Android_world
Visit ToolAndroidWorld is an environment and benchmark for autonomous agents. It provides a highly reproducible benchmark of 116 hand-crafted tasks across 20 apps on a live Android emulator.
At a glance
Trending
AndroidWorld is an environment and benchmark for autonomous agents. It provides a highly reproducible benchmark of 116 hand-crafted tasks across 20 apps on a live Android emulator.
Trending
About
AndroidWorld is an open-source environment and benchmark designed for building and evaluating autonomous computer control agents. It operates on a live Android emulator, offering a highly reproducible benchmark comprising 116 hand-crafted tasks across 20 real-world Android applications. These tasks are dynamically instantiated with randomly-generated parameters, creating millions of unique variations for robust testing. Key features include durable reward signals for reliable evaluation, experimental Docker support for simplified setup, and an open environment with access to millions of Android apps and websites. It also integrates with the MiniWoB++ web benchmark, rendering common input elements as native Android UI widgets. The platform is extensible, allowing users to easily add new tasks and benchmarks, and supports custom agent creation.
Capabilities
Pricing & Plans
Open Source
Free
FAQs
Trending