Google is rebuilding Android development for humans and agents

May 26, 2026

Blog_Google_is_rebuilding_2400x1256_81f2eecd64

The Android development ecosystem is undergoing its most significant transformation in decades. As Matthew McCullough, VP of Android Development Experiences at Google, points out, suggesting tools that serve agents rather than humans would have elicited deep confusion from developers just three years ago. Today, omitting agents from the conversation yields that same confusion.

This shift represents more than an incremental tooling update. It is a fundamental rethinking of how three million active Android developers work, what they build, and who gets to participate in software creation. For engineering leaders managing mobile teams, the strategic question is no longer whether to adopt AI-assisted development, but how to support teams across a rapidly widening spectrum of adoption maturity.

Google is building Android tooling for humans and agents

The most consequential decision Google made this year was committing to dual-mode tooling. They are building everything to serve both traditional human-driven workflows and emerging agentic execution patterns. McCullough explains that developers who prefer traditional IDE interfaces are still fully supported, but the platform is also built for agentic workflows ranging from simple chat queries to long-running autonomous tasks.

This approach acknowledges a critical reality. Even within a single organization, teams operate at vastly different points along the adoption curve. At the entry level, developers use AI primarily as an enhanced answer engine for documentation lookup, quick tips, and forum-style Q&A. This represents a productivity boost, but one that keeps developers firmly in control of every implementation decision.

The middle of the bell curve has shifted dramatically in recent months. McCullough notes that the center of gravity has moved from developers simply getting good answers to actually getting work done. The inflection point arrived somewhere between November and January when AI assistance crossed from peripheral support into the core development loop. Developers stopped context-switching to browsers and colleagues for answers and started staying inside tighter, more productive cycles where AI actively advances work rather than simply informing it.

At the frontier, a small but growing cohort of teams reports that the vast majority of their code is now generated through agentic processes. These developers have fundamentally redefined their role. They spend less time inspecting syntax and more time orchestrating intent, defining product vision, and reviewing agent output while the agents handle the implementation details.

The strategic insight here is that platform providers cannot force convergence. Teams will move at different speeds based on risk tolerance, domain complexity, and organizational culture. The winning approach supports cautious adopters and frontier teams simultaneously without fragmenting the ecosystem or creating incompatible development paths.

Google is making Android tools composable for agentic workflows

Supporting dual-mode development requires a return to compositional fundamentals. McCullough emphasizes that command-line interfaces and text user interfaces are relevant again, not out of nostalgia, but because composability has become essential infrastructure for agentic workflows. The reintroduction of Android CLI, a modern command-line surface for builds, project setup, and SDK management, exemplifies this philosophy.

Graphical IDEs remain important for developers who prefer visual interaction, but they can no longer be the only interface. Agentic systems need programmatic access to clean APIs, scriptable commands, and machine-readable outputs. This means every capability must be exposed at multiple levels: UI for human efficiency, CLI for automation, and API for agent integration.

The composability mandate extends beyond tooling architecture into how teams evaluate and adopt new capabilities. With so much hype and rapid iteration in the AI space, McCullough advocates for measurement over marketing. He stresses the importance of relying on hard numbers, measuring outcomes, and allowing people to repeat evaluations themselves. To support this, Google came up with an open benchmark that works on real codebases and provides updated scores on a monthly basis.

Android Bench represents this philosophy in practice as an open evaluation framework that tests models against real Android codebases and publishes reproducible performance scores. Engineering leaders can use these benchmarks to cut through vendor claims and make evidence-based decisions about which models to integrate into their workflows.

Composability also reinforces established engineering discipline. Code review, for instance, becomes more important as automation increases. When agents generate significant portions of a codebase, review shifts from catching syntax errors to ensuring architectural coherence, security posture, and long-term maintainability. The goal is to add to codebase quality over time, not gradually erode it through unchecked automation.

For teams building their own toolchains, the lesson is clear: design for multiple interaction modes from the start. Assume that both humans and agents will use your tools, and optimize for composability over feature richness. The most powerful systems will be those that can be recombined in unexpected ways rather than those that try to anticipate every use case through custom UI.

AI prototyping speeds product alignment and decisions

One of the most underappreciated shifts in AI-assisted development is how it changes alignment and decision-making. McCullough argues that prototypes have become a more effective coordination artifact than lengthy written specifications. He notes that everyone in software tends to have strong opinions quickly once they can actually play with a working prototype.

Traditional product development often begins with detailed PRDs, twenty-page documents outlining features, user flows, and technical requirements. These documents take significant time to produce, yet they remain abstract until implementation begins. Teams debate hypotheticals, make assumptions about user behavior, and commit to directions before anyone can interact with the proposed experience.

AI has inverted this dynamic by dramatically lowering the time cost of producing working prototypes. Instead of starting from scratch, teams can now bring in existing assets, web applications, React Native projects, Flutter apps, and even iOS codebases to use as starting points. McCullough points out that developers are no longer starting from zero; AI lets them jump straight to an advanced starting position.

This shift makes product development more grounded in user reality. Rather than debating abstractions, teams can put working software in front of stakeholders and customers early in the cycle. Opinions form faster, feedback becomes more concrete, and misalignments surface before significant engineering investment has been made.

The strategic implication is that rapid prototyping becomes an organizational capability, not just a technical one. McCullough explains that this leads to more inclusive software development because participation is no longer limited to those with specific technical titles. Product managers, designers, and domain experts can now contribute directly to early product exploration rather than waiting for engineering resources to translate their ideas into code.

For engineering leaders, this means rethinking how teams validate ideas. Instead of gate-keeping prototype creation behind formal engineering sprints, consider enabling broader access to prototyping tools. AI Studio, for example, is evolving into a prototyping surface for Android applications, allowing non-engineers to experiment with product concepts before formal development begins.

The risk, of course, is that prototypes become production code without proper architectural review. The discipline here is ensuring that rapid prototyping remains a validation tool, not a shortcut around sound engineering practices. Prototypes should inform decisions, not become technical debt.

Engineers gain leverage by managing parallel agents

Perhaps the most profound change is the evolution of the engineering role itself. McCullough describes it as effectively receiving a promotion to manager, even if that means managing agents rather than people. This is not an exaggeration. The job is fundamentally shifting from direct code authorship to orchestration, planning, and quality oversight.

Six months ago, agentic workflows were largely sequential. A developer would spin up a task, grab coffee, and return to review the output. That model is already obsolete. Now, teams need a queue of concurrent tasks where one agent works on a bug backlog, another tackles a new feature, and a third prototypes something forward-looking.

Managing parallel agent workflows introduces new challenges. Developers must now coordinate multiple concurrent tasks without reintroducing the context-switching overhead they were trying to escape. This requires updated interfaces, visual dashboards, task queues, priority management, and new practices for tracking work that happens outside direct human control.

The skills that matter in this model are less about syntax mastery and more about strategic thinking. Engineers must define clear plans, articulate product intent precisely, and review output for architectural soundness rather than syntactic correctness. The ability to write a perfect loop matters less than the ability to describe what the loop should accomplish and validate that the agent's implementation aligns with broader system design.

For engineering leaders, this transition demands investment in coordination artifacts. Teams need better tools for expressing intent, tracking agent progress, and reviewing generated code at scale. They also need updated performance frameworks that recognize orchestration and oversight as core engineering contributions, not secondary activities.

The manager-of-agents model also expands leverage. A senior engineer who can effectively coordinate multiple agents can have far greater impact than one who writes every line of code personally. This creates opportunities for experienced engineers to multiply their influence while also opening pathways for newer engineers to contribute meaningfully by managing narrower agent workflows.

The key is recognizing that this is not a future state. It is happening now. Teams at the frontier are already operating this way, and the gap between them and traditional workflows is widening rapidly. Engineering organizations that fail to adapt risk losing their most innovative talent to environments that embrace this new operating model.

Google is redesigning Android apps around natural language intent

The conversation around natural language often focuses narrowly on developer productivity and how engineers prompt AI tools to generate code. But McCullough pushes the discussion into product territory, arguing that developers and product managers must update how they think about what they are building, not just how they are building it.

Android 17 represents a platform-level bet that user expectations are evolving just as rapidly as developer workflows. Consumers increasingly expect intent-driven interactions, telling apps what they want rather than navigating through multi-step flows. This has profound implications for mobile product design.

McCullough identifies three areas ripe for rethinking. First, the input modality is changing. Design is an easy place to start. Instead of pixel-perfect mockups adjusted through manual interface manipulation, designers could simply describe aesthetic changes in natural language, asking for less purple, deeper shadows, or a more modern look. The time savings come not from eliminating design thinking, but from eliminating the mechanical translation of that thinking into interface adjustments.

Second, contextual adaptation is critical. People are frequently on the run, on a bike, or in a car. In those moments, typing characters on a small keyboard is not optimal. Mobile interfaces must increasingly support voice, simplified choice menus, and open-ended prompts that adapt to user context. The goal is reducing friction when users are multitasking, moving, or otherwise unable to engage with traditional touch interfaces.

Third, multi-step wizards represent accumulated interface debt. Users increasingly expect to state their intent once and have the system figure out the necessary steps, permissions, and integrations. As McCullough notes, users just want to skip the middle part and get straight to the final result.

For engineering leaders, this shift demands closer collaboration between engineering and product teams. The technical capability to support natural language interactions is becoming table stakes. The strategic differentiation lies in reimagining user experiences from first principles rather than simply adding conversational interfaces to existing flows.

This also means that mobile developers need to think more deeply about the business outcomes their applications drive. Natural language interfaces that reduce friction can dramatically improve conversion rates, user retention, and completion of high-value actions. McCullough points out that developers receive a free upgrade in their role because of the immense leverage they can now exert at their company. Engineers who understand how interface design impacts business metrics will have outsized influence in product strategy.

Pluralism over convergence

The Android ecosystem is navigating a rare moment of simultaneous disruption and opportunity. The platform must support millions of developers at radically different points along the AI adoption curve while also enabling frontier teams to push into entirely new development paradigms. The strategic response of dual-mode tooling, composable foundations, rapid prototyping infrastructure, and support for agentic orchestration represents a coherent bet on pluralism over convergence.

For engineering leaders, the lesson is that transformation does not require forcing teams into a single operating model. It requires building systems flexible enough to meet teams where they are while providing clear pathways forward. The organizations that thrive will be those that embrace measurement over hype, composability over complexity, and coordination over control.

To dive deeper into the future of Android development and dual-mode tooling, listen to Matthew McCullough's full episode on the Dev Interrupted podcast.

Andrew Zigler

Andrew Zigler is a GTM Engineer at LinearB and the host of Dev Interrupted, a twice-weekly podcast and newsletter where 40k+ builders decode the transition to AI-native development and agentic orchestration. A classicist by training with a degree from The University of Texas at Austin, Andrew spent his early career teaching in Japan before channeling his interdisciplinary instincts into the tech world. His polymath background informs everything he builds, from automated workflows to the stories he tells about the seismic shifts reshaping software creation.

Connect with

Your next read

Cover image for AI is rebuilding software delivery from SDLC to ADLC