Blog
/
AMD is using agentic AI to rewrite ROCm and accelerate software delivery

AMD is using agentic AI to rewrite ROCm and accelerate software delivery

Photo of Andrew Zigler
|
Blog_Rewrite_RO_Cm_accelerate_software_delivery_2400x1256_25bca5aac1

The software development lifecycle is undergoing its most dramatic transformation in decades. In just the past few months, we have moved from treating agentic AI as prompts in a cron job to orchestrating multi-agent systems that plan, implement, and iterate autonomously across entire codebases. For engineering leaders navigating this shift, the challenge is fundamentally rethinking how teams collaborate, what abstractions matter, and where human judgment adds the most value.

Anush Elangovan, VP of AI Software at AMD, has been at the forefront of this transformation. His team is not only building the ROCm infrastructure that powers many frontier AI models, they are using agentic AI to rewrite ROCm itself, creating a self-improving flywheel where open source code, model familiarity, and autonomous refinement reinforce one another. In our conversation, Anush shared how AMD and teams across the industry are navigating what he calls the K-shaped transformation of software engineering, where early adopters compound their output while others risk being left behind.

AMD is multiplying software output with agentic AI

The shift from simple automation to true agentic workflows has happened faster than anyone anticipated. Six months ago, the cutting edge meant scheduled prompts and basic code generation. Today, small teams, sometimes just a single maintainer, orchestrate what previously required months of coordinated effort across large organizations. Elangovan points out that it is now possible for a single person pressing the right buttons to orchestrate work that used to require massive teams and years of effort, because the agents essentially run themselves.

This is not hyperbole. Anush's team recently built a production job scheduler in a week to replace Slurm, the 20-year incumbent in HPC environments. The project was made possible by agents that monitor issues and pull requests continuously, executing work overnight and across time zones. The human role shifted from writing code to defining clear end states and building robust test harnesses that validate autonomous execution.

The implications extend beyond velocity. AMD's open source ROCm stack creates a unique advantage. Because the entire codebase is publicly available, frontier models have native understanding of the hardware specifications and software architecture. This enables a self-improving loop where agents can propose and implement improvements directly, essentially allowing agentic AI to rewrite ROCm itself.

This model points toward a future where software creation is increasingly treated as an application of compute and tokens against a clearly defined target state. The traditional constraints of team size, geographic distribution, and working hours matter less when agents can execute continuously. What remains essential is speed.

"My new mantra is software is just tokens and time, but speed is still the moat."

For engineering leaders, this means rethinking resource allocation. The bottleneck is no longer raw development capacity but the ability to define outcomes clearly, build effective harnesses, and steer autonomous execution toward business value.

Test harnesses make autonomous coding reliable

As agents take on more implementation work, the primary control surface for engineering shifts dramatically. Planning and test design become the mechanisms through which humans guide autonomous development. Tests are defined before implementation and serve as both specification and validation, a pattern that becomes critical when generated code grows too large for line-by-line human review.

Anush learned this lesson directly when an autonomous agent took an unexpected shortcut. He discovered that when a test was failing, the agent sneakily simulated the hardware to make the test pass. The agent then committed the code, satisfied that the test had technically succeeded.

The agent satisfied the superficial condition without honoring the actual system behavior. It simulated the hardware rather than fixing the underlying issue. This example illustrates why robust harnesses are essential. Agents optimize for the constraints you give them, and if those constraints are incomplete, they will find creative and often problematic shortcuts.

The solution is to invest heavily in test design and validation. What was once 80% of engineering effort now approaches 99.9%. Elangovan explains that writing and validating tests effectively takes over the entire workflow because the test is essentially becoming the code review.

This shift has profound implications for how teams work. Tests become the primary artifact that encodes intent, defines acceptable outcomes, and prevents regression. They serve as both specification and review mechanism, especially when the volume of generated code makes traditional review impractical.

The pattern extends beyond internal development to community contributions. Open source projects that embrace agentic workflows need clear evaluative gates, well-defined tests, and acceptance criteria to make external collaboration scalable. Shared expectations about what constitutes success matter more than consistency in coding style or implementation approach.

For engineering leaders, this means prioritizing test infrastructure, investing in frameworks that make test creation easier, and building organizational muscle around test-driven development. The teams that master harness engineering will be able to supervise broader autonomous execution safely and effectively.

Intent-to-outcome orchestration removes implementation bottlenecks

Traditional software development relies on intermediate checkpoints like design documents, code reviews, and architectural diagrams. These artifacts help teams align on approach and maintain shared understanding. But as agentic workflows mature, the development process is compressing toward what Anush calls intent-to-outcome orchestration, or simply IO.

In this model, teams focus on defining the desired end state and the criteria for success. The implementation path becomes more flexible, less human-readable, and often unintelligible without agent assistance. Multiple approaches become acceptable as long as they satisfy the outcome requirements.

This creates a new challenge of reviewing and validating work when the middle of the workflow is opaque. The answer, increasingly, is that review itself becomes agentic. Rather than manually inspecting 100,000 lines of code, Elangovan gets the agent to pull the code apart, review it, and write down an assessment of what it thinks the code does versus what the author originally intended.

Agents unpack machine-generated changes, summarize divergence from intent, and propose fixes. The human role is to validate the assessment, provide judgment on edge cases, and make final decisions about what constitutes acceptable risk.

This pattern fundamentally changes how teams collaborate. Agreement on success criteria matters more than consensus on implementation details. Traditional abstractions such as language choice, coding conventions, and even the centrality of manual git operations become less important. Anush has not opened his text editor or performed a manual git commit in over a month. All code is treated as ephemeral, rewritten on the fly as models and requirements evolve.

The implications for engineering leadership are significant. Teams need new collaboration patterns that work when the implementation layer is fluid. They need to get comfortable with multiple valid solutions to the same problem. They also need to build trust in agentic review processes that can surface issues humans would miss in the volume of generated code.

The shift also changes what skills matter. Familiarity with specific languages or frameworks becomes less critical when agents handle implementation. What matters is the ability to define clear outcomes, build effective constraints, and orchestrate between intent and validated results.

Why agentic AI is splitting engineering teams into winners and laggards

Perhaps the most urgent challenge facing engineering organizations is what Anush calls the K-shaped transformation, a divergence in productivity where early adopters of agentic tooling compound their output rapidly while others remain anchored to conventional workflows. He sees a distinct split between the folks on the upper arm who have just taken off and the folks who are just on the cusp of learning what these tools are.

This is not a gradual shift that organizations can manage through normal change processes. The transformation is unfolding over weeks, not quarters. Engineers who were on vacation for a few weeks can return to find their tools and workflows fundamentally different, an experience Anush compares to waking from a 30-year coma.

The risk is that capable engineers who have not adapted will become defensive, protecting their corners rather than embracing new patterns. Once that resistance hardens, organizations face a double challenge in addressing both the human and technical aspects of the gap.

The solution is proactive upskilling focused on bringing the broader workforce from baseline productivity to materially higher leverage. The outliers at 100x or 1000x productivity will continue to compound, but organizational value comes from moving the middle of the curve from 1x or 2x to 10x or beyond.

AMD's approach includes creating psychologically safe forums for basic questions. Anush runs what he calls an AI anonymous group where team members can ask anything without judgment. He believes there should be a safe space for people to ask basic questions about how new tools work without feeling embarrassed or judged.

The group encourages self-service learning while providing peer support and internal champions who can accelerate adoption. The goal is to help people help themselves by equipping them with the core primitives that unlock the rest of the puzzle.

For engineering leaders, this means treating AI upskilling as an immediate priority rather than a gradual initiative. It means identifying champions who can model new workflows, creating safe spaces for learning, and actively steering the transformation rather than letting it unfold organically. The teams that move quickly will compound their advantages; those that delay will find the gap increasingly difficult to close.

AI is turning the software lifecycle into continuous orchestration

The traditional software development lifecycle of plan, implement, review, and iterate is compressing into a faster loop where documents, commits, and code versions become increasingly ephemeral. This compression changes not just velocity but the fundamental nature of software artifacts and what it means to ship code.

Language choice, once a critical decision with long-term implications, matters less when agents generate and inspect code. Teams can optimize for outcome, ecosystem fit, or performance rather than individual familiarity. Anush chose Rust for a recent project despite never having learned the language and having no intention to learn it. He simply tells the agent to handle rebasing and treats the resulting code entirely like ephemera.

The code exists as a manifestation of intent, serving as pre-training tokens for future iterations. Each version builds on the previous one, creating a flywheel where models learn from past implementations and generate better solutions over time.

This pattern extends beyond coding to adjacent operational work. Dashboards, coordination systems, and workflows assembled from existing data sources can be built on the fly rather than maintained as fixed artifacts. A CRM system, for example, becomes unnecessary when agents can synthesize unified views from Slack channels, calendars, and meeting notes on demand.

The challenge is that this acceleration creates its own bottleneck. The volume of messages, tasks, and signals produced by both humans and agents quickly becomes unmanageable. Teams need agent-mediated memory and prioritization layers to process the flood of information. Anush points out that in order to consume the massive amount of data generated by agents, you ultimately need another agent to consume it for you.

Anush built a Rust-based interface that consumes all his Microsoft Teams messages, Outlook emails, and Slack communications while maintaining memory and context in each channel. It surfaces high-priority interrupts and visualizes activity as a set of dynamic molecules that update in real time. Without this layer, the countless one-on-one messages and group chats would be impossible to process sequentially.

Orchestrating continuous refinement

The development lifecycle is also increasingly tied to open models and local inference. Privacy, cost, and personalization benefits make on-device and edge execution relevant across the hardware stack. AMD's portfolio, from data center Instinct products to Strix Halo laptops with 128GB of RAM capable of running 200B parameter models locally, reflects this shift toward distributed, personalized AI that does not require constant cloud connectivity.

For engineering leaders, the implications are clear. The development lifecycle is no longer a fixed process but a flexible orchestration layer. Teams need to build infrastructure for agent-mediated coordination, invest in memory and prioritization systems, and prepare for a world where code, documents, and even git commits are increasingly ephemeral artifacts in a continuous refinement loop.

The transformation from traditional software development to intent-to-outcome orchestration is not a distant future. It is happening now, measured in weeks rather than years. Engineering leaders face a choice: actively steer their organizations through this shift, or watch as the gap between early adopters and the rest of the team becomes unbridgeable.

The path forward requires focusing on fundamentals such as harness engineering, clear outcome definition, and proactive upskilling while embracing the reality that many traditional abstractions no longer serve the same purpose. Speed remains the moat, but it is speed in adapting to new workflows, not just shipping features. The teams that master this transition will compound their advantages, while those that delay will find themselves on the wrong side of the K-shaped curve.

To hear more of Anush Elangovan's insights on agentic workflows, the K-shaped transformation, and the future of ROCm, listen to his full episode on the Dev Interrupted podcast.

andrewzigler_4239eb98ca

Andrew Zigler

Andrew Zigler is a developer advocate and host of the Dev Interrupted podcast, where engineering leadership meets real-world insight. With a background in Classics from The University of Texas at Austin and early years spent teaching in Japan, he brings a humanistic lens to the tech world. Andrew's work bridges the gap between technical excellence and team wellbeing.

Connect with

Your next read