Goldman Sachs Pilots Its First Autonomous Coder

Written by Cam Sivesind | Tue | Jul 15, 2025 | 4:08 PM Z

Goldman Sachs has officially entered the era of the "hybrid workforce" with the announcement of Devin, an autonomous AI software engineer from Cognition. Marco Argenti, Goldman's chief information officer, told CNBC that Devin will soon join the ranks of the bank's 12,000-plus developers, with initial deployments numbering in the hundreds and potentially scaling into the thousands.

The introduction of Devin represents a significant inflection point impacting not just the finance sector but also setting a new standard for cybersecurity strategies and the integration of AI-driven automation across industries.

For the finance industry, and Goldman Sachs specifically, there will be unprecedented productivity gains. Argenti estimates Devin could boost developer output by three to four times, freeing humans from routine, error-prone code updates.

It's interesting to see banks at the forefront of AI adoption. Typically cautious, major financial institutions are now leading agentic-AI integration. Goldman's move is joined by broader AI-powered initiatives across JPMorgan Chase, Morgan Stanley, and others.

Devin and its capabilities adds to turbulence in the job market. Bloomberg projects that up to 200,000 banking jobs may be eliminated over the next three to five years due to AI. While Devin handles repetitive tasks, it raises urgent questions about reskilling and workforce realignment.

With AI writing—and potentially rewriting—critical banking systems, the cybersecurity stakes are higher than ever.

Academic research warns of inherent vulnerabilities with AI-generated code that could contain serious security flaws. Developers may unknowingly introduce insecure patterns when relying on AI suggestions.

Fast Company notes that Devin successfully solved only three of 20 real-world tasks in one test—underscoring the risk of hidden bugs or misconfigurations entering production.

GenAI systems in finance amplify threats like AI-generated phishing, deepfakes, and adversarial attacks.

Human oversight will be a non-negotiable. Goldman's model relies on human supervision of Devin's outputs—errors must be caught before deployment. Security teams need to audit not just code but AI prompt history and decision logs.

Reddit commentary underscores growing caution and real-world developer skepticism. "Devin is known for being false promise… other times, I can hardly get anything to function," said one poster. "There will be so much code, the engineers will need to work with the AI to solve it," said another.

The Reddit community's voices highlight the reality: AI tools may speed outputs, but they require extensive validation and cleanup.

Goldman's Devin isn't just about finance—it signals a pivotal shift in how industries may deploy AI. Agentic AI is maturing. Unlike copilots or chat assistants, agentic systems autonomously execute complex, multi-step tasks. Devin can code, test, debug, and even deploy.

What is the central role of human-AI collaboration? The new normal centers on developers defining problems, crafting prompts, supervising outcomes, and refining results—a dramatic shift in job roles.

Professional services is being, and will be, further redefined. Some experts predict advisory roles—from financial analysis to compliance—may also be disrupted by agentic AI.

Sectors like healthcare, logistics, legal services, and government are closely watching. The same automation principles could apply to report generation, compliance reviews, risk modeling, and beyond.

Bryan Muehlberger, CIO at Lumiy, and former CIO/CTO at Vuori, Red Bull, and Beachbody, posted about Goldman Sachs' addition of Devin on LinkedIn, asking this of his community of peers:

"1. As AI takes on more technical tasks, where do you believe human creativity, ethics, and strategic insight will remain irreplaceable in software development? 2. What is the role of the CIO? Are they the new head of 'human' resources? (replace 'human' with 'AI'.) 3. What is the role of the CHRO/CPO as AI Agents begin taking on the tasks, previously performed by humans?"

For cybersecurity professionals, there are a few key takeaways:

Prepare for AI-generated code security audits: Build frameworks to review not only the outcome but the decision pathway and prompt usage.
Incorporate adversarial testing: Treat AI agents as potential threat vectors; test for malicious prompt injections, hallucinations, and runtime failures.
Evolve best practices: Train teams in secure prompt engineering, guardrails, and how to vet AI outputs with the same diligence used for human code reviews.
Advocate regulatory alignment: Push for standards around AI audits, transparency, and traceability. Finance is already under heavy regulatory scrutiny.

Goldman Sachs' pilot with Devin is more than a tech experiment—it's a clarion call. It tells us agentic AI is ready for institutional-scale deployment, and that a hybrid workforce—where humans set the course and AI charts the path—is becoming the new frontier.

View full post