📊 Full opportunity report: The Model Is Only 10%: The Real Lesson of the New SDLC on ThorstenMeyerAI.com — validation score, market gap, and execution plan.

TL;DR

The latest research emphasizes that AI models constitute only about 10% of effective AI systems. The majority of performance depends on the harness, context, and verification processes, shifting strategic focus for developers and organizations.

A new Google whitepaper titled “The New SDLC With Vibe Coding” highlights a fundamental shift in software engineering: the AI model accounts for only about 10% of system behavior, while the harness, context, and verification comprise the remaining 90%. This insight challenges conventional focus on model advancements alone and underscores the importance of configuration and strategic design in AI deployment.

The whitepaper, authored by Addy Osmani, Shubham Saboo, and Sokratis Kartakis, states that the dominant factor in AI system performance is how the model is integrated and controlled. Experiments demonstrate that tweaking the harness—prompts, tools, rules—can dramatically improve outcomes, even when using the same underlying model. For example, a coding agent improved its ranking from outside the Top 30 to Top 5 solely through harness adjustments.

The authors argue that cost and effectiveness in AI development hinge on configuration, context engineering, and verification. They emphasize that the actual AI model is a small part of the equation, with the majority of the system’s success depending on how it is structured, monitored, and guided. This shift has profound implications for organizations investing heavily in model development, suggesting a need to reallocate focus toward system design and maintenance.

At a glance
reportWhen: published March 2026
The developmentA Google whitepaper introduces a paradigm shift in software engineering, highlighting that the AI model itself is only a small part of the system, with the majority of value in configuration and verification.
The Model Is Only 10% — The New SDLC With Vibe Coding
AI Dispatch · Field Notes
Google · Osmani, Saboo & Kartakis · May 2026

The model is only 10%

A Google whitepaper argues software’s biggest shift is from writing code to expressing intent. Its sharpest claim: the model you obsess over is the smallest part of the system — the scaffolding around it does the real work.

A spectrum, not a binary — the differentiator is how outputs get verified
Vibe Coding
Casual prompts · “does it seem to work?” · disposable code · high risk
Structured AI-Assisted
Detailed prompts + constraints · manual testing · features in real codebases
Agentic Engineering
Formal specs · automated tests + evals + CI gates · production scale · low risk
Tests verify the deterministic; evals verify the rest. Without both, it’s vibe coding — however clever the prompt.
The idea worth building your strategy around
Agent = Model + Harness
~10%
HARNESS — prompts · tools · context · hooks · sandboxes · observability
MODEL~90% IS YOUR SURFACE AREA, NOT THE PROVIDER’S
Outside Top 30 → Top 5 on Terminal Bench 2.0 by changing only the harness — same model.
“Most agent failures, examined honestly, are configuration failures” — a missing tool, a vague rule, a noisy context.
The economics: it’s a token-cost problem (CapEx vs OpEx)
Vibe Coding
Low CapEx · High OpEx
Looks free, hides debt: token burn (fix-it loops), maintenance tax (AI spaghetti), security remediation. Crosses over to 3–10× more per feature.
Agentic Engineering
High CapEx · Low OpEx
Pay upfront (specs, evals, context), then ship cheaply. Levers: context engineering for first-pass success + intelligent model routing — cheap models for the easy work.
85%
of devs use AI coding agents (51% daily)
41%
of all new code is AI-generated
~90%
of agent behavior is the harness, not the model
+19%
longer on some tasks (METR) — verification is the cost
The read

The clearest map yet of how serious AI development works — and mostly tool-agnostic. But it’s a Google funnel: the concepts are neutral, the on-ramps point to Gemini, Jules & the ADK. If the harness is 90% and it’s yours, your moat and your costs both live there — so own your scaffolding, route across models, and remember: AI amplifies whatever engineering culture it lands in.

Source: Osmani, Saboo & Kartakis, “The New SDLC With Vibe Coding,” Google (May 2026). Figures are the paper’s own, incl. METR & LangChain. Analysis is the author’s.
thorstenmeyerai.com

Implications for AI Development Strategies

This revelation redefines how organizations should approach AI integration. Instead of prioritizing the latest model releases, companies are encouraged to invest in building robust harnesses, context management, and verification processes. These elements determine system performance, cost-efficiency, and security. The insight suggests that long-term competitive advantage lies in mastering configuration and system architecture, not just model acquisition.

Furthermore, by focusing on system design, organizations can reduce costs—since ad-hoc prompting and vibe coding are shown to be more expensive over time—while increasing reliability and security. This strategic pivot could influence AI procurement, development, and operational practices across industries.

LEAN PROGRAMMING FOR FORMAL SOFTWARE VERIFICATION: Mathematical proof systems and logical frameworks for verified computation

LEAN PROGRAMMING FOR FORMAL SOFTWARE VERIFICATION: Mathematical proof systems and logical frameworks for verified computation

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Evolution of AI System Design and The SDLC Shift

The whitepaper builds on recent trends where AI adoption skyrocketed, with 85% of developers now using AI coding agents regularly, and over 41% generating AI-based code. Traditionally, the focus has been on improving model capabilities, but recent experiments and benchmarks reveal that configuration and context management are more impactful than model improvements alone.

This perspective aligns with ongoing industry discussions about the limitations of chasing model size and raw performance, and highlights a move toward systematic engineering practices akin to traditional software development, but adapted for AI systems. The paper situates this as the most significant evolution in SDLC (Software Development Life Cycle) since the rise of cloud computing and automation.

“The model is only about 10% of what determines behavior; the harness and context are the other 90%.”

— Addy Osmani

The AI Prompt Playbook: Master AI Prompt Engineering with 140 Ready-to-Use Templates for ChatGPT, Claude, Gemini & Copilot

The AI Prompt Playbook: Master AI Prompt Engineering with 140 Ready-to-Use Templates for ChatGPT, Claude, Gemini & Copilot

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Unclear Aspects of Implementation and Industry Adoption

While the whitepaper presents compelling evidence, it is still unclear how quickly organizations will adapt to this paradigm shift and reorient their development practices. The long-term impact on AI costs, security, and performance across different industries remains to be fully observed. Additionally, the precise methods for best practices in harness design and context engineering are still evolving and lack standardized benchmarks.

YAML Made Simple: A Beginner’s Guide to Configuration and Data Structuring

YAML Made Simple: A Beginner’s Guide to Configuration and Data Structuring

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Next Steps for Organizations and Developers

Organizations are expected to reevaluate their AI strategies, emphasizing system configuration, verification, and context management. Future developments may include standardized tools and frameworks for harnessing AI models more effectively. Industry leaders will likely publish best practices, and further research will aim to quantify cost savings and performance improvements from this approach. Monitoring how this shift influences AI security and maintenance will be key in the coming months.

AI-Powered Software Testing: Volume 1: Foundational Patterns and Principles for Architects and Technical Leads

AI-Powered Software Testing: Volume 1: Foundational Patterns and Principles for Architects and Technical Leads

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Key Questions

Why is the model only 10% of the system’s effectiveness?

According to the whitepaper, the model itself accounts for a small part of the overall system behavior. The majority depends on how the model is integrated, configured, and verified through harnesses, context, and rules.

How does this shift affect AI development costs?

Focusing on configuration and verification can reduce long-term costs by decreasing token wastage, improving reliability, and minimizing security vulnerabilities, despite higher upfront investment in system design.

What practical steps should organizations take now?

Organizations should invest in building robust harnesses, improving context management, and establishing verification processes. Training teams in system architecture and configuration will be essential.

Does this mean model improvements are no longer important?

Model improvements remain valuable but are now seen as only one part of a broader system. The whitepaper emphasizes that system design and configuration have a greater impact on performance and cost-efficiency.

Will this change how AI tools are marketed?

Yes, vendors may shift focus from promoting model size and raw capabilities to highlighting system integration, configuration options, and verification tools.

Source: ThorstenMeyerAI.com

You May Also Like

Wireless Inductive Charging: Advances and Real‑World Deployments

Optimized for convenience and efficiency, wireless inductive charging advancements are transforming everyday device use—discover how these innovations are shaping our world.

One Video In, a Whole Publishing Kit Out — Without the Cloud

Discover how to turn a single video into a complete publishing package offline. Save time, control your assets, and avoid cloud dependence with this local-first workflow.

One upload in. A whole channel’s worth of content out.

ChannelHelm’s v1.5 release enables creators to convert a single video into all necessary content, learning and optimizing over time.

Battery Swapping Networks: Case of Mitsubishi Fuso and Ample Partnership

Spearheading innovative fleet solutions, the Mitsubishi Fuso and Ample partnership transforms electric vehicle operations—discover how their battery swapping network is reshaping transportation.