**SPEAKER_1** (0:00)
Welcome to Colaberry AI Podcast, brought to you by Colaberry AI Research Labs and Carl Foundation. So imagine spending, you know, decades and billions of dollars trying to build a functional quantum computer.
**SPEAKER_2** (0:14)
Oh yeah, it's a massively complex undertaking.
**SPEAKER_1** (0:16)
Right, only to have a swarm of autonomous AI agents just sift through mountains of your fabrication data, and well, point out a broken, uncalibrated thermometer that your human scientists completely missed.
**SPEAKER_2** (0:28)
It's pretty wild to think about.
**SPEAKER_1** (0:29)
It really is, and today, we aren't just looking at a standard tech announcement. We are looking at AI actively redesigning its own future hardware.
**SPEAKER_2** (0:38)
Yeah, the materials we're examining today from Microsoft's Build 2026 conference, they represent a really profound structural pivot for the entire industry.
**SPEAKER_1** (0:46)
A complete shift in strategy, right?
**SPEAKER_2** (0:48)
Exactly. For years, Microsoft's core AI strategy relied super heavily on external partnerships. So pouring billions into open AI, integrating those third-party models into Azure, and backing competitors like Anthropic.
**SPEAKER_1** (1:01)
Right. They were essentially renting the intelligence.
**SPEAKER_2** (1:04)
Exactly. Renting third-party intelligence.
But what we're unpacking in this deep dive is the highly technical methodology and really the striking performance metrics. Behind their decision to engineer their own full proprietary AI stack, they're moving to own the entire factory.
**SPEAKER_1** (1:23)
From the foundational models all the way down to the quantum hardware level. And I mean, they are not doing this quietly. They dropped seven distinct in-house AI models at once.
**SPEAKER_2** (1:33)
Yeah, quite the rollout.
**SPEAKER_1** (1:34)
We're looking at MAI Image 2.5, Transcribe 1.5, Voice 2 and Code 1 Flash. And these are all designed to integrate directly into the daily development pipelines you are likely already using.
**SPEAKER_2** (1:46)
But the absolute crown jewel of this release, the core engine driving this whole proprietary stack, is MAI Thinking 1
**SPEAKER_1** (1:53)
Okay, so let's get into the architecture of MAI Thinking 1, because it's telling us a lot about where Enterprise AI is heading, right?
**SPEAKER_2** (1:58)
It really is. So it's classified as a mid-size reasoning model operating with 35 billion active parameters.
**SPEAKER_1** (2:04)
Now, in a landscape where everyone is bragging about trillion parameter behemoths, 35 billion active parameters sounds, well, highly targeted.
**SPEAKER_2** (2:12)
Oh, very targeted. It is built strictly for raw efficiency and low latency reasoning. But the most crucial technical detail here isn't actually the parameter count.
**SPEAKER_1** (2:21)
Okay. What is it then?
**SPEAKER_2** (2:22)
It's the training methodology. Microsoft trained this model entirely from scratch on clean, commercially licensed data. They explicitly emphasize their method of avoiding any distillation from third-party frontier models.
**SPEAKER_1** (2:36)
Wait, let me unpack that distillation piece because that distinction is critical for anyone deploying enterprise software.
**SPEAKER_2** (2:42)
Yeah, please do.
**SPEAKER_1** (2:43)
Think of distillation like copying the smart kids' homework. Sure, you get the correct answers quickly, and you can train a smaller, cheaper model to just mimic the outputs of a massive model like GPT-4.
**SPEAKER_2** (2:54)
Right. It's a shortcut.
**SPEAKER_1** (2:55)
Exactly. But you inherit all of their underlying math errors, their flawed logic paths, and crucially for the enterprise world, all of their legal liabilities and copyright risks.
**SPEAKER_2** (3:05)
Yes, the legal baggage. By engineering from scratch on commercially licensed data, Microsoft is basically providing a legally clean slate.
**SPEAKER_1** (3:14)
And that clean architecture is actually yielding some really remarkable empirical results, isn't it?
**SPEAKER_2** (3:20)
It really is. So in blind evaluations conducted by Surge.
**SPEAKER_1** (3:23)
And Surge is a highly regarded independent human rating partner just for context.
**SPEAKER_2** (3:27)
Right. Exactly. So in those blind evils, MAI Thinking 1 consistently beat Anthrobics Clawed Sonnet 4.6. Wow.
**SPEAKER_1** (3:35)
Beating Sonnet is no small feat.
**SPEAKER_2** (3:37)
Not at all. And on rigorous coding benchmarks like SweeBench Pro, which is essentially a gauntlet of real world GitHub issues that tests if an AI can actually reason through and resolve complex software bugs, it matched the much larger Clawed Opus 4.6.
**SPEAKER_1** (3:53)
That's huge. And the economics tied to those benchmarks are staggering. Looking at the consulting firm McKinsey, Mustafa Suleiman, Microsoft AI CEO, he stated that after tuning these models for McKinsey's internal use, they actually outperformed OpenAI's GPT 5.5 on pure quality.
**SPEAKER_2** (4:10)
Which is incredible on its own.
**SPEAKER_1** (4:11)
But here's the critical metric. They are projecting a massive 10 times improvement in cost efficiency compared to public pricing data. 10 times.
**SPEAKER_2** (4:19)
Well, yeah, because controlling the entire stack inside Azure changes the margin math completely, right? If you aren't paying heavy licensing fees to an external model provider, those compute margins improve dramatically.
14 more minutes of transcript below
Try it now — copy, paste, done:
curl -H "x-api-key: pt_demo" \
https://spoken.md/transcripts/1000651996090
Works with Claude, ChatGPT, Cursor, and any agent that makes HTTP calls.
From $0.10 per transcript. No subscription. Credits never expire.
Using your own key:
curl -H "x-api-key: YOUR_KEY" \
https://spoken.md/transcripts/1000771194353