**Frank Wang** (0:00)
You know, when you get into a car, even if it's a car you have literally never driven before, you just instantly know how to use it.
**SPEAKER_2** (0:07)
Right. Yeah. Because the setup is always basically the same.
**Frank Wang** (0:10)
Exactly. You have a steering wheel which, you know, affords turning, you have pedals that afford pressing.
Industrial designers call these affordances. The physical form of the interface tells your brain exactly what the machine can do before you even turn the key.
**SPEAKER_2** (0:25)
It's a completely established paradigm. The form follows function and we just parse it in a fraction of a second.
**Frank Wang** (0:32)
But then if you look at how we've been interacting with artificial intelligence over the last couple of years, suddenly that steering wheel is just, it's entirely gone.
**SPEAKER_2** (0:40)
Oh, completely. It's just a blank box.
**Frank Wang** (0:42)
Right. The prompt box offers absolutely zero affordances. Yeah.
It's like we've been handed the keys to this incredible Formula One engine, but the only way to actually drive it is by passing it tiny handwritten notes on little slips of paper under a locked door.
**SPEAKER_2** (0:58)
That is a very accurate, if slightly terrifying, way to describe it.
**Frank Wang** (1:03)
And today, we're putting on our Design Leadership Hats to unpack a highly relevant executive briefing on what the industry is calling the agentic shift. Yeah.
**SPEAKER_2** (1:12)
This is a massive topic right now.
**Frank Wang** (1:13)
It really is. The goal of today's deep dive is to look at how recent AI breakthroughs are forcing a fundamental ground up pivot in product design.
Because if you read the news, the tech conversation is so skewed toward like model architecture.
**SPEAKER_2** (1:29)
For sure. It's all about parameters and training runs and raw compute power.
**Frank Wang** (1:33)
Exactly.
But this briefing highlights that the real battlefield, the place where this actually matters for you, the listener, has shifted completely away from the backend engineering. The technology has evolved from just answering our questions to actually doing autonomous work.
**SPEAKER_2** (1:49)
The user interface simply has not caught up.
**Frank Wang** (1:51)
No, not at all. The data highlighting this shift is staggering.
Specifically, there's this index from Vercel's AI Gateway. For those who aren't super deep in the engineering weeds, Vercel is a massive platform that powers a huge chunk of the modern web's front-end infrastructure.
**SPEAKER_2** (2:06)
They basically act as the router for millions of AI requests every single day.
**Frank Wang** (2:12)
Right. And the recent index shows that these agentic workloads have doubled in just the last six months. In fact, these agents now dominate total token consumption across the web, hitting 59%.
**SPEAKER_2** (2:24)
Which is wild, and we should probably clarify what that 59% actually represents, because token consumption is one of those metrics that usually stays totally siloed in AI engineering departments.
**Frank Wang** (2:35)
Yeah, let's break that down.
**SPEAKER_2** (2:36)
So a token is basically the fundamental unit of data for a large language model. Think of it as like a chunk of a word. It's the core currency of compute.
If an AI is just generating a quick poem for a user or summarizing a single email, it uses a pretty small handful of tokens. It reads the prompt, it spits out the answer, and it's done.
**Frank Wang** (2:58)
But an agetic workload is totally different. An agent isn't just answering a question, it's executing a multi-step plan.
**SPEAKER_2** (3:04)
Yeah, exactly. It is entering a self-reflection loop. The agent gets a goal, and then it starts talking to itself.
**Frank Wang** (3:10)
Having an internal monologue.
**SPEAKER_2** (3:11)
Basically, yeah. It writes a plan, it queries an external tool, reads the result, realizes it made an error, rewrites the code, and then tries again.
That entire internal monologue consumes thousands upon thousands of tokens.
**Frank Wang** (3:25)
Wow.
**SPEAKER_2** (3:26)
So when Vercell says agents are hitting 59 percent of total token consumption, it means the vast majority of AI processing power right now is not interacting directly with a human user.
**Frank Wang** (3:36)
It's just AI doing deep iterative work in the background.
**SPEAKER_2** (3:40)
Exactly.
**Frank Wang** (3:41)
So if the majority of the compute is agents talking to themselves, then forcing a user to just sit there and watch a chat transcript of that internal monologue is totally broken.
**SPEAKER_2** (3:51)
It makes no sense from a user experience perspective.
**Frank Wang** (3:53)
This is the immediate design implication of that tipping point. The standard chatbot UI is rapidly becoming a legacy pattern. The era of the blank text box is dying. I actually look at the blank AI text box like the old MS-DOS command prompt from the 1990s.
15 more minutes of transcript below
Try it now — copy, paste, done:
curl -H "x-api-key: pt_demo" \
https://spoken.md/transcripts/1000651996090
Works with Claude, ChatGPT, Cursor, and any agent that makes HTTP calls.
From $0.10 per transcript. No subscription. Credits never expire.
Using your own key:
curl -H "x-api-key: YOUR_KEY" \
https://spoken.md/transcripts/1000768118285