Navigating GPT 5.5 Features artwork

Navigating GPT 5.5 Features

Latent Space AI

May 6, 2026

In this episode, we explore the latest updates from OpenAI, including the release of GPT 5.5 and their new self-serve ads manager for U.S. advertisers.
**SPEAKER_1** (0:00)
OpenAI has dropped GPT 5.5 instant. They also have self-serve ads manager that OpenAI has been rolling out. Greg Brockman was in court admitting that they're gonna spend $50 billion on compute this year. This all happened in the span of about six hours. Before that, we had Apple that is finally going to open Siri up to Claude and Gemini and let you basically pick any model you want to run inside of there. There's also some other interesting Apple news. They're being fined $250 million for anyone that bought a iPhone 15 Pro or iPhone 16 because of their marketing campaign around Apple Intelligence. They're gonna have to pay $100 back to anyone for basically advertising since they missed their deadline. There's also a researcher that caught Chrome silently downloading a four gigabyte Gemini model onto your computer without asking. And so much more. We're gonna get into all of it on the podcast. Kicking this off, let's talk about Apple. They are finally cracking Siri Open. Mark Gurman, he first was reporting on this whole framework back in March. But there is a new report that came from 925 Mac that is confirming that this is actually going to ship at WWDC on June 8th. This is gonna be part of iOS 27 and iPad OS 27 and Mac OS 27 So all of the latest updates are gonna be coming out in June. And the mechanism is basically called extensions. What this means is that you're gonna be able to go into settings in Apple Intelligence and Siri and they're gonna pick what model handles what. So you can plug Gemini in, you can plug ChatGPT in, you can plug Clot in. And what's interesting is you can actually set custom voices depending on which model is responding. The customization is going to be nice and hopefully a good enough feature to apologize to everyone for why this has all taken so long, especially at the same moment that this lawsuit has come in, where basically they're gonna have to pay a hundred dollars back to everyone that bought an iPhone 15 Pro and an iPhone 16 because Apple Intelligence didn't ship when those were launched and the marketing all basically pointed to it shipping. Okay, let's talk about what's going on with Chrome. There was a security researcher named Alexander Hanf. He's a computer scientist and a lawyer based in the EU. And he caught Google Chrome downloading a four gigabyte Gemini Nano model onto the user device. There was no consent prompt. There was like, it didn't ask you, it just downloads this four gigabyte model onto your computer. The file is called weights.bin and it lives in a folder called opt guide on device model. Apparently, according to Hanf, he verified the install by reading macOS kernel file system logs. There's a bunch of reasons why this is problematic. According to this particular researcher Hanf, he says that this violates the ePrivacy directive article 5.3, which prohibits storing data on a user's equipment without prior consent, plus GDPR article 5 and 25
So, you know, this isn't really super great to give 4 gigabytes onto, you know, a billion plus Chrome users' computers without any sort of, you know, telling them that they could do this. And the precedent isn't great if, you know, every AI company could start downloading models onto your computer and say it's just, you know, part of their software upgrade. I mean, these models, the nano one, even their 4 gigs, these things just get bigger and bigger. So precedent, not good on this. And speaking of multiple different AI models, if you're already paying for Chat GPT and Cloud Pro and Gemini and Grok and maybe 11 Labs and all of these different audio image AI tools out there, I'd love to tell you about my software company called AI Box.AI. We have one platform with 80 different AI models on there, everything from the top AI Labs, and it is 8.99 a month. So you get access to all of the top image, audio, video, music models in one place for one subscription. You don't have to have tons of different accounts and, most importantly, you don't have to have $20 subscriptions on 20 different accounts. They cost you a ton of money. So hope this saves you a lot of money. It is AI Box.AI. It's how I recommend people get access to all of the different AI models, use them all, test them all, try them all. I love using Claude for writing and tons of different tasks, but it doesn't have an image generator. So I got to go over to ChaiGBT to do that. And then that doesn't have, you know, an audio MP3 file generator. So I go over to 11 Labs for that. Anyways, all of those in one platform.

8 more minutes of transcript below

Feed this to your agent

Try it now — copy, paste, done:

curl -H "x-api-key: pt_demo" \
  https://spoken.md/transcripts/1000766510529

Works with Claude, ChatGPT, Cursor, and any agent that makes HTTP calls.

From $0.10 per transcript. No subscription. Credits never expire.

Using your own key:

curl -H "x-api-key: YOUR_KEY" \
  https://spoken.md/transcripts/1000766510529