Minimax M3 + Hermes Agent is Insane (FREE!)

**Julian Goldie** (0:00)
MiniMax M3 just dropped out of China. When you plug it into Hermes Agent, you get a free AI worker that can run on your computer for 12 hours straight. We're talking about an AI that can open up your app, schedule tasks, search your web, send reports, all on its own, using this powerful agentic model that literally just dropped. And in this video, I'm gonna show you how to set up, test it live, a real task, and this one thing you absolutely have to do to unlock the full power of this. Plus, I'll show you something really cool at the end, which is a cloud-hosted version of this you can access from anywhere. Let's get into it. So this is MiniMax, it's just come out from China. MiniMax M3, brand new coding and agentic model. And we can check out the benchmarks here as an example. So you can see here that it's performing pretty well on benchmarks. And what you can actually do is plug it into Hermes using these commands. And you can actually test out for free, using these commands too. So the way that you can get started with this, if you want to plug MiniMax M3 directly into Hermes Agent, is we can use the commands right here. So the first thing that we need to do is make sure that we have Hermes Agent installed. Make sure that you have Ollama downloaded as well. And once you've done that, make sure you have Ollama running in the background, like you can see. Now we can run this simple command. So Ollama launch Hermes and then pick the model, which is MiniMax M3, of course. You can run that inside a terminal and boom shakalaka. We now have Hermes Agent connected to MiniMax M3 Cloud. Perfect. Now, if you're on the free plan on Ollama, obviously there's token limits. Most people won't hit those. But also the cool thing about this is in one single click, you can also use MiniMax M3 inside Claw Code, inside Codex app, inside OpenClaw, Codex, OpenCode, et cetera. So what you can also see here is this 500,000 token context window and the input is text and image as well. Now, if we go into my Agent OS, you can see that it's automatically synced to MiniMax M3 Cloud, which is perfect. So if we go and test this out now, we can try it over here. And just for a quick agentic task, we'll say go to chatgpt.com and let's see how that goes. Another test I've run here is open up the Notes app locally and say hello. And you can see it says Note Created and the Notes app is now in the background. We've the note typed in. Here's what it did. And then if we go over to our notes, we can see hello from Hermes, right? So it can talk to us through the computer. It can navigate to local files. It can open up apps and also it can do that in the background whilst we're talking to it directly, which is perfect. I say schedule afternoon tea at 3 p.m. Daily just see how it responds to more agentic tasks, to tasks running on a schedule. The one thing that I'll say here is if you are using it through Ollama, it can be a little bit slow because it's running through the cloud models, right? If you want a faster version, what I'd actually recommend is you either use open router or even better, you can actually use M3 directly with the coding plan and then use that to use Hermes, right? And the good thing about that is once you signed up to the coding plan, then you don't need to use APIs because you can use OAuth to log in to MiniMax directly. Now also something that's interesting here, and 99% of people don't even know about this, but you can actually get a cloud hosted version of Hermes called Max Hermes. And since this release of MiniMax M3, it's now powered by M3 as well. So if you're using Max Hermes in the cloud, you can deploy it in one single click, and then you can use Hermes and also MaxClaw, which is a version of OpenClaw in the cloud too.
So if you wanted to deploy this, you can just click on start now, login, and then you just need to make sure you have the plan like so. So this is what it looks like once it's deployed. Honestly, for me personally, you could use a terminal. Do you have the full conversation history? No. You could use Max Hermes in the cloud, but it still doesn't look that nice. I personally prefer using my Agent OS system. It's a lot nicer. It's a lot easier to navigate and the UI is just beautiful.

Feed this to your agent

Try it now — copy, paste, done:

curl -H "x-api-key: pt_demo" \
  https://spoken.md/transcripts/1000651996090

Works with Claude, ChatGPT, Cursor, and any agent that makes HTTP calls.

From $0.10 per transcript. No subscription. Credits never expire.

Using your own key:

curl -H "x-api-key: YOUR_KEY" \
  https://spoken.md/transcripts/1000770860040