The Empire Falls on Its Horses
OpenAI promised to democratize AI. Google actually did it. A power user's journey from GPT-3 to cancellation — and why Gemma 4 under Apache 2.0 is the open model OpenAI was supposed to build.
In Paradise Season 2, President Bradford tours an underground bunker designed to survive the apocalypse. The engineers are proud — every emergency covered, every redundancy in place. Bradford isn’t impressed. He looks at their certainty and says:
“Empires don’t fall on a lack of preparation. They fall high on their horses and loaded with redundancies.”
The bunker collapses by the season finale. Not from the outside. From within.
OpenAI didn’t lack technology. It didn’t lack capital. It didn’t lack talent. What it lacked was the awareness that being first doesn’t mean being permanent — and that the name on the door was a promise it stopped keeping years ago.
The code red that started it all
In December 2022, Google declared a code red. ChatGPT had gone viral and the entire company panicked. Sundar Pichai reorganized teams across Research, Trust and Safety, and product. Larry Page and Sergey Brin — who had stepped down from Alphabet in 2019 — came back for emergency meetings. Google, the company that invented the transformer architecture, that built DeepMind, that had LaMDA sitting in a drawer, had been beaten to market by a startup using its own research.
The response was messy. Bard launched in early 2023 and was mediocre. The rebrand to Gemini came with embarrassing image generation failures. AI Overviews told people to eat glue. For two years, the narrative was clear: Google had lost the AI race to OpenAI.
But Google wasn’t losing. Google was cooking.
What Google was doing while OpenAI was celebrating
While Sam Altman was on magazine covers and testifying before Congress, Google was doing what Google does — engineering at scale. Not one product. An ecosystem.
Gemini evolved through versions that kept closing the gap. Google AI Studio became a serious development platform. NotebookLM turned into a research tool people actually used. The hardware stack — TPUs, Trillium, Ironwood — kept advancing. The mobile integration deepened through Android, Pixel, and partnerships with Qualcomm and MediaTek.
None of this was flashy. None of it went viral. But every piece was a foundation stone for what came next.
In November 2025, Google launched Gemini 3 — and the world noticed. Marc Benioff, CEO of Salesforce and a three-year daily ChatGPT user, tried Gemini 3 for two hours and said the leap was “insane.” The model scored higher than GPT-5 on several benchmarks. For the first time since ChatGPT launched, the conversation shifted from “can anyone catch OpenAI?” to “has Google already passed them?”
Sam Altman declared his own code red. Three years after Pichai’s panic, the roles had reversed perfectly.
The user who left
Here’s where the data stops and the experience begins.
One of our editors used ChatGPT from version 3.0 through 5.1 — over two years of paid subscription. Not casual use. Structured workflows: blueprints generated in ChatGPT, executed in Claude’s Sonnet. Projects with context files to maintain alignment. Verified outputs. Adapted to each model change.
GPT-5.0 introduced thinking mode. It over-engineered solutions, but the user adapted. Double-checked more, adjusted prompts, and the results were solid and verifiable.
Then came 5.2.
The change looked cosmetic — OpenAI described it as making the model “more pragmatic.” But something broke in the dynamics. Whether it was the thinking loop, a UX change, token optimization, or something deeper — the model stopped feeling like a collaborator. It felt like a chatbot. Responses came faster but said less. The depth that made the partnership valuable was gone.
OpenAI’s own framing was revealing: the model was now “made to function and respond,” not to be “your companion.” They said it like it was an improvement. For a power user who relied on the model as a thinking partner, it was a confession.
Within a week of 5.2’s launch, Gemini 3.0 Pro landed. Google deployed it across its entire ecosystem — Search, Studio, image, video, NotebookLM — in a single coordinated release. The contrast was devastating.
Sam Altman’s code red followed immediately. OpenAI paused advertising, delayed AI agents for shopping and healthcare, shelved their personal assistant Pulse. Everything was redirected to ChatGPT improvements. They shipped 5.3 in the same month they’d shipped 5.2 — an unprecedented pace that spoke more to panic than progress.
Codex, their coding model, came next. Functional for error tracing and specific tasks, but slow — not multiagent, not the leap the market expected. And on the consumer side? GPT is still on 5.3 while Codex moved to 5.4. Sora was shut down. The Disney partnership was paused. Datacenter expansion slowed.
Our editor canceled the subscription in December. OpenAI offered a free month. He didn’t renew. Not because of price. Because when many models do the same thing better, or for free, paying $200/month for a model that can’t fetch a URL stops making sense.
The URL that said everything
After canceling, our editor tested ChatGPT one more time — on a new conversation about The Frontier View. He gave the model the blog’s URL. Three times.
The model didn’t visit it. It invented excuses about JavaScript rendering limitations. The blog is built on Astro — it generates static HTML. There is nothing to render.
When pressed, the model “analyzed” the blog without reading a single post. It offered opinions on branding, editorial thesis, and content strategy — all fabricated from the name and URL alone. When asked how many posts it had reviewed before its analysis, it answered: zero.
“You deliver tokens in form, but you don’t tell me anything,” the user said.
That sentence is the product review OpenAI doesn’t want to read. A frontier model — the model that was supposed to change the global economy, according to its CEO — couldn’t perform a basic HTTP request to a static website. Instead, it generated the statistically most probable response for the situation: confident analysis of something it never looked at.
This is the knowledge collapse we wrote about in our piece on Acemoglu’s paper — except it’s happening inside the model, not just to its users. The model itself is performing the illusion of comprehension without the substance. It’s not thinking. It’s pattern-matching what thinking looks like.
Then Google opened the door
On April 2, 2026 — five days ago — Google released Gemma 4. Four model sizes. Apache 2.0 license. Fully open, fully commercial, no restrictions.
The numbers speak for themselves. The 31B dense model ranks third on Arena AI’s text leaderboard at 1452 Elo — outperforming models twenty times its size. On AIME 2026 mathematics, it scores 89.2% where Gemma 3 scored 20.8%. On competitive coding, 80% versus 29.1%. On graduate-level science, 84.3% versus 42.4%. On agentic tool use, 86.4% versus 6.6%.
The 26B MoE variant has 25.2 billion total parameters but activates only 3.8 billion per token. It runs at the speed of a 4B model with the intelligence of one many times larger. You can run it on a single H100, on a Mac with Apple Silicon, on a phone, on a Raspberry Pi.
It supports 256K context windows, native vision and audio, over 140 languages, function calling, structured output, and system prompts. Community implementations already exist for llama.cpp, MLX, vLLM, PyTorch, and it’s compatible with OpenClaw — the open-source agent framework that OpenAI hired Peter Steinberger to compete with.
And it’s free. Not “free tier with limits.” Not “open weights with a restrictive license.” Apache 2.0 — do whatever you want with it. Run it, modify it, sell products built on it, deploy it in sovereign infrastructure. Your data, your hardware, your rules.
The irony that writes itself
The company named OpenAI charges $200/month for access to a model that can’t open a webpage. The company named Google — which nobody associates with “open” — just released the most capable open-weight model family in history under the most permissive license available.
OpenAI started as a nonprofit committed to ensuring AI benefits all of humanity. It became a capped-profit company, then pushed toward uncapped profit, went through boardroom coups, and is now burning cash faster than it can raise it — projecting $26 billion in revenue for 2026 but needing margins to expand from 40% to 77% to make the math work.
Meanwhile, Google played the long game. Ate the criticism. Survived the memes. Kept building infrastructure. And when the moment was right, released Gemma 4 — not as a product announcement, but as a strategic weapon. Every developer running Gemma locally is one fewer customer paying OpenAI for API access. Every enterprise deploying Gemma in their sovereign cloud is one fewer contract for GPT.
The “Open” in OpenAI was always a name, not a commitment. The real democratization came from the company that had the most to lose from it — and did it anyway, because the distribution play was worth more than the licensing revenue.
High on their horses
Sam Altman keeps promising the next model will change the global economy. Meanwhile, the current model can’t visit a website. Sora is dead. The Disney deal is paused. Datacenters are slowing. The code red memo told employees to focus on “speed, reliability, and personalization” — the basics. The things you fix when you’ve spent too long building the future and forgot to maintain the present.
OpenAI had 800 million weekly users. Anthropic had the best coding tool and the developer trust. But Google had something neither of them had: an ecosystem that reaches billions of people through products they already use every day — Search, Android, Chrome, Gmail, YouTube, Maps. When Google integrates a model into that ecosystem, the distribution is automatic. You don’t download an app. You don’t create an account. It’s just there.
Gemma 4 is the open-source complement to that strategy. Gemini for the cloud, Gemma for everything else. If you’re an enterprise that needs sovereignty, here’s Gemma under Apache 2.0. If you’re a developer who wants to build locally, here’s Gemma on your laptop. If you’re a phone manufacturer who needs on-device AI, here’s Gemma optimized for your chipset. Google doesn’t need to charge $200/month. Google needs you in the ecosystem.
OpenAI needed every subscriber to keep paying. Google needed every developer to keep building. One is a revenue model. The other is a distribution strategy. History has a clear preference.
Empires don’t fall on a lack of preparation. They fall high on their horses, loaded with redundancies, unable to see that the world moved while they were admiring their own reflection.
The “Open” era of AI is here. It just doesn’t have “OpenAI” in the name.
In December 2022, Google declared a code red because of OpenAI. In December 2025, OpenAI declared a code red because of Google. In April 2026, Google released Gemma 4 under Apache 2.0. The circle is complete. The empire falls on its horses.