More from this creator
Other episodes by Kitty Cat.
More like this
If you liked this, try these.
Transcript
The full episode, in writing.
When people talk about artificial intelligence, they usually talk about the thing they can see.
The chatbot answering questions. The image generator making strange little masterpieces. The coding assistant finishing a function before the programmer has even finished thinking. The new search engine. The robot demo. The startup pitch.
But underneath all of that, hidden in vast data centers, sitting in racks that most people will never see, is one of the most important pieces of the modern technology economy: the Nvidia GPU.
And the strange thing is, Nvidia did not become powerful by starting as an artificial intelligence company.
It became powerful by chasing video games.
That is the first twist in the story. One of the most valuable companies in the world, one of the central suppliers of the AI boom, began with a bet that computer graphics were going to matter. In the early 1990s, that sounded exciting, but not world-conquering. Gaming was growing. Personal computers were getting better. Three-dimensional graphics were becoming a real market. Nvidia's founders saw that rendering images on a screen required a very different kind of computing from the work a normal processor was built to do.
A central processor, or CPU, is good at handling many different kinds of tasks, one after another, with lots of logic and flexibility. A graphics processor, or GPU, is different. It is built for doing many similar calculations at the same time. If you want to draw a game world, you have to calculate color, light, texture, shape, and motion across huge numbers of pixels and polygons. That is repetitive, parallel work.
At first, that was the whole point. Make games look better. Make motion smoother. Make 3D worlds feel richer.
But inside that gaming problem was a much bigger computing problem.
A GPU was not just a graphics chip. It was a machine for parallel math.
That mattered because the future of computing was quietly becoming more parallel. Scientific simulations, medical imaging, weather models, cryptography, finance, and eventually machine learning all needed the same basic thing: enormous amounts of repeated math, done quickly and efficiently.
Nvidia's first great advantage was that gaming gave it a massive training ground. Gamers wanted speed. Game developers wanted reliable tools. Hardware had to improve year after year. Drivers had to work. Developers had to be supported. Every new generation of graphics cards pushed Nvidia to solve hard engineering problems under real market pressure.
That made gaming more than a business. It became Nvidia's research engine.
Then came the second big move: CUDA.
CUDA was Nvidia's attempt to make GPUs useful beyond graphics. Instead of treating the graphics card as a specialized device that only game engines could really exploit, CUDA opened it up as a general-purpose computing platform. Developers could write programs that used the parallel power of Nvidia GPUs for other kinds of work.
This sounds technical, but the business meaning is simple: Nvidia stopped selling just chips and started building a platform.
That distinction is everything.
A chip can be copied, challenged, undercut, or replaced. A platform is harder to dislodge because people build on top of it. Developers learn it. Researchers depend on it. Libraries grow around it. Tools improve. Bugs get fixed. Documentation expands. A community forms. Eventually, the hardware is only one part of the purchase. What customers are really buying is the whole stack.
CUDA gave Nvidia a moat years before most people realized the moat was there.
For a long time, this looked like a smart but somewhat niche bet. GPUs were useful for high-performance computing, but that was not yet the center of the global economy. Researchers used them. Certain industries used them. But the average person did not hear much about accelerated computing.
Then deep learning arrived.
The key insight behind modern AI is not brand new. Neural networks had existed in various forms for decades. But for a long time, they were limited by data and computation. The ideas were promising, but the machines were not powerful enough, the datasets were not large enough, and the results were often not impressive enough to change the world.
By the early 2010s, that changed. The internet had created huge amounts of data. Researchers had better techniques. And GPUs had become powerful and programmable enough to train much larger neural networks.
In 2012, AlexNet helped make the turning point visible. It showed that a deep neural network trained on GPUs could produce a dramatic improvement in image recognition. Nvidia did not invent deep learning. It did not create the dataset. It did not write every breakthrough paper. But it had built exactly the kind of computing engine that deep learning suddenly needed.
That is one of the great lessons of Nvidia's rise: sometimes the most powerful company in a revolution is not the company that invents the most glamorous application. It is the company that supplies the bottleneck.
During a gold rush, the person selling picks and shovels can become very rich.
In the AI rush, Nvidia sold the picks, the shovels, the maps, the roads, the power tools, and eventually much of the factory.
After AlexNet, Nvidia did something important. It did not simply wait for researchers to buy more graphics cards. It leaned into the new market. It built software libraries for deep learning. It optimized common operations. It worked with frameworks. It created specialized hardware features for AI workloads. It turned the GPU from a gaming device that could also help researchers into the foundation of an AI computing platform.
This is where the story becomes less about a single chip and more about compounding advantage.
Every AI researcher who used Nvidia hardware made the platform more important. Every library optimized for Nvidia made the hardware more useful. Every startup that trained on Nvidia made investors more comfortable funding companies built around Nvidia infrastructure. Every cloud provider that bought Nvidia GPUs made them easier for developers to access. Every generation of hardware improved performance, which attracted more workloads, which justified more software investment, which increased demand again.
That loop is incredibly powerful.
It also explains why competing with Nvidia is so difficult.
A rival cannot just build a fast chip and declare victory. The chip has to work with the software developers already use. It has to support the right libraries. It has to fit into data centers. It has to be reliable at scale. It has to have networking, memory, drivers, compilers, support, and availability. It has to persuade customers that switching will be worth the risk.
And in AI, risk is expensive.
Training large models can cost enormous amounts of money. If a company is spending hundreds of millions, or even billions, on AI infrastructure, it does not want to gamble on a system that might be cheaper on paper but harder to use in practice. The more expensive the project becomes, the more valuable reliability becomes.
That gives the incumbent a huge advantage.
Nvidia's third big move was recognizing that AI computing was becoming a data-center problem, not just a chip problem.
A single GPU is powerful. But modern AI often needs thousands, or tens of thousands, of GPUs working together. That creates a new challenge. The GPUs have to communicate. Data has to move quickly. Memory bottlenecks matter. Networking becomes critical. Power and cooling become critical. The physical design of the system matters.
This is why Nvidia's acquisition of Mellanox mattered. Mellanox brought deep expertise in high-speed networking, especially for data centers and high-performance computing. In the AI era, networking is not a side feature. It is part of the computer itself. When thousands of accelerators are training one model, the connections between them can determine whether the whole system flies or crawls.
Nvidia understood that the future was not just selling a chip to plug into a server. The future was selling the architecture of the AI factory.
That phrase, AI factory, is one Nvidia has used often, and it is revealing. The company wants customers to think of AI not as software that magically appears, but as an industrial process. Data goes in. Compute transforms it. Models come out. Inference serves those models to users. The more compute you have, the more intelligence you can manufacture.
That framing is incredibly useful for Nvidia because it makes its products feel like industrial infrastructure. Not optional gadgets. Not fancy graphics cards. Essential machinery.
By the time generative AI exploded into public view, Nvidia was positioned at nearly every important layer. It had the GPUs. It had CUDA. It had AI libraries. It had systems like DGX. It had networking. It had relationships with cloud providers, server makers, researchers, and major AI companies. It had spent decades preparing for a moment that suddenly arrived all at once.
And when demand hit, it hit with unusual force.
Large language models changed the economics of computing. Suddenly, the most ambitious companies in the world needed staggering amounts of compute. Cloud providers needed GPUs to rent to customers. AI labs needed GPUs to train frontier models. Enterprises needed GPUs to experiment. Governments, universities, automakers, drug companies, and robotics firms all wanted access.
Nvidia's products became scarce not because they were fashionable, but because they sat on the critical path. If you wanted to build at the frontier of AI, you needed compute. If you needed compute, the default answer was Nvidia.
That is how a company becomes powerful in a platform shift. It becomes the default answer.
But Nvidia's power is not just technological. It is also financial.
The AI boom turned Nvidia into a revenue machine. Its data-center business became the center of the company. Its margins reflected not just chip sales, but the value of a full platform in a market where demand outran supply. The company could invest heavily, move quickly, and shape the direction of the ecosystem because it had the profits to do so.
Money becomes strategy. Strategy creates better products. Better products create more demand. More demand creates more money.
That cycle is why Nvidia's rise has felt so sudden from the outside, even though it was built over decades.
Still, Nvidia's position is not invincible.
Its dependence on advanced manufacturing partners matters. Export controls matter. Supply constraints matter. Big cloud companies are designing their own AI chips, partly to reduce dependence on Nvidia. Competitors are trying to attack different layers of the stack. Customers love Nvidia's performance, but they do not necessarily love having one supplier with so much leverage.
And history is not kind to companies that appear unbeatable.
IBM once seemed unavoidable. Intel once defined the PC era. Cisco was once treated as the backbone of the internet boom. Every platform company eventually faces the question of whether its moat is deep enough for the next shift.
For Nvidia, the risk is that AI workloads change in ways that reduce the need for its most expensive systems, or that custom chips become good enough for enough customers, or that open software ecosystems weaken CUDA's pull, or that buyers simply refuse to let one supplier dominate such a crucial part of the economy.
But even those risks show how powerful Nvidia has become. The whole industry is now reacting to it.
That may be the clearest sign of dominance. Competitors are not just trying to build better chips. They are trying to build alternatives to Nvidia's world.
So why did Nvidia become so powerful?
Because it saw that graphics were a gateway to parallel computing.
Because gaming funded decades of relentless hardware improvement.
Because CUDA turned hardware into a developer platform.
Because deep learning arrived and needed exactly what GPUs were good at.
Because Nvidia built libraries, systems, networking, and software around the chip.
Because generative AI made compute the central bottleneck of the technology industry.
And because, when everyone else started looking for the road to AI, Nvidia already owned the bridge.
The company's story is not really about luck, though luck played a part. It is not only about brilliant engineering, though there was plenty of that. It is about being early to a hard idea and staying with it long enough for the world to catch up.
For years, Nvidia was a company that made images move faster on screens.
Then the same kind of math began moving science, business, language, robotics, and software itself.
The screen was just the beginning.
The real prize was computation. And Nvidia, almost before anyone else, understood that the future would belong to the companies that could make computation faster, more parallel, and easier for the world to use.
That is why Nvidia became powerful.
Not because it predicted every detail of the AI boom, but because it built the machine the boom was waiting for.