The first Google TPU for the age of inference


Today at Google Cloud Next 25, we’re introducing Ironwood, our seventh-generation Tensor Processing Unit (TPU) — our most performant and scalable custom AI accelerator to date, and the first designed specifically for inference. For more than a decade, TPUs have powered Google’s most demanding AI training and serving workloads, and have enabled our Cloud customers to do the same. Ironwood is our most powerful, capable and energy efficient TPU yet. And it’s purpose-built to power thinking, inferential AI models at scale.

Ironwood represents a significant shift in the development of AI and the infrastructure that powers its progress. It’s a move from responsive AI models that provide real-time information for people to interpret, to models that provide the proactive generation of insights and interpretation. This is what we call the “age of inference” where AI agents will proactively retrieve and generate data to collaboratively deliver insights and answers, not just data.

Ironwood is built to support this next phase of generative AI and its tremendous computational and communication requirements. It scales up to 9,216 liquid cooled chips linked with breakthrough Inter-Chip Interconnect (ICI) networking spanning nearly 10 MW. It is one of several new components of Google Cloud AI Hypercomputer architecture, which optimizes hardware and software together for the most demanding AI workloads. With Ironwood, developers can also leverage Google’s own Pathways software stack to reliably and easily harness the combined computing power of tens of thousands of Ironwood TPUs.

Here’s a closer look at how these innovations work together to take on the most demanding training and serving workloads with unparalleled performance, cost and power efficiency.



Source link

Share

Latest Updates

Frequently Asked Questions

Related Articles

Remaining Windsurf team and tech acquired by Cognition, makers of Devin: ‘We’re friends with Anthropic again’

Want smarter insights in your inbox? Sign up for our weekly newsletters to...

Crypto-focused Grayscale confidentially files for potential US listing

Crypto-focused asset manager Grayscale said on Monday it has confidentially submitted paperwork with...

Securing the new high ground: tackling export loopholes in space tech

Outer space, the new high ground, is no longer the solitary domain of...

AI models highly vulnerable to health disinfo weaponisation

Artificial intelligence chatbots can be easily manipulated to deliver dangerous health disinformation, raising...
Mahjong Ways 2 Live Draw Hk Live Casino Online Mahjong Ways Judi Bola Online Sabung Ayam Online Judi Bola Online Sv388 Kisah Budi menang Veloz dari Mahjong Perjalanan Kkajhe di Mahjong Ways 2 Hoho sukses temukan trik Mahjong belajar trik spin kecil tono sukses miliki peternakan sapi berkat scatter hitam mahjong ways kisah unik kades bertari yang sukses kelola bumdes dengan scatter hitam mahjong ways