Home Artificial Intelligence Google claims AI supercomputer speed superiority with new Tensor chips

by Jon Gold

Senior Writer

Google claims AI supercomputer speed superiority with new Tensor chips

News

Apr 05, 20233 mins

High-Performance ComputingMachine Learning

Google says its TPU v4 supercomputer is more powerful and efficient than ever, thanks to optical circuit switching technology and architecture, and challenges Nvidia.

A new white paper from Google details the company’s use of optical circuit switches in its machine learning training supercomputer, saying that the TPU v4 model with those switches in place offers improved performance and more energy efficiency than general-use processors.

Google’s Tensor Processing Units — the basic building blocks of the company’s AI supercomputing systems — are essentially ASICs, meaning that their functionality is built in at the hardware level, as opposed to the general use CPUs and GPUs used in many AI training systems. The white paper details how, by interconnecting more than 4,000 TPUs through optical circuit switching, Google has been able to achieve speeds 10 times faster than previous models while consuming less than half as much energy.

Aiming for AI performance, price breakthroughs

The key, according to the white paper, is in the way optical circuit switching (performed here by switches of Google’s own design) enables dynamic changes to interconnect topology of the system. Compared to a system like Infiniband, which is commonly used in other HPC areas, Google says that its system is cheaper, faster and considerably more energy efficient.

“Two major architectural features of TPU v4 have small cost but outsized advantages,” the paper said. “The SparseCore [data flow processors] accelerates embeddings of [deep learning] models by 5x-7x by providing a dataflow sea-of-cores architecture that allows embeddings to be placed anywhere in the 128 TiB physical memory of the TPU v4 supercomputer.”

According to Peter Rutten, research vice president at IDC, the efficiencies described in Google’s paper are in large part due to the inherent characteristics of the hardware being used — well-designed ASICs are almost by definition better suited to their specific task than general use processors trying to do the same thing.

“ASICs are very performant and energy efficient,” he said. “If you hook them up to optical circuit switches where you can dynamically configure the network topology, you have a very fast system.”

While the system described in the white paper is only for Google’s internal use at this point, Rutten noted that the lessons of the technology involved could have broad applicability for machine learning training.

“I would say it has implications in the sense that it offers them a sort of best practices scenario,” he said. “It’s an alternative to GPUs, so in that sense it’s definitely an interesting piece of work.”

Google-Nvidia comparison is unclear

While Google also compared TPU v4’s performance to systems using Nvidia’s A100 GPUs, which are common HPC components, Rutten noted that Nvidia has since released much faster H100 processors, which may shrink any performance difference between the systems.

“They’re comparing it to an older-gen GPU,” he said. “But in the end it doesn’t really matter, because it’s Google’s internal process for developing AI models, and it works for them.”

by Jon Gold

Senior Writer

Jon Gold covers IoT and wireless networking for Network World. He can be reached at jon_gold@ifoundrycodg.com.

Americas

Topics

About

Policies

Our Network

More

Google claims AI supercomputer speed superiority with new Tensor chips

Google says its TPU v4 supercomputer is more powerful and efficient than ever, thanks to optical circuit switching technology and architecture, and challenges Nvidia.

Aiming for AI performance, price breakthroughs

Google-Nvidia comparison is unclear

More from this author

Raspberry Pi to become a public company

AWS boss steps down after 15 years at Amazon

Nvidia to buy AI orchestration software provider Run:ai

TSMC gets $6.6 billion in CHIPS funding for third Arizona fab

AI drives spending on cloud infrastructure, IDC reports

Two-in-one SIM offers network redundancy for IoT devices

Corning’s latest 5G radio equipment combines DAS and small-cell capability

Verizon debuts NaaS cloud management for unified multicloud

Show me more

Billion-dollar fine against Intel annulled, says EU Court of Justice

F5, Nvidia team to boost AI, cloud security

How to examine files on Linux

Has the hype around ‘Internet of Things’ paid off? | Ep. 145

Episode 1: Understanding Cisco’s Converged SDN Transport

Episode 2: Pluggable Optics and the Internet for the Future

How to use the diff3 command

How to use the colordiff command

How to use the CMP command

Google claims AI supercomputer speed superiority with new Tensor chips

Google says its TPU v4 supercomputer is more powerful and efficient than ever, thanks to optical circuit switching technology and architecture, and challenges Nvidia.

Aiming for AI performance, price breakthroughs

Google-Nvidia comparison is unclear

Related content

Zero Trust + AI: A match made in the clouds

AI-assisted cybersecurity: 3 key components you can’t ignore

How cybersecurity and AI will influence global elections in 2024

What it means to 'fight AI with AI'

Newsletter Promo Module Test

More from this author

Raspberry Pi to become a public company

AWS boss steps down after 15 years at Amazon

Nvidia to buy AI orchestration software provider Run:ai

TSMC gets $6.6 billion in CHIPS funding for third Arizona fab

AI drives spending on cloud infrastructure, IDC reports

Two-in-one SIM offers network redundancy for IoT devices

Corning’s latest 5G radio equipment combines DAS and small-cell capability

Verizon debuts NaaS cloud management for unified multicloud

Show me more

Billion-dollar fine against Intel annulled, says EU Court of Justice

F5, Nvidia team to boost AI, cloud security

How to examine files on Linux

Has the hype around ‘Internet of Things’ paid off? | Ep. 145

Episode 1: Understanding Cisco’s Converged SDN Transport

Episode 2: Pluggable Optics and the Internet for the Future

How to use the diff3 command

How to use the colordiff command

How to use the CMP command