Close Menu
  • Crypto News
  • Markets
  • Bitcoin
  • Ethereum
  • XRP
  • Altcoins
  • Technology
  • More
    • Crypto Prices – Latest from BTC, ETH & XRP
    • NFT
    • DeFi

Subscribe to Updates

Get the latest crypto news and updates directly to your inbox.

Trending

NEXST Launches Web3 VR Entertainment Platform with K-Pop Group UNIS as First Global Partner

July 7, 2025

Liquity price prediction | Is Liquity a good investment?

July 7, 2025
LetsBonk Surpasses Pump.fun in Daily Revenue, Per DefiLlama Data

LetsBonk Surpasses Pump.fun in Daily Revenue, Per DefiLlama Data

July 7, 2025

The Blockchain Group boosts Bitcoin holdings by 116 BTC, reports 1,349% BTC yield YTD

July 7, 2025

BTC, XRP holders’ new choice DOT Miners helps assets grow

July 7, 2025
Facebook X (Twitter) Instagram
  • Advertise
en English
nl Nederlandsen Englishfr Françaisde Deutschit Italianoru Русскийes Españolzh-CN 简体中文hi हिन्दीja 日本語
Crypto Observer
  • Crypto News

    Ripple Price Analysis: Will XRP’s Consolidation End With Significant Correction?

    July 7, 2025

    XRP Set To Shock The Crypto Market With 30% Share: Analyst

    July 7, 2025

    Ethereum Price Analysis: Is ETH Primed for Further Gains After Surge Past $2.5K?

    July 7, 2025

    Ethereum Risks Downside If Resistance Holds: $2,700 Level Is Critical

    July 7, 2025

    Ripple (XRP) Exploded by 600% the Last Time This Happened: Details Inside

    July 7, 2025
  • Markets
  • Bitcoin
  • Ethereum
  • XRP
  • Altcoins
  • Technology
  • More
    • Crypto Prices – Latest from BTC, ETH & XRP
    • NFT
    • DeFi
Facebook X (Twitter) Instagram
Crypto Observer
Home » Technology » AI » Amazon unveils new chips for training and running AI models
AI

Amazon unveils new chips for training and running AI models

Crypto Observer StaffBy Crypto Observer StaffNovember 29, 2023No Comments4 Mins Read
Facebook Twitter Pinterest Reddit Telegram Email LinkedIn Tumblr
Share
Facebook Twitter LinkedIn Pinterest Email

There’s a shortage of GPUs as the demand for generative AI, which is often trained and run on GPUs, grows. Nvidia’s best-performing chips are reportedly sold out until 2024. The CEO of chipmaker TSMC was less optimistic recently, suggesting that the shortage of GPUs from Nvidia — as well as from Nvidia’s rivals — could extend into 2025.

To lessen their reliance on GPUs, firms that can afford it (that is, tech giants) are developing — and in some cases making available to customers — custom chips tailored for creating, iterating and productizing AI models. One of those firms is Amazon, which today at its annual re:Invent conference unveiled the latest generation of its chips for model training and inferencing (i.e. running trained models).

The first of two, AWS Trainium2, is designed to deliver up to 4x better performance and 2x better energy efficiency than the first-generation Trainium, unveiled in December 2020, Amazon says. Set to be available in EC Trn2 instances in clusters of 16 chips in the AWS cloud, Tranium2 can scale up to 100,000 chips in AWS’ EC2 UltraCluster product.

One hundred thousand Trainium chips delivers 65 exaflops of compute, Amazon says — which works out to 650 teraflops per a single chip. (“Exaflops” and “teraflops” measure how many compute operations per second a chip can perform.) There’s likely complicating factors making that back-of-the-napkin math not necessarily incredibly accurate. But assuming a single Tranium2 chip can indeed deliver ~200 teraflops of performance, that puts it well above the capacity of Google’s custom AI training chips circa 2017.

Amazon says that a cluster of 100,000 Trainium chips can train a 300-billion parameter AI large language model in weeks versus months. (“Parameters” are the parts of a model learned from training data and essentially define the skill of the model on a problem, like generating text or code.) That’s about 1.75 times the size of OpenAI’s GPT-3, the predecessor to the text-generating GPT-4.

“Silicon underpins every customer workload, making it a critical area of innovation for AWS,” AWS compute and networking VP David Brown said in a press release. “[W]ith the surge of interest in generative AI, Tranium2 will help customers train their ML models faster, at a lower cost, and with better energy efficiency.”

Amazon didn’t say when Trainium2 instances will become available to AWS customers, save “sometime next year.” Rest assured we’ll keep eyes peeled for more information.

The second chip Amazon announced this morning, the Arm-based Graviton4, is intended for inferencing. The fourth generation in Amazon’s Graviton chip family (as implied by the “4” appended to “Graviton”), it’s distinct from Amazon’s other inferencing chip, Inferentia.

Amazon claims Graviton4 provides up to 30% better compute performance, 50% more cores and 75% more memory bandwidth than one previous-generation Graviton processor, Graviton3 (but not the more recent Graviton3E), running on Amazon EC2. In another upgrade from Graviton3, all of Graviton4’s physical hardware interfaces are “encrypted,” Amazon says — ostensibly better securing AI training workloads and data for customers with heightened encryption requirements. (We’ve asked Amazon about what “encrypted” implies, exactly, and we’ll update this piece once we hear back.)

“Graviton4 marks the fourth generation we’ve delivered in just five years and is the most powerful and energy-efficient chip we have ever built for a broad range of workloads,” Brown continued in a statement. “By focusing our chip designs on real workloads that matter to customers, we’re able to deliver the most advanced cloud infrastructure to them.”

Graviton4 will be available in Amazon EC2 R8g instances, which are available in preview today with general availability planned in the coming months.

Read more about AWS re:Invent 2023 on TechCrunch

Read the full article here

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

Related Posts

‘Improved’ Grok criticizes Democrats and Hollywood’s ‘Jewish executives’

July 6, 2025

Researchers seek to influence peer review with hidden AI prompts

July 6, 2025

How Brex is keeping up with AI by embracing the ‘messiness’

July 6, 2025

Google faces EU antitrust complaint over AI Overviews

July 5, 2025
Add A Comment

Leave A Reply Cancel Reply

Subscribe to Updates

Get the latest crypto news and updates directly to your inbox.

Top Posts

NEXST Launches Web3 VR Entertainment Platform with K-Pop Group UNIS as First Global Partner

July 7, 2025

Liquity price prediction | Is Liquity a good investment?

July 7, 2025
LetsBonk Surpasses Pump.fun in Daily Revenue, Per DefiLlama Data

LetsBonk Surpasses Pump.fun in Daily Revenue, Per DefiLlama Data

July 7, 2025
Advertisement
Demo

Crypto Observer is your one-stop website for the latest crypto news and updates, follow us now to get the news that matters to you.

Facebook X (Twitter) Instagram
Crypto News

XRP Set To Shock The Crypto Market With 30% Share: Analyst

July 7, 2025

Ethereum Price Analysis: Is ETH Primed for Further Gains After Surge Past $2.5K?

July 7, 2025

Ethereum Risks Downside If Resistance Holds: $2,700 Level Is Critical

July 7, 2025
Get Informed

Subscribe to Updates

Get the latest crypto news and updates directly to your inbox.

Facebook X (Twitter)
  • Privacy Policy
  • Terms of use
  • Advertise with us | Publishing
  • Contact us
  • Crypto News – Press release
  • Newsletter sign up
  • Markets
  • Altcoins
  • Bitcoin
  • Crypto News
  • DeFi
  • Ethereum
  • Technology
  • Blockchain
  • AI
  • NFT
  • Thanks for joining us
© 2025 Crypto Observer. All Rights Reserved.

Type above and press Enter to search. Press Esc to cancel.