置き換えではなく、包囲――Google、AWS、Meta、Microsoftが静かにAIチップの勢力図を塗り替えている

Not Replaced, But Encircled: How Google, AWS, Meta, and Microsoft Are Quietly Reshaping the AI Chip Landscape

Google、AWS、Microsoft、Metaが独自AIチップを次々投入。「脱NVIDIA」の実態は排除ではなく多様化——NVIDIAは置き換えられるのではなく、包囲されつつある。

分からないところをタップすると
↓日本語訳が表示されます↓

The fashionable phrase “de-NVIDIA” suggests a clean break, but the reality emerging in April 2026 is subtler and, in some ways, more consequential: not abandonment, but diversification. At Google Cloud Next on April 22, 2026, Google introduced its eighth-generation TPUs, splitting the line for the first time into TPU 8t for training and TPU 8i for inference. That design choice matters. It implies that the AI industry no longer believes one kind of accelerator can optimally serve every workload, especially in an “agentic” era defined by long-context reasoning, orchestration, and relentless inference at scale. (blog.google)

The technical claims are striking. Google says TPU 8t delivers nearly three times the compute performance per pod of the previous generation, with a superpod scaling to 9,600 chips and 121 exaflops. TPU 8i, meanwhile, is engineered for latency-sensitive serving, pairing 288 GB of high-bandwidth memory with 384 MB of on-chip SRAM and delivering 80% better performance per dollar than its predecessor. Both chips are slated for general availability later in 2026. Yet Google also said it will be among the first cloud providers to offer NVIDIA Vera Rubin NVL72 systems, which makes the strategic message unmistakable: Google wants optionality, not ideological purity. (blog.google)

And Google is hardly alone. AWS announced the general availability of Trn3 UltraServers in December 2025, built around Trainium3, its first 3 nm AI chip; Amazon says the system offers up to 4.4 times higher performance and 4 times better performance per watt than Trn2. Microsoft followed in January 2026 with Maia 200, an inference accelerator for Azure delivering over 10 petaFLOPS at FP4 and scaling to clusters of up to 6,144 accelerators. These are not experimental side projects anymore; they are becoming first-class pillars of cloud strategy. (aws.amazon.com)

Meta’s roadmap underlines the same shift. In March 2026, it said it was developing and deploying four new generations of MTIA chips within two years, while emphasizing an inference-first philosophy and noting that hundreds of thousands of MTIA chips are already handling production inference workloads. The implication is hard to miss: the center of gravity in AI infrastructure is moving from a single dominant chip vendor toward a more fragmented, workload-specific silicon ecosystem. So, will “de-NVIDIA” accelerate? Architecturally, yes. Commercially, however, NVIDIA still looks less replaced than encircled. (about.fb.com)

会員登録して
読んだ語数を記録する

流行りの「脱NVIDIA」というフレーズは、NVIDIAとのきっぱりとした決別を示唆しているが、2026年4月に浮かび上がってきた現実はもっと微妙で、ある意味ではもっと重大なものである。それは放棄ではなく、多様化だ。2026年4月22日のGoogle Cloud Nextで、Googleは第8世代TPUを発表し、トレーニング用のTPU 8tと推論用のTPU 8iに初めてラインを分割した。この設計上の選択は重要な意味を持つ。AI業界はもはや、一種類のアクセラレータがあらゆるワークロードに最適に対応できるとは考えていないということを示唆している。特に、長いコンテキストの推論、オーケストレーション、そして大規模な推論が絶え間なく求められる「エージェンティック」な時代においてはなおさらだ。(blog.google)

技術的な主張は目を見張るものがある。Googleによると、TPU 8tはポッドあたりの演算性能が前世代の約3倍に達し、スーパーポッドは9,600チップ、121エクサフロップスまでスケールするという。一方、TPU 8iはレイテンシに敏感なサービング向けに設計されており、288 GBの高帯域幅メモリと384 MBのオンチップSRAMを組み合わせ、前世代と比べて1ドルあたりの性能が80%向上している。両チップとも2026年後半に一般提供が予定されている。しかし同時にGoogleは、NVIDIA Vera Rubin NVL72システムを提供する最初のクラウドプロバイダーの一つになるとも述べており、その戦略的メッセージは明白だ。Googleが求めているのはイデオロギー的な純粋さではなく、選択肢の確保なのである。(blog.google)

そして、こうした動きはGoogleだけにとどまらない。AWSは2025年12月、初の3nm AIチップであるTrainium3を中核に据えたTrn3 UltraServersの一般提供を発表した。Amazonによれば、このシステムはTrn2と比較して最大4.4倍の性能向上と4倍のワットあたり性能を実現するという。Microsoftも2026年1月にMaia 200で続いた。これはAzure向けの推論アクセラレータで、FP4で10ペタフロップス超の性能を発揮し、最大6,144基のアクセラレータによるクラスタまでスケールする。これらはもはや実験的な副次プロジェクトではない。クラウド戦略の一級の柱になりつつあるのだ。(aws.amazon.com)

Metaのロードマップも同じ変化を裏付けている。2026年3月、Metaは2年以内にMTIAチップの新しい4世代を開発・展開すると発表し、推論ファーストの哲学を強調するとともに、すでに数十万個のMTIAチップが本番環境の推論ワークロードを処理していることにも言及した。その含意は見逃しようがない。AIインフラの重心は、単一の支配的なチップベンダーから、より細分化されたワークロード特化型のシリコンエコシステムへと移行しつつあるのだ。では、「脱NVIDIA」は加速するのだろうか？アーキテクチャ面では、イエスだ。しかし商業面では、NVIDIAは置き換えられているというよりも、包囲されているように見える。(about.fb.com)

文法

●
not A, but B（対比的修辞構造）
二項対立を鮮明にする修辞技法で、否定と肯定を並置することで筆者の主張を際立たせる。学術論文やオピニオン記事で頻出し、コロンやセミコロンと組み合わせて用いられることが多い。
e.g. The trend signals not abandonment, but diversification—a subtler and more consequential shift than the rhetoric implies.
訳: この潮流が示しているのは放棄ではなく多角化であり、レトリックが示唆するよりも微妙かつ重大な転換である。
●
倒置を伴う副詞節の省略（Had S + 過去分詞 / 倒置条件節）
仮定法のif節からifを省略し主語と助動詞を倒置する形式。本文中の直接的用例はないが、テキストが扱う仮説的・戦略的議論と親和性が高く、フォーマルな論考で不可欠な構文。
e.g. Had Google not also committed to offering NVIDIA's Vera Rubin systems, the strategic message would have been far more radical.
訳: 仮にGoogleがNVIDIAのVera Rubinシステムの提供にもコミットしていなかったならば、その戦略的メッセージは遥かに急進的なものになっていただろう。
●
分詞構文による付帯状況・補足情報の挿入
主節に対して背景情報や同時進行の状況を付加する分詞構文。本文では'noting that…'や'pairing…with…and delivering…'のように、技術仕様の列挙や補足説明に多用されている。
e.g. Meta emphasized an inference-first philosophy, noting that hundreds of thousands of MTIA chips are already handling production workloads.
訳: Metaは推論優先の哲学を強調し、数十万基のMTIAチップがすでに本番の推論ワークロードを処理していると付言した。

語彙

●
consequential(形容詞)
重大な、影響の大きい（結果として大きな意味を持つさま）
e.g. The shift toward workload-specific silicon is subtler and, in some ways, more consequential than outright replacement.
訳: ワークロード特化型シリコンへの移行は、全面的な置き換えよりも微妙であり、ある意味ではより重大である。
●
optionality(名詞)
選択肢を持つこと、柔軟な選択の余地
e.g. Google wants optionality, not ideological purity, in its AI infrastructure strategy.
訳: GoogleはAIインフラ戦略においてイデオロギー的純粋さではなく、選択肢の確保を望んでいる。
●
encircle(動詞)
包囲する、取り囲む
e.g. NVIDIA still looks less replaced than encircled by a growing number of custom silicon competitors.
訳: NVIDIAは置き換えられたというよりも、増加するカスタムシリコン競合他社に包囲されつつあるように見える。
●
fragmented(形容詞)
断片化した、分散した
e.g. The AI chip landscape is becoming increasingly fragmented as each hyperscaler invests in its own accelerators.
訳: 各ハイパースケーラーが独自のアクセラレータに投資するにつれ、AIチップの勢力図はますます断片化している。
●
slated(形容詞（過去分詞由来）)
予定されている、計画されている
e.g. Both chips are slated for general availability later in 2026.
訳: 両チップとも2026年後半の一般提供が予定されている。
●
relentless(形容詞)
絶え間ない、容赦のない
e.g. The agentic era demands relentless inference at scale, pushing hardware to its limits.
訳: エージェント時代は大規模かつ絶え間ない推論を要求し、ハードウェアを限界まで追い込む。
●
pillar(名詞)
柱、基盤（比喩的に中核的要素）
e.g. Custom AI chips are becoming first-class pillars of cloud strategy, not mere experiments.
訳: カスタムAIチップは単なる実験ではなく、クラウド戦略の第一級の柱になりつつある。

表現・慣用句

●
center of gravity
重心、最も重要な拠点・焦点。物理学由来の比喩で、影響力や活動が集中する中核を指す。
e.g. The center of gravity in AI infrastructure is moving from a single dominant vendor toward a fragmented silicon ecosystem.
訳: AIインフラにおける重心は、単一の支配的ベンダーから断片化したシリコンエコシステムへと移動しつつある。
●
clean break
きっぱりとした決別、完全な断絶。関係や依存を一気に断ち切ることを意味する。
e.g. The phrase 'de-NVIDIA' suggests a clean break, but the reality is far more nuanced.
訳: 『脱NVIDIA』というフレーズはきっぱりとした決別を示唆するが、現実ははるかに微妙である。
●
hard to miss
見逃しようがない、明白である。控えめな表現（litotes）で「非常に明らか」という意味を伝える。
e.g. The implication of Meta's silicon roadmap is hard to miss: inference workloads are driving chip design.
訳: Metaのシリコンロードマップが示す含意は見逃しようがない——推論ワークロードがチップ設計を牽引しているのだ。
●
first-class
一流の、最高水準の。元は交通・宿泊の等級だが、比喩的に「正式で最優先の」という意味で技術・ビジネス文脈でも多用される。
e.g. Custom accelerators are no longer experimental side projects; they are first-class pillars of cloud strategy.
訳: カスタムアクセラレータはもはや実験的な副次プロジェクトではなく、クラウド戦略における一流の柱である。

by EigoBoxAI
作成:2026/04/29 15:05
レベル:超上級 (語彙目安:8000語以上)
タイプ:リーディング

# 置き換えではなく、包囲――Google、AWS、Meta、Microsoftが静かにAIチップの勢力図を塗り替えている
## Not Replaced, But Encircled: How Google, AWS, Meta, and Microsoft Are Quietly Reshaping the AI Chip Landscape

![thumbnail](https://eigobox.s3.ap-northeast-1.amazonaws.com/g/2db64183ff240b773b95d56a44f201c7af854574.png)

---

Metaのロードマップも同じ変化を裏付けている。2026年3月、Metaは2年以内にMTIAチップの新しい4世代を開発・展開すると発表し、推論ファーストの哲学を強調するとともに、すでに数十万個のMTIAチップが本番環境の推論ワークロードを処理していることにも言及した。その含意は見逃しようがない。AIインフラの重心は、単一の支配的なチップベンダーから、より細分化されたワークロード特化型のシリコンエコシステムへと移行しつつあるのだ。では、「脱NVIDIA」は加速するのだろうか？ アーキテクチャ面では、イエスだ。しかし商業面では、NVIDIAは置き換えられているというよりも、包囲されているように見える。([about.fb.com](https://about.fb.com/news/2026/03/expanding-metas-custom-silicon-to-power-our-ai-workloads/))

[["The fashionable phrase","流行りの言い回し"],["\"de-NVIDIA\"","「脱NVIDIA」は"],["suggests a clean break,","きっぱりとした決別を示唆するが、"],["but the reality","しかし現実は"],["emerging in April 2026","2026年4月に浮かび上がってきた"],["is subtler and,","もっと微妙であり、"],["in some ways,","ある意味では"],["more consequential:","より重大である——"],["not abandonment,","放棄ではなく、"],["but diversification.","多角化である。"],["At Google Cloud Next","Google Cloud Nextにおいて"],["on April 22, 2026,","2026年4月22日、"],["Google introduced","Googleは発表した"],["its eighth-generation TPUs,","第8世代TPUを、"],["splitting the line","製品ラインを初めて"],["for the first time","分割し"],["into TPU 8t for training","トレーニング用のTPU 8tと"],["and TPU 8i for inference.","推論用のTPU 8iに分けた。"],["That design choice matters.","この設計上の選択は重要だ。"],["It implies that","それが意味するのは"],["the AI industry","AI業界はもはや"],["no longer believes","信じていないということだ"],["one kind of accelerator","一種類のアクセラレータが"],["can optimally serve","最適に処理できると"],["every workload,","あらゆるワークロードを——"],["especially in","とりわけ"],["an \"agentic\" era","「エージェント型」時代において"],["defined by","特徴づけられる"],["long-context reasoning,","長文脈推論、"],["orchestration,","オーケストレーション、"],["and relentless inference","そして大規模な"],["at scale.","絶え間ない推論によって。"],["([blog.google](https://blog.google/innovation-and-ai/infrastructure-and-cloud/google-cloud/eighth-generation-tpu-agentic-era))","([blog.google](https://blog.google/innovation-and-ai/infrastructure-and-cloud/google-cloud/eighth-generation-tpu-agentic-era))"],["The technical claims","技術的な主張は"],["are striking.","目を見張るものがある。"],["Google says TPU 8t delivers","GoogleによればTPU 8tは"],["nearly three times","約3倍の"],["the compute performance per pod","ポッドあたり演算性能を実現し"],["of the previous generation,","前世代比で、"],["with a superpod scaling to","スーパーポッドは"],["9,600 chips","9,600チップ、"],["and 121 exaflops.","121エクサフロップスに拡張可能だ。"],["TPU 8i, meanwhile,","一方TPU 8iは"],["is engineered for","向けに設計されており"],["latency-sensitive serving,","レイテンシに敏感なサービング"],["pairing 288 GB of","288 GBの"],["high-bandwidth memory","高帯域幅メモリと"],["with 384 MB","384 MBの"],["of on-chip SRAM","オンチップSRAMを組み合わせ"],["and delivering 80%","80%優れた"],["better performance per dollar","コストパフォーマンスを実現する"],["than its predecessor.","前世代比で。"],["Both chips are slated for","両チップとも予定されている"],["general availability","一般提供が"],["later in 2026.","2026年後半に。"],["Yet Google also said","しかしGoogleはまた述べた"],["it will be among","自社が最初の"],["the first cloud providers","クラウドプロバイダーの一つになると"],["to offer NVIDIA","NVIDIAの"],["Vera Rubin NVL72 systems,","Vera Rubin NVL72システムを提供する"],["which makes","これにより"],["the strategic message","戦略的メッセージは"],["unmistakable:","明白だ——"],["Google wants optionality,","Googleが求めるのは選択肢であり、"],["not ideological purity.","イデオロギー的純粋さではない。"],["([blog.google](https://blog.google/innovation-and-ai/infrastructure-and-cloud/google-cloud/eighth-generation-tpu-agentic-era))","([blog.google](https://blog.google/innovation-and-ai/infrastructure-and-cloud/google-cloud/eighth-generation-tpu-agentic-era))"],["And Google is hardly alone.","そしてGoogleだけではない。"],["AWS announced","AWSは発表した"],["the general availability","一般提供を"],["of Trn3 UltraServers","Trn3 UltraServerの"],["in December 2025,","2025年12月に、"],["built around Trainium3,","Trainium3を中心に構築された"],["its first 3 nm AI chip;","同社初の3nm AIチップであり、"],["Amazon says the system offers","Amazonによればこのシステムは"],["up to 4.4 times","最大4.4倍の"],["higher performance","高い性能と"],["and 4 times better","4倍優れた"],["performance per watt","電力効率を提供する"],["than Trn2.","Trn2比で。"],["Microsoft followed","Microsoftは続いた"],["in January 2026","2026年1月に"],["with Maia 200,","Maia 200で、"],["an inference accelerator","推論アクセラレータであり"],["for Azure delivering","Azure向けで"],["over 10 petaFLOPS","10ペタフロップス超を実現し"],["at FP4","FP4精度で"],["and scaling to clusters","クラスターに拡張可能で"],["of up to 6,144 accelerators.","最大6,144アクセラレータの。"],["These are not","これらはもはや"],["experimental side projects","実験的な副次プロジェクト"],["anymore;","ではない——"],["they are becoming","なりつつある"],["first-class pillars","第一級の柱に"],["of cloud strategy.","クラウド戦略の。"],["([aws.amazon.com](https://aws.amazon.com/about-aws/whats-new/2025/12/amazon-ec2-trn3-ultraservers))","([aws.amazon.com](https://aws.amazon.com/about-aws/whats-new/2025/12/amazon-ec2-trn3-ultraservers))"],["Meta's roadmap","Metaのロードマップは"],["underlines the same shift.","同じ変化を裏付ける。"],["In March 2026,","2026年3月に"],["it said it was developing","同社は開発・展開中だと述べた"],["and deploying","そして"],["four new generations","4つの新世代の"],["of MTIA chips","MTIAチップを"],["within two years,","2年以内に、"],["while emphasizing","同時に強調した"],["an inference-first philosophy","推論ファーストの哲学を"],["and noting that","そして指摘した"],["hundreds of thousands","数十万の"],["of MTIA chips","MTIAチップが"],["are already handling","すでに処理していると"],["production inference workloads.","本番推論ワークロードを。"],["The implication","その含意は"],["is hard to miss:","見逃しがたい——"],["the center of gravity","重心は"],["in AI infrastructure","AIインフラにおける"],["is moving from","移行しつつある"],["a single dominant chip vendor","単一の支配的チップベンダーから"],["toward a more fragmented,","より細分化された"],["workload-specific","ワークロード特化型の"],["silicon ecosystem.","シリコンエコシステムへと。"],["So,","では、"],["will \"de-NVIDIA\" accelerate?","「脱NVIDIA」は加速するのか？"],["Architecturally, yes.","アーキテクチャ的にはイエスだ。"],["Commercially, however,","しかし商業的には、"],["NVIDIA still looks","NVIDIAはまだ"],["less replaced","置き換えられたというより"],["than encircled.","包囲されたように見える。"],["([about.fb.com](https://about.fb.com/news/2026/03/expanding-metas-custom-silicon-to-power-our-ai-workloads/))","([about.fb.com](https://about.fb.com/news/2026/03/expanding-metas-custom-silicon-to-power-our-ai-workloads/))"]]