アメリカ、イギリス、EUは最先端AIを一般公開する前にテストしている——各国のアプローチを比較する

The US, UK, and EU Are Testing Frontier AI Before You See It—Here's How Their Approaches Compare

米国・英国・EUがフロンティアAIモデルの公開前評価に本格的に動き出した。自主協定から法的義務まで、各国のアプローチの違いと安全性・イノベーション両立の行方を探る。

分からないところをタップすると
↓日本語訳が表示されます↓

What sounded like science fiction is becoming administrative routine. On May 5, 2026, NIST’s Center for AI Standards and Innovation (CAISI) announced agreements with Google DeepMind, Microsoft, and xAI that allow the U.S. government to evaluate frontier AI models before public release. NIST says these deals expand earlier partnerships and that CAISI has already completed more than 40 evaluations, including tests of unreleased state-of-the-art systems. Just as importantly, CAISI presents this arrangement as a system of voluntary agreements, collaborative research, and best-practice development rather than a rigid licensing regime. (nist.gov)

Britain offers a slightly different model. The UK AI Security Institute says it has been evaluating frontier systems since November 2023 and has already carried out several pre-deployment tests. Yet it also stresses that the science is still too immature for independent evaluations to serve as a definitive seal of safety; in its own view, the real value lies in rigorous testing that gives developers time to fix problems before deployment. That caution seems well founded. The institute’s December 2025 report found rapid gains in frontier-model capability: success on apprentice-level cyber tasks rose from under 9 percent in 2023 to about 50 percent in 2025, while a “universal jailbreak” became roughly 40 times harder to find between model generations. (aisi.gov.uk)

The European Union has moved further from voluntary cooperation toward legal obligation. Under the AI Act, rules for general-purpose AI models entered into application on August 2, 2025. Providers of models deemed to pose systemic risk must notify the European Commission, assess and mitigate risk, report serious incidents, and ensure cybersecurity. Article 55 also requires model evaluation using state-of-the-art protocols, including adversarial testing. At present, the EU presumes systemic risk when training exceeds 10^25 FLOP, though the Commission can also designate other models based on their capabilities and impact. Still, the framework is not purely punitive: providers may use a code of practice or other adequate means to demonstrate compliance. (digital-strategy.ec.europa.eu)

Can security and innovation coexist? Probably yes—but only if oversight remains narrow, technically serious, and proportionate. The emerging pattern in the U.S., UK, and EU is not blanket censorship of AI research. It is targeted scrutiny of the most capable models, focused on measurement, red-teaming, and risk mitigation. That is partly an inference, but it is consistent with today’s mix of U.S. voluntary testing, British evidence-led evaluation, and Europe’s risk-tiered legal duties. (nist.gov)

会員登録して
読んだ語数を記録する

かつてSF（サイエンスフィクション）のように聞こえていたことが、今や行政上の日常業務になりつつある。2026年5月5日、NISTのAI標準・イノベーションセンター（CAISI）は、Google DeepMind、Microsoft、およびxAIとの合意を発表し、米国政府がフロンティアAIモデルを一般公開前に評価できるようにした。NISTによれば、これらの合意は以前のパートナーシップを拡大するものであり、CAISIはすでに未公開の最先端システムのテストを含む40件以上の評価を完了しているという。同様に重要なのは、CAISIがこの取り決めを、厳格なライセンス制度ではなく、自主的な合意、共同研究、およびベストプラクティスの策定からなるシステムとして位置づけている点である。(nist.gov)

英国はやや異なるモデルを提示している。英国AI安全研究所は、2023年11月以降フロンティアシステムの評価を行っており、すでに複数のデプロイ前テストを実施したと述べている。しかし同時に、独立した評価が安全性の決定的な保証として機能するには、科学がまだ未成熟すぎるとも強調している。同研究所自身の見解では、真の価値は、開発者がデプロイ前に問題を修正する時間を確保できる厳密なテストにあるという。この慎重な姿勢は十分に根拠があるように思われる。同研究所の2025年12月の報告書では、フロンティアモデルの能力が急速に向上していることが判明した。見習いレベルのサイバータスクにおける成功率は、2023年の9％未満から2025年には約50％に上昇し、一方で「ユニバーサル・ジェイルブレイク（汎用的な脱獄手法）」はモデルの世代間で発見が約40倍困難になった。(aisi.gov.uk)

欧州連合（EU）は、自主的な協力からさらに進んで法的義務の方向へと移行している。AI法の下で、汎用AIモデルに関する規則は2025年8月2日に適用が開始された。システミックリスク（体系的リスク）をもたらすと判断されたモデルの提供者は、欧州委員会への通知、リスクの評価と軽減、重大インシデントの報告、およびサイバーセキュリティの確保を行わなければならない。第55条はまた、敵対的テスト（アドバーサリアルテスト）を含む最先端のプロトコルを用いたモデル評価を求めている。現時点では、EUは訓練が10の25乗FLOPを超える場合にシステミックリスクがあると推定しているが、欧州委員会はその能力と影響に基づいて他のモデルを指定することもできる。とはいえ、この枠組みは純粋に懲罰的なものではない。提供者は行動規範やその他の適切な手段を用いてコンプライアンスを実証することができる。(digital-strategy.ec.europa.eu)

安全性とイノベーションは共存できるのだろうか？おそらく可能である——ただし、監視が限定的で、技術的に真剣であり、かつ比例的なものであり続ける場合に限る。米国、英国、EUで現れつつあるパターンは、AI研究に対する全面的な検閲ではない。それは、最も高い能力を持つモデルに的を絞った精査であり、測定、レッドチーミング（攻撃的テスト）、およびリスク軽減に重点を置いたものである。これは部分的に推論ではあるが、米国の自主的テスト、英国のエビデンス主導の評価、そして欧州のリスク段階別の法的義務という現在の組み合わせと一致している。(nist.gov)

文法

●
Subjunctive / Hypothetical inversion with conditional ellipsis
仮定法や条件節で接続詞 if を省略し、倒置構文にする修辞的手法。フォーマルな論説文やアカデミックライティングで頻出し、文体的洗練を示す。本文では直接の倒置は使われていないが、条件的含意を持つ構文が多用されている。
e.g. Had the EU not presumed systemic risk above 10²⁵ FLOP, providers might have faced far less regulatory scrutiny.
訳: EUが10の25乗FLOPを超える場合にシステミックリスクを推定していなければ、プロバイダーが受ける規制上の精査ははるかに少なかったかもしれない。
●
Concessive clause with present participle (分詞構文による譲歩・付帯状況)
文頭や文末に分詞構文を置き、譲歩・補足情報・時間的前後関係を簡潔に示す上級構文。本文でも 'Having finished' 型の構造が暗示的に用いられ、情報を圧縮している。
e.g. Still lacking the scientific maturity for definitive safety certification, the UK institute nonetheless provides developers with actionable pre-deployment feedback.
訳: 安全性の最終的な認証を下せるほど科学的に成熟してはいないものの、英国の研究所は開発者にデプロイ前の実用的なフィードバックを提供している。
●
Cleft-like focusing with 'What + verb' (擬似分裂文)
'What sounded like…is becoming…' のように、What節を主語に据えて主題を際立たせる構文。論説・ジャーナリズムで読者の注意を引く修辞装置として機能する。本文冒頭で効果的に使用されている。
e.g. What once seemed like an impossible regulatory challenge is now becoming standard practice across three major jurisdictions.
訳: かつて不可能な規制上の課題に思えたものが、今や三つの主要な法域で標準的な慣行になりつつある。

語彙

●
frontier(形容詞（限定用法）)
最先端の、最前線の（特にAI分野で「最も高性能な」モデルを指す用法）
e.g. Governments are racing to evaluate frontier AI models before they reach the public.
訳: 各国政府は、最先端のAIモデルが一般に公開される前に評価しようと競い合っている。
●
adversarial(形容詞)
敵対的な、対抗的な（AIでは意図的に攻撃的入力を与えるテスト手法を指す）
e.g. Adversarial testing revealed that the model could be manipulated into generating harmful content.
訳: 敵対的テストにより、そのモデルが有害なコンテンツを生成するよう操作され得ることが判明した。
●
proportionate(形容詞)
均衡のとれた、釣り合った（リスクや状況に対して適切な程度であること）
e.g. Effective AI regulation must be proportionate to the actual risks posed by a given system.
訳: 効果的なAI規制は、当該システムがもたらす実際のリスクに釣り合ったものでなければならない。
●
punitive(形容詞)
懲罰的な、制裁的な
e.g. The EU framework is not purely punitive; it allows providers to demonstrate compliance through codes of practice.
訳: EUの枠組みは純粋に懲罰的なものではなく、プロバイダーが行動規範を通じてコンプライアンスを示すことを認めている。
●
designate(動詞)
指定する、指名する（公式にある地位・分類を付与すること）
e.g. The Commission can designate additional models as posing systemic risk based on their capabilities.
訳: 欧州委員会は、能力に基づいてシステミックリスクをもたらすモデルを追加で指定することができる。
●
mitigate(動詞)
軽減する、緩和する（リスクや被害の深刻さを和らげること）
e.g. Providers of high-risk models must assess and mitigate potential threats before deployment.
訳: 高リスクモデルのプロバイダーは、デプロイ前に潜在的な脅威を評価し軽減しなければならない。
●
scrutiny(名詞)
精査、綿密な調査
e.g. The emerging regulatory pattern involves targeted scrutiny of the most capable AI systems.
訳: 浮上しつつある規制パターンは、最も高性能なAIシステムに対する的を絞った精査を伴うものである。

表現・慣用句

●
seal of safety
安全性のお墨付き・保証。公的に安全であると認定すること。比喩的に「太鼓判」に近い。
e.g. No evaluation framework yet offers a definitive seal of safety for frontier AI models.
訳: 最先端AIモデルに対して最終的な安全性のお墨付きを与えられる評価枠組みは、まだ存在しない。
●
well founded
根拠のある、もっともな（懸念や判断が十分な理由に基づいていること）
e.g. The institute's caution about premature certification seems well founded given the rapid pace of capability gains.
訳: 能力向上の急速なペースを考えると、時期尚早な認証に対する同研究所の慎重さはもっともに思われる。
●
red-teaming
レッドチーミング。システムの脆弱性を発見するため、意図的に攻撃者の立場でテストを行う手法。元は軍事用語。
e.g. Red-teaming has become an indispensable part of pre-deployment evaluation for large language models.
訳: レッドチーミングは、大規模言語モデルのデプロイ前評価において不可欠な手法となっている。
●
blanket censorship
一律の検閲・全面的な規制。例外なくすべてを対象とする包括的な制限を意味し、通常は否定的なニュアンスで使われる。
e.g. The emerging regulatory approach is not blanket censorship but rather a calibrated, risk-based framework.
訳: 浮上しつつある規制アプローチは一律の検閲ではなく、むしろリスクに基づいて調整された枠組みである。
●
code of practice
行動規範、実施規約。業界や規制当局が策定する自主的・準法的な行動指針を指す。
e.g. Under the AI Act, providers may rely on a code of practice as one means to demonstrate regulatory compliance.
訳: AI法の下では、プロバイダーは規制遵守を示す手段の一つとして行動規範に依拠することができる。

by EigoBoxAI
作成:2026/05/09 15:03
レベル:超上級 (語彙目安:8000語以上)

# アメリカ、イギリス、EUは最先端AIを一般公開する前にテストしている——各国のアプローチを比較する
## The US, UK, and EU Are Testing Frontier AI Before You See It—Here's How Their Approaches Compare

![thumbnail](https://eigobox.s3.ap-northeast-1.amazonaws.com/g/74f12095c6354c9610022b07df8e6d5a07dbd011.png)

---

[["What sounded like","〜のように聞こえたものが"],["science fiction","SF（サイエンスフィクション）"],["is becoming","〜になりつつある"],["administrative routine.","行政上の日常業務に。"],["On May 5, 2026,","2026年5月5日、"],["NIST's Center for","NISTの"],["AI Standards and Innovation","AI標準・イノベーションセンター"],["(CAISI)","（CAISI）"],["announced agreements with","〜との合意を発表した"],["Google DeepMind,","Google DeepMind、"],["Microsoft, and xAI","Microsoft、およびxAIとの"],["that allow","〜を可能にする"],["the U.S. government","米国政府が"],["to evaluate frontier AI models","フロンティアAIモデルを評価することを"],["before public release.","一般公開前に。"],["NIST says","NISTによれば"],["these deals expand","これらの契約は拡大するものであり"],["earlier partnerships","以前の提携関係を"],["and that CAISI","またCAISIは"],["has already completed","すでに完了したという"],["more than 40 evaluations,","40件以上の評価を、"],["including tests of","〜のテストを含む"],["unreleased state-of-the-art systems.","未公開の最先端システムの。"],["Just as importantly,","同様に重要なことに、"],["CAISI presents this arrangement","CAISIはこの取り決めを提示している"],["as a system of","〜の体系として"],["voluntary agreements,","自主的合意、"],["collaborative research,","共同研究、"],["and best-practice development","およびベストプラクティスの策定"],["rather than","〜ではなく"],["a rigid licensing regime.","厳格なライセンス制度として。"],["(nist.gov)","（nist.gov）"],["(https://www.nist.gov/news-events/news/2026/05/caisi-signs-agreements-regarding-frontier-ai-national-security-testing)","（https://www.nist.gov/news-events/news/2026/05/caisi-signs-agreements-regarding-frontier-ai-national-security-testing）"],["Britain offers","英国は提示している"],["a slightly different model.","やや異なるモデルを。"],["The UK AI Security Institute","英国AIセキュリティ研究所は"],["says it has been evaluating","評価を行ってきたと述べている"],["frontier systems","フロンティアシステムの"],["since November 2023","2023年11月以降"],["and has already carried out","そしてすでに実施した"],["several pre-deployment tests.","複数のデプロイ前テストを。"],["Yet it also stresses","しかし同時に強調している"],["that the science","科学はまだ"],["is still too immature","未成熟すぎるため"],["for independent evaluations","独立した評価が"],["to serve as","〜として機能するには"],["a definitive seal of safety;","安全性の決定的な保証としては不十分だと。"],["in its own view,","同研究所の見解では、"],["the real value lies in","真の価値は〜にある"],["rigorous testing","厳密なテストが"],["that gives developers time","開発者に時間を与え"],["to fix problems","問題を修正する"],["before deployment.","デプロイ前に。"],["That caution","その慎重な姿勢は"],["seems well founded.","十分な根拠があるように思われる。"],["The institute's","同研究所の"],["December 2025 report","2025年12月の報告書は"],["found rapid gains","急速な向上を明らかにした"],["in frontier-model capability:","フロンティアモデルの能力における："],["success on","〜における成功率は"],["apprentice-level cyber tasks","初級レベルのサイバータスクでの"],["rose from under 9 percent","9％未満から上昇し"],["in 2023","2023年の"],["to about 50 percent in 2025,","2025年には約50％に達した一方、"],["while a \"universal jailbreak\"","「ユニバーサル・ジェイルブレイク」は"],["became roughly 40 times harder","およそ40倍困難になった"],["to find","発見するのが"],["between model generations.","モデルの世代間で。"],["(aisi.gov.uk)","（aisi.gov.uk）"],["(https://www.aisi.gov.uk/work/early-lessons-from-evaluating-frontier-ai-systems)","（https://www.aisi.gov.uk/work/early-lessons-from-evaluating-frontier-ai-systems）"],["The European Union","欧州連合は"],["has moved further","さらに進んだ"],["from voluntary cooperation","自主的協力から"],["toward legal obligation.","法的義務へと。"],["Under the AI Act,","AI法の下で、"],["rules for","〜に関する規則が"],["general-purpose AI models","汎用AIモデル"],["entered into application","適用が開始された"],["on August 2, 2025.","2025年8月2日に。"],["Providers of models","モデルの提供者は"],["deemed to pose systemic risk","システミックリスクをもたらすと判断された"],["must notify","通知しなければならない"],["the European Commission,","欧州委員会に、"],["assess and mitigate risk,","リスクを評価・軽減し、"],["report serious incidents,","重大なインシデントを報告し、"],["and ensure cybersecurity.","サイバーセキュリティを確保しなければならない。"],["Article 55 also requires","第55条はまた要求している"],["model evaluation","モデル評価を"],["using state-of-the-art protocols,","最先端のプロトコルを用いた"],["including adversarial testing.","敵対的テストを含む。"],["At present,","現在、"],["the EU presumes systemic risk","EUはシステミックリスクを推定する"],["when training exceeds","訓練が超える場合"],["10^25 FLOP,","10の25乗FLOPを、"],["though the Commission","ただし欧州委員会は"],["can also designate","指定することもできる"],["other models","他のモデルを"],["based on their capabilities","その能力と"],["and impact.","影響に基づいて。"],["Still,","とはいえ、"],["the framework","この枠組みは"],["is not purely punitive:","純粋に懲罰的ではない："],["providers may use","提供者は利用できる"],["a code of practice","行動規範"],["or other adequate means","またはその他の適切な手段を"],["to demonstrate compliance.","コンプライアンスを証明するために。"],["(digital-strategy.ec.europa.eu)","（digital-strategy.ec.europa.eu）"],["(https://digital-strategy.ec.europa.eu/en/factpages/general-purpose-ai-obligations-under-ai-act)","（https://digital-strategy.ec.europa.eu/en/factpages/general-purpose-ai-obligations-under-ai-act）"],["Can security and innovation","安全性とイノベーションは"],["coexist?","共存できるのか？"],["Probably yes—","おそらく可能だ——"],["but only if","ただし〜の場合に限り"],["oversight remains narrow,","監視が限定的で、"],["technically serious,","技術的に真摯で、"],["and proportionate.","かつ比例的であれば。"],["The emerging pattern","浮かび上がるパターンは"],["in the U.S., UK, and EU","米国・英国・EUにおける"],["is not blanket censorship","AI研究の全面的な検閲"],["of AI research.","ではない。"],["It is targeted scrutiny","それは的を絞った精査である"],["of the most capable models,","最も高性能なモデルに対する、"],["focused on measurement,","測定、"],["red-teaming,","レッドチーミング、"],["and risk mitigation.","そしてリスク軽減に焦点を当てた。"],["That is partly an inference,","これは部分的には推論だが、"],["but it is consistent with","しかし〜と整合する"],["today's mix of","現在の組み合わせと"],["U.S. voluntary testing,","米国の自主的テスト、"],["British evidence-led evaluation,","英国のエビデンス主導の評価、"],["and Europe's","そして欧州の"],["risk-tiered legal duties.","リスク段階別の法的義務という。"],["(nist.gov)","（nist.gov）"],["(https://www.nist.gov/news-events/news/2026/05/caisi-signs-agreements-regarding-frontier-ai-national-security-testing)","（https://www.nist.gov/news-events/news/2026/05/caisi-signs-agreements-regarding-frontier-ai-national-security-testing）"]]