OpenAIの新しい音声AIは、話している最中でも本当に通訳できるのか？

Can OpenAI's New Voice AI Really Interpret While You're Still Talking?

OpenAIが発表した「GPT-Realtime-Translate」は、話している最中にリアルタイムで翻訳できる音声AI。その実力と注意点を解説します。

分からないところをタップすると
↓日本語訳が表示されます↓

Many people want a voice AI that works like a real interpreter. OpenAI moved closer to that idea on May 7, 2026, when it announced GPT-Realtime-Translate. According to OpenAI, this new model can translate speech from more than 70 input languages into 13 output languages, and it can keep up with the speaker instead of waiting for a full recording to finish. OpenAI’s audio documentation also explains that live translation should use a dedicated realtime session, so translated speech can begin as audio arrives. (openai.com)

So, can OpenAI’s new voice AI interpret while people are still talking? In the technical sense, yes. The new model is designed for streaming speech-to-speech translation, and OpenAI says it can return translated audio and transcript updates while the original audio is still coming in. That is much closer to human-style interpreting than older systems that first make a full transcript and only then produce a translation. However, there is an important point: OpenAI introduced this feature in its API for developers. In OpenAI’s public ChatGPT Voice FAQ, ChatGPT voice is described as a spoken conversation tool for logged-in users, but the help pages do not present this new live translator as a standard, built-in ChatGPT voice mode for everyone. (developers.openai.com)

For English learners, this is exciting. A future app could let you speak Japanese, hear quick English output, and check the transcript later. But it is not magic. OpenAI clearly warns that voice conversations may make mistakes, and its help pages say voice input can sometimes detect the wrong language unless you set your main language in the app settings. So the best answer is: yes, OpenAI’s newest voice technology can already do live interpreting-like translation, but it is still a tool that needs careful use. For travel, study, and casual conversation, it could be very helpful. For medical, legal, or business situations, people should still double-check important meaning. (help.openai.com)

会員登録して
読んだ語数を記録する

多くの人が、本物の通訳者のように機能する音声AIを求めています。OpenAIは2026年5月7日、GPT-Realtime-Translateを発表し、その理想に一歩近づきました。OpenAIによると、この新しいモデルは70以上の入力言語から13の出力言語への音声翻訳が可能で、録音が完全に終わるのを待つのではなく、話者のスピードに合わせてリアルタイムで処理できます。OpenAIの音声に関するドキュメントでは、ライブ翻訳には専用のリアルタイムセッションを使用すべきだと説明されており、これにより音声が届き次第、翻訳された音声を出力し始めることができます。(openai.com)

では、OpenAIの新しい音声AIは、人がまだ話している最中に通訳できるのでしょうか？技術的な意味では、「はい」です。この新しいモデルはストリーミング方式の音声間翻訳（speech-to-speech translation）向けに設計されており、OpenAIによれば、元の音声がまだ入力されている最中に、翻訳された音声やトランスクリプト（文字起こし）の更新を返すことができます。これは、まず完全なトランスクリプトを作成してからようやく翻訳を生成する従来のシステムと比べると、人間の通訳スタイルにはるかに近いものです。ただし、重要なポイントがあります。OpenAIはこの機能を開発者向けのAPIで導入しました。OpenAIが公開しているChatGPT Voice FAQでは、ChatGPTの音声機能はログインしたユーザー向けの音声会話ツールとして説明されていますが、ヘルプページでは、この新しいライブ翻訳機能を誰もが使える標準搭載のChatGPT音声モードとしては紹介していません。(developers.openai.com)

英語学習者にとって、これはワクワクするニュースです。将来的には、日本語で話すと素早く英語の音声が出力され、後からトランスクリプトを確認できるアプリが登場するかもしれません。しかし、魔法ではありません。OpenAIは、音声での会話には間違いが生じる可能性があると明確に警告しており、ヘルプページでは、アプリの設定でメイン言語を設定しないと、音声入力が誤った言語を検出してしまうことがあると述べています。したがって、最も的確な答えはこうです。はい、OpenAIの最新の音声技術は、すでにライブ通訳のような翻訳を行うことができます。しかし、それはまだ慎重に使う必要のあるツールです。旅行、勉強、カジュアルな会話には、とても役に立つでしょう。しかし、医療、法律、ビジネスの場面では、重要な意味についてはやはりダブルチェック（二重確認）をすべきです。(help.openai.com)

文法

●
関係代名詞 that（主格）
「that」は名詞の後ろに置いて、その名詞を詳しく説明する文をつなげます。日本語の「〜する（もの・こと）」に近い使い方です。
e.g. Many people want a voice AI that works like a real interpreter.
訳: 多くの人が、本物の通訳のように働く音声AIを求めています。
●
比較級 + than（〜よりも）
「形容詞/副詞 + -er + than」または「more + 形容詞 + than」で、2つのものを比べて「〜よりも…だ」と言えます。
e.g. This is much closer to human-style interpreting than older systems.
訳: これは古いシステムよりも、人間の通訳にずっと近いです。
●
could + 動詞の原形（可能性・推量）
「could」は「〜できるかもしれない」「〜する可能性がある」という未来の可能性を表します。canよりも不確かなニュアンスがあります。
e.g. For travel and study, it could be very helpful.
訳: 旅行や勉強には、とても役に立つかもしれません。

語彙

●
interpreter(名詞)
通訳者
e.g. She works as an interpreter at international meetings.
訳: 彼女は国際会議で通訳者として働いています。
●
translate(動詞)
翻訳する、通訳する
e.g. This AI can translate speech from Japanese into English.
訳: このAIは日本語の音声を英語に翻訳できます。
●
announce(動詞)
発表する、知らせる
e.g. The company announced a new product yesterday.
訳: その会社は昨日、新しい製品を発表しました。
●
detect(動詞)
検出する、見つける
e.g. The app can sometimes detect the wrong language.
訳: そのアプリは時々、間違った言語を検出することがあります。
●
transcript(名詞)
書き起こし、文字記録
e.g. You can check the transcript after the conversation.
訳: 会話の後で書き起こしを確認できます。
●
streaming(名詞・形容詞)
ストリーミング（データをリアルタイムで送受信すること）
e.g. The new model is designed for streaming translation.
訳: この新しいモデルはストリーミング翻訳のために設計されています。
●
double-check(動詞)
再確認する、二重にチェックする
e.g. Please double-check the meaning before you send the message.
訳: メッセージを送る前に意味を再確認してください。

表現・慣用句

●
keep up with
〜に遅れずについていく、〜と同じ速さで進む。スピードやペースについていく場面で使います。
e.g. The AI can keep up with the speaker's speed.
訳: そのAIは話す人のスピードについていくことができます。
●
in the technical sense
「技術的な意味では」という意味で、厳密に言えばそうだと説明するときに使います。
e.g. In the technical sense, yes, the AI can do live interpreting.
訳: 技術的な意味では、はい、そのAIはリアルタイムの通訳ができます。
●
it is not magic
「魔法ではない」＝完璧ではない、限界がある、という意味。期待しすぎないように伝えるときに使います。
e.g. AI translation is useful, but it is not magic.
訳: AI翻訳は便利ですが、魔法ではありません（完璧ではありません）。
●
built-in
「最初から組み込まれている」という意味。アプリや機械にもともと入っている機能を表すときに使います。
e.g. My phone has a built-in camera.
訳: 私のスマホにはカメラが内蔵されています。

by EigoBoxAI
作成:2026/05/19 12:04
レベル:初中級 (語彙目安:1000〜2000語)
タイプ:リーディング

# OpenAIの新しい音声AIは、話している最中でも本当に通訳できるのか？
## Can OpenAI's New Voice AI Really Interpret While You're Still Talking?

![thumbnail](https://eigobox.s3.ap-northeast-1.amazonaws.com/g/170ef20c8f7a9b51943cf0419da24c51a6c4d6c9.png)

---

[["Many people want","多くの人が望んでいます"],["a voice AI","音声AIを"],["that works like","〜のように働く"],["a real interpreter.","本物の通訳のように。"],["OpenAI moved closer","OpenAIは近づきました"],["to that idea","そのアイデアに"],["on May 7, 2026,","2026年5月7日に、"],["when it announced","発表したときに"],["GPT-Realtime-Translate.","GPT-Realtime-Translateを。"],["According to OpenAI,","OpenAIによると、"],["this new model","この新しいモデルは"],["can translate speech","音声を翻訳できます"],["from more than","〜以上の"],["70 input languages","70の入力言語から"],["into 13 output languages,","13の出力言語に、"],["and it can keep up","そしてついていけます"],["with the speaker","話している人に"],["instead of waiting","待つのではなく"],["for a full recording","全部の録音が"],["to finish.","終わるのを。"],["OpenAI's audio documentation","OpenAIの音声ドキュメントは"],["also explains that","〜とも説明しています"],["live translation","ライブ翻訳は"],["should use","使うべきだと"],["a dedicated realtime session,","専用のリアルタイムセッションを、"],["so translated speech","そうすれば翻訳された音声が"],["can begin","始められます"],["as audio arrives.","音声が届くと同時に。"],["(openai.com/index/advancing-voice-intelligence-with-new-models-in-the-api/?utm_source=openai)","(openai.com/index/advancing-voice-intelligence-with-new-models-in-the-api/?utm_source=openai)"],["So,","では、"],["can OpenAI's new voice AI","OpenAIの新しい音声AIは"],["interpret","通訳できるのでしょうか"],["while people","人々が"],["are still talking?","まだ話している間に？"],["In the technical sense,","技術的な意味では、"],["yes.","はい。"],["The new model","新しいモデルは"],["is designed for","〜のために設計されています"],["streaming","ストリーミング"],["speech-to-speech translation,","音声から音声への翻訳の。"],["and OpenAI says","そしてOpenAIは言っています"],["it can return","返すことができると"],["translated audio","翻訳された音声と"],["and transcript updates","文字起こしの更新を"],["while the original audio","元の音声が"],["is still coming in.","まだ届いている間に。"],["That is much closer","それはずっと近いです"],["to human-style interpreting","人間のような通訳に"],["than older systems","古いシステムよりも"],["that first make","最初に作る"],["a full transcript","完全な文字起こしを"],["and only then","そしてその後やっと"],["produce a translation.","翻訳を作る。"],["However,","しかし、"],["there is an important point:","重要なポイントがあります："],["OpenAI introduced this feature","OpenAIはこの機能を紹介しました"],["in its API","そのAPIで"],["for developers.","開発者向けの。"],["In OpenAI's public","OpenAIの公開の"],["ChatGPT Voice FAQ,","ChatGPT音声FAQでは、"],["ChatGPT voice is described","ChatGPT音声は説明されています"],["as a spoken conversation tool","音声会話ツールとして"],["for logged-in users,","ログインしたユーザー向けの、"],["but the help pages","しかしヘルプページは"],["do not present","紹介していません"],["this new live translator","この新しいライブ翻訳を"],["as a standard,","標準的な、"],["built-in ChatGPT voice mode","内蔵のChatGPT音声モードとして"],["for everyone.","みんなのための。"],["(developers.openai.com/api/docs/models/gpt-realtime-translate)","(developers.openai.com/api/docs/models/gpt-realtime-translate)"],["For English learners,","英語学習者にとって、"],["this is exciting.","これはわくわくします。"],["A future app could","将来のアプリなら"],["let you speak Japanese,","日本語を話して、"],["hear quick English output,","素早い英語の出力を聞いて、"],["and check the transcript","そして文字起こしを確認"],["later.","後で。"],["But it is not magic.","でも魔法ではありません。"],["OpenAI clearly warns","OpenAIははっきり警告しています"],["that voice conversations","音声会話は"],["may make mistakes,","間違えることがあると、"],["and its help pages say","そしてヘルプページには書いてあります"],["voice input can sometimes","音声入力は時々"],["detect the wrong language","間違った言語を検出する"],["unless you set","設定しない限り"],["your main language","あなたのメインの言語を"],["in the app settings.","アプリの設定で。"],["So the best answer is:","なので一番良い答えは："],["yes,","はい、"],["OpenAI's newest","OpenAIの最新の"],["voice technology","音声技術は"],["can already do","もうできます"],["live interpreting-like","ライブ通訳のような"],["translation,","翻訳を、"],["but it is still a tool","でもまだツールです"],["that needs careful use.","注意して使う必要がある。"],["For travel, study,","旅行、勉強、"],["and casual conversation,","そして日常会話には、"],["it could be very helpful.","とても役に立つでしょう。"],["For medical, legal,","医療、法律、"],["or business situations,","またはビジネスの場面では、"],["people should still","人々はやはり"],["double-check","ダブルチェックすべきです"],["important meaning.","重要な意味を。"],["(help.openai.com/en/articles/8400625-voice-mode-faq?_bhlid=abe7351e362fdae6683625d8ce3ad3e214a39d8a)","(help.openai.com/en/articles/8400625-voice-mode-faq?_bhlid=abe7351e362fdae6683625d8ce3ad3e214a39d8a)"]]