Illustration depicting linguists studying why human language resists compression like computer code, contrasting brain processing with digital efficiency.

研究、人間の言語がコンピュータコードのように圧縮されない理由を探る

2026年02月20日(金)

AIによるレポート

AIによって生成された画像

事実確認済み

言語学者のリチャード・フートレルとマイケル・ハーンによる新しいモデルは、人間の言語の多くの特徴的な要素——馴染みのある単語、予測可能な順序、段階的に構築される意味など——が、逐次情報処理の制約を反映したものであり、データ最大圧縮の衝動によるものではないことを示唆している。この研究はNature Human Behaviourに掲載された。

人間の言語は驚くほど豊かで複雑である。情報理論の観点から、同じアイデアは原理的に、はるかにコンパクトな文字列で伝えられる可能性がある——コンピュータが2進数字を使って情報を表現するのと同様に。 nnドイツのザールブリュッケンにあるザールランド大学の言語学者マイケル・ハーン氏と、カリフォルニア大学アーバイン校のリチャード・フートレル氏らは、日常会話が厳密に圧縮されたデジタルコードに似ていない理由を明らかにしようとした。2025年11月にNature Human Behaviourに掲載された論文で、研究者らは、「自然言語らしい」構造が、逐次予測の限界——すでに聞こえた内容から次に来るものを予測するためにどれだけの情報を前方に運ぶ必要があるか——によって生じるというモデルを提示した。 nnその枠組みでは、言語は人々がストリームとして処理しやすいパターンから利益を得る。ScienceDailyの要約では、大阪大学の資料を引用し、例を挙げてこの考えを説明している：猫半分犬半分のハイブリッド概念のための造語「gol」は共有経験にきれいにマッピングされないため理解しにくく、「gadcot」のような乱雑なブレンドも同様に解釈しにくい。一方、「cat and dog」は即座に意味が通じる。 nn研究者らはまた、単語の順序が聞き手にリアルタイムで不確実性を減らすシグナルであると指摘している。ScienceDailyのリリースでは、ドイツ語の名詞句「Die fünf grünen Autos」（「緑色の車5台」）を例に挙げ、各単語が解釈の可能性を絞り込むことで意味が段階的に構築されることを示している。これらの単語を「Grünen fünf die Autos」のように並べ替えると、その予測可能性が乱れ、理解が難しくなる。 nn言語が「最大限に圧縮されていない」理由を説明するだけでなく、論文の議論は機械学習にも結びつけている。フートレル氏とハーン氏は、自然言語が認知制約下で次のトークン予測を比較的容易にするよう構造化されていると主張し、これは現代の大規模言語モデルに関連すると述べている。 nn

事実確認

信頼スコア

信頼コメント

最も具体的な主張——著者、所属、論文のタイトルと出版日、コアの「予測情報」議論、および具体的な例（「gol」、「gadcot」、ドイツ語フレーズ）——はScienceDailyのリリースと基になるNature Human Behaviour論文によって直接裏付けられている。2つの要素は記述通りに明確に裏付けられていなかったため緩和した：記事の「最大情報圧縮」に対する厳密なトレードオフの枠組みと、「約7,000」言語の正確な数字で、これはリリースに登場するが論文自体では確立されていない。全体の信頼性は強く、ピアレビュー済み研究と一貫した機関要約に主に依存しているため。

Study points to whole-brain network coordination as a key feature of general intelligence

2026年03月03日(火) AIによるレポート AIによって生成された画像事実確認済み

University of Notre Dame researchers report evidence that general intelligence is associated with how efficiently and flexibly brain networks coordinate across the whole connectome, rather than being localized to a single “smart” region. The findings, published in Nature Communications, are based on neuroimaging and cognitive data from 831 Human Connectome Project participants and an additional 145 adults from the INSIGHT Study.

研究、人間の言語がコンピュータコードのように圧縮されない理由を探る

関連記事

Study points to whole-brain network coordination as a key feature of general intelligence

Study uncovers 40,000-year-old signs as early information systems

Computer language spots error in widely cited physics paper

Study uncovers overlap in brain networks for episodic and semantic memory

US commission credits China’s AI edge to open-source models, manufacturing

Northwestern engineers print artificial neurons that can stimulate living brain cells

Scientists say defining consciousness is increasingly urgent as AI and neurotechnology advance

AIs frequently recommend nuclear strikes in war simulations

Quantum method promises AI boost from computers

OpenAI unveils biology-tuned large language model GPT-Rosalind

Cortical Labs to build biological data centres in Melbourne and Singapore

Human brain cells on chip learn to play Doom in a week

Study shows AI can deanonymize online users from posts

Generative AI outperforms human teams in analyzing medical data

Two-month-old babies categorize objects earlier than thought

Hackers are using LLMs to build next-generation phishing attacks

このウェブサイトはCookieを使用します