Training Large Language Models To Reason In Continuous Latent Space Deep Papers podcast

Artwork

Science Tech Math Business Arize AI

コンテンツは Arize AI によって提供されます。エピソード、グラフィック、ポッドキャストの説明を含むすべてのポッドキャストコンテンツは、Arize AI またはそのポッドキャストプラットフォームパートナーによって直接アップロードされ、提供されます。誰かがあなたの著作物をあなたの許可なく使用していると思われる場合は、ここで概説されているプロセスに従うことができますhttps://ja.player.fm/legal。

Deep Papers »
Training Large Language Models to Reason in Continuous Latent Space

13d ago 24:58

シェア

MP3•エピソードのホーム

コンテンツは Arize AI によって提供されます。エピソード、グラフィック、ポッドキャストの説明を含むすべてのポッドキャストコンテンツは、Arize AI またはそのポッドキャストプラットフォームパートナーによって直接アップロードされ、提供されます。誰かがあなたの著作物をあなたの許可なく使用していると思われる場合は、ここで概説されているプロセスに従うことができますhttps://ja.player.fm/legal。

LLMs have typically been restricted to reason in the "language space," where chain-of-thought (CoT) is used to solve complex reasoning problems. But a new paper argues that language space may not always be the best for reasoning. In this paper read, we cover an exciting new technique from a team at Meta called Chain of Continuous Thought—also known as "Coconut." In the paper, "Training Large Language Models to Reason in a Continuous Latent Space" explores the potential of allowing LLMs to reason in an unrestricted latent space instead of being constrained by natural language tokens.
Read a full breakdown of Coconut on our blog

Learn more about AI observability and evaluation in our course, join the Arize AI Slack community or get the latest on LinkedIn and X.

… continue reading

41 つのエピソード

#Science #Tech #Math #Business #Arize AI

Artwork

Training Large Language Models to Reason in Continuous Latent Space

22 subscribers

published 13d ago

シェア

MP3•エピソードのホーム

コンテンツは Arize AI によって提供されます。エピソード、グラフィック、ポッドキャストの説明を含むすべてのポッドキャストコンテンツは、Arize AI またはそのポッドキャストプラットフォームパートナーによって直接アップロードされ、提供されます。誰かがあなたの著作物をあなたの許可なく使用していると思われる場合は、ここで概説されているプロセスに従うことができますhttps://ja.player.fm/legal。

LLMs have typically been restricted to reason in the "language space," where chain-of-thought (CoT) is used to solve complex reasoning problems. But a new paper argues that language space may not always be the best for reasoning. In this paper read, we cover an exciting new technique from a team at Meta called Chain of Continuous Thought—also known as "Coconut." In the paper, "Training Large Language Models to Reason in a Continuous Latent Space" explores the potential of allowing LLMs to reason in an unrestricted latent space instead of being constrained by natural language tokens.
Read a full breakdown of Coconut on our blog

Learn more about AI observability and evaluation in our course, join the Arize AI Slack community or get the latest on LinkedIn and X.

… continue reading

41 つのエピソード

#Science #Tech #Math #Business #Arize AI

すべてのエピソード

×

プレーヤーFMへようこそ！

Player FMは今からすぐに楽しめるために高品質のポッドキャストをウェブでスキャンしています。これは最高のポッドキャストアプリで、Android、iPhone、そしてWebで動作します。全ての端末で購読を同期するためにサインアップしてください。

500+以上のトピックを聴こう

クイックリファレンスガイド

トップポッドキャスト

鈴木淑子の地球は競馬でまわってる

文化放送競馬中継～今週のメインレース

ボードゲームおっぱい

聴く日経ヘッドライン

サンドウィッチマンの東北魂

トータルテンボスのぬきさしならナイト！

流行りモノ通信簿

RADIO365-TOKYO BAYSIDE RADIO STATION

町田徹のふかぼり！

足立明穂の週刊ＩＴトレンドＸ

伊藤洋一のRound Up World Now！

The other side journal

KIQTAS（キクタス）

ヴォイニッチの科学書

武田邦彦のヒバリクラブ

IchibanTalk 海外で頑張る日本人トーク

探検しながらこの番組を聞いてください