Csaba Szepesvari

TalkRL: The Reinforcement Learning Podcast

コンテンツは Robin Ranjit Singh Chauhan によって提供されます。エピソード、グラフィック、ポッドキャストの説明を含むすべてのポッドキャストコンテンツは、Robin Ranjit Singh Chauhan またはそのポッドキャストプラットフォームパートナーによって直接アップロードされ、提供されます。誰かがあなたの著作権で保護された作品をあなたの許可なく使用していると思われる場合は、ここで概説されているプロセスに従うことができますhttps://ja.player.fm/legal。

4y ago 48:42

MP3•エピソードのホーム

Csaba Szepesvari is:

Head of the Foundations Team at DeepMind
Professor of Computer Science at the University of Alberta
Canada CIFAR AI Chair
Fellow at the Alberta Machine Intelligence Institute
Co-Author of the book Bandit Algorithms along with Tor Lattimore, and author of the book Algorithms for Reinforcement Learning

References

Bandit based monte-carlo planning, Levente Kocsis, Csaba Szepesvári
Bandit Algorithms, Tor Lattimore, Csaba Szepesvári
Algorithms for Reinforcement Learning, Csaba Szepesvári
The Predictron: End-To-End Learning and Planning, David Silver, Hado van Hasselt, Matteo Hessel, Tom Schaul, Arthur Guez, Tim Harley, Gabriel Dulac-Arnold, David Reichert, Neil Rabinowitz, Andre Barreto, Thomas Degris
A Bayesian framework for reinforcement learning, Strens
Solving Rubik’s Cube with a Robot Hand ; Paper, OpenAI, Ilge Akkaya, Marcin Andrychowicz, Maciek Chociej, Mateusz Litwin, Bob McGrew, Arthur Petron, Alex Paino, Matthias Plappert, Glenn Powell, Raphael Ribas, Jonas Schneider, Nikolas Tezak, Jerry Tworek, Peter Welinder, Lilian Weng, Qiming Yuan, Wojciech Zaremba, Lei Zhang
The Nonstochastic Multiarmed Bandit Problem, Peter Auer, Nicolò Cesa-Bianchi, Yoav Freund, and Robert E. Schapire
Deep Learning with Bayesian Principles, Mohammad Emtiyaz Khan
Tackling climate change with Machine Learning David Rolnick, Priya L. Donti, Lynn H. Kaack, Kelly Kochanski, Alexandre Lacoste, Kris Sankaran, Andrew Slavin Ross, Nikola Milojevic-Dupont, Natasha Jaques, Anna Waldman-Brown, Alexandra Luccioni, Tegan Maharaj, Evan D. Sherwin, S. Karthik Mukkavilli, Konrad P. Kording, Carla Gomes, Andrew Y. Ng, Demis Hassabis, John C. Platt, Felix Creutzig, Jennifer Chayes, Yoshua Bengio

53 つのエピソード

#Reinforcement Learning #Machine Learning #Robin Ranjit Singh Chauhan #Artificial Intelligence #Tech

Csaba Szepesvari

TalkRL: The Reinforcement Learning Podcast

82 subscribers

published 4y ago

MP3•エピソードのホーム

Csaba Szepesvari is:

Head of the Foundations Team at DeepMind
Professor of Computer Science at the University of Alberta
Canada CIFAR AI Chair
Fellow at the Alberta Machine Intelligence Institute
Co-Author of the book Bandit Algorithms along with Tor Lattimore, and author of the book Algorithms for Reinforcement Learning

References

Bandit based monte-carlo planning, Levente Kocsis, Csaba Szepesvári
Bandit Algorithms, Tor Lattimore, Csaba Szepesvári
Algorithms for Reinforcement Learning, Csaba Szepesvári
The Predictron: End-To-End Learning and Planning, David Silver, Hado van Hasselt, Matteo Hessel, Tom Schaul, Arthur Guez, Tim Harley, Gabriel Dulac-Arnold, David Reichert, Neil Rabinowitz, Andre Barreto, Thomas Degris
A Bayesian framework for reinforcement learning, Strens
Solving Rubik’s Cube with a Robot Hand ; Paper, OpenAI, Ilge Akkaya, Marcin Andrychowicz, Maciek Chociej, Mateusz Litwin, Bob McGrew, Arthur Petron, Alex Paino, Matthias Plappert, Glenn Powell, Raphael Ribas, Jonas Schneider, Nikolas Tezak, Jerry Tworek, Peter Welinder, Lilian Weng, Qiming Yuan, Wojciech Zaremba, Lei Zhang
The Nonstochastic Multiarmed Bandit Problem, Peter Auer, Nicolò Cesa-Bianchi, Yoav Freund, and Robert E. Schapire
Deep Learning with Bayesian Principles, Mohammad Emtiyaz Khan
Tackling climate change with Machine Learning David Rolnick, Priya L. Donti, Lynn H. Kaack, Kelly Kochanski, Alexandre Lacoste, Kris Sankaran, Andrew Slavin Ross, Nikola Milojevic-Dupont, Natasha Jaques, Anna Waldman-Brown, Alexandra Luccioni, Tegan Maharaj, Evan D. Sherwin, S. Karthik Mukkavilli, Konrad P. Kording, Carla Gomes, Andrew Y. Ng, Demis Hassabis, John C. Platt, Felix Creutzig, Jennifer Chayes, Yoshua Bengio

53 つのエピソード

#Reinforcement Learning #Machine Learning #Robin Ranjit Singh Chauhan #Artificial Intelligence #Tech

すべてのエピソード

プレーヤーFMへようこそ！

Player FMは今からすぐに楽しめるために高品質のポッドキャストをウェブでスキャンしています。これは最高のポッドキャストアプリで、Android、iPhone、そしてWebで動作します。全ての端末で購読を同期するためにサインアップしてください。

500+以上のトピックを聴こう

TalkRL: The Reinforcement Learning Podcastに似ているもの

聞く価値のあるポッドキャスト

TalkRL: The Reinforcement Learning Podcast « » Csaba Szepesvari

Csaba Szepesvari

聞く価値のあるポッドキャスト

プレーヤーFMへようこそ！

TalkRL: The Reinforcement Learning Podcastに似ているもの

クイックリファレンスガイド

TalkRL: The Reinforcement Learning Podcast « »
Csaba Szepesvari