104-Open Talk on Open Source Dataset Format for AI

37:53
 
シェア
 

Manage episode 331285781 series 2922369
著作 Kashif Manzoor の情報はPlayer FM及びコミュニティによって発見されました。著作権は出版社によって所持されます。そして、番組のオーディオは、その出版社のサーバから直接にストリーミングされます。Player FMで購読ボタンをタップし、更新できて、または他のポッドキャストアプリにフィードのURLを貼り付けます。

An open talk on the challenges of having a data pipeline for the images, audio and videos, The Hub enables to have several famous machine learning datasets with just a single command, like CIFAR-10, MNIST or Fashion-MNIST, Google Objection, ImageNet, COCO, and many others.

As I came from a Relational Database management system (RDBS) background, this talk gives me a new perspective and helps to think outside of the known areas. Enjoy the talk with the CEO of Active Loop. This session was recorded in October 2021 and is now being published.

Today’s Guest

Davit Buniatyan, CEO at ActiveLoop.ai

A great insight talk with the guest speaker on a topic, a product owner focuses on a dataset format to offer API for creating, storing, and collaborating on any size of AI datasets.

  • What were the challenges faced in the unstructured data storage and how the hub is offering a solution to solve the data problem?
  • The question of where you will store the large data sets of images, and videos - you will get all the answers in this talk
  • How the opensource Github repo is helping thousands of people to use datasets to PyTorch or TensorFlow with one line of code.

Resources:

105 つのエピソード