Artwork

Contenido proporcionado por Stack Overflow and The Stack Overflow Podcast. Todo el contenido del podcast, incluidos episodios, gráficos y descripciones de podcast, lo carga y proporciona directamente Stack Overflow and The Stack Overflow Podcast o su socio de plataforma de podcast. Si cree que alguien está utilizando su trabajo protegido por derechos de autor sin su permiso, puede seguir el proceso descrito aquí https://es.player.fm/legal.
Player FM : aplicación de podcast
¡Desconecta con la aplicación Player FM !

The server-side rendering equivalent for LLM inference workloads

21:44
 
Compartir
 

Manage episode 501111517 series 1123725
Contenido proporcionado por Stack Overflow and The Stack Overflow Podcast. Todo el contenido del podcast, incluidos episodios, gráficos y descripciones de podcast, lo carga y proporciona directamente Stack Overflow and The Stack Overflow Podcast o su socio de plataforma de podcast. Si cree que alguien está utilizando su trabajo protegido por derechos de autor sin su permiso, puede seguir el proceso descrito aquí https://es.player.fm/legal.

Ryan is joined by Tuhin Srivastava, CEO and co-founder of Baseten, to explore the evolving landscape of AI infrastructure and inference workloads, how the shift from traditional machine learning models to large-scale neural networks has made GPU usage challenging, and the potential future of hardware-specific optimizations in AI.

Episode notes:

Baseten is an AI infrastructure platform giving you the tooling, expertise, and hardware needed to bring AI products to market fast.

Connect with Tuhin on LinkedIn or reach him at his email [email protected].

Shoutout to user Hitesh for winning a Populist badge for their answer to Cannot drop database because it is currently in use.

See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

  continue reading

894 episodios

Artwork
iconCompartir
 
Manage episode 501111517 series 1123725
Contenido proporcionado por Stack Overflow and The Stack Overflow Podcast. Todo el contenido del podcast, incluidos episodios, gráficos y descripciones de podcast, lo carga y proporciona directamente Stack Overflow and The Stack Overflow Podcast o su socio de plataforma de podcast. Si cree que alguien está utilizando su trabajo protegido por derechos de autor sin su permiso, puede seguir el proceso descrito aquí https://es.player.fm/legal.

Ryan is joined by Tuhin Srivastava, CEO and co-founder of Baseten, to explore the evolving landscape of AI infrastructure and inference workloads, how the shift from traditional machine learning models to large-scale neural networks has made GPU usage challenging, and the potential future of hardware-specific optimizations in AI.

Episode notes:

Baseten is an AI infrastructure platform giving you the tooling, expertise, and hardware needed to bring AI products to market fast.

Connect with Tuhin on LinkedIn or reach him at his email [email protected].

Shoutout to user Hitesh for winning a Populist badge for their answer to Cannot drop database because it is currently in use.

See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

  continue reading

894 episodios

すべてのエピソード

×
 
Loading …

Bienvenido a Player FM!

Player FM está escaneando la web en busca de podcasts de alta calidad para que los disfrutes en este momento. Es la mejor aplicación de podcast y funciona en Android, iPhone y la web. Regístrate para sincronizar suscripciones a través de dispositivos.

 

Guia de referencia rapida

Escucha este programa mientras exploras
Reproducir