Artwork

Contenido proporcionado por Raza Habib. Todo el contenido del podcast, incluidos episodios, gráficos y descripciones de podcast, lo carga y proporciona directamente Raza Habib o su socio de plataforma de podcast. Si cree que alguien está utilizando su trabajo protegido por derechos de autor sin su permiso, puede seguir el proceso descrito aquí https://es.player.fm/legal.
Player FM : aplicación de podcast
¡Desconecta con la aplicación Player FM !

Why Your AI Product Needs Evals with Hamel Husain and Swyx

1:09:02
 
Compartir
 

Manage episode 441766382 series 3586305
Contenido proporcionado por Raza Habib. Todo el contenido del podcast, incluidos episodios, gráficos y descripciones de podcast, lo carga y proporciona directamente Raza Habib o su socio de plataforma de podcast. Si cree que alguien está utilizando su trabajo protegido por derechos de autor sin su permiso, puede seguir el proceso descrito aquí https://es.player.fm/legal.

Hamel Husain is a seasoned AI consultant and engineer with experience at companies like GitHub, DataRobot, and Airbnb. He is a trailblazer in AI development, known for his innovative work in literate programming and AI-assisted development tools. Shawn Wang (aka Swyx) is the host of the Latent Space podcast, the author of the essay 'Rise of the AI Engineer,' and the founder of the AI Engineer World Fair. In this episode, Hamel and Swyx share their unique insights on building effective AI products, the critical importance of evaluations, and their vision for the future of AI engineering.

Chapters
00:00 - Introduction and recent AI advancements

06:14 - The critical role of evals in AI product development

15:33 - Common pitfalls in AI product development

26:33 - Literate programming: A new paradigm for AI development

39:58 - Answer AI and innovative approaches to software development

51:56 - Integrating AI with literate programming environments

58:47 - The importance of understanding AI prompts

01:00:37 - Assessing the current state of AI adoption

01:07:10 - Challenges in evaluating AI models

--------------------------------------------------------------------------------------------------------------------------------------------------
Humanloop is an Integrated Development Environment for Large Language Models. It enables product teams to develop LLM-based applications that are reliable and scalable. To find out more go to humanloop.com

  continue reading

25 episodios

Artwork
iconCompartir
 
Manage episode 441766382 series 3586305
Contenido proporcionado por Raza Habib. Todo el contenido del podcast, incluidos episodios, gráficos y descripciones de podcast, lo carga y proporciona directamente Raza Habib o su socio de plataforma de podcast. Si cree que alguien está utilizando su trabajo protegido por derechos de autor sin su permiso, puede seguir el proceso descrito aquí https://es.player.fm/legal.

Hamel Husain is a seasoned AI consultant and engineer with experience at companies like GitHub, DataRobot, and Airbnb. He is a trailblazer in AI development, known for his innovative work in literate programming and AI-assisted development tools. Shawn Wang (aka Swyx) is the host of the Latent Space podcast, the author of the essay 'Rise of the AI Engineer,' and the founder of the AI Engineer World Fair. In this episode, Hamel and Swyx share their unique insights on building effective AI products, the critical importance of evaluations, and their vision for the future of AI engineering.

Chapters
00:00 - Introduction and recent AI advancements

06:14 - The critical role of evals in AI product development

15:33 - Common pitfalls in AI product development

26:33 - Literate programming: A new paradigm for AI development

39:58 - Answer AI and innovative approaches to software development

51:56 - Integrating AI with literate programming environments

58:47 - The importance of understanding AI prompts

01:00:37 - Assessing the current state of AI adoption

01:07:10 - Challenges in evaluating AI models

--------------------------------------------------------------------------------------------------------------------------------------------------
Humanloop is an Integrated Development Environment for Large Language Models. It enables product teams to develop LLM-based applications that are reliable and scalable. To find out more go to humanloop.com

  continue reading

25 episodios

Todos los episodios

×
 
Loading …

Bienvenido a Player FM!

Player FM está escaneando la web en busca de podcasts de alta calidad para que los disfrutes en este momento. Es la mejor aplicación de podcast y funciona en Android, iPhone y la web. Regístrate para sincronizar suscripciones a través de dispositivos.

 

Guia de referencia rapida

Escucha este programa mientras exploras
Reproducir