Filtering by Tag: #LLMs

What’s Left to Build When Software Is Free, with Chip Huyen

Added on June 10, 2026 by Jon Krohn.

For today's landmark episode (#999!), I asked rockstar Chip Huyen to be my guest and she said "yes"! We discuss her book "A.I. Engineering" (the most popular O'Reilly book in 2025) and how the A.I. job landscape is shifting.

In case you haven't heard of her, more on Chip:
• Her most recent book is "AI Engineering", which was the most popular book in the O'Reilly platform last year.
• Previously wrote “Designing Machine Learning Systems”, which was also an O'Reilly mega-bestseller and was based on the Stanford University course she created and taught on the same topic.
• Is currently building a new stealth startup.
• Previously worked as VP of AI at Voltron Data, co-founder of Claypot AI, ML Engineer at Snorkel AI and Sr Deep Learning Engineer at NVIDIA.
• Holds a Master's in Computer Science from Stanford.
• Her invaluable posts have earned her over 300k followers on LinkedIn.

In this episode, Chip breaks down:
• What separates AI engineering from machine learning engineering.
• The case for a "start simple" workflow.
• The real costs of running LLMs in production.
• Physical AI.
• Robotics.
• World models.
• Why the durable problems worth solving are increasingly human ones.

The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.

End-to-End Foundation Models for the Energy Industry, with Jazmia Henry

Added on May 28, 2026 by Jon Krohn.

What does it take to build foundation LLMs from scratch today? Deeply impressive Jazmia Henry breaks down the four stages in today's episode, enjoy!

Jazmia:
• Holds degrees from Tulane University and Columbia University... and is partway through a PhD at the University of Oxford.
• Held a technical fellowship at Stanford University.
• Previously worked as a data strategist at Morgan Stanley, head of ML at The Motley Fool and a Lead Applied AI engineer at Microsoft.
• Published a top paper at NeurIPS, the world's most prestigious academic AI conference.
• Currently works as "Member of Technical Staff for AI/ML" at collide., a Texas-based startup that’s building AI infrastructure (including all aspects of specialized foundation models) for the energy industry.

Key topics covered in this episode include:
• What foundation models are.
• Her "full-stack" foundation-model building's four distinct stages.
• How reinforcement learning (RL) models are "bursty" because they idle the GPU during reward calculation and then dump enormous loads on it all at once.
• Reward hacking by RL models.

Thanks to Mark Freeman II for recommending Jazmia as a guest.

The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.

How to Build AI-First Organizations, with Jacob Miller and Jeremy Mumford

Added on May 20, 2026 by Jon Krohn.

After today's fun episode with Jacob and Jeremy — authors of the brand-new book "Architected Intelligence" — you’ll have all the key info to build successful AI features, AI products and AI-first companies. Enjoy!

Jeremy Mumford and Jacob Miller serve as Lead AI Engineer and Vice President of Platform Intelligence, respectively, at Pattern, a giant Utah-based tech company that IPO’ed on the Nasdaq exchange about six months ago.

Jacob and Jeremy's brand-new "Architected Intelligence" book was published by Wiley and this episode focuses almost exclusively on this invaluable book.

Episode highlights include:
• The "User Agnosticism Tenet", which means designing products and processes so they can be executed equally well by a human, an AI agent, or any hybrid combo.
• The shift in the "define-build-feedback" loop today where "building" is no longer the bottleneck, which means "definition" and "feedback" are where teams win or lose.
• Why workflows are deterministic, predictable, and cheaper than agents, and why the natural progression is skills first, then workflows, and only then agents.
• Why data engineering is the bedrock of AI engineering.
• Why velocity is the only durable moat in a world where everyone has access to the same frontier models.

Thanks to podcast superfan Jonathan Bown for recommending Jeremy and Jacob as guests!

The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.

AI Infrastructure, Ray, and Why Nonlinear Careers Win, with Linda Haviv

Added on April 28, 2026 by Jon Krohn.

For folks in A.I., software, data science, things are moving so fast, it's easy to be overwhelmed. Luckily, A.I. engineer Linda Haviv makes it a joy to stay up to date! Today, we discuss career tips as well as open-source A.I. tech like Ray.

More on Linda:
• Until recently, was Staff Developer Advocate at Anyscale, makers of Ray, an open-source framework for managing, executing and optimizing A.I. compute.
• Previously was A.I. Developer Advocate at Amazon Web Services (AWS).
• Before that, was a software developer at Fox Corporation.
• Was a professional singer in New York up until her second (of three!) children was born.
• Holds a degree in philosophy from Baruch College.

In this episode, Linda ebulliently covers:
• How "A.I. infrastructure" refers to the compute stack, tooling and frameworks purpose-built for A.I. and ML workloads.
• Ray is a Python-native open-source distributed computing framework that lets engineers distribute training, data processing and model serving across GPUs without needing to become distributed systems experts.
• How building in public, creating content and contributing to open source are not just career insurance... they're how you find your community, attract unexpected opportunities and learn faster through teaching.
• And much more!

The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.

The Four Types of Memory Every AI Agent Needs, with Richmond Alake

Added on April 22, 2026 by Jon Krohn.

To build an effective A.I. agent, getting its memory right is essential. In today's episode, our agent-memory guide is brilliant (and very funny!) machine-learning architect and engineer, Richmond Alake.

More on Richmond:
• Director of A.I. developer experience at Oracle.
• Previously roles include: staff developer advocate for AI/ML at MongoDB, ML architect at Slalom, writer for NVIDIA and computer-vision engineer at Loveshark.
• Holds a master's in ML and robotics from the University of Surrey.

In this episode, Richmond magnificently covers:
• How agent memory is the encapsulation of systems (embedding models, rerankers, databases, and LLMs) that allow AI agents to learn and adapt with new information over time, rather than starting from scratch every session.
• The four types of agent memory (all drawn from human cognition).
• Memory-first agent harnesses.
• Predictions for a flattening of AI engineering roles, where the future developer will need end-to-end understanding of the full agent stack.

The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.