The Math Still Matters: Deep Skills in the Age of AI, with Dr. Catherine Williams

Added on July 22, 2026 by Jon Krohn.

Dr. Catherine Williams was solving black-hole equations with pen and paper before she ever wrote a line of code and, in today’s episode, she makes the case that going deep on AI/ML math matters more than ever...

...even now that AI can do the math for you.

More on Dr. Williams:
• PhD in math researching general relativity and black holes.
• Postdocs at Stanford University and Columbia University.
• Became one of the very first data scientists when she joined AppNexus back in 2012, around the same time "data scientist" became a job title.
• Across more than a decade of senior data leadership at AppNexus, Xander, Qualtrics and now Candid, she's watched our field get born and then reinvent itself again and again.

In today's episode, Catherine traces the data science and AI evolution — from Bayesian models to BERT to today's LLMs — and shares sharp guidance on which skills will still matter as machines take over more of the technical work.

The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.

Fable 5 as Advisor: Anthropic’s Two-Model Pattern for Smarter, Cheaper Agents

Added on July 20, 2026 by Jon Krohn.

Want near-frontier A.I. agent quality at a fraction of the cost? Anthropic recently productized the Advisor Strategy that pairs a cheap "executor" model with a brilliant "advisor" to give you the best of both worlds:

HOW IT WORKS
• A fast, cheap model (e.g., Claude Haiku or Sonnet) runs the entire agent loop: calling tools, writing code, drafting output.
• A frontier model (e.g., Claude Opus or Fable) sits on standby as a "tool" the executor can consult (like a junior worker phoning their supervisor when unsure).
• Everything happens inside one API call: Anthropic's servers hand the advisor the full conversation transcript and return just 400-700 tokens of advice, making this fast and inexpensive (it's also usually only a one-line code change so it's easy to implement).

THE RESULTS
• Sonnet + Opus advisor beat Sonnet alone on the "SWE-bench Multilingual" benchmark by 2.7 percentage points while cutting cost per task by 11.9%. Better quality AND slightly lower cost.
• Unsurprisingly, the biggest gains come from pairing a very fast/cheap model with a much more capable advisor: For example, on BrowseComp (web research benchmark), Haiku alone scored 19.7%; Haiku + Opus advisor scored 41.2% (more than double!) at 85% less cost than Sonnet alone.
• Newest data, from last week: On "SWE-bench Pro", Sonnet 5 + a Fable 5 advisor captured ~92% of Fable's standalone performance at ~63% of its cost.

WHY IT WORKS
• The advisor's output is tiny relative to the whole task, and a good plan delivered early prevents wasted attempts and misguided tool calls.
• Unlike OpenAI's router (which dispatches queries to a model up front), the cheap model runs the show and escalates itself mid-task with full shared context.

PRACTICAL LESSONS
• Skip it for single-turn Q&A; it shines on long-horizon agentic work (like coding, research, computer use).
• Executors under-call the advisor by default so prompt them to consult it early (before committing to an approach) and late (before declaring the task done).
• Cap advisor output at ~2,000 tokens (~7x cost reduction, no quality loss) and enable prompt caching for long loops.
• The pattern is spreading: OpenRouter now offers a cross-provider version (e.g., a Google Gemini executor consulting Claude).
• Alternative design patterns such as having a powerful "orchestrator" (shown below the advisor pattern in the chart I included in this post) might work even more effectively for your use case so it could be worth comparing them.

BOTTOM LINE
Frontier A.I. progress is no longer just bigger models... it's smarter economics in composing the models we already have.

The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.

The AI-Native Startup Playbook

Added on July 16, 2026 by Jon Krohn.

Anthropic recently published a 35-page "Founder's Playbook" for building an A.I.-native startup. It doubles as marketing for their products, but the guidance is disciplined, specific and useful:

THE PREMISE
• A.I. has removed the three bottlenecks that historically gated company-building: capital, headcount and technical skill.
• The founder's role shifts from individual contributor to "orchestrator of agents": Your scarce attention goes to deciding what to build and why; A.I. handles much of the execution.
• Each of the 4 stages of the playbook boils down to one principle: Keep your sense-making ahead of your building, especially when building feels effortless.

STAGE 1: IDEA
• The #1 trap is "mistaking building for validating". 42% of startups already failed by building something nobody wanted; expect that rate to climb now that prototypes take hours, not months.
• Sharpen your problem statement into a testable hypothesis: exactly who has the problem, how often, how severely and what they currently do about it.
• Use A.I. as a structured devil's advocate. Ask it to argue *against* your idea and find disconfirming evidence... A.I. tools have given confirmation bias a serious power-up.
• In customer interviews, ask about the specific past ("tell me about the last time..."), not the hypothetical future ("would you use...?").

STAGE 2: MVP
• Beware "agentic technical debt": Without written specs and architectural constraints, each AI coding session re-derives decisions from scratch and your codebase drifts.
• Fix: Document your architecture BEFORE you build, and log key decisions after each session. Five minutes of documentation is cheap insurance.
• Write a scope document stating what the MVP deliberately does NOT do; frictionless building makes scope creep nearly free.
• Define your retention and activation benchmarks before launch so early buzz doesn't masquerade as product-market fit.

STAGE 3: LAUNCH
At Launch, *you* become the bottleneck. Audit everything you handle: What can be automated, what needs a human (not necessarily you) and what merits founder judgment.

STAGE 4: SCALE
At Scale, the question is defensibility: If a well-funded incumbent copied you today, would users stay? Moats come from encoded domain expertise, compounding user data and workflow lock-in.

Thanks to my friend and A.I.-native founder Jeff Tompkins for pointing this guide out to me! Very helpful indeed :)

The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.

How AI Is Quietly Saving Lives, with Steve Mock

Added on July 16, 2026 by Jon Krohn.

Negative A.I. buzz makes most of the headlines but there are lots of ways that A.I. has made big (even life-changing!) positive impacts on people. In today's episode, Steve Mock, shares many such inspiring stories.

More on Steve:
• Investor at the venture capital firm Blumberg Capital.
• Entrepreneur involved in growing five successful software businesses.
• Creator and developer (without writing any code!) of a website called AISavedMe.org that has a wide range of inspiring examples from healthcare to education to more trivial engineering stories.

In today's episode, we discuss:
• AISavedMe.org and the stories users have posted on the site.
• How he built the website without having a technical background.
• Lots of market insights from his entrepreneur-investor brain.

The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.

Official Sizzle Reel

Added on July 9, 2026 by Jon Krohn.

In show business, they call this a "Sizzle Reel"... For a few years now, I've done a fair bit of work for TV; this short, punchy video highlights the best bits.

Thanks to Mario Pombo for creating such an exceptional reel. Don't hesitate to reach out to Mario (or I can introduce you) if you need exceptional media-editing done.

And thanks to Dylan Silverstein from AGI Entertainment Media and Management LLC for representing me and finding me such exciting projects for on both stage and screen. Don't hesitate to reach out to Dylan if you have ideas on ways we could collaborate.

P.S.: I believe the "AGI" in "AGI Entertainment" is derived from "Artists Group International" not "Artificial General Intelligence" 😂

How to Find Solid Career Ground in the AI Era, with 80,000 Hours Founder Ben Todd

Added on July 8, 2026 by Jon Krohn.

Today's episode is not to be missed! Benjamin Todd returns with alarming A.I. possibilities (e.g., outcomes *worse* than human extinction) but also practical roadmaps for you to mitigate risks and thrive in the A.I. future.

Ben Todd:
• Author of the new Penguin Random House book "80,000 Hours: Find a Fulfilling Career that Does Good".
• Co-founder and President of 80,000 Hours, a globally renowned charity dedicated to helping people find careers they love.
• Is particularly expert at A.I. futures, A.I. careers and the future of work.
• Holds a Master's in physics and philosophy from the University of Oxford.

In this episode, we discuss:
• Ben's data-backed, extensively-researched guidance for the A.I. era.
• The best career advice from his brand-new book.
• Ways A.I. practitioners can find solid ground and do the most good.
• ...as well as ways A.I. could do the most bad!

The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.

In Case You Missed It in June 2026

Added on July 6, 2026 by Jon Krohn.

It is mighty hot in New York rn... but not nearly as spicy as the interviews on my podcast in June! ICYMI, here are the best bits of my on-air convos last month:

1. Two-time mega-bestselling O'Reilly author Chip Huyen on what's left for humans to do when the cost of building software is headed to $0.

2. Andrey Kurenkov, co-host of my favorite podcast ("Last Week in A.I.") and Founding A.I. Lead at Astrocade, on effective vibe-coding.

3. Lightning AI's VP of Infrastructure Frank Basso on what it's actually like inside an A.I. data center.

4. Gilbert Eijkelenboom on why 85% of data scientists can't communicate their work effectively... and the framework for fixing this.

5. In a role-reversal for landmark Episode #1001, the founder and original host of the SuperDataScience Podcast, Kirill Eremenko, interviewed me. In this clip, we discussed whether AGI would require something like consciousness to be realized.

The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.

People Skills for Analytical Thinkers, with Bestselling Author Gilbert Eijkelenboom

Added on July 1, 2026 by Jon Krohn.

Gilbert Eijkelenboom was a pro poker player who read his opponents through data alone but in today's episode, the bestselling author explains why the people side of data science matters more than the math.

More on Gilbert:
• Wrote the bestselling book "People Skills for Analytical Thinkers".
• Run MindSpeaking, a firm that's trained over 15,000 (mostly technical) folks on "people skills".
• Folks love his invaluable content, allowing him to gather over 200k followers.
• Was previously Managing Consultant on data and digital analytics for Capgemini, as well as a professional poker player on BetVictor.
• Holds a Master's in behavioral economics from Maastricht University.

In today's episode, Gilbert covers:
• Why no matter how good your model or analysis is, it only creates value once people actually use it, which makes communication a core data skill rather than an optional extra.
• His "and, but, therefore" communications framework.
• How research suggests only around 15% of people are self-aware and his tips for closing that gap.
• How experiences in childhood install personal "algorithms" in our adult behavior like avoiding conflict or staying silent... but we can change as adults (and he also provides tips on how).

Thanks to Kate Strachnyi for suggesting Gilbert as a guest!

The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.

Recursive Self-Improvement

Added on June 29, 2026 by Jon Krohn.

Recursive Self-Improvement (RSI) is suddenly a term that's everywhere. What is RSI? How concerned should be about it? And how soon can we expect it? Here's the skinny:

WHAT IS RSI?
• The idea: An A.I. gets good enough at A.I. research to build a more capable successor, which builds an even better one, in a loop that compounds every turn.
• What we have today is *not* RSI but "A.I.-assisted coding", in which humans still set the goals and judge the results (actual RSI takes the human out of the loop, as shown in the diagram).
• RSI isn't a new concept; it's been around since at least 1965 when mathematician I.J. Good described an "intelligence explosion".

WHAT'S THE CONCERN?
RSI could unleash Artificial Superintelligence (ASI) and "the singularity", a point beyond which there could be radical abundance and radically positive outcomes for humanity... but we have no idea what will happen beyond the singularity and that's also a cause for concern (e.g., human extinction risk, Terminator-style "SkyNet", etc.).

HOW CLOSE ARE WE TO RSI?
• Anthropic reports that, as of May 2026, over 80% of code merged into its production codebase was written by Claude — up from low single digits before early 2025.
• On the hardest open-ended problems, its models' success rate jumped from under 20% in late 2025 to 76% by May.
• Think-tank METR finds the length of tasks A.I. can handle solo is now doubling roughly every four months, up from the "doubling every seven months" trend of the past few years.
• Anthropic co-founder Jack Clark puts a 60% chance on an A.I. creating its own successor, with no human involved, by the end of 2028.

REASONS TO BE SKEPTIC
• Skeptics flag two bottlenecks: compute (chips are scarce) and data (success is hard to verify outside code and math, risking "recursive drift").
• Others note the gap between today's coding agents and real RSI is wider than the hype suggests.

BOTTOM LINE
The productivity gains from coding assistants are real, accelerating rapidly and already in your hand. The closer we get to systems that improve themselves, the more it pays to keep human checkpoints, monitoring and oversight firmly in place.

Listen to the most recent episode of my podcast (Episode #1004) to hear more on all of the above, including what you can do personally to mitigate the risks of RSI if that's a way you might like to make an impact!

The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.

Building an AI Data Center End to End, with Lightning AI’s Frank Basso

Added on June 24, 2026 by Jon Krohn.

We've done over 1,000 episodes of this show on every layer of the A.I. stack... except the one that physically runs all of it: the A.I. data center. Today we fix that in a fascinating episode with Lightning AI's Frank Basso.

Frank is VP of Infrastructure at Lightning AI, a New York-based company that has over 35,000 modern GPUs, over $500m in ARR, and that makes it easy to go from A.I. idea to product, "lightning fast" (I hold a fellowship at Lightning so am not an unbiased source on the business, btw). Frank himself is based in Los Angeles and, prior to Lightning, he spent decades directing the development of data centers in California.

In this exceptionally informative episode, Frank explains:
• How Lightning provisions its 35,000+ GPUs through hyperscale co-location.
• Why everything new is liquid-to-chip cooled.
• How GPUs talk to each other over ultra-fast east-west networks.
• What it’s actually like to stand inside a 110-decibel A.I. data hall.
• The most persistent myths about data-center water and electricity use.

The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.

Fable 5: The Full Story from Capabilities to Drama

Added on June 22, 2026 by Jon Krohn.

The dust has settled, allowing me to provide you with all the key context you need to know on Fable 5, the most capable A.I. model ever offered to the public, and the US government forcing it off shelves three days later:

A NEW CLASS OF MODEL
• Anthropic stacks its models in tiers: Haiku (small and fast), Sonnet (the capable middle) and Opus (the powerful top). Sitting above all of them now is a "Mythos-class" tier.
• Fable 5 and its locked-down sibling Mythos 5 are the same underlying model... the only difference is the safeguards.
• Mythos 5 goes to trusted cyberdefenders with guardrails largely lifted; Fable 5 went to the public with them switched on.

WHAT IT COULD DO
• State-of-the-art on nearly every benchmark Anthropic tested... and the lead grows the longer and more complex the task (see chart).
• Stripe ran a codebase-wide migration on 50M lines of Ruby in a single day; work estimated at 2+ months for a full engineering team.
• Beat video "Pokémon FireRed" from raw screenshots alone, and got a 3x bigger memory boost than Opus on "Slay the Spire".
• Priced at $10/$50 per million input/output tokens: roughly 2x Opus 4.8, but under half the original Mythos Preview.

SAFETY BY DESIGN
• Classifiers watch three sensitive areas: cybersecurity, biology/chemistry and distillation (extracting a model to train a rival).
• Flagged requests quietly fall back to Opus 4.8 and the user is told.
• Triggers fire in under 5% of sessions. Anthropic admits it tuned conservatively, so some harmless prompts get bounced too.

THE THREE-DAY SHUTDOWN
• On Friday evening the federal government ordered Anthropic to switch off both Fable 5 and Mythos 5 worldwide, citing national security.
• The mechanism was an export-control action covering foreign nationals everywhere (including even, say, Canadian Anthropic employees living in the US!)... so broad that Anthropic pulled the model for absolutely everyone.
• The trigger was a reported jailbreak of the cyber safeguards by Amazon. Anthropic disputes its severity, calling it narrow and non-universal.

BOTTOM LINE
A premium-tier model, wrapped in deliberately cautious safeguards, pulled by its own government not long before Anthropic's reported IPO and the latest in a public battle between the firm and the federal government. Sessions now fall back to Opus 4.8. Whether Fable returns (and on what terms) depends on a fight that's far from over.

The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.

How AI Erased My Career Moat, an Episode #1001 Special: Jon Krohn interviewed by Kirill Eremenko

Added on June 17, 2026 by Jon Krohn.

To mark cresting over 1000 episodes, today’s features a role reversal: Kirill Eremenko (who founded the podcast a decade ago) returns to host and welcomes *me* as the guest. Kirill's still got it, enjoy!

Kirill hosted the first 431 episodes of the SuperDataScience Podcast before handing me the reins five years ago. In today's role-reversal episode, we discuss:
• A.I. rapidly usurping our technical skills
• Whether we’re in an A.I. bubble
• The one key reason why I’ve seen A.I. projects fail
• Relationships between A.I. and biological neuroscience.

... so, as usual, lots of A.I. in this episode, but unusually, I’m the one answering the questions instead of asking them!

The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.

Ten Years of the Super Data Science Podcast, with Jon, Kirill and Special Guests

Added on June 15, 2026 by Jon Krohn.

Today, we published Episode #1000 of the SuperDataScience Podcast! To celebrate, the show's original host Kirill Eremenko joined me and dozens of regular listeners on air to predict what the next 10 years of A.I. will bring.

In a bit more detail:
• We publish 104 episodes per year so Episode #1000 coincides with the show being about ten years old.
• The show was founded by Kirill Eremenko in 2016, who hosted over 400 episodes before handing me the reins in 2021.
• In a first for the show, Episode #1000 was streamed live online with our audience invited to join on air.
• Most folks interacted via chat functionality but a number of surprise guests came right onto the recording including Natalie Ziajski and Mario Pombo from the podcast team, rockstar A.I. entrepreneur Jepson Taylor, my 96-year-old grandmother and my very own pa, William Krohn.
• Kirill and I looked back on a decade of the podcast and fielded listener questions on topics such as A.I.’s biggest opportunities, the build-versus-buy dilemma, how to break into the field today, and how to stay grounded amid the relentless pace of A.I.

Thank you for support and listenership over all these years — we make this show for you and couldn't do it without you! We're excited to see what the next decade brings :)

The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.

CBS PrimeTime News: OpenAI IPO & Public Ownership of AI

Added on June 11, 2026 by Jon Krohn.

Last night, I was on CBS PrimeTime News to discuss OpenAI's upcoming IPO, as well as Senator Bernie Sanders' plans for an A.I. sovereign wealth fund... an idea I thought could be "awkward".

What’s Left to Build When Software Is Free, with Chip Huyen

Added on June 10, 2026 by Jon Krohn.

For today's landmark episode (#999!), I asked rockstar Chip Huyen to be my guest and she said "yes"! We discuss her book "A.I. Engineering" (the most popular O'Reilly book in 2025) and how the A.I. job landscape is shifting.

In case you haven't heard of her, more on Chip:
• Her most recent book is "AI Engineering", which was the most popular book in the O'Reilly platform last year.
• Previously wrote “Designing Machine Learning Systems”, which was also an O'Reilly mega-bestseller and was based on the Stanford University course she created and taught on the same topic.
• Is currently building a new stealth startup.
• Previously worked as VP of AI at Voltron Data, co-founder of Claypot AI, ML Engineer at Snorkel AI and Sr Deep Learning Engineer at NVIDIA.
• Holds a Master's in Computer Science from Stanford.
• Her invaluable posts have earned her over 300k followers on LinkedIn.

In this episode, Chip breaks down:
• What separates AI engineering from machine learning engineering.
• The case for a "start simple" workflow.
• The real costs of running LLMs in production.
• Physical AI.
• Robotics.
• World models.
• Why the durable problems worth solving are increasingly human ones.

The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.

In Case You Missed It in May 2026

Added on June 8, 2026 by Jon Krohn.

Well, I certainly learned a lot from the outstanding guests we had on my podcast in May. ICYMI, today's episode features the best parts of my conversations with them:

1. Rubrik's Anneka Gupta and Cal Al-Dhubaib on how, in the Mythos era, the old cybersecurity playbook of prevention and detection is no longer enough, and how A.I. agents themselves are becoming a new source of data exposure inside organizations.

2. marimo's Dr. Trevor Manz on why code notebooks have become the natural working memory for A.I. coding agents. Trevor walks me through the Marimo Pair skill, which lets you drive a notebook from your agent, collaborating with Claude Code or Codex in real time as you load, explore, and visualize your data.

3. Jazmia Henry of collide. walks me through her work as a "full-stack" foundation model builder. We cover all four stages of the process: the often unglamorous slog of data curation, building bespoke tokenizers and embeddings, model training and reinforcement learning, and the inference layer that serves it all to end users.

4. Jacob Miller and Jeremy Mumford of Pattern (and authors of the great, brand-new book "Architected Intelligence") argue that the most expensive AI mistake an organization can make is failing slowly and sticking with prototypes long past their sell-by date because the traditional software mindset says you have to. We, of course, also discuss a solution.

The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.

How This Text-to-Video-Game AI Startup Hit 20M Users

Added on June 3, 2026 by Jon Krohn.

Imagine being able to vibe-code full-blown video games... for free! My returning guest, Dr. Andrey Kurenkov, helped engineer Astrocade to do just that... and already 20 million people have played games through their platform.

More on Andrey:
• Founding A.I. Lead at Astrocade, a Bay Area-based startup that has raised $68m in venture capital to create the TikTok of video games, where creators create games for free and you play them for free.
• Co-host (alongside Jeremie Harris) of my favorite podcast, "Last Week in A.I.".
• Holds a PhD from Stanford University, where his research focused on machine vision and robotics.

In this episode, we discuss:
• The fascinating Astrocade journey, of course.
• The surprising pace of humanoid robotics.
• Why he's a skeptic on Artificial Super Intelligence.

The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.

TrueFoundry’s Nikunj Bajaj on How to Get $100M Returns on AI Agent Deployments

Added on June 1, 2026 by Jon Krohn.

Imagine being able to deploy an AI agent and getting a return of over $100m from that single deployment. My guest today, Nikunj Bajaj, has facilitated that multiple times! Lots to learn from him, enjoy!

Nikunj:
• CEO and co-founder of TrueFoundry, a Bay Area-based startup that has raised over $20m to solve the thorniest problems that enterprises face when deploying agents.
• His clients include demanding organizations like NVIDIA and Siemens.
• Was previously ML tech lead at Facebook.
• Holds a master's in computer science from University of California, Berkeley.

The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.

Episode #1000: Join Us Live for the First-Ever Interactive SDS Podcast!

Added on May 29, 2026 by Jon Krohn.

Ten years ago, Kirill Eremenko founded The SuperDataScience Podcast. To celebrate the upcoming Episode #1000, we are inviting you to join us both in a format we've never tried before:

It will be the first-ever interactive episode where you can join in online as we record, and ask your questions... or I suppose just make comments! You'll be able to ask share your thoughts in the chat or come right onto the show via video.

Kirill (founder, original host) and I (current host) will both be there, so you can ask us anything, e.g.:

• Did Kirill think the show would last ten years and 1000 episodes?

• How has data science transformed over the past decade?

• Did Jon have hair on his head ten years ago?

Date: Next Thursday, June 4th

Time: 5pm Eastern Time / 2pm Pacific Time

To get a calendar invite that includes the URL to join us live, check out the Luma link below ⬇️
luma.com/7vl7mdos

The "Super Data Science Podcast with Jon Krohn" is available on all major podcasting platforms and a video version is on YouTube. Whether you join us or not for the interactive recording, Episode #1000 will be published on Friday June 12th!

End-to-End Foundation Models for the Energy Industry, with Jazmia Henry

Added on May 28, 2026 by Jon Krohn.

What does it take to build foundation LLMs from scratch today? Deeply impressive Jazmia Henry breaks down the four stages in today's episode, enjoy!

Jazmia:
• Holds degrees from Tulane University and Columbia University... and is partway through a PhD at the University of Oxford.
• Held a technical fellowship at Stanford University.
• Previously worked as a data strategist at Morgan Stanley, head of ML at The Motley Fool and a Lead Applied AI engineer at Microsoft.
• Published a top paper at NeurIPS, the world's most prestigious academic AI conference.
• Currently works as "Member of Technical Staff for AI/ML" at collide., a Texas-based startup that’s building AI infrastructure (including all aspects of specialized foundation models) for the energy industry.

Key topics covered in this episode include:
• What foundation models are.
• Her "full-stack" foundation-model building's four distinct stages.
• How reinforcement learning (RL) models are "bursty" because they idle the GPU during reward calculation and then dump enormous loads on it all at once.
• Reward hacking by RL models.

Thanks to Mark Freeman II for recommending Jazmia as a guest.

The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.