Filtering by Category: Data Science

Fable 5 as Advisor: Anthropic’s Two-Model Pattern for Smarter, Cheaper Agents

Added on July 20, 2026 by Jon Krohn.

Want near-frontier A.I. agent quality at a fraction of the cost? Anthropic recently productized the Advisor Strategy that pairs a cheap "executor" model with a brilliant "advisor" to give you the best of both worlds:

HOW IT WORKS
• A fast, cheap model (e.g., Claude Haiku or Sonnet) runs the entire agent loop: calling tools, writing code, drafting output.
• A frontier model (e.g., Claude Opus or Fable) sits on standby as a "tool" the executor can consult (like a junior worker phoning their supervisor when unsure).
• Everything happens inside one API call: Anthropic's servers hand the advisor the full conversation transcript and return just 400-700 tokens of advice, making this fast and inexpensive (it's also usually only a one-line code change so it's easy to implement).

THE RESULTS
• Sonnet + Opus advisor beat Sonnet alone on the "SWE-bench Multilingual" benchmark by 2.7 percentage points while cutting cost per task by 11.9%. Better quality AND slightly lower cost.
• Unsurprisingly, the biggest gains come from pairing a very fast/cheap model with a much more capable advisor: For example, on BrowseComp (web research benchmark), Haiku alone scored 19.7%; Haiku + Opus advisor scored 41.2% (more than double!) at 85% less cost than Sonnet alone.
• Newest data, from last week: On "SWE-bench Pro", Sonnet 5 + a Fable 5 advisor captured ~92% of Fable's standalone performance at ~63% of its cost.

WHY IT WORKS
• The advisor's output is tiny relative to the whole task, and a good plan delivered early prevents wasted attempts and misguided tool calls.
• Unlike OpenAI's router (which dispatches queries to a model up front), the cheap model runs the show and escalates itself mid-task with full shared context.

PRACTICAL LESSONS
• Skip it for single-turn Q&A; it shines on long-horizon agentic work (like coding, research, computer use).
• Executors under-call the advisor by default so prompt them to consult it early (before committing to an approach) and late (before declaring the task done).
• Cap advisor output at ~2,000 tokens (~7x cost reduction, no quality loss) and enable prompt caching for long loops.
• The pattern is spreading: OpenRouter now offers a cross-provider version (e.g., a Google Gemini executor consulting Claude).
• Alternative design patterns such as having a powerful "orchestrator" (shown below the advisor pattern in the chart I included in this post) might work even more effectively for your use case so it could be worth comparing them.

BOTTOM LINE
Frontier A.I. progress is no longer just bigger models... it's smarter economics in composing the models we already have.

The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.

How AI Is Quietly Saving Lives, with Steve Mock

Added on July 16, 2026 by Jon Krohn.

Negative A.I. buzz makes most of the headlines but there are lots of ways that A.I. has made big (even life-changing!) positive impacts on people. In today's episode, Steve Mock, shares many such inspiring stories.

More on Steve:
• Investor at the venture capital firm Blumberg Capital.
• Entrepreneur involved in growing five successful software businesses.
• Creator and developer (without writing any code!) of a website called AISavedMe.org that has a wide range of inspiring examples from healthcare to education to more trivial engineering stories.

In today's episode, we discuss:
• AISavedMe.org and the stories users have posted on the site.
• How he built the website without having a technical background.
• Lots of market insights from his entrepreneur-investor brain.

The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.

People Skills for Analytical Thinkers, with Bestselling Author Gilbert Eijkelenboom

Added on July 1, 2026 by Jon Krohn.

Gilbert Eijkelenboom was a pro poker player who read his opponents through data alone but in today's episode, the bestselling author explains why the people side of data science matters more than the math.

More on Gilbert:
• Wrote the bestselling book "People Skills for Analytical Thinkers".
• Run MindSpeaking, a firm that's trained over 15,000 (mostly technical) folks on "people skills".
• Folks love his invaluable content, allowing him to gather over 200k followers.
• Was previously Managing Consultant on data and digital analytics for Capgemini, as well as a professional poker player on BetVictor.
• Holds a Master's in behavioral economics from Maastricht University.

In today's episode, Gilbert covers:
• Why no matter how good your model or analysis is, it only creates value once people actually use it, which makes communication a core data skill rather than an optional extra.
• His "and, but, therefore" communications framework.
• How research suggests only around 15% of people are self-aware and his tips for closing that gap.
• How experiences in childhood install personal "algorithms" in our adult behavior like avoiding conflict or staying silent... but we can change as adults (and he also provides tips on how).

Thanks to Kate Strachnyi for suggesting Gilbert as a guest!

The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.

Ten Years of the Super Data Science Podcast, with Jon, Kirill and Special Guests

Added on June 15, 2026 by Jon Krohn.

Today, we published Episode #1000 of the SuperDataScience Podcast! To celebrate, the show's original host Kirill Eremenko joined me and dozens of regular listeners on air to predict what the next 10 years of A.I. will bring.

In a bit more detail:
• We publish 104 episodes per year so Episode #1000 coincides with the show being about ten years old.
• The show was founded by Kirill Eremenko in 2016, who hosted over 400 episodes before handing me the reins in 2021.
• In a first for the show, Episode #1000 was streamed live online with our audience invited to join on air.
• Most folks interacted via chat functionality but a number of surprise guests came right onto the recording including Natalie Ziajski and Mario Pombo from the podcast team, rockstar A.I. entrepreneur Jepson Taylor, my 96-year-old grandmother and my very own pa, William Krohn.
• Kirill and I looked back on a decade of the podcast and fielded listener questions on topics such as A.I.’s biggest opportunities, the build-versus-buy dilemma, how to break into the field today, and how to stay grounded amid the relentless pace of A.I.

Thank you for support and listenership over all these years — we make this show for you and couldn't do it without you! We're excited to see what the next decade brings :)

The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.

What’s Left to Build When Software Is Free, with Chip Huyen

Added on June 10, 2026 by Jon Krohn.

For today's landmark episode (#999!), I asked rockstar Chip Huyen to be my guest and she said "yes"! We discuss her book "A.I. Engineering" (the most popular O'Reilly book in 2025) and how the A.I. job landscape is shifting.

In case you haven't heard of her, more on Chip:
• Her most recent book is "AI Engineering", which was the most popular book in the O'Reilly platform last year.
• Previously wrote “Designing Machine Learning Systems”, which was also an O'Reilly mega-bestseller and was based on the Stanford University course she created and taught on the same topic.
• Is currently building a new stealth startup.
• Previously worked as VP of AI at Voltron Data, co-founder of Claypot AI, ML Engineer at Snorkel AI and Sr Deep Learning Engineer at NVIDIA.
• Holds a Master's in Computer Science from Stanford.
• Her invaluable posts have earned her over 300k followers on LinkedIn.

In this episode, Chip breaks down:
• What separates AI engineering from machine learning engineering.
• The case for a "start simple" workflow.
• The real costs of running LLMs in production.
• Physical AI.
• Robotics.
• World models.
• Why the durable problems worth solving are increasingly human ones.

The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.

In Case You Missed It in May 2026

Added on June 8, 2026 by Jon Krohn.

Well, I certainly learned a lot from the outstanding guests we had on my podcast in May. ICYMI, today's episode features the best parts of my conversations with them:

1. Rubrik's Anneka Gupta and Cal Al-Dhubaib on how, in the Mythos era, the old cybersecurity playbook of prevention and detection is no longer enough, and how A.I. agents themselves are becoming a new source of data exposure inside organizations.

2. marimo's Dr. Trevor Manz on why code notebooks have become the natural working memory for A.I. coding agents. Trevor walks me through the Marimo Pair skill, which lets you drive a notebook from your agent, collaborating with Claude Code or Codex in real time as you load, explore, and visualize your data.

3. Jazmia Henry of collide. walks me through her work as a "full-stack" foundation model builder. We cover all four stages of the process: the often unglamorous slog of data curation, building bespoke tokenizers and embeddings, model training and reinforcement learning, and the inference layer that serves it all to end users.

4. Jacob Miller and Jeremy Mumford of Pattern (and authors of the great, brand-new book "Architected Intelligence") argue that the most expensive AI mistake an organization can make is failing slowly and sticking with prototypes long past their sell-by date because the traditional software mindset says you have to. We, of course, also discuss a solution.

The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.

How This Text-to-Video-Game AI Startup Hit 20M Users

Added on June 3, 2026 by Jon Krohn.

Imagine being able to vibe-code full-blown video games... for free! My returning guest, Dr. Andrey Kurenkov, helped engineer Astrocade to do just that... and already 20 million people have played games through their platform.

More on Andrey:
• Founding A.I. Lead at Astrocade, a Bay Area-based startup that has raised $68m in venture capital to create the TikTok of video games, where creators create games for free and you play them for free.
• Co-host (alongside Jeremie Harris) of my favorite podcast, "Last Week in A.I.".
• Holds a PhD from Stanford University, where his research focused on machine vision and robotics.

In this episode, we discuss:
• The fascinating Astrocade journey, of course.
• The surprising pace of humanoid robotics.
• Why he's a skeptic on Artificial Super Intelligence.

The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.

TrueFoundry’s Nikunj Bajaj on How to Get $100M Returns on AI Agent Deployments

Added on June 1, 2026 by Jon Krohn.

Imagine being able to deploy an AI agent and getting a return of over $100m from that single deployment. My guest today, Nikunj Bajaj, has facilitated that multiple times! Lots to learn from him, enjoy!

Nikunj:
• CEO and co-founder of TrueFoundry, a Bay Area-based startup that has raised over $20m to solve the thorniest problems that enterprises face when deploying agents.
• His clients include demanding organizations like NVIDIA and Siemens.
• Was previously ML tech lead at Facebook.
• Holds a master's in computer science from University of California, Berkeley.

The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.

End-to-End Foundation Models for the Energy Industry, with Jazmia Henry

Added on May 28, 2026 by Jon Krohn.

What does it take to build foundation LLMs from scratch today? Deeply impressive Jazmia Henry breaks down the four stages in today's episode, enjoy!

Jazmia:
• Holds degrees from Tulane University and Columbia University... and is partway through a PhD at the University of Oxford.
• Held a technical fellowship at Stanford University.
• Previously worked as a data strategist at Morgan Stanley, head of ML at The Motley Fool and a Lead Applied AI engineer at Microsoft.
• Published a top paper at NeurIPS, the world's most prestigious academic AI conference.
• Currently works as "Member of Technical Staff for AI/ML" at collide., a Texas-based startup that’s building AI infrastructure (including all aspects of specialized foundation models) for the energy industry.

Key topics covered in this episode include:
• What foundation models are.
• Her "full-stack" foundation-model building's four distinct stages.
• How reinforcement learning (RL) models are "bursty" because they idle the GPU during reward calculation and then dump enormous loads on it all at once.
• Reward hacking by RL models.

Thanks to Mark Freeman II for recommending Jazmia as a guest.

The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.

AI’s Putting Recent Grads Out of Work; Here’s How to Get Hired Anyway!

Added on May 25, 2026 by Jon Krohn.

Computer science/engineering grads had an employment advantage (see chart) that, since ChatGPT's release, has disappeared. Is A.I. to blame? Here's what the data say and what new grads (or anyone!) can do about it:

THE EMPLOYMENT LANDSCAPE
• NY Fed: unemployment for recent computer-science grads (22-27) sits at 7.0%, and computer engineering at 7.8% (roughly on par with fine arts and anthropology grads!)
• Compare that to ~5.8% for recent grads overall and ~4% for the whole US workforce.
• Eighteen-year-olds are voting with their feet: US undergrad CS enrolment fell 11% in 2025; computer programming fell a stunning 26%.
• Demand is shrinking too: Handshake postings are down ~50% from their 2022 peak, and Revelio Labs data suggest entry-level software and data-analysis postings have dropped as much as 67%.

IS A.I. TO BLAME?
• "Yes" camp: A 2025 Stanford University study found employment for 22-25-year-olds in A.I.-exposed jobs dropped 13% since 2022, while older workers held steady. The Dallas Fed replicated it... and the decline comes from juniors never being hired, not layoffs.
• "Not so fast" camp: Google economists found posting declines were just as steep for senior workers and predate ChatGPT. A Fed study of 1M+ firms found "null effects." Their take: high interest rates and a post-pandemic hangover, with A.I. as a convenient scapegoat.

WHAT YOU CAN DO:
1. Stop competing on raw code. The human edge is now system design, architecture and deciding what to build in the first place.

2. Pick a domain. "A.I. engineer" is a common résumé; "A.I. engineer who worked alongside a hospital team for two summer internships" is a short list.

3. Build a public portfolio. Substantive GitHub repos and a Kaggle project beat CVs sent into the void.

4. Get fluent with agentic tooling, e.g., RAG, model evaluation, multi-agent orchestration. PwC found A.I.-skilled workers earn a 56% wage premium (!!!)

5. Lean on your network. Referrals and warm intros are crushing mass (often GenAI-produced) applications in this market.

The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.

How to Build AI-First Organizations, with Jacob Miller and Jeremy Mumford

Added on May 20, 2026 by Jon Krohn.

After today's fun episode with Jacob and Jeremy — authors of the brand-new book "Architected Intelligence" — you’ll have all the key info to build successful AI features, AI products and AI-first companies. Enjoy!

Jeremy Mumford and Jacob Miller serve as Lead AI Engineer and Vice President of Platform Intelligence, respectively, at Pattern, a giant Utah-based tech company that IPO’ed on the Nasdaq exchange about six months ago.

Jacob and Jeremy's brand-new "Architected Intelligence" book was published by Wiley and this episode focuses almost exclusively on this invaluable book.

Episode highlights include:
• The "User Agnosticism Tenet", which means designing products and processes so they can be executed equally well by a human, an AI agent, or any hybrid combo.
• The shift in the "define-build-feedback" loop today where "building" is no longer the bottleneck, which means "definition" and "feedback" are where teams win or lose.
• Why workflows are deterministic, predictable, and cheaper than agents, and why the natural progression is skills first, then workflows, and only then agents.
• Why data engineering is the bedrock of AI engineering.
• Why velocity is the only durable moat in a world where everyone has access to the same frontier models.

Thanks to podcast superfan Jonathan Bown for recommending Jeremy and Jacob as guests!

The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.

Tokenmaxxing vs AI Hardware Bottlenecks

Added on May 19, 2026 by Jon Krohn.

Humans (like Reinforcement Learning algos) can "reward hack": "Tokenmaxxing" being a perfect example, after employers started using "number of tokens" consumed as a proxy for developers' productivity.

Even if humans weren't engaging in this pointless time-, money- and energy-consuming behavior, however, demand for A.I. compute is so vast that everyone's scrambling to to make more available. Alas, four tricky hardware bottlenecks face us:

1. GPUs:
• NVIDIA data-center GPU lead times now run 36–52 weeks, with Blackwell chips sold out through mid-2026.
• The real choke point isn't fabrication: It's TSMC's "CoWoS" advanced packaging, which is sold out through 2026. Nvidia alone has locked up ~60% of CoWoS capacity through 2027.

2. High-Bandwidth Memory (HBM):
• Demand has quintupled since 2023, and only three companies (SK hynix, Samsung and Micron) make it.
• All three are sold out well into 2026 and new HBM factories take 18–24 months to come online.

3. CPUs:
• As workloads shift toward agentic AI, the CPU:GPU ratio jumps from ~1:12 (for GenAI-only chatbots) to 1:1.
• Intel's CFO says the server-CPU shortfall "starts with a B" — billions in unmet demand so server CPU prices are up 10–20% in just the past couple of months.

4. Electricity: Hyperscaler build-outs are now gated by grid interconnect (18–36 months) and transformer lead times.

THE BIG MISMATCH
• The top 5 hyperscalers alone (Alphabet, Amazon, Meta, Microsoft and Oracle) are on track for ~$725B in combined 2026 capex.
• That's roughly 6x the hyperscalers' 2022 spend, with ~75% going to A.I. infrastructure.
• Hardware suppliers, however, have grown capex by only ~50%.... a 6x increase in demand met by only a 50% increase in supply is a big mismatch!

REASONS FOR OPTIMISM
Demand will continue to be high but I'm optimistic we'll continue to squeeze more juice from every lemon because, e.g.:
• Algorithmic efficiency keeps improving — Google's TurboQuant recently briefly tanked memory stocks by promising to materially cut inference memory needs.
• LLM efficiency gains via mixture-of-experts and smarter inference scheduling continue to compound.
• The tokenmaxxing trend is a corporate farce that will fade.

The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.

Pair Programming with AI in Your Python Notebook, with Dr. Trevor Manz

Added on May 13, 2026 by Jon Krohn.

Exceptional technical episode today with Dr. Trevor Manz on "marimo Pair", an actually!) game-changing pair-programming A.I.-agent companion that lifts heavy loads within your Python data-science notebook.

More on Trevor:
• 27-time NCAA Swimming All-American & National Champion.
• Master's in Computational Biology from University of Cambridge.
• PhD in Bioinformatics from Harvard University.
• Creator of the popular open-source "anywidget" project (amongst many others, particularly in visualizing bioinformatics data, e.g., genomics data).
• Now a founding engineer at marimo.io, where he is leading the charge on marimo Pair.

Seriously, marimo Pair is unreal. A complete reimagining of what's possible in a Jupyter notebook-style environment in the agentic A.I. era. You will hear (and see) my mind explode in this episode!

We also discuss:
• Agent skills.
• Recursive language models.
• A number of other open-source projects, largely in data viz/analysis.

The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.

Web Summit Vancouver 2026

Added on May 11, 2026 by Jon Krohn.

"Collision" has grown and re-branded as "Web Summit Vancouver". I'm looking forward to experiencing the new brand for the first time next week! See you there? Here's where you can catch me:

• Tue May 12 at 11am: Mentor Hours on "scaling your startup"

• Wed May 13 at 1:30pm: Delivering my agentic A.I. talk ("Something Big is Happening") on the "A.I. Summit" stage.

• Wed May 13 at 1:50pm: Emceeing the "A.I. Summit" stage all afternoon.

More on Web Summit Vancouver:
• Taking place May 11–14 at the Vancouver Convention Centre.
• It's the second year in a row the conference, under this new brand, has taken place (the previous "Collision"-branded event was held annually in Toronto and the photo in this post is from a talk I gave there in 2024).
• Connects over 35,000 startup founders, investors and industry leaders to discuss A.I., entrepreneurship and tech trends.

Security for Mythos-Era Agentic Risks, with Rubrik’s Anneka Gupta and Cal Al-Dhubaib

Added on May 6, 2026 by Jon Krohn.

Mythos finds security vulnerabilities at ~100X the rate of publicly available models, and comparable open-weight models are ~6 months away. Scary? Thankfully my guests today, Anneka and Cal, have solutions!

Anneka:
• Chief Product Officer at Rubrik.
• Lecturer in Product Management at Stanford University.
• Climbed the ladder from software engineer to President (!!) during an 11-year tenure at LiveRamp.
• Holds a degree in math and computational sciences from Stanford.

Cal:
• Principal Technologist at Rubrik.
• Formerly founder and CEO of Pandata, which was acquired by Further.
• Highly sought-after keynote speaker.
• Holds a degree in data science from Case Western Reserve University.

This is an exceptional episode with two brilliant, entertaining and highly knowledgeable guests. It can be enjoyed by anyone! In it, they cover:

• How Anthropic's Mythos model can be pointed at a code repository and autonomously surface every vulnerability inside it, and how Anthropic itself estimates Mythos-class capabilities will reach other labs within six to eighteen months, with open-weight versions likely to follow.
• How code-gen models make it easy for attackers by scaling up their capabilities... and by vibe-coders not being aware of vulnerabilities they have!
• How Rubrik's Agent Cloud delivers three pillars of resilience: visibility into every agent in your environment, governance and runtime control through the SAGE small language model, and remediation through Agent Rewind.
• Why the next wave of knowledge work is inherently cross-functional, with A.I. attorneys, security pros, and data scientists all needing shared literacy in A.I. risk.

The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.

In Case You Missed It in April 2026

Added on May 4, 2026 by Jon Krohn.

Whoa, it's May Day... and our podcast-production team was *on the ball* with getting our ICYMI-in-April episode together lickety-split. In case you missed it, these were the best bits of my on-air convos last month:

1. Oracle's Director of A.I. Developer Experience Richmond Alake defines the four types of memory A.I. agents can have... and the biological inspiration for each of them.

2. Matthew J. Glickman, co-founder/CEO of Genesis Computing, describes how A.I. agents allow data engineers to dramatically scale up their impact in an enterprise.

3. The A.I. infrastructure engineer Linda Haviv has amassed a following of over 250,000 folks on social media. In her clip from last month, she combines both worlds — detailing why A.I. infrastructure has now become everyone's problem while also discussing her work in lowering the barrier to access A.I. education.

4. Traci Walker Griffith, principal of The Eliot School in Boston, shares her novel perspective on what critical thinking is... in the context of how fifth-graders are leveraging A.I. to evaluate their work and prepare for tests.

The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.

AI Infrastructure, Ray, and Why Nonlinear Careers Win, with Linda Haviv

Added on April 28, 2026 by Jon Krohn.

For folks in A.I., software, data science, things are moving so fast, it's easy to be overwhelmed. Luckily, A.I. engineer Linda Haviv makes it a joy to stay up to date! Today, we discuss career tips as well as open-source A.I. tech like Ray.

More on Linda:
• Until recently, was Staff Developer Advocate at Anyscale, makers of Ray, an open-source framework for managing, executing and optimizing A.I. compute.
• Previously was A.I. Developer Advocate at Amazon Web Services (AWS).
• Before that, was a software developer at Fox Corporation.
• Was a professional singer in New York up until her second (of three!) children was born.
• Holds a degree in philosophy from Baruch College.

In this episode, Linda ebulliently covers:
• How "A.I. infrastructure" refers to the compute stack, tooling and frameworks purpose-built for A.I. and ML workloads.
• Ray is a Python-native open-source distributed computing framework that lets engineers distribute training, data processing and model serving across GPUs without needing to become distributed systems experts.
• How building in public, creating content and contributing to open source are not just career insurance... they're how you find your community, attract unexpected opportunities and learn faster through teaching.
• And much more!

The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.

Building Hardware is Hard but AI Agents Help, with Kishore Subramanian

Added on April 27, 2026 by Jon Krohn.

In software, when something goes wrong, you push a patch. In hardware? Oooph. You're dealing with big headaches and huge costs. Thankfully, my guest today — Kishore Subramanian — is using AI to transform the way physical products get built for the better.

Kishore:
• Is CTO of Propel Software, a Bay Area company that combines product data with agentic AI to make the production of physical hardware (including high tech and medtech devices) as seamless as possible.
• Prior to Propel, held senior engineering roles at Google, where he worked on Google Assistant, so he has particularly rich experience with agent development.
• Holds a degree in electronics, computers and process control… as well as a 200-hour yoga-teaching certificate!

In this episode, Kishore covers:
• How product lifecycle management (PLM) is the system that takes a physical product from concept all the way to the customer and beyond.
• How AI agents can review engineering change orders — the hardware equivalent of pull requests — to flag risks, compliance gaps, and downstream impacts before they become expensive problems.
• How Propel built their AI platform, Propel One, on top of Salesforce's Agentforce 360 Platform, which gave them security, governance, data infrastructure, and a reasoning engine out of the box, allowing them to ship in about six months.

The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.

The Four Types of Memory Every AI Agent Needs, with Richmond Alake

Added on April 22, 2026 by Jon Krohn.

To build an effective A.I. agent, getting its memory right is essential. In today's episode, our agent-memory guide is brilliant (and very funny!) machine-learning architect and engineer, Richmond Alake.

More on Richmond:
• Director of A.I. developer experience at Oracle.
• Previously roles include: staff developer advocate for AI/ML at MongoDB, ML architect at Slalom, writer for NVIDIA and computer-vision engineer at Loveshark.
• Holds a master's in ML and robotics from the University of Surrey.

In this episode, Richmond magnificently covers:
• How agent memory is the encapsulation of systems (embedding models, rerankers, databases, and LLMs) that allow AI agents to learn and adapt with new information over time, rather than starting from scratch every session.
• The four types of agent memory (all drawn from human cognition).
• Memory-first agent harnesses.
• Predictions for a flattening of AI engineering roles, where the future developer will need end-to-end understanding of the full agent stack.

The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.

Building AI Agents Where 99.9% Accuracy Isn't Good Enough, with Raju Malhotra

Added on April 17, 2026 by Jon Krohn.

The headlines shout “SaaSpocalypse,” but I don’t buy it. Neither does my guest today, Raju Malhotra, who argues that, thanks to humans collaborating with agents on optimized workflows, the SaaS opportunity is now far bigger than ever before.