Today, we published Episode #1000 of the SuperDataScience Podcast! To celebrate, the show's original host Kirill Eremenko joined me and dozens of regular listeners on air to predict what the next 10 years of A.I. will bring.
In a bit more detail:
• We publish 104 episodes per year so Episode #1000 coincides with the show being about ten years old.
• The show was founded by Kirill Eremenko in 2016, who hosted over 400 episodes before handing me the reins in 2021.
• In a first for the show, Episode #1000 was streamed live online with our audience invited to join on air.
• Most folks interacted via chat functionality but a number of surprise guests came right onto the recording including Natalie Ziajski and Mario Pombo from the podcast team, rockstar A.I. entrepreneur Jepson Taylor, my 96-year-old grandmother and my very own pa, William Krohn.
• Kirill and I looked back on a decade of the podcast and fielded listener questions on topics such as A.I.’s biggest opportunities, the build-versus-buy dilemma, how to break into the field today, and how to stay grounded amid the relentless pace of A.I.
Thank you for support and listenership over all these years — we make this show for you and couldn't do it without you! We're excited to see what the next decade brings :)
The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.
Filtering by Tag: #datascience
In Case You Missed It in May 2026
Well, I certainly learned a lot from the outstanding guests we had on my podcast in May. ICYMI, today's episode features the best parts of my conversations with them:
1. Rubrik's Anneka Gupta and Cal Al-Dhubaib on how, in the Mythos era, the old cybersecurity playbook of prevention and detection is no longer enough, and how A.I. agents themselves are becoming a new source of data exposure inside organizations.
2. marimo's Dr. Trevor Manz on why code notebooks have become the natural working memory for A.I. coding agents. Trevor walks me through the Marimo Pair skill, which lets you drive a notebook from your agent, collaborating with Claude Code or Codex in real time as you load, explore, and visualize your data.
3. Jazmia Henry of collide. walks me through her work as a "full-stack" foundation model builder. We cover all four stages of the process: the often unglamorous slog of data curation, building bespoke tokenizers and embeddings, model training and reinforcement learning, and the inference layer that serves it all to end users.
4. Jacob Miller and Jeremy Mumford of Pattern (and authors of the great, brand-new book "Architected Intelligence") argue that the most expensive AI mistake an organization can make is failing slowly and sticking with prototypes long past their sell-by date because the traditional software mindset says you have to. We, of course, also discuss a solution.
The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.
Episode #1000: Join Us Live for the First-Ever Interactive SDS Podcast!
Ten years ago, Kirill Eremenko founded The SuperDataScience Podcast. To celebrate the upcoming Episode #1000, we are inviting you to join us both in a format we've never tried before:
It will be the first-ever interactive episode where you can join in online as we record, and ask your questions... or I suppose just make comments! You'll be able to ask share your thoughts in the chat or come right onto the show via video.
Kirill (founder, original host) and I (current host) will both be there, so you can ask us anything, e.g.:
• Did Kirill think the show would last ten years and 1000 episodes?
• How has data science transformed over the past decade?
• Did Jon have hair on his head ten years ago?
Date: Next Thursday, June 4th
Time: 5pm Eastern Time / 2pm Pacific Time
To get a calendar invite that includes the URL to join us live, check out the Luma link below ⬇️
luma.com/7vl7mdos
The "Super Data Science Podcast with Jon Krohn" is available on all major podcasting platforms and a video version is on YouTube. Whether you join us or not for the interactive recording, Episode #1000 will be published on Friday June 12th!
Pair Programming with AI in Your Python Notebook, with Dr. Trevor Manz
Exceptional technical episode today with Dr. Trevor Manz on "marimo Pair", an actually!) game-changing pair-programming A.I.-agent companion that lifts heavy loads within your Python data-science notebook.
More on Trevor:
• 27-time NCAA Swimming All-American & National Champion.
• Master's in Computational Biology from University of Cambridge.
• PhD in Bioinformatics from Harvard University.
• Creator of the popular open-source "anywidget" project (amongst many others, particularly in visualizing bioinformatics data, e.g., genomics data).
• Now a founding engineer at marimo.io, where he is leading the charge on marimo Pair.
Seriously, marimo Pair is unreal. A complete reimagining of what's possible in a Jupyter notebook-style environment in the agentic A.I. era. You will hear (and see) my mind explode in this episode!
We also discuss:
• Agent skills.
• Recursive language models.
• A number of other open-source projects, largely in data viz/analysis.
The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.