• Home
  • Fresh Content
  • Courses
  • Resources
  • Podcast
  • Talks
  • Publications
  • Sponsorship
  • Testimonials
  • Contact
  • Menu

Jon Krohn

  • Home
  • Fresh Content
  • Courses
  • Resources
  • Podcast
  • Talks
  • Publications
  • Sponsorship
  • Testimonials
  • Contact
Jon Krohn

In Case You Missed It in May 2024

Added on June 17, 2024 by Jon Krohn.

We had another incredible set of guests in May on the SuperDataScience Podcast I host. ICYMI, today's episode highlights the most fascinating moments of my conversations with them.

Specifically, conversation highlights include:

1. Dr. Luis Serrano, a math- and ML-education YouTuber with 150k subscribers, explaining what language embeddings are, how they function, and how essential they are for running semantic search queries.

2. Sol Rashidi, serial C-suite data-role executive at Fortune 100s and bestselling author of "Your A.I. Survival Guide", on her approach to building data teams.

3. Co-founder of the MLOps Community, Demetrios Brinkmann, on the differences between ML Engineering and MLOps roles.

4. Navdeep Martin, an entrepreneur blending climate tech and generative A.I. in her latest startup, on opportunities where you can tackle climate change with technological innovation yourself.

The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.

In Data Science, Interview, Podcast, Professional Development, SuperDataScience, YouTube Tags superdatascience, ai, ML, podcast

Reinforcement Learning from Human Feedback (RLHF), with Dr. Nathan Lambert

Added on June 11, 2024 by Jon Krohn.

In today's episode, the renowned RLHF thought-leader Dr. Nathan Lambert digs into the origins of RLHF, its role today in fine-tuning LLMs, emerging alternatives to RLHF... and how GenAI may democratize (human) education!

Nathan:

• Is a Research Scientist at the Allen Institute for AI (AI2) in Seattle, where he’s focused on fine-tuning Large Language Models (LLMs) based on human preferences as well as advocating for open-source AI.

• He’s renowned for his technical newsletter on AI called "Interconnects".

• Previously helped build an RLHF (reinforcement learning from human feedback) research team at Hugging Face.

• Holds a PhD from University of California, Berkeley in which he focused on reinforcement learning and robotics, and during which he worked at both Meta AI and Google DeepMind.

Today’s episode will probably appeal most to hands-on practitioners like data scientists and machine learning engineers, but anyone who’d like to hear from a talented communicator who works at the cutting edge of AI research may learn a lot by tuning in.

In today’s episode, Nathan details:

• What RLHF is and how its roots can be traced back to ancient philosophy and modern economics.

• Why RLHF is the most popular technique for fine-tuning LLMs.

• Powerful alternatives to RLHF such as RLAIF (reinforcement learning from A.I. feedback) and direct distilled preference optimization (dDPO).

• Limitations of RLHF.

• Why he considers AI to often be more alchemy than science.

The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.

In Data Science, Podcast, SuperDataScience, YouTube Tags superdatascience, machinelearning, ai, llms, RLHF, reinforcementlearning

Open-Source Libraries for Data Science at the New York R Conference

Added on June 7, 2024 by Jon Krohn.

For today's short episode, I asked four data-science luminaries about their favorite open-source libraries. Hear what Emily Zabor, James David Long, Drew Conway and Jared Lander chose, live on stage!

The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.

In Data Science, YouTube, SuperDataScience, Podcast Tags superdatascience, datascience, opensource, rlang, rlanguage

ML for Wind-Powered Energy Generation, with Dr. Jason Yosinski

Added on June 4, 2024 by Jon Krohn.

One of my all-time favorite A.I. researchers, Dr. Jason Yosinski, is my guest today! He details how his startup is using ML to collect wind energy more efficiently and digs into visualizing/understanding deep neural networks.

Jason:

• Is Co-Founder and CEO of Windscape AI, a startup using ML to increase the efficiency of energy generation via wind turbines.

• Is Co-Founder and President of the ML Collective, a research group that’s open to ML researchers anywhere.

• Was a Co-Founder of the A.I. Lab at the ride-share company Uber.

• Holds a PhD in Computer Science from Cornell, during which he worked at the NASA Jet Propulsion Laboratory, Google DeepMind and with the eminent Yoshua Bengio in Montreal.

• His work has been featured in The Economist, on the BBC and, coolest of all, in an XKCD comic!

Today’s episode gets fairly technical in parts so may be of greatest interest to hands-on practitioners like data scientists and ML engineers, although there are also parts that will appeal to anyone keen to hear how ML is being used to produce more clean energy.

In today’s episode, Jason details:

• How ML can make wind direction more predictable, thereby making wind turbines and power grids in general more efficient.

• How to infer what individual neurons in a deep learning model are doing by using visualizations.

• Why freezing a particular layer of a neural net prior to doing any training at all can lead to better results.

• How you can get involved in a cutting-edge research community no matter where you are in the world.

• What traits make for successful A.I. entrepreneurs.

The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.

In Data Science, Podcast, SuperDataScience, YouTube Tags superdatascience, machine learning, ai, ClimateChange, wind energy

Multi-Agent Systems: How Teams of LLMs Excel at Complex Tasks

Added on May 31, 2024 by Jon Krohn.

Groundbreaking multi-agent systems (MAS, for short) are transforming the way AI models collaborate to tackle complex challenges.

Read More
In Data Science, Five-Minute Friday, Podcast, SuperDataScience, YouTube Tags superdatascience, machine learning, ai, llms, multi agent systems

MLOps: The Job and The Key Tools, with Demetrios Brinkmann

Added on May 28, 2024 by Jon Krohn.

Today, global MLOps community leader Demetrios Brinkmann details why MLOps is essential, how it differs from related roles like LLMOps, DevOps and A.I. Engineering, and the best tools for deploying and scaling LLMs.

Demetrios:

• Is Founder and CEO of MLOps Community, an organization dedicated to supporting MLOps professionals that has quickly grown to over 20,000 members.

• Was previously founder of the Data on Kubernetes community.

• Before that, worked in public-facing roles at a number of European tech startups.

Today’s episode will be of interest to anyone who’s keen to better understand the critical function of MLOps in bringing machine learning models to the real world.

In today’s episode, Demetrios details:

• What exactly MLOps is and how it relates to other jobs like LLMOps, DevOps and A.I. Engineer.

• The key MLOps tools and approaches.

• What it takes to build a thriving community of tens of thousands of professionals in just a few years.

The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.

In Data Science, Podcast, SuperDataScience, YouTube Tags superdatascience, machinelearning, mlops, ai, devops, aiengineer

The Six Keys to Data Scientists’ Success, with Kirill Eremenko

Added on May 24, 2024 by Jon Krohn.

For today's episode, Kirill Eremenko — who has taught more than 2.8 million people data science — fills us in on his six most valuable insights about data science careers.

More on Kirill:

• Founder and CEO of SuperDataScience, an e-learning platform that is the namesake of this very podcast.

• Launched the SuperDataScience Podcast in 2016 and hosted the show until he passed me the reins four years ago.

• Has reached more than 2.8 million students through the courses he’s published on Udemy, making him Udemy’s most popular data science instructor.

At a high level, Kirill's six data science insights are:

1. Unlike many other careers, there’s no need for formal credentials to become a data scientist.

2. Mentors can be invaluable guides in a DS career, but you should also try to give back to your mentors when you can.

3. Portfolios are the key to landing the DS job of your dream because they showcase your DS abilities for all to see.

4. Hands-on labs are a fun, interactive way to develop your portfolio and are a great complement to classes.

5. Collaborations can make lots of aspects of DS career development fun, including learning new materials, completing labs and developing your portfolio.

6. Data scientists can come from any background and work from anywhere in the world with an Internet connection.

The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.

In Data Science, Five-Minute Friday, Interview, Personal Improvement, Podcast, SuperDataScience, YouTube Tags superdatascience, datascience, datascientist, datacareer, podcast

Math, Quantum ML and Language Embeddings, with Dr. Luis Serrano

Added on May 21, 2024 by Jon Krohn.

Today, Dr. Luis Serrano (a master at making complex math and ML topics friendly) leads a mind-expanding discussion on embeddings in LLMs, Quantum ML and what the next big trends in A.I. will be. I wouldn't miss this one 🤯

Luis:

• Is the beloved creator behind the Serrano Academy, an educational YouTube channel on math and ML with over 146,000 subscribers.

• Until this month, he worked as Head of Developer Relations at Cohere, one of the world’s few A.I. labs that is actually at the frontier of LLMs.

• Prior to that, he was a Quantum A.I. Research Scientist at Zapata Computing, Lead A.I. Educator at Apple, Head of Content for A.I. at Udacity and ML Engineer at Google.

• Holds a PhD in Math from the University of Michigan.

Today’s episode should be appealing to just about anyone! In it, Luis details:

• How supposedly complex topics like math and A.I. can be made easy to understand.
• How Cohere’s focus on enterprise use cases for LLMs has led it to specialize in embeddings, the most important component of LLMs.
• The promising application areas for Quantum Machine Learning.
• What the next big trends in A.I. will be.

The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.

In Data Science, Interview, Podcast, SuperDataScience, YouTube Tags SuperDataScience, machine learning, ai, llms

Aligning Large Language Models, with Sinan Ozdemir

Added on May 17, 2024 by Jon Krohn.

For today’s quick Five-Minute Friday episode, the exceptional author, speaker and entrepreneur Sinan Ozdemir provides an overview of what it actually means for an LLM to be “aligned”.

More on Sinan:
• Is Founder and CTO of LoopGenius, a generative AI startup.
• Has authored several excellent books, including, most recently, the bestselling "Quick Start Guide to Large Language Models".
• Is a serial AI entrepreneur, including founding a Y Combinator-backed generative AI startup way back in 2015 that was later acquired.

This episode was filmed live at the Open Data Science Conference (ODSC) East in Boston last month. Thanks to ODSC for providing recording space.

The Super Data Science Podcast is available on all major podcasting platforms and a video version is on YouTube. This is episode #784!

The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.

In Data Science, Five-Minute Friday, Interview, Podcast, SuperDataScience, YouTube Tags superdatascience, machinelearning, ai, generativeai, llms, podcast

Generative A.I. for Solar Power Installation, with Navdeep Martin

Added on May 14, 2024 by Jon Krohn.

A startling 70% of solar-power projects fail. In today's episode, hear how Navdeep Martin's startup Flypower is using Generative A.I. to ensure we install renewable energy sources more effectively and efficiently.

Navdeep:

• Co-founder and CEO of Flypower, a generative A.I. startup dedicated to ensuring clean-energy projects, particularly solar-power projects, succeed.

• Previously held senior product leadership roles at VC-backed Bay Area AI startups as well as for AI products at Comcast and The Washington Post.

• Before that, was a software engineer for the CIA.

• Holds a degree in computer science from William & Mary and an MBA from the University of Virginia.

Today’s episode will appeal to anyone who’d like to hear about the evolution of generative A.I. technologies in products and applications, including how you can best make use of the various categories of Gen-A.I. technologies today and how, in particular, A.I. is being used to overcome the social and regulatory hurdles associated with combating climate change.

The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.

In Data Science, Podcast, SuperDataScience, YouTube Tags superdatascience, machinelearning, ai, climatetech, generativeai

In Case You Missed It in April 2024

Added on May 11, 2024 by Jon Krohn.

Other than excessive maleness and paleness*, April 2024 was an excellent month for the podcast, packed with outstanding guests. ICYMI, today's episode highlights the most fascinating moments of my convos with them.

Specifically, conversation highlights include:

1. Iconic open-source developer Dr. Hadley Wickham putting the "R vs Python" argument to bed.

2. Aleksa Gordić, creator of a digital A.I.-learning community of 160k+ people, on the movement from formal to self-directed education.

3. World-leading futurist Bernard Marr on how we can work with A.I. as opposed to it lording over of us.

4. Educator of millions of data scientists, Kirill Eremenko, on why gradient boosting is so powerful for making informed business decisions.

5. Prof. Barrett Thomas on how drones could transform same-day delivery.

*Remedied in May!

The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.

In Data Science, Five-Minute Friday, Podcast, SuperDataScience, YouTube Tags data science, ai, llm, python, R

Ensuring Successful Enterprise AI Deployments, with Sol Rashidi

Added on May 7, 2024 by Jon Krohn.

Prodigious Sol Rashidi has deployed nearly 40 large-scale data and A.I. projects at Fortune 100 companies. Her rich insights on doing this successfully fill her new book and are distilled into today's fun episode.

Sol ☀:

• Has been a C-suite data/analytics/A.I. leader at Estée Lauder, Merck pharmaceuticals, Sony Music and Royal Caribbean Cruise Lines.

• Was Senior Partner leading the Digital and Innovation Practice at EY and was the Partner who led the Watson go-to-market at IBM.

• Has been involved in over three dozen large-scale data/A.I. deployments.

• Is recognized with a string of international awards for her leadership.

• Holds eight patents with many more pending.

Today’s episode will be invaluable to anyone who’d like to succeed at deploying A.I. models commercially. In it, Sol details:

• Her straightforward system for selecting the enterprise A.I. projects that will be successfully deployed.

• What kinds of A.I. projects should always be avoided.

• Why larger enterprises drag their feet on impactful A.I. projects and how to overcome such corporate logjams.

• When you should patent an innovation.

• Why Chief Data Officers and related C-suite roles have such high turnover.

The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.

In Data Science, Podcast, SuperDataScience, YouTube, Professional Development Tags superdatascience, ai, data, machinelearning, artificialintelligence

How to Become a Data Scientist, with Dr. Adam Ross Nelson

Added on May 4, 2024 by Jon Krohn.

Today's episode features Dr. Adam Ross Nelson providing his #1 most useful piece of guidance on "How to Become a Data Scientist" from his book of that very name!

This was filmed live at the Open Data Science Conference (ODSC) East in Boston last week — thanks ODSC East for providing valuable conference space for us to shoot podcast episodes.

The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.

In Data Science, Interview, Five-Minute Friday, Personal Improvement, Podcast, Professional Development, SuperDataScience, YouTube Tags superdatascience, datascience, datascientist, career, podcast

The Tidyverse of Essential R Libraries and their Python Analogues, with Dr. Hadley Wickham

Added on April 30, 2024 by Jon Krohn.

Many-time bestselling author and prolific open-source R developer Hadley Wickham is our guest today. In it, we discuss Posit's rebrand and why the Tidyverse needs to be in every data scientist's toolkit.

More on Hadley:
• Chief Scientist at Posit PBC
• Adjunct Professor of Statistics at Stanford University, Rice University and The University of Auckland.
• Is best-known as the creator of the Tidyverse suite of open-source R libraries for data science, including the essential libraries dplyr and ggplot2.
• Has written seminal books on R programming for O'Reilly, Springer and CRC Press, including the mega-bestselling "R for Data Science".

Today’s episode will primarily be of interest to hands-on practitioners like data scientists and machine learning engineers. In it, Hadley details:
• Why the iconic open-source company RStudio rebranded to Posit.
• The philosophy of the tidyverse, amusing backstories on its most iconic packages and why the tidyverse is invaluable for all data scientists to be familiar with.
• The open-source projects he’s most excited about today.
• How you can easily get involved with career-bolstering open-source projects yourself.

The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.

In Data Science, Podcast, SuperDataScience, YouTube, Professional Development Tags superdatascience, datascience, machinelearning, statistics, rlanguage

Mixtral 8x22B: SOTA Open-Source LLM Capabilities at a Fraction of the Compute

Added on April 26, 2024 by Jon Krohn.

Today, I’m going to do my best to give you a five-minute update on a groundbreaking new open-source Large Language Model called Mixtral 8x22B groundbreaking new open-source Large Language Model called Mixtral 8x22B out of an extremely hot French startup called Mistral.

Read More

Generative AI in Practice, with Bernard Marr

Added on April 23, 2024 by Jon Krohn.

In today's episode, Bernard Marr — world-leading futurist (>4m social-media followers) and prolific author (20+ books!) — details how GenAI will revolutionize industries, enhance our lives and solve pressing global issues.

In case he isn’t already on your radar, Bernard:

• World-leading futurist who’s consulted with NVIDIA, Google, Microsoft, Amazon and many more on digital transformation and A.I. in business.

• His 20+ books have been translated into 20+ languages and earned several business and management "book of the year" awards; many have also been bestsellers.

• His writing has been featured in The Guardian, Financial Times, The Wall Street Journal, the Harvard Business Review and many other leading media outlets.

• Has over 4 million combined social media followers.

Today’s episode will be of interest to anyone who’d like to better understand Generative A.I. and how to adopt GenAI effectively at work or at home.

In this episode, Bernard details:

• The history of GenAI.

• How GenAI will pair with other industries like energy, healthcare and education to accelerate hyper-innovation across every aspect of society.

• The regulatory and ethical challenges associated with GenAI and how we can overcome them.

• How AI paradoxically makes us more human.

• How to successfully implement GenAI both professionally and personally.

The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.

Tags superdatascience, machinelearning, ai, genai, llms, future

Deep Utopia: AI Could Solve All Human Problems in Our Lifetime

Added on April 20, 2024 by Jon Krohn.

Today’s episode focuses on Nick Bostrom's latest book, Deep Utopia. Published a couple of weeks ago, it delves into the possibilities of a future where artificial intelligence has solved humanity's deepest problems.

Read More
In Five-Minute Friday, Podcast, SuperDataScience, YouTube Tags SuperDataScience, utopia, ai, ml, llm, machinelearning

What will humans do when machines are vastly more intelligent? With Aleksa Gordić

Added on April 17, 2024 by Jon Krohn.

Aleksa Gordić — the famed A.I. educator and multilingual-LLM entrepreneur — is my guest today. Brilliant and widely-read, Aleksa opines on what it will take to realize Artificial Super Intelligence and the consequences for humans.

Aleksa:

• Is Founder & CEO of Runa AI, a startup focused on building multilingual LLMs.

• Is an online educator that has built a community of 160,000 people in the A.I. space, including through his A.I. Epiphany YouTube channel.

• Previously, he was an A.I. Research Engineer at Google DeepMind in London and a Machine Learning Software Engineer at Microsoft.

• He holds a degree in Electronics and Computer Science from the University of Belgrade in Serbia.

Today’s episode contains tidbits here and there that will appeal primarily to hands-on machine learning practitioners, but it mostly should be of great interest to anyone.

In this episode, wildly-intelligent Aleksa details:

• Why multilingual LLMs provide so much value despite the cutting-edge LLMs like Claude 3, Gemini Ultra and GPT-4 supporting so many languages.

• His frameworks for entrepreneurial success and for effective self-directed learning.

• His analogy for how humans are born as a checkpoint of a Bayesian model that’s fine-tuned with reinforcement learning from human feedback (RLHF).

• What he thinks it will take to realize artificial super intelligence and what it could mean for human society when it arrives.

The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.

In Data Science, Podcast, SuperDataScience, YouTube Tags superdatascience, machine learning, ai, agi, entrepreneur, llms

RFM-1 Gives Robots Human-like Reasoning and Conversation Abilities

Added on April 14, 2024 by Jon Krohn.

Today’s episode is all about an LLM trained for robotics applications called RFM-1 that completely blows my mind because of the implications for what can now suddenly be accomplished so easily with robotics.

Read More
In Data Science, Podcast, SuperDataScience, YouTube Tags ai, robotics, NLP, physics, GPT4, Covariant

Deep Reinforcement Learning for Maximizing Profits, with Prof. Barrett Thomas

Added on April 9, 2024 by Jon Krohn.

Today, Prof. Barrett Thomas blends his rich technical understanding of Deep Reinforcement Learning with his commercial savviness to eloquently detail how Deep RL can be leveraged to minimize costs and maximize profits.

Barrett:

• Is Research Professor in Business Analytics and Senior Associate Dean at the University of Iowa’s College of Business.

• As will soon be unsurprising to you when you hear how well he communicates complex concepts, he’s won multiple teaching awards (amongst other academic prizes).

• He holds a PhD in Industrial and Operations Engineering from the University of Michigan.

Today’s episode is a technical one that will appeal primarily to hands-on practitioners like data scientists, software developers and machine learning engineers.

In this episode, Barrett details:

• What Markov Decision Processes are and how they relate to Deep Reinforcement Learning.

• How operations research leverages neural networks to minimize business costs and maximize business profits.

• How same-day delivery has been made possible by machine learning.

• How aerial drones and autonomous vehicles will revolutionize supply chains and transportation.

The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.

In Interview, SuperDataScience, YouTube Tags superdatascience, machinelearning, ai, deepreinforcementlearning, logistics, supplychain, drones, profit
← Newer Posts Older Posts →
Back to Top