• Home
  • Fresh Content
  • Courses
  • Resources
  • Podcast
  • Talks
  • Publications
  • Sponsorship
  • Testimonials
  • Contact
  • Menu

Jon Krohn

  • Home
  • Fresh Content
  • Courses
  • Resources
  • Podcast
  • Talks
  • Publications
  • Sponsorship
  • Testimonials
  • Contact
Jon Krohn

AutoDiff Explained

Added on August 7, 2021 by Jon Krohn.

New YouTube video live! This one introduces what Automatic Differentiation — a technique that allows us to scale up the computation of derivatives to machine-learning scale — is.

A new video for my "Calculus for ML" course published on YouTube every Wednesday. Playlist is here.

More detail about my broader "ML Foundations" curriculum and all of the associated open-source code is available in GitHub here.

In Calculus, Data Science, ML Foundations, Professional Development, YouTube Tags machinelearning, datascience, calculus, mathematics

Bringing Data to the People

Added on August 4, 2021 by Jon Krohn.

This week's guest is super-cool Anjali Shrivastava. Anjali makes data accessible and broadly appealing by analyzing pop culture — from TikTok mansions to Star Wars timelines — in her fun and creative YouTube videos.

Anjali is an expert in data-science visualization. She has used this skill set to engineer visualizations of data in production systems in a number of roles and recently took up a data science role at the lab technology giant Thermo Fisher Scientific.

We dig into her technical expertise, including her favorite software tools and applications for viz. We also discuss Anjali's mission to bring a face to data, which she accomplishes through journalism as well as through her brilliant and fun "Vastava" YouTube channel.

Anjali holds dual degrees from the prestigious University of California, Berkeley in data science, as well as in industrial engineering and operations research. A recent graduate, she fill us in on what a data science degree curriculum is like at a top university like Berkeley, as well as how anyone can access their world class data science lectures online.

Listen or watch here.

In Data Science, Podcast, SuperDataScience, Interview Tags SuperDataScience, datascience, dataanalytics, dataviz, popculture, journalism, podcast

The World is Awful (and it’s Never Been Better)

Added on August 2, 2021 by Jon Krohn.

Feel like the world is kinda poopy? Well, it is! BUT, covid pandemic not withstanding, it's also WAY better than ever before. I articulate this idea with data and charts for this week's Five-Minute Friday episode.

Thanks to Benjamin Todd for pointing me in the direction of a blog post by Max Roser (founder of Our World in Data) that formed the basis of this podcast episode.

Watch or listen here.

In Five-Minute Friday, Data Science, Podcast, SuperDataScience

Automatic Differentiation – Segment 3 of Subject 3, "Limits & Derivatives" – Machine Learning Foundations

Added on July 28, 2021 by Jon Krohn.

Automatic Differentiation is a computational technique that allows us to move beyond calculating derivatives by hand and scale up the calculation of derivatives to the massive scales that are common in machine learning.

The YouTube videos in this segment, which we'll release every Wednesday, introduce AutoDiff in the two most important Python AutoDiff libraries: PyTorch and TensorFlow.

My growing "Calculus for ML" course is available on YouTube here.

More detail about my broader "ML Foundations" curriculum and all of the associated open-source code is available in GitHub here.


In Calculus, Data Science, ML Foundations, Professional Development, YouTube Tags machinelearning, datascience, calculus, mathematics, python, pytorch, tensorflow

R in Production

Added on July 27, 2021 by Jon Krohn.

Dutch national-podium-level powerlifter Veerle van Leemput joins me this week to detail how R is not only an option for production, but may in fact be the *best* production option if data models are central to your application.

Over the course of the episode, Veerle runs down for us her favorite R tools for:
• Data gathering
• Model development
• Deployment into production systems

Veerle has held a number of data-science leadership roles at Dutch companies. She now serves as Managing Director and Head of Data Science at Analytic Health, a London-based firm that builds data-centric software for the healthcare industry. And she was silver medalist in the 57kg class of the 2021 Dutch national powerlifting championships with a total of 335kg (~739 pounds) across the back squat, bench press, and deadlift.

Listen or watch here.


In Data Science, Podcast, Professional Development, SuperDataScience Tags SuperDataScience, datascience, machinelearning, analytics, healthcaredata, rlanguage, software

Intro to Regression Models – O'Reilly Live Lessons

Added on July 26, 2021 by Jon Krohn.

My new 80-minute intro to Regression Models is up on YouTube! It's packed with hands-on code demos in Python-based Jupyter notebooks to make learning regression intuitive, interactive, and maybe even fun :)

This lesson is an excerpt from my 9-hour "Probability and Statistics for Machine Learning" video tutorial, which is available via O'Reilly here.


All of the code is available open-source via GitHub.


In Data Science, Live Training, O'Reilly, Professional Development, YouTube Tags machinelearning, datascience, statistics, probability, regression, tutorials, python, jupyternotebook

Say No to Pie Charts

Added on July 23, 2021 by Jon Krohn.

Public Service Announcement for this week's Five-Minute Friday: Don't use pie charts! (Nor, in almost all circumstances, ANY circular chart!)

Listen or watch here.

In Data Science, Five-Minute Friday, Professional Development, Podcast, SuperDataScience Tags SuperDataScience, podcast, dataviz, datascience, dataanalytics
1626875518572.jpeg

DataScienceGo This Weekend

Added on July 21, 2021 by Jon Krohn.

The DataScienceGO conference is this weekend — registration for Friday and Saturday is 100% free! I'm speaking Saturday on the pros and cons of TensorFlow vs PyTorch for training and deploying deep-learning models.

Awesome speakers — whom you may already be familiar with from recent SuperDataScience episodes — include:
• Erica Greene (episode # 435)
• Harpreet Sahota (# 457)
• Andrew Jones (# 483)

I don't (yet!) personally know the other speakers pictured here but their weighty reputations precede them and I'm looking forward to getting to know them better over the course of the weekend: Gabriela de Queiroz, Karen JEAN-FRANCOIS, Yudan Lin, Ken Jee, and Danny Ma.

Free registration here!


In Data Science, Professional Development, SuperDataScience Tags datascience, machinelearning, tensorflow, pytorch, deeplearning, conference, training

Monetizing Machine Learning

Added on July 21, 2021 by Jon Krohn.

This week's guest is the legendary Vin Vashishta! Vin details his A.I. commercialization strategy, which allows data science teams and machine learning companies alike to be profitable and successful long-term.

Vin is founder of and chief data scientist at V Squared, his own consulting practice that specializes in monetizing machine learning by helping Fortune 100 companies with A.I. strategy. He's also the creator of several platforms (including The ML Rebellion) for learning about critical skill gaps related to artificial intelligence such as commercial strategy, data science leadership, and model explainability.

In addition to the episode's focus on A.I. strategy, Vin answers questions from SuperDataScience listeners (thanks, Serg, Joe, Daniel, Nikhil, and Michael!), including on:
• Efficiency gains from no-code or low-code machine learning tools
• The biggest skills gaps that data scientists have
• The most disturbing data sets
• Investing in socially beneficial models
• The most challenging problem with commercializing AI

Listen or watch here.

(With thanks to Harpreet Sahota for another stellar guest suggestion!)

In Data Science, Podcast, Professional Development, SuperDataScience Tags SuperDataScience, datascience, machinelearning, aistrategy, startups

The Power Rule on a Function Chain — Topic 61 of Machine Learning Foundations

Added on July 19, 2021 by Jon Krohn.

This is the FINAL (of nine) videos in my Machine Learning Foundations series on the Derivative Rules. It merges together the Power Rule and the Chain Rule into a single easy step.

Next begins a chunk of long, meaty videos on Automatic Differentiation — i.e., using the PyTorch and TensorFlow libraries to, well, automatically differentiate equations (e.g., ML models) instead of needing to do it painstakingly by hand.

Because these forthcoming videos are so meaty, we're moving from a twice-weekly publishing schedule to a weekly one: Starting next week, we'll publish a new video to YouTube every Wednesday.

My growing "Calculus for ML" course available on YouTube here.

More detail about my broader "ML Foundations" curriculum and all of the associated open-source code is available in GitHub here.

In Calculus, Data Science, ML Foundations, Professional Development, YouTube Tags machinelearning, datascience, calculus, mathematics, python

The Price of Your Attention

Added on July 19, 2021 by Jon Krohn.

Time is money. Every second of your life is yours to use and one of the options you have is to generate income. You can do this hourly, or, as a data scientist, invest time in a digitally-sharable product with a huge potential ROI.

Listen or watch here.

In Five-Minute Friday, Podcast, Personal Improvement, Professional Development, SuperDataScience Tags SuperDataScience, datascientist, software, earning, scalability

Advanced Exercises on Derivative Rules — Topic 60 of Machine Learning Foundations

Added on July 19, 2021 by Jon Krohn.

Having now covered the product rule, quotient rule, and chain rule, we're well-prepared for advanced exercises that confirm your comprehension of all of the derivative rules in my Machine Learning Foundations series.

There’s just one quick derivative rule left after this — one that conveniently combines together two of the rules we’ve already covered — and then we’re ready to move on to the next segment of videos on Automatic Differentiation with PyTorch and TensorFlow.

New videos are published every Monday and Thursday to my "Calculus for ML" course, which is available on YouTube here.

More detail about my broader "ML Foundations" curriculum and all of the associated open-source code is available in GitHub here.


In Calculus, Data Science, ML Foundations, YouTube, Professional Development Tags machinelearning, datascience, calculus, mathematics, python
unnamed.jpg

TensorFlow vs PyTorch @ DataScienceGo Virtual

Added on July 14, 2021 by Jon Krohn.

The DataScienceGO Virtual conference is coming up next Saturday and it is FREE! I'm giving a talk on TensorFlow vs PyTorch with lots of time for audience questions.

In Accouncement, Data Science, Professional Development, SuperDataScience Tags deeplearning, tensorflow, pytorch, conference

Fixing Dirty Data

Added on July 14, 2021 by Jon Krohn.

My guest this week is the fixer of dirty data herself, the one and only Susan Walsh. We have a lot of laughs in this episode as we discuss how organizations can save substantial sums by tidying up their data.

Susan has worked for a decade as a data-quality specialist for a wide range of firms across the private and public sectors. For the past four years, she's been doing this work as the founder and managing director of her own company, The Classification Guru Ltd. She's also the author of the forthcoming book, "Between the Spreadsheets", and she hosts her own video interview show called "Live from the Data Den".

Listen or watch here.

In Data Science, Podcast, SuperDataScience, Professional Development Tags SuperDataScience, dataanalytics, datacleaning, procurement, data

The Chain Rule for Derivatives — Topic 59 of Machine Learning Foundations

Added on July 12, 2021 by Jon Krohn.

Today's video introduces the Chain Rule — arguably the single most important differentiation rule for ML. It facilitates several of the most ubiquitous ML algorithms, such as gradient descent and backpropagation.

Gradient descent and backprop will be covered in great detail later in my "Machine Learning Foundations" video series. This video is critical for understanding those applications.

New videos are published every Monday and Thursday to my "Calculus for ML" course, which is available on YouTube.

More detail about my broader "ML Foundations" curriculum and all of the associated open-source code is available in GitHub.

In Calculus, Data Science, ML Foundations, Professional Development, YouTube Tags machinelearning, datascience, calculus, mathematics, python

The History of Calculus

Added on July 11, 2021 by Jon Krohn.

Y'all seem to love these "History of..." episodes, so for Five-Minute Friday this week, here's another one. It's on the History of Calculus! Enjoy 😄

(Leibniz and Newton, who independently devised modern calculus around the same time, are pictured.)

Listen or watch here.


In Calculus, Five-Minute Friday, Podcast, SuperDataScience, YouTube Tags superdatascience, math, calculus, datascience, podcast

The Quotient Rule for Derivatives — Topic 58 of Machine Learning Foundations

Added on July 8, 2021 by Jon Krohn.

This is the penultimate Derivative Rule and then we're moving onward to AutoDiff with TensorFlow and PyTorch! The Quotient Rule is analogous to the Product Rule introduced on Monday but is for division instead of multiplication.

New videos are published every Monday and Thursday. The playlist for my "Calculus for ML" course is here.

More detail about my broader "ML Foundations" series and all of the associated open-source code is available in GitHub here.

In Calculus, Data Science, ML Foundations, Professional Development, YouTube Tags machinelearning, datascience, calculus, mathematics, python
Waitlist-update_alt.png

Upcoming O'Reilly Calculus Classes

Added on July 8, 2021 by Jon Krohn.

Starting a week today, I'm offering my entire "ML Foundations" curriculum as a series of 14 live, interactive workshops via O'Reilly Media. The first five classes are open for registration; two are already waitlist-only, so grab a spot now:

• Jul 14 — Intro to Linear Algebra (waitlisted)
• Jul 21 — LinAlg II: Matrix Tensors (5 spots remaining)
• Jul 28 — LinAlg III: Eigenvectors (waitlisted)
• Aug 12 — Intro to Calculus (143 spots remaining)
• Aug 18 — Calc II: AutoDiff (148 spots remaining)

REGARDING THE WAITLIST: I have a made a request with O'Reilly to increase the maximum class size from 600 students to 1000, so if you sign up for a waitlisted class now, you should still be able to get in.

Overall, there will be four subject areas covered:
• Linear Algebra (3 classes)
• Calculus (4 classes)
• Probability and Statistics (4 classes)
• Computer Science (3 classes)

Sign up opens about two months prior to each class. All 14 training dates, running from next week through December, are provided at jonkrohn.com/talks

A detailed curriculum and all of the code for my ML Foundations series is available open-source in GitHub here.

In Data Science, Professional Development, O'Reilly, Live Training Tags python, datascience, machinelearning, tensorflow, pytorch, math

Financial Data Engineering

Added on July 7, 2021 by Jon Krohn.

This week's guest is Doug Eisenstein, an exceptionally clear and content-rich communicator. He fills us in on the complexity of engineering a coherent source of truth for financial models, integrating hundreds of data sources.

Topics covered in the episode include:
• A breakdown of the primary financial sectors and departments
• Why data source integration for finance is wildly complicated
• Specific data engineering approaches that resolve these issues including entity resolution, knowledge graph mapping and tri-temporality.

20 years ago, Doug founded the consulting firm, Advanti and they have since become a critical provider of solutions to complex data engineering problems faced by some of the world's largest banks and asset managers including Morgan Stanley, Bank of America, Citibank and State Street.

Listen or watch here.

In Data Science, Personal Improvement, Podcast, Professional Development, SuperDataScience

The Product Rule for Derivatives

Added on July 5, 2021 by Jon Krohn.

Today's video is on the Product Rule, a relatively advanced Derivative Rule. Only a couple such rules remain and then we move onward to Automatic Differentiation with PyTorch and TensorFlow.

New videos are published every Monday and Thursday. The playlist for my "Calculus for ML" course is here.

More detail about my broader "ML Foundations" series and all of the associated open-source code is available in GitHub here.

machinelearning,datascience,calculus,mathematics,python


In Calculus, Data Science, ML Foundations, Professional Development, YouTube Tags machinelearning, datascience, calculus, mathematics, python
← Newer Posts Older Posts →
Back to Top