• Home
  • Fresh Content
  • Courses
  • Resources
  • Podcast
  • Talks
  • Publications
  • Sponsorship
  • Testimonials
  • Contact
  • Menu

Jon Krohn

  • Home
  • Fresh Content
  • Courses
  • Resources
  • Podcast
  • Talks
  • Publications
  • Sponsorship
  • Testimonials
  • Contact
Jon Krohn

Open-Source Analytical Computing (pandas, Apache Arrow)

Added on November 16, 2021 by Jon Krohn.

The legend Wes McKinney is this week's guest! He details the genesis of the ubiquitous pandas library, the forthcoming edition of his bestselling book, and how Apache Arrow brings analytics into the distributed computing era.

Wes:
• Created pandas, the industry-standard Python library for data analytics
• Co-created Apache Arrow, a language-agnostic open-source library for efficient analytics on modern distributed CPUs and GPUs
• Wrote the classic O'Reilly Media desk reference "Python for Data Analysis"
• Has worked as technical expert at prestigious firms like Cloudera, RStudio PBC, Two Sigma, and AQR Capital Management
• Today serves as co-founder and CTO of Voltron Data

In this episode, Wes takes us on a technical deep-drive through:
• The creation story of his now-ubiquitous pandas library
• A sneak peek at the third edition of his international-bestselling book
• What the Apache Arrow project is and why it's poised to revolutionize the data science and software industries
• The software and hardware tools that he uses daily to be such an epically productive software developer and entrepreneur
• Responses to great questions by listeners Daniel, David, Doug, and Brett

The SuperDataScience show's available on all major podcasting platforms, YouTube, and at SuperDataScience.com.

In Data Science, Interview, Podcast, Professional Development, SuperDataScience, YouTube Tags superdatascience, datascience, python, dataanalytics, distributedcomputing, opensourcedevelopment, pandas
← Newer: The Gradient of Quadratic Cost Older: Data Tools vs. Data Platforms →
Back to Top