Don't miss this mind-blowing episode with Jason Warner, who compellingly argues that code-specialized LLMs will bring about AGI. His firm, poolside, was launched to achieve this and facilitate an "AI-led, developer-assisted" coding paradigm en route.
Jason:
• Is Co-Founder and CEO of poolside, a hot venture capital-backed startup that will shortly be launching its code-specialized Large Language Model and accompanying interface that is designed specifically for people who code like software developers and data scientists.
• Previously was Managing Director at the renowned Bay-Area VC Redpoint Ventures.
• Before that, held a series of senior software-leadership roles at major tech companies including being CTO of GitHub and overseeing the Product Engineering of Ubuntu.
• Holds a degree in computer science from Penn State University and a Master's in CS from Rensselaer Polytechnic Institute.
Today’s episode should be fascinating to anyone keen to stay abreast of the state of the art in A.I. today and what could happen in the coming years.
In today’s episode, Jason details:
• Why a code-generation-specialized LLM like poolside’s will be far more valuable to humans who code than generalized LLMs like GPT-4 or Gemini.
• Why he thinks AGI itself will be brought about by a code-specialized ML model like poolside’s.
The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.
Filtering by Tag: software
Data Mesh
"Data Mesh" may be the trendiest term in data science. What is it and how will its Distributed A.I. transform your organization? The founder of the Data Mesh concept herself, Zhamak Dehghani, explains in this episode.
Zhamak:
• Authored the O'Reilly Media book "Data Mesh" and also co-authored an O’Reilly book on software architecture.
• Is newly the CEO and founder of a stealth tech startup reimagining the future of the data developer experience though the Data Mesh.
• Previously worked as a software engineer, software architect, and as a technology incubation director.
• Holds a Bachelor of Engineering degree in Computer Software from the Shahid Beheshti University in Iran and a Masters in Information Technology Management from the University of Sydney in Australia.
Today’s episode should be broadly interesting to anyone who’s keen to get a glimpse of the future of how organizations will work with data and A.I.
In this episode, Zhamak details:
• What a data mesh is.
• Why data meshes are essential today and will be even more so in the coming years.
• The biggest challenges of distributed data architectures.
• Why now was the right time for her to launch her own data mesh startup.
• Her tricks for keeping pace with the rapid of pace of tech progress.
The SuperDataScience show's available on all major podcasting platforms, YouTube, and at SuperDataScience.com.
Geospatial Data and Unconventional Routes into Data Careers
This week, the remarkably well-read Christina Stathopoulos, details open-source software for working with geospatial data... as well as how you can navigate your data-career path, no matter what your background.
Christina:
• Has worked at Google for nearly five years in several data-centric roles.
• For the past year, she’s worked as an Analytical Lead for Waze, the popular crowdsourced navigation app owned by Google.
• Is also an adjunct professor at IE Business School School in Madrid, where she teaches courses on business analytics, machine learning, data visualization, and data ethics.
• Previously worked as a data engineer at media analytics giant Nielsen.
• Holds a Master’s in Business Analytics and Big Data from IE Business School and a Bachelor’s in Science, Tech, and Society from North Carolina State University.
Today’s episode will appeal to a broad audience of technical and non-technical listeners alike.
In this episode, Christina details:
• Geospatial data and open-source packages for working with it.
• Her tips for getting a foothold in a data career if you come from an unconventional background.
• Guidance to help women and other underrepresented groups thrive in tech.
• The hard and soft skills most essential to success in a data role today.
• Her #bookaweekchallenge and her top data book recommendations.
The SuperDataScience show's available on all major podcasting platforms, YouTube, and at SuperDataScience.com.
R in Production
Dutch national-podium-level powerlifter Veerle van Leemput joins me this week to detail how R is not only an option for production, but may in fact be the *best* production option if data models are central to your application.
Over the course of the episode, Veerle runs down for us her favorite R tools for:
• Data gathering
• Model development
• Deployment into production systems
Veerle has held a number of data-science leadership roles at Dutch companies. She now serves as Managing Director and Head of Data Science at Analytic Health, a London-based firm that builds data-centric software for the healthcare industry. And she was silver medalist in the 57kg class of the 2021 Dutch national powerlifting championships with a total of 335kg (~739 pounds) across the back squat, bench press, and deadlift.
Listen or watch here.
The Price of Your Attention
Time is money. Every second of your life is yours to use and one of the options you have is to generate income. You can do this hourly, or, as a data scientist, invest time in a digitally-sharable product with a huge potential ROI.
Listen or watch here.
How to Thrive as an Early-Career Data Scientist
Getting started in data science? Today's episode is for you! Sidney Arcidiacono is absolutely crushing her first year in the field; we discuss the options for getting started in the field and top tips for early-career success.
Trained as a phlebotomist (blood-sample collection), Sidney was inspired by the potential for machine learning to revolutionize healthcare, so she jumped feet first into a full-time computer science degree at Make School, specializing in the data science track. From no familiarity with code or models just a year ago, Sidney's immersion has paid off: She's now fluent in the modern data science software stack and landed a summer data science internship at GreenLight Biosciences, Inc., an RNA-molecule therapeutics firm (like the Pfizer/BioNTech/Moderna vaccines).
Sidney is terrifically sharp and engaging; I think you'll enjoy hearing from her as much as I did during filming.
Watch or listen here.