Super Data Science: ML & AI Podcast with Jon Krohn
Jon Krohn
937 episodes
4 days ago
The latest machine learning, A.I., and data career topics from across both academia and industry are brought to you by host Dr. Jon Krohn on the Super Data Science Podcast. As the quantity of data on our planet doubles every couple of years and with this trend set to continue for decades to come, there's an unprecedented opportunity for you to make a meaningful impact in your lifetime. In conversation with the biggest names in the data science industry, Jon cuts through hype to fuel that professional impact.
Whether you're curious about getting started in a data career or you're a deep technical expert, whether you'd like to understand what A.I. is or you'd like to integrate more data-driven processes into your business, we have inspiring guests and lighthearted conversation for you to enjoy.
We cover tools, techniques, and implementation tricks across data collection, databases, analytics, predictive modeling, visualization, software engineering, real-world applications, commercialization, and entrepreneurship − everything you need to crush it with data science.
All content for Super Data Science: ML & AI Podcast with Jon Krohn is the property of Jon Krohn and is served directly from their servers
with no modification, redirects, or rehosting. The podcast is not affiliated with or endorsed by Podjoint in any way.
The latest machine learning, A.I., and data career topics from across both academia and industry are brought to you by host Dr. Jon Krohn on the Super Data Science Podcast. As the quantity of data on our planet doubles every couple of years and with this trend set to continue for decades to come, there's an unprecedented opportunity for you to make a meaningful impact in your lifetime. In conversation with the biggest names in the data science industry, Jon cuts through hype to fuel that professional impact.
Whether you're curious about getting started in a data career or you're a deep technical expert, whether you'd like to understand what A.I. is or you'd like to integrate more data-driven processes into your business, we have inspiring guests and lighthearted conversation for you to enjoy.
We cover tools, techniques, and implementation tricks across data collection, databases, analytics, predictive modeling, visualization, software engineering, real-world applications, commercialization, and entrepreneurship − everything you need to crush it with data science.
How much power – and risk – do we carry around with us in our pockets? A Reuters investigation about how easily LLMs can be utilized for online phishing scams is the subject of this week’s Five-Minute Friday with Jon Krohn. By asking six of the most popular LLMs (Grok, ChatGPT, Meta AI, Claude, DeepSeek and Gemini) to generate phishing emails specifically targeting elderly people, Reuters found the safety sometimes severely lacking in the models. Listen to the episode to hear Jon quantify this problem with real-world examples, why mere content warnings in LLM models don’t work, and the troubling results of the phishing requests.
Additional materials: www.superdatascience.com/936
Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.
Jon Krohn speaks to researcher, broadcaster and author Stephanie Hare about how the Hippocratic Oath might apply to artificial intelligence, and a guiding ethos for pushing innovation while protecting users from harm. A code of conduct, she says, could be one approach to ensuring that people are using technology more mindfully and ethically, as well as an opportunity for users to feel that they belong to a wider, global community. Although she sympathizes with people concerned by overregulation undermining innovation, Stephanie also notes that we expect certain standards to be met elsewhere, such as vehicle and drug safety, as well as fair journalistic practices. As Stephanie explains, we need to find a realistic middle ground between innovation and regulation.
This episode is brought to you by the Dell, by Intel, by Fabi and by Gurobi.
Additional materials: www.superdatascience.com/935
Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.
In this episode you will learn:
(01:23) What ‘technology ethics’ is
(14:46) Developing a Hippocratic Oath for tech
(42:32) How to protect against sensationalism
(53:38) How to maintain a balance of growth and infrastructure
With the number of jobs dramatically slowing in the last year, many question if this decline is down to companies turning to AI for completing entry-level tasks in particular. Research published earlier this month by Yale University shows no major difference in the types of roles and tasks in so-called `white-collar jobs` since late 2022, an auspicious date that coincides with the launch of ChatGPT. In this week‘s Five-Minute Friday, host Jon Krohn discusses if and when AI will undercut junior-level jobs, particularly in the US.
Additional materials: www.superdatascience.com/934
Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.
Sheamus McGovern, CEO of Open Data Science, takes Jon Krohn and his listeners on a journey to launching his popular data science and AI conference, now in its tenth year, as well as the great shifts to the fields that he has seen on the way. For Seamus, the growth of his Open Data Science Conference has shown him that an AI engineer is just the beginning of several roles that will emerge from the industry. He asks Jon to consider the breadth of tasks demanded of today’s engineers, from data profiling and transformation to feature engineering, hyper-parameter tuning, and model deployments. Just as the AI engineer emerged from the data scientist role, Seamus expects the industry to respond to the broadening range of projects and tools with new, niche, and dynamic job roles.
This episode is brought to you by the Trainium2, the latest AI chip from AWS, by Gurobi, by Dell and by Intel.
Additional materials: www.superdatascience.com/933
Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.
In this episode you will learn:
(02:50) Why Seamus started ODSC
(18:27) The differences in AI engineers and data scientists
(24:20) How to keep up with AI’s rapid pace
(33:51) How people hire for AI orchestration
(46:26) How companies can get team skillsets right
Larissa Schneider speaks to Jon Krohn in this Feature Friday about finding the right time to invest in AI solutions, and when it’s better to build them yourself. She discusses her work leading global strategy and operations at Unframe, and how they raised $50 million in venture capital since the company’s launch in March 2025.
Additional materials: www.superdatascience.com/932
Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.
AI predictions, and how to act on them: Data Science Strategist at Gurobi, Jerry Yurchisin, speaks to Jon Krohn about how mathematical optimization helps enterprises automate decisions for business success and where to find the resources to make it happen.
This episode is brought to you by the ODSC, the Open Data Science Conference, by Fabi, by Dell, and by Intel.
Additional materials: www.superdatascience.com/931
Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.
In this episode you will learn:
(02:34) What mathematical optimization is
(13:58) How to get started with mathematical optimization
(45:56) Gurobi’s use cases
(56:29) Quantum computing and mathematical optimization
Jon Krohn’s highlights from this month of interviews focus on ways to future-proof your career, looking at the hardware that will get you the most mileage, the emerging roles that are well worth a look, and the developments in AI that will endure in a field constantly testing the durability of its own breakthroughs.
Additional materials: www.superdatascience.com/930
Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.
Breaking news: Jon Krohn welcomes Adrian Kosowski to the show to talk about the groundbreaking research happening at Pathway. Adrian and his team demonstrate how they have brought attention in AI closer to the way the brain functions, creating, in essence, a “massively parallel system of [artificial] neurons” that communicate with one another and exhibit properties similar to natural neurons. The goal is to move beyond the current limitations of transformers, where reasoning can be generalized across more complex and extended reasoning patterns, approximating a more human-like approach to problem-solving.
This episode is brought to you by the Trainium2, the latest AI chip from AWS, by Dell, by Intel, by and Gurobi.
Additional materials: www.superdatascience.com/929
Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.
In this episode you will learn:
(01:27) Pathway’s ground-breaking new biologically inspired architecture
(20:40) Limitless context windows
(34:39) BDH architecture as positive space
(53:11) Building multilingual models
(1:01:07) How to access the BDH architecture
Prompt injections, malicious code, and AI agents: In this week’s Five-Minute Friday, Jon Krohn looks into the current security weaknesses found in AI systems. A structural vulnerability that The Economist dubs a “lethal trifecta” could cause havoc for AI users, unless we take the necessary steps to contain our systems.
Additional materials: www.superdatascience.com/928
Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.
Earlier this year, David Loker joined CodeRabbit as their Director of AI. As more people come to write code with the help of large language models, David believes CodeRabbit will become a helpful assistant for code reviewing and pull requests. He tells Jon Krohn how CodeRabbit assists developers with real-time feedback, as well as the reality of vibe coding, the optimization challenges of agentic AI, and other pressing questions in AI and tech.
This episode is brought to you by the Dell, by Intel, by Gurobi and by ODSC, the Open Data Science Conference.
Additional materials: www.superdatascience.com/927
In this episode you will learn:
(01:26) How CodeRabbit helps with coding
(17:30) Context engineering in context
(40:40) How CodeRabbit keeps data secure
(46:10) David’s thoughts on “vibe coding”
(1:03:04) If machines will ever be truly creative
In this Five-Minute Friday, Jon Krohn explores how AI is reshaping the legal industry. He investigates how AI tools are helping lawyers make conclusions faster, how paralegals are being retrained, and the latest in-demand role in law (hint: It concerns AI). Listen to hear how Harvey AI and Thomson Reuters’ CoCounsel are using AI to help lawyers get ahead.
Additional materials: www.superdatascience.com/926
Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.
Tech innovation’s dependence on economic systems, trust in technology throughout history, and job displacement through AI: The Dieter Schwartz Associate Professor of AI and work at the University of Oxford, Carl Benedikt Frey, talks to Jon Krohn about his latest book, How Progress Ends, as well as how different economic systems deal with innovation and scaling, dealing with the homogeneity of generative AI output, and how to stay afloat in the new wave of job automation.
This episode is brought to you by the Dell, by Intel, by ODSC, the Open Data Science Conference and by Gurobi.
Additional materials: www.superdatascience.com/925
Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.
In this episode you will learn:
(04:00) All about How Progress Ends: Technology, Innovation, and the Fate of Nations
(14:26) The role of weak ties in driving technological innovation
(18:22) How to keep innovating as a big business
(48:05) What we can learn and apply from previous industrial revolutions
(54:33) How workers can try to ‘future-proof’ themselves
MIT lab NANDA (“Networked AI Agents in Decentralized Architecture”) reveals less than promising results for the future of AI adoption in businesses. According to “The GenAI Divide: State of AI in Business 2025”, a whopping 95% of enterprise AI projects “are getting zero return” on their $30-40 billion investment. Jon Krohn takes this Five-Minute Friday to look into why this has happened, with help from a critical response to the report written by Futuriom’s R. Scott Raynovich.
Additional materials: www.superdatascience.com/924
Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.
Graphs, but not as you would expect them: Graph analytics guru Amy Hodler speaks to Jon Krohn about the graph data structure and graph applications, graph algorithms, graph RAG, and graphs as memory systems for AI agents. We can use graphs in a surprising number of ways. Money laundering and fraud, as well as supply-chain crime, leave breadcrumbs at multiple “touch-points” over time, behaviors that graphs are better suited to reveal than rows and tables. Amy sees that most interest in graphs has been in the cybersecurity space. But this work isn’t only restricted to fighting crime! Listen to the episode to hear more case examples and how to get into graph work.
This episode is brought to you by the Dell, by the Intel, by ODSC, the Open Data Science Conference and by Gurobi.
Additional materials: www.superdatascience.com/923
Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.
In this episode you will learn:
01:49) A brief history of graphs
(10:08) Uncovering fraud with graphs
(28:31) Where graphs are most commonly applied, to date
(34:49) Retrieval augmented generation graphs
(48:04) The future of graphs
Hugo Dozois-Caouette speaks to Jon Krohn about his startup MaintainX and how he secured $100 million in venture capital. MaintainX manages and maintains computerized maintenance management systems (CMSs), or work-execution software, for the industrial and manufacturing industries. This “digitized version of a clipboard” with the help of web and mobile applications, provide a list of procedures, guidelines and regulations to help increase worker productivity and give a company the data-driven insights it needs to refine its processes. Listen to the episode to hear Hugo’s thoughts on the gaps in the manufacturing industry that technology can fill, the tech stack used by MaintainX, and the discrepancy of information in manufacturing environments.
Additional materials: www.superdatascience.com/922
Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.
Using Windows for AI development and the bleeding edge of NPUs: Shirish Gupta and Ish Shah from Dell Technologies speak to Jon Krohn about the latest products from Dell, the future of neural-processing units (NPUs), and how AI developers can make sound hardware investments.
This episode is brought to you by the Trainium2, the latest AI chip from AWS, by ODSC, the Open Data Science Conference and by Gurobi.
Additional materials: www.superdatascience.com/921
Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.
In this episode you will learn:
(04:18) Why Windows still outranks other operating systems
(20:58) The difference between GPUs and NPUs
(32:44) How to access and use Dell’s NPUs and GPUs
(49:08) Using processing units on the cloud versus locally
(57:43) About the Dell Pro Max
This month’s episode of In Case You Missed It gives us reasons to be cautiously optimistic about the future of large language models (LLMs), with guests discussing what to do about recent reports that found AI agents blackmailed human users when threatened, the importance of post-training LLMs, and the training we have available for data and AI engineers to create robust, secure, and useful AI. Jon Krohn includes clips from his interviews with Akshay Agrawal (Episode 911), Julien Launay (Episode 913), Michelle Yi (Episode 915), and Kirill Eremenko (Episode 917).
Additional materials: www.superdatascience.com/920
Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.
PyTorch, AGI, and the future of alignment research: Aurélien Géron joins Jon Krohn in this live interview to talk about the fourth edition of his bestselling Hands-On Machine Learning as well as what superintelligence makes him hopeful for, as well as what concerns him about machines surpassing human intelligence.
This episode is brought to you by Gurobi and by the Dell AI Factory with NVIDIA
Additional materials: www.superdatascience.com/919
Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.
In this episode you will learn:
(02:04) Why Aurélien wrote Hands-On Machine Learning
(20:54) How Aurélien came to decide on material for the new edition
(28:53) Aurélien’s predictions for AGI
(51:21) How to support alignment research
(1:13:42) Does superintelligence mean super-capability
In this Five-Minute Friday, Jon Krohn introduces listeners to CrewAI, an open-source Python framework that can create and manage multi-agent teams. The clue is in the title: CrewAI assembles specialized agents into single “crews” that achieve complex goals between them. CrewAI’s agent teams can also learn and iterate, meaning that after the crew has achieved its goals for the first time, they can refine and tailor their approach to future goals.
Additional materials: www.superdatascience.com/918
Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.
Founder of SuperDataScience, Kirill Eremenko, talks to Jon Krohn about how he found the best tools and approaches to help launch his 8-week AI engineering bootcamp. He breaks down the topics participants cover each week, and he also shares his tips with listeners who might want to start their own tech bootcamp or sign up for SuperDataScience’s September 2025 cohort.
This episode is brought to you by the Dell AI Factory with NVIDIA and by ODSC, the Open Data Science Conference
Additional materials: www.superdatascience.com/917
Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.
In this episode you will learn:
(10:58) Weeks 1-4 of the SuperDataScience bootcamp
(37:52) How to use AI to drive the bottom line in business
(47:50) Weeks 5-8 of the SuperDataScience bootcamp
(54:50) How to convert LLMs to agents
(1:09:33) Jon’s feedback on the SuperDataSciencebootcamp
Super Data Science: ML & AI Podcast with Jon Krohn
The latest machine learning, A.I., and data career topics from across both academia and industry are brought to you by host Dr. Jon Krohn on the Super Data Science Podcast. As the quantity of data on our planet doubles every couple of years and with this trend set to continue for decades to come, there's an unprecedented opportunity for you to make a meaningful impact in your lifetime. In conversation with the biggest names in the data science industry, Jon cuts through hype to fuel that professional impact.
Whether you're curious about getting started in a data career or you're a deep technical expert, whether you'd like to understand what A.I. is or you'd like to integrate more data-driven processes into your business, we have inspiring guests and lighthearted conversation for you to enjoy.
We cover tools, techniques, and implementation tricks across data collection, databases, analytics, predictive modeling, visualization, software engineering, real-world applications, commercialization, and entrepreneurship − everything you need to crush it with data science.