Join Josh and Zeljko live at HETT 2025 in London - covering the most exciting topics and highlights that are upcoming in AI for healthcare. Coming from the duo who are living and breathing AI for healthcare, and together, have worked across every area of healthTech - from the hospital frontlines, to university research, to NHS implementation, to building industry grade agents including AI scribes, computer control and digital twins, to product and compliance. This is one not to miss!
00:00 start and intro
    2:15 What are AI agents? (and why they're different from chatbots)
    3:52 AI scribes: the 150 company sprint to "scribe plus" features
    8:02 AI psychosis and mental health - all LLMs reinforce delusional beliefs
    9:34 Computer control: Automating hospital workflows by mimicking human actions
    13:42 Digital twins for health are the future: A safer path forward?
    18:40 How does the national health service become AI enabled?
    22:22 closing remarks - Is AI in healthcare a hype or hope?
    25:12 questions - digital twins for individuals or for cohorts?
    26:52 questions - Lessons from building AVTs and digital twins for consumer space
    29:02 questions - LLM clinical summarisation - risks and benefits
    31:17 questions - ethics of AI vs Human errors. is it the same?
    33:02 questions - challenges and barriers to AI deployment in NHS
👋 Hey! If you are enjoying our conversations, reach out, share your thoughts and journey with us. Don't forget to subscribe whilst you're here :)
👨🏻⚕️ Doc - Dr. Joshua Au Yeung - https://www.linkedin.com/in/dr-joshua-auyeung/
🤖 Dev - Zeljko Kraljevic - https://twitter.com/zeljkokr
Follow us:
    YT - https://youtube.com/@DevAndDoc
    Spotify - https://podcasters.spotify.com/pod/show/devanddoc
    Apple - https://podcasts.apple.com/gb/podcast/dev-and-doc-ai-for-healthcare-podcast/id1751495120
    Substack - https://aiforhealthcare.substack.com/
For enquiries:
    📧 Devanddoc@gmail.com
Credits:
    🎞️ Editor - Dragan Kraljević - https://www.instagram.com/dragan_kraljevic/
    🎨 Brand design and art direction - Ana Grigorovici - https://www.behance.net/anagrigorovici027d
Whenever there was AI, there were benchmarks- from the turing test, to society-changing benchmarks like MNIST and ImageNet to modern problems like the ARC prize, benchmarked served a vital purpose to measure the performance of AI models. But something has shifted in modern times, in the LLM era have benchmarks lost their utility, becoming mere advertisement for big tech?
Even seemingly more sophisticated benchmarks like LM Arena can be gamed by tech giants. We also deep dive into healthcare benchmarks like OpenAI's Healthbench (deeply problematic) and Microsoft's AI-DXO orchestrator agent for diagnosis. Where is this all going? How do we make the perfect benchmark? Or is the real work to be done afterwards in the real world?
👋 Hey! If you are enjoying our conversations, reach out, share your thoughts and journey with us. Don't forget to subscribe whilst you're here :)
---
Timestamps
00:00 Intro - The OG benchmarks - Turing test, MNIST, ImageNET
06:40 Are large language models benchmarks similar to humans taking tests?
10:05 Are we testing model capability vs production ready?
12:00 LLM era - data contamination
15:30 LM Arena - The leaderboard illusion paper - how big tech games benchmarks
28:35 Goodhart's law - When a measure becomes a target, it ceases to be a good measure
32:05 Some good benchmarks - games - Pokemon, ARC prize, Minecraft
34:35 Medical benchmarks - OpenAI's healthbench has some big problems
46:50 Microsoft AI-DXO orchestrator for case reports
---
Connect with Us
Your Hosts:
👨🏻⚕️ Doc - Dr. Joshua Au Yeung - LinkedIn
🤖 Dev - Zeljko Kraljevic - Twitter
Follow & Subscribe:
YT: https://youtube.com/@DevAndDoc
Spotify: Follow us on Spotify
Apple Podcasts: Listen on Apple Podcasts
Substack: https://aiforhealthcare.substack.com/
For enquiries:
📧 Devanddoc@gmail.com
---
Production Credits
🎞️ Editor: Dragan Kraljević - Instagram
🎨 Brand & Art: Ana Grigorovici - Behance
AI agents are here, but how did we get here in the first place? How do we build and leverage AI agents for high stakes domains like healthcare? In this episode of Dev and Doc, we go deep into the forest that is AI agents and computer control - starting from the "caveman" era of LLMs discovering tools, to cultivating intelligent models and agentic workflows. We dissect everyday agents like MANUS AI, and deep dive into how, where and when AI agents should be used. Are these agents hype or hope, is this actually the second deepseek moment?
👋 Hey! If you are enjoying our conversations, reach out, share your thoughts and journey with us. Don't forget to subscribe whilst you're here :)
Episode Timestamps:
00:00 Highlight
3:13 start / intro
5:20 LLM's caveman era - tool usage
6:46 Agents have autonomy and interact with environment
11:15 workflows and agentic flows
15:30 when should you be using an agent?
24:27 vibe coding is like driving a car
29:07 Demo - MANUS gathering financial trends, computer control
35:55 Demo MANUS AI- website creation for Autism Assessment
49:05 computer control factions- Freedom vs Process automation
55:00 Autism website testing
59:13 summary + end
Hosts:
👨🏻⚕️Doc - Dr. Joshua Au Yeung - https://www.linkedin.com/in/dr-joshua-auyeung/
🤖Dev - Zeljko Kraljevic https://twitter.com/zeljkokr
Find us on:
YT - https://youtube.com/@DevAndDoc
Spotify - https://podcasters.spotify.com/pod/show/devanddoc
Apple- https://podcasts.apple.com/gb/podcast/dev-and-doc-ai-for-healthcare-podcast/id1751495120
Substack- https://aiforhealthcare.substack.com/
For enquiries:
📧Devanddoc@gmail.com
Credits:
🎞️ Editor- Dragan Kraljević https://www.instagram.com/dragan_kraljevic/
🎨Brand design and art direction - Ana Grigorovici https://www.behance.net/anagrigorovici027d
Claude sonnet 3.7 was released less than 48 hours ago, the model is highly intelligent and is one of the best we have seen in recent memory. Definitely passes the vibe check.
We give some amazing examples of coding with claude with few shot prompts, and cover technical and clinical evaluations and share our first thoughts. We even tested claude to take a patient history!
NB - PLEASE don't do this at home, obviously this is a demo and we do not in any way condone or recommend using an LLM as your doctor or healthcare provider, we are just demonstrating what the future could be. If you are sick, please seek a medical professional.
👋 Hey! If you are enjoying our conversations, reach out, share your thoughts and journey with us. Don't forget to subscribe whilst you're here :)
👨🏻⚕️Doc - Dr. Joshua Au Yeung - linkedin.com/in/dr-joshua-auyeung
🤖Dev - Zeljko Kraljevic twitter.com/zeljkokr
Spotify:podcasters.spotify.com/pod/show/devanddoc
Apple:podcasts.apple.com/gb/podcast/dev-and-doc-ai-for-healthcare-podcast/id1751495120
Substack:aiforhealthcare.substack.com
For enquiries - 📧 Devanddoc@gmail.com
🎞️ Editor - Dragan Kraljević instagram.com/dragan_kraljevic
🎨 Brand design - Ana Grigorovici behance.net/anagrigorovici027d
Is the academic system broken in this publish-or-perish landscape? When is a PhD not worth pursuing?
In this Dev and Doc episode, Zeljko (now associate professor!) and Josh (doctor, PhD drop out) talk about the good and the bad of PhD life. They provide insight into the academic world with a focus on computer science and machine learning.
Hey! If you are enjoying our conversations, reach out, share your thoughts and journey with us. Don't forget to subscribe whilst you're here :)
For enquiries - devanddoc@gmail.com
Dev and Doc put Deepseek R1 to the test in a technical and clinical deep dive.
👋 Hey! If you are enjoying our conversations, reach out, share your thoughts and journey with us. Don't forget to subscribe whilst you're here :)
 👨🏻⚕️Doc - Dr. Joshua Au Yeung - https://www.linkedin.com/in/dr-joshua-au-yeung/
 🤖Dev - Zeljko Kraljevic https://twitter.com/zeljkokr
TIMESTAMPS
  00:00 Highlights 
  04:36 Intro
  08:29 response from OpenAI, Anthropic- model training costs, tightening restrictions on China, pricing wars 
  13:13 what an open-source deepseek means for the world. 
  15:38 Sam altman and Dario amodei feeling the pressure 
  23:10 TECHNICAL deep dive - RLHF, ppo, dpo
  37:08 GRPO, R1s secret sauce 
  45:02 the aha moment, learning like a human?
  50:25 deepseek R1 training and controversy 
  59:08 deepseek healthcare evaluation - Ethnic Bias
  1:06:17 The diagnostic acid test (fail)
  1:12:46 Coding clinical data / Medical billing (shout out SNOMED)
  LinkedIn Newsletter https://www.linkedin.com/build-relation/newsletter-follow?entityUrn=7216474068085026817
  YT - https://youtube.com/@DevAndDoc
  Spotify - https://podcasters.spotify.com/pod/show/devanddoc
  Apple- https://podcasts.apple.com/gb/podcast/dev-and-doc-ai-for-healthcare-podcast/id1751495120
  Substack- https://aiforhealthcare.substack.com/
For enquiries - 📧Devanddoc@gmail.com
  🎞️ Editor- Dragan Kraljević  https://www.instagram.com/dragan_kraljevic/
  🎨Brand design and art direction - Ana Grigorovici https://www.behance.net/anagrigorovici027d
Dev and Doc - Latest News
Dev and Doc - Latest News
It's 2025, Dev and Doc cover the latest news including Google's deep research and notebook LM, DeepMind's Promptbreeder, and Anthropic's new RAG approach. We also go through what retrieval augmented generation (RAG) is, and how this technique is advancing LLM performance.
👋 Hey! If you are enjoying our conversations, reach out, share your thoughts and journey with us. Don't forget to subscribe whilst you're here :)
Meet the Team
Where to Follow Us
Contact Us
📧 For enquiries - Devanddoc@gmail.com
Credits
Episode Timeline
References
Dev and Doc is joined by guest Annabelle Painter, doctor, CMO, and podcaster for the Royal Society of Medicine Digital Health Podcast. We deep dive into explainability and interpretability with concrete healthcare examples.
Check out Dr. Painter's Podcast here, she has some amazing guests and great insights into AI in healthcare! - https://spotify.link/pzSgxmpD5yb
👋 Hey! If you are enjoying our conversations, reach out, share your thoughts and journey with us. Don't forget to subscribe whilst you're here :)
👨🏻⚕️ Doc - Dr. Joshua Au Yeung - https://www.linkedin.com/in/dr-joshua-auyeung/
🤖 Dev - Zeljko Kraljevic - https://twitter.com/zeljkokr
For enquiries - 📧 Devanddoc@gmail.com
🎞️ Editor - Dragan Kraljević - https://www.instagram.com/dragan_kraljevic/
🎨 Brand design and art direction - Ana Grigorovici - https://www.behance.net/anagrigorovici027d
Doc talks to Dr Derrick Khor - Cancer Doctor, HealthTech Consultant and Linkedin Guru. We share Derrick's insights from consulting over 120 companies and a step-by-step guide on how to build a successful Healthcare company. You can find more of Derrick and his helpful guides - https://adoptadoc.com/resources/ profile- https://www.linkedin.com/in/derrick-khor/ 👋 Hey! If you are enjoying our conversations, reach out, share your thoughts and journey with us. Don't forget to subscribe whilst you're here :)
Dev&Doc is a podcast where doctors and developers deep dive into the potential of AI in healthcare.
👨🏻⚕️Doc - Dr. Joshua Au Yeung
🤖Dev - Zeljko Kraljevic
LinkedIn Newsletter
YouTube
Spotify
Apple
Substack
For enquiries - 📧 Devanddoc@gmail.com
<p>🎞️ Editor - <a href="https://www.instagram.com/dragan_kraljevic/">Dragan Kraljević</a></p>
<p>🎨 Brand design and art direction - <a href="https://www.behance.net/anagrigorovici027d">Ana Grigorovici</a></p>
Timestamps 00:00 Highlights and intro 3:01 Start 5:10 getting into health tech 8:03 lack of clinicians in start ups 15:07 Derrick's own healthtech journey to consulting 23:37 Start ups and failure 27:35 the start up road map 32:16 are you a medical device (samd)? Intended use 40:55 clinical evidence generation 48:16 go to market, NHS DTAC 57:57 power of networking, social media, linkedin 1:02:43 top UK health tech companies to look out for
Dev and Doc deconstruct digital biomarkers! This is a fascinating and nascent field in the world of medicine, how have biomarkers transformed the way we practice medicine, and how will AI and wearables, sensors and digital fingerprints transform the way we practice in the future?
Hey! If you are enjoying our conversations, reach out, share your thoughts and journey with us. Don't forget to subscribe whilst you're here :)
find us on youtube- @Dev and Doc 📙Substack: https://aiforhealthcare.substack.com/👨🏻⚕️Doc - Dr. Joshua Au Yeung - https://www.linkedin.com/in/dr-joshua-auyeung/🤖Dev - Zeljko Kraljevic https://twitter.com/zeljkokr🎞️ Editor- Dragan Kraljević https://www.instagram.com/dragan_kraljevic/🎨Brand design and art direction - Ana Grigorovici https://www.behance.net/anagrigorovici027d
Timestamp 00:00 highlights 01:50 intro 02:40 how biomarkers evolved in the last century 6:02 what is the definition of a biomarker 10:00 biomarkers can be very biased depending on who you are testing 12:31 when does a test become a biomarker 17:30 the digital age and measurements - AI vision in retina scans, digital stethoscopes 23:50 what is an “analog” biomarker vs digital biomarker? 30:10 where do biomarkers fail in evidence based medicine? 34:55 Biomarkers are pretty poor for mental health 47:57 can AI predict depression better than humans? 51:21 Digital biomarkers to detect movement disorders 01:00:04 this can change clinical trials forever
Refs
- variable definitions of biomarkers https://informatics.bmj.com/content/31/1/e100914
-digital biomarkers convergence nature paper https://www.nature.com/articles/s41746-022-00583-z
-digital stethoscope for heart failure https://www.thelancet.com/pdfs/journals/landig/PIIS2589-7500(21)00256-9.pdf
-touch screen typing depression paper https://www.nature.com/articles/s41746-022-00583-z
- Duchennes body suit biomarker https://www.nature.com/articles/s41591-022-02045-1#Sec9
- Friedreichs ataxia body suit https://www.nature.com/articles/s41591-022-02159-6?fromPaywallRec=false#Sec9 
Dr Keith Grimes is a HealthTech consultant and General Practitioner working with companies to transform clinical ideas into something impactful. He worked as the digital health director in Babylon Health prior to its demise, and currently runs his own consulting firm, Curistica. This is one not to miss! References HealthTech consulting at Curistica www.curistica.com Prof Amanda Goodall on leadership theory https://amandagoodall.com/ For those interested in Leadership opportunities: -Faculty of medical leadership and management https://www.fmlm.ac.uk/ -Bite labs https://www.bitelabs.io/ <p>Dev&Doc is a podcast where doctors and developers deep dive into the potential of AI in healthcare.<br>
👨🏻⚕️Doc - <a href="https://www.linkedin.com/in/dr-joshua-auyeung/">Dr. Joshua Au Yeung</a><br>
🤖Dev - <a href="https://twitter.com/zeljkokr">Zeljko Kraljevic</a><br>
<a href="https://www.linkedin.com/build-relation/newsletter-follow?entityUrn=7216474068085026817">LinkedIn Newsletter</a><br>
<a href="https://youtube.com/@DevAndDoc">YouTube</a><br>
<a href="https://podcasters.spotify.com/pod/show/devanddoc">Spotify</a><br>
<a href="https://podcasts.apple.com/gb/podcast/dev-and-doc-ai-for-healthcare-podcast/id1751495120">Apple</a><br>
<a href="https://aiforhealthcare.substack.com/">Substack</a><br>
For enquiries - 📧 <a href="mailto:Devanddoc@gmail.com">Devanddoc@gmail.com</a>
</p>
🎞️ Editor- Dragan Kraljević https://www.instagram.com/dragan_kraljevic/ 🎨Brand design and art direction - Ana Grigorovici https://www.behance.net/anagrigorovici027d Timestamps 00:00 start 1:10 Career career career - GP, babylon health, digital consultancy 6:40 working as a rural GP in Scotland 9:21 time is the biggest factor of clinical impact 12:11 finding impact through data 21:29 leading by example 23:52 Should doctors be leading healthtech businesses? 30:10 why do healthtech start-ups not have clinicians earlier? 36:30 Babylon failure - importance of having clinical influence at the top 43:55 experience being grilled on BBC newsnight 49:45 lessons learnt from the downfall of Babylon 52:25 6 values of consulting firm Curistica 55:51 common problems in start ups 59:36 how AI will change the healthcare landscape
How do we align AI models for healthcare? 👨⚕️ And importantly, the moral codes and ethics that we practice everyday, how does the LLM deal with ethical scenarios like the trolley problem for example? This is a fascinating topic and one we spend a lot of time thinking about. In this episode Dev and Doc, Zeljko Kraljevic and I cover all the up to date topics around reinforcement learning, the benefits and where it can go wrong. We also discuss different RL methods including the algorithms used to train ChatGPT (RLHF). Dev and Doc is a Podcast where developers and doctors join forces to deep dive into AI in healthcare. Together, we can build models that matter. 👨🏻⚕️Doc - Dr. Joshua Au Yeung - https://www.linkedin.com/in/dr-joshua... 🤖Dev - Zeljko Kraljevic https://twitter.com/zeljkokr The podcast 🎙️ 🔊Spotify: https://open.spotify.com/show/3QO5Lr3... 📙Substack: https://aiforhealthcare.substack.com/ Hey! If you are enjoying our conversations, reach out, share your thoughts and journey with us. Don't forget to subscribe whilst you're here :) 🎞️ Editor- Dragan Kraljević https://www.instagram.com/dragan_kral... 🎨Brand design and art direction - Ana Grigorovici https://www.behance.net/anagrigorovic...00:00 Highlights 01:27 start 4:38 aligning ethics of ai models 7:04 doctors ethical choices daily 8:00 RLHF and AI training methods 16:29 reinforcement learning 19:35 Preference model -rewarding models correctly can make or break the success 27:05 exploiting reward function, model degradation (and how to fix it) Ref AI intro paper - https://pn.bmj.com/content/23/6/476 Open AI RLHF paper - https://arxiv.org/abs/1909.08593 War and peace of LLMs! - https://arxiv.org/abs/2311.17227
In this episode Doc goes on an adventure to chair an LLM/ generative AI conference session and reflects on his experience. Dev and Doc also discuss big news on meta's Llama3 and Code LlaMa. Dev and Doc is a Podcast where developers and doctors join forces to deep dive into AI in healthcare. Together, we can build models that matter. 👨🏻⚕️Doc - Dr. Joshua Au Yeung - https://www.linkedin.com/in/dr-joshua-auyeung/ 🤖Dev - Zeljko Kraljevic https://twitter.com/zeljkokr The podcast 🎙️ 🔊Spotify: https://open.spotify.com/show/3QO5Lr3w4Rd6lqwlfKDaB7?si=e7915d844994403e 📙Substack: https://aiforhealthcare.substack.com/ Hey! If you are enjoying our conversations, reach out, share your thoughts and journey with us. Don't forget to subscribe whilst you're here :) 🎞️ Editor- Dragan Kraljević https://www.instagram.com/dragan_kraljevic/ 🎨Brand design and art direction - Ana Grigorovici https://www.behance.net/anagrigorovici027d 00:00 Highlight 00:36 Start 1:57 Are researchers just using Generative AI to get presentations /publications? 6:18 Hype cycles , lack of real world clinical studies using LLMs 8:08 LlaMa3 , Code LlaMa announcement and insights 13:30 Google bard / Gemini ultra second on leaderboard 17:30 wrap up and end
Dev And Doc are back ! Here we break down the biggest highlights of 2023, and AI predictions for 2024. Dev and Doc is a Podcast where developers and doctors join forces to deep dive into AI in healthcare. Together, we can build models that matter. 👨🏻⚕️Doc - Dr. Joshua Au Yeung - https://www.linkedin.com/in/dr-joshua-auyeung/ 🤖Dev - Zeljko Kraljevic https://twitter.com/zeljkokr 00:00 start 01:01 Intro, Advancing LLMs in healthcare 07:10 Ambient note documentation in Medicine 10:52 Meta LLaMa are the good guys ? 14:40 GPT store 19:40 Overhyped Google Gemini model 26:17 AGI again 29:05 6 big predictions Open source vs Closed source models 38:55 AI in healthcare- LLM clinical trials , AI drug discovery 42:05 end References GPT store- https://openai.com/blog/introducing-the-gpt-store Hugging face predictions- https://twitter.com/ClementDelangue/status/1729158744762626310 AI drug discovery (blog post to paper) - https://news.mit.edu/2023/using-ai-mit-researchers-identify-antibiotic-candidates-1220 Google AMIE blog - https://blog.research.google/2024/01/amie-research-ai-system-for-diagnostic_12.html The podcast 🎙️ 🔊Spotify: https://open.spotify.com/show/3QO5Lr3w4Rd6lqwlfKDaB7?si=e7915d844994403e 📙Substack: https://aiforhealthcare.substack.com/ Hey! If you are enjoying our conversations, reach out, share your thoughts and journey with us. Don't forget to subscribe whilst you're here :) 🎞️ Editor- Dragan Kraljević https://www.instagram.com/dragan_kraljevic/ 🎨Brand design and art direction - Ana Grigorovici https://www.behance.net/anagrigorovici027d
We have conversations between doctors and developers exploring the potential of AI in healthcare Josh is a training Neurologist in the NHS, and AI researcher in St Thomas' hospital and King's College Hospital. He is also a PhD student at King's College London. Zeljko is an AI researcher and PhD student at King's College London, as well as a CTO for a natural language processing company.