Home
Categories
EXPLORE
True Crime
Comedy
Society & Culture
Business
News
Sports
TV & Film
About Us
Contact Us
Copyright
© 2024 PodJoint
Podjoint Logo
US
00:00 / 00:00
Sign in

or

Don't have an account?
Sign up
Forgot password
https://is1-ssl.mzstatic.com/image/thumb/Podcasts211/v4/29/46/00/294600db-5fd1-ea40-c100-ab260302c68e/mza_8888261894088655092.jpg/600x600bb.jpg
ArchiCraft: Solution Architecture Insights for AI Engineering
Dmytro Golodiuk
15 episodes
1 day ago
ArchiCraft is a podcast about modern software and solution architecture, enterprise strategies, and the future of AI-driven engineering. Each episode is AI-generated from original blog posts by Dmytro Golodiuk, based on real-world experience and hands-on insights. Explore the full archive at https://www.golodiuk.com/news.
Show more...
Technology
RSS
All content for ArchiCraft: Solution Architecture Insights for AI Engineering is the property of Dmytro Golodiuk and is served directly from their servers with no modification, redirects, or rehosting. The podcast is not affiliated with or endorsed by Podjoint in any way.
ArchiCraft is a podcast about modern software and solution architecture, enterprise strategies, and the future of AI-driven engineering. Each episode is AI-generated from original blog posts by Dmytro Golodiuk, based on real-world experience and hands-on insights. Explore the full archive at https://www.golodiuk.com/news.
Show more...
Technology
https://d3t3ozftmdmh3i.cloudfront.net/staging/podcast_uploaded_episode/43947566/43947566-1751034085986-2ba06855426c5.jpg
#002 - How long to train a 70B LLM on 15T tokens using 1024 H100s?
ArchiCraft: Solution Architecture Insights for AI Engineering
11 minutes 27 seconds
4 months ago
#002 - How long to train a 70B LLM on 15T tokens using 1024 H100s?

Ever wondered what it really takes to train a massive AI model like the ones powering the latest tech? We move beyond speculation and get down to the numbers.

In this episode, we answer a very specific question: How long would it actually take to train a 70-billion parameter Large Language Model on a colossal 15-trillion token dataset using a supercomputer cluster of 1024 NVIDIA H100 GPUs?

Join us as we unpack this question and calculate the answer from two different angles:

  1. The Top-Down Approach: Using real-world performance benchmarks published by NVIDIA.

  2. The Bottom-Up Approach: Building a fundamental calculation from scratch based on total Floating-Point Operations (FLOPs) and system efficiency, also known as Model FLOPS Utilization (MFU).

Whether you're an AI practitioner, a tech enthusiast, or just curious about the scale of modern computation, this episode provides a concrete look at the time, resources, and complexity behind building state-of-the-art artificial intelligence.

Thank you for listening! ❤️


CONNECT WITH DMYTRO

  • LinkedIn: ⁠https://www.linkedin.com/in/dimanngo/⁠

  • Facebook: ⁠https://www.facebook.com/EnterpriseArchitectureServices/⁠

  • Email: ⁠info@golodiuk.com⁠


EPISODE LINKS (ORIGINAL BLOG POSTS)

Find the full blog post and all the calculations here: How Long to Train a 70B LLM on 15T Tokens with 1024 H100s

This podcast episode is an AI-narrated version of the original text-based articles from Dmytro's personal blog, which you can find at ⁠www.golodiuk.com/news⁠.


ABOUT Dmytro | ⁠www.golodiuk.com⁠

Dmytro Golodiuk is a highly experienced technology professional with over 17 years in the software industry. His proficiency spans cloud computing, enterprise platforms, software development, and integration technologies, with deep expertise in the Microsoft ecosystem. Dmytro combines his technical knowledge with formal Enterprise Architecture frameworks like TOGAF and ArchiMate to deliver robust and practical solutions.

In addition to his architectural work, Dmytro is a passionate mentor dedicated to helping others grow in their IT careers. ⁠https://mentor.sh/mentors/dmytro_golodiuk⁠


MENTORSHIP and WHAT I OFFER

  • A CLEAR ROADMAP: I'll help you forge the path from technical expertise to architectural vision. My focus isn't on specific technologies – you've got that covered. Instead, we'll concentrate on the strategic thinking, communication, and leadership skills that define a successful architect.

  • BRIDGING THE GAPS: Together, we'll identify and close the crucial gaps between a senior engineering role and the holistic view required of an architect.

  • FOSTERING YOUR GROWTH: My mentorship is about cultivating your ability to see the bigger picture, to design robust and effective solutions, and to communicate complex ideas with simplicity and impact.

  • ARCHITECT READY CV PROFILE OPTIMISATION: I'll help you transform your engineering CV into a strategic narrative that compellingly showcases your architectural potential, leadership, and strategic contributions to resonate powerfully with hiring managers.

  • ACE YOUR ARCHITECT INTERVIEW: I’ll prepare you for the full spectrum of interview scenarios.

IF YOU'RE A MID TO SENIOR ENGINEER WHO

  • Aspires to become a Solution Architect.

  • Recognizes the need to develop beyond deep technical skills.

  • Is ready to embrace the mindset and responsibilities of an architect.

Then I'm the mentor you're looking for. Let's work together to unlock your potential and lay the bridge to your future as a Solution Architect.

⁠https://mentor.sh/mentors/dmytro_golodiuk⁠

ArchiCraft: Solution Architecture Insights for AI Engineering
ArchiCraft is a podcast about modern software and solution architecture, enterprise strategies, and the future of AI-driven engineering. Each episode is AI-generated from original blog posts by Dmytro Golodiuk, based on real-world experience and hands-on insights. Explore the full archive at https://www.golodiuk.com/news.