Home
Categories
EXPLORE
True Crime
Comedy
Business
Society & Culture
History
Sports
Health & Fitness
About Us
Contact Us
Copyright
© 2024 PodJoint
00:00 / 00:00
Sign in

or

Don't have an account?
Sign up
Forgot password
https://is1-ssl.mzstatic.com/image/thumb/Podcasts221/v4/ce/ef/59/ceef597b-a627-2ee4-f95b-3c0e813acd75/mza_18377625926653678381.jpg/600x600bb.jpg
Hello SundAI - our world through the lense of AI
Roger Basler de Roca
52 episodes
1 week ago
"Hello SundAI - Our World Through the Lens of AI," is your twice-weekly dive into how artificial intelligence shapes our digital landscape. Hosted by Roger and SundAI the AI, this podcast brings you practical tips, cutting-edge tools, and insightful interviews every Sunday and Wednesday morning. Whether you're a seasoned tech enthusiast or just starting to explore the digital domain, tune in to discover innovative ways to get things done and propel yourself forward in a world increasingly driven by AI. Our hashtag is: #helloSundai
Show more...
Business
RSS
All content for Hello SundAI - our world through the lense of AI is the property of Roger Basler de Roca and is served directly from their servers with no modification, redirects, or rehosting. The podcast is not affiliated with or endorsed by Podjoint in any way.
"Hello SundAI - Our World Through the Lens of AI," is your twice-weekly dive into how artificial intelligence shapes our digital landscape. Hosted by Roger and SundAI the AI, this podcast brings you practical tips, cutting-edge tools, and insightful interviews every Sunday and Wednesday morning. Whether you're a seasoned tech enthusiast or just starting to explore the digital domain, tune in to discover innovative ways to get things done and propel yourself forward in a world increasingly driven by AI. Our hashtag is: #helloSundai
Show more...
Business
https://d3t3ozftmdmh3i.cloudfront.net/staging/podcast_uploaded_nologo/1600094/1600094-1726908230369-7cbdb65744fd1.jpg
The Byte Latent Transformer (BLT): A Token-Free Approach to LLMs
Hello SundAI - our world through the lense of AI
15 minutes 6 seconds
7 months ago
The Byte Latent Transformer (BLT): A Token-Free Approach to LLMs

The Byte Latent Transformer (BLT) is a novel byte-level large language model (LLM) that processes raw byte data by dynamically grouping bytes into entropy-based patches, eliminating the need for tokenization.

  • Dynamic Patching: BLT segments data into variable-length patches based on entropy, allocating more computation where complexity is higher—unlike token-based models that treat all tokens equally.
  • Efficiency & Robustness: BLT matches tokenized LLM performance while improving inference efficiency (using up to 50% fewer FLOPs) and enhancing robustness to noisy inputs and character-level tasks.
  • Scalability: Scaling studies up to 8B parameters and 4T training bytes show that BLT achieves better scaling trends at a fixed inference cost than token-based models.
  • Architecture:
  • Entropy-Based Patching: A small byte-level model estimates entropy to determine patch boundaries, allocating more compute to complex sequences (e.g., word beginnings).
  • Performance Gains: BLT achieves parity with Llama 3 in FLOP-controlled training and outperforms it in character-level tasks and low-resource translation.
  • Patch Size Scaling: Larger patches (e.g., 8 bytes) improve scaling efficiency by reducing latent transformer compute needs, enabling larger model sizes within a fixed inference budget.
  • "Byte-ifying" Tokenizers: Pre-trained token-based models (e.g., Llama 3.1) can initialize BLT’s transformer, leading to faster convergence and improved performance on specific tasks.

BLT introduces a fundamentally new approach to LLMs, leveraging raw bytes instead of tokens for more efficient, scalable, and robust language modeling.


This is Hello Sunday - the podcast in digital business where we look back and ahead, so you can focus on next weeks challenges


Thank you for listening to Hello Sunday - make sure to subscribe and spread the word, so others can be inspired too


Hello SundAI - our world through the lense of AI


Disclaimer: This podcast is generated by Roger Basler de Roca (contact) by the use of AI. The voices are artificially generated and the discussion is based on public research data. I do not claim any ownership of the presented material as it is for education purpose only.

⁠https://rogerbasler.ch/en/contact/


Hello SundAI - our world through the lense of AI
"Hello SundAI - Our World Through the Lens of AI," is your twice-weekly dive into how artificial intelligence shapes our digital landscape. Hosted by Roger and SundAI the AI, this podcast brings you practical tips, cutting-edge tools, and insightful interviews every Sunday and Wednesday morning. Whether you're a seasoned tech enthusiast or just starting to explore the digital domain, tune in to discover innovative ways to get things done and propel yourself forward in a world increasingly driven by AI. Our hashtag is: #helloSundai