Home
Categories
EXPLORE
True Crime
Comedy
Business
Society & Culture
History
Sports
Health & Fitness
About Us
Contact Us
Copyright
© 2024 PodJoint
00:00 / 00:00
Sign in

or

Don't have an account?
Sign up
Forgot password
https://is1-ssl.mzstatic.com/image/thumb/Podcasts126/v4/ca/ae/47/caae47e5-6b4f-4ccc-085a-6104dab5331e/mza_16644954937649900406.jpg/600x600bb.jpg
Weaviate Podcast
Weaviate
131 episodes
5 days ago
Join Connor Shorten as he interviews machine learning experts and explores Weaviate use cases from users and customers.
Show more...
Technology
RSS
All content for Weaviate Podcast is the property of Weaviate and is served directly from their servers with no modification, redirects, or rehosting. The podcast is not affiliated with or endorsed by Podjoint in any way.
Join Connor Shorten as he interviews machine learning experts and explores Weaviate use cases from users and customers.
Show more...
Technology
https://d3t3ozftmdmh3i.cloudfront.net/staging/podcast_uploaded_nologo/34794122/34794122-1704376570442-9264e631e6abf.jpg
REFRAG with Xiaoqiang Lin - Weaviate Podcast #130!
Weaviate Podcast
1 hour
5 days ago
REFRAG with Xiaoqiang Lin - Weaviate Podcast #130!

Xiaoqiang Lin is a Ph.D. student at the National University of Singapore. During his time at Meta, Xiaoqiang lead the research behind REFRAG: Rethinking RAG-based Decoding. Traditional RAG systems use vectors to retrieve relevant context with semantic search, but then throw away the vectors when passing the context to the LLM. REFRAG instead feeds the LLM these pre-compute vectors, achieving massive gains in long context processing and LLM inference speed! REFRAG makes Time-To-First-Token (TTFT) 31x faster and Time-To-Iterative-Token (TTIT) 3x faster, boosting overall LLM throughput by 7x while also being able to handle much longer contexts!


There are so many interesting aspects to this and I really loved diving into the details with Xiaoqiang! I hope you enjoy the podcast!

Weaviate Podcast
Join Connor Shorten as he interviews machine learning experts and explores Weaviate use cases from users and customers.