Home
Categories
EXPLORE
True Crime
Comedy
Society & Culture
Business
History
Technology
Fiction
About Us
Contact Us
Copyright
© 2024 PodJoint
Loading...
0:00 / 0:00
Podjoint Logo
US
Sign in

or

Don't have an account?
Sign up
Forgot password
https://is1-ssl.mzstatic.com/image/thumb/Podcasts211/v4/0e/a0/6b/0ea06bef-d724-4ebf-61db-69b969390a17/mza_9075267775248923469.jpg/600x600bb.jpg
Programming Throwdown
Patrick Wheeler and Jason Gauci
183 episodes
18 hours ago
Programming Throwdown educates Computer Scientists and Software Engineers on a cavalcade of programming and tech topics. Every show will cover a new programming language, so listeners will be able to speak intelligently about any programming language.
Show more...
How To
Education,
News,
Tech News
RSS
All content for Programming Throwdown is the property of Patrick Wheeler and Jason Gauci and is served directly from their servers with no modification, redirects, or rehosting. The podcast is not affiliated with or endorsed by Podjoint in any way.
Programming Throwdown educates Computer Scientists and Software Engineers on a cavalcade of programming and tech topics. Every show will cover a new programming language, so listeners will be able to speak intelligently about any programming language.
Show more...
How To
Education,
News,
Tech News
https://img.transistor.fm/bCwzToD5jY0S0vtUMWXCsqBr7Qu4WOoZF9SDEAcfhRs/rs:fill:3000:3000:1/q:60/aHR0cHM6Ly9pbWct/dXBsb2FkLXByb2R1/Y3Rpb24udHJhbnNp/c3Rvci5mbS9mYmRl/NzI3NzNhZDdjYmU5/ZDFiNDA0MmU1Mjdk/N2M0NC5qcGc.jpg
177: Vector Databases
Programming Throwdown
1 hour 28 minutes
8 months ago
177: Vector Databases

Intro topic:  Buying a Car

News/Links:

  • Cognitive Load is what Matters
    • https://github.com/zakirullin/cognitive-load
  • Diffusion models are Real-Time Game Engines
    • https://gamengen.github.io/
  • Your Company Needs Junior Devs
    • https://softwaredoug.com/blog/2024/09/07/your-team-needs-juniors
  • Seamless Streaming / Fish Speech / LLaMA Omni
    • Seamless: https://huggingface.co/facebook/seamless-streaming
    • Fish: https://github.com/fishaudio/fish-speech 
    • LLaMA Omni: https://github.com/ictnlp/LLaMA-Omni 

Book of the Show

  • Patrick: 
    • Thought Emporium Youtube
      • https://youtu.be/8X1_HEJk2Hw?si=T8EaHul-QMahyUvQ
  • Jason: 
    • Novel Minds
      • https://www.novelminds.ai/


Patreon Plug https://www.patreon.com/programmingthrowdown?ty=h


Tool of the Show

  • Patrick: 
    • Escape Simulator
      • https://pinestudio.com/games/escape-simulator/
  • Jason: 
    • Cursor IDE
      • https://www.cursor.com/

Topic: Vector Databases (~54 min)

  • How computers represent data traditionally
    • ASCII values
    • RGB values
  • How traditional compression works
    • Huffman encoding (tree structure)
    • Lossy example: Fourier Transform & store coefficients
  • How embeddings are computed
    • Pairwise (contrastive) methods
    • Forward models (self-supervised)
  • Similarity metrics
  • Approximate Nearest Neighbors (ANN)
  • Sub-Linear ANN
    • Clustering
    • Space Partitioning (e.g. K-D Trees)
  • What a vector database does
    • Perform nearest-neighbors with many different similarity metrics
    • Store the vectors and the data structures to support sub-linear ANN
    • Handle updates, deletes, rebalancing/reclustering, backups/restores
  • Examples
    • pgvector: a vector-database plugin for postgres
    • Weaviate, Pinecone 
    • Milvus

★ Support this podcast on Patreon ★
Programming Throwdown
Programming Throwdown educates Computer Scientists and Software Engineers on a cavalcade of programming and tech topics. Every show will cover a new programming language, so listeners will be able to speak intelligently about any programming language.