Home
Categories
EXPLORE
True Crime
Comedy
Society & Culture
Business
News
Sports
TV & Film
About Us
Contact Us
Copyright
© 2024 PodJoint
Podjoint Logo
US
00:00 / 00:00
Sign in

or

Don't have an account?
Sign up
Forgot password
https://is1-ssl.mzstatic.com/image/thumb/Podcasts116/v4/b0/ea/a7/b0eaa7ff-d116-a232-0889-5076f665179d/mza_17263211783617196594.jpg/600x600bb.jpg
Stanford MLSys Seminar
Dan Fu, Karan Goel, Fiodar Kazhamakia, Piero Molino, Matei Zaharia, Chris Ré
24 episodes
4 days ago
Machine learning is driving exciting changes and progress in computing. What does the ubiquity of machine learning mean for how people build and deploy systems and applications? What challenges does industry face when deploying machine learning systems in the real world, and how can academia rise to meet those challenges? Updates every Monday and Friday - old episodes on Mondays, new episodes on Fridays! Check out our website and your YouTube channel for full videos! https://mlsys.stanford.edu/ https://www.youtube.com/channel/UCzz6ructab1U44QPI3HpZEQ
Show more...
Technology
RSS
All content for Stanford MLSys Seminar is the property of Dan Fu, Karan Goel, Fiodar Kazhamakia, Piero Molino, Matei Zaharia, Chris Ré and is served directly from their servers with no modification, redirects, or rehosting. The podcast is not affiliated with or endorsed by Podjoint in any way.
Machine learning is driving exciting changes and progress in computing. What does the ubiquity of machine learning mean for how people build and deploy systems and applications? What challenges does industry face when deploying machine learning systems in the real world, and how can academia rise to meet those challenges? Updates every Monday and Friday - old episodes on Mondays, new episodes on Fridays! Check out our website and your YouTube channel for full videos! https://mlsys.stanford.edu/ https://www.youtube.com/channel/UCzz6ructab1U44QPI3HpZEQ
Show more...
Technology
https://d3t3ozftmdmh3i.cloudfront.net/production/podcast_uploaded_nologo400/20680941/20680941-1641609936241-adeced5f38a5d.jpg
2/3/22 #53 Cody Coleman - Data Selection for Data-Centric AI
Stanford MLSys Seminar
55 minutes 25 seconds
3 years ago
2/3/22 #53 Cody Coleman - Data Selection for Data-Centric AI

Cody Coleman - Data selection for Data-Centric AI: Data Quality Over Quantity

Data selection methods, such as active learning and core-set selection, improve the data efficiency of machine learning by identifying the most informative data points to label or train on. Across the data selection literature, there are many ways to identify these training examples. However, classical data selection methods are prohibitively expensive to apply in deep learning because of the larger datasets and models. This talk will describe two techniques to make data selection methods more tractable. First, "selection via proxy" (SVP) avoids expensive training and reduces the computation per example by using smaller proxy models to quantify the informativeness of each example. Second, "similarity search for efficient active learning and search" (SEALS) reduces the number of examples processed by restricting the candidate pool for labeling to the nearest neighbors of the currently labeled set instead of scanning over all of the unlabeled data. Both methods lead to order of magnitude performance improvements, making active learning applications on billions of unlabeled images practical for the first time.

Stanford MLSys Seminar
Machine learning is driving exciting changes and progress in computing. What does the ubiquity of machine learning mean for how people build and deploy systems and applications? What challenges does industry face when deploying machine learning systems in the real world, and how can academia rise to meet those challenges? Updates every Monday and Friday - old episodes on Mondays, new episodes on Fridays! Check out our website and your YouTube channel for full videos! https://mlsys.stanford.edu/ https://www.youtube.com/channel/UCzz6ructab1U44QPI3HpZEQ