Home
Categories
EXPLORE
True Crime
Comedy
Society & Culture
Business
Sports
History
Music
About Us
Contact Us
Copyright
© 2024 PodJoint
00:00 / 00:00
Sign in

or

Don't have an account?
Sign up
Forgot password
https://is1-ssl.mzstatic.com/image/thumb/Podcasts124/v4/b1/e7/03/b1e70326-4200-a297-ec08-bd8590545dc8/mza_12174766315714829581.jpg/600x600bb.jpg
Towards Data Science
The TDS team
130 episodes
5 days ago
Note: The TDS podcast's current run has ended. Researchers and business leaders at the forefront of the field unpack the most pressing questions around data science and AI.
Show more...
Technology
RSS
All content for Towards Data Science is the property of The TDS team and is served directly from their servers with no modification, redirects, or rehosting. The podcast is not affiliated with or endorsed by Podjoint in any way.
Note: The TDS podcast's current run has ended. Researchers and business leaders at the forefront of the field unpack the most pressing questions around data science and AI.
Show more...
Technology
https://d3t3ozftmdmh3i.cloudfront.net/production/podcast_uploaded_nologo400/473625/473625-1610835242571-7393225beb5b8.jpg
118. Angela Fan - Generating Wikipedia articles with AI
Towards Data Science
51 minutes 44 seconds
3 years ago
118. Angela Fan - Generating Wikipedia articles with AI

Generating well-referenced and accurate Wikipedia articles has always been an important problem: Wikipedia has essentially become the Internet's encyclopedia of record, and hundreds of millions of people use it do understand the world.

But over the last decade Wikipedia has also become a critical source of training data for data-hungry text generation models. As a result, any shortcomings in Wikipedia’s content are at risk of being amplified by the text generation tools of the future. If one type of topic or person is chronically under-represented in Wikipedia’s corpus, we can expect generative text models to mirror — or even amplify — that under-representation in their outputs.

Through that lens, the project of Wikipedia article generation is about much more than it seems — it’s quite literally about setting the scene for the language generation systems of the future, and empowering humans to guide those systems in more robust ways.

That’s why I wanted to talk to Meta AI researcher Angela Fan, whose latest project is focused on generating reliable, accurate, and structured Wikipedia articles. She joined me to talk about her work, the implications of high-quality long-form text generation, and the future of human/AI collaboration on this episode of the TDS podcast.

--- 

Intro music:

- Artist: Ron Gelinas

- Track Title: Daybreak Chill Blend (original mix)

- Link to Track: https://youtu.be/d8Y2sKIgFWc

---

Chapters:

  • 1:45 Journey into Meta AI
  • 5:45 Transition to Wikipedia
  • 11:30 How articles are generated
  • 18:00 Quality of text
  • 21:30 Accuracy metrics
  • 25:30 Risk of hallucinated facts
  • 30:45 Keeping up with changes
  • 36:15 UI/UX problems
  • 45:00 Technical cause of gender imbalance
  • 51:00 Wrap-up
Towards Data Science
Note: The TDS podcast's current run has ended. Researchers and business leaders at the forefront of the field unpack the most pressing questions around data science and AI.