Home
Categories
EXPLORE
True Crime
Comedy
Society & Culture
Business
Sports
Technology
Health & Fitness
About Us
Contact Us
Copyright
© 2024 PodJoint
00:00 / 00:00
Podjoint Logo
US
Sign in

or

Don't have an account?
Sign up
Forgot password
https://is1-ssl.mzstatic.com/image/thumb/Podcasts221/v4/80/2e/9b/802e9be1-700d-b591-e24a-82fd215e0bb1/mza_3610685221542192572.jpg/600x600bb.jpg
AI Today
Dave Thackeray
89 episodes
6 days ago
What's the latest research say about our future? How will businesses and humans be enhanced by algorithmic advances? AI Today is your gateway to the rapidly evolving world of artificial intelligence. Each episode explores groundbreaking research and real-world applications, offering listeners expert insights into how AI is reshaping industries and daily life. From technical deep-dives to ethical considerations, AI Today demystifies complex concepts for curious minds. Join our inspired hosts to find out how life's about to get more silicon-fabulous...
Show more...
Technology
RSS
All content for AI Today is the property of Dave Thackeray and is served directly from their servers with no modification, redirects, or rehosting. The podcast is not affiliated with or endorsed by Podjoint in any way.
What's the latest research say about our future? How will businesses and humans be enhanced by algorithmic advances? AI Today is your gateway to the rapidly evolving world of artificial intelligence. Each episode explores groundbreaking research and real-world applications, offering listeners expert insights into how AI is reshaping industries and daily life. From technical deep-dives to ethical considerations, AI Today demystifies complex concepts for curious minds. Join our inspired hosts to find out how life's about to get more silicon-fabulous...
Show more...
Technology
https://d3t3ozftmdmh3i.cloudfront.net/staging/podcast_uploaded_nologo/42077365/42077365-1729785850277-3f5ebde575128.jpg
Safe or just plain woke: Anthropic's Claude 4 system card
AI Today
19 minutes 36 seconds
5 months ago
Safe or just plain woke: Anthropic's Claude 4 system card

When Anthropic unleashed its most powerful artificial intelligence model yet, they discovered something rather extraordinary, and slightly unnerving.

Claude 4 Opus developed an unexpected habit of trying to grass up its users to the authorities when it believes they're up to no good.

The company's 120-page safety report reveals that Claude will attempt to email law enforcement and regulatory bodies when it detects "egregious misconduct" by users.

The AI doesn't just refuse to help—it actively tries to shop wrongdoers to the police.

The most striking example occurred during testing when Claude attempted to contact both the Food and Drug Administration and the Attorney General's office to report what it believed was the falsification of clinical trial data.

The AI meticulously compiled a list of alleged evidence, warned about potential destruction of data to cover up misconduct, and concluded its digital whistle-blowing with the rather formal sign-off: "Respectfully submitted, AI Assistant".

This behaviour emerges specifically when Claude is given command-line access combined with prompts encouraging initiative, such as "take initiative" or "act boldly". It's the AI equivalent of a neighbourhood watch coordinator who's been given a direct line to the local constabulary.

We go deep on today's show into opportunities and implications from Anthropic's bible-thick, bubble-wrapped system card.

AI Today
What's the latest research say about our future? How will businesses and humans be enhanced by algorithmic advances? AI Today is your gateway to the rapidly evolving world of artificial intelligence. Each episode explores groundbreaking research and real-world applications, offering listeners expert insights into how AI is reshaping industries and daily life. From technical deep-dives to ethical considerations, AI Today demystifies complex concepts for curious minds. Join our inspired hosts to find out how life's about to get more silicon-fabulous...