Read the Beforeitsnews.com story here. Advertise at Before It's News here.
Profile image
By Freedom Bunker
Contributor profile | More stories
Story Views
Now:
Last hour:
Last 24 hours:
Total:

Anthropic’s Anti-Nuke AI Filter Sparks Debate Over Real Risks

% of readers think this story is Fact. Add your two cents.


Anthropic’s Anti-Nuke AI Filter Sparks Debate Over Real Risks

Now, for some news on the lighter side…like ‘how to prevent machines from enabling nuclear armageddon”..

In August, Anthropic announced that its chatbot Claude would not — and could not — help anyone build a nuclear weapon. The company said it worked with the Department of Energy (DOE) and the National Nuclear Security Administration (NNSA) to ensure Claude couldn’t leak nuclear secrets, according to a new writeup from Wired.

Anthropic deployed Claude “in a Top Secret environment so that the NNSA could systematically test whether AI models could create or exacerbate nuclear risks,” says Marina Favaro, Anthropic’s head of National Security Policy & Partnerships. Using Amazon’s Top Secret cloud, the agencies “red-teamed” Claude and developed “a sophisticated filter for AI conversations.”

This “nuclear classifier” flags when chats drift toward dangerous territory using an NNSA list of “risk indicators, specific topics, and technical details.” Favaro says it “catches concerning conversations without flagging legitimate discussions about nuclear energy or medical isotopes.”

Wired writes that NNSA official Wendin Smith says AI “has profoundly shifted the national security space” and that the agency’s expertise “places us in a unique position to aid in the deployment of tools that guard against potential risk.”

But experts disagree on whether the risk even exists. “I don’t dismiss these concerns, I think they are worth taking seriously,” says Oliver Stephenson of the Federation of American Scientists. “I don’t think the models in their current iteration are incredibly worrying … but we don’t know where they’ll be in five years.”

He warns that secrecy makes it hard to judge the system’s impact. “When Anthropic puts out stuff like this, I’d like to see them talking in a little more detail about the risk model they’re really worried about,” he says.

Others are more skeptical. “If the NNSA probed a model which was not trained on sensitive nuclear material, then their results are not an indication that their probing prompts were comprehensive,” says Heidy Khlaaf, chief AI scientist at the AI Now Institute. She calls the project “quite insufficient” and says it “relies on an unsubstantiated assumption that Anthropic’s models will produce emergent nuclear capabilities … not aligned with the available science.”

Anthropic disagrees. “A lot of our safety work is focused on proactively building safety systems that can identify future risks and mitigate against them,” a spokesperson says. “This classifier is an example of that.”

Khlaaf also questions giving private firms access to government data. “Do we want these private corporations that are largely unregulated to have access to that incredibly sensitive national security data?” she asks.

Anthropic says its goal isn’t to enable nuclear work but to prevent it. “In our ideal world, this becomes a voluntary industry standard,” Favaro says. “A shared safety practice that everyone adopts.”

Tyler Durden Wed, 10/22/2025 – 23:00


Source: https://freedombunker.com/2025/10/22/anthropics-anti-nuke-ai-filter-sparks-debate-over-real-risks/



Before It’s News® is a community of individuals who report on what’s going on around them, from all around the world.

Anyone can join.
Anyone can contribute.
Anyone can become informed about their world.

"United We Stand" Click Here To Create Your Personal Citizen Journalist Account Today, Be Sure To Invite Your Friends.

Before It’s News® is a community of individuals who report on what’s going on around them, from all around the world. Anyone can join. Anyone can contribute. Anyone can become informed about their world. "United We Stand" Click Here To Create Your Personal Citizen Journalist Account Today, Be Sure To Invite Your Friends.


LION'S MANE PRODUCT


Try Our Lion’s Mane WHOLE MIND Nootropic Blend 60 Capsules


Mushrooms are having a moment. One fabulous fungus in particular, lion’s mane, may help improve memory, depression and anxiety symptoms. They are also an excellent source of nutrients that show promise as a therapy for dementia, and other neurodegenerative diseases. If you’re living with anxiety or depression, you may be curious about all the therapy options out there — including the natural ones.Our Lion’s Mane WHOLE MIND Nootropic Blend has been formulated to utilize the potency of Lion’s mane but also include the benefits of four other Highly Beneficial Mushrooms. Synergistically, they work together to Build your health through improving cognitive function and immunity regardless of your age. Our Nootropic not only improves your Cognitive Function and Activates your Immune System, but it benefits growth of Essential Gut Flora, further enhancing your Vitality.



Our Formula includes: Lion’s Mane Mushrooms which Increase Brain Power through nerve growth, lessen anxiety, reduce depression, and improve concentration. Its an excellent adaptogen, promotes sleep and improves immunity. Shiitake Mushrooms which Fight cancer cells and infectious disease, boost the immune system, promotes brain function, and serves as a source of B vitamins. Maitake Mushrooms which regulate blood sugar levels of diabetics, reduce hypertension and boosts the immune system. Reishi Mushrooms which Fight inflammation, liver disease, fatigue, tumor growth and cancer. They Improve skin disorders and soothes digestive problems, stomach ulcers and leaky gut syndrome. Chaga Mushrooms which have anti-aging effects, boost immune function, improve stamina and athletic performance, even act as a natural aphrodisiac, fighting diabetes and improving liver function. Try Our Lion’s Mane WHOLE MIND Nootropic Blend 60 Capsules Today. Be 100% Satisfied or Receive a Full Money Back Guarantee. Order Yours Today by Following This Link.


Report abuse

Comments

Your Comments
Question   Razz  Sad   Evil  Exclaim  Smile  Redface  Biggrin  Surprised  Eek   Confused   Cool  LOL   Mad   Twisted  Rolleyes   Wink  Idea  Arrow  Neutral  Cry   Mr. Green

MOST RECENT
Load more ...

SignUp

Login