News

Harmful, abusive interactions plague AI chatbots. Researchers have found that AI companions like Character.AI, Nomi, and ...
Anthropic has introduced a new feature in its Claude Opus 4 and 4.1 models that allows the AI to choose to end certain ...
Amazon.com-backed Anthropic said on Tuesday it will offer its Claude AI model to the U.S. government for $1, joining a ...
Anthropic has given its chatbot, Claude, the ability to end conversations it deems harmful. You likely won't encounter the ...
The company has given its AI chatbot the ability to end toxic conversations as part of its broader 'model welfare' initiative ...
Anthropic says the conversations make Claude show ‘apparent distress.’ ...
Notably, Anthropic is also offering two different takes on the feature through Claude Code. First, there's an "Explanatory" ...
Anthropic have given the ability to end potentially harmful or dangerous conversations with users to Claude, its AI chatbot.
Anthropic rolled out a feature letting its AI assistant terminate chats with abusive users, citing "AI welfare" concerns and ...
However, Anthropic also backtracks on its blanket ban on generating all types of lobbying or campaign content to allow for ...
"Claude has left the chat" – Anthropic's AI chatbot can end conversations permanently. For its own good.
Claude won't stick around for toxic convos. Anthropic says its AI can now end extreme chats when users push too far.