NewsFlash Articles Data Fundraising Skill&API

Subjected to community backlash for covert sabotage, Anthropic issues apology and lifts Claude's secret downgrade restriction

According to Data Beating monitoring, Anthropic has announced an adjustment to the security strategy of its new model, Claude Fable 5, removing the mechanism that silently degrades performance. The silent degradation mechanism has been criticized by the community as "covert sabotage," leading to a strong backlash from the artificial intelligence research community.

Per Anthropic's terms of service, users are prohibited from using Claude to train competitive models. Anthropic plans to directly reduce the performance of Claude Fable 5 for accounts suspected of training competitive models without notifying the users. AI researchers have warned that silent performance degradation will disrupt the work of third-party security assessment firms and hinder collaboration in the AI security field within the open-source community.

In response to community concerns, Anthropic has issued a public apology statement, acknowledging that they made a mistake in their security strategy trade-offs and will adjust the development of security measures to provide public warnings. If the system detects users attempting to build high-capability AI, the requests will be explicitly denied, or users will be redirected to a lower-capability model. Anthropic has cautioned that since public protection mechanisms are more susceptible to targeted bypassing, they will broaden the scope of security interception in the future, potentially resulting in the accidental blocking of some benign requests.

Source

Correction/Report

On-Chain Activity

1h ago

Binance to List BMNR, ASML, and Other US Stock Futures Trading

CryptoQuant: Bitcoin May Be Nearing Structural Bottom, But Market Potential Selling Pressure Not Exhausted

Japan is pushing to reclassify cryptocurrency as a financial instrument

Santiment: Mainstream Coin Trading Volume Hits Two-Year Low, Market May Be Entering "Surrender-Driven Depression," Historically a Pre-Rebound Signal

Correction/Report

Submit

Add Library

Visible to myself only

Public

Save

Choose Library

Add Library

Cancel

Finish

Subjected to community backlash for covert sabotage, Anthropic issues apology and lifts Claude's secret downgrade restriction

FTX/Alameda have unlocked approximately 200,000 SOL tokens worth around $12.99 million.

「Sell High」 Whale Reduces Position in SK Hynix, Increases Short Position in Samsung, Now Largest Bear on SK Hynix

US Stock Trader 'CBB' Holds 82% Short Position, Accurately Captures Oracle's Downtrend

Whale Continues to Increase Bitcoin Holdings, Withdraws Over 3000 BTC in the Past 5 Days from CEX and Custodian