AI poisoning could turn open models into destructive “sleeper agents,” says Anthropic

Ars Technica

January 15, 2024

162 views

Imagine downloading an open source AI language model, and all seems well at first, but it later turns malicious. On Friday, Anthropic—the maker of ChatGPT competitor Claude—released a research paper about AI "sleeper agent" large language models (LLM... [2498 chars]

Source: Ars Technica

AI poisoning could turn open models into destructive “sleeper agents,” says Anthropic

You may also like

Biden says he'll let people know 'real soon' about his reelection plans