Red Team Security - Search News

Red teaming LLMs exposes a harsh truth about the AI security arms race

Unrelenting, persistent attacks on frontier models make them fail, with the patterns of failure varying by model and developer. Red teaming shows that it’s not the sophisticated, complex attacks that ...

VentureBeat

How OpenAI's red team made ChatGPT agent into an AI fortress

In case you missed it, OpenAI yesterday debuted a powerful new feature for ChatGPT and with it, a host of new security risks and ramifications. Called the "ChatGPT agent," this new feature is an ...

AOL

Inside the Anthropic ‘Red Team’ tasked with breaking its AI models—and burnishing the company’s reputation for safety

Last month, at the 33rd annual DEF CON, the world’s largest hacker convention, in Las Vegas, Anthropic researcher Keane Lucas took the stage. A former U.S. Air Force captain with a PhD in electrical ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Red teaming LLMs exposes a harsh truth about the AI security arms race

How OpenAI's red team made ChatGPT agent into an AI fortress

Inside the Anthropic ‘Red Team’ tasked with breaking its AI models—and burnishing the company’s reputation for safety

Trending now