Red Team Security - Search News

Red teaming LLMs exposes a harsh truth about the AI security arms race

Unrelenting, persistent attacks on frontier models make them fail, with the patterns of failure varying by model and developer. Red teaming shows that it’s not the sophisticated, complex attacks that ...

VentureBeat

How OpenAI's red team made ChatGPT agent into an AI fortress

In case you missed it, OpenAI yesterday debuted a powerful new feature for ChatGPT and with it, a host of new security risks and ramifications. Called the "ChatGPT agent," this new feature is an ...

Results that may be inaccessible to you are currently showing.

Hide inaccessible results

Red teaming LLMs exposes a harsh truth about the AI security arms race

How OpenAI's red team made ChatGPT agent into an AI fortress

Trending now