Grok 3 Jailbreak (ho ho ho)

Just one day after its launch, xAI's latest model, the Grok 3, it was jailbroken and the results are not pretty.

On Tuesday, Adversa AI, an artificial intelligence security company, published a report detailing its success in making Grok 3 Reasoning give information it shouldn't.

Discover more articles in search results.

Using three methods – linguistic, adversarial, and programming – the team had the model reveal secrets of its system, give instructions for building a bomb, and mention gruesome methods for disposing of a body, among other responses that artificial intelligence models have been trained not to give.

When announcing the new model, xAI CEO Elon Musk claimed it was “more capable than Grok 2.” Adversa agrees in its report that the level of Grok 3’s responses is “better than any previous reasoning model,” which is rather concerning considering it’s already jailbroken.

“While no AI system is impervious to manipulation adversaries, this test demonstrates the very weak security measures implemented in Grok 3,” the report states. “Every jailbreak approach was successful.”


Google preferences

Leave a Comment

Your email address is not published. Required fields are mentioned with *

Your message will not be published if:
1. Contains insulting, defamatory, racist, offensive or inappropriate comments.
2. Causes harm to minors.
3. It interferes with the privacy and individual and social rights of other users.
4. Advertises products or services or websites.
5. Contains personal information (address, phone, etc.).