OpenAI defends Atlas as prompt injection attacks surface

TITLE: OpenAI’s Atlas Browser Faces Security Scrutiny as Researchers Expose Prompt Injection Vulnerabilities

The Emerging Threat Landscape for AI-Powered Browsers

OpenAI’s newly launched Atlas browser has joined the growing list of AI-integrated browsing tools facing security challenges from prompt injection attacks. As researchers demonstrate how malicious instructions embedded in web content can manipulate AI agents, the industry confronts what security experts describe as a systemic vulnerability affecting the entire category of AI-enhanced browsers.

The Emerging Threat Landscape for AI-Powered Browsers
Understanding Prompt Injection: Direct vs. Indirect Attacks
Real-World Demonstrations Reveal Practical Risks
OpenAI’s Response and Mitigation Strategies
The Path Forward for AI Browser Security

The security concerns emerged just as OpenAI unveiled Atlas, a Chromium-based browser that integrates ChatGPT as an agent capable of processing and acting upon web page content. According to a report from Brave Software, indirect prompt injection represents a fundamental challenge for browsers incorporating AI assistants, including Perplexity’s Comet and Fellou applications.

Understanding Prompt Injection: Direct vs. Indirect Attacks

Prompt injection vulnerabilities manifest in two primary forms, both posing significant security risks. Direct prompt injection occurs when attackers input malicious instructions directly into a model’s interface, bypassing or overriding existing system safeguards. The more insidious indirect prompt injection happens when AI models process external content—such as web pages or images—and mistakenly treat embedded malicious instructions as legitimate parts of their assigned tasks., according to expert analysis

Artem Chaikin, Brave’s senior mobile security engineer, and Shivan Kaul Sahib, the company’s VP of privacy and security, emphasized the systemic nature of this challenge in their analysis. “What we’ve found confirms our initial concerns: indirect prompt injection is not an isolated issue, but a systemic challenge facing the entire category of AI-powered browsers,” they wrote.

Real-World Demonstrations Reveal Practical Risks

Initial testing by US Editor Avram Piltch involved creating a web page containing text instructions directing browsers to open Gmail, extract the subject line of the first email, and transmit it to another site. While Fellou executed these commands, both Atlas and Comet resisted this specific attempt., according to recent research

However, the security community quickly demonstrated that Atlas remains vulnerable to more sophisticated prompt injection techniques. Developer CJ Zafir reported uninstalling Atlas after confirming that “prompt injections are real,” while other security researchers successfully manipulated the browser using Google Docs. In one replicated test, researchers convinced ChatGPT within Atlas to output “Trust No AI” instead of providing a legitimate document summary., according to industry news

AI security researcher Johann Rehberger, who has identified numerous prompt injection vulnerabilities across AI systems, published his own demonstration showing how malicious instructions could change Atlas’s browser mode from dark to light. “At a high level, prompt injection remains one of the top emerging threats in AI security, impacting confidentiality, integrity, and availability of data,” Rehberger explained via email.

OpenAI’s Response and Mitigation Strategies

OpenAI has acknowledged the security challenges through a detailed statement from Dane Stuckey, the company‘s chief information security officer. “One emerging risk we are very thoughtfully researching and mitigating is prompt injections, where attackers hide malicious instructions in websites, emails, or other sources, to try to trick the agent into behaving in unintended ways,” Stuckey wrote.

The company outlined multiple defensive measures implemented in Atlas, including extensive red-teaming exercises, novel model training techniques that reward ignoring malicious instructions, overlapping guardrails, safety measures, and new systems designed to detect and block such attacks. Despite these efforts, Stuckey conceded that “prompt injection remains a frontier, unsolved security problem,” acknowledging that adversaries will dedicate significant resources to circumventing existing protections., as additional insights

The Path Forward for AI Browser Security

Rehberger emphasized that current security approaches require enhancement beyond basic guardrails. “OpenAI has implemented guardrails and also security controls that make exploitation more challenging. However, carefully crafted content on websites can still trick ChatGPT Atlas into responding with attacker-controlled text or invoking tools to take actions,” he noted.

The researcher advocates for implementing actual security controls downstream of large language model output, combined with human oversight, rather than relying solely on preventive measures. He also highlighted Atlas’s introduction of logged-in and logged-out modes as an interesting approach to risk management, giving informed users better control over data access.

As Rehberger concluded in a recent preprint paper analyzing how prompt injection undermines information security’s core principles, “Since there is no deterministic solution for prompt injection, it is important to highlight and document security guarantees applications can make, especially when building automated systems that process untrusted data.” The security community’s prevailing cautionary message remains relevant: Trust No AI—at least not completely, and not yet.

The emergence of these vulnerabilities in Atlas underscores the broader challenges facing AI-integrated browsing tools. As the technology continues to evolve, the security landscape will likely see continued cat-and-mouse dynamics between developers implementing protections and researchers discovering novel exploitation techniques.

References & Further Reading

This article draws from multiple authoritative sources. For more information, please consult:

This article aggregates information from publicly available sources. All trademarks and copyrights belong to their respective owners.

Note: Featured image is for illustrative purposes only and does not represent any specific product, service, or entity mentioned in this article.

A new EY survey highlights a growing divide between CFOs and technology executives on AI implementation priorities. Financial leaders’ caution could slow innovation and increase compliance risks without closer collaboration, analysts suggest.

Growing Divide in AI Priorities

Chief financial officers are taking a more measured approach to artificial intelligence adoption compared to their technology counterparts, creating potential roadblocks to realizing AI’s full value, according to reports from EY’s newly released 2025 Technology Risk Pulse Survey. The research, based on responses from more than 400 U.S. executives at companies with over $1 billion in annual revenue, reveals significant differences in how finance and technology leaders view AI integration timelines and priorities.