OpenAI Plans Controlled Cybersecurity AI Rollout, Keeping Access Limited to Approved Partners in Defensive Security Sector

2026/04/10 11:10

OpenAI Quietly Builds Restricted Cybersecurity AI As Industry Locks Down Access

OpenAI is moving deeper into cybersecurity, but not in a way the public will easily see.

A new report suggests the company is preparing an advanced AI model designed for cyber operations, with access limited to a small circle of vetted organisations rather than a broad release.

Trusted Access Programme Signals Shift To Controlled AI Rollouts

The model is expected to sit within OpenAI’s “Trusted Access for Cyber” programme, first introduced in February.

The initiative was designed to keep powerful systems out of general circulation and place them directly in the hands of defensive security teams.

Participants are supported with $10 million in API credits, alongside access to tools such as GPT-5.3-Codex, currently OpenAI’s most capable cybersecurity model.

OpenAI launches Trusted Access for Cyber — identity-verified framework giving security pros access to frontier models like GPT-5.3-Codex for vulnerability testing. Balances defensive use vs misuse. https://t.co/H5eN6fMqNs
— The AI Scope (@the_ai_scope) February 28, 2026

While details remain limited, the restricted rollout reflects a deliberate decision to prioritise control over scale, particularly as AI systems grow more capable of identifying and exploiting vulnerabilities.

Anthropic’s Mythos Forces Industry To Rethink Openness

The timing is not coincidental.

Earlier this week, Anthropic revealed its Claude Mythos Preview model, describing it as capable of discovering “tens of thousands of vulnerabilities” across major operating systems and browsers.

The model reportedly identified zero-day flaws with a level of autonomy comparable to experienced human researchers, raising immediate concerns about misuse.

This is big... Anthropic just announced a model so powerful they won't release it to the public out of fear over the damage it will cause 😨

Claude Mythos Preview found thousands of zero-day exploits in every major operating system and web browser...

The numbers are hard to… https://t.co/pEuokoHMA1 pic.twitter.com/FlQgGiavsd
— Josh Kale (@JoshKale) April 7, 2026

Anthropic responded by limiting access through its Project Glasswing programme, distributing the model only to a tightly selected group of companies including major cloud providers, chipmakers, and cybersecurity firms.

More than 40 organisations tied to critical infrastructure were granted controlled access under strict conditions.

We’ve partnered with Amazon Web Services, Apple, Broadcom, Cisco, CrowdStrike, Google, JPMorganChase, the Linux Foundation, Microsoft, NVIDIA, and Palo Alto Networks.

Together we’ll use Mythos Preview to help find and fix flaws in the systems on which the world depends. pic.twitter.com/FnnhSkPLNQ
— Anthropic (@AnthropicAI) April 7, 2026

How Powerful Is Too Powerful For Public Release

Internal testing results added to the unease.

Mythos Preview uncovered previously unknown flaws in widely used software, including a 27-year-old vulnerability in c and an issue in FFmpeg that had escaped millions of prior automated checks.

this new anthropic model is no joke,

i was curious how claude mythos actually works so i dug into it's system DNA, test results... the numbers are just insane:

- it found a 27-year old hidden exploit for $50 in a few hours. elite human researchers would've taken weeks

- 29%… https://t.co/6nAgURv6cA pic.twitter.com/j2nA9MxKtF
— Ejaaz (@cryptopunk7213) April 8, 2026

Anthropic stressed that the model was not specifically trained for security tasks, meaning its capabilities emerged from broader improvements in reasoning and code understanding.

That dual-use nature, the ability to both fix and exploit weaknesses, has become a central concern across the industry.

Security Benchmarks Struggle To Keep Up With AI Progress

The rapid leap in capability is also exposing limits in existing safety frameworks.

Anthropic acknowledged that Cybench, a benchmark used to assess cyber risk in AI systems, is “no longer sufficiently informative of current frontier model capabilities”.

The company added that safety evaluations now involve “judgment calls” and carry “more fundamental uncertainty,” signalling that established measurement tools are falling behind the technology they are meant to assess.

Regulators Increase Pressure As Risks Expand

Government scrutiny is rising alongside these developments.

Federal agencies have intensified their focus on AI safety protocols since early April, while Anthropic is already facing pressure after the Pentagon reportedly flagged it as a supply chain risk over restrictions tied to surveillance and weapons-related use cases.

LATEST🇺🇸: Pentagon tags Anthropic as a "National security supply chain risk", and the court just refused to pause it.

Anthropic will challenge this on May 19

This tag can make it harder for Anthropic to work with the US govt or defence deals. pic.twitter.com/69l6pZrf2f
— Bitinning (@bitinning) April 9, 2026

Security experts and former officials have also warned that sufficiently advanced AI systems could be used to disrupt essential infrastructure, including power grids, water systems, and financial networks.

AI Cyber Tools Begin To Resemble Classified Technology

Against this backdrop, OpenAI’s decision to restrict access appears as much about positioning as precaution.

By limiting distribution early, the company signals alignment with regulators and distances itself from the risks of uncontrolled deployment.

At the same time, it reflects a broader shift in how frontier AI is being released.

Instead of public launches, the most capable systems are increasingly handled like sensitive assets — shared selectively, governed by agreements, and reserved for organisations judged capable of managing their risks.

Cybersecurity

OpenAI

Gain a broader understanding of the crypto industry through informative reports, and engage in in-depth discussions with other like-minded authors and readers. You are welcome to join us in our growing Coinlive community:https://t.me/CoinliveSG

Add Comment

LoginLeave your comments

0 Comments

Earliest

Load more comments

Live Updates

5 hours ago
US lawmaker: I'd rather Trump play golf than go to the Situation Room
Bullish
Bearish
5 hours ago
PRECIOUS METALS | Commerzbank Predicts Gold to Boost Silver Prices
Bullish
Bearish
5 hours ago
STOCKS | Hong Kong AI Concept Stocks Decline in Early Trading
Bullish
Bearish
5 hours ago
FLORK's market capitalization surpasses $9 million, surging more than 21 times in a single day.
Bullish
Bearish
5 hours ago
SEC Accelerates Digital Asset Regulation with New Strategy
Bullish
Bearish
5 hours ago
China Urged to Strengthen Oversight on Token Trading
Bullish
Bearish
5 hours ago
Uzbekistan Establishes Besqala Mining Valley for Cryptocurrency Mining
Bullish
Bearish
5 hours ago
Singapore will optimize regulations on crypto capital: public blockchain assets may no longer be uniformly classified as high-risk.
Bullish
Bearish
5 hours ago
Uzbekistan establishes Besqala Mining Valley, aiming to promote green cryptocurrency mining with tax exemptions until 2035.
Bullish
Bearish
5 hours ago
Michael Saylor: Strategy achieved a profit of 47,079 BTC in the first three weeks of April, a return of 6.2%.
Bullish
Bearish

OpenAI Plans Controlled Cybersecurity AI Rollout, Keeping Access Limited to Approved Partners in Defensive Security Sector

OpenAI Quietly Builds Restricted Cybersecurity AI As Industry Locks Down Access

Trusted Access Programme Signals Shift To Controlled AI Rollouts

Anthropic’s Mythos Forces Industry To Rethink Openness

How Powerful Is Too Powerful For Public Release

Security Benchmarks Struggle To Keep Up With AI Progress

Regulators Increase Pressure As Risks Expand

AI Cyber Tools Begin To Resemble Classified Technology

Live Updates

Trending News

Squid Game Final Season Trailer Drops as the Dystopian Drama Hype Sends Tokens Moonward

WhatsApp Set to Introduce Smart Wallpaper Creation Feature Using Meta’s AI Technology in a Major Upcoming Update

South Korea’s Presidential Candidate Lee Jae-myung Promises Crypto ETFs and Lower Transaction Fees to Capture Young Voter Support: Can He Win Over Their Votes?

Russian Pyramid Schemes Are Shifting Focus, With an Overwhelming 86% Now Targeting Victim’s Crypto Funds

From Misplaced to Recovered: Japan Uses AI to Speed Up Lost-and-Found Services

Rising Threat of Tabnabbing, a New Form of Online Scam, Has Experts Calling for Stronger Online Safety Measures

Bhutan Makes History with First National Crypto Tourism System Supported by Binance

Solana Name Service Introduces $SNS Token, Offering 40% Airdrop to Loyal Supporters

Israeli App TeleMessage Used by Trump Admin to Archive Messages Breached, Hundreds of Government Contacts Leaked

Stripe Builds a Smarter, Faster Global Bank with AI and Stablecoins in 100+ Countries