美國出口管制解除,Anthropic 於7月1日重新部署 Claude Fable 5,並新增網路安全分類器
重點摘要
Anthropic 重新部署其最強大的通用模型 Claude Fable 5。該公司於6月30日宣佈美國出口管制已解除,這項管制先前涵蓋 Claude Fable 5 與 Claude Mythos 5。Fable 5 已於7月1日(週三)重新向全球用戶開放,而 Mythos 5 則恢復提供給特定美國組織。這些模型於6月12日被下架,原因是美國政府指令限制其僅供非外國籍人士使用,但 Anthropic 無法即時驗證用戶國籍,因此暫停所有人使用。本文說明觸發封鎖的原因,涵蓋新的安全防護機制與提出的越獄框架,並比較 Fable 5 與 GLM-5.2 等競爭對手的表現。快速事實:模型為 Claude Fable 5(一種經安全處理的 Mythos 級模型),事件為7月1日重新部署。
Anthropic is redeploying Claude Fable 5, its most capable generally available model. On June 30, it announced that US export controls had lifted. The controls had covered Claude Fable 5 and Claude Mythos 5. Fable 5 returned to users globally on Wednesday, July 1. Mythos 5 access is restored to a set of US organizations. The models were pulled on June 12. A US government directive restricted them to non-foreign-nationals. Anthropic could not verify nationality in real time. So it suspended both models for everyone. This article explains what triggered the block. It covers the new safeguard and the proposed jailbreak framework. It also shows how Fable 5 compares to rivals like GLM-5.2. Quick facts Model: Claude Fable 5 (a Mythos-class model made safe for general use) Event: Redeployed July 1, 2026 after export controls lifted Reason for pause: An Amazon report on a safeguard bypass Fix: A new safety classifier that blocks the reported technique Pricing: $10 per million input tokens, $50 per million output tokens Where: Claude Platform, Claude.ai, Claude Code, Claude Cowork What happened: the timeline Anthropic launched Fable 5 and Mythos 5 on June 9. Both share the same underlying model. Fable 5 ships with strong safeguards for general use. Mythos 5 has some safeguards lifted for defensive cybersecurity partners. On June 12, the US government applied export controls. The order took effect immediately. Anthropic suspended access rather than risk non-compliance. The trigger was a report from Amazon researchers. They found a method of bypassing Fable 5’s safeguards. The prompt made the model identify a number of software vulnerabilities. In one case, it produced code showing how to exploit one vulnerability. By June 26, the government approved restoring Mythos 5 for some US organizations. On June 30, the controls were fully lifted. Why Anthropic says the finding was not unique Anthropic tested whether the finding was unique to Fable 5. It was not. Less capable models identified the same vulnerabilities. That list includes Claude Opus 4.8, GPT-5.5, and Kimi K2.7. For the single exploit demonstration, every tested model reproduced it. That set included Haiku 4.5, Sonnet 4.6, Opus 4.6, and Opus 4.7. It also covered Opus 4.8, GPT-5.4, GPT-5.5, and Kimi K2.7. The Anthropic team states the technique exposed no unique Mythos-level cyber capabilities. It called the case a borderline one for Fable 5’s safeguards. The blocked behavior involved only routine defensive cybersecurity work. How the new classifier works Anthropic still moved to close the gap. It trained an improved safety classifier for the reported behavior. The classifier blocks the specific technique in over 99% of cases. Blocked requests are not refused outright. They are routed to Claude Opus 4.8 instead. Users are notified when this fallback happens. Researchers from the Department of Commerce’s CAISI tested both old and new safeguards. They agree the safeguards are extraordinarily strong. The tradeoff is more false positives during routine coding and debugging. This reflects Anthropic’s ‘defense in depth’ design. Classifiers are smaller AI systems that detect harmful cyber tasks. A deliberate ‘safety margin’ also blocks some benign requests. Fable 5 uses a much larger safety margin than prior models. The proposed jailbreak severity framework The episode exposed a gap. The industry has no shared standard for scoring a ‘jailbreak,’ a technique that bypasses a model’s safeguards. Anthropic is drafting one with Amazon, Microsoft, Google, and other Glasswing partners. The draft scores a jailbreak on four criteria: Capability gain — how far beyond existing tools it takes the user. Breadth of capability gain — how many distinct offensive tasks it unlocks. Ease of weaponization — how much human effort an attack still needs. Discoverability — how easily someone can obtain the technique. For the most severe class, Anthropic will deploy preliminary mitigations immediately. It is also standing up 24/7 monitoring of jailbreak submission channels. Interactive scorer Try this embedded interactive scorer to see how these four criteria combine. (function(){ window.addEventListener("message", function(e){ if(e.data && e.data.type === "mtp-embed-height"){ var f = document.getElementById("mtp-fable5-scorer"); if(f) f.style.height = e.data.height + "px"; } }); })(); Use cases with examples Fable 5 targets long-horizon, agentic work. Here is where early engineers can apply it. Codebase migrations: Stripe reported a codebase-wide migration in one day. The job spanned a 50-million-line Ruby codebase. Doing it by hand would take a team over two months. Financial analysis: On Hebbia‘s Finance Benchmark, Fable 5 posts the highest score. It gains on chart, table, and document reasoning. Vision-to-code: Fable 5 can rebuild a web app’s source code from screenshots alone. Long-running agents: File-based memory helps it stay focused across millions of tokens. How Fable 5 compares The pause created an opening for rivals. Days after the suspension, Zhipu AI released GLM-5.2 as open weights. Independent testers rank it the strongest openly available model. ModelDeveloperAccessContextPrice (in/out per 1M)Reported benchmarkCyber safeguardsClaude Fable 5AnthropicGeneral (Platform, .ai, Code, Cowork)Long-context$10 / $50Led AA-Briefcase at 1587 EloStrongest applied; falls back to Opus 4.8Claude Mythos 5AnthropicGlasswing / trusted US orgsLong-context$10 / $50Same base model as Fable 5Cyber safeguards liftedClaude Opus 4.8AnthropicGeneralLong-context~$5 / $25SWE-bench Pro 69.2; Terminal-Bench 85.0StandardGLM-5.2Zhipu AI (Z.ai)Open weights (MIT)1M tokens~$1.40 / $4.40SWE-bench Pro 62.1; Terminal-Bench 81.0None (open weight)GPT-5.5OpenAIGeneralLong-context~$5 / $30SWE-bench Pro 58.6StandardBenchmark and price figures are self-reported or from independent testers. Sources: felloai, Latent Space, TrendingTopics. GLM-5.2 uses a Mixture-of-Experts design. It has roughly 750 billion total parameters. Only about 40 billion activate per token. On Semgrep‘s IDOR benchmark, it scored 39% F1. That beat Claude Code at 32% on the same prompt. The gap narrows on cost. On AA-Briefcase, Fable 5 averaged $31 per task. GLM-5.2 averaged $2.40. Access and a quick API example For Pro, Max, Team, and select Enterprise plans, Fable 5 is included through July 7. It covers up to 50% of weekly usage limits. After that, access moves to usage credits. Anthropic is also re-enabling Fable 5 on AWS, Google Cloud, and Microsoft Foundry. Developers call the model with the claude-fable-5 string: Copy CodeCopiedUse a different Browserfrom anthropic import Anthropic # Reads your key from the ANTHROPIC_API_KEY environment variable client = Anthropic() message = client.messages.create( model="claude-fable-5", max_tokens=1024, messages=[ {"role": "user", "content": "Refactor this module for readability."} ], ) print(message.content) If a classifier fires, the response comes from Opus 4.8. Your code path stays the same. Key takeaways Fable 5 returns July 1 after export controls were lifted. A new classifier blocks the reported bypass in over 99% of cases. Blocked requests route to Opus 4.8, not an outright refusal. Anthropic proposes a four-criteria framework for scoring jailbreaks. GLM-5.2 emerged as a cheaper open-weight rival during the pause. Check out the Technical details. Also, feel free to follow us on Twitter and don’t forget to join our 150k+ML SubReddit and Subscribe to our Newsletter. Wait! are you on telegram? now you can join us on telegram as well. Need to partner with us for promoting your GitHub Repo OR Hugging Face Page OR Product Release OR Webinar etc.? Connect with us The post Anthropic Redeploys Claude Fable 5 on July 1 After US Export Controls Lift, Adds New Cybersecurity Classifier appeared first on MarkTechPost.
Related
相關文章

谷歌貼身對標字節:最新輕量版 Nano Banana 2 四秒出圖,單張僅 0.034 美元,還能直出視頻
谷歌最新發布輕量版 Nano Banana 2,能在四秒內生成圖片,單張成本僅 0.034 美元,價格低於字節跳動的競品。該模型在出圖品質與延遲上實現雙重反超,並具備直接生成影片的功能。

時隔4個月,谷歌Nano Banana再發新模型
這篇消息聚焦「時隔4個月,谷歌Nano Banana再發新模型」。原始導語提到:4秒生成1K圖片,僅需0.23元。 從 AI 情報角度來看,這類內容值得關注其背後的技術進展、產品落地、產業競爭與後續市場影響。

美團開源萬億參數大模型 LongCat-2.0,五萬卡國產算力全流程訓練
美團開源了萬億參數的大模型 LongCat-2.0,該模型採用五萬張國產算力卡進行全流程訓練。此模型特別強調 Agentic Coding 能力,即具備自主編碼與任務執行的特性。
203.8億!AI視頻最大單筆融資,即將誕生
智東西 作者 | 江宇 編輯 | 雲鵬 智東西7月1日消息,《南華早報》援引知情人士消息稱,快手旗下視頻生成業務“可靈AI”將完成一輪30億美元(約合人民幣203.8億元)融資,投後估值將達到180億美元(約合人民幣1223億元),較今年4月最初設定的200億美元目標估值縮水20億美元。騰訊參與了可靈AI本輪融資。 報道稱,隨著中國AI視頻生成賽道競爭加劇,可靈AI正在加速推進獨立融資和後續上市安排。一位知情人士稱,可靈AI最新估值較此前目標有所下調,反映出市場對可靈AI估值預期的變化。 快手此前已經披露過相關動作。5月12日,快手在港交所公告稱,其正在評估一項關於可靈AI業務的重組建議,該重組可能涉及引入外部投資者融資。 上市方面,一位接近交易的人士稱,快手預計將在未來12個月內啟動可靈AI赴港上市流程,IPO募集資金將用於算力和數據中心建設,以及人才引進與留存。 可靈AI是快手在AI視頻生成領域的核心業務。 快手在今年3月曾宣佈,“可靈AI”基於1月份收入計算的年化收入已超過3億美元(約合人民幣122.34億元),並預計該業務2026年的全年收入將翻倍。一位知情人士透露,今年第一季度,“可靈AI”收入為7500萬美元(約合人民幣5.09億元),其中大部分來自包括北美在內的海外市場。 截至港股收盤,快手股價報41.6港元(約合人民幣36元),上漲1.46%,總市值約1799.99億港元(約合人民幣1559.2億元)。“可靈AI”當前180億美元的估值,大約相當於快手市值的78%。 ▲快手股價(圖源:騰訊自選股) 結語:可靈AI搶跑AI視頻IPO 從今年5月傳出分拆融資,到如今接近完成30億美元融資,可靈AI的資本化進程正在進一步明確。雖然最新估值較此前200億美元目標有所下調,但180億美元投後估值仍意味著,可靈AI已經成為國內AI視頻生成賽道估值最高的獨立化項目之一。

OpenAI:ChatGPT 全球採用率持續擴大,非英語用戶佔比過半
OpenAI 最新報告顯示,ChatGPT 全球採用率持續擴大,非英語用戶佔比已超過一半。用戶地域分佈更廣,非洲和亞洲增速最快,35 歲以上用戶群體也在擴大。工作用途中,視覺設計任務增長最快。#ChatGPT##AI#
物理 AI 到來,移動網絡該如何進化?
過去一年,幾乎所有人都在追問同一個問題: AI 大模型還能變得多聰明?Agent、多模態、端側部署,構成了 AI 產業最熱鬧的敘事。但當 AI 開始進入眼鏡、汽車、機器人、攝像頭這些物理載體,開始在道路、工廠、家庭和人類交互之後,我們發現,現實世界和 AI 大模型之間,還有最後一公里的路沒有打通。AI 眼鏡要識別眼前的文字,Robotaxi 要判斷周圍路況,機器人要決定下一步動作。它們依賴的不只是算法,也包括圖像、語音、位置、空間關係、設備狀態和環境變化等數據。這些數據需要先由終端採集,再通過網絡上傳,交給雲端或邊緣側模型處理。模型輸出的結果,也要通過網絡返回終端,才能轉化現實中的業務動作。這使得移動網絡在 AI 時代承擔了新的壓力。過去,網絡波動可能只是讓視頻緩衝幾秒;未來,同樣的波動可能導致實時識別失敗、遠程控制遲滯,或者智能體無法完成動作。移動網絡不再只是單純的內容分發通道。它需要識別連接對象,理解業務類型,判斷實時性要求,也要在資源緊張時決定哪些連接需要優先保障。物理 AI 時代的網絡,不能只是一條“更寬的管道”。移動網絡的下一批用戶,不只是人類很長一段時間裡,移動通信網絡服務的核心對象是人。在語音時代,網絡解決的是通話連接,讓人在移動狀態下完成實時溝通。進入 3G、4G 時代後,智能手機和在線內容快速普及,移動網絡的主要任務轉向大帶寬數據傳輸。網頁、圖片、音樂、視頻、直播、遊戲和在線會議,成為了移動互聯網時代最典型的業務形態。智能手機也在這個過程中變成了高度集成的入口。過去由獨立設備完成的功能,例如 GPS、音樂播放器、相機、錄音機和部分辦公工具,逐漸被手機吸收到一個終端裡。移動互聯網時代的一個基本特徵,是終端形態向手機集中,人則是最主要的連接主體。AI 的到來改變了這個前提。大模型要理解現實世界,首先需要獲得更豐富、更連續的現實輸入。圖像、語音、位置、動作、空間