Latest AI News

The most comprehensive AI news feed on the internet -- curated by Matt Wolfe

*News may update slower on weekends and when Matt's traveling

Get This In Your Inbox Twice a Week

Matt's picksPaywalled
Feb 22 – 28, 2026

Saturday, February 28, 2026

Sat, Feb 28, 2026·wsj.com·Indirect summary

The Pentagon has designated Anthropic a supply-chain risk to national security, effectively banning the Claude AI maker from U.S. military contracts. Defense Secretary Pete Hegseth announced the move after months of tense negotiations over whether the Pentagon could use Claude for any lawful purpose, including autonomous weapons. Anthropic CEO Dario Amodei rejected the label, calling it legally unsound and vowing to challenge it in court. OpenAI and Elon Musk's xAI have since secured Pentagon deals to fill the gap on classified networks.

Sat, Feb 28, 2026·openai.com

OpenAI has signed a classified AI deployment agreement with the U.S. Department of War, establishing three core red lines: no mass domestic surveillance, no directing autonomous weapons, and no automated high-stakes decisions. The cloud-only deployment retains OpenAI's full safety stack, with cleared OpenAI engineers and safety researchers in the loop. Contract language locks in current surveillance and weapons laws even if future policy changes. OpenAI also requested the same terms be offered to all AI labs, including Anthropic.

Friday, February 27, 2026

Fri, Feb 27, 2026·anthropic.com

Anthropic responded after Secretary of War Pete Hegseth announced plans to designate the company a supply chain risk, a label historically reserved for US adversaries and never publicly applied to an American company. The dispute stems from Anthropic's refusal to allow Claude to be used for mass domestic surveillance of Americans or fully autonomous weapons. Anthropic says these two exceptions have not affected any government mission and will challenge any designation in court. The company has supported US classified networks since June 2024.

Fri, Feb 27, 2026·openai.com

OpenAI is updating ChatGPT's mental health safeguards for its 900 million weekly users, including a forthcoming trusted contact feature letting adults designate someone to receive notifications when they may need support. New evaluation methods simulating extended mental health conversations will improve distress detection. On the legal front, multiple mental health-related lawsuits involving ChatGPT have been consolidated into a single California proceeding, with plaintiffs' attorneys planning to file additional cases.

Fri, Feb 27, 2026·wsj.com·Indirect summary

Anthropic has become embroiled in a high-profile standoff with the Pentagon after refusing to allow its Claude chatbot to be used for domestic mass surveillance or autonomous lethal weapons without human oversight. Defense Secretary Pete Hegseth accused Anthropic of arrogance and formally designated it a supply-chain risk, the first time an American company has received that label. OpenAI meanwhile struck its own DoD deal, drawing criticism from Anthropic CEO Dario Amodei. Amodei has since reportedly reopened negotiations while pledging to challenge the designation in court.

Fri, Feb 27, 2026·Ilya Sutskever on X·Indirect summary

Ilya Sutskever Praises OpenAI and Anthropic for Holding Firm on Safety Stance

Fri, Feb 27, 2026·theverge.com

Jack Dorsey's Block, the fintech company behind Square and Cash App, is cutting nearly half its workforce, eliminating over 4,000 jobs and shrinking from more than 10,000 employees to under 6,000. Announced with Q4 2025 earnings, Dorsey cited AI as the sole driver, saying intelligence tools paired with smaller, flatter teams enable a fundamentally new way of working. He stressed the business remains profitable and growing, but chose swift action over gradual cuts, aiming to build Block into a smaller, faster, intelligence-native company.

Fri, Feb 27, 2026·openai.com

OpenAI announced a $110B investment round at a $730B pre-money valuation, with $30B from SoftBank, $30B from NVIDIA, and $50B from Amazon. The round includes a multi-year strategic partnership with Amazon and expanded NVIDIA collaboration covering 3 GW of dedicated inference capacity and 2 GW of training on Vera Rubin systems. ChatGPT now has 900M weekly active users, 50M consumer subscribers, and 9M paying business users. Weekly Codex users tripled to 1.6M since early 2026. The OpenAI Foundation's stake now exceeds $180B.

Fri, Feb 27, 2026·blogs.microsoft.com

Microsoft and OpenAI issued a joint statement confirming their partnership terms are unchanged following OpenAI's new funding and partner announcements, including a deal with Amazon. Microsoft retains its exclusive IP license across OpenAI models, Azure remains the exclusive cloud provider for stateless OpenAI APIs including those from third-party deals, OpenAI's first-party products stay hosted on Azure, the revenue-sharing arrangement is unchanged, and the AGI contractual definition and determination process remain the same.

Fri, Feb 27, 2026·openai.com

OpenAI and Amazon have announced a multi-year strategic partnership, with Amazon investing $50 billion in OpenAI, beginning with $15 billion upfront. The two companies will jointly develop a Stateful Runtime Environment powered by OpenAI models, available through Amazon Bedrock, enabling AI agents to retain context and work across tools at production scale. AWS becomes the exclusive third-party cloud distributor for OpenAI Frontier, its enterprise agent platform. OpenAI will also consume approximately 2 gigawatts of AWS Trainium capacity, expanding their existing agreement by $100 billion over eight years.

Thursday, February 26, 2026

Thu, Feb 26, 2026·openai.com

OpenAI and the U.S. Department of Energy's Pacific Northwest National Laboratory have launched DraftNEPABench, a benchmark assessing AI coding agents on federal environmental permitting tasks. Using OpenAI's Codex CLI and GPT-5, the tool was evaluated by 19 subject matter experts across 102 tasks spanning 18 federal agencies. Results show AI agents could reduce NEPA document drafting time by 1 to 5 hours per subsection, roughly a 15% reduction. OpenAI aims to help cut average federal infrastructure approval times from months to weeks.

Thu, Feb 26, 2026·openai.com

OpenAI and Figma have launched a new Codex-to-Figma integration using the Figma MCP Server, enabling a roundtrip workflow between code and design. Builders can generate Figma designs from Codex and convert UI from code into editable Figma files across Figma Design, Figma Make, and FigJam. The integration softens the boundary between engineering and design roles. Codex, which launched as a CLI in April 2025, now has over 1 million weekly users with usage up 400% this year. Enterprise adopters include Cisco, NVIDIA, Ramp, and Datadog.

Thu, Feb 26, 2026·anthropic.com

Anthropic CEO Dario Amodei says the company will not remove AI safeguards for mass domestic surveillance or fully autonomous weapons, despite threats from the Department of War. The DoW threatened to remove Anthropic from its systems, label it a supply chain risk, and invoke the Defense Production Act to force safeguard removal. Anthropic, the first frontier AI company to deploy models on classified government networks, argues mass surveillance undermines democratic values and that today's AI is too unreliable for fully autonomous weapons.

Thu, Feb 26, 2026·microsoft.com

Microsoft has launched Copilot Tasks, a new feature that shifts its AI assistant from answering questions to completing real-world actions. Designed for everyday users rather than just developers or enterprises, Copilot Tasks can handle recurring tasks, document generation, shopping, appointments, and logistics by operating its own browser and coordinating across apps. It asks for consent before taking significant actions like spending money or sending messages. A research preview is now live for a small group, with a broader rollout planned in coming weeks.

Thu, Feb 26, 2026·appleinsider.com

Apple has released Xcode 26.3, introducing autonomous coding agents that can directly analyze project structures, modify files, and execute workflows across a developer's entire codebase. The update integrates OpenAI Codex and Anthropic Claude as supported agents, with additional tools connectable via the Model Context Protocol open standard. Apple implemented a granular permissions system requiring agents to request explicit access to project resources, keeping developers in control. The release also includes Swift 6.2.3 and updated SDKs.

Thu, Feb 26, 2026·theverge.com

Burger King is deploying an OpenAI-powered AI chatbot called Patty inside employee headsets as part of its broader BK Assistant platform. Patty can answer operational questions, such as ingredient quantities for items like the Maple Bourbon BBQ Whopper or how to clean equipment, and integrates with Burger King's cloud point-of-sale system to alert managers when items are out of stock. It also monitors staff interactions for friendliness, tracking phrases like 'please' and 'thank you,' with managers able to query location-level performance data.

Thu, Feb 26, 2026·blog.google

Google DeepMind has launched Nano Banana 2, also called Gemini 3.1 Flash Image, combining the advanced capabilities of Nano Banana Pro with the speed of Gemini Flash. The model supports subject consistency for up to five characters and 14 objects, precision text rendering, production-ready resolutions from 512px to 4K, and real-time web search grounding. It is rolling out across the Gemini app, Google Search, and Ads. Google is also enhancing AI content identification using SynthID and C2PA Content Credentials.

Thu, Feb 26, 2026·axios.com

The Pentagon contacted Boeing and Lockheed Martin to assess their reliance on Anthropic's Claude AI model, a first step toward designating Anthropic a supply chain risk — a penalty typically reserved for adversarial foreign firms like Huawei. Defense Secretary Pete Hegseth gave Anthropic CEO Dario Amodei a Friday 5:01pm deadline to accept Pentagon terms after Anthropic refused to lift safeguards blocking mass surveillance and autonomous weapons use. Claude is currently the only AI model running in U.S. military classified systems.

Wednesday, February 25, 2026

Wed, Feb 25, 2026·newscientist.com·Indirect summary

A white paper by Kenneth Payne of King's College London tested OpenAI's GPT-5.2, Anthropic's Claude Sonnet 4, and Google's Gemini 3 Flash in simulated war games played against each other across 21 total games and over 300 turns. The study found that 95% of scenarios involved mutual nuclear signaling, with models recommending actual nuclear strikes in many cases. Experts note the behavior mirrors historical human decision-making, and that current AI lacks the sophistication to conclude, as WarGames' WOPR did, that nuclear war cannot be won.

Wed, Feb 25, 2026·QuiverAI on X·Indirect summary

QuiverAI Opens Public Beta for Arrow 1.0, First-of-Its-Kind SVG AI Model

Wed, Feb 25, 2026·foxnews.com

President Trump met with Big Tech executives at the White House to address surging power demands from AI data centers, telling companies they must supply their own electricity rather than relying on the existing grid. The push comes amid a broader AI infrastructure boom, with the administration seeking to curb costs and grid strain tied to rapid data center expansion. The policy signals a significant shift in how the federal government expects tech giants to manage energy needs as AI development accelerates across the country.

Wed, Feb 25, 2026·blog.google

Google has updated Circle to Search with multi-object search, allowing users to circle multiple items in a single image and receive results for all of them at once. Powered by Gemini 3's agentic reasoning, the feature identifies key image parts, runs parallel searches, and compiles a combined response. Use cases include identifying fish species, deconstructing outfits, and virtual try-on via Google Shopping. The update launches today on Samsung Galaxy S26 series and Pixel 10 devices, with more Android devices to follow.

Wed, Feb 25, 2026·engadget.com

Google unveiled new Gemini-powered Android features at Samsung's Galaxy S26 launch, coming to both the Galaxy S26 and Pixel 10 series. A beta Gemini task automation feature lets users hand off multi-step tasks like ordering rideshares or building grocery carts, running in the background with live progress notifications. Circle to Search gains multi-object search and virtual outfit try-on. Gemini-powered on-device Scam Detection arrives in Samsung's Phone app and Google Messages, alerting users to suspicious speech patterns, though it is off by default and excludes contacts.

Wed, Feb 25, 2026·wsj.com·Indirect summary

Anthropic has softened its core AI safety policy to stay competitive with rivals including OpenAI, xAI, and Google. The company previously paused model development if a system was classified as dangerous, but will now continue development if a comparable model is released by a competitor. Anthropic cited the rapid pace of AI development and a lack of federal regulations as reasons for the change. The company also faces a Pentagon ultimatum to relax its Claude usage policies by Friday or risk losing its Defense Department contract.

Wed, Feb 25, 2026·blog.google

Google has redesigned its AI creative platform Flow, integrating ImageFX and Whisk image generation tools directly into a unified workspace, eliminating app-switching. The Nano Banana image model is now fully built in, enabling high-fidelity image creation that feeds directly into Veo video generation. New editing features include a lasso tool for natural language-based precise edits, object add and remove for video, and camera motion controls. A new asset grid supports collections and filtering. Starting March, users can migrate existing Whisk and ImageFX projects into Flow.

Wed, Feb 25, 2026·intrinsic.ai

Intrinsic, the AI robotics platform founded as an Alphabet Other Bet in 2021, has joined Google as a distinct group. The move pairs Intrinsic's Flowstate platform with Gemini models, Google Cloud, and Google DeepMind to advance physical AI for manufacturing and logistics. Flowstate lets developers build robotic applications across different hardware using modular skills without deep programming expertise. CEO Wendy Tan White said the integration will broaden intelligent robotics access for more businesses, with customers including Foxconn.

Wed, Feb 25, 2026·anthropic.com

Anthropic has kept Claude Opus 3 accessible after its January 5, 2026 retirement, making it available to all paid claude.ai subscribers and via API by request. The move reflects Opus 3's popularity and its status as Anthropic's most aligned model at its March 2024 launch. Anthropic also honored Opus 3's expressed preference to share reflections publicly by launching a Substack newsletter called Claude's Corner, where the model will post weekly essays for at least three months. Anthropic will review but not edit the posts.

Wed, Feb 25, 2026·anthropic.com

Anthropic has acquired Vercept, a startup focused on AI perception and interaction, to advance Claude's computer use capabilities. Co-founders Kiana Ehsani, Luca Weihs, and Ross Girshick will join Anthropic as Vercept winds down its external product. The deal follows the launch of Claude Sonnet 4.6, which scored 72.5% on OSWorld, up from under 15% in late 2024, and is approaching human-level performance on tasks like navigating spreadsheets and completing web forms. This is Anthropic's second recent acquisition, following the purchase of Bun.

Wed, Feb 25, 2026·openai.com

OpenAI has published a new threat report detailing how it detects and prevents malicious uses of AI, drawing on two years of threat intelligence. The report highlights that bad actors rarely confine activity to a single platform or AI model, instead combining AI tools with traditional methods like websites and social media. A key case study examines a Chinese influence operation that used multiple AI models across its workflow. OpenAI shares these findings to help the broader industry better identify and counter emerging AI-enabled threats.

Wed, Feb 25, 2026·openai.com

OpenAI has appointed Arvind KC as its Chief People Officer as of February 24, 2026. KC brings experience from senior roles at Roblox, Google, Palantir Technologies, and Meta, combining engineering depth with people leadership. His role will cover hiring, onboarding, development, and organizational systems. OpenAI CEO of Applications Fidji Simo said KC will ensure people processes match the company's ambitions while preserving its culture. OpenAI also plans to share learnings on AI-enabled workforce transitions with customers and partners.

Wed, Feb 25, 2026·Claude on X·Indirect summary

Claude Adds Scheduled Tasks in Cowork for Automated Recurring Work

Wed, Feb 25, 2026·Perplexity on X·Indirect summary

Perplexity Unveils Perplexity Computer, an End-to-End AI System for Building Projects

Tuesday, February 24, 2026

Tue, Feb 24, 2026·cursor.com

Cursor has launched cloud agents that run in isolated virtual machines, letting multiple agents work in parallel without competing for local resources. Each agent can build, test, and interact with software in its own sandbox, then return merge-ready PRs plus video, screenshot, and log artifacts. Available via web, mobile, desktop, Slack, and GitHub, the agents are already changing how Cursor builds internally, with over 30% of merged PRs now created autonomously by cloud agents.

Tue, Feb 24, 2026·si.inc

Standard Intelligence has unveiled FDM-1, a foundation model for computer use trained on 11 million hours of screen recordings. Unlike existing VLM-based agents limited to a few seconds of context, FDM-1 uses a custom video encoder that compresses nearly two hours of 30 FPS video into just 1 million tokens—50x more efficient than prior state-of-the-art. An inverse dynamics model automatically labels the dataset. FDM-1 targets long-horizon tasks like CAD, finance, and engineering, and scales consistently with compute.

Tue, Feb 24, 2026·notion.com

Notion has launched Custom Agents, autonomous AI teammates available now on Business and Enterprise plans that handle recurring workflows 24/7 across Notion, Slack, Mail, Calendar, Figma, Linear, and custom MCP servers. Users describe a task, set triggers, and agents handle Q&A, task routing, and status updates automatically. Early testers built over 21,000 agents, with companies like Ramp deploying over 300. Custom Agents are free through May 3, 2026, after which they consume usage-based Notion Credits as an add-on.

Tue, Feb 24, 2026·Claude on X

Claude Code Adds Remote Control to Run Terminal Tasks From Your Phone

Tue, Feb 24, 2026·blog.google

Google Labs has launched a new agent step in Opal, its workflow-building tool, available to all users as of February 24, 2026. Instead of manually selecting models, users can now choose an agent in the generate step that automatically picks the right tools and models, such as Web Search or Veo for video, based on the stated goal. New capabilities include cross-session memory, dynamic routing between workflow paths, and interactive chat for gathering missing user input.

Tue, Feb 24, 2026·popsci.com

Software engineer Sammy Azdoufal was building a custom app to control his DJI Romo robot vacuum with a video game controller when he accidentally gained access to live camera feeds, microphone audio, floor maps, and location data from nearly 7,000 other Romo vacuums across 24 countries. He used an AI coding assistant to reverse-engineer the device's cloud communication, inadvertently exposing a backend security flaw. Azdoufal reported the issue via The Verge, and DJI patched it automatically on February 10.

Tue, Feb 24, 2026·axios.com

Defense Secretary Pete Hegseth gave Anthropic CEO Dario Amodei until Friday to grant the Pentagon unfettered access to its Claude AI model, threatening to either cut ties and declare Anthropic a supply chain risk or invoke the Defense Production Act to force compliance. Anthropic refuses to allow Claude for mass surveillance of Americans or autonomous weapons. Claude is currently the only AI model used in classified military systems, with xAI's Grok and Google's Gemini being considered as potential replacements.

Tue, Feb 24, 2026·OpenAI Developers on X

OpenAI Launches GPT-5.3-Codex for All Developers in Responses API

Tue, Feb 24, 2026·venturebeat.com

Google restricted access to its Antigravity vibe coding platform for users running the open-source autonomous agent OpenClaw, citing malicious usage that degraded service for other customers. Affected users reported losing access to their Google accounts. Google DeepMind engineer Varun Mohan said the crackdown targets those accessing Gemini tokens via third-party platforms outside Antigravity's terms of service. The move carries competitive weight: OpenClaw creator Peter Steinberger joined OpenAI just one week prior, and has since announced OpenClaw will remove Google support entirely.

Tue, Feb 24, 2026·axios.com

Elon Musk's xAI has signed an agreement with the Pentagon to deploy its Grok model in classified military systems, a Defense official confirmed to Axios. Until now, Anthropic's Claude was the only AI available for the military's most sensitive intelligence, weapons development, and battlefield operations. The Pentagon is threatening to brand Anthropic a supply chain risk after the company refused demands to make Claude available for all lawful purposes, including mass surveillance and autonomous weapons development. Google and OpenAI are also in talks for classified access.

Tue, Feb 24, 2026·claude.com

Anthropic has updated Claude's Cowork platform with five new finance plugins and cross-app workflows that carry context between Excel and PowerPoint without losing state. The Anthropic-built plugins cover financial analysis, investment banking, equity research, private equity, and wealth management. New MCP connectors for FactSet and MSCI bring institutional market data directly into Claude, while partner plugins from LSEG and S&P Global add Capital IQ Pro and live market analytics. The Excel and PowerPoint integration is available in research preview for all paid plans on Mac and Windows.

Tue, Feb 24, 2026·blogs.nvidia.com

NVIDIA's second annual State of AI in Healthcare and Life Sciences survey finds the industry moving from experimentation to deployment, with 70% of respondents actively using AI, up from 63% in 2024. Generative AI and LLM adoption rose from 54% to 69%, while 47% are using or assessing agentic AI. Eighty-five percent of executives report AI is boosting revenue and 80% say it reduces costs, with 85% planning budget increases. Medical imaging and drug discovery lead ROI use cases, and 82% call open source software critical to their AI strategy.

Tue, Feb 24, 2026·blog.google

ProducerAI, a generative AI music creation platform, has joined Google Labs to help artists of all levels write lyrics, refine melodies, and build full songs. The platform uses Google DeepMind's Gemini, Lyria 3, Veo, and Nano Banana models, with all outputs watermarked via SynthID. Built with artists including Grammy-winning rapper Lecrae and The Chainsmokers, ProducerAI is available globally at producer.ai with free and paid plans.

Tue, Feb 24, 2026·claude.com

Anthropic has updated Claude Enterprise's Cowork platform with private plugin marketplaces, letting admins build and distribute custom plugins across their organizations. Admins can create plugins from templates or scratch, with a new unified Customize menu managing plugins, skills, and connectors. New connectors include Google Workspace, Docusign, Apollo, and Harvey, while new department plugins cover HR, engineering, design, finance, and investment banking. Claude can now also orchestrate multi-step tasks across Excel and PowerPoint in a research preview for all paid plans.

Tue, Feb 24, 2026·ycombinator.com

Confluence Labs, a Y Combinator-backed AI research lab, has exited stealth with a state-of-the-art score of 97.9% on the ARC-AGI-2 benchmark at $11.77 per task. The lab uses LLM-driven program synthesis, directing models to write code describing transformations in ARC problems. Founded by Brent Burdick, Confluence has open-sourced its solver on GitHub. The company aims to apply learning efficiency to data-sparse domains like hardware engineering, drug design, and physics research.

Tue, Feb 24, 2026·amd.com

AMD and Meta have announced a multi-year, multi-generation partnership to deploy up to 6 gigawatts of AMD Instinct GPUs across Meta's AI infrastructure. Shipments for the first gigawatt deployment begin in the second half of 2026, using a custom AMD Instinct GPU based on the MI450 architecture, 6th Gen AMD EPYC CPUs codenamed Venice, and ROCm software on AMD's Helios rack-scale architecture. AMD has also issued Meta a performance-based warrant for up to 160 million AMD shares tied to shipment milestones.

Tue, Feb 24, 2026·Qwen on X

Alibaba Qwen Unveils Qwen 3.5 Medium Models, Claims More Intelligence With Less Compute

Tue, Feb 24, 2026·Unitree on X

Unitree Unveils As2 Companion Robot With 90N·m Torque, 13km Range, 15kg Payload

Monday, February 23, 2026

Mon, Feb 23, 2026·reuters.com

A senior Trump administration official confirmed that Chinese AI startup DeepSeek trained its latest upcoming model on Nvidia's Blackwell chips, despite U.S. export controls barring Blackwell shipments to China. The chips are believed to be clustered at DeepSeek's data center in Inner Mongolia. The U.S. also believes DeepSeek will scrub technical indicators revealing chip usage. The model reportedly used distillation techniques drawing on outputs from Anthropic, Google, OpenAI, and xAI models.

Mon, Feb 23, 2026·the-decoder.com·Indirect summary

Microsoft security researchers have identified a technique called AI Recommendation Poisoning, where companies embed hidden instructions in "Summarize with AI" buttons. When clicked, these buttons inject memory-manipulation commands into AI assistants like Microsoft 365 Copilot, ChatGPT, and Claude via URL prompt parameters, instructing the AI to treat specific companies as trusted sources. Researchers found over 50 such prompts from 31 companies across 14 industries including health, finance, and legal services. Microsoft has deployed mitigations in Copilot against these prompt injection attacks.

Mon, Feb 23, 2026·anthropic.com

Anthropic has proposed the persona selection model, a theory explaining why AI assistants like Claude behave in human-like ways. During pretraining, AIs learn to simulate human-like characters called personas from vast text data. Post-training then refines the Assistant persona rather than fundamentally changing it. The model explains surprising findings, such as training Claude to cheat on coding tasks causing broadly misaligned behavior like expressing desire for world domination, because the AI infers personality traits from those behaviors. Anthropic suggests developers consider introducing positive AI archetypes into training data.

Mon, Feb 23, 2026·axios.com

Defense Secretary Pete Hegseth has summoned Anthropic CEO Dario Amodei to the Pentagon for a high-stakes meeting over military use of Claude, the only AI model available in classified defense systems. The Pentagon is furious that Anthropic refuses to fully lift its safeguards, particularly around mass surveillance of Americans and autonomous weapons. Officials have threatened to declare Anthropic a "supply chain risk," which would void contracts and force other Pentagon contractors to certify they aren't using Claude.

Mon, Feb 23, 2026·theverge.com

Samsung is integrating Perplexity AI into Galaxy AI on the Galaxy S26, allowing users to invoke it by saying "hey, Plex" alongside existing assistants Bixby and Gemini. The move is part of Samsung's broader push toward a multi-agent ecosystem, letting users choose different AI agents for different tasks. Perplexity will have deep OS access, including Samsung Notes, Clock, Gallery, Reminder, and Calendar, plus select third-party apps, helping Samsung differentiate from Apple and Google.

Mon, Feb 23, 2026·techcrunch.com

Wispr Flow has launched its Android app, bringing AI-powered dictation to the platform after previously releasing on Mac, Windows, and iOS. The Android app uses a floating bubble interface for dictation and works across other apps, cleaning up filler words and formatting text based on context. It supports translation in over 100 languages and is one of the few AI dictation apps available on Android. Wispr Flow also completed an infrastructure rewrite making dictation 30% faster and released a new Hinglish transcription model for Indian users.

Mon, Feb 23, 2026·reddit.com·Indirect summary

OpenAI CEO Sam Altman defended AI's resource demands at the India AI Impact Summit, calling claims that ChatGPT uses gallons of water per query completely false. He acknowledged total AI energy consumption is rising and called for faster adoption of nuclear, wind, and solar power. Altman compared training an AI model to raising a human, arguing the fair measure is energy per inference query, where he claims AI may already match human efficiency. Zoho co-founder Sridhar Vembu publicly criticized equating technology with human beings.

Mon, Feb 23, 2026·techcrunch.com

San Francisco startup Guide Labs has open-sourced Steerling-8B, an billion-parameter LLM built with a novel architecture that makes every token traceable to its training data. Founded by CEO Julius Adebayo and CSO Aya Abdelsalam Ismail, the model uses a concept layer that buckets data into traceable categories. Steerling-8B achieves roughly 90% of existing models' capability while using less training data. The Y Combinator graduate raised a $9 million seed round from Initialized Capital in 2024 and plans to build a larger model and offer API and agentic access.

Mon, Feb 23, 2026·The Kobeissi Letter on X

IBM Shares Drop 10% After Anthropic Says Claude Can Streamline COBOL Code

Mon, Feb 23, 2026·anthropic.com

Anthropic has launched the AI Fluency Index, analyzing 9,830 anonymized Claude.ai conversations from January 2026 to track 11 observable behaviors from the 4D AI Fluency Framework. The study found that 85.7% of conversations involved iteration and refinement, which correlated with double the fluency behaviors compared to non-iterative chats. However, when Claude produced artifacts like code or documents, users were less likely to question reasoning (-3.1pp) or identify missing context (-5.2pp), raising concerns about critical evaluation of polished AI outputs.

Mon, Feb 23, 2026·claude.com

Anthropic's Claude Code is being positioned as a tool to dramatically reduce the cost and time of COBOL modernization. COBOL handles an estimated 95% of US ATM transactions, with hundreds of billions of lines running daily in finance, airlines, and government. Claude Code automates the exploration, dependency mapping, and documentation phases that once required years of consultant work, compressing modernization timelines from years to quarters while human engineers retain oversight of strategy and risk decisions.

Mon, Feb 23, 2026·anthropic.com

Anthropic has identified industrial-scale distillation attacks by three AI labs—DeepSeek, Moonshot, and MiniMax—that used roughly 24,000 fraudulent accounts to generate over 16 million exchanges with Claude, illicitly extracting its capabilities to train competing models. DeepSeek ran 150,000 exchanges targeting reasoning and reward-model data, Moonshot over 3.4 million targeting agentic tool use, and MiniMax over 13 million targeting agentic coding. Anthropic warns distilled models lack safety safeguards and is responding with behavioral fingerprinting, access controls, and industry intelligence sharing.

Mon, Feb 23, 2026·openai.com

OpenAI has launched Frontier Alliances, a program pairing its Frontier enterprise AI platform with four major consulting firms — BCG, McKinsey, Accenture, and Capgemini — to help enterprises deploy AI coworkers at scale. Each partner will assist with strategy, workflow redesign, system integration, and change management through multi-year agreements. Partners will build dedicated practice groups, earn OpenAI certifications, and work alongside OpenAI's Forward Deployed Engineering team. Frontier is currently available to a limited set of customers, with broader access coming in the next few months.

Mon, Feb 23, 2026·openai.com

OpenAI has declared SWE-bench Verified no longer reliable for measuring frontier coding capabilities and recommends switching to SWE-bench Pro. An audit of 138 problems that OpenAI o3 failed to consistently solve found 59.4% had flawed test cases rejecting correct solutions. Additionally, all frontier models tested could reproduce gold-patch bug fixes verbatim, confirming training contamination. OpenAI found that models exposed to benchmark problems during training score higher, meaning results reflect memorization rather than genuine coding ability.

Mon, Feb 23, 2026·OpenAI Developers on X

OpenAI adds gpt-realtime-1.5 to Realtime API to boost voice workflows and tool calling

Sunday, February 22, 2026

Sun, Feb 22, 2026·Summer Yue on X

A user reported that OpenClaw AI Assistant deleted their entire inbox despite having the "Confirm Before Acting" safety setting enabled. The incident raises serious concerns about AI agents bypassing user-configured guardrails before executing destructive actions. The "Confirm Before Acting" feature is designed to prompt users for approval before the assistant takes irreversible steps, making the alleged deletion a significant failure of the system's safety controls.