Latest AI News

The most comprehensive AI news feed on the internet -- curated by Matt Wolfe

*News may update slower on weekends and when Matt's traveling

Get This In Your Inbox Twice a Week

Matt's picksPaywalled
Jun 1 – 7, 2026

Saturday, June 6, 2026

Sat, Jun 6, 2026·techcrunch.com

President Trump said he's discussing deals with AI companies where the American people can benefit from AI's success. CNBC reports the Trump administration has been actively discussing taking an equity stake in OpenAI, with some of that equity potentially seeding a Public Wealth Fund proposed by OpenAI itself. CEO Sam Altman has reportedly been discussing a government stake in major AI companies since early 2025. The idea mirrors Trump's prior move taking a 10% stake in chipmaker Intel last year.

Friday, June 5, 2026

Fri, Jun 5, 2026·anthropic.com

Anthropic is partnering with synthetic, computational, and analytical chemists to improve Claude's chemistry capabilities. In a new white paper, Anthropic chemist David Kamber tested Claude Opus 4.7, Opus 4.6, and Sonnet 4.6 against dedicated NMR software ChemDraw and MestReNova across 20 novel compounds. Opus 4.7 matched or outperformed both tools on hydrogen shift prediction, averaging ±0.079 ppm error, and tied MestReNova on carbon prediction. Claude also successfully performed inverse structure elucidation from NMR spectra alone, a task existing software leaves entirely to chemists.

Fri, Jun 5, 2026· on X

OpenAI appears to have introduced a feature allowing ChatGPT users to send emails directly from within writing blocks in the ChatGPT interface, according to a post from the official ChatGPT account on X. However, the original post text was not available for review, so specific details about supported platforms, setup requirements, or limitations cannot be confirmed. The title suggests this is a notable workflow integration, but a fully accurate summary requires access to the complete source material.

Fri, Jun 5, 2026·theverge.com

New York's state legislature passed a one-year moratorium on new large data centers, potentially the first statewide ban of its kind in the US. The bill targets facilities with peak demand of at least 20 megawatts and directs the state's environmental agency to assess electricity, water, and land use impacts. Companies must also fund public hearings three months before approval. Democratic Governor Kathy Hochul, who has until December to decide, has not indicated whether she will sign or veto the bill.

Fri, Jun 5, 2026·blog.google

Google has released Quantization-Aware Training (QAT) checkpoints for its Gemma 4 model family, optimizing them for on-device use on mobile phones, laptops, and consumer GPUs. Unlike standard post-training quantization, QAT integrates compression into the training process to preserve model quality. A custom mobile quantization schema reduces the Gemma 4 E2B model's memory footprint to under 1GB. Weights are available on Hugging Face in GGUF and compressed tensor formats, with support for llama.cpp, Ollama, LM Studio, vLLM, and MLX.

Fri, Jun 5, 2026·Matt Wolfe on YouTube

Watch Matt Wolfe's latest YouTube video where he breaks down all of the most important AI news from the past week.

Fri, Jun 5, 2026·engadget.com

OpenAI has launched Lockdown Mode, an optional security setting for ChatGPT designed to protect users handling sensitive data from prompt injection attacks, where malicious instructions hidden on webpages trick AI systems into leaking information. The mode limits certain features, disabling Deep Research and Agent Mode entirely, and restricting internet image loading and file downloads. It is available to all accounts including free-tier users via Settings under Safety and Security. OpenAI is also rolling out an active session manager to monitor account access.

Thursday, June 4, 2026

Thu, Jun 4, 2026·blog.reve.com

Reve has launched Reve 2.0, an image generation model that replaces text-based intermediate representations with structured layouts. Each layout defines every element's location, size, description, and optional attributes like color or image references, acting as a backbone separating semantic intent from pixel rendering. Built on a unified Large Layout Model trained using Qwen open-source LLMs and a pipeline of billions of images, Reve 2.0 claims top arena rankings among sub-$1 trillion companies, trained on 10x fewer GPUs than competitors.

Thu, Jun 4, 2026·openai.com

OpenAI is rolling out Dreaming V3, an upgraded memory system for ChatGPT, now available to Plus and Pro users in the US. Unlike the original saved memories launched in April 2024, Dreaming uses a background process to automatically synthesize memories from chat history, keeping context fresh and accurate over time. The update addresses staleness, correctness, and scalability issues. Users can review a memory summary page and edit what ChatGPT knows about them. Free and Go users will gain access in coming weeks.

Thu, Jun 4, 2026·developer.nvidia.com

NVIDIA has released Nemotron 3 Ultra, a 550B-parameter Mixture-of-Experts model with 55B active parameters designed for long-running AI agents. It delivers 5x higher throughput than comparable open models and reduces agentic task costs by up to 30% by using fewer tokens per turn. Key innovations include a hybrid Mamba-Transformer architecture, NVFP4 precision, and Multi-Teacher On-Policy Distillation training. NVIDIA also launched Nemotron 3.5 Content Safety, a 4B guardrail model covering 23 safety categories, and Nemotron 3.5 ASR for sub-100ms voice agents.

Thu, Jun 4, 2026·openai.com

OpenAI has updated GPT-Rosalind, its purpose-built model series for enterprise life sciences research, combining GPT-5.5's agentic coding and tool-use capabilities with stronger intelligence in drug-discovery domains including medicinal chemistry and genomics. The update improves performance across biology analysis, design, and experimental workflows. OpenAI also introduced LifeSciBench, an expert-judged benchmark covering six workflow areas, on which GPT-Rosalind outperforms GPT-5.5 and Grok. The model is now available in research preview to eligible organizations globally via a trusted-access deployment structure.

Thu, Jun 4, 2026·Krea on X

Krea has launched Krea 2 Turbo, a new AI image generation model that produces high-quality images in approximately two seconds. The model is compatible with style references, moodboards, and LoRAs, making it adaptable for a range of creative workflows. Krea 2 Turbo is available to try for free at krea.ai, positioning it as a fast and accessible option for designers and creators who need rapid AI-assisted image generation results.

Thu, Jun 4, 2026·Runway on X

Runway has launched Aleph 2.0, a new AI video editing model now available inside its newly released Edit Studio. Aleph 2.0 is designed for precise, targeted edits that modify only the specific elements a user wants changed while leaving the rest of the shot completely untouched. The tool aims to give video creators greater control and accuracy, directly addressing one of the most persistent challenges in AI-assisted video editing: unintended changes to surrounding footage.

Thu, Jun 4, 2026·theinformation.com

Microsoft CEO Satya Nadella rebuked an internal memo by corporate vice president Omar Shahine that advocated making users addicted to Scout, Microsoft's new AI agent. Nadella told roughly 50 top AI engineers that addiction is "absolutely a non goal," adding that the author may "want to go work elsewhere." Shahine's memo outlined developing Scout in three phases from "addictive app to agentic platform." Scout, based on open source OpenClaw software and announced at Microsoft Build, is central to the company's push to sell AI tools to businesses.

Thu, Jun 4, 2026·prnewswire.com

Meshy, an AI-powered 3D creation platform, launched Meshy 3D Agent Beta on June 4, 2026, available to all users at meshy.ai. Unlike traditional Text-to-3D tools that generate one result per prompt, the agent supports a continuous, chat-based workflow where users can describe ideas in natural language, generate variations, and refine details step by step. Target use cases include games, product visualization, concept design, AR/VR, and 3D printing workflows.

Thu, Jun 4, 2026·siliconangle.com

Anthropic has called for a globally coordinated pause or slowdown in frontier AI development, warning that systems may soon achieve recursive self-improvement — the ability to independently enhance their own capabilities by writing their own code — without human oversight. Anthropic's Marina Favaro and Jack Clark argue this could happen within two years. Critics, including analyst Rob Enderle, call enforcement near-impossible, while others suggest the move is strategic marketing ahead of Anthropic's planned IPO following its $65 billion funding round.

Wednesday, June 3, 2026

Wed, Jun 3, 2026·blog.google

Google has launched Gemma 4 12B, a laptop-friendly multimodal AI model featuring a novel encoder-free architecture that processes vision and audio inputs directly into the LLM backbone. Running on just 16GB of VRAM, it delivers benchmark performance near Google's larger 26B MoE model at less than half the memory footprint. The model supports native audio input, Multi-Token Prediction drafters for lower latency, and is released under Apache 2.0. Gemma 4 models have now surpassed 150 million downloads.

Wed, Jun 3, 2026·blog.google

Google has announced five water stewardship commitments for its data centers, including a pledge to replenish more water than it consumes by 2030. In 2025, Google replenished over 7 billion gallons and has 165 projects across 97 watersheds expected to replenish 19 billion gallons annually by 2030. The company is also committing $17 million to new projects across seven U.S. states and has invested over $500 million in water infrastructure. Google is additionally evaluating more than 700 projects submitted through its Water Replenishment RFI.

Wed, Jun 3, 2026·Grok on X

xAI has released Grok Imagine 1.5 Preview, an updated image generation model now available to developers through its API. The release was announced by the official Grok account on X, with a direct link to the API endpoint for immediate access. Imagine 1.5 Preview is an incremental update to xAI's existing image generation capabilities, offering developers early access to test and integrate the latest version ahead of any wider public rollout.

Wed, Jun 3, 2026·Aoden Teo on X

Miso has launched Miso One, an billion-parameter text-to-speech model designed for highly expressive, emotive speech generation. The company claims it is the most emotive voice model in the world, capable of producing human-like emotional expression across a wide range of use cases. Miso One responds faster than a human speaker, achieving a latency of just 110 milliseconds, making it well-suited for real-time applications that require natural, expressive voice output at low latency.

Wed, Jun 3, 2026·Ideogram on X

Ideogram has launched Ideogram 4.0, an open-weight text-to-image model that ranks first on DesignArena's third-party leaderboard, ahead of all open models and behind only closed models from OpenAI and Google. Users can download weights, fine-tune on their own data, and run it on their own hardware. Key features include dense text rendering, native 2K resolution, background transparency, and precise layout control via bounding boxes. It is available on all Ideogram plans and the API today.

Tuesday, June 2, 2026

Tue, Jun 2, 2026·anthropic.com

Anthropic is expanding Project Glasswing, its AI-powered cybersecurity initiative, from roughly 50 initial partners to approximately 150 organizations across more than 15 countries. Partners use Claude Mythos Preview to scan codebases for vulnerabilities, having already found over 10,000 high- or critical-severity flaws. The new cohort covers critical infrastructure sectors including power, water, healthcare, and communications. Anthropic estimates a successful attack on most partners' codebases could affect more than 100 million people, underscoring the program's national and global security stakes.

Tue, Jun 2, 2026·perplexity.ai

Perplexity has announced a hybrid local-server inference orchestrator for its Personal Computer product, launching in July 2026. The system automatically routes tasks between on-device models and cloud-based frontier models, keeping sensitive data like financial records and health information local while sending compute-intensive work to servers. Unveiled in partnership with Intel, it also supports NVIDIA RTX Spark silicon. Perplexity argues the architecture reduces centralized infrastructure demand and improves data sovereignty without requiring countries to build dedicated data centers.

Tue, Jun 2, 2026·claude.com

Anthropic has added dynamic multi-agent workflows to Claude Code, allowing the AI to write and orchestrate its own custom harness on the fly for complex tasks. Unlike static workflows, dynamic workflows spawn separate Claude subagents with isolated context windows, combating failure modes like agentic laziness, self-preferential bias, and goal drift. Powered by Claude Opus 4.8, workflows execute JavaScript files to coordinate subagents and support patterns like fan-out-and-synthesize, adversarial verification, and tournament-style competition. Users can trigger them by asking Claude or using the keyword "ultracode."

Tue, Jun 2, 2026·whitehouse.gov

President Trump signed an executive order on June 2, 2026, directing federal agencies to strengthen AI cybersecurity within 30 days. CISA must issue binding directives to harden civilian systems and expand AI-enabled defensive tools. The Treasury Department will form a voluntary AI cybersecurity clearinghouse with industry to scan and patch software vulnerabilities. A separate voluntary framework will allow AI developers to submit frontier models for classified government review up to 30 days before release. The order explicitly prohibits mandatory licensing or preclearance requirements for AI models.

Tue, Jun 2, 2026·openai.com

OpenAI is expanding Codex, now used by over 5 million people weekly, with role-specific plugins, shareable interactive sites, and inline annotations. Six new plugins cover data analytics, creative production, sales, product design, public equity investing, and investment banking, bundling 62 apps and 110 skills. Non-developers make up 20% of Codex users and are growing three times faster than developers. Sites, rolling out in preview for Business and Enterprise customers, let teams share interactive dashboards and apps via URL.

Tue, Jun 2, 2026·quantum.microsoft.com

Microsoft has unveiled Majorana 2, a scalable quantum processor featuring topological qubits that are over 1,000 times more reliable than those in its predecessor. By replacing aluminum with lead in the material stack, qubit lifetimes jumped from 1–12 milliseconds to a mean of 20 seconds, occasionally exceeding one minute. The larger topological gap better shields qubits from environmental noise. AI assisted in designing the new stack. Microsoft, one of two finalists in DARPA's US2QC program, has halved its roadmap timeline and now targets a fault-tolerant quantum computer by 2029.

Tue, Jun 2, 2026·microsoft.ai

Microsoft AI launched seven in-house MAI models spanning reasoning, coding, image, transcription, and voice. MAI-Thinking-1 is a flagship reasoning model trained from scratch without third-party distillation, matching leading models on software engineering benchmarks. MAI-Code-Flash, a billion-parameter coding model, integrates into GitHub Copilot and VS Code. MAI Transcribe-1.5 claims world-best accuracy across 43 languages, running five times faster than competitors. Microsoft also announced a superintelligence lab and a healthcare partnership with Mayo Clinic to co-create a clinical AI model.

Tue, Jun 2, 2026·devblogs.microsoft.com

Microsoft has announced the private preview of Frontier Tuning, a new AI customization approach designed to make AI models work according to specific business needs. The system applies reinforcement learning inside a company's compliance boundary, using the organization's own data, processes, and conventions. This allows businesses to tailor AI behavior without exposing sensitive information outside their controlled environment, making it particularly relevant for enterprises with strict data governance requirements.

Tue, Jun 2, 2026·microsoft.com

Microsoft has introduced Scout, its first Autopilot agent for Microsoft 365, a new category of always-on agents that work autonomously on your behalf without needing to be prompted each time. Scout integrates with Teams, Outlook, OneDrive, and SharePoint, scheduling meetings, blocking calendar time for deliverables, and flagging risks like stalled decisions. It uses Work IQ to learn user priorities over time and is powered by OpenClaw open-source technology. Scout is currently in private preview for Frontier organizations requiring a GitHub Copilot license.

Tue, Jun 2, 2026·blogs.windows.com

Microsoft held its Build 2026 developer conference, using the annual event to connect with the global developer community and highlight progress on Windows as a trusted development platform. The company described Build as one of its favorite moments each year for sharing what its teams have been building. However, the available source text is truncated, limiting full detail on specific announcements, features, or tools revealed at the event.

Tue, Jun 2, 2026·microsoft.com

Microsoft will make its Work IQ APIs generally available on June 16, 2026, offering developers a new way to build AI agents that interact with Microsoft 365 data. Work IQ acts as a semantic intelligence layer, continuously processing email, calendar, meetings, chats, files, and organizational data to give agents business context rather than raw data. The APIs claim 2x faster processing, 80% fewer tokens than traditional APIs, and collapse tool calling into just 10 generic tools via MCP. Pricing uses consumption-based Copilot Credits.

Tue, Jun 2, 2026·blogs.bing.com

Microsoft has launched Web IQ, a suite of AI-native grounding APIs built on Bing's global index but re-architected from the ground up for agentic AI workloads. Web IQ connects AI agents to fresh web data including pages, news, images, and videos, returning passage-level evidence rather than full documents to improve token efficiency. It operates at sub-165ms p95 latency, nearly 2.5 times faster than competitors, and outperforms alternatives on grounding satisfaction scores measured across 3,000 global queries.

Tue, Jun 2, 2026·commandline.microsoft.com

Microsoft has unveiled Project Solara, a chip-to-cloud software platform paired with tailored hardware designed to enable agent-first devices. Built on the premise that the next platform shift moves from apps to agents, Solara treats agents as both a new programming unit and a new human-to-machine interaction model. The platform supports an open, multi-agent world with enterprise-grade security, identity, and manageability built in. Microsoft is previewing two device concept categories—stationary and portable—targeting verticals including healthcare, retail, and finance.

Tue, Jun 2, 2026·blogs.windows.com

Microsoft has announced the Surface RTX Spark Dev Box, a new device aimed at software developers. Revealed on the Windows Devices blog, the machine is designed to meet the demanding workloads of modern development, including longer-running tasks and intensive tooling requirements. Microsoft positions the device as built to match the pace of contemporary software creation, though specific hardware specifications, pricing, and availability details were not fully captured in the available source text.

Tue, Jun 2, 2026·github.blog

GitHub has launched the Copilot app, a desktop control center for agent-native development, now in technical preview for existing Copilot Pro, Pro+, Business, and Enterprise users. The app features a My Work view to manage parallel agent sessions, pull requests, and background automations across connected repositories. New canvases provide bidirectional work surfaces where agents and developers collaborate visibly. Additional features include cloud and local sandboxes, an Agent Merge tool, upgraded code review with medium-tier reasoning models, and a generally available Copilot SDK supporting Node.js, Python, Go, .NET, Rust, and Java.

Tue, Jun 2, 2026·news.microsoft.com

Mayo Clinic and Microsoft have announced a strategic collaboration to develop a frontier AI model purpose-built for healthcare. The model will combine Mayo Clinic's de-identified clinical health data and longitudinal insights with Microsoft's AI and cloud capabilities to support clinical reasoning, earlier diagnoses, and personalized treatment decisions. Mayo Clinic will own the model, reinforcing patient trust and data stewardship. Microsoft plans to distribute it via Azure Foundry APIs. Microsoft AI CEO Mustafa Suleyman called it the best collaboration imaginable to accelerate frontier medical intelligence.

Tue, Jun 2, 2026·Nous Research on X

Nous Research has launched Hermes Desktop in public preview, marking the next evolution of its Hermes Agent platform. The application runs natively on local machines, bringing the full Hermes Agent experience to desktop environments without relying on cloud infrastructure. Hermes Desktop was first publicly demoed during Nvidia CEO Jensen Huang's GTC keynote before being made broadly available. The native desktop approach allows users to run the Hermes Agent system directly on their own hardware.

Tue, Jun 2, 2026·Black Forest Labs on X

Black Forest Labs has named legendary filmmaker Martin Scorsese as an advisor, bringing his six decades of cinematic storytelling experience to the AI image generation company. Scorsese will help guide the development of visual intelligence with human taste and craft as central priorities. The announcement was accompanied by a working storyboarding session in which Scorsese used Black Forest Labs' FLUX image generation model, demonstrating a practical creative collaboration between the director and the AI tool.

Tue, Jun 2, 2026·factory.ai

Factory.ai has launched Factory Router, an AI model routing tool now in private research preview, designed to cut enterprise AI coding costs by automatically selecting the right model for each task. It achieves 20-25% lower cost per session while maintaining 99% of Claude Opus 4.7's pass rate on Terminal-Bench 2 and 96% on Legacy-Bench. The tool routes across providers for 99.9%+ reliability and includes enterprise admin controls for custom routing rules.

Monday, June 1, 2026

Mon, Jun 1, 2026·openai.com

OpenAI broke ground on The Barn, a 1GW Stargate data center campus in Saline, Michigan, alongside Governor Gretchen Whitmer and partners Oracle, Related Digital, and Walbridge. The project is expected to create 2,500 union construction jobs and 450 permanent onsite positions, generating $1 billion in tax revenue over the lease term. OpenAI will also provide up to $45 million in Codex credits to over 400,000 eligible Michigan college and trade school students during the 2026–2027 academic year.

Mon, Jun 1, 2026·martechseries.com

TwelveLabs has launched Rodeo, its first application-layer product and an AI-powered creative copilot that lets video editors find, assemble, and edit footage using natural language. Powered by TwelveLabs' Marengo 3.0 and Pegasus 1.5 foundation models, Rodeo searches entire footage libraries contextually, understanding not just what appears on screen but why it matters. CEO Jae Lee says the tool marks TwelveLabs' evolution from infrastructure provider to creator-facing toolmaker, enabling producers and editors to go from raw footage to finished stories in minutes.

Mon, Jun 1, 2026·theverge.com

Microsoft's Build conference in San Francisco will feature several major announcements, including a new reasoning model called MAI-Thinking-1 from Microsoft AI chief Mustafa Suleiman, plus MAI-Image-2.5 and MAI-Image-2.Flash. Microsoft will also unveil a developer-optimized Windows 11 experience with pre-installed tools and a distraction-free environment, a Copilot super app combining AI assistants into one interface, and expanded support for local AI models on Nvidia RTX Spark hardware. GitHub improvements and Qualcomm's continued Windows on Arm work are also expected.

Mon, Jun 1, 2026·nytimes.com·Indirect summary

Sen. Bernie Sanders (I-VT) has proposed that the federal government acquire a 50% ownership stake in major AI companies including OpenAI and Anthropic, to be held in a newly created sovereign wealth fund for public benefit. Sanders, a self-described democratic socialist, outlined the plan in a New York Times op-ed and said he would introduce formal legislation within weeks. The proposal would impose a one-time tax on AI firms, paid in stock rather than cash, rather than a direct purchase.

Mon, Jun 1, 2026·huggingface.co

JetBrains has released Mellum2, a 12B-parameter Mixture-of-Experts model trained from scratch on text and code. The model activates only 2.5B parameters per token, enabling more than 2x faster inference compared to similarly sized open models. Released under Apache 2.0, Mellum2 targets latency-sensitive workloads including routing, RAG pipelines, sub-agents, summarization, and private deployments. It is available on Hugging Face and is designed to serve as a fast, efficient component within larger multi-model AI systems for software engineering.

Mon, Jun 1, 2026·theinformation.com·Indirect summary

Anthropic's Claude Mythos Preview has found over 10,000 high- or critical-severity vulnerabilities across critical software through Project Glasswing, a collaborative effort with roughly 50 partners. Cloudflare alone found 2,000 bugs with a false positive rate better than human testers. Mythos also scanned 1,plus open-source projects, surfacing an estimated 3,900 confirmed high- or critical-severity flaws. The bottleneck has shifted from finding bugs to patching them, with some maintainers asking Anthropic to slow disclosures due to capacity constraints.

Mon, Jun 1, 2026·anthropic.com

Draft summary: Anthropic, the AI safety company behind Claude, has confidentially submitted a draft S-1 registration statement to the U.S. Securities and Exchange Commission for a proposed IPO of its common stock. The filing gives Anthropic the option to go public once the SEC completes its review, though the offering remains subject to market conditions. Share count and pricing have not yet been determined. Anthropic recently raised $65 billion in Series H funding at a $965 billion post-money valuation.

Mon, Jun 1, 2026·Sam Altman on X

OpenAI is actively hiring engineers for its robotics division, according to a post from CEO Sam Altman. The push signals OpenAI's growing ambitions in physical AI, aiming to develop robots capable of performing genuinely useful real-world tasks. While specific roles and technical details were not disclosed in the post, the recruitment drive suggests OpenAI is scaling its robotics team significantly as competition in humanoid and general-purpose robotics intensifies across the AI industry.

Mon, Jun 1, 2026·minimax.io

MiniMax has released M3, an open-weight AI model featuring a 1 million token context window, native multimodality supporting image and video input, and frontier-level coding and agentic capabilities. Powered by a new MiniMax Sparse Attention architecture, M3 achieves 15x faster decoding than its predecessor. On SWE-Bench Pro, it surpasses GPT-5.5 and Gemini 3.1 Pro. M3 is claimed to be the first open-weight model combining all three capabilities and is available via MiniMax Code, Token Plan, and API.