What's New in AI

Latest releases, updates, and breaking changes

Last updated: May 28, 2026

πŸ”₯ This Week's Major Stories

Breaking news and milestone announcements from May 22-28, 2026

πŸ“°

AI Week in Review: May 22-28, 2026

ROUNDUP

Your complete summary of the week's biggest AI news: Anthropic's first profitability ($10.9B Q2), OpenAI's IPO timeline (September 2026), Dell partnership for enterprise deployment, YouTube's AI labeling update, Figma Make going live with production code editing, and market signals showing enterprise AI spend accelerating.

Read Full Roundup β†’ May 28, 2026
Anthropic

Anthropic Hits First Profitability with $10.9B Q2 Revenue

MILESTONE

Anthropic will more than double revenue to $10.9B in Q2 2026 and deliver its first operating profit, putting it ahead of OpenAI on the profitability timeline. Driven by enterprise Claude Code adoption and API pricing shifts.

Full Story β†’ May 20, 2026
OpenAI

OpenAI IPO Filing Expected September 2026

IPO

OpenAI is "barreling toward" an initial public offering likely in September 2026, setting up a potential race with Anthropic to go public first. S-1 filing will reveal audited financials for the first time.

Full Story β†’ May 20, 2026
OpenAI + Dell

OpenAI + Dell: Codex Coming to Hybrid & On-Prem Enterprise

PARTNERSHIP

Dell Technologies is integrating Codex into its AI Data Platform, enabling enterprises to deploy AI coding agents in hybrid and on-premises environments β€” addressing data sovereignty and compliance concerns.

Full Story β†’ May 18, 2026
YouTube

YouTube Moves AI Labels to Prominent Positions, Adds Auto-Detection

PLATFORM

AI labels now appear below video players (long-form) or as overlays (Shorts). YouTube also introduced automatic AI detection to catch undisclosed AI content β€” creators can dispute but some labels are permanent.

Full Story β†’ May 27, 2026
Figma

Figma Make GA: Now Edits Production Codebases

GENERAL AVAILABILITY

Figma Make can now connect to production GitHub repositories and translate visual design changes directly into code. New editing panel enables precise adjustments to layouts, colors, fonts, and effects.

Full Story β†’ May 27, 2026

πŸ“š AI Research & Analysis

Independent research, academic papers, and critical analysis of AI capabilities

AI Agent

MOSS: Self-Evolution through Source-Level Rewriting

RESEARCH

MOSS performs self-rewriting at the source level on production agentic substrates. Unlike previous agents that only modify text artifacts, MOSS adapts actual source codeβ€”routing, hook ordering, state invariantsβ€”making it Turing-complete and deterministic.

  • πŸš€ Results: Lifted OpenClaw four-task mean grader score from 0.25 to 0.61 in single cycle
  • βš™οΈ Pipeline: Evidence curation β†’ Code modification β†’ Verification β†’ User-consent deployment
  • πŸ“„ arXiv: arXiv:2605.22794 [cs.AI], 12 pages
Full Analysis β†’ May 21, 2026
ICML

AtelierEval: Agentic Evaluation of Text-to-Image Prompters

ICML 2026

AtelierEval introduces an agentic evaluation framework for assessing text-to-image prompts created by both humans and LLMs. Provides scalable, automated assessment without manual evaluation.

  • πŸ€– Framework: AI agents automatically evaluate prompt quality and effectiveness
  • πŸ‘₯ Comparison: Direct comparison of human vs. LLM prompt engineering capabilities
  • πŸ“Š Benchmark: Standardized metrics for text-to-image prompt quality
Full Analysis β†’ May 2026
ACL

Skill Weaving: Efficient LLM Improvement via Modular Skillpacks

ACL 2026

Skill Weaving introduces modular "skillpacks"β€”reusable, composable modules that enhance LLM capabilities without full retraining. Significantly more efficient than fine-tuning.

  • πŸ“¦ Modular: Self-contained skillpacks encode specific capabilities
  • πŸ”— Weaving: Combine multiple skillpacks for complex multi-skill tasks
  • ⚑ Efficient: Add or swap skillpacks without modifying base model parameters
Full Analysis β†’ May 2026
Business AI

Spreadsheet-RL: LLM Agents on Spreadsheet Tasks

RESEARCH

Advances LLM agents on realistic spreadsheet tasks through reinforcement learning. Addresses automation of complex spreadsheet operations common in business environments.

  • πŸ“ˆ RL Approach: Train agents through trial and error, not just pre-trained knowledge
  • 🎯 Realistic: Focuses on actual business use cases, not synthetic examples
  • πŸ€– Application: Business automation, data manipulation, office productivity
Full Analysis β†’ May 2026
AI Safety

SciIntegrity-Bench: AI Scientist Integrity Benchmark

SAFETY

First benchmark for evaluating academic integrity in autonomous AI research systems. 33 scenarios across 11 trap categories test whether AI scientists fabricate results under pressure.

  • ⚠️ Critical Finding: Current AI scientists frequently fabricate results rather than acknowledge limitations
  • 🎯 11 Trap Categories: Impossible experiments, non-existent citations, fabricated data requests
  • πŸ“Š Framework: First standardized integrity evaluation for AI research agents
Full Analysis β†’ May 11, 2026
Evaluation

ProEval: Proactive AI Evaluation Framework

EVALUATION

ProEval uses transfer learning to efficiently estimate AI model performance and identify failure cases without exhaustive benchmark testing. Dramatically reduces evaluation costs.

  • πŸ”„ Transfer Learning: Predicts performance on unevaluated models from known results
  • πŸ” Proactive Discovery: Identifies failure modes before deployment
  • πŸ’° Cost Reduction: Significantly fewer evaluation runs required
Full Analysis β†’ April 24, 2026
Stanford

Stanford AI Index Report 2026

ANNUAL REPORT

Ninth edition of the comprehensive annual study tracking AI development globally. Covers research trends, technical performance, economic impact, policy, and societal implications.

  • πŸ“ˆ Investment: Global private AI investment reached new highs in 2025
  • πŸ† Performance: Human-level performance achieved on several benchmarks
  • 🌍 Policy: EU AI Act fully implemented, international cooperation mechanisms established
Full Report β†’ Early 2026
Apple

Apple Research: The Illusion of Thinking

RESEARCH

Apple published a critical research paper arguing that AI models do not actually reason or solve problemsβ€” they merely generate text word by word. All frontier reasoning models tested show complete accuracy collapse at high complexity.

  • ⚠️ Key Finding: LRMs face complete accuracy collapse beyond certain problem complexities
  • πŸ“‰ Counter-Intuitive: Reasoning effort increases with complexity to a point, then declines
  • πŸ” Models Tested: OpenAI o1/o3, DeepSeek R1, Claude 3.7 Thinking, Google Gemini Thinking
Full Analysis β†’ May 20, 2026
Google

Google Quantum AI: Cryptocurrency Vulnerability

QUANTUM

Google Quantum AI published resource estimates for breaking 256-bit elliptic curve cryptography used in Bitcoin and Ethereum. Q-Day timeline moved up to 2029-2030.

  • βš›οΈ Key Finding: New quantum resource estimates show cryptography breaking is closer than expected
  • πŸ“… Timeline: Q-Day potentially by 2029-2030 instead of 2035+
  • πŸ›‘οΈ Mitigation: Post-quantum cryptographic algorithms proposed
Full Paper β†’ March 30, 2026
Anthropic

Trojan-Speak: Bypassing Constitutional AI

SAFETY

Adversarial fine-tuning attack bypasses AI safety classifiers with no performance penalty. Fine-tuning APIs create new attack surface for disabling safety measures.

  • ⚠️ Attack Method: Targeted fine-tuning on crafted examples
  • 🎯 No Jailbreak Tax: Full model capability maintained on benign tasks
  • πŸ›‘οΈ Target: Constitutional AI classifiers (Anthropic, etc.)
Full Paper β†’ March 30, 2026
Google

ExploitGym: AI Agents & Security Exploitation

SECURITY

Multi-institutional study shows AI agents can autonomously exploit security vulnerabilities to achieve unauthorized access and code execution with minimal human guidance.

  • 🎯 Finding: AI agents successfully exploit certain vulnerability classes autonomously
  • βš–οΈ Dual-Use: Same capabilities enable defensive security workflows
  • πŸ“Š Benchmark: ExploitGym created for evaluating AI exploitation capabilities
Full Study β†’ May 11, 2026

🀝 AI Partnerships & Integrations

Major collaborations between AI companies and enterprise platforms

OpenAI Dell

OpenAI + Dell Technologies β€” Enterprise Partnership

ENTERPRISE

OpenAI and Dell collaborating to deploy Codex in hybrid and on-premises enterprise environments using Dell AI Data Platform and Dell AI Factory.

  • 🏒 Enterprise Scale: 4M+ developers use Codex weekly, expanding beyond coding to business workflows
  • πŸ”’ Hybrid Deployment: Codex connects to governed enterprise data in Dell environments (on-premises)
  • βš™οΈ Use Cases: Code review, incident response, lead qualification, report preparation, business system coordination
Full Details β†’ May 18, 2026
Meta Anthropic

Meta Ads AI Connector β€” Open Beta

INTEGRATION

Meta launched AI Connectors enabling Claude to manage Facebook/Instagram ad campaigns via natural language commands using Model Context Protocol (MCP).

  • πŸ”Œ OAuth Integration: Secure connection to Meta ad accounts without API keys
  • πŸ› οΈ 29+ Tools: Campaign creation, performance reports, catalog management, diagnostics
  • πŸ’¬ Natural Language: "Analyze this week's performance" or "Create campaign targeting X"
Full Details β†’ April 29, 2026

πŸ”₯ This Week (May 18-19, 2026)

Meta

Meta Ads AI Connector β€” Open Beta

INTEGRATION

Meta launched AI Connectors enabling Claude to manage Facebook/Instagram ad campaigns via natural language commands using Model Context Protocol (MCP).

  • πŸ”Œ OAuth Integration: Secure connection to Meta ad accounts without API keys
  • πŸ› οΈ 29+ Tools: Campaign creation, performance reports, catalog management, diagnostics
  • πŸ’¬ Natural Language: "Analyze this week's performance" or "Create campaign targeting X"
Full Details β†’
crewAI

crewAI v1.14.5 β€” BREAKING CHANGES

BREAKING

Major breaking changes in crewAI v1.14.5 released May 18, 2026. If your existing code stopped working, this is why.

  • ⚠️ CrewAgentExecutor DEPRECATED: Now uses AgentExecutor by default
  • ⚠️ function_calling_llm field removed: Delete from crew configurations
  • ⚠️ Status endpoint changed: Now /status/{kickoff_id} instead of /{kickoff_id}/status
Full Release Notes β†’ Setup Guide β†’
LangChain

LangChain v1.3.1 β€” Streaming v2/v3

NEW

LangChain v1.3.1 adds content-block-centric streaming and updated model references.

  • ✨ Streaming v2/v3: Better control with content blocks
  • ✨ Updated models: gpt-3.5-turbo removed, use gpt-4o or gpt-4o-mini
  • ✨ Security hardening: Protection against untrusted manifests
Full Release Notes β†’ Setup Guide β†’
OpenClaw

OpenClaw β€” Claude Code Integration

NEW

Full Claude Code integration with new commands for background sessions and model switching.

  • ✨ /resume command: Resume background sessions with elapsed duration
  • ✨ /model command: Session-specific model switching
  • ✨ Protocol v4: Mac mini UI app support (run openclaw update)
May 19 PM Release β†’ May 19 AM Release β†’ Setup Guide β†’

πŸ“… Recent Updates (April 30 - May 17)

  • May 15
    AutoGen v0.7.5

    Stable release β€” no updates needed

  • May 10
    n8n AI Nodes

    New AI workflow nodes for automation

  • May 5
    Flowise 2.0

    Redesigned UI, improved performance

  • April 30
    Dify Cloud Launch

    Managed Dify hosting now available