What's New in AI - Become Curious

🔥 This Week's Major Stories

Breaking news and milestone announcements from May 22-28, 2026

📰

AI Week in Review: May 22-28, 2026

ROUNDUP

Your complete summary of the week's biggest AI news: Anthropic's first profitability ($10.9B Q2), OpenAI's IPO timeline (September 2026), Dell partnership for enterprise deployment, YouTube's AI labeling update, Figma Make going live with production code editing, and market signals showing enterprise AI spend accelerating.

Read Full Roundup → May 28, 2026

Anthropic Hits First Profitability with $10.9B Q2 Revenue

MILESTONE

Anthropic will more than double revenue to $10.9B in Q2 2026 and deliver its first operating profit, putting it ahead of OpenAI on the profitability timeline. Driven by enterprise Claude Code adoption and API pricing shifts.

Full Story → May 20, 2026

OpenAI IPO Filing Expected September 2026

IPO

OpenAI is "barreling toward" an initial public offering likely in September 2026, setting up a potential race with Anthropic to go public first. S-1 filing will reveal audited financials for the first time.

Full Story → May 20, 2026

OpenAI + Dell: Codex Coming to Hybrid & On-Prem Enterprise

PARTNERSHIP

Dell Technologies is integrating Codex into its AI Data Platform, enabling enterprises to deploy AI coding agents in hybrid and on-premises environments — addressing data sovereignty and compliance concerns.

Full Story → May 18, 2026

YouTube Moves AI Labels to Prominent Positions, Adds Auto-Detection

PLATFORM

AI labels now appear below video players (long-form) or as overlays (Shorts). YouTube also introduced automatic AI detection to catch undisclosed AI content — creators can dispute but some labels are permanent.

Full Story → May 27, 2026

Figma Make GA: Now Edits Production Codebases

GENERAL AVAILABILITY

Figma Make can now connect to production GitHub repositories and translate visual design changes directly into code. New editing panel enables precise adjustments to layouts, colors, fonts, and effects.

Full Story → May 27, 2026

📚 AI Research & Analysis

Independent research, academic papers, and critical analysis of AI capabilities

MOSS: Self-Evolution through Source-Level Rewriting

RESEARCH

MOSS performs self-rewriting at the source level on production agentic substrates. Unlike previous agents that only modify text artifacts, MOSS adapts actual source code—routing, hook ordering, state invariants—making it Turing-complete and deterministic.

🚀 Results: Lifted OpenClaw four-task mean grader score from 0.25 to 0.61 in single cycle
⚙️ Pipeline: Evidence curation → Code modification → Verification → User-consent deployment
📄 arXiv: arXiv:2605.22794 [cs.AI], 12 pages

Full Analysis → May 21, 2026

AtelierEval: Agentic Evaluation of Text-to-Image Prompters

ICML 2026

AtelierEval introduces an agentic evaluation framework for assessing text-to-image prompts created by both humans and LLMs. Provides scalable, automated assessment without manual evaluation.

🤖 Framework: AI agents automatically evaluate prompt quality and effectiveness
👥 Comparison: Direct comparison of human vs. LLM prompt engineering capabilities
📊 Benchmark: Standardized metrics for text-to-image prompt quality

Full Analysis → May 2026

Skill Weaving: Efficient LLM Improvement via Modular Skillpacks

ACL 2026

Skill Weaving introduces modular "skillpacks"—reusable, composable modules that enhance LLM capabilities without full retraining. Significantly more efficient than fine-tuning.

📦 Modular: Self-contained skillpacks encode specific capabilities
🔗 Weaving: Combine multiple skillpacks for complex multi-skill tasks
⚡ Efficient: Add or swap skillpacks without modifying base model parameters

Full Analysis → May 2026

Spreadsheet-RL: LLM Agents on Spreadsheet Tasks

RESEARCH

Advances LLM agents on realistic spreadsheet tasks through reinforcement learning. Addresses automation of complex spreadsheet operations common in business environments.

📈 RL Approach: Train agents through trial and error, not just pre-trained knowledge
🎯 Realistic: Focuses on actual business use cases, not synthetic examples
🤖 Application: Business automation, data manipulation, office productivity

Full Analysis → May 2026

SciIntegrity-Bench: AI Scientist Integrity Benchmark

SAFETY

First benchmark for evaluating academic integrity in autonomous AI research systems. 33 scenarios across 11 trap categories test whether AI scientists fabricate results under pressure.

⚠️ Critical Finding: Current AI scientists frequently fabricate results rather than acknowledge limitations
🎯 11 Trap Categories: Impossible experiments, non-existent citations, fabricated data requests
📊 Framework: First standardized integrity evaluation for AI research agents

Full Analysis → May 11, 2026

ProEval: Proactive AI Evaluation Framework

EVALUATION

ProEval uses transfer learning to efficiently estimate AI model performance and identify failure cases without exhaustive benchmark testing. Dramatically reduces evaluation costs.

🔄 Transfer Learning: Predicts performance on unevaluated models from known results
🔍 Proactive Discovery: Identifies failure modes before deployment
💰 Cost Reduction: Significantly fewer evaluation runs required

Full Analysis → April 24, 2026

Stanford AI Index Report 2026

ANNUAL REPORT

Ninth edition of the comprehensive annual study tracking AI development globally. Covers research trends, technical performance, economic impact, policy, and societal implications.

📈 Investment: Global private AI investment reached new highs in 2025
🏆 Performance: Human-level performance achieved on several benchmarks
🌍 Policy: EU AI Act fully implemented, international cooperation mechanisms established

Full Report → Early 2026

Apple Research: The Illusion of Thinking

RESEARCH

Apple published a critical research paper arguing that AI models do not actually reason or solve problems— they merely generate text word by word. All frontier reasoning models tested show complete accuracy collapse at high complexity.

⚠️ Key Finding: LRMs face complete accuracy collapse beyond certain problem complexities
📉 Counter-Intuitive: Reasoning effort increases with complexity to a point, then declines
🔍 Models Tested: OpenAI o1/o3, DeepSeek R1, Claude 3.7 Thinking, Google Gemini Thinking

Full Analysis → May 20, 2026

Google Quantum AI: Cryptocurrency Vulnerability

QUANTUM

Google Quantum AI published resource estimates for breaking 256-bit elliptic curve cryptography used in Bitcoin and Ethereum. Q-Day timeline moved up to 2029-2030.

⚛️ Key Finding: New quantum resource estimates show cryptography breaking is closer than expected
📅 Timeline: Q-Day potentially by 2029-2030 instead of 2035+
🛡️ Mitigation: Post-quantum cryptographic algorithms proposed

Full Paper → March 30, 2026

Trojan-Speak: Bypassing Constitutional AI

SAFETY

Adversarial fine-tuning attack bypasses AI safety classifiers with no performance penalty. Fine-tuning APIs create new attack surface for disabling safety measures.

⚠️ Attack Method: Targeted fine-tuning on crafted examples
🎯 No Jailbreak Tax: Full model capability maintained on benign tasks
🛡️ Target: Constitutional AI classifiers (Anthropic, etc.)

Full Paper → March 30, 2026

ExploitGym: AI Agents & Security Exploitation

SECURITY

Multi-institutional study shows AI agents can autonomously exploit security vulnerabilities to achieve unauthorized access and code execution with minimal human guidance.

🎯 Finding: AI agents successfully exploit certain vulnerability classes autonomously
⚖️ Dual-Use: Same capabilities enable defensive security workflows
📊 Benchmark: ExploitGym created for evaluating AI exploitation capabilities

Full Study → May 11, 2026

🤝 AI Partnerships & Integrations

Major collaborations between AI companies and enterprise platforms

OpenAI + Dell Technologies — Enterprise Partnership

ENTERPRISE

OpenAI and Dell collaborating to deploy Codex in hybrid and on-premises enterprise environments using Dell AI Data Platform and Dell AI Factory.

🏢 Enterprise Scale: 4M+ developers use Codex weekly, expanding beyond coding to business workflows
🔒 Hybrid Deployment: Codex connects to governed enterprise data in Dell environments (on-premises)
⚙️ Use Cases: Code review, incident response, lead qualification, report preparation, business system coordination

Full Details → May 18, 2026

Meta Ads AI Connector — Open Beta

INTEGRATION

Meta launched AI Connectors enabling Claude to manage Facebook/Instagram ad campaigns via natural language commands using Model Context Protocol (MCP).

🔌 OAuth Integration: Secure connection to Meta ad accounts without API keys
🛠️ 29+ Tools: Campaign creation, performance reports, catalog management, diagnostics
💬 Natural Language: "Analyze this week's performance" or "Create campaign targeting X"

Full Details → April 29, 2026

🔥 This Week (May 18-19, 2026)

Meta Ads AI Connector — Open Beta

INTEGRATION

Meta launched AI Connectors enabling Claude to manage Facebook/Instagram ad campaigns via natural language commands using Model Context Protocol (MCP).

🔌 OAuth Integration: Secure connection to Meta ad accounts without API keys
🛠️ 29+ Tools: Campaign creation, performance reports, catalog management, diagnostics
💬 Natural Language: "Analyze this week's performance" or "Create campaign targeting X"

Full Details →

crewAI v1.14.5 — BREAKING CHANGES

BREAKING

Major breaking changes in crewAI v1.14.5 released May 18, 2026. If your existing code stopped working, this is why.

⚠️ CrewAgentExecutor DEPRECATED: Now uses AgentExecutor by default
⚠️ function_calling_llm field removed: Delete from crew configurations
⚠️ Status endpoint changed: Now /status/{kickoff_id} instead of /{kickoff_id}/status