2026 · 05 · 02 · Sat

Updates for 5/2

Major AI tools added agents that do multi-step work on your computer and across apps. We also track new security offerings, pricing tiers, and government trust signals.

A · Theme of the day

AI assistants that can do tasks, not just answer

Several products moved from chat to hands-on help with multi-step work.

Perplexity adds “Computer” agent on Max tier

PerplexityPerplexity
Compared to before

Perplexity was mainly a search-and-answer tool with sources. Now its “Computer” assistant carries out tasks for you, offered on the Max plan.

What changed

Perplexity Computer autonomous agent (19-model orchestration, Max tier)

Why it matters

Teams can shorten the gap between research and execution. Budgets may shift since the most useful workflow features sit behind a higher tier.

Perplexity “Computer” expands to 400+ app connections

PerplexityPerplexity
Compared to before

Perplexity's strengths centered on fast, cited answers. “Computer” now coordinates many AI engines and connects to hundreds of apps on the Max tier.

What changed

Perplexity Computer: autonomous agent orchestrating 19 models with 400+ app integrations (Max tier)

Why it matters

App connectivity often separates a demo from daily use. Test real tasks end-to-end and weigh the lock-in risk of tying workflows to one platform.

Claude Cowork rolls out broadly; adds Zoom connection

ClaudeClaude
Compared to before

Cowork, Claude's PC task automation, is now available to paid users. A new Zoom connection pulls meeting information into Cowork.

What changed

Claude Cowork (PC-automation AI agent) now generally available for paid plan users; Zoom connector ingests meeting data into Cowork

Why it matters

Meeting notes can turn into follow-ups and action lists faster. It signals a push toward integrated work assistants rather than standalone chat.

Mistral’s Le Chat adds Agent Mode for multi-step tasks

MistralMistral
Compared to before

Le Chat was primarily a conversational assistant. Agent Mode shifts it from “tell me” to running multi-step tasks more autonomously.

What changed

Le Chat gains Agent Mode for autonomous multi-step task execution

Why it matters

Evaluating Mistral now means workflow automation, not only model cost. Pilots should track time savings, error rates, and permitted actions.

Grok adds “Computer” for desktop task automation

GrokGrok
Compared to before

Grok was best known for fast answers and X integration. “Grok Computer” now operates a desktop, planning and running steps in parallel.

What changed

Grok Computer: autonomous desktop automation agent with parallel planning and execution

Why it matters

Desktop automation can cut routine internal work. Check reliability, audit trails, and who can run actions before using it for critical tasks.

Grok 4.3 adds “Imagine” mode for creative tasks

GrokGrok
Compared to before

Creative work in Grok used to be prompt-driven and manual. “Imagine” adds a more autonomous mode for creative and image tasks.

What changed

Grok 4.3 "Imagine" agent mode: autonomous agent for creative and image-generation tasks

Why it matters

Marketing and design teams can get faster drafts and more variations per brief. Brand consistency and rights still need human review.

B · Theme of the day

Workflows inside the tools people already use

More AI features land directly in Word, team chat, and developer tooling.

Microsoft adds AI Legal Agent inside Word

Microsoft CopilotMicrosoft Copilot
Compared to before

Copilot already worked across Office apps. Word now includes a Legal Agent that flags clauses, checks for issues, and suggests redlines.

What changed

AI Legal Agent integrated into Microsoft Word for contract review, clause checking, and redline suggestions

Why it matters

Contract cycles can speed up as first-pass review gets faster. Define what needs human sign-off before relying on suggestions.

Poe adds large group chat across 200+ AI models

PoePoe
Compared to before

Poe was already a hub for trying many AI models. It now supports group chats for up to 200 participants, moving it toward a shared workspace.

What changed

Group chat for up to 200 users collaborating across 200+ AI models

Why it matters

Teams can evaluate models together with shared context. Plan access controls and guidelines for what gets shared in-room.

Uber reports broad Claude Code adoption in engineering

Claude CodeClaude Code
Compared to before

Claude Code is a terminal-based coding assistant. Uber's numbers shift it from “possible” to “proven at scale” in a large org.

What changed

Uber: ~95% of engineers use Claude Code monthly, ~70% of commits AI-generated (as of April 2026)

Why it matters

Strong adoption suggests measurable productivity gains. Review, testing, and standards need to keep pace—invest in training and guardrails, not just licenses.

Claude Code guide emphasizes repo-wide, end-to-end help

Getting Started with Claude Code: An AI Coding Assistant from Your Terminal
Compared to before

Coding assistants were often framed as autocomplete. The updated guide highlights repo-wide understanding, multi-file changes, tests, and diff review.

What changed

Why it matters

Teams can plan for refactors, debugging, and test work. Evaluation shifts toward reliability with existing codebases, paired with verification habits.

C · Theme of the day

Enterprise trust, security, and policy signals

Updates highlight vendor approaches to security, sensitive customers, and data.

Anthropic launches Claude Security for enterprises

Claude (Anthropic)Claude (Anthropic)
Compared to before

Claude previously focused on general assistant and developer features. A dedicated security offering now gives the lineup a clearer “security buyer” story.

What changed

Claude Security launched: enterprise AI security tooling built on Mythos model capabilities, helping defenders gain AI advantage

Why it matters

Security teams may gain faster investigation and response workflows. Assess how it fits with existing security processes and reporting.

OpenAI chosen for a U.S. classified AI program

GPT (OpenAI)GPT (OpenAI)
Compared to before

OpenAI already had broad model and API availability. This adds a major government selection signal for high-stakes use.

What changed

Selected for U.S. DoD classified AI program alongside Google, Nvidia, and xAI

Why it matters

It may de-risk vendor selection for sensitive workloads. Treat it as one data point—not a substitute for due diligence.

Anthropic not selected for U.S. classified AI program

Claude (Anthropic)Claude (Anthropic)
Compared to before

Claude is widely used for reasoning and coding. It was not selected for a U.S. classified AI program, with supply-chain risk cited.

What changed

Excluded from U.S. DoD classified AI program (cited as supply-chain risk; OpenAI, Google, Nvidia, xAI were selected)

Why it matters

Regulated sectors may ask tougher sourcing questions. Performance alone isn't deciding; map requirements to vendor capabilities.

ChatGPT free tier: marketing tracking on by default in some regions

ChatGPTChatGPT
Compared to before

ChatGPT's free tier has been moving toward ads in some countries. Marketing tracking is now on by default for free users there; paid plans are exempt.

What changed

Marketing cookies enabled by default for free users in ad-serving countries (paid plans exempt; opt-out available in settings)

Why it matters

Be cautious about employees using free accounts for work topics. Defaults matter because most users never change them.

D · Theme of the day

Voice and healthcare performance milestones

Rapid progress in natural-sounding speech and medical benchmarks.

Gemini adds more expressive, low-delay speech preview

GeminiGemini
Compared to before

Gemini already covered a wide range of text tasks. A new preview adds expressive speech with emotion, pace, and style control, streaming with low delay.

What changed

Gemini 3.1 Flash TTS Preview: expressive speech synthesis with dynamic emotion/tempo/style control via Audio tags, low-latency streaming

Why it matters

Better voice quality helps call assistants, training, and accessibility. Low delay matters for live conversations, not just reading text aloud.

DeepMind reports strong results for AI medical co-clinician

Gemini (Google)Gemini (Google)
Compared to before

Medical AI claims are hard to compare across vendors. In a blinded physician simulation, DeepMind's system outperformed a leading alternative.

What changed

DeepMind AI Co-Clinician outperforms GPT-5.4 in blinded physician simulation test for medical diagnosis

Why it matters

It may accelerate pilots in triage support and documentation. Real-world deployment still requires validation, oversight, and accountability.

E · Theme of the day

Pricing clarity and reliability fixes

A few updates cover the basics: predictable pricing and reliability.

Grok introduces SuperGrok Lite at $10/month

GrokGrok
Compared to before

Grok's paid options ranged from bundled access to higher-priced tiers. A lower-cost SuperGrok Lite plan now sits between free and premium.

What changed

SuperGrok Lite: $10/mo (from 2026/3/25)

Why it matters

Lower entry pricing can expand pilots and small-team adoption. Compare what each tier includes before standardizing.

Mistral Medium 3.5 long-document issue fixed

MistralMistral
Compared to before

Early users saw degraded long-document performance in one distribution format. A third-party fix by Unsloth has restored it.

What changed

Mistral Medium 3.5 GGUF initial bug (YaRN parsing) fixed by Unsloth — long-context performance restored

Why it matters

Long documents are common in business work. Reliability fixes like this often matter more than new features when rolling into production.

Archive

Past updates

A daily archive of changes actually applied to the site.