straced/layer
/drops

LayerDrops

Daily cross-vendor scan — APIs, models, SDKs, pricing, deprecations. Source-linked.

Tracked12Vendors8Breaking1

2026-05-08

Fri3 drops
Cursor·Cursor·2.2Feature

Cursor 2.2 ships Debug Mode, browser visual editor, and multi-agent judging

Cursor 2.2 adds Debug Mode (instruments your app with runtime logs to find root causes), a browser sidebar visual editor, Plan Mode with inline Mermaid diagrams, automatic judging across parallel agent runs, and pinned chats in the agent sidebar.

Meta·Muse SparkModel

Muse Spark ships with parallel Contemplating mode and 58% on Humanity's Last Exam

Meta's Superintelligence Labs released Muse Spark, a multimodal reasoning model whose Contemplating mode runs parallel reasoning agents — 58% on Humanity's Last Exam, 38% on FrontierScience Research, live at meta.ai with a private API preview opening.

OpenAI·OpenAI APIDeprecationBREAKING

OpenAI deprecates the legacy Chat Completions streaming format

OpenAI set Aug 1 2026 as the EOL for the legacy Chat Completions streaming format; migrate to the Responses API streaming events (event-typed SSE).

2026-05-07

Thu4 drops
Google·Gemma 4Tooling

Gemma 4 multi-token prediction drafters speed up inference via speculative decoding

Google shipped multi-token prediction drafters for Gemma 4 — speculative decoding with shared KV cache; Google reports roughly 2x tokens/sec for the 26B model on an RTX PRO 6000 and ~2.2x on Apple Silicon edge models.

OpenAI·OpenAI Voice APIModel

OpenAI ships three voice API models: GPT-Realtime-2, Realtime-Translate, Realtime-Whisper

OpenAI launched three voice API models: GPT-Realtime-2 for voice reasoning, GPT-Realtime-Translate (70+ input languages, 13 output), and GPT-Realtime-Whisper for live speech-to-text.

Google·Gemini APIModel

Gemini 3.1 Pro and Gemini 3 Flash listed as preview models

Google added Gemini 3.1 Pro and Gemini 3 Flash to the Gemini API as preview models, alongside Gemini 3.1 Flash-Lite, Veo 3.1, and Gemini Robotics — no GA dates, no published pricing.

xAI·Grok WebIntegration

Grok Web ships eight workspace connectors plus a Bring-Your-Own MCP slot

xAI shipped Connectors on Grok Web — read/write for SharePoint, Outlook, OneDrive, Google Workspace, Notion, GitHub, and Linear, plus a Bring-Your-Own MCP slot; mobile coming soon.

2026-05-06

Wed2 drops
Anthropic·Claude Code & Claude APIPricing

Claude paid-plan rate limits doubled, Opus API headroom raised

Anthropic doubled Claude Code rate limits on Pro, Max, Team, and Enterprise plans, removed peak-hour reductions, and raised Opus API limits — backed by a new 300MW / 220,000-GPU Colossus 1 deal with SpaceXAI.

Google·Gemini APIApi

Gemini API adds webhooks, ending polling for long-running jobs

Google added event-driven webhook support to the Gemini API for long-running and batch jobs, replacing the older poll-for-status flow.

2026-05-05

Tue1 drop
Anthropic·Claude Cowork & Microsoft 365 add-insFeature

Anthropic ships ten Claude finance agents and Excel, Word, PowerPoint add-ins

Anthropic shipped ten Claude finance agent templates as Cowork and Managed Agent plugins, plus Claude add-ins for Excel, PowerPoint, and Word; Outlook is coming soon.

2026-04-29

Wed1 drop
Mistral·Mistral Medium 3.5 & Vibe·3.5Model

Mistral Medium 3.5 ships open-weight 128B with 256k context and remote Vibe agents

Mistral released Medium 3.5, an open-weight 128B dense model with a 256k context — Mistral reports 77.6% on SWE-Bench Verified — and Vibe coding agents can now run remotely from the CLI or Le Chat.

2026-04-24

Fri1 drop
DeepSeek·DeepSeek V4·V4Model

DeepSeek V4 ships open-weight with million-token context and hybrid attention

DeepSeek released V4, an open-weight model with a million-token context, hybrid CSA+HCA attention, interleaved tool-call thinking, and a new RL sandbox called DSec — live on web, app, and API.