Author: RadarAI Editorial
Editor: RadarAI Editorial
Last updated: 2026-05-21
Review status: Editorial review pending
Brief
速报
官方
AI动态
开源
Claude Code officially integrates 'Computer Use' capability, enabling native macOS GUI interaction; Qwen3.5-Omni fully demonstrates real-time multimodal capabilities across use cases including audio-visual programming, voice-based emotional control, and trip planning; NVIDIA and LangChain announce a deep partnership, with Jensen Huang set to attend the Interrupt Conference to discuss enterprise-grade AI Agent strategy [1][4][3].
Editorial standards and source policy: Editorial standards, Team. Content links to primary sources; see Methodology.
## 🔍 Key Insights
**Claude Code** has officially integrated the 'Computer Use' capability, supporting native macOS GUI interaction; **Qwen3.5-Omni** comprehensively showcases real-time multimodal capabilities—spanning audio-visual programming, voice-driven emotional control, and travel planning; **NVIDIA and LangChain** have announced a deep strategic partnership, with Jensen Huang scheduled to speak at the Interrupt Conference on enterprise AI Agent strategy [1][4][3].
## 🚀 Major Updates
- **Claude Code officially supports Computer Use** [3]: Anthropic has integrated this feature into its CLI, enabling direct interaction with applications and UI elements.
- **Claude Code's Computer Use lands in macOS Research Preview** [1]: Users can activate it via the `/mcp` command, supporting SwiftUI, Electron, and CLI-less GUI tools.
- **Qwen3.5-Omni demo series intensively released** [19][20][22][23][24]: Includes real-time voice emotion/speed/volume control, intelligent multi-turn dialogue interruption, trip planning, audio-visual 'atmospheric programming', and timestamped audio-visual caption generation.
- **LangChain upgrades LangSmith Experiments detail view** [9]: Redesigned visualization and workflow comparison tools significantly improve AI Agent debugging efficiency.
- **Gemini Live now powered by the new Gemini 3.1 Flash Live model** [10]: Google announces the underlying model upgrade, enhancing real-time responsiveness.
- **LlamaIndex launches litesearch: a local-first RAG retrieval tool** [11]: A high-performance CLI/TUI document retrieval pipeline built on LiteParse and Bun.
- **Genspark CLI introduces AI-powered phone calling — 'Call for Me'** [12]: Enables automated dialing to complete real-world tasks such as reservations.
- **OpenRouter adds service tiers and ELO ranking integration** [14][13]: Introduces the `/fast` parameter for accelerated inference and integrates the Design Arena ELO model evaluation framework into its benchmark page.
## 🔗 Sources
[1] How to Enable Claude Code's Computer Use Feature — https://www.bestblogs.dev/status/2038663019983479234
[3] Claude Code Now Supports Computer Use — https://www.bestblogs.dev/status/2038663014098899416
[4] Jensen Huang to Attend LangChain's Interrupt Conference — https://www.bestblogs.dev/status/2038660999495217428
[9] LangSmith Experiments Detail View Fully Upgraded — https://www.bestblogs.dev/status/2038647945487167766
[10] Gemini Live Now Powered by Gemini 3.1 Flash Live — https://www.bestblogs.dev/status/2038647896300826853
[11] LlamaIndex Showcases litesearch: A Local-First RAG Retrieval Pipeline — https://www.bestblogs.dev/status/2038647600623288398
[12] Genspark CLI Launches AI-Powered Phone Calling — https://www.bestblogs.dev/status/203
Claude Code has officially integrated the 'Computer Use' capability, supporting native macOS GUI interaction; Qwen3.5-Omni comprehensively showcases real-time multimodal capabilities—spanning audio-visual programming, voice-driven emotional control, and travel planning; NVIDIA and LangChain have announced a deep strategic partnership, with Jensen Huang scheduled to speak at the Interrupt Conference on enterprise AI Agent strategy [1][4][3].
🚀 Major Updates
- Claude Code officially supports Computer Use [3]: Anthropic has integrated this feature into its CLI, enabling direct interaction with applications and UI elements.
- Claude Code's Computer Use lands in macOS Research Preview [1]: Users can activate it via the
/mcp command, supporting SwiftUI, Electron, and CLI-less GUI tools.
- Qwen3.5-Omni demo series intensively released [19][20][22][23][24]: Includes real-time voice emotion/speed/volume control, intelligent multi-turn dialogue interruption, trip planning, audio-visual 'atmospheric programming', and timestamped audio-visual caption generation.
- LangChain upgrades LangSmith Experiments detail view [9]: Redesigned visualization and workflow comparison tools significantly improve AI Agent debugging efficiency.
- Gemini Live now powered by the new Gemini 3.1 Flash Live model [10]: Google announces the underlying model upgrade, enhancing real-time responsiveness.
- LlamaIndex launches litesearch: a local-first RAG retrieval tool [11]: A high-performance CLI/TUI document retrieval pipeline built on LiteParse and Bun.
- Genspark CLI introduces AI-powered phone calling — 'Call for Me' [12]: Enables automated dialing to complete real-world tasks such as reservations.
- OpenRouter adds service tiers and ELO ranking integration [14][13]: Introduces the
/fast parameter for accelerated inference and integrates the Design Arena ELO model evaluation framework into its benchmark page.
🔗 Sources
[1] How to Enable Claude Code's Computer Use Feature — https://www.bestblogs.dev/status/2038663019983479234
[3] Claude Code Now Supports Computer Use — https://www.bestblogs.dev/status/2038663014098899416
[4] Jensen Huang to Attend LangChain's Interrupt Conference — https://www.bestblogs.dev/status/2038660999495217428
[9] LangSmith Experiments Detail View Fully Upgraded — https://www.bestblogs.dev/status/2038647945487167766
[10] Gemini Live Now Powered by Gemini 3.1 Flash Live — https://www.bestblogs.dev/status/2038647896300826853
[11] LlamaIndex Showcases litesearch: A Local-First RAG Retrieval Pipeline — https://www.bestblogs.dev/status/2038647600623288398
[12] Genspark CLI Launches AI-Powered Phone Calling — https://www.bestblogs.dev/status/203
← Back to Updates