AI Briefing, April 8 · Issue #187
Editorial standards and source policy: Editorial standards, Team. Content links to primary sources; see Methodology.
## 🔍 Key Insights
Domestic large models have achieved a critical breakthrough: **Zhipu’s GLM-5.1** has not only *surpassed Claude Opus 4.6* for the first time on authoritative programming benchmarks like **SWE-bench Pro**, but also demonstrated **8-hour long-horizon task capability** and efficient local deployment—setting a new open-source benchmark [0][4][13]. Meanwhile, AI interaction paradigms are rapidly evolving: **Skill-first Agent architectures** are systematically challenging traditional app-centric entry-point logic [18].
## 🚀 Highlights
- **GLM-5.1 Launch**: *First open-source model to beat Claude Opus 4.6*, unlocking **8-hour long-horizon reasoning** [0] — Zhipu’s next-gen open LLM delivers quantum leaps in complex software engineering and extended-context reasoning.
- **GLM-5.1 Integrated into Poe**, Accelerating Agent Engineering [4]: Optimized for autonomous coding and multi-step task orchestration—significantly boosting AI Agent reliability in real-world engineering.
- **VoxCPM 2 Released**: A domestic open-source 2B speech model that *recreates Guo Degang’s most demanding *guankou* (rapid-fire monologue)* [1]: Supports 48kHz high-fidelity audio, 30 languages, and 9 dialects—cross-lingual voice generation now meets commercial-grade standards.
- **Clicky Launches**: An AI tutor *permanently anchored beside your cursor* [3]: Recognizes on-screen UI elements, supports voice interaction, and precisely highlights interface components—pioneering a “see-and-learn” paradigm for real-time software education.
- **CodexBar 0.20 Released**: Adds support for **Perplexity** and **OpenCode Go** [16]: Streamlines multi-provider account switching and fixes abnormal Claude token-cost tracking.
- **Skill vs. App**: The battle for *entry-point paradigms in the AI Agent era intensifies* [18]: A QuantumBit forum deep-dive explores how Skill invocation is reshaping product design, interaction flows, and business models.
- **Personalized Podcast Tool Goes Live**: Fully open-source AI podcasting with *AI hosts + RSS subscription* [6]: Zara Zhang launches a dual-host AI podcast generator—turn any text or transcript into a subscribable audio feed, instantly.
- **DeepSeek Web Interface Adds “Expert Mode” & Visual Model Beta Testing** [23]: Dual-mode operation + multimodal capability rollout fuels strong community anticipation of an imminent V4 model release.
## 🔗 Sources
[0] Zhipu GLM-5.1 Launch: First Open-Source Model to Surpass Claude Opus 4.6, Unlocking 8-Hour Long-Horizon Task Capability — https://www.bestblogs.dev/article/773d97b6
[1] Domestic Free 2B Open-Source Speech Model Masters “The Reckless Man” — Recreates Guo Degang’s Most Difficult *Guankou* — https://www.bestblogs.dev/article/7bc4f2cd
[3] Introducing Clicky: An AI Tutor That Lives Beside Your Cursor — https://www.bestblogs.dev/status/2041753208671072708
[4] GLM-5.1 Now Available on Poe, Empowering Agent Engineering — https://www.bestblogs.dev/status/2041739682770514268
[6] Personalized Podcasting: Turn Any Content Into an RSS-Subscribable Feed With AI Hosts — https://www.bestblogs.dev/status/20417368699