## 🔍 Key Insights AI engineering is rapidly entering a new era of '**AI building AI**': Baichuan Intelligence has launched **ForgeTrain**, the world's first production-grade pretraining framework fully written by AI, and successfully trained **MiniCPM5-1B**; meanwhile, foundational innovations—including **Domain-Specific Architectures (DSAs)**, **KV cache quantization** (e.g., OSCAR's 2-bit scheme), and the **Tao Law**—are being deployed in rapid succession, continuously breaking through compute and energy-efficiency bottlenecks [24][10][22]. ## 🚀 Major Updates - **World's First 'AI Building AI' Realized: Baichuan Launches ForgeTrain Framework & MiniCPM5-1B Model** [24]: A production-grade pretraining framework entirely authored by AI, used to train a 1B-parameter model outperforming NVIDIA's Megatron in key benchmarks. - **OSCAR Introduces a 2-bit KV Cache Quantization Scheme for Real-World Serving** [10]: Achieves near-BF16 accuracy at ~2.28 effective bits via attention-aware rotation and alignment; already integrated into SGLang. - **Huawei Proposes 'Tao Law'—A New Paradigm for Semiconductor Evolution** [22]: Replaces transistor density with time constant τ as the core metric, redefining chip evolution through 3D stacking techniques such as logic folding. - **Kuaishou's Keye-VL-2.0 Introduces DSA-Based Sparse Attention to Multimodal Models for the First Time** [15]: Enables lossless inference over ultra-long contexts up to 256K tokens, unlocking agent collaboration capabilities and enhancing long-video temporal understanding and tool invocation. - **Tencent's Wukong Security Team Releases VulnGym Benchmark** [20]: Built on 3,632 real-world vulnerabilities, it addresses the rising prevalence of business-logic flaws in the AI era, offering project-level, white-box, and path-level annotated evaluation capabilities. - **DeepSeek Unveils Its Trillion-Dollar Technical Roadmap** [9]: Centered on architectural innovations—including MoE, MLA, DSA, KV cache compression, and Engram—to systematically reduce large models' dependence on HBM and GPU memory. - **Codex Data Reveals Viral Content Patterns on X Platform** [2]: Content categories with the highest retweet rates include tool-discovery posts and product teardowns; resource-link posts achieve a viral rate of 51%. - **Agentic ERP Market Settles into a Three-Way Competition** [23]: The landscape is now defined by three major camps—ERP-native incumbents, agent automation platforms, and AI-native ERP vendors—vying for technological integration and strategic positioning across use cases. ## 🔗 Sources [1] The Promise of AI Is Worthless—Who Should Pay the Price? — https://www.bestblogs.dev/article/fa37a78f?utm_source=rss&utm_medium=feed&utm_campaign=resources&entry=rss_article_item [2] Codex Analyzes Three Years of X Data: Viral Content Formulas & Optimal Posting Windows — https://www.bestblogs.dev/status/2059255732911186123?utm_source=rss&utm_medium=feed&utm_campaign=resources&entry=rss_article_item [3] X Data Analysis Continued: Tool-Discovery Content Shows Highest Virality Rate — https://www.bestblogs.dev/status/2059260610345660694?utm_source=rss&utm_medium=feed&utm_campaign=resources&entry=rss_article_item [4] 1,112 Images! From 'Words Fail' to 'Words Manifest': An AI Portrait Generation Handbook