This article introduces ACK GIE's precision-mode prefix cache-aware routing that maximizes KV-Cache hit rates for distributed LLM inference.
This article details the storage format and HNSW algorithm implementation behind AliSQL’s native vector indexing capability for high-dimensional AI workloads.
Alibaba Group launched "Alibaba Wonder on Ice" (AWI) at the Milano Cortina 2026, using AI and cloud computing to demonstrate next-gen virtual retail experiences.
This article introduces a dual memory-pool inference framework enabling efficient hybrid Transformer-Mamba model execution by resolving conflicting caching mechanisms.
This article introduces engineering optimizations to 3FS—KVCache's foundation layer—across performance, productization, and cloud-native management for scalable AI inference.
The Intelligent Pin Trading Station blends one of the Games’ best-loved traditions with voice- and gesture-enabled interaction.
This article introduces HiCache, a hierarchical KVCache infrastructure developed by Alibaba Cloud Tair and SGLang to optimize performance and memory capacity for long-context "agentic" LLM inference.
Ekosistem Qwen saat ini berkembang sangat pesat, mulai dari Large Language Model (LLM) hingga model multimodal yang bisa memahami teks, gambar, video,...
Quark AI Glasses joins 11.11 shopping festival to kick off online pre-sale in China starting October 24
Global cloud leader supports customers and partners including Wio Bank, ACCUMED, Byond Asia, The Game Company, and Atos to achieve business growth in .
Alibaba Cloud has signed a MoU with Wio Bank, the Middle East’s leading digital financial platform, to accelerate innovation across cloud computing, A...
Alibaba Cloud has further strengthened its longstanding collaboration with CapitaLand Group (CapitaLand), one of Asia’s largest diversified real estate groups.
The article explores Alibaba Cloud's Qwen LLM and Dify platform, showcasing their roles in developing intelligent AI systems for business automation.
This article introduces how Moonshot AI uses Alibaba Cloud's solutions to enhance data preprocessing for its large model, Kimi, focusing on stability, resource elasticity, and efficient management.
The BMW Group and Alibaba Group announced an expanded strategic partnership in China, accelerating the integration of Alibaba’s Qwen large language mo.
Alibaba Cloud has launched Qwen2.5-Omni-7B, a unified end-to-end multimodal model in the Qwen series.
This article showcases Alibaba Cloud's innovative AI models that boost efficiency and integration across modalities, setting new standards in industri...
This article discusses the challenges and strategies involved in managing resource consumption in large model applications.
This article introduces Higress.ai, highlighting its official launch and the seamless integration of new AI capabilities.
Alibaba Cloud continues to solidify its standing as a global leader in cloud computing and artificial intelligence (AI).