This article introduces engineering optimizations to 3FS—KVCache's foundation layer—across performance, productization, and cloud-native management for scalable AI inference.
This article introduces the architecture and implementation of Tair KVCache Manager, an open-source enterprise-grade global KVCache management service for scalable Agentic AI inference.
This article introduces Apache RocketMQ's strategic evolution into an AI-native message engine for long-running sessions, intelligent compute scheduling, and agent collaboration.
Qwen3.5-LiveTranslate-Flash is the latest simultaneous interpretation model in the Qwen family, built on top of Qwen3.5-Omni.
Today we introduce Qwen3.7-Max, our latest proprietary model designed for the agent era.
This article introduces building a production-ready RAG pipeline on Alibaba Cloud using Hologres for vector search and Model Studio for embeddings and LLM inference.
Alibaba on Wednesday launched its most aggressive AI push yet, unveiling a new flagship large language model, a homegrown AI chip that triples the performance of its predecessor.
This article introduces challenges in AI Agent scheduled task orchestration and presents Alibaba Cloud's MSE AI Task Scheduling as an enterprise-grade solution.
Qwen3.7-Max, upgraded cloud infrastructure and model services, and new T-Head chips announced at Alibaba Cloud Summit
RocketMQ LiteTopic enables fine-grained, per-scenario traffic governance for AI inference workloads via millisecond-level throttling and consumption suspension.
Alibaba Cloud's OpenTelemetry-based observability plugin brings full visibility to Hermes AI agent execution, enabling traceable costs, performance, and security auditing.
EventHouse, a new capability of Alibaba Cloud EventBridge, was officially launched and is now in public preview.
AgentScope Java 1.1 launches with workspace-driven persistence, pluggable filesystems, auto-context management, and secure sandbox orchestration for scalable enterprise Agents.
Alibaba has unveiled Fun-ASR1.5, a major upgrade to its end-to-end speech recognition model.
Alibaba Cloud unveiled a new AI model subscription service specifically for enterprises and developers.
An Alibaba Cloud MVP demonstrates how the AI idol group SPECTRA utilized Wan 2.7 and HappyHorse to achieve an almost zero-touch, agent-driven pipeline for autonomous music video production.
This article introduces how to build a stable, reliable, and efficient real-time speech message link architecture using the LiteTopic feature of ApsaraMQ for RocketMQ.
Alibaba Group closed fiscal year 2026 with cloud revenue surging and its AI bets delivering across the business.
TIME’s inaugural list for the AI sector highlights Alibaba’s “full-stack AI empire”
Alibaba DAMO Academy, collaborating with Guangdong Provincial People’s Hospital and other institutions, has developed COCA, an advanced AI model that detects colorectal cancer.