Recently, Qwen3.5-Max-Preview, the preview of our next-generation flagship model, has made its debut on LM Arena.
This article explores how DAS is democratizing expert-level database management for every enterprise.
This article explains how generative AI is expanding the cybersecurity attack surface and outlines AI-driven strategies to defend AI systems and enterprise workflows.
Alibaba Group reported strong progress in AI for the December quarter, with accelerating revenue growth in the Cloud Intelligence Group and significan...
This article explains how Kimi leverages Alibaba Cloud's ACK and ACS to build a secure, instantly elastic infrastructure capable of supporting hundreds of thousands of concurrent AI Agent sandboxes.
This article shows how SGLang RBG + Mooncake enable production-grade, cloud-native LLM inference with PD-disaggregation.
This article offers a framework for choosing between self-hosted GPUs and MaaS for LLM inference by weighing cost, data, engineering, and scalability tradeoffs.
This article introduces SysOM MCP, an open-source O&M assistant that enables AI Agents to perform automated system diagnostics via natural language using MCP.
Alibaba Group has supported the Olympic and Paralympic Winter Games Milano Cortina 2026 (Milano Cortina 2026) in becoming the most intelligent Games in Olympic history.
This article introduces ACK GIE's precision-mode prefix cache-aware routing that maximizes KV-Cache hit rates for distributed LLM inference.
This article introduces ACK One Fleet's multi-cluster canary release solution, integrated with Kruise Rollout, for safe AI inference deployments across hybrid and geo-distributed clouds.
This article introduces how combining LLM Agents with deterministic Workflows like Argo enables controllable, production-ready AI systems.
We are delighted to announce the official release of Qwen3.5, introducing the open-weight of the first model in the Qwen3.
Qwen App, Alibaba’s consumer-facing AI application, has spurred a behavioral shift toward AI-powered shopping during its Chinese New Year (CNY) campaign.
This article introduces UModel, Alibaba Cloud's ontology that transforms observability into a unified model-driven digital twin of IT systems.
Alibaba Chairman shares his perspective at the World Government Summit 2026 on why full stack companies maintains an advantage as open-source AI providers.
Alibaba Cloud is partnering with OBS and IOC to deploy advanced cloud and AI technologies for the Olympic and Paralympic Winter Games Milano Cortina 2026.
This article introduces a dual memory-pool inference framework enabling efficient hybrid Transformer-Mamba model execution by resolving conflicting caching mechanisms.
This article introduces engineering optimizations to 3FS—KVCache's foundation layer—across performance, productization, and cloud-native management for scalable AI inference.
This article introduces Dify's Nacos A2A plugins, enabling bidirectional agent collaboration—discovering external A2A agents and exposing Dify apps as discoverable agents via Nacos Registry.