×
LLMs

UModel Data Governance: Practice of Building an O&M World Model

This article introduces UModel, Alibaba Cloud's ontology that transforms observability into a unified model-driven digital twin of IT systems.

Joe Tsai on the Future of Open-Source AI: Why Full-Stack Companies Will Excel

Alibaba Chairman shares his perspective at the World Government Summit 2026 on why full stack companies maintains an advantage as open-source AI providers.

Alibaba Brings Cloud-Based AI Innovation to the Olympic Winter Games Milano Cortina 2026

Alibaba Cloud is partnering with OBS and IOC to deploy advanced cloud and AI technologies for the Olympic and Paralympic Winter Games Milano Cortina 2026.

Hybrid Model Support | SGLang's Support Scheme for Hybrid Architecture Models like Mamba-Transformer

This article introduces a dual memory-pool inference framework enabling efficient hybrid Transformer-Mamba model execution by resolving conflicting caching mechanisms.

Alibaba Cloud Tair KVCache Implementation Based on 3FS Enterprise-Grade Deployment, High-Availability Operations & Performance Optimization

This article introduces engineering optimizations to 3FS—KVCache's foundation layer—across performance, productization, and cloud-native management for scalable AI inference.

Dify Officially Launched the Nacos A2A Plugin, Completing Its Bidirectional Multi-agent Collaboration Capabilities

This article introduces Dify's Nacos A2A plugins, enabling bidirectional agent collaboration—discovering external A2A agents and exposing Dify apps as discoverable agents via Nacos Registry.

Rebuild Search Pipelines: An Analysis of PolarDB IMCI Capabilities

The article introduces PolarDB IMCI’s native columnar full-text indexing for efficient, integrated text and hybrid vector search—eliminating the need for external search engines.

Momentum: How Alibaba Cloud Is Leading the New AI Paradigm

At Alibaba Cloud, we're not just delivering technology. We're co-creating a new chapter of AI with the world.

Alibaba Cloud Accelerates Global AI Partner Ecosystem with New Incentives and Investments

New programs for channel, ISV, and service partners to accelerate AI adoption, service transformation, and SMB growth

Memahami Kebutuhan GPU Memory untuk LLM: Panduan Lengkap

Apakah kamu berencana melakukan deployment Large Language Model (LLM) tapi nggak tahu berapa GPU memory yang dibutuhkan? atau model AI yang kamu gunak...

Quest 1.0: Self-learning Coding Agent

The release of Quest 1.0—an autonomous agent capable of self-learning and rapid evolution was unveiled last week.

Is Your AI Agent Getting Dumber? Alibaba Cloud AnalyticDB Unveils AI Context Engineering

This article introduces AI Context Engineering, a framework on Alibaba Cloud's AnalyticDB that prevents AI agents from "getting dumber" by intelligently managing their context and memory.

From ReAct to Ralph Loop A Continuous Iteration Paradigm for AI Agents

The article introduces the Ralph Loop—a continuous, self-iterating paradigm that keeps AI programming agents working until tasks are verifiably complete.

Container Technology Evolution for LLMs and AI Agents

The article outlines how container technology is advancing to support LLMs and AI agents across data processing, training, inference, and deployment.

The AI Gateway Has Become a Symbol of AI Evolution This Year

This article introduces the evolution and impact of the AI Gateway, particularly Higress, as a crucial infrastructure for AI development and implementation in 2025.

Decoding 2025: 10 Key Insights from the Alibaba Cloud Blog Community

What are developers actually building in 2025? In this article, we summarized ten key insights derived from the top search queries on the Alibaba Cloud Community Blog in 2025.

A Practical Guide to SLS Data Masking: Securing Sensitive Data in LLM Applications

This article demonstrates using SLS data masking to protect sensitive data in an e-commerce copilot demo—without altering business logic.

Qwen-Image-2512: Finer Details, Greater Realism

We are excited to introduce Qwen-Image-2512, the December update of Qwen-Image’s text-to-image foundational model.

Alibaba Cloud Tair Partners with SGLang to Build HiCache: Constructing a New Cache Paradigm for "Agentic Inference"

This article introduces HiCache, a hierarchical KVCache infrastructure developed by Alibaba Cloud Tair and SGLang to optimize performance and memory capacity for long-context "agentic" LLM inference.

Qwen-Image-Edit-2511: Improve Consistency

This article introduces Qwen-Image-Edit-2511, an upgraded AI image-editing model with better consistency, geometric reasoning, built-in LoRA support, and industrial design enhancements.