×
Large language models

Rebuild Search Pipelines: An Analysis of PolarDB IMCI Capabilities

The article introduces PolarDB IMCI’s native columnar full-text indexing for efficient, integrated text and hybrid vector search—eliminating the need for external search engines.

Alibaba Cloud Accelerates Global AI Partner Ecosystem with New Incentives and Investments

New programs for channel, ISV, and service partners to accelerate AI adoption, service transformation, and SMB growth

Memahami Kebutuhan GPU Memory untuk LLM: Panduan Lengkap

Apakah kamu berencana melakukan deployment Large Language Model (LLM) tapi nggak tahu berapa GPU memory yang dibutuhkan? atau model AI yang kamu gunak...

MoreFun Group's Observability Upgrade: Achieve 80% Faster Fault Detection and a 40% Reduction in O&M Costs

The article introduces how MoreFun Group upgraded to a full-stack observability system on Alibaba Cloud using ARMS, SLS, Prometheus, and an AI-powered MCP Server.

Is Your AI Agent Getting Dumber? Alibaba Cloud AnalyticDB Unveils AI Context Engineering

This article introduces AI Context Engineering, a framework on Alibaba Cloud's AnalyticDB that prevents AI agents from "getting dumber" by intelligently managing their context and memory.

Qwen3-VL-Embedding and Qwen3-VL-Reranker: For the Next Generation of Multimodal Retrieval

The article introduces Qwen3-VL-Embedding and Qwen3-VL-Reranker—open-source, multimodal models for next-generation cross-modal retrieval.

The Next Evolution Toward Intelligent Editing: Qoder NEXT Model and ActionRL Preference Alignment in Practice

The article introduces Qoder NEXT, an intelligent editing model that uses AST-based simulation and ActionRL to deliver multi-step, intent-aware code suggestions beyond simple completion.

SAPO: A Stable and Performant Reinforcement Learning Method for Training Large Language Models

This article introduces SAPO, a new reinforcement learning method that stabilizes and improves policy optimization for training large language models.

Become an Agentic Enterprise Today: Salesforce on Alibaba Cloud at the Apsara Conference

The article highlights Salesforce’s showcase at the Apsara Conference, emphasizing how AI and CRM together enable enterprises to become agentic and intelligent.

Becoming an Agentic Enterprise: Highlights from Dreamforce 2025

This article introduces how Salesforce and Alibaba Cloud showcased AI-powered customer service and the concept of becoming an agentic enterprise at Dreamforce 2025.

Alibaba Qwen Wins “NeurIPS 2025 Best Paper Award” for Breakthrough in Attention Mechanisms

Research findings have already been incorporated into the Qwen3-Next model

From Visibility to Decisiveness: Operation Intelligence Redefines the Intelligent O&M Paradigm for Enterprises

This article introduces Alibaba Cloud's Operation Intelligence, an AI-native O&M paradigm shift from visibility to decisive action.

Milvus Launches on Alibaba Cloud International: Empowering Global Businesses to Accelerate Vector Search

This article introduces Alibaba Cloud's Vector Retrieval Service for Milvus, a fully managed, high-performance vector database that accelerates global...

Alibaba Cloud Boosts GPU Utilization with AI Infrastructure Breakthrough at SOSP 2025

This article introduces Aegaeon, an AI infrastructure breakthrough from Alibaba Cloud accepted at SOSP 2025, which significantly boosts GPU utilization for serving multiple AI models concurrently.

How Alibaba Cloud Calculates and Manages LLM Tokens

This article outlines the essential best practices for calculating and managing tokens on Alibaba Cloud.

RIDE the AI Lift: Alibaba Cloud CIO's Insights into Results as a Service (RaaS)

This article introduces a systematic approach for enterprises to successfully implement large language model applications based on a talk by Alibaba Cloud's CIO.

Alibaba Unveils Intelligent Cockpits, Enterprise Partnerships and AI Glasses at WAIC 2025

At the 2025 World Artificial Intelligence Conference (WAIC), Alibaba Group showcased a series of AI-driven innovation.

ACK Gateway with Inference Extension: A Practice for Optimizing Large Model Inference Service Deployed across Multiple Nodes

This article introduces how to use ACK Gateway with Inference Extension to optimize multi-node large-model inference performance.

GoShield: AI-Powered Passenger Safety Built for the Realities of Everyday Rides

This article introduces GoShield, an AI-driven system that monitors ride-hailing trips in real time to detect harassment and protect passengers.

Full Compatibility with MySQL! How to Build a RAG System Based on PolarDB

The article explains how to build a Retrieval-Augmented Generation (RAG) system on Alibaba Cloud PolarDB, leveraging its MySQL-compatible vector search and built-in AI capabilities.