×
LLMs

The Assessment Engineering Is Becoming a Key Focus of the Next Round of Agent Evolution

This article discusses the growing importance of assessment engineering in the evolution of AI agents, particularly the use of LLM-as-a-Judge tools and the open-source RM-Gallery.

Alibaba Cloud Boosts GPU Utilization with AI Infrastructure Breakthrough at SOSP 2025

This article introduces Aegaeon, an AI infrastructure breakthrough from Alibaba Cloud accepted at SOSP 2025, which significantly boosts GPU utilization for serving multiple AI models concurrently.

The Changes in the Agent Development Toolchain and the Invariance of the Application Architecture

This article introduces the rapid evolution of Agent development toolchains across four stages, contrasting it with the relatively stable underlying Agent application architecture.

How Alibaba Cloud Calculates and Manages LLM Tokens

This article outlines the essential best practices for calculating and managing tokens on Alibaba Cloud.

Alibaba's "Cloud for Youth" Wins Education Innovation Award for Bridging China’s Rural Digital Divide

"Cloud for Youth" has recently been awarded with the Global Smart Education Innovation Prize 2025.

Alibaba Open-Sources Tongyi DeepResearch LLM, Partners with S&P Global to Deliver AI Intelligence to Chinese Customers

This week’s news roundup highlights Alibaba’s advancements in AI innovation and cloud-driven enterprise solutions.

RIDE the AI Lift: Alibaba Cloud CIO's Insights into Results as a Service (RaaS)

This article introduces a systematic approach for enterprises to successfully implement large language model applications based on a talk by Alibaba Cloud's CIO.

Best Practices for High Availability of LLM Based on AI Gateway

This article introduces Alibaba Cloud AI Gateway's high availability best practices for LLM services, including fallback mechanisms, passive health ch.

LoongCollector Security Log Ingestion Practice: Standardized Log Collection for Enterprise Firewall Scenarios

This article explains how LoongCollector enables standardized, end-to-end collection and parsing of multi-vendor enterprise firewall logs.

Zero-Code Transformation! Observability in Practice with LoongSuite

This article introduces LoongSuite's zero-code transformation capabilities and its practical application in observability for AI-native applications.

AnythingLLM Builds Personal Knowledge Bases with RDS PostgreSQL's PGVector Plug-in

The article explains how AnythingLLM uses Alibaba Cloud ApsaraDB for PostgreSQL with the PGVector extension to create private, vector-based knowledge bases.

How to Deploy FlowiseAI with One Click on Alibaba Cloud?

On Alibaba Cloud, you can quickly deploy FlowiseAI by using a one-click deployment link and start using it with a few simple configuration steps.

Simplified Deployment Tutorial of the Qwen3 LLM on Alibaba Cloud Container Service for Kubernetes

The article explains how to deploy the Qwen3 large language model on Alibaba Cloud ACK and ACS serverless GPU resources.

No Increase in GPU, the First Token Latency Decreases by 50% | New Practices in LLM Service Load Balancing

This article introduces innovative load balancing strategies for LLM services that reduce first-token latency by 50% without requiring additional GPU resources.

Introducing Qwen-Image: Novel Model in Image Generation and Editing

Alibaba released Qwen-Image, a novel image generation foundation model that achieves significant breakthroughs in complex text rendering and precise image editing.

Qwen3 Undertakes Chatbot Arena Top 3; Compact Qwen3-30B Series Launches for Efficient AI Development

This article introduces Alibaba's Qwen3 models achieving a top-3 ranking on Chatbot Arena and the launch of a compact Qwen3-30B series for efficient AI development.

Building FinTrack: An AI-Powered Tax Assistant for Freelancers Using Alibaba Cloud

This article explains how to build FinTrack, an AI-powered tax assistant for freelancers, using Alibaba Cloud's generative AI platform and its Qwen large language models.

Alibaba Unveils New Qwen3 Models for Coding, Complexing Reasoning and Machine Translation

Alibaba unveiled its latest Qwen3 models, including Qwen3-Coder, Qwen3-235B, and Qwen-MT, advancing open-source AI capabilities in coding, complex reasoning, and machine translation.

Alibaba Unveils Intelligent Cockpits, Enterprise Partnerships and AI Glasses at WAIC 2025

At the 2025 World Artificial Intelligence Conference (WAIC), Alibaba Group showcased a series of AI-driven innovation.

ACK Gateway with Inference Extension: A Practice for Optimizing Large Model Inference Service Deployed across Multiple Nodes

This article introduces how to use ACK Gateway with Inference Extension to optimize multi-node large-model inference performance.