×
LLMs

Alibaba's "Cloud for Youth" Wins Education Innovation Award for Bridging China’s Rural Digital Divide

"Cloud for Youth" has recently been awarded with the Global Smart Education Innovation Prize 2025.

Alibaba Open-Sources Tongyi DeepResearch LLM, Partners with S&P Global to Deliver AI Intelligence to Chinese Customers

This week’s news roundup highlights Alibaba’s advancements in AI innovation and cloud-driven enterprise solutions.

RIDE the AI Lift: Alibaba Cloud CIO's Insights into Results as a Service (RaaS)

This article introduces a systematic approach for enterprises to successfully implement large language model applications based on a talk by Alibaba Cloud's CIO.

Best Practices for High Availability of LLM Based on AI Gateway

This article introduces Alibaba Cloud AI Gateway's high availability best practices for LLM services, including fallback mechanisms, passive health ch.

LoongCollector Security Log Ingestion Practice: Standardized Log Collection for Enterprise Firewall Scenarios

This article explains how LoongCollector enables standardized, end-to-end collection and parsing of multi-vendor enterprise firewall logs.

Zero-Code Transformation! Observability in Practice with LoongSuite

This article introduces LoongSuite's zero-code transformation capabilities and its practical application in observability for AI-native applications.

AnythingLLM Builds Personal Knowledge Bases with RDS PostgreSQL's PGVector Plug-in

The article explains how AnythingLLM uses Alibaba Cloud ApsaraDB for PostgreSQL with the PGVector extension to create private, vector-based knowledge bases.

How to Deploy FlowiseAI with One Click on Alibaba Cloud?

On Alibaba Cloud, you can quickly deploy FlowiseAI by using a one-click deployment link and start using it with a few simple configuration steps.

Simplified Deployment Tutorial of the Qwen3 LLM on Alibaba Cloud Container Service for Kubernetes

The article explains how to deploy the Qwen3 large language model on Alibaba Cloud ACK and ACS serverless GPU resources.

No Increase in GPU, the First Token Latency Decreases by 50% | New Practices in LLM Service Load Balancing

This article introduces innovative load balancing strategies for LLM services that reduce first-token latency by 50% without requiring additional GPU resources.

Introducing Qwen-Image: Novel Model in Image Generation and Editing

Alibaba released Qwen-Image, a novel image generation foundation model that achieves significant breakthroughs in complex text rendering and precise image editing.

Qwen3 Undertakes Chatbot Arena Top 3; Compact Qwen3-30B Series Launches for Efficient AI Development

This article introduces Alibaba's Qwen3 models achieving a top-3 ranking on Chatbot Arena and the launch of a compact Qwen3-30B series for efficient AI development.

Building FinTrack: An AI-Powered Tax Assistant for Freelancers Using Alibaba Cloud

This article explains how to build FinTrack, an AI-powered tax assistant for freelancers, using Alibaba Cloud's generative AI platform and its Qwen large language models.

Alibaba Unveils New Qwen3 Models for Coding, Complexing Reasoning and Machine Translation

Alibaba unveiled its latest Qwen3 models, including Qwen3-Coder, Qwen3-235B, and Qwen-MT, advancing open-source AI capabilities in coding, complex reasoning, and machine translation.

Alibaba Unveils Intelligent Cockpits, Enterprise Partnerships and AI Glasses at WAIC 2025

At the 2025 World Artificial Intelligence Conference (WAIC), Alibaba Group showcased a series of AI-driven innovation.

ACK Gateway with Inference Extension: A Practice for Optimizing Large Model Inference Service Deployed across Multiple Nodes

This article introduces how to use ACK Gateway with Inference Extension to optimize multi-node large-model inference performance.

AI Gateway Analysis: OpenRouter vs Higress

This article introduces the comparison between OpenRouter and Higress as two distinct types of AI Gateways.

Next Gen Applications: Qwen Agentic Deep Dive and Workflow Innovation Lab

The article explores Alibaba Cloud's Qwen LLM and Dify platform, showcasing their roles in developing intelligent AI systems for business automation.

ACK Gateway with AI Extension: Intelligent Routing Practice for Kubernetes Large Model Inference

This article describes how to use the ACK Gateway with AI Extension plug-in to provide production-level load balancing and intelligent routing capabilities for QwQ-32B models deployed in ACK clusters.

ACK Gateway with AI Extension: Model Canary Release Practice for Large Model Inference

This article focuses on the canary release of models after the large model inference service is deployed in the cloud and the practices of model canary release based on ACK Gateway with AI Extension.