×
LLM

DeepSeek V4-Flash ในวงกว้าง: คู่มือการนำไปใช้ที่ยึดเกณฑ์มาตรฐานเป็นหลัก

การเลือกวิธีการนำโมเดลภาษาขนาดใหญ่ไปใช้งานจริงนั้น เป็นหนึ่งในการตัดสินใจที่สำคัญที่สุดและซับซ้อนที่สุดสำหรับทีม AI

DeepSeek V4-Flash trên quy mô lớn: Hướng dẫn triển khai dựa trên điểm chuẩn

Chọn cách triển khai mô hình ngôn ngữ quy mô lớn trong môi trường thực tế là một trong những quyết định quan trọng nhất — và gây bối rối nhất — mà một đội ngũ AI có thể đưa ra.

Ontology Is Trending Again. Can It Improve My AI Agent's Performance?

This article introduces how ontology provides structured domain knowledge to enhance AI agent accuracy and explainability in enterprise O&M scenarios.

Building a RAG Pipeline on Alibaba Cloud with Vector Search

This article introduces building a production-ready RAG pipeline on Alibaba Cloud using Hologres for vector search and Model Studio for embeddings and LLM inference.

Alibaba Cloud AI Gateway Supports DeepSeek V4

Alibaba Cloud AI Gateway has been updated to support the newly released DeepSeek-V4 models, enabling users to manage, call, and integrate these models...

Using SQL to Call LLMs? Hologres + Model Studio Enables Data Developers to "Talk" Directly to AI

Discover how Alibaba Cloud Hologres + Model Studio lets data teams use plain SQL to call LLMs for PDF analysis, image understanding, and RAG—no GPU, Python, or AI engineering required.

AliSQL Vector Technology Analysis (2): Read/Write Cache and Transaction Concurrency

This article introduces AliSQL's vector cache design and transaction concurrency mechanisms for production-ready vector search.

AliSQL Vector Technology Analysis (1): Storage Format and Algorithm Implementation

This article details the storage format and HNSW algorithm implementation behind AliSQL’s native vector indexing capability for high-dimensional AI workloads.

Building Secure RAG-Based Applications with Dify on Alibaba Cloud

The article explains how to build RAG-based application with security gateway for better yet secure retrieval and generation.

MSE Nacos Prompt Management: Making the Core Configuration of AI Agent Truly Governable

This article introduces MSE Nacos Prompt Management, which governs AI Agent prompts as dynamic configuration assets with centralized storage, versioning, and hot updates.

Alibaba Group Debuts “Wonder on Ice,” an Immersive AI Experience at Milan’s Sforza Castle for Milano Cortina 2026

Alibaba Group launched "Alibaba Wonder on Ice" (AWI) at the Milano Cortina 2026, using AI and cloud computing to demonstrate next-gen virtual retail experiences.

Alibaba Cloud to Debut AI-powered Pin Trading Experience in Olympic Village at Milano Cortina 2026

The Intelligent Pin Trading Station blends one of the Games’ best-loved traditions with voice- and gesture-enabled interaction.

Pushing Qwen3-Max-Thinking Beyond its Limits

This article introduces Qwen3-Max-Thinking, a top-tier reasoning model that rivals leading AI systems and features adaptive tool use and advanced test...

Alibaba's Latest Thinking Model Excels at Adaptive Tool Use

This article introduces Qwen3-Max-Thinking, Alibaba’s latest reasoning model that excels in adaptive tool use and advanced test-time scaling to outperform leading AI systems.

Momentum: How Alibaba Cloud Is Leading the New AI Paradigm

At Alibaba Cloud, we're not just delivering technology. We're co-creating a new chapter of AI with the world.

Nacos A2A Registry: AgentScope Enables Cross-Language and Cross-Framework Interoperability

This article introduces how AgentScope leverages the A2A protocol and Nacos Registry to enable cross-language, cross-framework agent interoperability and unified service governance.

Opening an AI Milk Tea Shop with AgentScope Java

This article introduces an AI-powered milk tea shop built with AgentScope Java, showcasing multi-agent collaboration, RAG, long-term memory, and enterprise integrations like Nacos and MCP.

What? My Werewolf Game Skills Are Worse Than AI's?

This article introduces an AI-powered Werewolf game built with AgentScope Java, where agents simulate human-like reasoning, deception, and collaboration—with seamless human-AI gameplay.

MiniMax Builds a Cloud-Native Data + AI Platform with Alibaba Cloud: A Case Study in Scaling Data Infrastructure for the LLM Era

Discover how MiniMax leveraged Alibaba Cloud to build a scalable, cloud-native Data + AI platform powering multimodal LLMs and global user growth.

จากการทดลองทุกความเป็นไปได้สู่ความประณีต: วิวัฒนาการและอนาคตของโครงสร้างพื้นฐานการฝึก AI

บทความนี้จะสำรวจประวัติความเป็นมาอันน่าทึ่งของการฝึก AI ผ่าปัญหาทางตันสำคัญที่มีโอกาสขัดขวางความก้าวหน้า และเจาะลึกอนาคตของโครงสร้างพื้นฐานที่ได้รับกา...