×
Large language models

One-Click Deployment of DeepSeek-V3 and DeepSeek-R1 Models

The Model Gallery offers vLLM or BladeLLM accelerated deployment features, enabling you to deploy the DeepSeek-V3 and DeepSeek-R1 series models with a single click.

Coding Smarter, Not Harder | The True Capability of Qwen 2.5 Coder 32B Instruct

Qwen 2.5 Coder 32B Instruct is a game-changing technology that can help you coding smarter, not harder.

Alibaba Cloud Native API Gateway Helps Industries Connect to DeepSeek Safely and Reliably

Alibaba Cloud Native API Gateway enhances the secure and reliable connection of industries to DeepSeek by providing comprehensive traffic management, content security, and model deployment solutions.

Alibaba Cloud's Qwen2.5-Max Secures Top Rankings in Chatbot Arena

Alibaba Cloud's latest proprietary large language model(LLM), Qwen2.5-Max, has achieved impressive results on Chatbot Arena.

Alibaba Cloud Releases Latest AI Models For Enhanced Visual Understanding and Long Context Inputs

Alibaba Cloud has unveiled its latest visual-language model, Qwen2.5-VL, which significantly enhances its predecessor, Qwen2-VL.

The Future of Technology: Key Trends to Watch in 2025

This article introduces key technology trends in 2025, particularly focusing on the impact of Artificial Intelligence.

Alibaba Taps AI to Advance Social Good

From weather forecast, healthcare, agriculture, science discovery to education, Alibaba's AI Innovations aim to bring positive impacts.

Alibaba Cloud Unveiled Wanx 2.1: Redefining AI-Driven Video Generation

Alibaba Cloud has introduced Wanx 2.1, the latest iteration of its multimodal large model Tongyi Wanxiang (Wanx), which first debuted in July 2023.

How to Fine-Tune Large Language Models

This article introduces the basic concepts of fine-tuning and explains how to fine-tune large language models (LLMs).

Best Practices for LLM Evaluation

This article describes how to implement more comprehensive, accurate, and focused model evaluation based on specific dataset types for different user groups to achieve better results in the AI field.

Empowering AI Innovation and Celebrating Scientific Excellence

The article introduces the launch of the AEF NextGen Fund to accelerate AI innovation and support startups, along with other advancements in AI technology by Alibaba Cloud.

High Availability and Performance: Best Practices for Deploying Dify based on ACK

This article provides a detailed solution for deploying and managing Dify services that are highly available, scalable, and have high SLAs in ACK clusters.

Use Alibaba Cloud ASM LLMProxy Plug-in to Ensure User Data Security for Large Models

This article introduces how to use Wasm plug-ins to enforce global protection of LLM calls within the mesh.

GTE-Multilingual Series: A Key Model for Retrieval-Augmented Generation

This article introduces the latest GTE-multilingual models from Alibaba's Tongyi Lab.

Interpreting Data Acceleration in the Era of Large Models: Balancing Performance, Stability, and Consistency

This article explores how to balance performance, stability, and consistency in data acceleration during the era of large models.

Alibaba Cloud's Qwen 2.5 Tops OpenCompass LLM Leaderboard as the First Open-Source Champion

The article introduces Alibaba Cloud's open-source Qwen 2.5-72B-Instruct has achieved the top position on the OpenCompass large language model leaderboard.

Use Alibaba Cloud ASM to Efficiently Manage LLM Traffic Part 1: Traffic Routing

This article introduces the traffic routing capabilities of Alibaba Cloud Service Mesh.

AI ON CLOUD | Article Collections on Artificial Intelligence

In the lead-up to the Apsara Conference, we’re rolling out a series of blogs showcases highlighting AI innovations.

In-Depth Exploration of the RAG Optimization Scheme and Practice

This article introduces in detail the challenges, general paradigm, engineering practice, and optimization strategy of RAG.

Unleash the Power of Generative AI with Alibaba Cloud

Alibaba Cloud offers a comprehensive solution for Generative AI, enabling businesses to empower their operations, enhance productivity, and accelerate innovation with customized models.