×
Large Language Model

BMW and Alibaba Deepen Strategic Partnership in China, Harnessing Qwen's AI Power to Redefine Intelligent In-Car Experiences

The BMW Group and Alibaba Group announced an expanded strategic partnership in China, accelerating the integration of Alibaba’s Qwen large language mo.

Alibaba Cloud Releases Qwen2.5-Omni-7B An End-to-end Multimodal AI Model

Alibaba Cloud has launched Qwen2.5-Omni-7B, a unified end-to-end multimodal model in the Qwen series.

Alibaba Cloud's AI Revolution: Advancing the Frontier with Mixture of Experts (MoE), Advanced Reasoning Model, and End-to-end Multimodal Model

This article showcases Alibaba Cloud's innovative AI models that boost efficiency and integration across modalities, setting new standards in industri...

Kimi Large Model-based Massive Data Preprocessing Practice of Moonshot AI

This article introduces how Moonshot AI uses Alibaba Cloud's solutions to enhance data preprocessing for its large model, Kimi, focusing on stability, resource elasticity, and efficient management.

The Consumption of Tokens by Large Models Can Be Quite Ambiguous

This article discusses the challenges and strategies involved in managing resource consumption in large model applications.

Higress.ai Officially Launches: Effortlessly Unlock New AI Capabilities and Start Global Services

This article introduces Higress.ai, highlighting its official launch and the seamless integration of new AI capabilities.

Alibaba Cloud's Industry Leadership Recognized by Top Global Research Firms

Alibaba Cloud continues to solidify its standing as a global leader in cloud computing and artificial intelligence (AI).

Discovering LLMs: A Deep Dive into Large Language Models

This blog post delves into the intricacies of LLMs, exploring their architecture, capabilities, and potential impact to solve real-world problems.

One-Click Deployment of DeepSeek-V3 and DeepSeek-R1 Models

The Model Gallery offers vLLM or BladeLLM accelerated deployment features, enabling you to deploy the DeepSeek-V3 and DeepSeek-R1 series models with a single click.

Coding Smarter, Not Harder | The True Capability of Qwen 2.5 Coder 32B Instruct

Qwen 2.5 Coder 32B Instruct is a game-changing technology that can help you coding smarter, not harder.

Introducing Qwen2.5 Coder 32B Instruct | Qwen

This article introduces Qwen2.5 Coder 32B Instruct the latest version of Qwen2.5 Coder from Qwen

Accelerate Your Transformation in The GenAI-Era: Community Gathering Session Recap

This article delved into how Qwen LLM stands out in terms of performance, scalability, and versatility, making it an ideal choice for organizations looking to harness the power of generative AI.

Best Practices for Generating a Unit Test by Using Tongyi Lingma to Simplify Unit Testing

This article discusses what unit testing is, the value of unit testing, the principles of adequate unit testing, and how to write an adequate unit test.

Use EAS and Elasticsearch to Deploy a RAG-Based LLM Chatbot

This article describes the basic features provided by a RAG-based LLM chatbot and the special features provided by Elasticsearch.

Alibaba Cloud Model Studio를 사용하여 나만의 Chatbot 시스템 빠르게 구축하기

오늘 이 글에서 우리는 알리바바클라우드의 Model Studio와 Tongyi Qwen 을 활용하여 Chatbot을 구축하는 방법을 안내드리고자 합니다.

Strengthening Security in the AI Era: Alibaba Cloud Showcases Security Solutions for Diverse Cloud Environments

In response to the growing trend of organizations adopting multi-cloud and hybrid cloud environments, where data is distributed across various platforms.

Alibaba Cloud Drives AI Enhancements Across Industries in Asia

Alibaba Cloud continues to pioneer technology innovation across a diverse range of industries from technology development, imaging, travel to beauty and healthcare.

Use NVIDIA NIM to Accelerate LLM Inference in Alibaba Cloud ACK

This article introduces how to use the cloud-native AI suite to integrate open-source inference service framework KServe and quickly deploy NVIDIA NIM in an ACK cluster.

Use PAI-Blade and TensorRT Plug-Ins to Optimize a RetinaNet Model

This article describes how to use PAI-Blade to optimize a detection model whose post-processing network is built by using TensorRT plug-ins.

Quickly Deploy a Multimodal Large Language Model in EAS

This article describes how to deploy and call MLLM inference services by using PAI-EAS.