×
Model Studio

Model Studio Architecture: A Deep Dive into Alibaba Cloud’s GenAI Application Platform

Running a generative AI application in production usually means stitching together a model server, a vector database, retrieval logic, a tool layer, a...

I Tested 19 LLM API Workloads on Real Calls and Cut Costs 79% — Here's the Data

518 real API calls. $33.99 → $7.06 in a single run. The same parameter change projects $15,667/year saved on a healthcare workload — here's the exact code, the math, and every scenario I measured.

Alibaba Launches Qwen3.7-Plus, AI Swine Diagnosis Assistant and Model Studio CLI

This article introduces the launch of Qwen3.7-Plus multimodal model, an AI swine diagnosis assistant with Muyuan Group, and Model Studio's open-source CLI for AI agents.

Triển khai OpenClaw trên Alibaba Cloud ECS kèm tích hợp Telegram

Hướng dẫn từng bước để chạy tác nhân lập trình AI riêng của bạn trên đám mây

ปรับใช้ OpenClaw บน Alibaba Cloud ECS ด้วย Telegram Integration

คู่มือแนะนำทีละขั้นตอนในการรันเอเจนต์ช่วยเขียนโค้ดด้วย AI ของคุณเองในระบบคลาวด์

Deploy OpenClaw di Alibaba Cloud ECS dengan Integrasi Telegram

Panduan langkah demi langkah menjalankan AI coding agent Anda sendiri di cloud

Qwen3.7-Plus: Multimodal Agent Intelligence

This article introduces Qwen3.7-Plus — a multimodal agent model that unifies vision and language into a single, versatile agent foundation.

DeepSeek V4-Flash ในวงกว้าง: คู่มือการนำไปใช้ที่ยึดเกณฑ์มาตรฐานเป็นหลัก

การเลือกวิธีการนำโมเดลภาษาขนาดใหญ่ไปใช้งานจริงนั้น เป็นหนึ่งในการตัดสินใจที่สำคัญที่สุดและซับซ้อนที่สุดสำหรับทีม AI

DeepSeek V4-Flash trên quy mô lớn: Hướng dẫn triển khai dựa trên điểm chuẩn

Chọn cách triển khai mô hình ngôn ngữ quy mô lớn trong môi trường thực tế là một trong những quyết định quan trọng nhất — và gây bối rối nhất — mà một đội ngũ AI có thể đưa ra.

Qwen Code × Model Studio CLI: WeChat Cover Generation Guide

Finding cover images after writing is always a pain. I built a Skill that auto-picks style, generates prompts, and calls Model Studio CLI to create images from article content.

Apache RocketMQ for AI: Strategic Upgrade Ushers in the Era of AI MQ

This article introduces Apache RocketMQ's strategic evolution into an AI-native message engine for long-running sessions, intelligent compute scheduling, and agent collaboration.

Qwen3.5-LiveTranslate: From Sound to Sight, From Word to Right

Qwen3.5-LiveTranslate-Flash is the latest simultaneous interpretation model in the Qwen family, built on top of Qwen3.5-Omni.

Qwen3.7: The Agent Frontier

Today we introduce Qwen3.7-Max, our latest proprietary model designed for the agent era.

Building a RAG Pipeline on Alibaba Cloud with Vector Search

This article introduces building a production-ready RAG pipeline on Alibaba Cloud using Hologres for vector search and Model Studio for embeddings and LLM inference.

Alibaba Unveils New AI Chip, Flagship Model, and Rebuilt Cloud Stack AI for Agentic Era

Alibaba on Wednesday launched its most aggressive AI push yet, unveiling a new flagship large language model, a homegrown AI chip that triples the performance of its predecessor.

Alibaba Announces Comprehensive Full-Stack AI Upgrade for the Agentic Era

Qwen3.7-Max, upgraded cloud infrastructure and model services, and new T-Head chips announced at Alibaba Cloud Summit

Alibaba Introduces Fun-ASR1.5: Advancing Multi-language Speech Recognition

Alibaba has unveiled Fun-ASR1.5, a major upgrade to its end-to-end speech recognition model.

Qwen3.6-27B: Flagship-Level Coding in a 27B Dense Model

Following the launch of Qwen3.6-Plus and Qwen3.6-35B-A3B, we are excited to open-source Qwen3.6-27B.

Qwen3.6-Max-Preview: Smarter, Sharper, Still Evolving

Following the release of Qwen3.6-Plus, we are sharing an early preview of our next proprietary model: Qwen3.6-Max-Preview.

Qwen3.6-35B-A3B: Agentic Coding Power, Now Open to All

Alibaba open-sources Qwen3.6-35B-A3B, an efficient 35B/3B MoE model delivering top-tier agentic coding and multimodal performance.