Running a generative AI application in production usually means stitching together a model server, a vector database, retrieval logic, a tool layer, a...
518 real API calls. $33.99 → $7.06 in a single run. The same parameter change projects $15,667/year saved on a healthcare workload — here's the exact code, the math, and every scenario I measured.
This article introduces the launch of Qwen3.7-Plus multimodal model, an AI swine diagnosis assistant with Muyuan Group, and Model Studio's open-source CLI for AI agents.
Hướng dẫn từng bước để chạy tác nhân lập trình AI riêng của bạn trên đám mây
คู่มือแนะนำทีละขั้นตอนในการรันเอเจนต์ช่วยเขียนโค้ดด้วย AI ของคุณเองในระบบคลาวด์
Panduan langkah demi langkah menjalankan AI coding agent Anda sendiri di cloud
This article introduces Qwen3.7-Plus — a multimodal agent model that unifies vision and language into a single, versatile agent foundation.
การเลือกวิธีการนำโมเดลภาษาขนาดใหญ่ไปใช้งานจริงนั้น เป็นหนึ่งในการตัดสินใจที่สำคัญที่สุดและซับซ้อนที่สุดสำหรับทีม AI
Chọn cách triển khai mô hình ngôn ngữ quy mô lớn trong môi trường thực tế là một trong những quyết định quan trọng nhất — và gây bối rối nhất — mà một đội ngũ AI có thể đưa ra.
Finding cover images after writing is always a pain. I built a Skill that auto-picks style, generates prompts, and calls Model Studio CLI to create images from article content.
This article introduces Apache RocketMQ's strategic evolution into an AI-native message engine for long-running sessions, intelligent compute scheduling, and agent collaboration.
Qwen3.5-LiveTranslate-Flash is the latest simultaneous interpretation model in the Qwen family, built on top of Qwen3.5-Omni.
Today we introduce Qwen3.7-Max, our latest proprietary model designed for the agent era.
This article introduces building a production-ready RAG pipeline on Alibaba Cloud using Hologres for vector search and Model Studio for embeddings and LLM inference.
Alibaba on Wednesday launched its most aggressive AI push yet, unveiling a new flagship large language model, a homegrown AI chip that triples the performance of its predecessor.
Qwen3.7-Max, upgraded cloud infrastructure and model services, and new T-Head chips announced at Alibaba Cloud Summit
Alibaba has unveiled Fun-ASR1.5, a major upgrade to its end-to-end speech recognition model.
Following the launch of Qwen3.6-Plus and Qwen3.6-35B-A3B, we are excited to open-source Qwen3.6-27B.
Following the release of Qwen3.6-Plus, we are sharing an early preview of our next proprietary model: Qwen3.6-Max-Preview.
Alibaba open-sources Qwen3.6-35B-A3B, an efficient 35B/3B MoE model delivering top-tier agentic coding and multimodal performance.