×
Retrieval-Augmented Generation

Alibaba Dragonwell 21 AI Extension: Unleash Java's Performance Potential in the AI Era

This article introduces Alibaba Dragonwell 21 AI Extension—a JVM optimized for AI workloads.

GenAI on Alibaba Cloud [Part 2]: Chat with Your PDF (Building a RAG System)

This article introduces how to build a “Chat with PDF” tool using RAG so Qwen can answer questions based on private documents.

Deploying Milvus on Alibaba ACK for RAG Pipelines

This article introduces the deployment of Milvus on Alibaba Cloud ACK for RAG pipelines, highlighting its benefits for modern AI applications.

Memilih Model Qwen yang Tepat untuk Kebutuhan Anda

Ekosistem Qwen saat ini berkembang sangat pesat, mulai dari Large Language Model (LLM) hingga model multimodal yang bisa memahami teks, gambar, video,...

Development Trends and Architecture Evolution of AI Agents

This article introduces the development trends, architectural evolution, and key challenges of AI Agents, along with Alibaba's open-source contributions to AI-native applications.

Thành thạo mô hình nhúng văn bản và xếp hạng lại với Qwen3

Bài viết này giới thiệu các mô hình nhúng văn bản và xếp hạng lại tiên tiến của Qwen3, chú trọng vào tính linh hoạt, khả năng hỗ trợ đa ngôn ngữ

Menguasai Penyematan Teks dan Pemeringkat Ulang dengan Qwen3

Artikel ini memperkenalkan model penyematan teks dan pemeringkatan ulang canggih Qwen3, menyoroti dukungan multibahasa dan berbagai kemampuannya

Full Compatibility with MySQL! How to Build a RAG System Based on PolarDB

The article explains how to build a Retrieval-Augmented Generation (RAG) system on Alibaba Cloud PolarDB, leveraging its MySQL-compatible vector search and built-in AI capabilities.

Mastering Text Embedding and Reranker with Qwen3

The article introduces Qwen3's advanced text embedding and reranking models, highlighting their versatility, multilingual support

Building a Retrieval-Augmented Generation (RAG) Service on Compute Nest with Alibaba Cloud Model Studio and AnalyticDB for PostgreSQL

This article provides a step-by-step guide to setting up a Retrieval-Augmented Generation (RAG) service using Alibaba Cloud Model Studio, Compute Nest, and AnalyticDB for PostgreSQL.

What is RAG and how Alibaba Cloud Elasticsearch enhances AI search with retrieval-augmented generation

Learn how Alibaba Cloud Elasticsearch supports RAG to streamline operations in various industries.

Spring AI Alibaba: Alibaba Cloud Open Source AI Application Development Framework

This article describes the core features of Spring AI Alibaba.

Use EAS and Elasticsearch to Deploy a RAG-Based LLM Chatbot

This article describes the basic features provided by a RAG-based LLM chatbot and the special features provided by Elasticsearch.

AI-native Applications Based on Event-driven Architectures

This article describes how to handle the preceding challenges based on event-driven architectures.

Use EAS and ApsaraDB RDS for PostgreSQL to Deploy a RAG-Based LLM Chatbot

This article describes how to associate a RAG-based LLM chatbot with an ApsaraDB RDS for PostgreSQL instance when you deploy the RAG-based LLM chatbot.

Tablestore Low-Cost Vector Retrieval Services to Help AI Retrieval

This article explains how Tablestore uses its vector retrieval service to meet the needs of large-scale data retrieval, especially in terms of cost, scale, and recall rate.

GTE-Multilingual Series: A Key Model for Retrieval-Augmented Generation

This article introduces the latest GTE-multilingual models from Alibaba's Tongyi Lab.

An Overview of Methods to Effectively Improve RAG Performance

This article first introduces several papers on RAG optimization and then describes some common engineering practices for RAG.

Seamless DB+AI Transformation: Why AnalyticDB for PostgreSQL Outshines Traditional Greenplum Solutions

The article introduces the advantages of AnalyticDB for PostgreSQL compared to traditional Greenplum solutions, focusing on the seamless transformation of database and AI capabilities.

Observability of LLM Applications: Exploration and Practice from the Perspective of Trace

This article clarifies the technical challenges of observability by analyzing LLM application patterns and different concerns.