Community

Blog Events Webinars Tutorials Forum

Create Account

×

Cache

Hybrid Model Support | SGLang's Support Scheme for Hybrid Architecture Models like Mamba-Transformer

This article introduces a dual memory-pool inference framework enabling efficient hybrid Transformer-Mamba model execution by resolving conflicting caching mechanisms.

ApsaraDB February 4, 2026 1,302

Alibaba Cloud Tair KVCache Implementation Based on 3FS Enterprise-Grade Deployment, High-Availability Operations & Performance Optimization

This article introduces engineering optimizations to 3FS—KVCache's foundation layer—across performance, productization, and cloud-native management for scalable AI inference.

ApsaraDB February 4, 2026 417

Alibaba Cloud Tair Partners with SGLang to Build HiCache: Constructing a New Cache Paradigm for "Agentic Inference"

This article introduces HiCache, a hierarchical KVCache infrastructure developed by Alibaba Cloud Tair and SGLang to optimize performance and memory capacity for long-context "agentic" LLM inference.

ApsaraDB December 29, 2025 2,827

Learn Nearly Everything About Redis Through An Incident

This article summarizes almost everything you need to know about Redis from an incident.

ApsaraDB March 13, 2025 6,920

Performance Optimization | Several Methods to Make Images Load Faster

Image optimization is crucial to the performance of e-commerce web pages. This article discusses some simple and reliable image optimization methods.

Alibaba Cloud Community December 2, 2024 4,589

How to Set up and Configure Alibaba Cloud Tair (Redis® OSS-Compatible) In-Memory Database

This article explains how to set up and configure Alibaba Cloud Tair (Redis® OSS-Compatible) in-memory database

Alibaba Cloud Community March 25, 2022 3,456

Fluid Supports Tiered Locality Scheduling

This article explains how to use Fluid to implement tiered affinity scheduling and configure custom affinity based on real scenarios.

Alibaba Container Service February 27, 2024 2,633

Exploring Cache Data Consistency

This article discusses data inconsistency between cache and database and the best choice of solutions in different business scenarios.

Alibaba Cloud Community August 23, 2023 7,153

OpenAnolis White Paper: Using the EROFS Read-Only File System across the Cloud-Edge-End

This short article discusses the background and technical scheme of EROFS.

OpenAnolis June 19, 2023 5,821

Java High-Performance Local Cache Practices

This article introduces local cache technology (for general understanding) and then introduces the best-performance cache.

Alibaba Cloud Community March 15, 2023 50,575

A New Upgrade for Applicable Scenarios! Extensions for Dragonfly2 as a Distributed Cache System Architecture

This article introduces Dragonfly2 and some of its new extensions.

OpenAnolis February 27, 2023 3,626

The Mechanism behind Measuring Cache Access Latency

This article describes how to measure the access latency at different levels of memory hierarchy and introduces the mechanism behind it.

OpenAnolis September 30, 2022 27,137

An Interpretation of the Source Code of OceanBase (11): Analysis of Location Cache Module

This article explains ObTableScan design and code knowledge and introduces the analysis of the location cache module.

OceanBase September 15, 2022 2,741

A Deep-Dive into MySQL: An Exploration of the MySQL Data Dictionary

This article focuses on the data structure and implementation architecture of the data dictionary.

ApsaraDB February 22, 2022 9,261

Straight Talk about "Dynamic Planning"

This article explains dynamic planning, its concepts, and its processes.

Alibaba F(x) Team January 26, 2022 3,398

An Introduction to Subqueries

This article shares the optimization techniques of subqueries and tips on handling subqueries in distributed databases.

ApsaraDB September 30, 2021 7,111

JindoFS: Computing and Storage Separation for Cloud-native Big Data

In this blog, we'll introduce the origins of JindoFS and discuss the problems its

Alibaba EMR April 2, 2021 3,061

Memory Model and Synchronization Primitive - Part 1: Memory Barrier

Part 1 of this 2-part series explains Memory Barrier and its associated functions in depth.

Jeffle Xu March 15, 2021 19,334

New Infrastructure's Accelerating the Digital Transformation of Government Agencies and Enterprises

Learn how new technologies are accelerating the digital transformation of government agencies and enterprises.

Alibaba Clouder April 30, 2020 6,827

Advanced Apache Flink Tutorial 1: Analysis of Runtime Core Mechanism

This article describes the core mechanism of running jobs in Flink Runtime. It provides an overview of the Flink Runtime architecture and basic job running process.

Apache Flink Community December 25, 2019 13,068

Related Tags

artificial intelligence big data cloud computing