Lin Xuewei, a technical expert, gives an overview of the latest performance and efficiency optimizations that were made to TPC-DS Perf after its third submission.
In this post, we will introduce Spark-TFRecord, a new solution to enable support for native TensorFlow data format in Spark.
This year, EMR increased its computing speed to 2.2 times of that from last year, breaking the world record again in the big data sector.
This articles looks at some of the misunderstandings and frequently overlooked aspects of refactoring, proposing some best practices.
Learn how Alibaba is transforming the Java language.
This article is a list of 50 efficient Java code samples.
This article outlines how you can use Alibaba Cloud AnalyticDB to analyze server logs without needing to set up Hadoop.
In this tutorial, you will learn how to set up Hadoop and its components on a multinode cluster using Apache Ambari.
This article is based on Alibaba Cloud E-MapReduce and the entire Alibaba Cloud system. We will focus on the most important scenarios, such as live video, video stream, etc.
This article looks into how you can accelerate query speeds by using the Spark Relational Cache of Alibaba Cloud E-MapReduce.
In this article, part two of two parts, an Alibaba engineer shares everything he knows about Kafka.
In this article, part one of two parts, an Alibaba engineer shares everything he knows about Kafka.
JindoFS is a cloud-native file system that integrates the advantages of local disks and the ultra-large capacity of Object Storage Service (OSS).
This article looks at how co-location technology has been explored and developed at Alibaba into what is now a large-scale solution architecture.
This article outlines Ant Financial's financial data intelligence system, which is built on next-generation technology to address increasingly complex use cases.
Alibaba Cloud MaxCompute provides Tunnel commands for uploading and downloading of large batches of offline data.
In this article, Men Deliang of Youku shares the success of Youku's business and platform by migrating from Hadoop to Alibaba Cloud MaxCompute.
This article mainly introduces how you can use PyODPS to perform Cartesian product operations throught DataFrame APIs.
In this article, we will show you how to use Alibaba Cloud E-MapReduce (EMR) to build a Kafka cluster automatically.
This article describes how to use an HAProxy reverse proxy to access Presto service through a Gateway node.