In this article, Zhang Jianfeng, a veteran in the open-source community, explains how to evaluate whether the technology is worth learning using three key dimensions.
This article demonstrates how Alluxio simplifies running the PyTorch framework on HDFS using the Kubernetes platform to drastically improve development efficiency.
This blog gives you a big dive into secure migrating data from Apache Hadoop to the cloud platform.
This article looks at the big data platform that helped power last year's Double 11.
This post provides a walkthrough on how to set up Spark on MaxCompute on Alibaba Cloud.
This article outlines how you can use Alibaba Cloud AnalyticDB to analyze server logs without needing to set up Hadoop.
In this tutorial, you will learn how to set up Hadoop and its components on a multinode cluster using Apache Ambari.
Hadoop is an open source distributed computing framework that processes data efficiently and scalably.
As a senior technical expert at Alibaba Group, I will share my thoughts on what there is to say about big data, past, present, future.
In January Alibaba announced Blink would become open-source and contribute to Apache Flink's code—now this has come to Fruition.
In this article, we continue with HUE, or Hadoop User Experience, which is an open-source web interface, which can make many operations more simpler and easy to complete.
In this article, we explore HUE, or Hadoop User Experience, which is an open-source web interface, which can make many operations more simple and easy to complete.
This tutorial shows how you can set up a multi-node Hadoop cluster on Alibaba Cloud ECS instances with Ubuntu 18.04 installed.
This article shows you how to set up Docker to be used to launch a single-node Hadoop cluster inside a Docker container on Alibaba Cloud.
In this tutorial, we will be learning how to setup an Apache Hadoop on a single node cluster on an Alibaba Cloud ECS with Ubuntu 16.04.
With DataX-On-Hadoop, you can upload Hadoop data to MaxCompute and ApsaraDB for RDS using multiple MapReduce tasks without the need to install and deploy DataX software in advance.
In this article, we introduce the data distribution and explain some new optimization measures in Alibaba Cloud MaxCompute.
To ensure the security of Hadoop clusters, user authentication and authorization must be implemented, in addition to firewalls, to contain attacks originating from the inside.
In this tutorial, we will be installing a single node OpenStack on an Alibaba Cloud ECS instance with CentOS 7.
Redis is now a major component used in many Big Data applications. Redis is a favorable alternative to traditional relational database services becaus.