×
Spark SQL

Best Practices for Big Data Processing in Spark

This article is an overview of the best practices for big data processing in Spark taken from a lecture.

In-depth Review of Apache Spark: Spark + AI Summit 2020

Matei Zaharia, founder of the Spark project, gave an in-depth review of Spark at the Spark + AI Summit 2020 in conjunction with its 10-year anniversary.

Rewriting the Execution Plan in the EMR Spark Relational Cache

This article goes through the process of rewriting execution plans in the Spark Relational Cache on EMR.