This article introduces the main features in the new version of Paimon that are supported by the Spark-based computing engine.
Part 20 of this series discusses another important SQL optimization method: rule-based optimization (RBO).
Part 19 of this series discusses SQL performance optimization.
This article is an overview of the best practices for big data processing in Spark taken from a lecture.
Matei Zaharia, founder of the Spark project, gave an in-depth review of Spark at the Spark + AI Summit 2020 in conjunction with its 10-year anniversary.
This article goes through the process of rewriting execution plans in the Spark Relational Cache on EMR.