This article is an overview of the best practices for big data processing in Spark taken from a lecture.