What is Apache Spark? The Unified engine for large-scale data analytics.

Apache Spark is a distributed, in-memory and disk based optimized system which does real-time analytics using Resilient Distributed Data(RDD) Sets.Spark includes a streaming library, and a rich set of programming interfaces to make data processing and transformation easier.

Continue ReadingWhat is Apache Spark? The Unified engine for large-scale data analytics.