What is Big Data and Why it is important to understand? Introduction and Properties

The amount of data in our world has been exploding. Different Companies capture trillions of bytes of information about their customers, suppliers, and operations, and millions of networked sensors are being embedded in the physical world in devices such as mobile phones and automobiles, sensing, creating, and communicating data.

Continue ReadingWhat is Big Data and Why it is important to understand? Introduction and Properties

What is Apache Spark? The Unified engine for large-scale data analytics.

Apache Spark is a distributed, in-memory and disk based optimized system which does real-time analytics using Resilient Distributed Data(RDD) Sets.Spark includes a streaming library, and a rich set of programming interfaces to make data processing and transformation easier.

Continue ReadingWhat is Apache Spark? The Unified engine for large-scale data analytics.