What is the Best Way to Secure a Big Data cluster?

Big data refers to datasets whose size, volume and structure is beyond the ability of traditional software tools and database systems to store,process and analyze within reasonable timeframes. Big data security is a term used for the different tools and techniques used to protect data,any back end processes from outside attacks and thefts.

Continue ReadingWhat is the Best Way to Secure a Big Data cluster?

What are Big Data File Storage Formats?

One of the most important aspect of architecting a solution with Big Data is choosing a proper Data Storage options in Hadoop/Spark. Hadoop does not have a standard data storage format ,but as a standard file system ,allows for storage of data in any format ,whether it’s text,binary ,image or other.

Continue ReadingWhat are Big Data File Storage Formats?

What is Big Data and Why it is important to understand? Introduction and Properties

The amount of data in our world has been exploding. Different Companies capture trillions of bytes of information about their customers, suppliers, and operations, and millions of networked sensors are being embedded in the physical world in devices such as mobile phones and automobiles, sensing, creating, and communicating data.

Continue ReadingWhat is Big Data and Why it is important to understand? Introduction and Properties