What is Columnar Data Storage and its Types?

Post author:nitendratech
Post category:Big Data
Post comments:0 Comments
Post published:May 21, 2021

In recent modern Big Data applications, numerous databases (NoSQL) have introduced columnar data storage, which provides several benefits over traditional row-oriented databases. Many Hadoop vendors like Cloudera, Hortonworks, and MapR…

Apache HBase Data Model

Post author:nitendratech
Post category:Big Data
Post comments:1 Comment
Post published:June 18, 2020

Apache HBase is an open-source, distributed, versioned, non-relational(NoSQL) database modeled after Google's Big Table. Even though this terminology overlaps with relational databases(RDBMS), the HBase table, in reality, is a multidimensional…

What is Apache HBase? Architecture and Features

Post author:nitendratech
Post category:Big Data
Post comments:3 Comments
Post published:June 2, 2020

Apache HBase is an open-source, non-relational, distributed database modeled after Google's BigTable. It is developed as part of the Apache Software Foundation and is written in Java. It sits on…

What are Apache Pig execution Modes?

Post author:nitendratech
Post category:Big Data
Post comments:0 Comments
Post published:May 6, 2020

We can run Apache Pig Latin code and Pig statements using various modes. We will go through all of the Apache Pig execution modes in detail in this blog post.

Introduction to Apache Flume: Components and Channels

Post author:nitendratech
Post category:Big Data
Post comments:0 Comments
Post published:April 30, 2020

Apache Flume is an Apache open source project used for moving massive quantities of streaming data into HDFS. It collects log data from the web server logs files and aggregates…

What is Complex Event Processing(CEP)?

Post author:nitendratech
Post category:Big Data
Post comments:1 Comment
Post published:December 28, 2019

Complex Event Processing (CEP) is a technique for tracking, analyzing, and processing incoming streams of data in real time and generating a summarized report. Event processing-based platforms have built-in capabilities…

Apache Sqoop Tutorial

Post author:nitendratech
Post category:Big Data
Post comments:2 Comments
Post published:November 4, 2019

Sqoop is a command-line interface application for transferring data between relational databases and Hadoop.This blog post will teach you basic tasks that you can perform with Sqoop.

What is Data Lake? Feature and Architecture

Post author:nitendratech
Post category:Big Data
Post comments:3 Comments
Post published:August 1, 2019

Introduction to Data Lake A Data Lake is a centralized data-centric storage architecture that is used for persisting a variety of data in its raw, unfiltered, and untransformed format. It…

Metadata Governance for Big Data Clusters

Post author:nitendratech
Post category:Big Data
Post comments:0 Comments
Post published:April 16, 2019

Managing metadata is an integral part of the overall data governance standard. An efficient way to do this is to establish data stewardship for metadata. It will ensure that data…

What is Apache Pig? Architecture and Components

Post author:nitendratech
Post category:Big Data
Post comments:3 Comments
Post published:September 23, 2018

Apache Pig is a high level dataflow language for analyzing large data sets. The data flows in a pipeline step by step and data can be stored at any point in the pipeline