Skip to content
Technology and Trends
  • Home
  • Database
  • Big Data
  • Hadoop
  • Spark
  • Linux
  • Interviews
  • Toggle website search
Menu Close
  • Home
  • Database
  • Big Data
  • Hadoop
  • Spark
  • Linux
  • Interviews
  • Toggle website search

Big Data

  1. Home>
  2. Big Data>
  3. Page 2

What is Columnar Data Storage and its Types?

  • Post author:nitendratech
  • Post category:Big Data
  • Post comments:0 Comments
  • Post published:May 21, 2021

In recent modern Big Data applications, numerous databases (NoSQL) have introduced columnar data storage, which provides several benefits over traditional row-oriented databases. Many Hadoop vendors like Cloudera, Hortonworks, and MapR…

Continue ReadingWhat is Columnar Data Storage and its Types?

Apache HBase Data Model

  • Post author:nitendratech
  • Post category:Big Data
  • Post comments:1 Comment
  • Post published:June 18, 2020

Apache HBase is an open-source, distributed, versioned, non-relational(NoSQL) database modeled after Google's Big Table. Even though this terminology overlaps with relational databases(RDBMS), the HBase table, in reality, is a multidimensional…

Continue ReadingApache HBase Data Model

What is Apache HBase? Architecture and Features

  • Post author:nitendratech
  • Post category:Big Data
  • Post comments:3 Comments
  • Post published:June 2, 2020

Apache HBase is an open-source, non-relational, distributed database modeled after Google's BigTable. It is developed as part of the Apache Software Foundation and is written in Java. It sits on…

Continue ReadingWhat is Apache HBase? Architecture and Features

What are Apache Pig execution Modes?

  • Post author:nitendratech
  • Post category:Big Data
  • Post comments:0 Comments
  • Post published:May 6, 2020

We can run Apache Pig Latin code and Pig statements using various modes. We will go through all of the Apache Pig execution modes in detail in this blog post.

Continue ReadingWhat are Apache Pig execution Modes?

Introduction to Apache Flume: Components and Channels

  • Post author:nitendratech
  • Post category:Big Data
  • Post comments:0 Comments
  • Post published:April 30, 2020

Apache Flume is an Apache open source project used for moving massive quantities of streaming data into HDFS. It collects log data from the web server logs files and aggregates…

Continue ReadingIntroduction to Apache Flume: Components and Channels

What is Complex Event Processing(CEP)?

  • Post author:nitendratech
  • Post category:Big Data
  • Post comments:1 Comment
  • Post published:December 28, 2019

Complex Event Processing (CEP) is a technique for tracking, analyzing, and processing incoming streams of data in real time and generating a summarized report. Event processing-based platforms have built-in capabilities…

Continue ReadingWhat is Complex Event Processing(CEP)?

Apache Sqoop Tutorial

  • Post author:nitendratech
  • Post category:Big Data
  • Post comments:2 Comments
  • Post published:November 4, 2019

Sqoop is a command-line interface application for transferring data between relational databases and Hadoop.This blog post will teach you basic tasks that you can perform with Sqoop.

Continue ReadingApache Sqoop Tutorial

What is Data Lake? Feature and Architecture

  • Post author:nitendratech
  • Post category:Big Data
  • Post comments:3 Comments
  • Post published:August 1, 2019

Introduction to Data Lake A Data Lake is a centralized data-centric storage architecture that is used for persisting a variety of data in its raw, unfiltered, and untransformed format. It…

Continue ReadingWhat is Data Lake? Feature and Architecture

Metadata Governance for Big Data Clusters

  • Post author:nitendratech
  • Post category:Big Data
  • Post comments:0 Comments
  • Post published:April 16, 2019

Managing metadata is an integral part of the overall data governance standard. An efficient way to do this is to establish data stewardship for metadata. It will ensure that data…

Continue ReadingMetadata Governance for Big Data Clusters

What is Apache Pig? Architecture and Components

  • Post author:nitendratech
  • Post category:Big Data
  • Post comments:3 Comments
  • Post published:September 23, 2018

Apache Pig is a high level dataflow language for analyzing large data sets. The data flows in a pipeline step by step and data can be stored at any point in the pipeline

Continue ReadingWhat is Apache Pig? Architecture and Components
  • Go to the previous page
  • 1
  • 2
  • 3
  • Go to the next page

Categories

  • Big Data
  • Data Science
  • Database
  • Hadoop
  • Hive
  • Interview
  • Java
  • Kafka
  • Linux
  • Programming
  • Scala
  • Spark
  • Technology

Recent Post

  • Data Engineering User Guide
  • Data Observability and Its Importance in Modern Data Tech Stack
  • Understanding difference between Stateless and Stateful Systems
  • What is Data Migration?
  • What is Large Language Models(LLM)?
  • Disclaimer
  • About Us
  • Contact Me
Copyright 2025 @Nitendratech.com