Hadoop Archives - Projects Based Learning

Becoming a Data Engineer in 2025: Key Skills and Tools You Need

Becoming a Data Engineer in 2025: Key Skills and Tools You Need The data landscape in 2025 is more complex, […]

Advanced SQL Queries for Big Data Analytics: Use Cases and Examples

Introduction SQL (Structured Query Language) remains the backbone of data analytics, even in the era of big data. From relational

Bigdata Hadoop

How to Run Apache Druid on Docker Desktop (Windows OS) – A Step-by-Step Guide

Apache Druid is a real-time analytics database designed for fast slice-and-dice analytics on large datasets. Running Druid on Docker Desktop

Bigdata Hadoop

Running Hive on Windows Using Docker Desktop: Everything You Need to Know

Apache Hive is a powerful data warehouse infrastructure built on top of Apache Hadoop, providing SQL-like querying capabilities for big

Bigdata Hadoop

The roadmap for becoming a Data Engineer

The roadmap for becoming a Data Engineer typically involves mastering various skills and technologies. Here’s a step-by-step guide: Step 1:

Bigdata Hadoop

Installing Apache Druid on the Local Machine

Apache Druid is a real-time analytics database designed for fast slice-and-dice analytics (“OLAP” queries) on large data sets. Most often,

Bigdata Hadoop

Installing Single Node Kafka Cluster

In this tutorial, we will set up a single-node Kafka Cluster and run it using the command line. Step 1)

Bigdata Hadoop

Installing Apache Flume on Ubuntu

System Requirements: Java Runtime Environment – Java 1.8 or later Memory – Sufficient memory for configurations used by sources, channels

Bigdata Hadoop

Installing Apache Sqoop on Ubuntu

Step 1) Create a Sqoop directory by using the command mkdir sqoop so that we can download Apache Sqoop. Step

Bigdata Hadoop

Apache Hive Installation Steps on Ubuntu

With this tutorial, we will learn the complete process to install Apache Hive 3.1.2 on Ubuntu 20. The Apache Hive

Bigdata Hadoop

Apache Hadoop 3.3.1 Installation Steps on Ubuntu (Part 1)

With this tutorial, we will learn the complete process to install Hadoop 3.3.1 on Ubuntu 20. Supported Java Versions Apache

Bigdata Hadoop

Apache Hadoop 3.3.1 Installation Steps on Ubuntu (Part 2)

Use the following property in the respective files File: nano etc/hadoop/core-site.xml: <configuration> <property> <name>fs.defaultFS</name> <value>hdfs://localhost:9000</value>

Bigdata Hadoop

Top 1000+ Big Data Interview Question and Answers

With more companies turning to big data to run their business, the demand for talent is at an all-time high.

Bigdata Hadoop

Apache Hadoop 3.3.0 Single Node Installation on Windows 10 Part 1

Download Link for Apache Hadoop 3.3.0 URL : https://hadoop.apache.org/releases.html Click on the Binary it will open a new website https://www.apache.org/dyn/closer.cgi/hadoop/common/hadoop-3.3.0/hadoop-3.3.0.tar.gz (This

Bigdata Hadoop

Apache Hadoop 3.3.0 Single Node Installation on Windows 10 Part 2

We have downloaded hadoop installation files We need to move (that is cut and paste) From: Downloads Location To: C:\hadoop-3.3.0.tar