Zeppelin

Running Apache Zeppelin on Docker Desktop (Windows OS)

Running Apache Zeppelin on Docker Desktop (Windows OS)

Apache Zeppelin is an open-source web-based notebook that enables interactive data analytics. It supports multiple languages like Scala, Python, SQL, and more, making it an excellent choice for data engineers, analysts, and scientists working with big data frameworks like Apache Spark, Flink, and Hadoop.Setting up Zeppelin on a Windows system can sometimes be tricky due to dependency and configuration issues. Fortunately, Docker Desktop makes the process simple, reproducible, and fast. In this blog, we’ll walk you through how to run Apache Zeppelin on Docker Desktop on a Windows OS, step-by-step.✅ PrerequisitesBefore you begin, make sure the following are installed on…
Read More
Customer Segmentation using Machine Learning in Apache Spark

Customer Segmentation using Machine Learning in Apache Spark

Customer segmentation is the practice of dividing a company's customers into groups that reflect similarities among customers in each group. The goal of segmenting customers is to decide how to relate to customers in each segment in order to maximize the value of each customer to the business. Problem Statement or Business Problem In this project, we will perform one of the most essential applications of machine learning – Customer Segmentation. We will implement customer segmentation in Apache Spark and Scala, whenever you need to find your best customer. Customer Segmentation is one of the most important applications of unsupervised…
Read More
Apache Zeppelin with Apache Spark Installation on Ubuntu

Apache Zeppelin with Apache Spark Installation on Ubuntu

Installation Steps for Apache Zeppelin on Ubuntu Prerequisite: Need to have Java 7 or Java 8 installed on Ubuntu Operating System. The first step is to download the latest version on Apache Zeppelin and save it in one of the folder Link: http://zeppelin.apache.org/download.html The second step is to unzip the downloaded tar file (i.e) .tgz (We have stored the downloaded tar file in /home/bigdata/apachezeppelin/ (We have manually created apachezeppelin folder by using command mkdir apachezeppelin) bigdata@bigdata:~$ cd /home/bigdata/apachezeppelin/ bigdata@bigdata:~/apachezeppelin$ pwd /home/bigdata/apachezeppelin bigdata@bigdata:~/apachezeppelin$ ls -ltr total 683072 -rw-rw-r-- 1 bigdata bigdata 699455687 Aug 15 11:27 zeppelin-0.9.0-bin-netinst.tgz bigdata@bigdata:~/apachezeppelin$ tar -xvzf zeppelin-0.9.0-bin-netinst.tgz zeppelin-0.9.0-bin-netinst/…
Read More