Blog - Page 3 of 5 - Projects Based Learning

Healthcare Analytics for Beginners Part 2

Patient’s Age Patient’s Income Patient’s Occupation All in One Scatter Plot Loading Data into DataFrame %scala // File location and […]

Uncategorized

Practice Apache Superset without Installing

Apache Superset is a modern data exploration and visualization platform. Superset is fast, lightweight, intuitive, and loaded with options that

Bigdata Hadoop

Install Apache Spark On Ubuntu

With this tutorial, we will learn the complete process to install Apache Spark 3.2.0 on Ubuntu 20. Prerequisite: Spark runs

Apache Spark Analytics

Marketing Analytics Part 1

Marketing analytics consists of both qualitative and quantitative, structured and unstructured data used to drive strategic decisions in relation to

Apache Spark Analytics

Marketing Analytics Part 2

Are there any useful variables that you can engineer with the given data?Review a list of the feature names below,

Apache Spark Analytics

Marketing Analytics Part 3

NumStorePurchases VS MntGoldProds MntFishProducts Distribution Campaign 1 Campaign 2 Campaign 3 Campaign 4 Campaign 5 Section 03: Data Visualization Products

Bigdata Hadoop

Apache Hive Installation Steps on Ubuntu

With this tutorial, we will learn the complete process to install Apache Hive 3.1.2 on Ubuntu 20. The Apache Hive

Bigdata Hadoop

Installing Apache Pig on Ubuntu

Download a recent stable release from one of the Apache Download website https://pig.apache.org/releases.html Click on Download A new Page will

Bigdata Hadoop

Apache Hadoop 3.3.1 Installation Steps on Ubuntu (Part 1)

With this tutorial, we will learn the complete process to install Hadoop 3.3.1 on Ubuntu 20. Supported Java Versions Apache

Bigdata Hadoop

Apache Hadoop 3.3.1 Installation Steps on Ubuntu (Part 2)

Use the following property in the respective files File: nano etc/hadoop/core-site.xml: <configuration> <property> <name>fs.defaultFS</name> <value>hdfs://localhost:9000</value>

Uncategorized

Installing Apache Superset on Ubuntu (Linux) Machine

Installing Superset from Scratch In Ubuntu 20.04 the following command will ensure that the required dependencies are installed: sudo apt-get

Uncategorized

Installing Apache Cassandra on Ubuntu (Linux) Machine

Installing the binary tarball Verify the version of Java installed. For example: Command $ java -version Result openjdk version “1.8.0_222”

Uncategorized

Installing Java on Ubuntu (Linux) Machine

Steps for Installing JAVA 8 on Ubuntu Step 1 – Install Java 8 on Ubuntu The OpenJDK 8 is available

Apache Spark Machine Learning

Customer Segmentation using Machine Learning in Apache Spark

Customer segmentation is the practice of dividing a company’s customers into groups that reflect similarities among customers in each group.

Uncategorized

Apache Zeppelin with Apache Spark Installation on Ubuntu

Installation Steps for Apache Zeppelin on Ubuntu Prerequisite: Need to have Java 7 or Java 8 installed on Ubuntu Operating

Apache Spark Machine Learning

Machine Learning Project – Creating Movies Recommendation Engine using Apache Spark

Movies are loved by everyone irrespective of age, gender, race, color, or geographical location. A recommendation system is a filtration

Bigdata Hadoop

Top 1000+ Big Data Interview Question and Answers

With more companies turning to big data to run their business, the demand for talent is at an all-time high.

Apache Spark Machine Learning

Machine Learning Project on Sales Prediction or Sale Forecast

Sales forecasting is the process of estimating future sales. Accurate sales forecasts enable companies to make informed business decisions and

Bigdata Hadoop

Apache Hadoop 3.3.0 Single Node Installation on Windows 10 Part 1

Download Link for Apache Hadoop 3.3.0 URL : https://hadoop.apache.org/releases.html Click on the Binary it will open a new website https://www.apache.org/dyn/closer.cgi/hadoop/common/hadoop-3.3.0/hadoop-3.3.0.tar.gz (This

Bigdata Hadoop

Apache Hadoop 3.3.0 Single Node Installation on Windows 10 Part 2

We have downloaded hadoop installation files We need to move (that is cut and paste) From: Downloads Location To: C:\hadoop-3.3.0.tar