Apache Hive Installation Steps on Ubuntu

With this tutorial, we will learn the complete process to install Apache Hive 3.1.2 on Ubuntu 20.

The Apache Hive data warehouse software facilitates reading, writing, and managing large datasets residing in distributed storage using SQL. Structure can be projected onto data already in storage. A command line tool and JDBC driver are provided to connect users to Hive.

Steps for Installing Hadoop on Ubuntu

Step 1 – Create a directory for example

$mkdir /home/bigdata/apachehive

Step 2 – Move to hadoop directory

$cd /home/bigdata/apachehive

Step 3 – Download Apache Hive (Link will change with respect to country so please get the download link from Apache Hive website ie https://hive.apache.org/downloads.html

https://downloads.apache.org/hive/hive-3.1.2/apache-hive-3.1.2-bin.tar.gz

$wget https://downloads.apache.org/hive/hive-3.1.2/apache-hive-3.1.2-bin.tar.gz

Step 4 – Extract this tar file

$tar -xzf apache-hive-3.1.2-bin.tar.gz

Step 5 – Open the bashrc files in the nano editor using the following command:

nano .bashrc

Edit .bashrc file located in the user’s home directory and add the following parameters:

export HIVE_HOME= “home/bigdata/apachehive/apache-hive-3.1.2-bin”
export PATH=$PATH:$HIVE_HOME/bin

Press CTRL+O and enter to save changes. Then press CTRL+X to exit the editor.

Step 6 – Open the core-site.xml file in the nano editor. The file is located in /home/bigdata/hadoop/hadoop-3.3.1/etc/hadoop/ (Hadoop Configuration Directory).

This location will differ based on your Hadoop installation.

Add the following configuration property in the core-site.xml file.

<configuration>
<property>
   <name>hadoop.proxyuser.dataflair.groups</name>
   <value>*</value>
</property>
<property>
   <name>hadoop.proxyuser.dataflair.hosts</name>
   <value>*</value>
</property>
<property>
   <name>hadoop.proxyuser.server.hosts</name>
   <value>*</value>
</property>
<property>
   <name>hadoop.proxyuser.server.groups</name>
   <value>*</value>
</property>
</configuration>