In this article, We have explored the Sentiments of People in India during Demonetization. Even by using small data, I could still gain a lot of valuable insights. I have used Spark SQL and Inbuild graphs provided by Databricks.
India is the second-most populous country in the world, with over 1.271 billion people, more than a sixth of the world’s population. Let us find out the views of different people on the demonetization by analyzing the tweets from Twitter.
data:image/s3,"s3://crabby-images/072e9/072e9bb0a0e875e3aa362f96a040578f2d02969b" alt=""
Attribute Information or Dataset Details:
col_name | data_type | comment |
---|---|---|
retweeted | string | null |
isRetweet | string | null |
retweetCount | string | null |
screenName | string | null |
statusSource | string | null |
replyToUID | string | null |
id | string | null |
replyToSID | string | null |
truncated | string | null |
created | string | null |
replyToSN | string | null |
favoriteCount | string | null |
favorited | string | null |
text | string | null |
X | bigint | null |
_c0 | bigint | null |
Table Created in Databricks Environment
data:image/s3,"s3://crabby-images/5a0c5/5a0c5e324f32dc76630cef063fde11e8deee29e0" alt=""
data:image/s3,"s3://crabby-images/9f70a/9f70a71f8c35aab9f3e074e6af207f5807e5e08b" alt=""
Technology Used
- Apache Spark
- Spark SQL
- DataFrame-based API
- Databricks Notebook
Free Account creation in Databricks
Creating a Spark Cluster
Basics about Databricks notebook
Code for Spark SQL to get Indias Tweet reaction during Demonetization
%sql select sum(retweetCount) as RetweetCount,created from demonetization group by created
data:image/s3,"s3://crabby-images/dfc2e/dfc2e9993680c3f60b251c6aec96586f2bf12a25" alt=""
Plot Option for Chart
data:image/s3,"s3://crabby-images/f55d3/f55d394e4b88c0ec20d9c0eb73fa7e0b49c57eed" alt=""
Code for Spark SQL to get Types of Devices used for Tweet
%sql select sum(retweetCount), substring_index(substring_index(statusSource, ">", -2),"<",1) as status_source from demonetization group by substring_index(substring_index(statusSource, ">", -2),"<",1)
data:image/s3,"s3://crabby-images/efe69/efe69253b290532ac3d2d5e1b938cb4134703545" alt=""
Plot Option for Pie Chart
data:image/s3,"s3://crabby-images/58709/587096d1c0ebfdb719c29274c70c6604e3677c2e" alt=""
Code for Spark SQL to get Number of Retweet During Demonetization
%sql select sum(favoriteCount),created from demonetization group by created
data:image/s3,"s3://crabby-images/b69fa/b69fa2fe9727593e466e7fee68976442f039bec3" alt=""
Code for Spark SQL to get Reaction of People on Demonetization
%sql select CASE WHEN text like '%Respect%' THEN "POSITIVE" WHEN text like '%symptom%' THEN "POSITIVE" WHEN text like '%terrorists%' THEN "POSITIVE"\ WHEN text like '%National%' THEN "POSITIVE" WHEN text like '%reform%' THEN"POSITIVE" WHEN text like '%support%' THEN "POSITIVE" WHEN text like '%#CorruptionFreeIndia%' THEN "POSITIVE" WHEN text like '%respect%' THEN "POSITIVE" WHEN text like '%Gandhi%' THEN "POSITIVE" WHEN text like '%vote%' THEN "POSITIVE" WHEN text like '%fishy%' THEN "NEGATIVE" WHEN text like '%disclosure%' THEN "NEGATIVE" WHEN text like '%Reddy Wedding%' THEN "NEGATIVE" WHEN text like '%protesting%' THEN "NEGATIVE" WHEN text like '%hards%' THEN "NEGATIVE" WHEN text like '%Kerala%' THEN "NEGATIVE" WHEN text like '%hurt%' THEN "NEGATIVE" WHEN text like '%USELESS%' THEN "NEGATIVE" WHEN text like '%Disaster!%' THEN "NEGATIVE" WHEN text like '%Black%' THEN "NEGATIVE" WHEN text like '%negative%' THEN "NEGATIVE" WHEN text like '%impact%' THEN "NEGATIVE" WHEN text like '%opposing%' THEN "NEGATIVE" ELSE "NEUTRAL" END AS Reaction from demonetization
data:image/s3,"s3://crabby-images/28bb1/28bb1075acedba25e96857bc128f8a2c8fa7ca09" alt=""