In this article, We have explored Census data for India to understand changes in India’s demographics, population growth, religion distribution, gender distribution, and sex ratio, etc. Even by using small data, I could still gain a lot of valuable insights about the country. I have used Spark SQL and Inbuild graphs provided by Databricks.
India is the second-most populous country in the world, with over 1.271 billion people, more than a sixth of the world’s population. Already containing 17.5% of the world’s population, India is projected to be the world’s most populous country by 2025, surpassing China, its population reaching 1.6 billion by 2050. Its population growth rate is 1.2%.
Attribute Information or Dataset Details:
Attribute Information or Dataset Details:
col_name | data_type | comment |
---|---|---|
SerialNo | string | null |
State | string | null |
District | string | null |
Persons | bigint | null |
Males | bigint | null |
Females | bigint | null |
Growthin1991to2001 | float | null |
Rural | bigint | null |
Urban | bigint | null |
ScheduledCastepopulation | bigint | null |
Showing 1 to 10 of 82 entries
Table Created in Databricks Environment

Technology Used
Technology Used
- Apache Spark
- Spark SQL
- DataFrame-based API
- Databricks Notebook
Free Account creation in Databricks
Free Account creation in Databricks
Creating a Spark Cluster
Creating a Spark Cluster
Basics about Databricks notebook
Basics about Databricks notebook
Code for Spark SQL to get India's States with Number of Districts

Plot Option for Chart
Plot Option for Chart
