District Level Report Navsari

Author

Ayush Patel

Published

August 23, 2022

Introduction - Navsari

This report aims to provide a birds eye view of the district through the lens of village amenities data released by census in 2011. Before moving towards the descriptive insights from the census data, here is what pops up when a search is executed for Navsari district Gujarat on wikipedia.



Summary Statistics

Total Number of Villages: 372

Total Number of Gram Panchayat: 337

Total Number of Sub Districts: 5

Total Population : 9.20535^{5}

Statistical Summaries at the subdistrict level

Sub District Total Population Total SC Population Total ST Population Total Area of Vilalges Total Area Sown (Net)
Bansda 217156 836 203425 59345.99 31137.26
Chikhli 302852 5658 218860 57352.37 47559.66
Gandevi 154764 3268 57057 26863.18 16404.56
Jalalpore 136071 3431 30884 47232.14 22637.12
Navsari 109692 4155 61586 23299.94 18900.92

Population and Geographical Area

It is of interest to look into which are the most densely populated villages. We can do this by creating a simple scatter plot between population of village and the total geographical area of a village.


Irrigation for Agriculture

The census provides the net area sown (hectares) in a village along with area irrigated with water source in hectares. The area under irrigation may be affected by several factors.

Area Sown vs Area under Irrigation


A distribution for the percentage of area irrigated will be interesting to look at.

Understanding what drives area under irrigation

Much is heard about rain fed agriculture in India. There are several factors that can affect area under irrigation - ranging from government supports, demographics, distance from urban clusters and several known and unknown variables. With the given data we can check if the following variables have any relation with area under irrigation:

  • Percentage of Marginalised group population in village
  • Distance from Major government offices
  • Distance from urban center
  • Total population of a village

A simple Linear regression to see if the above explanation has any merit

Dependent variable:
perc_irrigated_over_net_sown
total_population_of_village -0.0002
(0.001)
perc_marginalised_pop 0.137**
(0.064)
district_head_quarter_distance_in_km -0.512***
(0.122)
sub_district_head_quarter_distance_in_km -1.313***
(0.404)
nearest_statutory_town_distance_in_km 1.023**
(0.441)
sub_district_nameChikhli 11.297**
(4.545)
sub_district_nameGandevi 23.818***
(6.601)
sub_district_nameJalalpore 15.440*
(8.169)
sub_district_nameNavsari 35.895***
(7.422)
Constant 50.800***
(9.589)
Observations 370
R2 0.465
Adjusted R2 0.452
Residual Std. Error 23.417 (df = 360)
F Statistic 34.793*** (df = 9; 360)
Note: p<0.1; p<0.05; p<0.01

Model Diagnostic plots

Distribution of Redsiduals


Residuals vs Fitted

Note

This is to serve as a minimal example of creating parameterised reports with .rmd/.qmd files. This document is in no way analytically or statistically rigorous.