District Level Report Tapi

Author

Ayush Patel

Published

August 23, 2022

Introduction - Tapi

This report aims to provide a birds eye view of the district through the lens of village amenities data released by census in 2011. Before moving towards the descriptive insights from the census data, here is what pops up when a search is executed for Tapi district Gujarat on wikipedia.



Summary Statistics

Total Number of Villages: 488

Total Number of Gram Panchayat: 283

Total Number of Sub Districts: 5

Total Population : 7.27535^{5}

Statistical Summaries at the subdistrict level

Sub District Total Population Total SC Population Total ST Population Total Area of Vilalges Total Area Sown (Net)
Nizar 129969 2261 105043 35729.47 21058.63
Songadh 190084 368 185295 117988.76 40891.03
Uchchhal 88416 114 86693 46098.46 9632.94
Valod 90566 988 64868 20226.13 17087.02
Vyara 228500 1565 215330 80457.36 46233.03

Population and Geographical Area

It is of interest to look into which are the most densely populated villages. We can do this by creating a simple scatter plot between population of village and the total geographical area of a village.


Irrigation for Agriculture

The census provides the net area sown (hectares) in a village along with area irrigated with water source in hectares. The area under irrigation may be affected by several factors.

Area Sown vs Area under Irrigation


A distribution for the percentage of area irrigated will be interesting to look at.

Understanding what drives area under irrigation

Much is heard about rain fed agriculture in India. There are several factors that can affect area under irrigation - ranging from government supports, demographics, distance from urban clusters and several known and unknown variables. With the given data we can check if the following variables have any relation with area under irrigation:

  • Percentage of Marginalised group population in village
  • Distance from Major government offices
  • Distance from urban center
  • Total population of a village

A simple Linear regression to see if the above explanation has any merit

Dependent variable:
perc_irrigated_over_net_sown
total_population_of_village 0.0004
(0.001)
perc_marginalised_pop -0.166
(0.129)
district_head_quarter_distance_in_km 0.100
(0.117)
sub_district_head_quarter_distance_in_km -0.402**
(0.194)
nearest_statutory_town_distance_in_km -0.121
(0.182)
sub_district_nameSongadh -12.770
(10.955)
sub_district_nameUchchhal -6.028
(9.241)
sub_district_nameValod 29.963**
(11.815)
sub_district_nameVyara 15.284
(12.276)
Constant 57.969***
(17.702)
Observations 440
R2 0.314
Adjusted R2 0.300
Residual Std. Error 24.382 (df = 430)
F Statistic 21.916*** (df = 9; 430)
Note: p<0.1; p<0.05; p<0.01

Model Diagnostic plots

Distribution of Redsiduals


Residuals vs Fitted

Note

This is to serve as a minimal example of creating parameterised reports with .rmd/.qmd files. This document is in no way analytically or statistically rigorous.