Data Visualization and Analysis, Part 2/3 – United States Department of Agriculture

By | February 24, 2016

Author: Qi Chen

1. Introduction

In the previous part, we introduced some data visualization techniques and showed the relationship of GDP, healthcare investment and life expectancy grouped by countries. In this article, we will focus on poverty and obesity, and analyze the dataset from United States Department of Agriculture. Specifically, by analyzing the dataset using data visualization techniques, we want to explore the following questions:

1. How does poverty rate vary within racial and ethnic groups?
2. Does the milk/soda price ratio influence obesity rate?
3. Are the obesity and diabetes rates higher in fast paced states like New York States than in agricultural states like Wisconsin?

The dataset has been aggregated and could be downloaded by this link.

2. Analysis

2.1 Software used

In this article, we will demonstrate the use of Tableau for data visualization.

2.2 Approaches

In order to explore the relation between poverty and ethnic groups, we rank the poverty rate from highest to lowest for different counties. In Graph 1, the x-axis represents the FIPS (Federal Information Processing Standard) of each county, which is a unique numeric code, and for the y-axis, we display the percentage of different population groups for the county. The y-axis variables include:

POVRATE10:
Poverty rate of counties in 2010

PCT_NHWHITE10, PCT_NHBLACK10, PCT_NHASIAN10, PCT_NHPI10, PCT_NHNA10:
Percent of different racial and ethnic groups in 2010

From Graph 1, we find that poverty rate is relatively higher in black and native American people.

Graph 1: Ethnic groups vs. poverty rate
USDA1

Then we try to discover the relation between milk/soda price and obesity rate. In Graph 2, the x-axis represents the FIPS of each county and y-axis represents the milk/soda price ratio. The obesity rate for each county is represented by different colors. With the color closer to red, the county has a higher rate of obesity. Graph 2 clearly shows that counties on the left are more likely to be red, which means higher milk/soda price ratio is associated with higher obesity rate.

Graph 2: Milk/soda price ratio vs. obesity rate
USDA2

In Graph 3 and Graph 4, we show the obesity rate and diabetes rate for each state in the map respectively. Red means a higher rate of obesity or diabetes, and green means a lower rate. From the two graphs, we can see that generally, the two rates are positively correlated. States on the east are more likely to have higher obesity rates and diabetes rates.

Graph 3: Obesity Rate in U.S
USDA3

Graph 4: Diabetes Rate in U.S.
USDA4

2.3 Results and Summary

In this demonstration article, we explore the poverty rate with respect to different population groups, as well as the obesity and diabetes rates for different states.

This article is based on a course project in Industrial Data Analytics offered by Prof. Kaibo Liu in University of Wisconsin-Madison in Spring 2015. I would like to thank Prof. Liu for his instruction and also thank my teammate Li Xie and Sowmya Shankar for their initial work.

3. Source

http://www.ars.usda.gov/Services/docs.htm?docid=8964

Share this post
  •  
  •  
  •  
  •  
  •  
2+

Users who have LIKED this post:

  • avatar
  • avatar

One thought on “Data Visualization and Analysis, Part 2/3 – United States Department of Agriculture

  1. Profile gravatar of AnnieAnnie

    it would be very interesting to additionally look at ethnicity vs obesity-this data has brought a lot of questions to mind! will there be a follow up piece comparing race and health metrics?

Leave a Reply