Data Sources

The data sources used on this project can be found at the following locations.

Virginia Longitudinal Data Sytem

The Virginia Longitudinal Data System is a tool for extracting and analyzing insightful education and workforce development data within a secure environment. The data for this project was sourced specifically from The Virginia Department of Education website for data researchers and developers. The data can be found under the Test Data subsection.

United States Censu Bureau

The data on county level economic factors was sourced from:

U.S. Census Bureau; American Community Survey, 2009-2013 5-Year American Community Survey, Table DP03;
generated by Erik Nylander; using American FactFinder ; http://factfinder2.census.gov; (15 April 2012).

Data Cleaning and Aggregation

To gather the data for this project we needed to combine information from the US Census Bureau and sixty plus CSV files. Python was used to complete this task and an IPython notebook with the code and documentiation can be found here