For this assignment, you will need to download three different data sets:
https://www.newyorkfed.org/research/college-labor-market/college-labor-market_compare-majors.html
(scroll down to "School, Police, Health & Fire" and export as geoJSON, called schoolDistricts.json).
(If you have troubles downloading, here's the file: schoolDistricts.json
http://schools.nyc.gov/Accountability/data/TestResults/ELAandMathTestResults
We will use these data sets for later homework assignments. Since scraping the data takes time, save these data sets to use again for the future programs.
The work to be submitted is the same for the undergraduate and graduate versions of the course.
CMP 464/788 Homework: | |
---|---|
#1-3 |
Analyse the NY Fed's Labor Market Data for Recent Graduates (see link above) using a Principal
Components Analysis. There are three parts to this exercise:
#1: Submit your Python program as a .py file. #2: Submit a text file or screen shot that includes the covariance matrix. #3: Submit a screen shot of the graphics window containing the plot. |
#4-5 |
OPTIONAL (FOR EXTRA CREDIT)Using folium, create a map of the New York City School Districts (elementary and
middle school) and shade each districts by borough (that is, all districts in the Bronx
will be the same color; the districts in Brooklyn will be another color, etc.).
#4: Submit your Python program as a .py file. #5: Submit a screen shot of the graphics window containing the plot. |
#6-7 |
Using the New York City data for district test scores, shade your map above by percentage
of students proficient in mathematics (i.e. scored a 3 or 4 on the exam-- the last column in the CSV file).
#6: Submit your Python program as a .py file. #7: Submit a screen shot of the graphics window containing the plot. |