Homework (Due Feb 7-14)

Homework Week 1 (Due Feb 7)

Since we got a late start and you don’t have a full week, I spread out the homework some.

Send to me by 5 PM on Thursday:

URLs for three data sets that interest you. Use the subject “Homework
Week 1” and I’ll definitely see it.

By 9:30 AM Friday

Install Tabula If you get an error like “Tableau is damaged and can’t be opened. You should move it to the Trash,” the solution to is not at all intuitive: You have to Change your Privacy and Security settings to allow applications downloaded from “Anywhere” — it’s on the “general” tab.

Read Cairo: The Functional Art, Reading part 1: pages 25-31, 36-44, on thinking through a visualization as a tool for the reader; what graphical form best serves the goal? On e-reserve (access details on the syllabus)

Skim http://perceptualedge.com/articles/ie/the_right_graph.pdf and http://www.jiscinfonet.ac.uk/infokits/data-visualisation/type-of-charts/

Due 5 PM Monday:

Write a short blog post that describes the provenance of each of your three data sets data (who maintains it?), where the data can be found (include a link) and in less than 200 words each, explain why the data is interesting.

Due Feb 14:

Register for a Magellan account on CartoDB (use http://cartodb.com/academic to get the discount)

Make sure Firefox is installed on your computer with the Web Developer Toolbar extension.

Begin a scrapbook on WordPress, Tumblr, Pinterest or some other
aggregation service. Send me the URL.

4 thoughts on “Homework (Due Feb 7-14)”

  1. Minda









    So this is a very large data set, but I could focus on one specific city department, I’m thinking DHS, and the amount of projects for that dept. that are less than a certain percentage complete, or projects of that department that spent only a certain percentage of the funds awarded for their projects.


    I’m interested in stats about violent crime in NYC public schools but for this particular dataset I’m not sure what all of the column headings are detailing.



    Department of Buildings license fees: https://data.cityofnewyork.us/Business/DOB-License-Fees/vi6e-zw9u

    Department of Buildings complaints received: https://nycopendata.socrata.com/Housing-Development/DOB-Complaints-Received/eabe-havv?

    Building permits: https://nycopendata.socrata.com/Housing-Development/Building-Permits/tj4w-skrs?


    The musiXmatch dataset
    Built in partnership with the Million Song Dataset, a dataset created by the Echo Nest for developers who are looking to create music-related digital tools and apps.

    Food Scrap Drop-Off sites
    A list of (primarily greenmarkets) that accept food scrap drop-offs. The city does not maintain all of these.

    Piracy Data

    Data about the year’s Oscar-nominated films and how long they took to leak onto piracy networks.


    1. https://nycopendata.socrata.com/Housing-Development/DOB-Complaints-Received/eabe-havv?
    Elevators, Boilers, lots of very vital stuff in buildings that may not be working.

    2. Patients ranking their hospital. Also lots of reviews on social media.

    3. How Much Does a Broken Leg Cost in Your Neighborhood?
    One issue with this dataset, costs are averaged together as well as insurance charges.

    4. Would love to do a side-by-side comparison of travel fatalities across car/train/plane incidents. Only issue is that these are not .xml files but query systems.
    car data
    train data
    plane data


    Social Media Usage:

    Projected Population:

    2013 Campaign Contributions:

  2. Irina

    Link for first two sets:

    First set, starting on p. 30, lists the number of financial assistance applications that hospitals approved with some comparison metrics.

    Second set, starting on p. 40, lists the payment they received to provide financial assistance and liens they put on patients’ homes (who were unable to pay).

    Third set, on a slightly different topic, has data on cardiac surgeons in NYS and their work.https://health.data.ny.gov/Health/Cardiac-Surgery-by-Surgeon-Beginning-2008/dk4z-k3xb

    1. Vacant publicly owned land. (I would like to find vacant / undeveloped privately owned land as well.) https://data.cityofnewyork.us/Housing-Development/Vacant-Publicly-Owned-Land/c8vj-e9jc I might use in combination with this landcover data and look at the environmental benefits of

    2. Federal Stimulus Data for NYC with Funding Category and Funding Source https://data.cityofnewyork.us/Business/Federal-Stimulus-Data/ivix-m77e

    3. I read a piece about a new starbucks uptown and the economic changes this signifies (http://www.crainsnewyork.com/article/20140119/RETAIL_APPAREL/301199992/chain-reaction-in-inwood). I want to look at Starbucks locations in New York city in the context of other economic information http://www.starbucks.com/store-locator/search/location/10033

    Here you are:

Comments are closed.