As stated in the note from the Sunlight Foundation′s Board Chair, as of September 2020 the Sunlight Foundation is no longer active. This site is maintained as a static archive only.

Follow Us

Fun with Google Spreadsheets and Fusion Tables

by

Fun with Google Spreadsheets and Fusion Tables

I've been having a lot of fun with Google tools today, and I wanted to share. This morning I was interested in generating this pie chart from the data off of Data.gov in my last post but needed to get all the data out of the Data.gov Data Catalog first.

Google Spreadsheets actually makes this really easy -- if you know what you're doing:

Step 1: Create a new spreadsheet, and put this little line in a cell:

=importHtml("http://www.data.gov/catalog#raw","table",2)

Step 2: There is no step 2. You're done.

Cool huh? You've now got a spreadsheet of all the data in Data.gov. But that's not what I wanted-- what I wanted was a count of each data source by agency to see who was providing the most data. The answer here was Google's new Fusion Tables. In Fusion Tables, I can then take the data, create an aggregation and provide me the counts, imported from my Google Spreadsheet.

Google Fusion Tables (Pre-Alpha)

Easy data analysis without a lick of code.

Continue reading

Surge of EPA data in Data.gov

by

Late afternoon yesterday, Data.gov went from 81 feeds to 261, and the EPA overtook the USGS for the agency providing the most data. The EPA added 180 new data files-- the Toxics Release Inventory data for each state and territory as well as for federal agencies for 2005, 2006 and 2007.

This data is interesting stuff-- dozens of CSV files (still in .exe compressed archives, ick) that speak to where corporations and government are managing toxic chemicals. There's lots of interesting data in there. But it isn't just a clear win-- this data is poorly documented byte delimited text files. While we do have some headers provided to get us started, but no real description of the actual files.

If you do end up working with this data for your [Apps for America 2: The Data.gov Challenge] entry, make some notes on how you parsed the data and let's create our own documentation for this data source.

Here's a breakdown of the data in Data.gov as of today:

Continue reading

CFC (Combined Federal Campaign) Today 59063

Charity Navigator