Here are list of data and software projects that I’m working on or have worked on in the past. Consider these side projects.
Along with my friend Gaurav Sood, I’ve collected cricket match data for research and journalism analysis, including an article on ESPNCricInfo and a white paper, both covering the impact of winning the pre-match coin toss (full code here). Another product of these efforts is the Python library python-espncricinfo, which wraps the undocumented JSON API provided by the site. I’ve also written a Ruby wrapper, cricketer.
A Ruby on Rails app that, given a URL of a news story from The New York Times, The Washington Post and ProPublica, will find tweets containing the URL but not the title of the story.
I’ve developed a Ruby wrapper for washingtonpost.com articles and blog posts, including their comments. Potentially suitable for building custom feeds of Washington Post content, in the event that you don’t want to actually visit washingtonpost.com. Like ESPNCricInfo, the Post site has an undocumented API.
- Extractor - A proof-of-concept Python app for extracting text from URLs.
- LinkChecker - Web app to find links to Wikipedia in URLs.
- Paper of Record - A tiny JS app for tracking mentions of The New York Times in the Congressional Record.
- Binya - The Ruby wrapper for the Federal Reserve.
- USA Today Census - Ruby wrapper for the USA Today Census API