Most Popular
1500 questions
6
votes
3 answers
Restrict search to open datasets on CKAN's Data Hub?
The Data Hub, powered by CKAN, is an impressive collection of datasets available on the web. Currently it lists more than 6000 datasets — however, not all of them are available under an Open Data license.
Is there any way to only search for or list…
Patrick Hoefler
- 5,790
- 4
- 31
- 47
6
votes
1 answer
Requirements of the Open Data Commons Attribution License
When OCLC released their WorldCat data under Open Data Commons Attribution License last year, there was a question in the library community as to exactly what would be required for adequate attribution if you were to use the data in a mashup.
OCLC…
Joe
- 4,445
- 1
- 18
- 40
6
votes
2 answers
Is there an API or global source for US ballot information?
I'd like to be able to query to discover what's on the ballot - from local measures to Presidential elections.
Is there a source that combines all of this data? Or is there a standard that's relatively well implemented in the majority of locations…
TurtlePowered
- 163
- 4
6
votes
1 answer
CC-BY vs MIT or BSD licenses regarding re-use?
My understanding is that the MIT license allows sublicensing while the BSD license does not, meaning that license restrictions can be added to things like mashups and not to the original data as a whole (this view taken largely from Larry Rosen's…
Chris Travers
- 671
- 7
- 9
6
votes
1 answer
Search for deaths by a specific drug
I am not a programmer, so I am trying to understand the search methods to use when looking at OpenFDA. I am currently using the generic form for Reported Reactions. How do you get more the 10 adverse events and how do you get deaths to show up?
user2990
- 61
- 1
6
votes
3 answers
Are there any open datasets for Wrestling statistics
I'm looking for freestyle, greco, and folkstyle statistics. At the "professional" and college level.
To expand on my question (and in response to the comment below), for example, I'm looking for match results from the recent US Open held in Las…
ganders
- 161
- 5
6
votes
2 answers
Video game meta-data (supplement for Steam API)
Steam offers a REST-like API (details here and here) enabling a registered developer to obtain informations on its users (games owned, time passed on each...). However, I didn't find useful informations on the games themselves.
As a consequence, I'm…
merours
- 413
- 4
- 11
6
votes
2 answers
Redistributable Twitter-like data
For a tool that I'm working on, I'd like to to distribute an example data set that is Twitter-like. Twitter's terms of service preclude redistributing their data, so I know I can't use them directly. Here is what I need:
On the order of tens of…
Tom Panning
- 161
- 3
6
votes
1 answer
Domain Name System Record A database
Searching for a IP to TLD database lookup that is from an source that provides updates & archives. Possible this might require a number of sources, since I'm looking for all TLDs.
blunders
- 317
- 2
- 6
6
votes
2 answers
Where to contribute pictures of election leaftlets?
In most countries before elections, every citizen receives by mail an envelope containing paper leaflets printed by each political group, each describing its political program.
I think that these documents have a great historical value.
I have…
Nicolas Raoul
- 8,426
- 5
- 28
- 61
6
votes
1 answer
Dataset of adulteration incidents
I'm looking for a dataset of adulteration incidents containing as many following fields as possible:
name of the adulterant(s) (an adulterant is a substance found within other substances such as food, beverages, fuels, although not allowed for…
Franck Dernoncourt
- 7,780
- 9
- 39
- 86
6
votes
2 answers
african newpapers for the past 20 years for machine learning
I am starting a project looking at textual analysis (topic analysis and sentiment analysis) for newspapers African countries, including Gambia, Nigeria, and Tanzania. However, I was wondering if anyone knew of good repositories or archives of…
krishnab
- 459
- 2
- 12
6
votes
1 answer
Accurately Using Census Tract Data and Total Population
I am trying to gather the total population of Pittsburgh and overlay it onto a Google Map. I took the Census Tract .shp files from the U.S. Gazetteer Files for 2010 and put them in a fusion table and then ran the census API for the total…
user3271518
- 195
- 1
- 4
6
votes
2 answers
IRS Statistics of Income, machine-readable
Does anyone know of anyone else who has compiled a machine-readable set of the IRS's SOI Tax Stats - Historical Table 2, i.e., these?
The .xls files available from the IRS are crosstabbed and contain multiple subcategories. Difficult to translate…
JMcClure
- 293
- 1
- 7
6
votes
1 answer
Source for photographs of sample checks
I'm working on an OCR functionality around checks, specifically around a user taking a photograph of a check, and I'm looking to acquire a data set to quantify the success of our various features.
Does anyone know of such a set already in existence…
thomasmurphycodes
- 63
- 3