Most Popular
1500 questions
10
votes
8 answers
Is there a roadmap for opening all the data for a city or municipality?
In Chicago, a lot of data has been released on the City's Data Portal, but I have no idea how much is left to be opened.
The strategy I've noticed governments seem to follow is to start with a department head that is convinced open data is good and…
Derek Eder
- 101
- 4
10
votes
4 answers
Needed: Dataset for Outlier / Anomaly Detection
I would like to demonstrate outlier / anomaly detection and for that I need a real-life dataset.
I found the post
http://www.researchgate.net/post/What_are_anomaly_detection_benchmark_datasets
helpful, but I search for something simple and obvious…
guest
- 101
- 1
- 3
10
votes
2 answers
Public database of book titles?
I want my users to set their previously read books in my app.
Is there any database with easily accessible API where I can get up-to-date book titles?
János
- 899
- 8
- 20
10
votes
4 answers
Public domain paintings database
I'm looking for an API that would allow me to get a lot of painting on the public domain.
For example, Google's art project contains a lot of what I am looking for, but you can't easily download paintings (I will need something like 1000 of…
Rogue
- 201
- 2
- 4
10
votes
4 answers
A table mapping from US county or ZIP to Nielsen Designated Market Area (DMA)
Nielsen has come up with this notion:
A DMA region is a group of counties that form an exclusive geographic area[...]
DMAs partition all US counties, and I want to know which counties belong to which DMAs.
There are public GIS shape files for DMAs…
Frank
- 203
- 1
- 2
- 6
10
votes
1 answer
Phones dataset for speech recognition (not telephone number)
In phonetics a phone is a unit of speech sound.
I am looking for a dataset of phone waveforms for speech recognition. There are about 17 vowels and 17 consonants for English phones. I tried looking everywhere, but couldn't find a reliable…
Kevin
- 209
- 1
- 2
10
votes
2 answers
Transactional data over multiple years (Customer ID, Date, Price)
To build models to predict customer behavior, I am searching for transactional data over multiple years (i.e. > 3 years). My focus is to assess the quality of long-term predictions, thus the longer the time period the better.
Minimum requirement in…
majom
- 269
- 2
- 9
10
votes
1 answer
Interest of double licenses CC-BY-SA + ODbL for SVG maps
This is a split refactored from an older post containing too many questions about licenses in SVG Tiny 1.2 using RDFa as suggested by Patrick Hoefler. See also question Embed two licenses within SVG.
Introduction
LittleMap.org aims to allow anyone…
oHo
- 315
- 1
- 8
10
votes
2 answers
Data Licenses for US Government Data not in data.gov
I want to know when commercial use of US government data is permitted and for what uses. Below, there are 5 questions that I have for 4 different primary data sources.
So far, I have seen the data policy page at…
respectPotentialEnergy
- 1,550
- 1
- 10
- 11
10
votes
4 answers
Any Open (Structured) Datasets for the World Factbook (Public Domain Country Profiles Published by the CIA)?
The World Factbook [1][2] published by the Central Intelligence Agency (CIA) offers 267 free country 'n' territories profiles (incl. flags 'n' overview and locator maps) in the public domain (that is, no copyright(s), no rights…
Gerald Bauer
- 890
- 2
- 7
- 16
10
votes
1 answer
Dataset on donations to charities?
Is there a feed/database on individuals/companies that have donated to IRS verified charities?
I have done quite some Googling, and cannot seem to find anything :/
felix2018
- 101
- 3
10
votes
2 answers
High resolution, small area, maps that characterise natural terrain
I would like to collate a library of terrain types and features, suitable as reference material for generating small scale artificial versions (for gaming or concept art). For example, I might wish to view a typical part of the Mojave Desert, or…
Neil Slater
- 265
- 2
- 11
10
votes
3 answers
What quantified self products have open data behind them?
Are there quantified self products (fitbit, withings, nike fuelband, zeo, etc) that share bulk aggregate data with the public via API or download? For example: runs per zipcode, steps per county, or anonymized sleep quality logs?
Clay Johnson
- 927
- 7
- 8
10
votes
3 answers
Population movement datasets?
Are there any datasets available that contain real or simulated movement data of individuals over a period of time, or at least some metadata about population movements that might be used to build simulated models.
In particular, I'm looking for…
Bradley Dwyer
- 201
- 1
- 3
10
votes
2 answers
American English SMS Text Message Corpora
I'm seeking corpora of American English SMS (<=160 characters) text messages. What corpora are available?
Dan
- 520
- 4
- 15