Most Popular

1500 questions
15
votes
6 answers

What is the best way to request machine readable data from a FOIA request?

Maybe there is no way to request a specific file type but what is the best way to request a file (paper or digital) that can be converted easiestly?
cyclondude
  • 266
  • 2
  • 12
15
votes
4 answers

How do I share Open Data with others on this SE site?

I have some data which is Open(free from copyright, patents or other control mechanisms) and I want to share it with the users here. How do I do it without having to host it on some other server and share a link here? As links are destined to…
Nagendra Rao
  • 253
  • 1
  • 7
15
votes
3 answers

What are the available tools for managing crowdsourced data-cleaning tasks?

The Zooniverse team has had amazing success with the OldWeather project to crowdsource coding of old ships logs to extract climate data. And I believe Sunlight did some crowdsourcing (with Mechanical Turk?) to build the Political Party Time…
skyebend
  • 903
  • 8
  • 11
15
votes
4 answers

Do you know any open/standard resume format?

I would like to know if there is any kind open/standard resume or Curriculum Viate format which allows interchanging of resumes between systems. Any hint?
Diego Rosado
  • 153
  • 4
14
votes
7 answers

Should data APIs require registration and API keys?

When publishing a data API, there seems to be a conflict between requiring a user to sign-up for an API key and the principal of open access. What is best practice in publishing an open data API whilst avoiding overuse/abuse? Details When you…
D Read
  • 2,361
  • 2
  • 16
  • 22
14
votes
4 answers

Paraphrase data sets

I am looking for paraphrase data sets. I am aware of the following: PPDB: The Paraphrase Database (Ganitkevitch, Juri, Benjamin Van Durme, and Chris Callison-Burch. "PPDB: The Paraphrase Database." HLT-NAACL. 2013.). Its English portion,…
Franck Dernoncourt
  • 7,780
  • 9
  • 39
  • 86
14
votes
7 answers

Dataset of major newspapers content

I'm looking for the materials of major newspapers, such as The New York Times, Washington Post, and The Economist. A random sample of their articles or headlines would suffice, but each newspaper has complications. The New York Times has an article…
Anton Tarasenko
  • 3,641
  • 4
  • 20
  • 34
14
votes
2 answers

API giving ship positions worldwide

ShipAIS has a great database and web interface showing what ships are in a particular area: Unfortunately, it is limited to North Europe. Is there a similar database for ships in the whole world? Free unlimited API availability strongly…
Nicolas Raoul
  • 8,426
  • 5
  • 28
  • 61
14
votes
2 answers

What are some OpenData torrents to seed?

BitTorrent can be an efficient way to "host" a large data set that are static. Also, BitTorrent is a good mechanism distribute a large data set as part of a open data project without funding or official government recognition. As mentioned…
philshem
  • 17,647
  • 7
  • 68
  • 170
14
votes
3 answers

Where can I get bulk access to IRS 990 filings for US non-profits

There's another question about a list of all nonprofits, and this answer is kind of buried in the answers to that one. Also, this question came up today on the NICAR-L mailing list. It's a common question, so it seemed worth documenting here. From…
Joe Germuska
  • 5,488
  • 20
  • 46
14
votes
1 answer

Large list of quotes

I am looking for a large database / list of quotes I can use. Preferably famous quotes / quotes by famous people. I have tried looking for a brainyquote API or something, but couldn't find one. If there isn't a large downloadable list of data, what…
TwoShorts
  • 409
  • 3
  • 8
14
votes
4 answers

Is there an exoplanet API or dataset?

I am looking for an API or dataset that has information about the known exoplanets. The properties of this dataset would include things like mass, orbital period, atmosphere, temperature, etc.
CaesiumFarmer
  • 2,088
  • 1
  • 14
  • 26
14
votes
4 answers

How does one parse weather data?

Weather data is often cited as a "huge success" for the open data community. Where does this data sit, and how does one parse the data into something readable, like the 5 day forecast?
Clay Johnson
  • 927
  • 7
  • 8
14
votes
5 answers

How can I get a list of all non-geodata datasets on Data.gov?

Data.gov has a lot of datasets, and a super-majority of them are mapping. Is there a way to generate a list of all datasets other than geodata?
Waldo Jaquith
  • 363
  • 2
  • 7
14
votes
3 answers

Where can I find a dataset of songs labeled with their genre, BPM and key?

I am mostly interested in electronic songs. I need both the songs' waveform and their labels (genre, BPM and key). If you know some similar data set that doesn't contain all 3 labels, please still share. If possible, I want open data and downloading…
Franck Dernoncourt
  • 7,780
  • 9
  • 39
  • 86