Most Popular

1500 questions
12
votes
3 answers

Dataset of personal names

I'm looking for a dataset of personal names containing for each name as many following labels as possible: first name(s) middle name if any last name(s) nationality country of residence country of birth age sex is married number of siblings number…
Franck Dernoncourt
  • 7,780
  • 9
  • 39
  • 86
12
votes
6 answers

Where can I find a database of hotel property locations?

Looking for an open database of lat/long co-ordinates for hotels. I am aware of sites like Factual.com but I'm looking for something more open.
Conor
  • 223
  • 2
  • 5
12
votes
2 answers

Is there an open data api or service for dates of official government holidays?

Is there a service or Open API that exists out there for providing the dates of existing holidays in the US including observed holidays? Unfortunately, from my google searches I've just found a paid service timeanddate.com and bank-holidays.com from…
chrisjlee
  • 227
  • 2
  • 8
12
votes
5 answers

Has any MOOC (Coursera, edX, Udacity or others) publicly released some of their student data?

I'm looking for educational datasets from MOOCs. PSLC DataShop contains some learning interaction data, but not from MOOCs. I'm especially interested in logs tracking students' activities such as browsing the website or submitting answers.
Franck Dernoncourt
  • 7,780
  • 9
  • 39
  • 86
12
votes
2 answers

Job satisfaction data

Are there any datasets available freely online without restrictions that include individual level responses to questions about job satisfaction? Ideally, I'd be interested in a datafile that included multiple questions about job satisfaction as well…
Jeromy Anglim
  • 221
  • 1
  • 6
12
votes
2 answers

GitHub license for code written by US Government Employee

When I open a new project on Github, I see a selection of licenses in a drop-down list, which currently looks like this: None Apache v2 License MIT license Affero GPL Artistic License 2.0 BSD (3-clause) License BSD 2-clause license Eclipse Public…
Rich Signell
  • 369
  • 1
  • 11
12
votes
6 answers

Where can I find open data on historical forex rates for financial reporting purposes?

I am looking for historical foreign exchange (forex) data for financial reporting purposes. For example, I would like to distribute a data source that could be used for determining forex gains and losses on floated amounts within open source…
Chris Travers
  • 671
  • 7
  • 9
11
votes
3 answers

Are there public transport data for Germany freely available?

I wonder if there are some up-to-date web sites with data for S-bahn, U-bahn and other means of traffics in German cities. I found some before but not sure if they are updated regularly. Later I will add some links for those but others are welcomed…
Ewoks
  • 211
  • 1
  • 5
11
votes
4 answers

Examples of scraping from "real-world" data sources using OCR, etc?

Are there any examples of open data that have been scraped from "real-world" sources? To explain what I mean by "real-world", here are some (made up) examples and then some criteria: Train departure times from automated OCR of crowdsourced photos…
Croad Langshan
  • 292
  • 2
  • 6
11
votes
4 answers

Netflix Data set

One of the canonical examples of a big data competition was the Netflix prize data set. It seems to have disappeared from the Internet. Is that the case, or is it still accessible somewhere?
Sycorax
  • 213
  • 1
  • 2
  • 6
11
votes
1 answer

Statistics on car life length for each car model?

Authorities of each country register cars when they are bought or scrapped. Is there an open statistics of average life time for each car model?
cars
  • 111
  • 2
11
votes
2 answers

A standard format for English vocabulary

Is there a standard format for English vocabulary lists and some data sources in that format? (For example a standard XML) For example SAT vocabulary or Basic English words, including definition, examples, phonetic,... in that format.
Ahmad
  • 261
  • 1
  • 6
11
votes
2 answers

Data on refugee migration

Are there any current data sets on the migration of refugees in Europe available? As information seems to be sparse, I don't really care what kind of data. For example, data on migration routes or registration data would be interesting, preferably…
Eekhoorn
  • 213
  • 1
  • 5
11
votes
2 answers

Series of integers to test sorting algorithms

That may sound trivial, but do any of you know where I could find a database of test cases for sorting algorithms? It's common to test sorting algorithms on corner cases or with datasets with specific patterns (sorted values, decreasing values, pipe…
Morwenn
  • 211
  • 2
  • 6
11
votes
2 answers

Predictive Maintenance Data

I'm eager to try out some more with Microsoft Azure Machine Learning and would like to find a data set to make a use case concerning predictive manufacturing. Microsoft already offers a data set (semi conductor) for a use case like this, but I would…
user4985694
  • 111
  • 1
  • 3