Most Popular
1500 questions
12
votes
3 answers
Dataset of personal names
I'm looking for a dataset of personal names containing for each name as many following labels as possible:
first name(s)
middle name if any
last name(s)
nationality
country of residence
country of birth
age
sex
is married
number of siblings
number…
Franck Dernoncourt
- 7,780
- 9
- 39
- 86
12
votes
6 answers
Where can I find a database of hotel property locations?
Looking for an open database of lat/long co-ordinates for hotels. I am aware of sites like Factual.com but I'm looking for something more open.
Conor
- 223
- 2
- 5
12
votes
2 answers
Is there an open data api or service for dates of official government holidays?
Is there a service or Open API that exists out there for providing the dates of existing holidays in the US including observed holidays?
Unfortunately, from my google searches I've just found a paid service timeanddate.com and bank-holidays.com from…
chrisjlee
- 227
- 2
- 8
12
votes
5 answers
Has any MOOC (Coursera, edX, Udacity or others) publicly released some of their student data?
I'm looking for educational datasets from MOOCs. PSLC DataShop contains some learning interaction data, but not from MOOCs. I'm especially interested in logs tracking students' activities such as browsing the website or submitting answers.
Franck Dernoncourt
- 7,780
- 9
- 39
- 86
12
votes
2 answers
Job satisfaction data
Are there any datasets available freely online without restrictions that include individual level responses to questions about job satisfaction?
Ideally, I'd be interested in a datafile that included multiple questions about job satisfaction as well…
Jeromy Anglim
- 221
- 1
- 6
12
votes
2 answers
GitHub license for code written by US Government Employee
When I open a new project on Github, I see a selection of licenses in a drop-down list, which currently looks like this:
None
Apache v2 License
MIT license
Affero GPL
Artistic License 2.0
BSD (3-clause) License
BSD 2-clause license
Eclipse Public…
Rich Signell
- 369
- 1
- 11
12
votes
6 answers
Where can I find open data on historical forex rates for financial reporting purposes?
I am looking for historical foreign exchange (forex) data for financial reporting purposes. For example, I would like to distribute a data source that could be used for determining forex gains and losses on floated amounts within open source…
Chris Travers
- 671
- 7
- 9
11
votes
3 answers
Are there public transport data for Germany freely available?
I wonder if there are some up-to-date web sites with data for S-bahn, U-bahn and other means of traffics in German cities. I found some before but not sure if they are updated regularly. Later I will add some links for those but others are welcomed…
Ewoks
- 211
- 1
- 5
11
votes
4 answers
Examples of scraping from "real-world" data sources using OCR, etc?
Are there any examples of open data that have been scraped from "real-world" sources? To explain what I mean by "real-world", here are some (made up) examples and then some criteria:
Train departure times from automated OCR of crowdsourced photos…
Croad Langshan
- 292
- 2
- 6
11
votes
4 answers
Netflix Data set
One of the canonical examples of a big data competition was the Netflix prize data set. It seems to have disappeared from the Internet. Is that the case, or is it still accessible somewhere?
Sycorax
- 213
- 1
- 2
- 6
11
votes
1 answer
Statistics on car life length for each car model?
Authorities of each country register cars when they are bought or scrapped. Is there an open statistics of average life time for each car model?
cars
- 111
- 2
11
votes
2 answers
A standard format for English vocabulary
Is there a standard format for English vocabulary lists and some data sources in that format? (For example a standard XML)
For example SAT vocabulary or Basic English words, including definition, examples, phonetic,... in that format.
Ahmad
- 261
- 1
- 6
11
votes
2 answers
Data on refugee migration
Are there any current data sets on the migration of refugees in Europe available? As information seems to be sparse, I don't really care what kind of data. For example, data on migration routes or registration data would be interesting, preferably…
Eekhoorn
- 213
- 1
- 5
11
votes
2 answers
Series of integers to test sorting algorithms
That may sound trivial, but do any of you know where I could find a database of test cases for sorting algorithms? It's common to test sorting algorithms on corner cases or with datasets with specific patterns (sorted values, decreasing values, pipe…
Morwenn
- 211
- 2
- 6
11
votes
2 answers
Predictive Maintenance Data
I'm eager to try out some more with Microsoft Azure Machine Learning and would like to find a data set to make a use case concerning predictive manufacturing. Microsoft already offers a data set (semi conductor) for a use case like this, but I would…
user4985694
- 111
- 1
- 3