Most Popular

1500 questions
9
votes
1 answer

Where to find key log data for keyboard usage?

I am wondering if theres a dataset of key logs for keyboards of computers or even smartphones. It only needs to have the time and the key character for each key pressed. Such as: 9:00 4/2/2014, a 9:01 4/2/2014, ! I know it could be very sensitive…
Yulong
  • 518
  • 2
  • 5
9
votes
2 answers

Where can I find a large list of English books published in the last 50 years?

I need a dataset of books by title with their respective authors. I have tried the library of congress, but it seems as though you need a subscription to access its data? Are there any other website that would offer a dataset similar to this? I do…
TwoShorts
  • 409
  • 3
  • 8
9
votes
2 answers

publicly available spam dataset of social networks

I am trying to create a naive bayes spam filter for social networks and i need a spam dataset (containing posts/tweets etc.) of popular social networks like twitter/facebook. Is there any publicly available dataset which i can use to train the…
user3412336
  • 91
  • 1
  • 2
9
votes
4 answers

Cannabis Data Set

Looking for a data set of cannabis information. Already found the genome info off of amazon, but would be interested to know if there are any other botanical datasets out there like season cycles, nutrient sensitivity, etc. They do not need to be…
user2416
  • 91
  • 2
9
votes
4 answers

Tool to extract the main concepts/topics from web pages

I need to find the main themes or topics for each of 50,000 website urls. For example, http://austinzencenter.org/ should return topics such as zen, monastic, austin, retreat, buddha, etc. I'm guessing I can use services such as alchemyapi,…
dwenaus
  • 289
  • 2
  • 5
9
votes
4 answers

What is the best source for finding what businesses have opened/exist/closed in a given geographical area?

Is there a a standard source for finding out what businesses (with brick and mortar locations) exist in a geographical area (particularly in the U.S.)? This doesn't necessarily have to be a geographically-based query, but I am looking for…
batpigandme
  • 641
  • 1
  • 6
  • 17
9
votes
3 answers

Is there an open data format for screen/play scripts?

I'd like to write an app that would utilize dialog from a play script, but I don't want to reinvent the wheel. Is there an open format for encapsulating characters, locations, dialog, scenes, acts, etc? I was sure there must be, but all I see is…
Arian Kulp
  • 191
  • 3
9
votes
1 answer

Customer Service (Call Center) Audio datasets

I am looking for audio/video samples of conversations between customer service agents and customers. The background noise and accents are not a problem here so long as the conversation is in English. The dataset must be in an audio/video format and…
9
votes
1 answer

Text message corpus for American english

I am doing a research project to identify some patterns in how people interact with each other using text messages. The scope of the project is confined to American users. Can someone suggest a good place to find a corpus for American SMS/text…
Darth.Vader
  • 191
  • 2
9
votes
1 answer

Where can I find high-precision cartographic data of French rural areas?

While OpenStreetMap is rather good for cartographic information (and rendered maps) in urban areas of France, it is rather unsatisfactory in rural areas (except in the area where I live, where I improved the map!) The IGN local maps (1 : 25,000) are…
F'x
  • 193
  • 1
  • 12
9
votes
2 answers

Data Standards for Campaign Finance

While there are some nice APIs for campaign finance at the federal level, they differ…
fgregg
  • 5,108
  • 16
  • 37
9
votes
2 answers

Dictionary of misspelled words

I am looking for a dictionary of common misspelled words that can be used to highlight errors. I am not looking for a normal dictionary as I don’t want to hit things like names. It cannot include words like 'affect', which although is a common…
thewade
  • 91
  • 2
9
votes
3 answers

World gas/petrol prices at the pump

Is anyone aware of a source for data related to the world gas/petrol prices at the pump? The websites I've seen are all either selling data or just dealing with the global market price. I'm interested in the actual sale price per litre/gallon at…
geotheory
  • 459
  • 3
  • 9
9
votes
3 answers

How to get daily updates from Wikipedia?

I installed media-wiki on ec2 machine and loaded the Wikipedia page-articles dump content and other relevant data into that. I want to update the data regularly(daily) but I didn't get any resource for that. Is there any source from where I can get…
vinod
  • 91
  • 3
9
votes
2 answers

Historical values for the German "Sonntagsfrage"?

In Germany, there are several surveys on voter's preference ("If there were an election next sunday, which party would you vote for"), the so-called "Sonntagsfrage". Are historical values available? I found this graphical data, but I am looking for…
Karsten W.
  • 940
  • 5
  • 15