Most Popular

1500 questions
4
votes
1 answer

What encoding does the Census use in TIGER data?

What encoding does the Census distribute TIGER data in? The example on page 88 of PostGIS in Action, Second Edition seems to indicate latin1. shp2pgsql -s 4269 -g geom_4269 -I -W "latin1" ➥ "tl_2012_states" staging.tl_2012_states | ➥ psql -h…
Evan Carroll
  • 912
  • 1
  • 8
  • 21
4
votes
1 answer

Publishing (structured?) data about products: information about products, barcodes, photos (incl. packaging)

Is it legal to publish photos of products and their packaging? Can such be done anonymously on some open license? (which would be the best for such data to be open for public use, as products are generally available to see in shops for…
4
votes
0 answers

Dataset/database for the evolution of world's political entities (countries) along the years?

What would be the most accurate or complete source of data about the boundaries of states or other political entities, at different times in human history ? (In order to represent how the boundaries evolved along those years.)
wip
  • 141
  • 2
4
votes
2 answers

Where can I find machine readable transcribed text of the 2016 Presidential speeches and debates?

I'm looking for a machine-readable repository of transcribed Presidential debates and speeches from the 2016 general election. Structured data with a common format is desirable, as well as updates for the remaining speeches before the election.
philshem
  • 17,647
  • 7
  • 68
  • 170
4
votes
1 answer

Where can I find the training logs of (as many as possible) athletes?

Where can I find the training logs of (as many as possible) athletes? I would like to have that first-hand data, to be able to do some statistical research. Training logs of real athletes, from which perhaps very useful information could be obtained…
Mephisto
4
votes
1 answer

Satellite Images on OPeNDAP

I am looking for global satellite images(as recent as possible - say three hours ago) of meteorological data that can be accessed through OPeNDAP protocol i.e. subsetting via latitude, longitude and time. Meteosat does have them but only upto 2015 -…
user5883
4
votes
1 answer

Where can I find the graduate school college scorecard data

I want to know the data and information about graduate schools. Where can I find it?
olivia
  • 41
  • 2
4
votes
1 answer

Bittorrent usage for distributing open science-data?

Writing a grant proposal. - I am looking for information about the usage of the bittorrent protocol for open science data. Does anyone know of a recent paper, blog post or study? Specifically I am interested in the usage of science data from the…
knb
  • 1,227
  • 7
  • 17
4
votes
1 answer

How can I convert xy line-plots to textual data values?

I want to reverse engineer a simple XY-line graph to scalar data values. An example of such a plot is linked below, water temperature as a function of datetime. As far as I know, the raw data (sensor readings) and the processed numeric…
knb
  • 1,227
  • 7
  • 17
4
votes
1 answer

Historical land cover (land use) in Central Asia

I would be curious to know if there is GIS data of the historical land cover (land use) in Central Asia? It has to be finer grained than half a (spatial) degree like the DAAC dataset to be useful for my (socio-economic) analysis. I suppose that…
moezelot
  • 153
  • 7
4
votes
1 answer

Has CKAN been used as a data portal for any Open Science initiatives by an academic institution?

CKAN is the leading data portal solution, and is typically used by governments and their agencies. I am wondering if this software has been used by scientific institutions and universities for Open Science projects. Links to example sites and/or…
ted.strauss
  • 357
  • 1
  • 10
4
votes
7 answers

Where can I find a dataset containing legal documents?

I have a machine learning task I wish to pursue. For the task I will need several hundred sample legal documents of the following types: Employment contract, service contract, sale contract, rental contract/lease, loan contract, confidentiality…
Joshua
  • 151
  • 1
  • 5
4
votes
1 answer

User complaint/review description for running nlp-ner on telecom(preferably) data

I am looking for Consumer complaint/review description data which is not generated from automatic logs but manually entered by the end-user/customer-support. The objective is to run the Stanford NLP-NER tool on the user description for determining…
4
votes
2 answers

Bikesharing datasets

Im trying to obtain historical datasets of public transports like train, bus and also bikesharing provider. While api.citybik.es provides almost all APIs of these providers, i struggle to actually find a database with compiled data throughout a…
Sempui22
  • 43
  • 3
4
votes
0 answers

Existence of a diseases/symptoms database

For a personal project, I am looking for a database containing (possibly) every disease with its associated possible symptoms. Do you know if such a database is available somewhere?