Most Popular
1500 questions
5
votes
0 answers
Free quad polarization SAR images of San Francisco in bands X and L?
For comparing the results in my scientific article I need quad polarized sample data for San Francisco in bands C(RADARSAT-2), L(probably ALOS-PALSAR) and X(probably TerraSAR-X)
Up to now I have downloaded band C of this data from GEOSPATIAL…
Sepideh Abadpour
- 161
- 2
5
votes
1 answer
Government standards, guidelines, and practices for collecting data
By standards, guidelines, and practices for collecting data I mean the rules that facilitate:
Practical value: Collecting data with a specific goal in mind
Data integrity: Minimizing omissions and biases
Infrastructure: Integrating the data into…
Anton Tarasenko
- 3,641
- 4
- 20
- 34
5
votes
1 answer
MIMIC-III. Select only the first ICU admission
How can I select data only for the first ICU stay of each patient? In the MIMIC-II, there was a column with the sequence of ICU stay (first, second) and another column with the icustay_id. In the MIMIC-III, icustay_id does not appear to be a good…
alexandreliborio
- 367
- 2
- 5
5
votes
1 answer
Is there a dataset on existing flow of people between US cities?(could be car, bus, train or plane)
I am working on a project and would love to see in a year how many people go from one city in the US to another. I know this is fairly extensive and might require putting together multiple sets.
Emmanuel Brown
- 53
- 3
5
votes
1 answer
Historical NWS GFS forecast data
The National Weather Service (NWS) makes the results from its Global Forecast System model (GFS) available for download as GRIB2 files.
While accessing current (or recent) forecasts is straight-forward, I am looking for the archived forecasts over…
Max
- 51
- 2
5
votes
1 answer
Statistics on the number of seats per car
I am looking for some data on the (average) number of seats per car. I know one could assume such number to be 5, but I would like to have some statistics to support such an assumption.
Filippo Bistaffa
- 151
- 2
5
votes
2 answers
Cost of 1-bedroom apartment rental in US by zip code?
I'm looking for a cost of housing by zip code; even better if I can get data on 1-bedroom apartments.
For clarification: I would like to query over many thousands of zip codes, and not just one at a time.
arschie
- 129
- 1
- 1
- 3
5
votes
0 answers
Publicly available dataset of physician notes
I am looking for a publicly available data set of physician notes that describe medical reasoning, ideally freetext indexed by the level of training of the author.
MIMIC-III has physiologic data, and nursing and procedure notes. Procedure notes do…
mac389
- 151
- 6
5
votes
0 answers
Free UPC database?
I'm working with a food bank/soup kitchen and I'd like to be able to scan in donated food cans, packages, and other items. A short trip around the Google block seemed to offer UPC data of two sorts: good (but you must pay a lot) and bad (free). Is…
147pm
- 161
- 1
5
votes
1 answer
Dataset suggestions for teaching data science in a for-profit setting
I'm developing an ebook for a publishing company on Data Science. I'm hunting for a dataset that would be appropriate for this. I've seen many tutorials use iris, but I don't want to - I want to use a larger dataset that allows the audience to have…
Navaneethan Santhanam
- 161
- 4
5
votes
2 answers
Database or download of Amtrak station codes?
Is there available an easily digestible (i.e., CSV or something) source for Amtrak rail station codes? They have a code page here but I'm looking for something that could be included in a database lookup, without scraping and building it myself.
Peter Tirrell
- 253
- 1
- 7
5
votes
2 answers
Huge Biomedical Corpus for Unsupervised Experiments
I am looking for a very large collection ( > 10 GB) of biomedical text documents to run some unsupervised experiments with (for recognizing drug names, etc). Do we have access to such a thing for free? Or is it legal to crawl pubmed and use the…
user3639557
- 153
- 3
5
votes
1 answer
How do you maintain continuity in race definitions across time?
If you're looking to track enrollment of minority students over time, what's the best way to accommodate changes in IPEDS definitions after 2009?
I am presuming UGDS_WHITE (after 2009) maps to UGDS_WHITENH and UGDS_BLACK maps to UGDS_BLACKNH.
Looks…
arm5077
- 53
- 3
5
votes
2 answers
Larger data sets with random treatment (Randomized Trial Data)
We are looking for data sets which are divided into a treatment and control group and where a "treatment effect" can be identified.
It is important only that the sample is "large", since we want to be able to run computations on sub-samples. "Large"…
sheß
- 1,179
- 5
- 24
5
votes
1 answer
Tools and steps for converting 3-star data to 4 star
I am working on a project to upgrade a set of 3-star open data (mostly in CSV and JSON formats) to a 4-star level. I would like to know if there is any guideline for doing so? What tools may be needed to speed up the process? Examples of the…
Nexus
- 51
- 1