| A | B | C | D | E | F | G | H | I | J | K | L | M | N | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1 | uuid | createdAt | significance | recommended_approach | file_types | estimated_size | url | title | harvest_method | harvest_url | bag_url | ckan_url | ||
2 | B2034437-B4FE-4B9F-A117-FEC4EDE41084 | 2017-02-11T20:45:12.485Z | This url is part of this larger one : http://www.archivers.space/urls/B59D4BAC-EE63-4E01-8F3B-8689B220ABD2 (7 datasets) GPM_3IMERGHHL: GPM L3 IMERG Late Half Hourly 0.1 degree x 0.1 degree Precipitation V03 Relevance/High Level Description: Satellite collected precipitation data that feeds into a climate prediction model and cloud classification model Specific Description: These are provided to both the Climate Prediction Center (CPC) Morphing-Kalman Filter (CMORPH-KF) Lagrangian time interpolation scheme and the Precipitation Estimation from Remotely Sensed Information using Artificial Neural Networks Cloud Classification System (PERSIANN-CCS) re-calibration scheme. | You can find the ftp server address in the first xml file (click on online archive on the right). From then wget can download everything Please make a pdf with all information from the landing page and download associated documents : -Requires specific citation listed on landing page. Remember to save this along with each data file. -Documentation also located on landing page. Save this as well (PDF format) -Product Summary also contains important information regarding spatial coverage and relevant time range. Please ping me on slack (@tek) if you need help | NC4, XML, PDF | ~8 GB | https://disc.sci.gsfc.nasa.gov/uui/datasets/GPM_3IMERGDL_V03/summary | GPM_3IMERGDL: GPM (IMERG) Late Precipitation L3 1 day 0.1 degree x 0.1 degree V03 | Data was scraped using slightly modified [instructions](https://disc.sci.gsfc.nasa.gov/recipes/?q=recipes/How-to-Download-Data-Files-from-HTTP-Service-with-wget) offered by Goddard Earth Sciences Data and Information Services Center. * Create ~/.netrc and content like this: `machine urs.earthdata.nasa.gov login SOMELOGIN password SOMEPASS` * Creating a directory to handle cookies as ~/.urs_cookies The actual recursive fetch tries to flatten the directories. I think researchers will find the hierarchy more helpful, so maintained it with the --force-directories switch `wget --load-cookies ~/.urs_cookies --save-cookies ~/.urs_cookies --keep-session-cookies -r -c -nH -nd -np -A nc4,xml "https://gpm1.gesdisc.eosdis.nasa.gov/data/GPM_L3/GPM_3IMERGDL.03/" --force-directories` | https://s3.amazonaws.com/drp-upload/remote/B2034437-B4FE-4B9F-A117-FEC4EDE41084_1.zip | ||||
3 | E359DC1A-E56A-4085-A905-0EC99016CECA | 2017-02-11T22:41:41.175Z | https://tidesandcurrents.noaa.gov/inundation | |||||||||||
4 | 6C798F00-1BEE-4DBF-A3E4-2DF626E90F6D | 2017-02-17T19:43:09.847Z | https://triticeaetoolbox.org | |||||||||||
5 | 08304452-EC9E-4DB9-A077-150B78919325 | 2017-02-21T22:50:04.338Z | http://cdiac.ornl.gov/data_catalog.html | |||||||||||
6 | BEC2DC29-36AA-4026-9809-9376396B754B | 2017-02-11T18:44:59.966Z | http://climate.nasa.gov/vital-signs/arctic-sea-ice | |||||||||||
7 | 3CD7B737-E302-4503-BA3F-E79E78BE5656 | 2017-02-11T20:20:37.126Z | https://epa.gov/cleanups/cleanups-my-community | |||||||||||
8 | 1A505A57-2598-4D6A-A619-8510C0609075 | 2017-02-11T19:58:44.958Z | https://epa.gov/ejscreen/download-ejscreen-data | |||||||||||
9 | 77FE0540-051E-4064-BA3F-52BC266F9ADF | 2017-02-11T15:57:25.109Z | http:///%22dir%22%20:%20%22../data/howdo_weknow/National%20Science%20Foundation/To%20What%20Degree%20-%20How%20Do%20We%20Know/%22, | |||||||||||
10 | 85803555-ABA8-4590-992D-DB9C9DF7D883 | 2017-02-11T23:32:07.525Z | The purpose of the PPS is to process, analyze and archive data from the Global Precipitation Measurement (GPM) mission, partner satellites and the TRMM mission. The PPS also supports TRMM by providing validation products from TRMM ground radar sites. All GPM, TRMM and Partner public data products are available to the science community and the general public from the TRMM/GPM FTP Data Archive | Registration required for access to data archive. Once registered, access is made available to a basic FTP with direct downloads to data. note: tried registering an email (registration immediate) for ftp on Feb 18, 2017 and was unable to connect to ftp://arthurhou.pps.eosdis.nasa.gov 2/19: Able to register and access FTP server. Many of the files/directories are actually shortcuts to other directories. | HDF.tar, .png, .html, .xml, .txt | >1TB | https://pps.gsfc.nasa.gov | NASA Precipitation Processing System | ||||||
11 | B711D258-EF09-4206-912D-DA672D50E190 | 2017-02-11T19:28:29.988Z | https://epa.gov/caa-permitting/electronic-permit-submittal-system-region-9 | |||||||||||
12 | D8FA88D5-99CE-479E-8F9E-142ED1F5B22E | 2017-02-11T04:28:33.901Z | ftp://arthurhou.pps.eosdis.nasa.gov | |||||||||||
13 | AC344CB4-0A5B-4E77-94AB-D583C1D69043 | 2017-02-21T20:58:51.994Z | https://acf.hhs.gov/ocs/resource-library/search?area[2106]=2106&sort=recent | |||||||||||
14 | 5A911807-B941-4662-86B0-5C00DA6F246E | 2017-02-04T18:09:38.205Z | broken link | http://ready.noaa.gov/READYmetdata.php | ||||||||||
15 | 77FE0540-051E-4064-BA3F-52BC266F9ADF | 2017-02-11T15:57:25.109Z | http:///%22url%22%20:%20%22https:/www.youtube.com/playlist?list=PL0ujJTaPsv3erTG6jVPLIhYZ3TsYPDKGR", | |||||||||||
16 | 29C5D29E-FB79-4AA6-B39E-0EBCDEF2DDD3 | 2017-02-11T19:12:00.635Z | - Coal Production and number of mines by state, number of employees by mine, etc. - High significance | PDF, CSV | http://eia.gov/coal/annual | Annual Coal Report | python scraper | https://drp-upload.s3.amazonaws.com/remote/29C5D29E-FB79-4AA6-B39E-0EBCDEF2DDD3.zip | https://drp-upload-bagger.s3.amazonaws.com/remote/29C5D29E-FB79-4AA6-B39E-0EBCDEF2DDD3.zip | https://www.datarefuge.org/dataset/annual-coal-report | ||||
17 | 77FE0540-051E-4064-BA3F-52BC266F9ADF | 2017-02-11T15:57:25.109Z | http:///%22url%22%20:%20%22https:/www.youtube.com/user/NASAClimate/videos%22, | |||||||||||
18 | 811C0BA2-B8E5-4D18-B1AA-502A964A6B36 | 2017-02-11T19:05:11.398Z | https://waterwatch.usgs.gov | |||||||||||
19 | 5A98CDB1-51F9-482E-9EF2-815F9E535613 | 2017-02-11T19:41:18.128Z | Power generation and greenhouse gas and pollutant emissions data across the US | .zip containing .txt, .xls and .xlsx, .pdf, .jpg, | 125 MB | https://epa.gov/energy/emissions-generation-resource-integrated-database-egrid | Emissions & Generation Resource Integrated Database (eGRID) | clicking "download" button | https://drp-upload.s3.amazonaws.com/remote/5A98CDB1-51F9-482E-9EF2-815F9E535613.zip | https://drp-upload-bagger.s3.amazonaws.com/remote/5A98CDB1-51F9-482E-9EF2-815F9E535613 2.zip | https://www.datarefuge.org/dataset/https-epa-gov-energy-emissions-generation-resource-integrated-database-egrid | |||
20 | 30BA41F7-F8B7-4CC1-B058-6A9E3160BD32 | 2017-02-11T19:37:12.808Z | https://ncdc.noaa.gov/data-access/model-data | |||||||||||
21 | 38AC23BB-B595-4DFA-B584-E85F5859F0C8 | 2017-02-11T19:05:06.197Z | https://epa.gov/risk/risk-tools-and-databases | |||||||||||
22 | 8B4D8EA4-C6C6-4479-95F3-DF1168571C0B | 2017-02-19T20:06:49.718Z | https://dropbox.com/sh/e5is592zafvovwf/AABVLBCyQEM0FQ3BadWwOfFka/CRU-TS%20v3.10.01?dl=0 | |||||||||||
23 | 4822A7AA-9174-415A-BC0C-CA8A7C9BEEAD | 2017-02-11T18:53:21.448Z | https://epa.gov/research/human-health-risk-assessment-research-methods-models-tools-and-databases | |||||||||||
24 | 833DD7B3-12F9-4447-A291-B27CAB76E5CD | 2017-02-17T23:02:55.068Z | https://ncdc.noaa.gov/cdr | |||||||||||
25 | 9F3975B0-5D34-458A-83C5-B4F45DA19C34 | 2017-02-04T17:14:22.784Z | wget | nc, nas, iso, zip, pdf, sgp, csv, xls, doc, pptx, doc | 200,000 | ftp://ftp.cmdl.noaa.gov | ||||||||
26 | 789399CC-2DC6-4011-BEDD-1C2082862055 | 2017-02-11T22:41:41.175Z | https://tidesandcurrents.noaa.gov/astronomical.html | |||||||||||
27 | 607D3252-2664-417E-928B-45DAC02D885B | 2017-02-04T17:21:31.148Z | Harvestable by IA? | XLSX | http://afdc.energy.gov/data | Python Request | https://drp-upload.s3.amazonaws.com/remote/607D3252-2664-417E-928B-45DAC02D885B.tar | |||||||
28 | 7F02E2F0-45FF-405A-B439-25EE72C5BD5D | 2017-02-18T18:19:21.763Z | "The Data Pool is an on-line data cache that provides FTP access to select ASDC data products." | ftp download (note: symlinks involved are confusing wget. Surprisingly can't find a standard tool that will mirror ftp following symlinks?) Unclear to me if the "OPeNDAP Access" links are the same data as the FTP links or different. | https://eosweb.larc.nasa.gov/datapool | ASDC Data Pool | ||||||||
29 | CBDBE862-EE16-4B35-B54E-54BE4CB3727E | 2017-02-18T20:35:17.942Z | https://nasa.gov/content/space-station-view-of-us-national-parks | Space Station View of U.S. National Parks | ||||||||||
30 | 8393A9FE-61D2-4AD6-87DD-EAC897794226 | 2017-02-11T22:38:05.217Z | Datum is a reference needed to interpret local tide data. This info is critical for interpreting long time series data | https://tidesandcurrents.noaa.gov/api/. Note, the API will only scrape current epoch data (post 1970). A custom scraper is needed to get superseded (1968-1978) data. | csv | https://tidesandcurrents.noaa.gov/stations.html?type=Datums | Datums - Station Selection | API, BeautifulSoup | https://drp-upload.s3.amazonaws.com/remote/8393A9FE-61D2-4AD6-87DD-EAC897794226 2_1.zip | |||||
31 | 0C87975E-C222-4BD2-8516-B4E623EB67CB | 2017-02-11T23:15:44.383Z | Scan, Snotel, and water forcasting | Crawl Daily! Updated live weather station data (Not in the GHCN) and snow and water data | Mostly text files | GBs | https://www.wcc.nrcs.usda.gov/ftpref/data/ | National Resource Conservation Service - National Water and Climate Center | ||||||
32 | BA25099E-74FE-457F-B5B9-7EE666FA8FDB | 2017-02-11T19:45:42.074Z | https://iaspub.epa.gov/apex/waters/f?p=ASKWATERS:MAIN_MENU | |||||||||||
33 | 4AA0EAFC-300E-4B22-A328-AA1C0E30A6AD | 2017-02-18T15:22:14.931Z | This server contains research data from NASA's Gravity Recovery And Climate Experiment (GRACE). | Any recursive FTP approach will work, wget or other FTP tree script. Note this is an e | .tar.gz, .gz, .pdf, .txt | ~1TB | ftp://podaac-ftp.jpl.nasa.gov/allData/grace | NASA GRACE Satellite Data | wget -m ftp://podaac-ftp.jpl.nasa.gov/allData/grace | |||||
34 | 32EFDA12-4EDB-4122-9AE1-3FBA7AEBF7D4 | 2017-02-18T18:27:08.983Z | https://eosweb.larc.nasa.gov/project/misr/cmare_table | |||||||||||
35 | C5238DFE-56A7-43C2-8F90-3DCD30A01534 | 2017-02-11T19:26:43.940Z | https://ozoneaq.gsfc.nasa.gov/data/reflectivity | |||||||||||
36 | A187C3EB-FA09-4285-98DF-AD51821CBF4E | 2017-02-04T16:29:39.848Z | https://minerals.usgs.gov/minerals/pubs | |||||||||||
37 | 32EFDA12-4EDB-4122-9AE1-3FBA7AEBF7D4 | 2017-02-18T18:27:08.983Z | https://eosweb.larc.nasa.gov/project/mopitt/mopitt_table | |||||||||||
38 | 2B11C77E-7A56-44A8-8BC5-93D89CF33BE6 | 2017-02-17T19:14:08.716Z | https://epa.maps.arcgis.com/apps/webappviewer/index.html?extent=-146.2334,13.1913,-46.3896,56.5319&id=5f239fd3e72f424f98ef3d5def547eb5 | |||||||||||
39 | 23FD5067-BC1C-4173-988F-0CD194E56AB5 | 2017-02-11T19:28:00.971Z | https://ozoneaq.gsfc.nasa.gov/data/omps | |||||||||||
40 | 1F98FB6B-60F5-412D-9628-E7EF4FB7BCD4 | 2017-02-11T20:05:23.689Z | https://gispub.epa.gov/arcgis/rest/services | |||||||||||
41 | D334A6FB-9E3A-4D0D-89E9-EFBF8D3345BD | 2017-02-11T20:16:10.016Z | https://federalregister.gov/documents/search | |||||||||||
42 | 33CADAEC-116D-427B-99EE-B4FD8793F4C6 | 2017-02-17T22:26:24.191Z | https://data.giss.nasa.gov | |||||||||||
43 | 77FE0540-051E-4064-BA3F-52BC266F9ADF | 2017-02-11T15:57:25.109Z | Submitted to Data Refuge site by Project ClimateScienceSave as potentially vulnerable videos that are on the subject of climate change. | YouTube-DL | xml, json, mkv, mp4, .description | 6210 | http:///[%20%7B | |||||||
44 | 77FE0540-051E-4064-BA3F-52BC266F9ADF | 2017-02-11T15:57:25.109Z | http:///%22dir%22%20:%20%22../data/nasa_visualizations/NASA%20Scientific%20Visualization%20Studio/Earth/%22, | |||||||||||
45 | 8E495676-DD0F-449F-9E7A-6F6C60E02136 | 2017-02-04T16:31:49.542Z | https://cfpub.epa.gov/dmr/adv_search.cfm | |||||||||||
46 | 13085BD3-F379-4E81-B99A-C3F59EB228FD | 2017-02-04T17:47:31.345Z | EPA's Report on the Environment (ROE) shows how the condition of the U.S. environment and human health is changing over time. The ROE presents the best available indicators of national trends in five theme areas: Air, Water, Land, Human Exposure and Health, and Ecological Condition. | https://cfpub.epa.gov/roe | ||||||||||
47 | 77FE0540-051E-4064-BA3F-52BC266F9ADF | 2017-02-11T15:57:25.109Z | http:///%22agency%22%20:%20%22Interior%22, | |||||||||||
48 | C9C6C6DC-14D3-421C-B758-FB7AFB1B5509 | 2017-02-04T16:29:39.916Z | https://minerals.usgs.gov/products/index.html | |||||||||||
49 | 6AEC71EF-335A-4C37-9C94-00798F47F301 | 2017-02-11T20:49:19.670Z | This url is part of this larger one : http://www.archivers.space/urls/B59D4BAC-EE63-4E01-8F3B-8689B220ABD2 (7 datasets) GPM_3IMERGHHL: GPM L3 IMERG Late Half Hourly 0.1 degree x 0.1 degree Precipitation V03 Relevance/High Level Description: Satellite collected precipitation data that feeds into a climate prediction model and cloud classification model Specific Description: These are provided to both the Climate Prediction Center (CPC) Morphing-Kalman Filter (CMORPH-KF) Lagrangian time interpolation scheme and the Precipitation Estimation from Remotely Sensed Information using Artificial Neural Networks Cloud Classification System (PERSIANN-CCS) re-calibration scheme. | You can find the ftp server address in the first xml file (click on online archive on the right). From then wget can download everything Please make a pdf with all information from the landing page and download associated documents : -Requires specific citation listed on landing page. Remember to save this along with each data file. -Documentation also located on landing page. Save this as well (PDF format) -Product Summary also contains important information regarding spatial coverage and relevant time range. Please ping me (@tek¼ on slack if you need help) | HDF-5, PDF | ~70 GB | https://disc.sci.gsfc.nasa.gov/uui/datasets/GPM_3IMERGM_V03/summary | GPM_3IMERGM: GPM L3 IMERG Final 1 month 0.1 degree x 0.1 degree precipitation V03 | Used the JS script (in /tools) to generate lists of URLs, then downloaded them with wget | |||||
50 | B56AA4A6-2065-404B-9114-C102811D063D | 2017-02-18T21:06:32.136Z | http://ndbc.noaa.gov/station_page.php?station=46071 | |||||||||||
51 | B7EE77A0-D1F8-4D19-B7FC-524FC0519ECE | 2017-02-11T23:24:10.328Z | https://lib.noaa.gov/collections/imgdocmaps/daily_weather_maps.html | |||||||||||
52 | F64C268A-6016-4DA6-A2B2-187B3D90FA65 | 2017-02-11T20:28:37.146Z | /!\ I am working on this one, please take another one (tek - 11/02 12h15) This url is part of this larger one : http://www.archivers.space/urls/B59D4BAC-EE63-4E01-8F3B-8689B220ABD2 (7 datasets) GPM_3IMERGHHL: GPM L3 IMERG Late Half Hourly 0.1 degree x 0.1 degree Precipitation V03 Relevance/High Level Description: Satellite collected precipitation data that feeds into a climate prediction model and cloud classification model Specific Description: These are provided to both the Climate Prediction Center (CPC) Morphing-Kalman Filter (CMORPH-KF) Lagrangian time interpolation scheme and the Precipitation Estimation from Remotely Sensed Information using Artificial Neural Networks Cloud Classification System (PERSIANN-CCS) re-calibration scheme. | You can find the ftp server address in the first xml file (click on online archive on the right). From then wget can download everything Please make a pdf with all information from the landing page and download associated documents : -Requires specific citation listed on landing page. Remember to save this along with each data file. -Documentation also located on landing page. Save this as well (PDF format) -Product Summary also contains important information regarding spatial coverage and relevant time range. | HDF-5, PDF | ~70 GB | https://disc.sci.gsfc.nasa.gov/uui/datasets/GPM_3IMERGHHE_V03/summary | GPM_3IMERGHHE: GPM L3 IMERG Early Half Hourly 0.1 degree x 0.1 degree Precipitation V03 | wget on the FTP server here : ftp://gpm1.gesdisc.eosdis.nasa.gov/data/s4pa/GPM_L3/GPM_3IMERGHHE.03/ | |||||
53 | 21507C5F-BC76-4DD0-9770-7463DEE0B05D | 2017-02-18T19:41:41.552Z | https://science.nasa.gov/earth-science/decadal-surveys | Earth Decadal Survey | ||||||||||
54 | FDD2C5E8-5DBD-4FD4-8C91-C6FD5D9C3C5C | 2017-02-18T21:02:24.890Z | Link is acting strangely, just click the Data tab on top, to the right of the Home tab. | https://omg.jpl.nasa.gov/portal/browse | Oceans Melting Greenland | |||||||||
55 | 3E0F0A49-060D-40D8-B1AC-2097537DE254 | 2017-02-18T21:04:35.169Z | http://ndbc.noaa.gov/station_page.php?station=welm1 | |||||||||||
56 | 7C8BCEE6-7D74-4729-8F61-FD4FFC0027B6 | 2017-02-17T12:53:49.474Z | Air Quality Data - Raw data files, zips, notes,entire folder of ftp site | Pudasaini zip containing BP_ .csv files | 57 | ftp://newftp.epa.gov/AIR_QUALITY_DATA | newftp.epa.gov/AIR_QUALITY_DATA/ | |||||||
57 | 52A3EAD6-A01A-49AA-B84F-C6D3EE6C3CAB | 2017-02-11T18:53:39.827Z | Use API/REST as for https://www2.usgs.gov/water/ | https://waterdata.usgs.gov/nwis | ||||||||||
58 | FBB2A3CD-2A68-47DA-9B11-A107D4BCC2AB | 2017-02-11T22:35:42.972Z | historical data of ti | Not sure if there are historical versions of this or it is just the up-to-the-moment predictions | https://tidesandcurrents.noaa.gov/tide_predictions.html | NOAA Tide Predictions | Python- Beautiful Soup to get station IDs, requests to get txt files | |||||||
59 | E078AA97-8B36-49B8-B395-81CC5AE03A19 | 2017-02-11T22:09:36.854Z | http://nrel.gov/aim/npc.html | |||||||||||
60 | 77FE0540-051E-4064-BA3F-52BC266F9ADF | 2017-02-11T15:57:25.109Z | http:///%22dir%22%20:%20%22../data/nsf_watercycle/National%20Science%20Foundation/To%20What%20Degree%20-%20The%20Water%20Cycle/%22, | |||||||||||
61 | 8FBD6DBB-2832-4941-A37B-5A0CFE4E798A | 2017-02-11T20:36:25.786Z | This url is part of this larger one : http://www.archivers.space/urls/B59D4BAC-EE63-4E01-8F3B-8689B220ABD2 (7 datasets) GPM_3IMERGHHL: GPM L3 IMERG Late Half Hourly 0.1 degree x 0.1 degree Precipitation V03 Relevance/High Level Description: Satellite collected precipitation data that feeds into a climate prediction model and cloud classification model Specific Description: These are provided to both the Climate Prediction Center (CPC) Morphing-Kalman Filter (CMORPH-KF) Lagrangian time interpolation scheme and the Precipitation Estimation from Remotely Sensed Information using Artificial Neural Networks Cloud Classification System (PERSIANN-CCS) re-calibration scheme. | You can find the ftp server address in the first xml file (click on online archive on the right). From then wget can download everything Please make a pdf with all information from the landing page and download associated documents : -Requires specific citation listed on landing page. Remember to save this along with each data file. -Documentation also located on landing page. Save this as well (PDF format) -Product Summary also contains important information regarding spatial coverage and relevant time range. Please ping me on slack (@tek) if you need help | HDF-5, PDF | ~70 GB | https://disc.sci.gsfc.nasa.gov/uui/datasets/GPM_3IMERGDE_V03/summary | GPM_3IMERGDE: GPM (IMERG) Early Precipitation L3 1 day 0.1 degree x 0.1 degree V03 | wget | |||||
62 | 555BAE6B-5B5C-4677-91D3-CD48F525FD70 | 2017-02-11T22:05:05.526Z | The Soil Climate Analysis Network (SCAN) began as a soil moisture/soil temperature pilot project of the Natural Resources Conservation Service in 1991. The system is designed to provide data to support natural resource assessments and conservation activities. The SCAN system focuses on agricultural areas of the U.S. and is composed of over 200 stations. A typical SCAN site monitors soil moisture content at several depths, air temperature, relative humidity, solar radiation, wind speed and direction, liquid precipitation, and barometric pressure. | See https://github.com/antonpaquin/DataRescue-NRCS for a python script that does it The main bottleneck is on their end. | Smallish | https://www.wcc.nrcs.usda.gov/scan/ | Soil Climate Analysis Network (SCAN) Data & Products | |||||||
63 | B2F5923D-5119-4F6A-B9BC-BA96CFD41F82 | 2017-02-11T21:49:31.224Z | https://iaspub.epa.gov/sor_internet/registry/substreg/automatedservices/index.jsp | |||||||||||
64 | 46C864E3-316F-42BF-B2C2-06A86E28B3E4 | 2017-02-04T17:15:50.891Z | CSV | http://ftp.epa.gov | ||||||||||
65 | 5924B7B9-AD3B-44D0-9A51-162A90AB4AB3 | 2017-02-17T22:58:04.967Z | https://drought.gov/drought/search/data | |||||||||||
66 | BD2205C2-63EB-4B2B-836B-6B10F7EB9BFD | 2017-02-19T19:25:03.506Z | https://nomads.ncdc.noaa.gov/data/cfsr | |||||||||||
67 | C685462F-224B-4CC6-82D0-C53FFAA82DD1 | 2017-02-11T22:29:48.769Z | Field Campaign, Land Validation, Global and Regional Data. Field Campaign Data: ABOVE AirMOSS BOREAS and BOREAS Follow-On Carbon Monitoring System CARVE FIFE and FIFE Follow-On LBA NACP OTTER SAFARI 2000 Superior National Forest (SNF) Land Validation Data: ACCP BigFoot EOS Land Validation FLUXNET Data Sets FLUXNET Web Site MODIS Land Subsets PROVE Regional and Global Data: Climate Collections Daymet Hydroclimatology Collections ISLSCP II Project Net Primary Productivity (NPP) River Discharge (RIVDIS) Russian Land Cover (RLC) Soil Collections TransCom 3 Vegetation Collections VEMAP | Crawl this data periodically. Data is not updated everyday. If you crawl this ftp site 4 times a year that could be sufficient. This ftp site is not widely known. | all kinds of text, vector, raster, and db data. Lots of netcdf | TBs | https://thredds.daac.ornl.gov/thredds/catalog.html | Oakridge National Labs Distributed Active Archive Center (ORNL DAAC) | Not harvested. Too much data | |||||
68 | D35A2258-3B0F-4096-9DD4-AD69524F1B97 | 2017-02-11T18:57:42.395Z | https://java.epa.gov/castnet/datatypepage.do?reportTypeId=REP_001&reportTypeLabel=Measurement (Raw Data) | |||||||||||
69 | 7A4F74E4-A84A-4987-80DD-F54B9D7AAC16 | 2017-02-11T19:00:41.573Z | https://java.epa.gov/castnet/datatypepage.do?reportTypeId=REP_006&reportTypeLabel=Cloud Deposition Data | |||||||||||
70 | 935BCC89-6F2D-464B-B47B-6838E1DB9213 | 2017-02-12T00:07:05.095Z | https://iaspub.epa.gov/sor_internet/registry/substreg/searchandretrieve/searchbylist/search.do | |||||||||||
71 | 9A2D2BEA-65AA-4FB8-98FE-305CA9CEF54C | 2017-02-11T18:28:08.240Z | Monthly updated estimates of global surface temperature change, using current data from NOAA GHCN v3 (meteorological stations), ERSST v4 (ocean areas), and SCAR (Antarctic stations). | TXT,CSV | 0.05 | https://data.giss.nasa.gov/gistemp | GISS Surface Temperature Analysis (GISTEMP) | Python web scraper | https://drp-upload.s3.amazonaws.com/remote/9A2D2BEA-65AA-4FB8-98FE-305CA9CEF54C_2.zip | https://drp-upload-bagger.s3.amazonaws.com/remote/9A2D2BEA-65AA-4FB8-98FE-305CA9CEF54C_2.zip | https://www.datarefuge.org/dataset/giss-surface-temperature-analysis-gistemp | |||
72 | AE18429A-0C7B-49CF-B6DD-8D6AB7F6C913 | 2017-02-19T01:28:04.609Z | Information about the EPA center for Computational Toxicology and some of their research is here - https://www.epa.gov/aboutepa/about-national-center-computational-toxicology-ncct | Download via FTP | ftp://newftp.epa.gov/COMPTOX | (FTP repository) - data relating to Computational Toxicology | ||||||||
73 | 27BA8E02-C78B-455F-90DA-2D999110997C | 2017-02-04T18:31:28.087Z | http://cloudsat.cira.colostate.edu/resources/interactive-plots | |||||||||||
74 | 39520C31-A16D-4C0B-96E9-FB6587B3AA25 | 2017-02-22T16:31:39.488Z | http://fec.gov/data | |||||||||||
75 | D48932BE-189A-4704-BA83-BFAAD1F39A81 | 2017-02-16T17:04:03.667Z | Search page for chemical assessments in the IRIS program. The broader IRIS website also includes other documents such as drafts for peer review and public comment. See parent item for full IRIS website: https://epa.gov/iris, UUID: 14BDABB8-F78D-41BC-A160-C6E631E39081 | Check status of parent: https://epa.gov/iris, UUID: 14BDABB8-F78D-41BC-A160-C6E631E39081 -- content of all assessments accessible through this search tool will be harvested there. | https://cfpub.epa.gov/ncea/iris/search/index.cfm | Integrated Risk Information System - Advanced Search | ||||||||
76 | 30C62098-C36C-4D27-831F-234262AD933B | 2017-02-18T21:06:50.864Z | http://ndbc.noaa.gov/station_page.php?station=21414 | |||||||||||
77 | F6222255-F453-4C26-A31A-5FC800D3716A | 2017-02-11T19:52:57.617Z | http://echo.epa.gov | |||||||||||
78 | 585E1383-373E-4437-A8AF-FC673247F41B | 2017-02-04T18:24:16.577Z | Download everything that be accessed through the APIs listed here | http://earthdata.nasa.gov/api | ||||||||||
79 | 9D39BB37-051A-40BA-957E-0AF58BC3B56D | 2017-02-04T19:06:27.519Z | http://svs.gsfc.nasa.gov/cgi-bin/search.cgi?contentType=a | |||||||||||
80 | 5A911807-B941-4662-86B0-5C00DA6F246E | 2017-02-04T18:09:38.205Z | https://st.nmfs.noaa.gov/sisPortal | |||||||||||
81 | B4220F46-C9A9-4CE5-875F-65DD8E66D58A | 2017-02-08T14:32:37.386Z | This is covered by http://www.eia.gov/opendata/ 1EA13C4E-B1E3-4B53-9E20-EAF33BFB2CCA | https://eia.gov/consumption/residential | ||||||||||
82 | 97A18421-395A-4B57-86CC-1DC690F5C037 | 2017-02-11T19:29:37.725Z | https://watersgeo.epa.gov/beacon2 | |||||||||||
83 | 4A8854D3-17F8-42A1-A1E1-3FD13D302E3C | 2017-02-11T20:18:08.952Z | https://energy.gov/eere/femp/federal-energy-management-program | |||||||||||
84 | FB1E5A21-6658-4905-82F3-F683D6B958F8 | 2017-02-11T20:37:39.186Z | https://nsrdb.nrel.gov | |||||||||||
85 | F6EE4536-5CD7-4B43-9CA4-CEB54B83ECFC | 2017-02-11T19:15:53.184Z | https://epa.gov/air-emissions-inventories/air-pollutant-emissions-trends-data | |||||||||||
86 | 2DBEB724-432E-4764-9F4A-D7684F8FC348 | 2017-02-04T19:18:15.557Z | https://maps.bts.dot.gov/Transit | |||||||||||
87 | 32A58536-4746-4D67-9447-13CDAAF4C49F | 2017-02-11T20:18:49.747Z | https://epa.gov/waterdata/waters-web-services | |||||||||||
88 | D2B6060E-813F-4CB4-8EA3-4FE552A6F122 | 2017-02-11T19:16:02.419Z | This site serves USGS water data (https://waterdata.usgs.gov/nwis) via automated means using web services External Link and extensible markup language (XML) External Link, as well as other popular media types. Services are invoked with the REST External Link protocol. These services designed for high fault tolerance and very high availability. | Use API/REST: https://waterservices.usgs.gov/rest/ There are six API services to access different data each with their own documentation. Read the FAQ for info about request limits and why your IP might get blocked. Identical data to: https://waterdata.usgs.gov | XML, CSV | https://waterservices.usgs.gov | USGS Water Services | |||||||
89 | 6DD8BFC7-5482-478D-887D-EA677B48C737 | 2017-02-11T19:54:41.890Z | Daily weather station data for the globe but mostly for the USA. | CRAWL DAILY IF POSSIBLE! Updated live data everyday. | txt, csv | https://www1.ncdc.noaa.gov/pub/data/ghcn/daily | Global Historical Climatology Network - Daily (GHCN-Daily) (NOAA ftp site) | |||||||
90 | 2193925F-5CF5-4F0C-AF43-FB7F47C75742 | 2017-02-20T06:37:00.825Z | https://nodc.noaa.gov/cgi-bin/OC5/woa13/woa13.pl | |||||||||||
91 | E3637C06-5282-4064-8D65-64A020849441 | 2017-02-11T20:08:55.322Z | Complete collection of meteorological and solar irradiance data sets for the United States and a growing list of international locations. Link was pasted incorrectly but just use up until the first comma (https://nsrdb.nrel.gov/). Parent url (with links to 6 NREL databases is C837F320-C5B4-47B4-B8B4-F33DA519BD51. | https://nsrdb.nrel.gov/,%20http:/www.nrel.gov/rredc/,%20https:/maps.nrel.gov/nsrdb-viewer | National Solar Radiation Database | |||||||||
92 | 31FC5979-285F-44AE-9261-2BC6E75B33B5 | 2017-02-11T22:41:41.175Z | https://opendap.co-ops.nos.noaa.gov | |||||||||||
93 | CFC872BC-794B-4BD0-91D4-A3BC6E0A6DDC | 2017-02-11T19:57:11.419Z | Contains tables for projected outlooks (2015 and beyond) for total energy supply, production and disposition. | - Direct download from "download" buttons. - "Publications and Tables" on the upper right also provides links for outlooks of other types and earlier years. | CSV | http://eia.gov/outlooks/aeo/data/browser | Annual Energy Outlook 2017 | |||||||
94 | 77FE0540-051E-4064-BA3F-52BC266F9ADF | 2017-02-11T15:57:25.109Z | http:///%22url%22%20:%20%22https:/www.youtube.com/user/statevideo/search?query=climate", | |||||||||||
95 | 1CDC8FE9-B636-4BBF-94DC-3D74166CCD1C | 2017-02-22T16:37:12.963Z | https://beta.fec.gov/data/advanced | |||||||||||
96 | B13FF1BD-7653-4248-BAD3-F583818E6994 | 2017-02-11T19:25:35.928Z | https://ozoneaq.gsfc.nasa.gov/data/trace-gases | |||||||||||
97 | 77FE0540-051E-4064-BA3F-52BC266F9ADF | 2017-02-11T15:57:25.109Z | http:///%22dir%22%20:%20%22../data/nsf_carboncycle/National%20Science%20Foundation/To%20What%20Degree%20-%20The%20Carbon%20Cycle/%22, | |||||||||||
98 | 414DF865-91DF-424E-8D09-E9A6378B49F3 | 2017-02-08T14:37:31.306Z | Contains data on the features of all products that currently qualify for EPA's ENERGY STAR label. | via existing API | https://data.energystar.gov | |||||||||
99 | F1826B1E-6950-4E50-A73E-6087D31E11D8 | 2017-02-09T18:42:48.515Z | The BLM compiles a large amount of statistical information relating to oil and gas leasing on Federal lands. Below are links to tables and spreadsheets with data that include the numbers of BLM-administered oil and gas leases, applications for permit to drill, and oil and gas wells. Because the Federal Onshore Oil and Gas Leasing Reform Act if 1987 set the competitive lease requirement for public lands, 1988 is the first year for which some of this data is available. | 16 tables and a chart. Directly downloadable after expanding the section. | XLSX PDF | files in KB | https://blm.gov/programs/energy-and-minerals/oil-and-gas/oil-and-gas-statistics | Oil and Gas Statistics | ||||||
100 | FCA25F90-01BF-489A-A84D-A1212E3083A4 | 2017-02-17T22:44:04.659Z | http://ospo.noaa.gov/Products/index.html |