COVID-19 Hackathons: Data Resources

  1. Accessing the NERC Environmental Data Service (EDS)
    1. How can I access the data?
    2. Who can access the data?
    3. What can I do with it?
    4. Is there any charge?
    5. I still haven’t found what I’m looking for
  2. Other Key Environmental Data Sources
    1. European Space Agency (ESA) and the Copernicus Data Service
    2. Met Office
    3. Ordance Survey OpenData mapping
  3. COVID-19 Data Sources and Resources
  4. Specific Useful Key Document Sources
  5. Tools sources

Datasheet resources thumbnail
Download as PDF

The NERC COVID-19 Hackathons set out a number of key societal challenges related to the COVID-19 Pandemic, to be addressed over the duration of the Hackathon events. In these challenges, participants will draw together and manipulate a range of datasets, for example from the physical and environmental sciences, health and epidemiology, social science. We leave the choice for the specific datasets to you to select to help deliver your specific Digital Solution. There is a vast array of datasets now made available, and your solution may use any data sources as you see fit. The purpose of this document is to signpost you to some of the more significant data resources. Most importantly, NERC, as part of UKRI, are making the collections of datasets held by the NERC Environmental Data Service available to participants. Advice on that is provided below:

Top of page

Accessing the NERC Environmental Data Service (EDS)

NERC operate a network of environmental data centres (https://nerc.ukri.org/research/sites/environmental-data-service-eds/) that provide a focal point for NERC’s scientific data and information. These centres hold data from environmental scientists working in the UK and around the world. The data centres are responsible for maintaining environmental data and making them available to all users, not just NERC researchers but others from science, commerce, government and education as well as the general public. NERC supports five data centres covering a range of discipline areas:

  • British Oceanographic Data Centre (BODC) (Marine)
  • Centre for Environmental Data Analysis (CEDA) (Atmospheric, Earth Observation, and Solar and space physics)
  • Environmental Information Data Centre (EIDC) (Terrestrial and freshwater)
  • National Geoscience Data Centre (NGDC) (Geoscience)
  • Polar Data Centre (PDC) (Polar and cryosphere)

In addition, NERC contracts the Archaeology Data Service to manage and make available data arising from NERC funded research in Science Based Archaeology.

The NERC Data Catalogue Service provides an integrated, searchable catalogue of the data holdings of NERC’s data centres. It can be used to find information on what data the NERC data centres hold and how to access these data.

For accessing NERC datasets, the following Q&A should help:

How can I access the data?

The NERC Data Catalogue Service (https://data-search.nerc.ac.uk) provides a central point of access to search NERC’s data holdings and information products. An alternative approach is to visit https://nerc.ukri.org/research/sites/data and follow the links to the individual domain NERC Data Centres noted above.

Who can access the data?

For the specific purpose of the hackathons, anyone can apply for access to information and data held by NERC. The relevant Data Centres will be able to provide more information.

What can I do with it?

For the specific purpose of the hackathons, the NERC data can be used for any purpose and is supplied under the Open Government Licence.

Note that there is held within the EDS some data from third parties. This is released only under licence terms. A discussion with the relevant Data Centre will help provide appropriate guidance.

Is there any charge?

For the specific purpose of the hackathons, access to the NERC data shall be provided free of charge. However, that there may be a charge for some information products containing third party data – something to discuss with the Centre staff.

I still haven’t found what I’m looking for

OK Bono, try contacting the relevant Data Centre for specific data enquiries. They have specialist domain knowledge on suitability of the NERC data sets.

Top of page

Other Key Environmental Data Sources

European Space Agency (ESA) and the Copernicus Data Service

Another key source of environmental data is the Copernicus Data Service. Of particular use for example could be the ERA5/ERA5t data that are provided by ECMWF. Those are available through the Copernicus Data Service. The NERC CEDA service have a limited selection of those data so it may be advised to make use of the data from the CDS as the primary source rather than the CEDA archive at present. This data is made available under the following licence. This means that Hackathon participants can make good use of them. Note there is an API to access the data from the CDS too.

For a summary of the ESA satellite platforms (Copernicus) and some ideas, see – https://www.esa.int/Applications/Observing_the_Earth/COVID-19_how_can_satellites_help

Met Office

The UK Met Office offer the COVID-19 data lab – https://covid19-response.informaticslab.co.uk/hub/login

To gain access to the data platform is free but you need a GitHub account and then that is linked to the logon for the site. There are instructions there to help you. At the moment there is data from 1st Jan this year as focus was on data from during the pandemic. It includes:

  • Global gridded hourly surface meteorological data – this is on a 10km grid from our global predictions
    • UK gridded hourly surface meteorological data – this is on a 1.5 km grid from our UK predictions
    • This data is also aggregated up to daily gridded average, min, max values. Done to reduce data volume and as lots of other people’s data is on coarse temporal scale and so forcing them to use hourly is not very ‘user friendly’
    • The Met Office have also further aggregated the data spatially onto health reporting regions, i.e. ‘get average temperature for a day for a county’. Again, this is because a lot of the health data is region based. This is done based on areas defined in shape files and easy to do for any areas. Example python code to do this is also included on the platform.
    • The Met Office have also added air quality gridded model data now and are keen to explore other data.
    • The platform uses Pangeo and the Met Office have set it up on Microsoft Azure with some compute power behind it and so this allows people to run analysis using a range of tools and languages (Python and R are fully set up).

Users have not only been from the health area but also form behaviourally science and policy areas, i.e. wanting to look at adherence to lock down rules in relation other weather.

Ordnance Survey

The UK Ordnance Survey offer a range of open datasets that you may find of interest. See https://www.ordnancesurvey.co.uk/opendatadownload/products.html.

Top of page

COVID-19 Data Sources and Resources

In addition to the environmental datasets outlined above, there are also a broad range of publicly accessible COVID-19 related data resources which may aid your work.

Listed here are some of these principal sources:

Top of page

Specific Useful Key Document Sources

You may find it helpful to review the following further documents as you develop your plans: 

Top of page

Tools sources

The following tools may be of use to you in developing the digital solutions you put forward:

Also, see our blog on Data Visualisation Tools

Leave a Reply

Your email address will not be published. Required fields are marked *