11 Data Sources
Below you will find some online resources that you might want to explore when sourcing data for your coursework assignment and/or your dissertation. This is by no means an extensive data list, but summarises data used within some of the practicals alongside some additional data sources.
Note
You are not limited to using these datasets for your coursework assignment and/or your dissertation.
11.1 Open Data
The following websites contain Open Data or link to Open Data from several respectable data providers:
- Google Dataset Search
- Tesco Store Data (London)
- NHS Data (ready for R)
- US City Open Data Census
- nomis
- ONS Geoportal
- UK Data Service
- ONS
- Edina (e.g. OS mastermap)
- Open Topography
- USGS Earth Explorer
- Geofabrik (OSM data)
- Global Weather Data
- London Data Store
- AirBnB Data
- NASA SocioEconomic Data and Applications Center (SEDAC)
- UN Environmental Data Explorer
- WorldPop
- WorldPop GitHub
- DIVA-GIS
- DEFRA
- US Census Data
- TfL Open Data
- TfL Cycling Data
- EU Tourism Data
- NASA EARTHDATA
- Camden Air Action
- King’s College Data on Air Pollution
- Uber Travel Time Data
- Eurostat
- London Tube PM2.5 Levels
- Bike Docking Data (ready for R)
- UK COVID19 Data
- Johns Hopkins COVID19 Data (ready for R
- Tidy Tuesday Data (not exclusively spatial data)
11.2 CDRC Data
Undergraduate students can apply to CDRC for some of their Safeguarded data. There is a process to access these datasets, detailed on CDRC website. To access any CDRC safeguarded data, you will need to follow this process. It normally takes 4-5 weeks for your application to be granted.
As part of the process, you will need to say in your application why you want that specific dataset and what you are going to do with it. You will also need to have at least thought about the ethical implications of using that data and provide this with your data application (alongside your standard ethics application).
In terms of some of the specific datasets available, you can apply for:
- Bicycle Sharing System Docking Station Observations
- CDRC Modelled Ethnicity Proportions - LSOA Geography
- NHS Hospital Admission Rates by Ethnic Group and other Characteristics
- Local Data Company - SmartStreetSensor Footfall Data – Research Aggregated data
- Speedchecker Broadband Internet Speed Tests
- FCA Financial Lives Survey
Note
Given that the application can take several weeks, the safeguarded CDRC datasets may be useful for your dissertation work but not for the GEOG0030 coursework assignment. Do keep in mind that some of the CDRC datasets are marked as Open Data and do not require this application process. In this case, you can just register on the site and download the dataset directly.
11.3 Other Data
Some other webpages to explore:
- Free GIS data; long list with lots of GIS datasets on many different topics and covering many different areas.
- Awesome Public Datasets; general collection of datasets, although not limited to spatial data.