4 Session 3: Big Data Quality
Dealing with uncertainty: Quality and Representation in the context of Big Data
- Dr Justin van Dijk, University College London
Over the past decade or so, the usage of large and novel data sources has become widespread within Geography and the wider social sciences. Where the usage of social media data has probably reached its peak, the usage of consumer data sources is now well established. Applications within the United Kingdom, for instance, range from using electoral roll data to get insight into population dynamics (cf. Lansley et al. 2019; Van Dijk et al. 2021) to using mobile phone traces to understand day-to-day mobility (cf. Trasberg and Cheshire 2021). Other popular applications are found within the domain of the sharing economy such as bicycle sharing systems (see Todd et al. 2021a) and AirBnB (see Todd et al. 2021b).
Whilst insightful, there is a clear interest in exploring the quality and representation of these data sets. One of more pertinent questions is what is the provenance of various sources of these novel data sources? What, if any, is the relationship between specific data source (e.g. provision of information about travel, retail services, public services) and fitness for purpose in socially inclusive analyses? How can we reconcile new Big Data with conventional data sources? This session seeks to be a platform to discuss exactly the challenges, opportunities, and barriers to do this.