Analysing linked data through a web-based browser

This project has developed a robust proof of concept for using a web-based browser to access and interrogate data.

Using state-of-the-art statistical approaches, it has shown that a web application can link and analyse different datasets in an accurate, rigorous and user-friendly manner.

Bringing together Met Office data on climate variables, daily ozone measurements and ONS Mortality data for London over a 5 year period, the project initially sought to replicate research that linked temperature, air quality, and death rates using time series regression analysis.

The approach was to have user-friendly input screens to guide the user through the process, so they can perform their analysis without the necessary technical skills of some statistical software packages (ie Stata).

Having successfully replicated the research, the browser was extended to incorporate additional datasets such as pollen and daily emergency hospital admissions data – demonstrating how the application could be readily adapted to combine datasets from different sources and give users the ability to explore new hypotheses.

The flexibility of the tool also allows users to assess the impact of time lags on any relationships they might uncover.

For example, the effects of high or low temperatures on mortality levels can be assessed over a period of time, providing an insight into the amount of time authorities have to prevent deaths after an extreme temperature event.

It is hoped that by accurately modelling thresholds and lag times the tool will help to evaluate and improve current extreme temperature early warning systems, providing powerful guidance for Public Health England, local health authorities, healthcare professionals and vulnerable individuals.

Importantly, this research project has established an ongoing link between Met Office data and a human health database. In the future it’s hoped this could help to enable ongoing surveillance of links between weather and health, with the potential to provide rapid hypothesis testing and ability to identify health risk ‘hotspots’.