More information at http://www.ke.tu-darmstadt.de/lehre/ss20/pr-ki
An outbreak refers to the rapid spread of an infectious disease, with an
epidemic usually referring to a more severe and intense form.
The monitoring and surveillance of (especially contagious) diseases and the identification of current or future outbreaks is of high importance for ensuring the public health. Data mining and machine learning with their different methods can contribute to the support of public health authorities in performing this sensitive task.
In this practical course, we will investigate the usage of different data mining techniques to detect outbreaks and other outbreak related events. We will be working on a real data set provided by the Robert-Koch-Institute which provides information for Germany about infected cases over time and locations. This data set may also be combined with external data sources like weather information.
Outbreak detection involves different tasks. For instance, it includes the nowcasting of outbreaks but also the prediction of distributions of expected disease cases (forecasting). We will also put a special interest on interpretable models which allow to find valuable human-understandable relationships in the data which are relevant for outbreak detection.
Participants will explore and apply different techniques from data mining and machine learning such as time series analysis, outlier detection, auto-regression, weak learning and interpretable machine learning techniques. But also data preprocessing, including data cleansing and homogenization, are necessary tasks when working with real data.
- Dozent*in: Eneldo Loza Mencia