Abstract
The discovery of subsets of data that are characterized by models that differ significantly from the entire dataset, is the goal of exceptional model mining. With the increasing availability of temporal data, this task has clear relevance in discovering deviating temporal subprocesses that can bring insight into industrial processes, medical treatments, etc. As temporal data is often noisy, high-dimensional and has complex statistical dependencies, discovering such temporal subprocesses is challenging for current exceptional model mining methods. In this paper, we introduce Temporal Exceptional Model Mining to capture multiple and complex relationships among temporal variables of a dataset in a principled way. Our contributions are as follows: (i) we define the new task of temporal exceptional model mining; (ii) we characterize the discovery of exceptional temporal submodels using dynamic Bayesian networks by means of a new distance measure, (iii) we introduce a search procedure for exceptional dynamic Bayesian networks optimized by properties of the proposed distance, and (iv) the practical value of the proposed method is demonstrated based on simulated data and process data of funding applications and by comparisons with other exceptional model mining methods.
Original language | English |
---|---|
Title of host publication | Advanced Analytics and Learning on Temporal Data |
Subtitle of host publication | 5th ECML PKDD Workshop, AALTD 2020 |
Editors | Vincent Lemaire, Simon Malinowski, Anthony Bagnall, Thomas Guyet, Romain Tavenard, Georgiana Ifrim |
Publisher | Springer International Publishing AG |
Pages | 97-112 |
Number of pages | 16 |
ISBN (Electronic) | 978-3-030-65742-0 |
ISBN (Print) | 978-3-030-65741-3 |
DOIs | |
Publication status | Published - 2020 |
Event | 5th ECML/PKDD Workshop on Advanced Analytics and Learning on Temporal Data - Online, Ghent, Belgium Duration: 18 Sept 2020 → 18 Sept 2020 https://project.inria.fr/aaltd20/ |
Publication series
Series | Lecture Notes in Computer Science (LNCS) series |
---|---|
ISSN | 0302-9743 |
Workshop
Workshop | 5th ECML/PKDD Workshop on Advanced Analytics and Learning on Temporal Data |
---|---|
Abbreviated title | AALTD 2020 |
Country/Territory | Belgium |
City | Ghent |
Period | 18/09/20 → 18/09/20 |
Internet address |
Keywords
- Bayesian networks
- Exceptional model mining
- Graphical models
- Machine learning
- Subgroup discovery
- Temporal data