Robust Deep Hawkes Process under Label Noise of Both Event and Occurrence

Read original: arXiv:2407.17164 - Published 7/30/2024 by Xiaoyu Tan, Bin Li, Xihe Qiu, Jingjing Huang, Yinghui Xu, Wei Chu

Robust Deep Hawkes Process under Label Noise of Both Event and Occurrence

Overview

Explains a method for making deep Hawkes process models more robust to label noise in both event and occurrence data
Proposes a framework to handle noise in event timing, event type, and whether an event occurred at all
Demonstrates improved performance on real-world datasets compared to baseline models

Plain English Explanation

The paper presents a new approach for training deep Hawkes process models that can better handle noisy data. Hawkes processes are a type of machine learning model used to predict the timing and types of future events based on historical event data.

In real-world datasets, the event data can be inaccurate or incomplete - for example, the timestamp of an event may be off, the event type may be mislabeled, or some events may be missing entirely. This "label noise" can significantly degrade the performance of standard Hawkes process models.

The proposed framework introduces several techniques to make the model more robust to these various types of noise. For example, it learns to estimate the true, underlying event data from the noisy observations. It also uses regularization methods to prevent the model from overfitting to the noise.

The authors demonstrate that their "Robust Deep Hawkes Process" model outperforms standard Hawkes models on several real-world datasets with different types of label noise. This suggests the techniques could be valuable for applying Hawkes processes to real-world applications with imperfect data.

Technical Explanation

The paper introduces the "Robust Deep Hawkes Process" (RDHP), a framework for training deep Hawkes process models that can handle noisy event data. Hawkes processes are a class of temporal point process models used to predict the timing and types of future events based on historical data.

The key innovation is that RDHP incorporates several mechanisms to make the model more robust to different types of label noise, including:

Estimating true event data: RDHP learns to estimate the true, underlying event data (timestamps, types, and occurrences) from the noisy observations. This helps the model focus on the true patterns in the data rather than the noise.
Noise modeling: RDHP models the noise distributions for event timestamps, types, and occurrences. This allows the model to account for and adapt to different types of noise.
Regularization: RDHP uses regularization techniques like dropout and L1/L2 regularization to prevent the model from overfitting to the noise in the training data.

The authors evaluate RDHP on several real-world event datasets with varying levels and types of label noise. The results show that RDHP outperforms standard Hawkes process models in terms of predictive accuracy, robustness to noise, and other key metrics.

Critical Analysis

The paper makes a valuable contribution by addressing an important practical challenge in applying Hawkes processes to real-world data, which is often noisy and imperfect. The proposed RDHP framework provides a principled way to handle different types of label noise, which is a common issue in event data.

One potential limitation is that the noise modeling approach relies on assuming specific parametric forms for the noise distributions (e.g., Gaussian for timestamps, categorical for event types). While this works well in the experiments, in practice the noise characteristics may be more complex and difficult to model. An avenue for future research could be exploring more flexible, nonparametric noise modeling techniques.

Additionally, the paper does not provide much insight into the interpretability of the RDHP model. Understanding how the noise estimation and robustness mechanisms affect the learned model parameters and their interpretations could be an interesting area for further study.

Overall, the Robust Deep Hawkes Process is a promising approach that addresses an important real-world challenge in a principled way. The experimental results demonstrate its practical value, and the techniques could potentially be applied to other types of temporal point process models beyond just Hawkes processes.

Conclusion

This paper presents the Robust Deep Hawkes Process (RDHP), a framework for training deep Hawkes process models that are more resilient to label noise in event data. By incorporating techniques to estimate true event data, model noise distributions, and use regularization, RDHP can outperform standard Hawkes models on real-world datasets with various types of noise.

The work addresses a critical challenge in applying Hawkes processes to practical applications, where event data is often imperfect or incomplete. The proposed methods show promising results and could help enable the wider adoption of Hawkes process models in domains with noisy data, such as finance, social media, and healthcare. Further research into more flexible noise modeling and the interpretability of RDHP could build on these contributions.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Robust Deep Hawkes Process under Label Noise of Both Event and Occurrence

Xiaoyu Tan, Bin Li, Xihe Qiu, Jingjing Huang, Yinghui Xu, Wei Chu

Integrating deep neural networks with the Hawkes process has significantly improved predictive capabilities in finance, health informatics, and information technology. Nevertheless, these models often face challenges in real-world settings, particularly due to substantial label noise. This issue is of significant concern in the medical field, where label noise can arise from delayed updates in electronic medical records or misdiagnoses, leading to increased prediction risks. Our research indicates that deep Hawkes process models exhibit reduced robustness when dealing with label noise, particularly when it affects both event types and timing. To address these challenges, we first investigate the influence of label noise in approximated intensity functions and present a novel framework, the Robust Deep Hawkes Process (RDHP), to overcome the impact of label noise on the intensity function of Hawkes models, considering both the events and their occurrences. We tested RDHP using multiple open-source benchmarks with synthetic noise and conducted a case study on obstructive sleep apnea-hypopnea syndrome (OSAHS) in a real-world setting with inherent label noise. The results demonstrate that RDHP can effectively perform classification and regression tasks, even in the presence of noise related to events and their timing. To the best of our knowledge, this is the first study to successfully address both event and time label noise in deep Hawkes process models, offering a promising solution for medical applications, specifically in diagnosing OSAHS.

7/30/2024

Unmixing Noise from Hawkes Process to Model Learned Physiological Events

Guillaume Staerman, Virginie Loison, Thomas Moreau

Physiological signal analysis often involves identifying events crucial to understanding biological dynamics. Traditional methods rely on handcrafted procedures or supervised learning, presenting challenges such as expert dependence, lack of robustness, and the need for extensive labeled data. Data-driven methods like Convolutional Dictionary Learning (CDL) offer an alternative but tend to produce spurious detections. This work introduces UNHaP (Unmix Noise from Hawkes Processes), a novel approach addressing the joint learning of temporal structures in events and the removal of spurious detections. Leveraging marked Hawkes processes, UNHaP distinguishes between events of interest and spurious ones. By treating the event detection output as a mixture of structured and unstructured events, UNHaP efficiently unmixes these processes and estimates their parameters. This approach significantly enhances the understanding of event distributions while minimizing false detection rates.

6/26/2024

Mamba Hawkes Process

Anningzhe Gao, Shan Dai, Yan Hu

Irregular and asynchronous event sequences are prevalent in many domains, such as social media, finance, and healthcare. Traditional temporal point processes (TPPs), like Hawkes processes, often struggle to model mutual inhibition and nonlinearity effectively. While recent neural network models, including RNNs and Transformers, address some of these issues, they still face challenges with long-term dependencies and computational efficiency. In this paper, we introduce the Mamba Hawkes Process (MHP), which leverages the Mamba state space architecture to capture long-range dependencies and dynamic event interactions. Our results show that MHP outperforms existing models across various datasets. Additionally, we propose the Mamba Hawkes Process Extension (MHP-E), which combines Mamba and Transformer models to enhance predictive capabilities. We present the novel application of the Mamba architecture to Hawkes processes, a flexible and extensible model structure, and a theoretical analysis of the synergy between state space models and Hawkes processes. Experimental results demonstrate the superior performance of both MHP and MHP-E, advancing the field of temporal point process modeling.

7/9/2024

💬

RoTHP: Rotary Position Embedding-based Transformer Hawkes Process

Anningzhe Gao, Shan Dai

Temporal Point Processes (TPPs), especially Hawkes Process are commonly used for modeling asynchronous event sequences data such as financial transactions and user behaviors in social networks. Due to the strong fitting ability of neural networks, various neural Temporal Point Processes are proposed, among which the Neural Hawkes Processes based on self-attention such as Transformer Hawkes Process (THP) achieve distinct performance improvement. Although the THP has gained increasing studies, it still suffers from the {sequence prediction issue}, i.e., training on history sequences and inferencing about the future, which is a prevalent paradigm in realistic sequence analysis tasks. What's more, conventional THP and its variants simply adopt initial sinusoid embedding in transformers, which shows performance sensitivity to temporal change or noise in sequence data analysis by our empirical study. To deal with the problems, we propose a new Rotary Position Embedding-based THP (RoTHP) architecture in this paper. Notably, we show the translation invariance property and {sequence prediction flexibility} of our RoTHP induced by the {relative time embeddings} when coupled with Hawkes process theoretically. Furthermore, we demonstrate empirically that our RoTHP can be better generalized in sequence data scenarios with timestamp translations and in sequence prediction tasks.

5/14/2024