Multivariate Time-series Anomaly Detection via Gra

# Multivariate Time-series Anomaly Detection via Graph Attention Network Created: February 16, 2021 8:50 AM URL: https://arxiv.org/abs/2009.02040 Year: 2020 # Abstract Anomaly detection on multivariate time-series is of great importance in both data mining research and industrial applications. Recent approaches have achieved significant progress in this topic, but there is remaining limitations. One major limitation is that they do not capture the relationships between different time-series explicitly, resulting in inevitable false alarms. In this paper, we propose a novel self-supervised framework for multivariate time-series anomaly detection to address this issue. Our framework considers each univariate time-series as an individual feature and includes two graph attention layers in parallel to learn the complex dependencies of multivariate time-series in both temporal and feature dimensions. In addition, our approach jointly optimizes a forecasting-based model and are construction-based model, obtaining better time-series representations through a combination of single-timestamp prediction and reconstruction of the entire time-series. We demonstrate the efficacy of our model through extensive experiments. The proposed method outperforms other state-of-the-art models on three real-world datasets. Further analysis shows that our method has good interpretability and is useful for anomaly diagnosis. 多変量時系列上の異常検出は、データマイニング研究と産業応用の両方において非常に重要である。最近のアプローチはこのトピックで大きな進歩を遂げていますが、限界が残っています。大きな制限の一つは、異なる時系列間の関係を明示的に捉えていないため、結果的に誤報が発生してしまうことである。本論文では、この問題に対処するために、多変量時系列異常検出のための新しい自己教師付きフレームワークを提案する。我々のフレームワークでは、各一変量時系列を個々の特徴とみなし、2つのグラフ注目層を並列に含み、時間的次元と特徴的次元の両方で多変量時系列の複雑な依存性を学習する。さらに、我々のアプローチは予測ベースのモデルと構築ベースのモデルを共同で最適化し、単一タイムスタンプの予測と時系列全体の再構築を組み合わせることで、より優れた時系列表現を得ることができる。このモデルの有効性を大規模な実験で実証した。提案手法は、3つの実世界のデータセットにおいて、他の最新のモデルよりも優れた性能を示した。さらなる解析により、本手法が優れた解釈可能性を持ち、異常診断に有用であることが示された。 # メモ異常検知手法の提案論文だが，評価のデータセットにWebシステムの障害データセットを使っている． ## 背景 - 多変量時系列データの異常検知は多くの手法が提案されているが，既存の手法は多変量相関を明示的に捉える問題を扱ったものではない． - そのため時系列間の関係を適切にモデル化することで，既存手法の性能を改善する余地があると主張する ## 提案 - 本論文では，多変量時系列異常検出のための新しい自己教師付きフレームワークMTAD-GAT (Multivariate Tim-series Anomaly Detection via Graph Attention Network)を提案する ![[Multivariate Time-series Anomaly Detection via Gra/Untitled.png]] - モデルの全体像は上図の通り 1. 1 次元畳み込みを適用し，各時系列の高レベルの特徴を抽出する 2. 構造的特徴と時間的特徴を捉える二つのGAT層を並列で処理（グラフ構造は事前情報として与えるのでなく完全グラフで与えている．システムのコンポーネントが増えた場合にGAT層の計算コストが問題になりそう） 3. 長期の時系列性を捉えるためのGRU層 4. 出力層は，予測ベースのモデル（全結合層）と再構成ベースのモデル（VAE）に並行して実行し，両者を考慮した異常度スコアに変換する閾値はPeak Over Threshold (POT) を用いて自動的に選択する（POT知らなかったけど使えそう） ## 評価 - データセットは以下の3つ 1. SMAP (Soil Moisture Active Passive Satellite) 2. MSL (Mars Science Laboratory rover) 3. TSA (Time Series Anomaly detection system) - 1と2はNASAが提供している宇宙船のデータセット，3はFlink（[https://flink.apache.org/](https://flink.apache.org/)）の時系列異常検出システムから収集したデータセット - ベースラインは，Omni- Anomaly，LSTM-NDT，KitNet，DAGMM，GAN-Li，MAD-GAN，LSTM-VAE - 検知精度 ![[Multivariate Time-series Anomaly Detection via Gra/Untitled 1.png]] - 原因診断 - 原因診断は予測ベースや再構成誤差ベースの誤差の大きいものをTop-Kで取り出すようなやり方でやっている． > The ability of anomaly diagnosis is much owing to the graph attention layer leveraged in the model - （診断の結果はGAT層が効いていると書いているけどあくまでこれは学習段階で正常時のみのデータから学習したAttention weightだから異常時の相関などは考慮できていない．つまり，正常時に関連が強いものは異常時も関連が強いというような仮定を置いていることなり，異常時に想定していない伝搬が起きた場合は取り扱えない．この点は [[Graph Neural Network-Based Anomaly Detection in Multivariate Time Series]] も同じ） ![[Multivariate Time-series Anomaly Detection via Gra/Untitled 2.png]]