Unknown

Dataset Information

0

FRaC: a feature-modeling approach for semi-supervised and unsupervised anomaly detection.


ABSTRACT: Anomaly detection involves identifying rare data instances (anomalies) that come from a different class or distribution than the majority (which are simply called "normal" instances). Given a training set of only normal data, the semi-supervised anomaly detection task is to identify anomalies in the future. Good solutions to this task have applications in fraud and intrusion detection. The unsupervised anomaly detection task is different: Given unlabeled, mostly-normal data, identify the anomalies among them. Many real-world machine learning tasks, including many fraud and intrusion detection tasks, are unsupervised because it is impractical (or impossible) to verify all of the training data. We recently presented FRaC, a new approach for semi-supervised anomaly detection. FRaC is based on using normal instances to build an ensemble of feature models, and then identifying instances that disagree with those models as anomalous. In this paper, we investigate the behavior of FRaC experimentally and explain why FRaC is so successful. We also show that FRaC is a superior approach for the unsupervised as well as the semi-supervised anomaly detection task, compared to well-known state-of-the-art anomaly detection methods, LOF and one-class support vector machines, and to an existing feature-modeling approach.

SUBMITTER: Noto K 

PROVIDER: S-EPMC3359096 | biostudies-literature | 2012

REPOSITORIES: biostudies-literature

altmetric image

Publications

FRaC: a feature-modeling approach for semi-supervised and unsupervised anomaly detection.

Noto Keith K   Brodley Carla C   Slonim Donna D  

Data mining and knowledge discovery 20110908 1


Anomaly detection involves identifying rare data instances (anomalies) that come from a different class or distribution than the majority (which are simply called "normal" instances). Given a training set of only normal data, the semi-supervised anomaly detection task is to identify anomalies in the future. Good solutions to this task have applications in fraud and intrusion detection. The unsupervised anomaly detection task is different: Given unlabeled, mostly-normal data, identify the anomali  ...[more]

Similar Datasets

| S-EPMC10049350 | biostudies-literature
| S-EPMC9601423 | biostudies-literature
| S-EPMC3193936 | biostudies-literature
| S-EPMC3956069 | biostudies-literature
| S-EPMC4556708 | biostudies-literature
| S-EPMC7592391 | biostudies-literature
| S-EPMC8289984 | biostudies-literature
| S-EPMC10803081 | biostudies-literature
| S-EPMC11302674 | biostudies-literature
| S-EPMC3197694 | biostudies-literature