Obiekt

Tytuł: Comparison of Machine Learning and Statistical Approaches of Detecting Anomalies Using a Simulation Study

Tytuł odmienny:

Uczenie maszynowe i statystyczne metody wykrywania anomalii – porównawcza analiza symulacyjna

Autor:

Lenart, Klaudia

Opis:

Econometrics = Ekonometria, 2024, Vol. 28, No. 4, s. 23-31

Abstrakt:

Aim: An anomaly is an observation or a group of observations that is unusual for a given dataset. Anomaly detection has many applications, not only as a step of data preparation but also, for example, as a way of identifying credit card fraud detection, network intrusions and much more. There are diverse methods of anomaly detection. In particular two groups of methods have been developed independently – statistical methods and machine learning algorithms. Those methods are rarely compared. While statistical methods focus on formulating a measure of the abnormality of the observations, supervised machine learning makes it possible to use data about typical observations and previously identified anomalies. The aim of this paper was to compare the two approaches by conducting a simulation study. Methodology: A simulation study was conducted, during which the data was generated using copula functions. For the purpose of generating different types of anomalies, marginal distributions of the variables were manipulated. The effectiveness of each method was evaluated based on measures of classification model performance. Results: While the accuracy of the statistical methods was dependent on the precise prediction of the percentage of the anomalies that would occur in the data, the machine learning algorithms’ recall was significantly lower when the change in the marginal distribution of the value parameters was smaller. Implications and recommendations: For the statistical methods included in the study, knowledge about the distribution of the variables was crucial while the supervised machine learning algorithms required acquiring a training dataset. Unlike machine learning algorithms, the statistical methods performed with similar accuracy even when the change in the marginal distribution parameters’ value was smaller Originality/value: The two approaches to anomaly detection presented in the paper are not often compared, usually used by two separate groups of researchers – statisticians and machine learning or data science specialists.

Wydawca:

Publishing House of Wroclaw University of Economics and Business

Miejsce wydania:

Wroclaw

Data wydania:

2024

Typ zasobu:

artykuł

Identyfikator zasobu:

doi:10.15611/eada.2024.4.02 ; oai:dbc.wroc.pl:131964

Język:

eng

Powiązania:

Econometrics = Ekonometria, 2024, Vol. 28, No. 4

Prawa:

Pewne prawa zastrzeżone na rzecz Autorów i Wydawcy

Prawa dostępu:

Dla wszystkich zgodnie z licencją

Licencja:

CC BY-SA 4.0

Lokalizacja oryginału:

Uniwersytet Ekonomiczny we Wrocławiu

Tytuł publikacji grupowej:

Ekonometria = Econometrics

Obiekty Podobne

×

Cytowanie

Styl cytowania:

Ta strona wykorzystuje pliki 'cookies'. Więcej informacji