Anomaly detection using machine learning techniques
Downloads
DOI:
https://doi.org/10.26637/MJM0804/0139Abstract
Anomaly represents deviation from the normal behavior of an event. Detection of anomaly provides means to take appropriate countermeasures in various domains. Examples include detection of fraudulent transaction in banking or financial domain, detection of cyber-attacks in networking environment, detection of abnormal behavior of vital signs of patient in healthcare domain. Also, detection of anomalies with respect to time of arrival of data is a crucial in deciding the accomplishment of successful countermeasures. Selection of suitable algorithm or method for detection of anomaly is also equally important for successful detection of anomalies. In this paper it is proposed to compare the performance of two different algorithms, namely, Isolation Forest (unsupervised) and Random Forest (supervised) by varying the operating parameters of the algorithms. Experiment is carried out using benchmark dataset that belongs to healthcare domain. The data is preprocessed for missing values and then detection accuracy of algorithm is analyzed with respect to number of records. Results are discussed.
Keywords:
Anomaly detection, Random Forest, Isolation Forest, batch processing, healthcareMathematics Subject Classification:
Mathematics- Pages: 2144-2148
- Date Published: 01-10-2020
- Vol. 8 No. 04 (2020): Malaya Journal of Matematik (MJM)
Nisha P. Shetty, Jayashree Shetty, Rohil Narula and KushagraTandona, Comparison study of machine learning classifiers to detect anomalies, International Journal of Electrical and Computer Engineering (IJECE), 10(5)(2020), 5445-5452.
LuisePufahl and Mathias Weske, Requirements Framework for Batch Processing in Business Processes, International Conference on Exploring Modeling Methods for Systems Analysis and Design, 2017.
Simon D. Duque Anton, Sapna Sinha and Hans Dieter Schotten, Anomaly-based Intrusion Detection in Industrial Data with SVM and Random Forests, 27th International Conference on Software, Telecommunications and Computer Networks (SoftCOM), 1(2019).
Haoran Ma, Benyamin Ghojogh, Maria N. Samad, Dongyu Zheng, Mark Crowley, Isolation Mondrian Forest for Batch and Online Anomaly Detection, IEEE International Conference on Systems, Man, and Cybernetics (SMC), (2020).
Kathrin Melcher, Fraud Detection Using Random Forest, Neural Autoencoder, and Isolation Forest Techniques, AI, ML & Data Engineering, 2019.
RifkiePrimartha and Bayu Adhi Tama, Anomaly Detection using Random Forest: A Performance Revisited, 2017 International Conference on Data and Software Engineering (IcoDSE), (2018).
Rashmi H Roplekar and Prof. N. V. Buradkar, Survey of Random Forest Based Network Anomaly Detection Systems, International Journal of Advanced Research in Computer and Communication Engineering (IJARCCE), 6(12)(2017), 95-98.
Lekha R. Nair and Sujala D. Shetty, Research in Big Data and Analytics: An Overview, International Journal of Computer Applications, 108(14)(2014), 19-23.
Mansi Shah and Vatika Tayal, Future of Big Data beyond Batch Processing, International Journal for Scientific Research & Development (IJSRD), 3(01)(2015), 217-220.
P. Sai Pranavi, H. D. Sheethal, Sharanya S Kumar, Sonika Kariappa and B. H. Swathi, Analysis of Vehicle Insurance Data to Detect Fraud using Machine Learning, International Journal for Research in Applied Science & Engineering Technology (IJRASET), 8(7)(2020), 2033-2038.
Vrushali Y Kulkarni and Pradeep K Sinha, Effective Learning and Classification using Random Forest Algorithm, International Journal of Engineering and Innovative Technology (IJEIT), 3(11)(2014), 267-273.
Leo Breiman, Random Forests, Machine Learning, 45(2001), 5-32.
Sahand Hariri, Matias Carrasco Kind and Robert J. Brunner, Extended Isolation Forest, 3(2020), 1-10.
- NA
Metrics
Published
How to Cite
Issue
Section
License
Copyright (c) 2020 MJM
This work is licensed under a Creative Commons Attribution 4.0 International License.