Anomaly detection using machine learning techniques

Downloads

DOI:

https://doi.org/10.26637/MJM0804/0139

Abstract

Anomaly represents deviation from the normal behavior of an event. Detection of anomaly provides means to take appropriate countermeasures in various domains. Examples include detection of fraudulent transaction in banking or financial domain, detection of cyber-attacks in networking environment, detection of abnormal behavior of vital signs of patient in healthcare domain. Also, detection of anomalies with respect to time of arrival of data is a crucial in deciding the accomplishment of successful countermeasures. Selection of suitable algorithm or method for detection of anomaly is also equally important for successful detection of anomalies. In this paper it is proposed to compare the performance of two different algorithms, namely, Isolation Forest (unsupervised) and Random Forest (supervised) by varying the operating parameters of the algorithms. Experiment is carried out using benchmark dataset that belongs to healthcare domain. The data is preprocessed for missing values and then detection accuracy of algorithm is analyzed with respect to number of records. Results are discussed.

Keywords:

Anomaly detection, Random Forest, Isolation Forest, batch processing, healthcare

Mathematics Subject Classification:

Mathematics
  • Chellammal Suriyanarayanan Department of Computer Science, Bharathidasan University Constituent Arts and Science College, Affiliated to Bharathidasan University, Navalurkuttapattu, Tiruchirappalli, Tamil Nadu, India.
  • Saranya Kunasekaran Department of Computer Science, Bharathidasan University Constituent Arts and Science College, Affiliated to Bharathidasan University, Navalurkuttapattu, Tiruchirappalli, Tamil Nadu, India.
  • Pages: 2144-2148
  • Date Published: 01-10-2020
  • Vol. 8 No. 04 (2020): Malaya Journal of Matematik (MJM)

Nisha P. Shetty, Jayashree Shetty, Rohil Narula and KushagraTandona, Comparison study of machine learning classifiers to detect anomalies, International Journal of Electrical and Computer Engineering (IJECE), 10(5)(2020), 5445-5452.

LuisePufahl and Mathias Weske, Requirements Framework for Batch Processing in Business Processes, International Conference on Exploring Modeling Methods for Systems Analysis and Design, 2017.

Simon D. Duque Anton, Sapna Sinha and Hans Dieter Schotten, Anomaly-based Intrusion Detection in Industrial Data with SVM and Random Forests, 27th International Conference on Software, Telecommunications and Computer Networks (SoftCOM), 1(2019).

Haoran Ma, Benyamin Ghojogh, Maria N. Samad, Dongyu Zheng, Mark Crowley, Isolation Mondrian Forest for Batch and Online Anomaly Detection, IEEE International Conference on Systems, Man, and Cybernetics (SMC), (2020).

Kathrin Melcher, Fraud Detection Using Random Forest, Neural Autoencoder, and Isolation Forest Techniques, AI, ML & Data Engineering, 2019.

RifkiePrimartha and Bayu Adhi Tama, Anomaly Detection using Random Forest: A Performance Revisited, 2017 International Conference on Data and Software Engineering (IcoDSE), (2018).

Rashmi H Roplekar and Prof. N. V. Buradkar, Survey of Random Forest Based Network Anomaly Detection Systems, International Journal of Advanced Research in Computer and Communication Engineering (IJARCCE), 6(12)(2017), 95-98.

Lekha R. Nair and Sujala D. Shetty, Research in Big Data and Analytics: An Overview, International Journal of Computer Applications, 108(14)(2014), 19-23.

Mansi Shah and Vatika Tayal, Future of Big Data beyond Batch Processing, International Journal for Scientific Research & Development (IJSRD), 3(01)(2015), 217-220.

P. Sai Pranavi, H. D. Sheethal, Sharanya S Kumar, Sonika Kariappa and B. H. Swathi, Analysis of Vehicle Insurance Data to Detect Fraud using Machine Learning, International Journal for Research in Applied Science & Engineering Technology (IJRASET), 8(7)(2020), 2033-2038.

Vrushali Y Kulkarni and Pradeep K Sinha, Effective Learning and Classification using Random Forest Algorithm, International Journal of Engineering and Innovative Technology (IJEIT), 3(11)(2014), 267-273.

Leo Breiman, Random Forests, Machine Learning, 45(2001), 5-32.

Sahand Hariri, Matias Carrasco Kind and Robert J. Brunner, Extended Isolation Forest, 3(2020), 1-10.

  • NA

Metrics

Metrics Loading ...

Published

01-10-2020

How to Cite

Chellammal Suriyanarayanan, and Saranya Kunasekaran. “Anomaly Detection Using Machine Learning Techniques”. Malaya Journal of Matematik, vol. 8, no. 04, Oct. 2020, pp. 2144-8, doi:10.26637/MJM0804/0139.