Fraud Detection and Machine Learning in Auditing:A Systematic Literature Review

Authors

  • Angel II P. Esmeralda Nusa Putra University, Sukabumi, Indonesia Author
  • Nur Hidayah K Fadhilah Nusa Putra University, Sukabumi, Indonesia Author

DOI:

https://doi.org/10.60036/y8p1k791

Keywords:

Machine Learning, Fraud Detection, Auditing, Systematic Literature Review, Deep Learning, Explainable AI, Financial Statement Fraud

Abstract

This systematic literature review examines the application of machine learning (ML) techniques in fraud detection within the auditing domain, synthesizing findings from peer-reviewed studies published between 2019 and 2024. Following the PRISMA 2020 guidelines, this review analyzed 85 articles from Scopus, Web of Science, IEEE Xplore, and Google Scholar databases. The Kitchenham methodology was employed to ensure rigorous screening, extraction, and synthesis of relevant literature. The review reveals that ensemble methods, particularly Random Forest and XGBoost, demonstrate superior performance in fraud detection tasks. Deep learning architectures, including Convolutional Neural Networks (CNNs) and Long Short-Term Memory (LSTM) networks, show promising results for complex fraud patterns. Key challenges identified include imbalanced datasets, model interpretability, and regulatory compliance. The emergence of Explainable AI (XAI) techniques, such as SHAP and LIME, addresses transparency concerns in audit applications. This review provides a comprehensive synthesis of ML applications in fraud detection specifically within the auditing context, offering a research agenda for future investigations and practical implications for audit practitioners and regulators.

References

Achakzai, M.A.K., & Peng, J. (2023). Detecting financial statement fraud using dynamic ensemble machine learning. International Review of Financial Analysis, 89, 102827.

Bao, Y., Ke, B., Li, B., Yu, Y.J., & Zhang, J. (2020). Detecting accounting fraud in publicly traded U.S. firms using a machine learning approach. Journal of Accounting Research, 58(1), 199-235.

Cecchini, M., Aytug, H., Koehler, G.J., & Pathak, P. (2010). Detecting management fraud in public companies. Management Science, 56(7), 1146-1160.

Chawla, N.V., Bowyer, K.W., Hall, L.O., & Kegelmeyer, W.P. (2002). SMOTE: Synthetic minority over-sampling technique. Journal of Artificial Intelligence Research, 16, 321-357.

Chen, Y., et al. (2025). Year-over-year developments in financial fraud detection via deep learning: A systematic literature review. arXiv preprint arXiv:2502.00201.

Craja, P., Kim, A., & Lessmann, S. (2020). Deep learning for detecting financial statement fraud. Decision Support Systems, 139, 113421.

Cressey, D.R. (1953). Other people’s money: A study of the social psychology of embezzlement. Free Press.

Crowe, H. (2011). Why the fraud triangle is no longer enough. Horwath, Crowe LLP.

Hajek, P., & Henriques, R. (2017). Mining corporate annual reports for intelligent detection of financial statement fraud. Knowledge-Based Systems, 128, 139-152.

Kirkos, E., Spathis, C., & Manolopoulos, Y. (2007). Data mining techniques for the detection of fraudulent financial statements. Expert Systems with Applications, 32(4), 995-1003.

Lundberg, S.M., & Lee, S.I. (2017). A unified approach to interpreting model predictions. Advances in Neural Information Processing Systems, 30, 4765-4774.

Mutemi, A., & Bacao, F. (2024). E-commerce fraud detection based on machine learning techniques: Systematic literature review. Big Data Mining and Analytics, 7(2), 419-444.

Ngai, E.W.T., Hu, Y., Wong, Y.H., Chen, Y., & Sun, X. (2011). The application of data mining techniques in financial fraud detection: A classification framework and an academic review of literature. Decision Support Systems, 50(3), 559-569.

Page, M.J., et al. (2021). The PRISMA 2020 statement: An updated guideline for reporting systematic reviews. BMJ, 372, n71.

Perols, J. (2011). Financial statement fraud detection: An analysis of statistical and machine learning algorithms. Auditing: A Journal of Practice & Theory, 30(2), 19-50.

Ribeiro, M.T., Singh, S., & Guestrin, C. (2016). Why should I trust you? Explaining the predictions of any classifier. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 1135-1144.

Rodriguez-Perez, G., et al. (2024). Financial fraud detection through the application of machine learning techniques: A literature review. Humanities and Social Sciences Communications, 11, 1130.

Sharma, A., & Panigrahi, P.K. (2022). Explainable artificial intelligence (XAI) in auditing. International Journal of Accounting Information Systems, 46, 100572.

West, J., & Bhattacharya, M. (2016). Intelligent financial fraud detection: A comprehensive review. Computers & Security, 57, 47-66.

Yang, W., & Wu, D. (2024). AI driven fraud detection models in financial networks: A comprehensive systematic review. IEEE Access, 12, 98765-98790.

Downloads

Published

2026-01-14

How to Cite

Fraud Detection and Machine Learning in Auditing:A Systematic Literature Review. (2026). Educational Studies and Research Journal, 3(1), 37-46. https://doi.org/10.60036/y8p1k791