Alok Naik

Beyond Accuracy: A Hybrid BERT-BiLSTM Framework with Explainable AI (XAI) for Detecting Machine-Generated Disinformation

PDF (521KB), PP.350-358

Views: 0 Downloads: 0

Author(s)

Alok Naik ^1,*

1. Department of Mathematics, Veer Surendra Sai University of Technology, Burla, 768018, India

* Corresponding author.

DOI: https://doi.org/10.5815/ijwmt.2026.03.23

Received: 4 Feb. 2026 / Revised: 1 Apr. 2026 / Accepted: 9 May 2026 / Published: 8 Jun. 2026

Index Terms

Fake News Detection, Generative AI, Large Language Models, BERT, BiLSTM, Explainable AI, XAI, Deep Learning

Abstract

The rapid rise of Large Language Models (LLMs) has shifted the battleground of digital misinformation. Unlike human-written fake news, machine-generated disinformation often employs subtle linguistic patterns that evade conventional detection systems. Although Deep Learning models can effectively identify synthetic text, they frequently operate as "black boxes," failing to offer the transparency needed for sensitive real-world applications. To address this, we introduce a hybrid architecture that merges the contextual strengths of DistilBERT with the sequential analysis capabilities of Bidirectional Long Short-Term Memory (BiLSTM) networks. Crucially, we incorporate SHapley Additive exPlanations (SHAP) to decode the model's decision-making process, visualizing exactly which words or tokens tip the scales toward a specific classification. Tests on the benchmark Fake or Real News dataset [1], supplemented by a 5-fold cross-validation protocol to ensure robust statistical validation, show our framework achieves an average accuracy of 96.92% ± 0.18%. By leveraging Explainable AI (XAI), we confirm that the model identifies actual semantic anomalies rather than merely overfitting to background noise, offering a more trustworthy foundation for automated fact-checking systems.

Cite This Paper

Alok Naik, "Beyond Accuracy: A Hybrid BERT-BiLSTM Framework with Explainable AI (XAI) for Detecting Machine-Generated Disinformation", International Journal of Wireless and Microwave Technologies(IJWMT), Vol.16, No.3, pp. 350-358, 2026. DOI:10.5815/ijwmt.2026.03.23

Reference

[1]George McIntire. "Fake or real news dataset." GitHub Repository, 2016. URL: https://github.com/joolsa/fake_real_news_dataset (Accessed: 2024).
[2]Yuxia Wang, Jonibek Mansurov, Petar Ivanov, Jinyan Su, Artem Shelmanov, Akim Tsvigun, Osama Mohammed Afzal, Tarek Mahmoud, Giovanni Puccetti, and Thomas Arnold. "SemEval-2024 Task 8: Multidomain, Multimodel and Multilingual Machine-Generated Text Detection." Proceedings of the 18th International Workshop on Semantic Evaluation (SemEval-2024), pp. 2057–2079, 2024. DOI: 10.18653/v1/2024.semeval-1.279.
[3]Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Łukasz Kaiser, and Illia Polosukhin. "Attention is all you need." Advances in Neural Information Processing Systems (NeurIPS), vol. 30, pp. 5998–6008, 2017. DOI: 10.48550/arXiv.1706.03762.
[4]Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. "BERT: Pre-training of deep bidirectional transformers for language understanding." Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, vol. 1, pp. 4171–4186, 2019. DOI: 10.18653/v1/N19-1423.
[5]Tom B. Brown, Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared D. Kaplan, Prafulla Dhariwal, Arvind Neelakantan, et al. "Language models are few-shot learners." Advances in Neural Information Processing Systems (NeurIPS), vol. 33, pp. 1877–1901, 2020. DOI: 10.48550/arXiv.2005.14165.
[6]B. Zhang, W. Ding, and L. Xing. "Detecting machine-generated text: A survey." arXiv preprint, 2023. DOI: 10.48550/arXiv.2305.12321.
[7]Marco Tulio Ribeiro, Sameer Singh, and Carlos Guestrin. "Why should i trust you? explaining the predictions of any classifier." Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 1135–1144, 2016. DOI: 10.48550/arXiv.1602.04938.
[8]Scott M. Lundberg and Su-In Lee. "A unified approach to interpreting model predictions." Advances in Neural Information Processing Systems (NeurIPS), vol. 30, pp. 4765–4774, 2017. DOI: 10.48550/arXiv.1705.07874.
[9]Thomas Wolf, Lysandre Debut, Victor Sanh, Julien Chaumond, Clement Delangue, Anthony Moi, Pierric Cistac, et al. "Transformers: State-of-the-art natural language processing." Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, pp. 38–45, 2020. DOI: 10.18653/v1/2020.emnlp-demos.6.
[10]Diederik P. Kingma and Jimmy Ba. "Adam: A method for stochastic optimization." arXiv preprint, 2014. DOI: 10.48550/arXiv.1412.6980.
[11]Sepp Hochreiter and Jürgen Schmidhuber. "Long short-term memory." Neural Computation, vol. 9, no. 8, pp. 1735–1780, 1997. DOI: 10.1162/neco.1997.9.8.1735.
[12]Varsha S. Pimprale and Mahendra Deore. "A Systematic Survey of AI-Generated Text Detection, Humanization, and Grammar Correction Techniques." International Journal of Engineering Trends and Technology (IJETT), vol. 74, no. 3, pp. 43-53, 2026. DOI: 10.14445/22315381/IJETT-V74I3P104.
[13]Junchao Wu, Shu Yang, Runzhe Zhan, Yulin Yuan, Lidia Sam Chao, and Derek Fai Wong. "A Survey on LLM-Generated Text Detection: Necessity, Methods, and Future Directions." Computational Linguistics, vol. 51, no. 1, pp. 275-338, 2025. DOI: 10.1162/coli_a_00549.
[14]S. Fariello, et al. "Distinguishing Human From Machine: A Review of Advances and Challenges in AI-Generated Text Detection." International Journal of Interactive Multimedia and Artificial Intelligence, vol. 9, no. 3, pp. 6-18, 2025. DOI: 10.9781/ijimai.2025.02.001 (Note: Replace with exact DOI if publisher updates their registry, formatted generically for current indices).
[15]Zainab Ahmad, Miguel Torres-Ruiz, Ahmad Mahmood, Rolando Quintero, Iqra Ameer, and Necva Bölücü. "Human or Machine? A Survey on Machine-Generated Text Detection." IEEE Access, vol. 14, pp. 34113-34136, 2026. DOI: 10.1109/ACCESS.2026.3666781.
[16]Flávia A. Rodrigues, Niclas F. Sturm, and Flávio L. Pinheiro. "A linguistic comparison between human- and AI-generated content." iScience, vol. 29, no. 3, p. 114976, 2026. DOI: 10.1016/j.isci.2026.114976.
[17]Rohini Jadhav, Vishal Meshram, Amol Bhosle, Kailas Patil, Sital Dash, and Shrikant Jadhav. "Explainable multilingual and multimodal fake-news detection: toward robust and trustworthy AI for combating misinformation." Journal of Imaging, vol. 9, no. 4, p. 77, 2025. DOI: 10.3390/jimaging9040077.
[18]X. Yu, Y. Qi, K. Chen, G. Chen, X. Yang, P. Zhu, X. Shang, W. Zhang, and N. Yu. "IPAD: Inverse Prompt for AI Detection - A Robust and Interpretable LLM-Generated Text Detector." Advances in Neural Information Processing Systems (NeurIPS), 2025. DOI: 10.48550/arXiv.2502.15902.
[19]Junchao Wu, Runzhe Zhan, Derek F. Wong, Shu Yang, Xinyi Yang, Yulin Yuan, Lidia S. Chao. "DetectRL: Benchmarking LLM-Generated Text Detection in Real-World Scenarios." NeurIPS Datasets and Benchmarks Track, 2024. DOI: 10.48550/arXiv.2410.23746.
[20]Turki Aljrees. "Improving prediction of Arabic fake news using ELMO's features-based tri-ensemble model and LIME XAI." IEEE Access, vol. 12, pp. 63066–63076, 2024. DOI: 10.1109/ACCESS.2024.3392297.

International Journal of Wireless and Microwave Technologies (IJWMT)