Work place: Faculty of Computer Science and Engineering, Ss. Cyril and Methodius University, Skopje, North Macedonia
E-mail: enes.bajrami@students.finki.ukim.mk
Website:
Research Interests: Artificial Intelligence
Biography
Enes Bajrami is a PhD candidate and researcher in Computer Science at Ss. Cyril and Methodius University- Faculty of Computer Science and Engineering. His research focuses on software engineering and artificial intelligence, encompassing deep reinforcement learning, microservices, green computing, cloud orchestration, and medical AI applications. In addition to his academic work, he has extensive practical experience as a web developer. He currently serves as the Coordinator of the Broadband Implementation Unit at the state-owned enterprise National Broadcasting, where he leads the deployment of broadband infrastructure, bridging academic research with real-world technological applications.
By Enes Bajrami Florim Idrizi Shpend Ismaili
DOI: https://doi.org/10.5815/ijitcs.2026.01.03, Pub. Date: 8 Feb. 2026
Reinforcement Learning (RL) is a successful and established Artificial Intelligence (AI) method, particularly with recent groundbreaking progress in Deep Reinforcement Learning (DRL). Reinforcement learning is very well suited for sequential decision-making tasks, wherein a learned agent learns an optimal policy after many interactions with an environment. The present paper examines the application of reinforcement learning for automating screening of literature in academic research, particularly in the fields of computer science and e-learning. Keyword filtering techniques, while predominantly applied, are found to be inflexible as well as unable to capture the dynamic nature of research themes. To overcome such constraints, we present a Deep Q-Network (DQN)-based reinforcement learning model that combines reinforcement learning with the Semantic Scholar API to enhance research paper classification based on dynamically acquired decision rules. The proposed reinforcement learning model was trained and tested with a dataset of 8,934 research papers, accessed by systematic searching. The agent filtered and picked 11 effective papers depending on improved selection criteria like publication date, keyword relevance, and scholarly topic provided. The model iteratively optimizes the decision-making process through reward-based learning and therefore maximizes categorization accuracy over time. Test experiments demonstrate utilization of RL-based suggested framework yields classification accuracy at 91.5%, recall at 86.3%, and precision at 89.7%. A comparison test demonstrates that the approach performs 12.5% better on recall and 9.8% better on accuracy compared to traditional keyword-filtering methods. The finding confirms the power of the model in minimizing false positives and false negatives for screening literature, hence proving the scalability and adaptability of reinforcement learning in managing high academic data. This work offers a scalable, cognitive approach to conducting systematic reviews of literature through the application of reinforcement learning to programmatically execute work in academic research. The work shows the promise of reinforcement learning to further enhance research methodology, make literature reviews more effective, and facilitate more knowledgeable decision-making in fast-changing scientific disciplines. Further research will be focused on incorporating hybrid AI models with multi-agent systems of reinforcement learning for responsiveness and classification enhancements.
[...] Read more.Subscribe to receive issue release notifications and newsletters from MECS Press journals