Ankush Joshi

Work place: Department of Computer Science and Applications, College of Smart Computing, COER University Roorkee India

E-mail: ankushjoshi1987@gmail.com

Website:

Research Interests:

Biography

Ankush Joshi is an accomplished academic and researcher with over 14 years of experience in the field of Computer Science and Engineering. He holds an MCA, MTech (CSE), and a Ph.D. (CSE), and is currently serving as an Assistant Professor at COER University, Roorkee, India. Dr. Joshi specializes in Artificial Intelligence (AI), Machine Learning (ML), and Data Analysis, with a strong focus on interdisciplinary applications of these technologies. He has made significant contributions to academia, with more than 20 research papers and book chapters to his credit. Dr. Joshi has edited one book with IGI Global Publications and authored another, showcasing his expertise in the field. He is an active member of the academic community, having served as a reviewer for several IEEE conferences and IGI Global publications. Additionally, he has chaired sessions at prestigious IEEE and Springer conferences, further solidifying his reputation as a thought leader in the domain of computer science and engineering. His work continues to inspire students and researchers alike, bridging the gap between theoretical knowledge and practical applications in AI, ML, and data analysis.

Author Articles
Lightweight 3DCNN-BiLSTM Model for Human Activity Recognition using Fusion of RGBD Video Sequences

By Vijay Singh Rana Ankush Joshi Kamal Kant Verma

DOI: https://doi.org/10.5815/ijitcs.2025.06.10, Pub. Date: 8 Dec. 2025

Over the past two decades, the automatic recognition of human activities has been a prominent research field. This task becomes more challenging when dealing with multiple modalities, different activities, and various scenarios. Therefore, this paper addresses activity recognition task by fusion of two modalities such as RGB and depth maps. To achieve this, two distinct lightweight 3D Convolutional Neural Network (3DCNN) are employed to extract space time features from both RGB and depth sequences separately. Subsequently, a Bidirectional LSTM (Bi-LSTM) network is trained using the extracted spatial temporal features, generating activity score corresponding to each sequence in both RGB and depth maps. Then, a decision level fusion is applied to combine the score obtained in the previous step. The novelty of our proposed work is to introduce a lightweight 3DCNN feature extractor, designed to capture both spatial and temporal features form the RGBD video sequences. This improves overall efficiency while simultaneously reducing the computational complexity. Finally, the activities are recognized based the fusion scores. To assess the overall efficiency of our proposed lightweight-3DCNN and BiLSTM method, it is validated on the 3D benchmark dataset UTKinectAction3D, achieving an accuracy of 96.72%. The experimental findings confirm the effectiveness of the proposed representation over existing methods.

[...] Read more.
Other Articles