Karthik K.

Work place: School of Computer Science and Engineering Vellore Institute of Technology, Vellore, TamilNadu, India

E-mail: k.karthik@vit.ac.in

Website: https://orcid.org/0000-0003-0846-2982

Research Interests:

Biography

Karthik K. is an Assistant Professor Sr. Grade 1 in the Department of Analytics at the School of Computer Science and Engineering, Vellore Institute of Technology (VIT) Vellore. He gained his doctoral degree from National Institute of Technology Karnataka, Surathkal under the supervision of Dr. Sowmya Kamath S. He has also worked as research fellow for a funded project titled "A Framework for Deep Learning based Analytics for Intelligent Healthcare Applications" by SERB, Department of Science and Technology, Govt. of India. His research interest includes computer vision, machine learning, deep learning, data mining and analytics. With a strong background in analytics, he contributes to the advancement of the department through his expertise and dedication to teaching and research. As a valued member of the VIT faculty, he is committed to fostering a learning environment that promotes innovation and excellence in the field of computer science and engineering.

Author Articles
Leveraging Deep Learning Approach for the Detection of Human Activities from Video Sequences

By Preethi Salian K. Karthik K.

DOI: https://doi.org/10.5815/ijigsp.2025.06.05, Pub. Date: 8 Dec. 2025

Using deep learning approaches, recognizing human actions from video sequences by automatically deriving significant representations has demonstrated effective results from unprocessed video information. Artificial intelligence (AI) systems, including monitoring, automation, and human-computer interface, have become crucial for security and human behaviour analysis. For the visual depiction of video clips during the training phase, the existing action identification algorithms mostly use pre-trained weights of various AI designs, which impact the characteristics discrepancies and perseverance, including the separation among the visual and temporal indicators. The research proposes a 3-dimensional Convolutional Neural Network and Long Short-Term Memory (3D-CNN-LSTM) network that strategically concentrates on useful information in the input frame to recognize the various human behaviours in the video frames to overcome this problem. The process utilizes stochastic gradient descent (SGD) optimization to identify the model parameters that best match the expected and observed outcomes. The proposed framework is trained, validated, and tested using publicly accessible UCF11 benchmark dataset. According to the experimental findings of this work, the accuracy rate was 93.72%, which is 2.42% higher compared to the state-of-the-art previous best result. When compared to several other relevant techniques that are already in use, the suggested approach achieved outstanding performance in terms of accuracy. 

[...] Read more.
Other Articles