Sureshkumar Nagarajan

Work place: Department of CSE, School of Computing, Kalasalingam Academy of Research and Education, Krishnankovil, TamilNadu, India

E-mail: sureshkumar@klu.ac.in

Website:

Research Interests:

Biography

Dr. Sureshkumar Nagarajan https://orcid.org/0000-0001-9484-4965  (ORCID ID), completed his Ph.D. in the area of “Genetic based classification Algorithms for combined LANDSAT and ENVISAT Images” from VIT University Vellore in 2016.He has published fifty research articles in reputed peer reviewed journals, and his Area of interest includes vision computing, Computational intelligent and big data analytics. He is now working as a professor in the Department of Computer science and Engineering with Kalasalingam academy of research and education (Deemed to be university) Krishnankoil, TamilNadu.

Author Articles
Enhanced Deep Learning Framework for Tamil Slang Classification with Multi-task Learning and Attention Mechanisms

By Ramkumar. R. Sureshkumar Nagarajan Dinesh Prasanth Ganapathi

DOI: https://doi.org/10.5815/ijitcs.2025.06.02, Pub. Date: 8 Dec. 2025

In Artificial Intelligence, voice categorization is important for various applications. Tamil, being one of the oldest languages in the world, comprises rich regional slang differing in tone, pronunciation, and emotive expression. These slang words are difficult to categorize because they are informal and there is limited annotated audio data. This study proposes an enhanced deep learning framework for Tamil slang classification using a balanced audio corpus. The framework integrates data-specific pre-processing techniques, including Mel spectrograms, Chroma features and spectral contrast, to capture the nuanced characteristics of Tamil speech. A DenseNet backbone, combined with LSTM and GRU layers, models both temporal and spectral information. The suggested FRAE-PSA module is an innovative application of the Pyramid Split Attention (PSA) mechanism adapted to support regional and affective variations of speech. Different from current PSA or Transformer-based approaches, FRAE-PSA splits the audio frequency spectrum and adapts attention weights dynamically based on auxiliary tasks. A multi-branch architecture is employed to fuse temporal and spectral features effectively and multi-task learning is used to enhance regional accent and emotion detection. Custom loss functions and lightweight networks optimize model efficiency. Experimental results show up to a 15% improvement in classification accuracy over baseline models, demonstrating the framework's effectiveness for real-world Tamil slang classification tasks.

[...] Read more.
Other Articles