Work place: Centre for Data Science and Artificial Intelligence, University of Massachusetts, 740 N. Pleasant St., A205, Amherst, MA 01003, USA
E-mail: Dganapathi@umass.edu
Website:
Research Interests:
Biography
Mr. Dinesh Prasanth Ganapathi https://orcid.org/0009-0005-3273-7132 (ORCID ID) is a technology leader with over a decade of experience in software engineering, AI, cloud computing, and product strategy. He has built impactful solutions across industries—from avionics and enterprise networking to cloud observability and generative AI. Dinesh has held key roles at companies like Microsoft, Riverbed Technology, and Honeywell, leading innovations that range from AI integrations with Bing Search to multi-cloud visibility platforms. He currently serves as Senior Product Manager for Generative AI at Advisor360 and Industry Advisor for the Center for Data Science and AI at UMass Amherst. He holds advanced degrees and certifications in Computer Engineering, AI, and Product Management from Rutgers, UT Austin, Product School, and UC Berkeley. Known for blending technical depth with strategic foresight, Dinesh is passionate about building future-ready, responsible technology.
By Ramkumar. R. Sureshkumar Nagarajan Dinesh Prasanth Ganapathi
DOI: https://doi.org/10.5815/ijitcs.2025.06.02, Pub. Date: 8 Dec. 2025
In Artificial Intelligence, voice categorization is important for various applications. Tamil, being one of the oldest languages in the world, comprises rich regional slang differing in tone, pronunciation, and emotive expression. These slang words are difficult to categorize because they are informal and there is limited annotated audio data. This study proposes an enhanced deep learning framework for Tamil slang classification using a balanced audio corpus. The framework integrates data-specific pre-processing techniques, including Mel spectrograms, Chroma features and spectral contrast, to capture the nuanced characteristics of Tamil speech. A DenseNet backbone, combined with LSTM and GRU layers, models both temporal and spectral information. The suggested FRAE-PSA module is an innovative application of the Pyramid Split Attention (PSA) mechanism adapted to support regional and affective variations of speech. Different from current PSA or Transformer-based approaches, FRAE-PSA splits the audio frequency spectrum and adapts attention weights dynamically based on auxiliary tasks. A multi-branch architecture is employed to fuse temporal and spectral features effectively and multi-task learning is used to enhance regional accent and emotion detection. Custom loss functions and lightweight networks optimize model efficiency. Experimental results show up to a 15% improvement in classification accuracy over baseline models, demonstrating the framework's effectiveness for real-world Tamil slang classification tasks.
[...] Read more.Subscribe to receive issue release notifications and newsletters from MECS Press journals