Key Term Extraction using a Sentence based Weighted TF-IDF Algorithm

T. Vetriselvi 1 N.P.Gopalan 2 G. Kumaresan 2,*

1. Department of Computer Science and Engineering, K. Ramakrishnan College of Technology, Tiruchirappalli, India

2. Department of Computer Applications, National Institute of Technology, Tiruchirappalli, India

Received: 6 Nov. 2018 / Revised: 25 Jan. 2019 / Accepted: 15 Feb. 2019 / Published: 8 Jul. 2019

Index Terms

Similarity Matrix, Term Count, WordNet


Keyword ranking with similarity identification is an approach to find the significant Keywords in a corpus using a Variant Term Frequency Inverse Document Frequency (VTF-IDF) algorithm. Some of these may have same similarity and they get reduced to a single term when WordNet is used. The proposed approach that does not require  any test or training set, assigns sentence  based Weightage to the keywords(terms) and it  is found to be  effective. Its suitability is analyzed with several data sets using precision and recall as metrics.

