Dr. R. Subha received B.E in Computer Science and Engineering from Periyar University and M.E in Software Engineering from Anna University, Chennai in 2002 and 2006 respectively and completed the Ph.D. degree in software engineering in 2014. At Present, she is working as Assistant Professor in the department of Computer Science & Engg, Sri Krishna College of Technology, Coimbatore. She is currently pursuing Ph.D under Anna University, Coimbatore. Her research interest includes Software Engineering, Computer Architecture and NLP.

Similar Words Identification Using Naive and TF-IDF Method

By Divya K.S. R. Subha S. Palaniswami

DOI: https://doi.org/10.5815/ijitcs.2014.11.06, Pub. Date: 8 Oct. 2014

Requirement satisfaction is one of the most important factors to success of software. All the requirements that are specified by the customer should be satisfied in every phase of the development of the software. Satisfaction assessment is the determination of whether each component of the requirement has been addressed in the design document. The objective of this paper is to implement two methods to identify the satisfied requirements in the design document. To identify the satisfied requirements, similar words in both of the documents are determined. The methods such as Naive satisfaction assessment and TF-IDF satisfaction assessment are performed to determine the similar words that are present in the requirements document and design documents. The two methods are evaluated on the basis of the precision and recall value. To perform the stemming, the Porter’s stemming algorithm is used. The satisfaction assessment methods would determine the similarity in the requirement and design documents. The final result would give a accurate picture of the requirement satisfaction so that the defects can be determined at the early stage of software development. Since the defects determines at the early stage, the cost would be low to correct the defects.

