An Integrated Approach for Measuring Semantic Similarity between Words and Sentences using Web Search Engine
Kavitha A
Manonmaniam
Sundaranor University, India
Abstract: Semantic
similarity measures play vital roles in Information Retrieval (IR) and Natural
Language Processing. Despite the usefulness of semantic similarity measures in
various applications, strongly measuring semantic similarity between two words
remains a challenging task. Here, three semantic similarity measures have been
proposed, that uses the information available on the web to measure similarity
between words and sentences. The proposed method exploits page counts and text
snippets returned by a web search engine. We develop indirect associations of
words, in addition to direct for estimating their similarity. Evaluation
results on different data sets shows that our methods outperform several
competing methods.
Key
words: Semantic similarity, web search engine, higher order association
mining, support vector machine.
Received October 29, 2012; Accepted February 27, 2013