Loading...

Proceedings of

International Conference on Recent Trends in Computing and Communication Engineering RTCCE 2013

"AN EFFECTIVE STEMMER IN DEVANAGARI SCRIPT"

ABHISHEK TYAGI MONIKA DOGRA UPENDRA MISHRA
DOI
10.15224/978-981-07-6184-4-05
Pages
22 - 25
Authors
3
ISBN
978-981-07-6184-4

Abstract: “In today’s word of internet web search engines are developing the techniques to make the surfing faster. Stemming is a technique used by web search engines for prefix and suffix removal from the derived word. Stemming provides the way to store similar documents together. This research work aims at the development of Hindi stemmer based on Devanagari script for stripping both prefixes as well as suffixes from derived word to provide better stemming than previous stemmers. Proposed stemmer uses the hybrid approach which is the combination of lookup algorithm, suffix stripping algorithm and prefix removal algorithm.”

Keywords: natural language processing, stemming, overstemming, under-stemming, inflected word, Information Retrieval –IR, conflation

Download PDF