Loading...

Proceedings of

International Conference on Advances in Computing, Control and Communication CCN 2012

"PHRASE MATCHING IN (S,C) DENSE CODE COMPRESSED FILES"

JAGADISH DHARANIKOTA SUNEETA AGARWAL
DOI
10.15224/978-981-07-2579-2-451
Pages
49 - 52
Authors
2
ISBN
978-981-07-2579-2

Abstract: “Due to increase in the data size and limited network bandwidths there is need of compressing the data files. This compression technique saves the memory and data can be transferred faster over the network. Pattern matching on compressed files is one of the requirements for Information retrieval applications. Files compressed using (s ,c) dense code compression helps significantly to reduce the time for searching as it avoids the decompression of the compressed file for finding the pattern. In this paper we propose an approach for phrase matching in the compressed files by modifying standard string matching algorithms like hors pool and Sunday algorithm. This phrase matching can be used by search engines in relevant document retrieval for the given query. Pattern matching on (s, c) dense code compressed files had lots of advantages along with better compression ratios when compared to other standard compression algorithms. Searching the text on the compressed files is up to 8 times fast”

Keywords: Frequency based codeword searching; Phrase Matching; (s,c) Dense Code

Download PDF