Efficient Parallel Compression and Decompression for Large XML Files

Efficient Parallel Compression and Decompression for Large XML Files

Mohammad Ali and Minhaj Khan

Department of Computer Science, Bahauddin Zakariya University, Pakistan

Abstract: eXtensible Markup Language (XML) is gaining popularity and is being used widely on internet for storing and exchanging data. Large XML files when transferred on network create bottleneck and also degrade the query performance. Therefore, efficient mechanisms of compression and decompression are applied to XML files. In this paper, an algorithm for performing XML compression and decompression is suggested. The suggested approach reads an XML file, removes tags, divides the XML file into different parts and then compresses each different part on a separate core for achieving efficiency. We compare performance results of the proposed algorithm with parallel compression and decompression of XML files using GZIP. The performance results show that the suggested algorithm performs 24%, 53% and 72% better than the parallel GZIP compression and decompression on Intel Xeon, Intel core i7 and Intel core i3 based architectures respectively. 

Keywords: XML, distributed computing, XML compression, GZIP, performance.

Received May 26, 2014; accepted January 27, 2015

 

Read 2003 times Last modified on Monday, 07 September 2015 08:13
Share
Top
We use cookies to improve our website. By continuing to use this website, you are giving consent to cookies being used. More details…