Referential DNA Data Compression using Hadoop Map Reduce Framework

Referential DNA Data Compression using Hadoop Map Reduce Framework

Raju Bhukya and Sumit Deshmuk

Department of Computer Science and Engineering, National Institute of Technology, India

Abstract: The indispensable knowledge of Deoxyribonucleic Acid (DNA) sequences and sharply reducing cost of the DNA sequencing techniques has attracted numerous researchers in the field of Genetics. These sequences are getting available at an exponential rate leading to the bulging size of molecular biology databases making large disk arrays and compute clusters inevitable for analysis.In this paper, we proposed referential DNA data compression using hadoop MapReduce Framework to process humongous amount of genetic data in distributed environment on high performance compute clusters. Our method has successfully achieved a better balance between compression ratio and the amount of time required for DNA data compression as compared to other Referential DNA Data Compression methods.

Keywords: Compression, map reduce sequences, dna sequences.

Received August 12, 2017; accepted April 17, 2018
https://doi.org/10.34028/iajit/17/2/8

Full text      

Read 950 times Last modified on Wednesday, 26 February 2020 05:49
Share

Upcoming courses

  • Diploma Courses
  • Business and Enterprise
  • Digital Literacy & IT
  • Health Literacy
  • Business Literacy

Free courses

Starting from Jun. 14 2016

the degree finder

in 3 easy steps
Top
We use cookies to improve our website. By continuing to use this website, you are giving consent to cookies being used. More details…