EncCD: A Framework for Efficient Detection of Code Clones
Minhaj Khan
Department of Computer Science, Bahauddin Zakariya
University, Pakistan
Abstract: Code clones represent similar snippets of code
written for an application. The detection of code clones is essential for
maintenance of a software as modification to multiple snippets with a similar
bug becomes cumbersome for a large software. The clone detection techniques
perform conventional parsing before final match detection. An inefficient
parsing mechanism however deteriorates performance of the overall clone
detection mechanism. In this paper, we propose a framework called Encoded Clone
Detector (EncCD), which is based on encoded pipeline processing for efficiently
detecting clones. The proposed framework makes use of efficient labelled
encoding followed by tokenization and match detection. The experimentation
performed on the Intel Core i7 and Intel Xeon processor based systems shows
that the proposed EncCD framework outperforms the widely used JCCD and CCFinder
frameworks by producing a significant performance improvement.
Keywords: Clone detection, Software Engineering,
Software Maintenance, Optimization, Speedup.