Scalable Self-Organizing Structured P2P Information Retrieval Model Based on Equivalence Classes

Scalable Self-Organizing Structured P2P Information Retrieval Model Based on Equivalence Classes

Yaser A. Al-Lahham and Mohammad Hassan
Faculty of Science and Information Technology, Zarqa University, Jordan
 
Abstract: This paper proposes a new autonomous self-organizing content-based node clustering peer to peer information retrieval (P2PIR) model. This model uses incremental transitive document-to-document similarity technique to build Local Equivalence Classes (LECes) of documents on a source node. Locality Sensitive Hashing scheme is applied to map a representative of each LEC into a set of keys which will be published to hosting node(s). Similar LECes on different nodes form Universal Equivalence Classes (UECes), which indicate the connectivity between these nodes. The same LSH scheme is used to submit queries to subset of nodes that most likely have relevant information. The proposed model has been implemented. The obtained results indicate efficiency in building connectivity between similar nodes, and correctly allocate and retrieve relevant answers to high percentage of queries. The system was tested for different network sizes and proved to be scalable as efficiency downgraded gracefully as the network size grows exponentially.

Keywords: Peer-to-Peer systems, Information retrieval, Node clustering, Equivalence class, Mapping, Incremental transitivity.
 
Received February 5, 2012; accepted May 22, 2012
  

Full Text

Read 3113 times Last modified on Tuesday, 18 February 2020 04:00
Share
Top
We use cookies to improve our website. By continuing to use this website, you are giving consent to cookies being used. More details…