Mining Frequent Sequential Rules with An Efficient Parallel Algorithm
Nesma Youssef1,3, Hatem Abdulkader3, and Amira Abdelwahab2,3
1Department of Information System, Sadat Academy for Management Science, Egypt
2Department of Information Systems, King Faisal University, Saudi Arabia
3Department of Information Systems, Menoufia University, Egypt
Abstract: Sequential rule mining is one of the most common data mining techniques. It intends to find desired rules in large sequence databases. It can decide the essential information that helps acquire knowledge from large search spaces and select curiously rules from sequence databases. The key challenge is to avoid wasting time, which is particularly difficult in large sequence databases. This paper studies the mining rules from two representations of sequential patterns to have compact databases without affecting the final result. In addition, execute a parallel approach by utilizing multi core processor architecture for mining non-redundant sequential rules. Also, perform pruning techniques to enhance the efficiency of the generated rules. The evaluation of the proposed algorithm was accomplished by comparing it with another non-redundant sequential rule algorithm called Non-Redundant with Dynamic Bit Vector (NRD-DBV). Both algorithms were performed on four real datasets with different characteristics. Our experiments show the performance of the proposed algorithm in terms of execution time and computational cost. It achieves the highest efficiency, especially for large datasets and with low values of minimum support, as it takes approximately half the time consumed by the compared algorithm.
Keyword: Non-redundant rule, multi-core processors, dynamic bit vector, closed sequential patterns, sequential generator pattern.
Received December 6, 2020; accepted April 28, 2021