Discovering Top-k Spatial High Utility Itemsets in Very Large Quantitative Spatiotemporal databases

P Pallikila, P Veena, RU Kiran, R Avatar… - … Conference on Big …, 2021 - ieeexplore.ieee.org
P Pallikila, P Veena, RU Kiran, R Avatar, S Ito, K Zettsu, PK Reddy
2021 IEEE International Conference on Big Data (Big Data), 2021ieeexplore.ieee.org
Spatial High Utility Itemset Mining (SHUIM) is an important knowledge discovery technique
with many real-world applications. It involves discovering all itemsets that satisfy the user-
specified m inimum u tility (minUtil) inaq uantitative spatiotemporal database. The popular
adoption and the successful industrial application of this technique have been hindered by
the following two limitations:(i) Since the rationale of SHUIM is to find all itemsets that satisfy
the minUtil constraint, it often produces too many patterns, most of which may be redundant …
Spatial High Utility Itemset Mining (SHUIM) is an important knowledge discovery technique with many real-world applications. It involves discovering all itemsets that satisfy the user-specified m inimum u tility (minUtil) i n a q uantitative spatiotemporal database. The popular adoption and the successful industrial application of this technique have been hindered by the following two limitations: (i) Since the rationale of SHUIM is to find all itemsets that satisfy the minUtil constraint, it often produces too many patterns, most of which may be redundant or uninteresting to the user. (ii) Specifying a right minUtil value is an open research problem in SHUIM. This paper tackles these two problems by proposing a novel model of top-k spatial high utility itemsets that may exist in a database. A new constraint, called dynamic minimum utility (dMinUtil), was explored to reduce the search space effectively. This constraint is based on a greedy search, where we raise its value through five thresholdraising strategies. An efficient single scan algorithm that employs depth-first search to find all top-k spatial high utility itemsets was also presented in this paper. Experimental results demonstrate that our algorithm is memory and runtime efficient. We will also demonstrate the usefulness of our algorithm with two real-world case studies.
ieeexplore.ieee.org