keyboard_arrow_up
COLOCATION MINING IN UNCERTAIN DATA SETS:A PROBABILISTIC APPROACH

Authors

M.Sheshikala1, D. Rajeswara Rao2, and Md. Ali Kadampur3
1,3S.R Engineering College2Kl University

Abstract

In this paper we investigate colocation mining problem in the context of uncertain data. Uncertain data is a partially complete data. Many of the real world data is Uncertain, for example, Demographic data, Sensor networks data, GIS data etc.,. Handling such data is a challenge for knowledge discovery particularly in colocation mining. One straightforward method is to find the Probabilistic Prevalent colocations (PPCs). This method tries to find all colocations that are to be generated from a random world. For this we first apply an approximation error to find all the PPCs which reduce the computations. Next find all the possible worlds and split them into two different worlds and compute the prevalence probability. These worlds are used to compare with a minimum probability threshold to decide whether it is Probabilistic Prevalent colocation (PPCs) or not. The experimental results on the selected data set show the significant improvement in computational time in comparison to some of the existing methods used in colocation mining.

Keywords

Probabilistic Approach, Colocation Mining, Un-certain Data Sets