IGSNRR OpenIR  > 历年回溯文献
DECODE: a new method for discovering clusters of different densities in spatial data
Pei T.; Jasra A.; Hand D. J.; Zhu A. X.; Zhou C. H.
Source PublicationData Mining and Knowledge Discovery
KeywordData Mining Mcmc Point Process Reversible Jump Nearest Neighbor Earthquake Chain Monte-carlo Point-processes Neighbor Features Earthquakes Foreshock Algorithm Sequence
AbstractWhen clusters with different densities and noise lie in a spatial point set, the major obstacle to classifying these data is the determination of the thresholds for classification, which may form a series of bins for allocating each point to different clusters. Much of the previous work has adopted a model-based approach, but is either incapable of estimating the thresholds in an automatic way, or limited to only two point processes, i.e. noise and clusters with the same density. In this paper, we present a new density-based cluster method (DECODE), in which a spatial data set is presumed to consist of different point processes and clusters with different densities belong to different point processes. DECODE is based upon a reversible jump Markov Chain Monte Carlo (MCMC) strategy and divided into three steps. The first step is to map each point in the data to its mth nearest distance, which is referred to as the distance between a point and its mth nearest neighbor. In the second step, classification thresholds are determined via a reversible jump MCMC strategy. In the third step, clusters are formed by spatially connecting the points whose mth nearest distances fall into a particular bin defined by the thresholds. Four experiments, including two simulated data sets and two seismic data sets, are used to evaluate the algorithm. Results on simulated data show that our approach is capable of discovering the clusters automatically. Results on seismic data suggest that the clustered earthquakes, identified by DECODE, either imply the epicenters of forthcoming strong earthquakes or indicate the areas with the most intensive seismicity, this is consistent with the tectonic states and estimated stress distribution in the associated areas. The comparison between DECODE and other state-of-the-art methods, such as DBSCAN, OPTICS and Wavelet Cluster, illustrates the contribution of our approach: although DECODE can be computationally expensive, it is capable of identifying the number of point processes and simultaneously estimating the classification thresholds with little prior knowledge.
Indexed BySCI
Document TypeSCI/SSCI论文
Recommended Citation
GB/T 7714
Pei T.,Jasra A.,Hand D. J.,et al. DECODE: a new method for discovering clusters of different densities in spatial data. 2009.
Files in This Item: Download All
File Name/Size DocType Version Access License
Pei-2009-DECODE_ a n(1901KB) 开放获取LicenseView Download
Related Services
Recommend this item
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Pei T.]'s Articles
[Jasra A.]'s Articles
[Hand D. J.]'s Articles
Baidu academic
Similar articles in Baidu academic
[Pei T.]'s Articles
[Jasra A.]'s Articles
[Hand D. J.]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Pei T.]'s Articles
[Jasra A.]'s Articles
[Hand D. J.]'s Articles
Terms of Use
No data!
Social Bookmark/Share
File name: Pei-2009-DECODE_ a new method.pdf
Format: Adobe PDF
All comments (0)
No comment.

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.