Skip to main content
SHARE
Publication

Modeling Spatial Dependencies and Semantic Concepts in Data Mining...

by Ranga R Vatsavai
Publication Type
Conference Paper
Publication Date
Publisher Location
United States of America
Conference Name
International Conference on Computing for Geospatial Research and Applications (Com.Geo)
Conference Location
DC, District of Columbia, United States of America
Conference Sponsor
ACM, Microsoft
Conference Date
-

Data mining is the process of discovering new patterns and relationships in large datasets. However, several studies have shown that general data mining techniques often fail to extract meaningful patterns and relationships from the spatial data owing to the violation of fundamental geospatial principles. In this tutorial, we introduce basic principles behind explicit modeling of spatial and semantic concepts in data mining. In particular, we focus on modeling these concepts in the widely used classification, clustering, and prediction algorithms. Classification is the process of learning a structure or model (from user given inputs) and applying the known model to the new data. Clustering is the process of discovering groups and structures in the data that are ``similar,'' without applying any known structures in the data. Prediction is the process of finding a function that models (explains) the data with least error. One common assumption among all these methods is that the data is independent and identically distributed. Such assumptions do not hold well in spatial data, where spatial dependency and spatial heterogeneity are a norm. In addition, spatial semantics are often ignored by the data mining algorithms. In this tutorial we cover recent advances in explicitly modeling of spatial dependencies and semantic concepts in data mining.