A Preview on Subspace Clustering of High Dimensional Data

Authors

  • Sajid Nagi St. Edmund’s College, Shillong – 793001
  • Dhruba Kumar Bhattacharyya Tezpur University, Napaam – 784028
  • Jugal K. Kalita University of Colorado, Colorado Springs CO 80918

DOI:

https://doi.org/10.24297/ijct.v6i3.4466

Keywords:

Challenges, clustering survey, gene expression data, high dimensional data, issues, subspace clustering

Abstract

When clustering high dimensional data, traditional clustering methods are found to be lacking since they consider all of the dimensions of the dataset in discovering clusters whereas only some of the dimensions are relevant. This may give rise to subspaces within the dataset where clusters may be found. Using feature selection, we can remove irrelevant and redundant dimensions by analyzing the entire dataset. The problem of automatically identifying clusters that exist in multiple and maybe overlapping subspaces of high dimensional data, allowing better clustering of the data points, is known as Subspace Clustering. There are two major approaches to subspace clustering based on search strategy. Top-down algorithms find an initial clustering in the full set of dimensions and evaluate the subspaces of each cluster, iteratively improving the results. Bottom-up approaches start from finding low dimensional dense regions, and then use them to form clusters. Based on a survey on subspace clustering, we identify the challenges and issues involved with clustering gene expression data.

Downloads

Download data is not yet available.

Author Biographies

  • Sajid Nagi, St. Edmund’s College, Shillong – 793001
    Department of Computer Science
  • Dhruba Kumar Bhattacharyya, Tezpur University, Napaam – 784028
    Department of Computer Science and Engineering
  • Jugal K. Kalita, University of Colorado, Colorado Springs CO 80918
    Department of Computer Science

Downloads

Published

2013-05-21

Issue

Section

Research Articles

How to Cite

A Preview on Subspace Clustering of High Dimensional Data. (2013). INTERNATIONAL JOURNAL OF COMPUTERS & TECHNOLOGY, 6(3), 441-448. https://doi.org/10.24297/ijct.v6i3.4466

Similar Articles

21-30 of 239

You may also start an advanced similarity search for this article.