cluster analysis - Variable selection for k-means clustering -

September 15, 2012

i'm wondering if there methods selecting variables k-means algorithm. trying market segmentation using algorithm , have dataset dozens of potential variables. have results easy interpret, should limit number of variables max. 5-6. particularly interested in solutions can implemented in spss statistics or weka. also, there method/algorithm getting optimal number of variables clustering (i.e. how many of 'good' variables should use)?

try factor analysis, should help. no. of factors utilize depend on number of variables having eigen value >= 1. after finding no of factors, utilize fa() function find loadings value , decide variables need maintain , discard. help in removing highly multicollinear variables.

cluster-analysis data-mining weka k-means spss

Search This Blog

New Th

cluster analysis - Variable selection for k-means clustering -

Comments

Post a Comment

Popular posts from this blog

xslt - DocBook 5 to PDF transform failing with error: "fo:flow" is missing child elements. Required content model: marker* -

mediawiki - How do I insert tables inside infoboxes on Wikia pages? -

SQL Server : need assitance parsing delimted data and returning a long concatenated string -