cluster analysis - Variable selection for k-means clustering -



cluster analysis - Variable selection for k-means clustering -

i'm wondering if there methods selecting variables k-means algorithm. trying market segmentation using algorithm , have dataset dozens of potential variables. have results easy interpret, should limit number of variables max. 5-6. particularly interested in solutions can implemented in spss statistics or weka. also, there method/algorithm getting optimal number of variables clustering (i.e. how many of 'good' variables should use)?

try factor analysis, should help. no. of factors utilize depend on number of variables having eigen value >= 1. after finding no of factors, utilize fa() function find loadings value , decide variables need maintain , discard. help in removing highly multicollinear variables.

cluster-analysis data-mining weka k-means spss

Comments

Popular posts from this blog

xslt - DocBook 5 to PDF transform failing with error: "fo:flow" is missing child elements. Required content model: marker* -

mediawiki - How do I insert tables inside infoboxes on Wikia pages? -

Local Service User Logged into Windows -