AHC, k-means preprocessing parameter
Add a new parameter "Maximum number of columns" which drives the "kk" HCPC option :
kk: "An integer corresponding to the number of clusters used in a Kmeans prepro- cessing before the hierarchical clustering; the top of the hierarchical tree is then constructed from this partition. This is very useful if the number of individuals is high. Note that consolidation cannot be performed if kk is different from Inf and some graphics are not drawn. Inf is used by default and no preprocessing is done, all the graphical outputs are then given"
The default value is "1000" ?
Tooltip : "If the number of columns to process is higher than this value, a k-means preprocessing is done to reduce the number of columns"
Validation test
-
Compute an AHC on Voeux/annees
-
Tree inertias(62) are : 0.02946824000810865, 0.017009658614705714, 0.013538737561990003, 0.010700709101563509, 0.008729597905087566, 0.005001531187365394,...
-
set Kmeans processing limit to 50
-
re-compute
-
Tree inertias (49) are differents : 9.304126234174934E-4, 5.885272728059069E-4, 3.014134718653397E-4, 1.5932409005018803E-4, 1.1263315529493199E-4, 9.816819124949387E-5,...