¿Where can I find the references that describes the methods used in n_clusters?
              
              #984
            
            
              
                Unanswered
              
          
                  
                    
                      luifrancgom
                    
                  
                
                  asked this question in
                Q&A
              
            Replies: 0 comments
  
    Sign up for free
    to join this conversation on GitHub.
    Already have an account?
    Sign in to comment
  
        
    
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
When using
n_clustersin the documentation it is mention that " ... The mainn_clustersfunction proposes to run all of them, and find out the number of clusters that is suggested by the majority of methods ..." But there is not a description about what methods are used.Using the example pointed out you can extract all the methods used in the following way:
Created on 2024-06-23 with reprex v2.1.0
However, there is not a direct reference for each method. For example in the case of Elbow I was curious about how this method was implemented because Elbow method is a visual heuristic without a strict mathematical formula.
Checking the source code of
n_clustersthis is the way it is calculated:Created on 2024-06-23 with reprex v2.1.0
However, in
factoextrathe following plot is pointed out:And in the
fviz_nbclustdocumentation the following reference is pointed out but there is not a description about a strict mathematical formula.I understand in the case of Elbow the logic used in
n_clustersbut without a description in the function documentation it is difficult to understand the method without checking the source code.¿Where can I find the references for the rest of the 28 methods used in
n_clusterswithout checking the source code?Beta Was this translation helpful? Give feedback.
All reactions