Questions tagged [hierarchical-clustering]

Hierarchical cluster analysis is a method of cluster analysis which builds, by steps, a hierarchy of clusters, a dendrogram. Most popular is agglomerative hierarchical clustering (HAC) which starts from individual objects and collects them into bigger and bigger clusters.

475 questions
3
votes
1 answer

Hierarchical clustering on defined labeled clusters

So I have these data points and I know the ground truth/labels of these points. I want to use Hierarchical clustering on the dataset given that all of the points that have the same labels are clustered together. I know this somewhat defeats the…
mtber75
  • 261
1
vote
3 answers

How to implement hierarchical clustering in $O(N^2)$ instead on $O(N^3)$

First a theoretical question. I know that natively, an hierarchical clustering algorithm is of complexity on the cube of number of samples N. This is due to the fact that in each iteration, one has to go over the entire distance matrix to find the…
idoda
  • 217
1
vote
1 answer

How to interpret hierarchical clustering results

I have a data frame "Customer_original" with customer data, it has around 50 mixed type variables. I standardize "Customer_original" and put the standardized values into "Customer_standardize" data frame. I then perform Gowers distance on…
1
vote
0 answers

Interpreting description of hierarchical agglomerative clustering application

I'm trying to replicate a clustering methodology described in a paper as We define the hetereogeneity metric within a cluster to be the average of all-pair jaccard distances, and at each step merge two clusters if the heterogeneity of the resultant…
1
vote
0 answers

A name/reference for small top-level clusters in hierarchical clustering

When you perform hierarchical clustering, often some outliers "cluster out" at the top level into tiny small-size clusters. I need to discuss this effect in my paper, and I'm not sure how to call it. For example, in the dendrogram below, the top…
1
vote
0 answers

What are the methods to find the hierarchical relationships between features/variables?

How do I find the hierarchical interrelationships between features/variables? I made a test input file p = 30 and n_samples = n = 569 from a pre-made dataset from sklearn.datasets import load_breast_cancer cancer =…
Quesop
  • 73
1
vote
0 answers

Trying to understand Ward's algorithm implementation for clustering

I've been out of academia for the past few years, so my knowledge of statistics is very rusty and I would love some help with my interpretation. I'm trying to validate my hierarchical clustering solution produced by using wards/squared Euclidean…
ashamc
  • 53
  • 1
  • 6
1
vote
0 answers

Why is single link method in hierachial clustering not good at handling round/elliptical shapes?

I don't fully understand why single link techinque in hierachial clustering is not good at handling (discovering) clusters of round or elliptical shapes. Simple examples will help me a lot. Thanks.
Mark
  • 141
  • 1
  • 2
  • 11
0
votes
0 answers

hierarchical clustering linkage

Just want to get verified with hierarchical clustering. Let's assume that you want dissimilarity measure as euclidean distance. Let's say that you have three clusters. Each cluster has many observations. If you measure dissimilarity by single…
0
votes
0 answers

How to Determine Cluster Count with Hierarchal Clustering

I have analyzed the following Data set in R using hierarchical analysis, and plotted the results. Ive been asked to find and highlight the number of clusters present, but how exactly do you determine this? It seems it all depends where you cut the…
FEAR
  • 1
0
votes
1 answer

Hierarchical Cluster Analysis

I came across answers to many of questions about Cluster Analysis in this platform. One area I still need clarification has to do with missing cases. I obtained DHS data and extracted sub-regional variables that my study considered useful. I further…
0
votes
1 answer

I have 20 symptoms and need to do a hierarchical cluster analysis? Which method is appropriate?

and thank you in advance for any suggestions. I have 20 binary symptoms (for example, anxiety yes/no, depression yes/no), and i have to do a cluster analysis for my research project in order to see which symptoms cluster together. However, i find…
user163672
-2
votes
3 answers

Agglomerative Hierarchical Clustering "complete linkage" as opposed to "single linkage" dendrogram

Will any dataset clustered via each of the following methods: Agglomerative Hierarchical Clustering using "complete linkage" method Agglomerative Hierarchical Clustering using "single linkage" method have the same dendrogram structure? If yes,…