1

I am working on a research project such that I need to compare several distance based classifiers - say TF-IDF based KNN and Kmeans for clustering. Suppose I use Cosine Similarity for one and Cosine Distance for the other -- how will it affect the evaluations?

silent_dev
  • 617
  • 2
  • 7
  • 16

1 Answers1

1

If you meant Euclidean distance by Cosine distance,it is susceptible to entities being clustered by their L2-norm (magnitude, in the 2-dimensional case) instead of direction. i.e., vectors with quite different directions would be clustered because their distances from the origin are similar.

for the effects you can try it out by yourself referring to the answers to this question

iamgr007
  • 113
  • 5