Working with data that use different dimensions, you do not want that one dimension dominate.
This means feature scaling! A very intuitive way is to use min-max scaling so you scale everything between 0 to 1.
What I do not understand and what is not intuitive for me at all is to use z-score for feature scaling.
Why is z-score used? What is the motivation to not use min-max and to use z-score? Why is it a good idea to scale your data in standard deviations from the mean? What was the motivation to use z-score for scaling? Why is min-max not used all the time? What problem does z-score solve what min-max does not solve?
hope someone can help me and make it somehow clear.
Mahalanobis distance:
the Mahalanobis distance makes sense for me to detect outliers, but I do not understand how you can use it to motivate your feature scaling with z-score
range stability:
you said that rang is one of the least stable in statistics. What do you mean by that? What stability do you mean? What does it mean to be stable? Why is the range not stable and on what sense? Why are standard deviation unit more stable?
– JaySmi Oct 08 '21 at 15:22