2

Further to my previous post on the Hellinger Distance, there was one comment raised about there being different expressions of the Hellinger Distance. This has intrigued me.

In the Encyclopedia of Distances, 3rd Ed (p266), the Hellinger Distance is given as: enter image description here

(I have used a picture to show the equation as given in the above reference)

However, I can’t get this to equate. Where have I gone wrong...assuming I have?

So from the above reference we have...the left-hand expression is: $$\left[\sum_x \left (\sqrt{p_1 (x)}-\sqrt{p_2 (x)}\right)^{2}\right]^\frac{1}{2}$$ Re-writing as a radical $$\sqrt{\sum_x \left (\sqrt{p_1 (x)}-\sqrt{p_2 (x)}\right)^{2}}$$ Expanding the square $$\sqrt{\sum_x \left(\sqrt{p_1(x)}\right)^2-2\sqrt{p_1(x)}\sqrt{p_2(x)}+\left(\sqrt{p_2(x)}\right)^2}$$ Simplifying the radicals $$\sqrt{\sum_x {p_1(x)}-2\sqrt{p_1(x)p_2(x)}+{p_2(x)}}$$ Sum of a probability distribution equals 1, so $\sum_x p(x)=1$; $$\sqrt{\sum_x 1-2\sqrt{p_1(x)p_2(x)}+1}$$ Simplifying $$\sqrt{\sum_x 2-2\sqrt{p_1(x)p_2(x)}}$$ Factorising $$\sqrt{2\left(\sum_x 1-\sqrt{p_1(x)p_2(x)}\right)}$$ Moving the constant outside the radical and the minuend outside the summation $$\sqrt2\sqrt{1-\sum_x \sqrt{p_1(x)p_2(x)}}$$

So I get a constant $\sqrt{2}$. That is okay as I am aware some expressions of the Hellinger distance have a bound of $[0,\sqrt{2}]$. However, I can't see where a constant of $2$ comes from as given in the above reference (which would give a bound of $[0,2]$).

Any help appreciated.

anna6931
  • 121
  • 2
    Great question and well presented. I have raised similar inconsistencies with the Hellinger Distance in a previous post. It's important to find these errors - not only to improve the integrity of literature but to also reduce ambiguity in math/stats. – Mari153 Oct 09 '22 at 10:38

1 Answers1

2

From $[\rm I], $

enter image description here

So, it's a typo.


Reference:

$\rm [I]$ Encyclopedia of Distances, Michel Marie Deza, Elena Deza, Springer-Verlag Berlin Heidelberg, $2016, $ p. $268.$

User1865345
  • 8,202
  • 1
    Thanks for the confirmation. At least it shows my math was right! Unfortunately, this typo error seems to have gained traction in some articles, and even in questions on this site. Guess it shows the necessity to check and go back to first principles - as my advisor keeps telling me. – anna6931 Oct 07 '22 at 21:51
  • One thing I would suggest would be to follow the latest edition of any literature. – User1865345 Oct 07 '22 at 22:04
  • 2
    The interesting thing is that over the four editions of the "Encyclopedia of Distances", there have been four different expressions of the Hellinger Distance, including the 'typo'. This has clearly inaccurately influenced literature. The latest one is more standard but it still varies from the conventional expression that has a constant of $\frac{1}{2}$ which gives the bounds of $[0,1]$ as shown here. – Mari153 Oct 09 '22 at 10:29