Is there a calculation or test to determine if the occurrence of a surname in a population in a specific geographical area is statistically significant or just a random distribution?
For example, in a surname study of the most common 42 surnames in a geographic region, the surname Steele occurs 3249 times (3rd most common). The total number of families reporting a surname in the top 42 most common is 79408 and the total population in the geographic region is 119000. Obviously, 3249/119000*100= 2.71% of the population have the Steele surname, but is this statistically significant in population studies?