I have a file with many duplicated entries like this:
Jon Corzine
Jon S. Corzine
Hudson
Jon S. Corzine
Hudson
Corzine
Richard J. Codey
Corzine
Corzine
Bernard F. Kenny Jr.
Corzine
Corzine
Codey
Corzine
Codey
Codey
James E. McGreevey
Codey
Corzine
Codey
Codey
Corzine
Codey
Corzine
Robert E. Andrews
Codey
Codey
Kenny
Barry P. Sarkisian
Joseph Doria
Codey
Albio Sires
Louis Manzo
Laura Mansnerus
Lorne Michaels
Maya Rudolph
Each entity there is also associated with an ID, maybe one, maybe more,
I want to get each entity by itself, but because of disambiguation, for something like Hudson, each entity could be associated with multiple IDs, so maybe under Hudson There would be one for the river, and a different one for the bay, for the town, and so on.
I guess the best way to do this would be with a hash map where the name is the key, is that right?
Is there a way to output a hashmap in JSON format or some other highly maliable data representation?