I have a lot of images in a folder that I would like to put in order based on how similar they are. The images are histological slides from a tissue microarray (for example: http://www.proteinatlas.org/images/36302/112008_A_6_6.jpg)
What I have tried: I take the first image, compare it to all and match it up with the best match. Then i take the closest match and do the same thing all over. To calculate similarity I have tried Structural similarity index and also Hashing (pHash, dHash) none of them work very well, but the hashing seems to be a bit better.
To put the images in order, I basically use a number of renaming and moving of images from one folder to another and deleting the ones I have already matched up.
What could I use instead of hashing or similarity index and how could I sort them without renaming, moving the files around or deleting them?