Finding more Lightroom duplicates

Written by: | Posted on:

I have a problem: I've put all my photos I've taken the past ten years into Lightroom. To ensure that I didn't miss any, I've put in all from my server, all from my backups etc etc etc. In short: duplicate galore. Now, I've sorted by filename and removed duplicates. I've sorted by capture time and removed duplicates. That made me go from 88k to 74k. But I still have duplicates. They have different times because some were posted in my galleries. They have different sizes because some were thumbnails. They have different names because some were exported and sent via mail. And some are more odd, or perhaps a combination of them all. But in 74k photos, finding duplicates and deciding which one to keep is hard!

I have made a solution: a little Python script that will go through the Lightroom database, generate a 9x9 thumbnail of the photo, and compare it to all other photos. It takes a couple of hours to run on my 74k photos, helped me clean out 2k duplicates with only two false positives. That's not bad!

What it does is that it sets the label with a string representing the thumbnail, and then you sort your grid view by label and voila, you're good to go. Since it goes for the raw files, it will find duplicates even if they have been corrected or worked on afterwards:

Another example is that if you've changed the colours a bit around, it will still find it:

So that's all very nice. :-) Here's the source code, under BSD license. Put it in your Lightroom directory together with Lightroom.lrdata. And oh, btw, if it blows up anything at all, do tell me, but don't hold me responsible. This hasn't been tested on much. But if you read the code, I think you'll find it can do very limited harm

Challenges ahead:

  • RAW files not supported yet
  • Doesn't work too well on the different size problem, must find a better solution
  • Faster run times?

Hope you can use it as well as it helped me

Finding friends in Social Networking sites

Written by: | Posted on:

I need Adium for Social Networks. Adium is an IM tool that has 'em all and runs nicely under OS X. Social Networks are exploding. So I thought it would be fun to look for some of my friends, but after trying to find a few it hit me: there are so many networks. I would have to search each one of them. And what if they join tomorrow? I'd have to search again unless they search for me. I have an account at Orkut, Facebook, Blink and probably quite a few others. I usually don't log in to them. So I would need something that searches for my friends in all of them when I search, and which could do that over again whenever I wish to check if they've showed up. Then I'd probably need some interface to keep my information on all of them up-to-date, and to be able to manage them one way or another. Any ideas for an Adium-like client for social networks?

Image Image

Father, husband, software developer, musician and photographer.


Get to know what I'm up to

© 2020 Niklas Saers Contact Me