top | item 21431620

(no title)

jdmcnugent | 6 years ago

I just dealt with this crap storm of a project last month. We had 40k photos scattered across three laptops, old hard drives, and sd cards. First I just crudely copied all of the folders on to an external hard drive and ran a freeware duplicate remover to clean out about 20% of them. Then I used a python script to go through this giant pile of pics and copy them in to folders by year and month based on the created date. It also added yyyy-mm-dd to the beginning of each file name. Now we are slowly going through month by month and adding simple tags in the file name (event, location, names). It’s far from perfect, but I didn’t want to deal with keeping everything synced in a database or locked in to a certain OS or app, plus it should still be searchable in 15 years when we are all running Windows 30 and Mac OS Ozarks or whatever.

discuss

order

mceachen|6 years ago

May I ask, how are you saving the tags? Are you writing to sidecars?

Be careful with overwriting your originals. Many years ago I used jpegtran to rotate losslessly, but didn't realize it was removing all the metadata as well.

I added a bunch of heuristics to PhotoStructure to infer missing tags based on sibling files, specifically because I'd borked so many of my own photos.

FWIW, I've tried to make design decisions that will hopefully allow libraries to be very long-lived. PhotoStructure can copy unique (by SHA) originals into a dated subdirectory, and has what may be the most advanced duplicate image detection around (just added in the newest version). Your library is cross-platform (for example, stored on your NAS, created on your mac, then opened on your Windows box, and everything just works). The sqlite database is a straightforward schema.

jdmcnugent|6 years ago

I just put the tags in the file name, like “2019-12-25_xmas_bob_grandma.jpg”. Obviously you can’t go crazy with a bunch of tags, but I think I can get by with 2 or 3 tags at most. I was afraid to use sidecar info or xattr because I think that data can be lost if the files get moved between file systems (ie eventually moved from the current hdd to the nas I have yet to buy, etc). I definitely kept my raw unorganized folder on the ext hdd for now, but I’ve run several scripts to make sure I didn’t inadvertently overwrite or miss anything.

rajesh-s|6 years ago

Exactly! For me the pile keeps building up. I feel this is a problem that's still not addressed well. Sure you could save compressed ones to Instagram/Google Photos, but manually saving this is still a big hurdle for later retrieval. Need a simpler approach to tag and sort them into albums rather than just folders