I think this bug should be considered completely separately from how unwise it is to use a cloud service as the sole storage of important files.
Regardless of the circumstances, losing user files against the wishes of the user is the absolute worst thing a cloud backup provider can do.
Even for files that are deleted intentionally and unambiguously by the user, I'm astonished that Dropbox actually deletes the files at the end of the 30-day restore window. I would expect them to keep the files for some multiple of the publicly-stated restore deadline where the multiple >= 2, if for no other reason than as a goodwill generator. There is no more evangelical advocate for your company than the customer you email to say "Yes, you intentionally deleted this file six weeks ago. The 30 day deletion deadline has passed, but I have managed to restore the most recent version of your dissertation. Thank you for using Dropbox."
For files that aren't intentionally deleted by the user but are "de-synced", it is disgraceful and appalling that there is no contingency system in place. Keeping user files when the user assumes or wishes them to be kept safe is the core competency of a service like Dropbox.
"The user should have kept multiple redundant copies" is not an excuse for a poorly managed online backup service. "Keep multiple backups of everything important" is good advice for a user, but "Keep user files safe when the user thinks they are safe" is the most essential advice imaginable for the CEO of an online backup service.
I would expect them to keep the files for some multiple of the publicly-stated restore deadline where the multiple >= 2, if for no other reason than as a goodwill generator.
On the other hand, keeping copies of the user's data after you say you've deleted it doesn't sound too ethical to me. I don't expect immediate purging, but I also don't expect potentially sensitive files to linger for months after they were supposed to be deleted.
I think a better system would be to warn about big deletion events (e.g. send an email) to allow the user to revert it within the 30-day period.
The 30 day deletion deadline has passed, but I have managed to restore the most recent version of your dissertation.
Then you have the opposite problem; privacy-centered users complaining "Dropbox keeps your data even when they say it's permanently deleted! Here's proof!!"
Edit to add: Dropbox's sole purpose is to sync files between different physical machines. Which, now that I think about it, seems sort of anachronistic.
In cloud storage, the authoritative version of the file lives in the cloud and each device just accesses it. But in Dropbox, the authoritative version is on every machine. By default, every copy is authoritative, and actions are synced.
This sort of "many originals" architecture seems to confuse people. The article author here is clearly thinking of Dropbox like cloud storage--he checked the web interface, saw his files, and thought it was all good.
But in Dropbox the web interface is not authoritative, the local copy is.
Is Dropbox a backup service? Nope.
Dropbox is folder synch, a network version of a pendrive. Dropbox makes no guarantees about the durability of the data on their servers, which are merely a browsable cache.
Use proper backup services, instead, like Memopal.
"I think this bug should be considered completely separately from how unwise it is to use a cloud service as the sole storage of important files."
It is unwise to use anything as the sole storage of important files. Personally, I think cloud backup is safer then one backing up files to a DVD or tape backup but I just won't keep personal files on any cloud service.
Key assumption to your argument: Dropbox is a backup service.
I don't think that they are. More like the modern incarnation of a file server. Like a fileserver, they have resiliency against service failure or individual storage system failure (via S3). Cool stuff, but it ain't backup.
> I think this bug should be considered completely separately from how unwise it is to use a cloud service as the sole storage of important files.
YES: from DropBox's point of view - this is a serious bug even ignoring what other actions the users have/not taken to protect their data.
And NO: this sort of thing is one of the key reasons not to use such a service as a sole backup - linking the two when trying to educate users on the importance of thinking about data safety, so we can't separate the two as one is important when hammering the other point home...
> Even for files that are deleted intentionally and unambiguously by the user, I'm astonished that Dropbox actually deletes the files at the end of the 30-day restore window. I would expect them to keep the files for some multiple of the publicly-stated restore deadline
That would cause significant consternation in some areas. If I explicitly delete a file I might explicitly want it gone within the stated window and no later and would therefore be unhappy if I found the data still available (even if I had properly encrypted it before it touched the cloud service so noone else could read it).
> For files that aren't intentionally deleted by the user but are "de-synced",
I definitely agree there - it sounds like that process needs some transactional workflow wrapped around it so that:
1. Client and server don't do anything until they've both agreed what to do (then they can both go away and do it even if the connection drops, safe in the knowledge that the action is correct).
2. The client and server record what has been agreed as part of that transaction protected process, so in the case of an unclean stop during the process it can be resumed/retried.
3. Some process may need to be in place for one side rolling back if the other detects a failure applying the agreed actions.
Of course this is a fair bit of work for something that won't happen often, particularly when you consider that there might be more than one active connection to a set of files at any given time so it might not be a simple two-way merge operation, so DB may have other priorities - but the bad PR from it happening and not being cleanly dealt with is something that they need to consider if making that sort of decision.
> "The user should have kept multiple redundant copies" is not an excuse for a poorly managed online backup service
Agreed. But "the single backup service failed" is similarly not something that is going to encourage me to have sympathy for the user!
Sync services are excellent secondary backups, and they satisfy the "off site" requirement that a great many home and small-office users neglect (I know people who worked for a small company which died because the backups were in the same building as the active data so one fire took out the lot), but no one should trust them as their only backup. I even recommend against multiple sync services as your only backups as that in itself can cause problems (when bugs or other peculiarities from one interact badly with similar behaviours in another).
Using just a sync service means you have no off-line (or even semi-offline) backup, which is as bad as not having any off-site copies.
Of course getting this involves educating users as to the risks they are taking, which is a point notoriously difficult to get across, so while I don't expect a service like DB to be perfectly bug free (and therefore wouldn't trust it as an only backup) I think such services need to help the education process by being a little less indigenous about the safety issue and making sure their users know that an accidental delete will be propagated everywhere and be unrecoverable after a time. This will never happen though: "please use some other backup option too in case ours goes wrong" is not a sentiment marketing departments or investors will want to see publicly stated!
" from how unwise it is to use a cloud service as the sole storage of important files."
I disagree (a little bit)
There are some issues here:
- Using a free service
- Using a service that "thinks" when you want to add or remove stuff. Yes, it's easier to use, but it's not explicit, and it usually fails spectacularly.
- Thinking that this "magical save thing" is equivalent to a backup.
My strategy: a big external drive used for Time Machine, and a subscription to Backblaze. Both of these are all about retaining multiple versions, recovering from accidental deletion, and continuously backing up in the background. Dropbox is about syncing stuff between computers.
Dropbox keeps deleted files and previous versions of files for 30 days. With Packrat, a free feature for all paid accounts, Dropbox keeps deleted files for the lifetime of the account.
How is Dropbox not a backup provider? It may not be a very good backup provider, depending on your point of view, but it clearly markets itself as a backup provider ("your stuff is always safe in Dropbox and can be restored in a snap"), is used by its users as a backup provider, and has the features of a backup provider.
I noticed there hasn't been any mention of attempting to recover your deleted files from your local disk. If you haven't tried it yet it is worth a shot. There are a few software solutions out there that will scan your disk for files that have been deleted. On Windows I have used Piriform's Recuva[1] and on OS X I have used PhotoRec[2], both have worked rather well. It is worth noting that the longer you wait to try the more likely the sectors are to be overwritten with new data. (And you should back up any files currently on the disk). I know it is a long shot at this point but some chance is better than none. Good luck.
> From all this information it seems that Dropbox client first deletes files locally before it informs the server about the new selective sync settings. Consequently, if the client crashes or is killed before the server is contacted, the files remain deleted without any trace. After the client restarts again, it only sees there are some files missing and syncs this new state with the server.
It concerns me that the top several comments are some variation of "you shouldn't trust the cloud with your data." How did software services become so totally insulated from the expectations we have of every other service that holds on to our valuable property?
> It concerns me that the top several comments are some variation of "you shouldn't trust the cloud with your data."
No, it's "you shouldn't trust one single thing with your data". Hardware fails. OSes have bugs (OS X's "space in your drive name and we delete your user folder.") Services have unexpected gotchas. This is why you never store anything in only one place.
A sync service by definition is not a separate place (unless he had found a bug in the Packrat feature)
This is actually the point that makes no sense to me - after removing a folder from selective sync, the default (in my mind) should be to leave the folder alone. At the very least the user should be prompted or an option setting should be available to delete or leave the folder. 'Security' and 'disk space' are only two of the many reasons to not sync a folder.
I trust file operations "move", "copy" and "remove".
But "sync" could mean anything: which side is syncing to which side? If something is not present on one side, will it delete it on the other?
I wish the term didn't exist, and apps/clients/... would copy or even just load or show things, and if you want to "sync", then rsync with clear source and destination. Same with mail for example: I don't want to "sync" email messages, I just want to see them, the most recent ones first.
Here "sync" means what Dropbox says it means. The answers to your questions are well defined in their software, even if to some other software, they might be different.
I wish the term didn't exist, and apps/clients/... would copy or even just load or show things, and if you want to "sync", then rsync with clear source and destination.
The source and destination is always clear. It's to the server and from the server. Dropbox uses a star topology, not P2P.
You'd get the same result if you used rsync with source and destination, if you passed --delete, which is equivalent to what Dropbox does.
In today's digital age, I believe lots and lots of people will lose a lot of valuable information like photos because they don't have a good understanding of what backing up means.
I've warned my friend who is computer illiterate that he is risking losing his childrens' complete photo collection for their entire lives because he is only keeping a single copy of his data on his hard drive. This is probably the situation with millions of people in the US alone, and when that hard drive failure event happens, which will happen, then they will lose everything.
This is becoming even more important with SSDs. I used to do a lot of data recovery, and photos were of course a very common type of data that customers wanted to recover.
If the drive was spinning, I could usually recover most of the data. Even if the drive was completely dead, there was still some chance at recovery of I had the correct logic board, and for a hefty price I could send it out to a company with more resources to get the platters spinning again.
SSDs can be much harder recover from, since you need information stored in the controller to map out the data, and many consumer level hard drives brick themselves on failure to prevent silent data corruption.
Would it have been more effective to just provide him with a list of stuff - a choice of two drives; a choice of two softwares (with really easy URLs to buy and download) and instructions for using and testing.
I use Dropbox for sharing pictures, but would never even dream to use a cloud based service for backup purposes.
Granted, Jan's case is a bit more complex and I'm really sorry for his loss.
Stories like that should really be a lesson to everybody never to completely trust a cloud based service as your main backup.
On a side note: I agree that archiving of digital files is a hard problem. The smartest librarians of the world are thinking about how to achieve this for, literally, decades and I'm not sure they even have a good solution to the problem.
My personal strategy is redundancy: I buy new hard disks every couple of years and copy all important files, twice. One hard disk is kept off site.
It's not perfect, but it's the best I can come up with. Reading horror stories, like Jan's, indicates that it's the better solution. Despite the messiness.
When your main business is storing files (and you also just raised yet another shitload of cash), not offering better safeguards against file loss is inadmissible. I left Dropbox for Google Drive when they announced the Condolezza Rice move, and I haven't looked back since.
If you are using Dropbox as a sole backup of your files, think again.
Simplify to If you only have a sole backup, think again. Points 4 and 5 should be you always have more than one backup of important material and at least one backup should be on physical media that you own.
This is clearly a problem for the reputation of dropbox.
On their website they market the service as:
"Even if your phone goes for a swim, your stuff is always safe in Dropbox and can be restored in a snap."
With this message in mind, and a valuation of roughly >>10$ BILLION<<.
I think, you can have very very high expectations regarding the data retention of their service.
Also, the whole point of using a service like dropbox is to remove friction/time investment regarding the standard backup/sync tasks.
If you suggests to handle multiple harddisk backups locally+offsite, that is fine except that not everyone wants this kind of time investment and cost associated with a do-it-yourself backup service and rather depend on a service provider that they can trust.
As I said to someone the other day after dealing with yet another failure by Dropbox to do the safe thing or even the reasonably-recoverable thing with our data: "One reason I'm not worth a billion dollars is I would never have let Dropbox ship like this!"
Doing sync on top of an existing filesystem, making it work with existing apps, maintaining solid transaction semantics, and having a simple, understandable user model -- you just can't do all four. The Dropbox experiment has now shown that "correctness" doesn't matter, commercially speaking.
We live in a convenient age, but also a fickle one. I laugh when my wife prints 100 hundreds of digital photos although her disaster recovery process is more stable than mine.
My wife does the same; she has a Snapfish account. She will have them print and mail her the most important pictures from a set, while trusting the ones she doesn't care as much about to our home server and Facebook (the former for long-term storage, the latter for convenient access). Since she has to order a certain number of prints every year to keep the account open, and it comes with unlimited free storage, it works out well for her.
The only exception was our wedding photos. We have a set of prints directly from the studio hired to take the pictures, as well as a digital copy on DVD from them. We also have those digital files backed up on our home server, a computer at my parents' house, Dropbox, Snapfish, and both of our Facebook accounts. The photography company also has long-term storage that we would have access to if all else failed. That's about as redundant as we can get without spending a ton of money, but it's more than most people seem to have.
I laughed at this when I first read it, but it's seriously true. For people that aren't computer literate, physical copies can sometimes be the best form of a backup for digital data.
Dropbox has one of the worst websites on the Internet. Seriously!
E.g. people have mentioned that "packrat" would have helped in this situation. But try finding that service from the pricing page. Hint: NOT THERE. Someone here linked to some explanation in their "help" section. Seriously? Such an important feature is only mentioned under "help"?
Even if you go to sign up for the paid version, Dropbox Pro, packrat isn't mentioned [1]. Their horrid website alone is enough for me to not want to do business with them.
Someone else here has succinctly summarized the situation [2]:
One reason I'm not worth a billion dollars
is I would never have let Dropbox ship like this!
As many others mentioned already "sync is not backup".
If you really want true backup then on a Mac I will suggest to have a setup like:
1- local backup for quick recovery with Time Machine
2- remote backup with a true backup solution like CrashPlan (or similar)
I personally have such setup with a local one with my Time Machine, and having CrashPlan running all the time, it did save me a couple times when after many weeks I deleted a wrong file or folder... Got everything back without problem.
If you don't have a Time Capsule, then you can always consider an alternative with just a smaller subset of what you want to archive by using a JetDrive by Transcend.
I recently got a 128GB JetDrive and it is my local destination for my MBP for TimeMachine... Yes 128GB is not enough but knowing that CrashPlan is constantly running make me comfortable enough with that setup.
I use my Time Capsule's USB port to attach an external drive that is shared so that Crashplan can backup to a folder on it. That's in addition to using Crashplan Central for cloud backups and Time Machine.
What about in 10, 20 years? Photo libraries will keep inflating. Local storage will not. As of now I backup from a SSD Mac. What happens when I don't have a computer anymore?
Interestingly, people don't value "bits" or information. We value moments and emotions and work and art. There's no successful current consumer business model for people to store and backup photos (Backblaze is mainly prosumer).
And so aren't social networks the real backups by now? The redundancy of publishing on multiple services means some photos will stick and the rest will fade, somehow like former printed pictures I guess. Publish it or lose it?
External backup drives continue to increase in size at a falling cost per GB.
It is cheaper and lower latency to store/retrieve a large amount of data locally than remotely, unless you believe that the current glut of "near-free cloud storage" will continue indefinitely. Market cycles suggest otherwise.
> Publish it or lose it
Why do social networks store your photos "for free"?
> And so aren't social networks the real backups by now?
I hope not. I cringe when I see people who treat them like they are. What do you do when your account password gets cracked(it happens to people on Facebook a lot), or the site itself goes away(MySpace)? Also, you're not able to store the images in their original form; I'm pretty sure sites like these will have limitations on what you can upload. I'm not a pro-photographer by any means, but there's no way I could upload a RAW file to Facebook.
I had an issue similar to this (but fortunately not nearly as bad) about two and a half or three years ago and that was what convinced me to pay of the Packrat service add-on.
I agree with the OP that that should be a built-in option for all paying customers -- or at least make it more visible as an add-on. I've had instances in the past where it was months later that I realized something was either deleted or didn't sync and I had to look through Events and use Packrat to restore. It can be scary - even if you do make backups of your backups.
Once you deleted your own local copy, the folder of photos on Dropbox was no longer a "backup." You just moved your one copy elsewhere. You essentially had NO backup. And everyone should know, you should always have at least two backups. (Again, you had zero.) I wouldn't even consider Dropbox (without Packrat) a valid backup destination either, because any changes get synced.
Sorry this happened to you, but better backup practices could have avoided it easily.
My buddy Tom says "If its not on your current computer, its lost". Imagine when Dropbox closes one day (as every company ultimately does), how much will be lost.
It really sucks that we all have to keep learning this lesson over and over. Everyone I've ever spoken to that has a good backup strategy in place has it because they have lost irreplaceable files.
Have real backups. Syncing is not a backup strategy, raid is not a backup strategy, etc. 3-2-1 At least three copies in 2 different (storage) formats and at least one copy offsite. It sounds like overkill but you have to decide how much your data is really worth.
[+] [-] gabemart|11 years ago|reply
Regardless of the circumstances, losing user files against the wishes of the user is the absolute worst thing a cloud backup provider can do.
Even for files that are deleted intentionally and unambiguously by the user, I'm astonished that Dropbox actually deletes the files at the end of the 30-day restore window. I would expect them to keep the files for some multiple of the publicly-stated restore deadline where the multiple >= 2, if for no other reason than as a goodwill generator. There is no more evangelical advocate for your company than the customer you email to say "Yes, you intentionally deleted this file six weeks ago. The 30 day deletion deadline has passed, but I have managed to restore the most recent version of your dissertation. Thank you for using Dropbox."
For files that aren't intentionally deleted by the user but are "de-synced", it is disgraceful and appalling that there is no contingency system in place. Keeping user files when the user assumes or wishes them to be kept safe is the core competency of a service like Dropbox.
"The user should have kept multiple redundant copies" is not an excuse for a poorly managed online backup service. "Keep multiple backups of everything important" is good advice for a user, but "Keep user files safe when the user thinks they are safe" is the most essential advice imaginable for the CEO of an online backup service.
[+] [-] icebraining|11 years ago|reply
On the other hand, keeping copies of the user's data after you say you've deleted it doesn't sound too ethical to me. I don't expect immediate purging, but I also don't expect potentially sensitive files to linger for months after they were supposed to be deleted.
I think a better system would be to warn about big deletion events (e.g. send an email) to allow the user to revert it within the 30-day period.
[+] [-] colanderman|11 years ago|reply
Then you have the opposite problem; privacy-centered users complaining "Dropbox keeps your data even when they say it's permanently deleted! Here's proof!!"
[+] [-] snowwrestler|11 years ago|reply
Edit to add: Dropbox's sole purpose is to sync files between different physical machines. Which, now that I think about it, seems sort of anachronistic.
In cloud storage, the authoritative version of the file lives in the cloud and each device just accesses it. But in Dropbox, the authoritative version is on every machine. By default, every copy is authoritative, and actions are synced.
This sort of "many originals" architecture seems to confuse people. The article author here is clearly thinking of Dropbox like cloud storage--he checked the web interface, saw his files, and thought it was all good.
But in Dropbox the web interface is not authoritative, the local copy is.
[+] [-] ai_ja_nai|11 years ago|reply
Use proper backup services, instead, like Memopal.
[+] [-] 01Michael10|11 years ago|reply
It is unwise to use anything as the sole storage of important files. Personally, I think cloud backup is safer then one backing up files to a DVD or tape backup but I just won't keep personal files on any cloud service.
Never just have one point of failure!
[+] [-] Spooky23|11 years ago|reply
I don't think that they are. More like the modern incarnation of a file server. Like a fileserver, they have resiliency against service failure or individual storage system failure (via S3). Cool stuff, but it ain't backup.
[+] [-] dspillett|11 years ago|reply
YES: from DropBox's point of view - this is a serious bug even ignoring what other actions the users have/not taken to protect their data.
And NO: this sort of thing is one of the key reasons not to use such a service as a sole backup - linking the two when trying to educate users on the importance of thinking about data safety, so we can't separate the two as one is important when hammering the other point home...
> Even for files that are deleted intentionally and unambiguously by the user, I'm astonished that Dropbox actually deletes the files at the end of the 30-day restore window. I would expect them to keep the files for some multiple of the publicly-stated restore deadline
That would cause significant consternation in some areas. If I explicitly delete a file I might explicitly want it gone within the stated window and no later and would therefore be unhappy if I found the data still available (even if I had properly encrypted it before it touched the cloud service so noone else could read it).
> For files that aren't intentionally deleted by the user but are "de-synced",
I definitely agree there - it sounds like that process needs some transactional workflow wrapped around it so that:
1. Client and server don't do anything until they've both agreed what to do (then they can both go away and do it even if the connection drops, safe in the knowledge that the action is correct).
2. The client and server record what has been agreed as part of that transaction protected process, so in the case of an unclean stop during the process it can be resumed/retried.
3. Some process may need to be in place for one side rolling back if the other detects a failure applying the agreed actions.
Of course this is a fair bit of work for something that won't happen often, particularly when you consider that there might be more than one active connection to a set of files at any given time so it might not be a simple two-way merge operation, so DB may have other priorities - but the bad PR from it happening and not being cleanly dealt with is something that they need to consider if making that sort of decision.
> "The user should have kept multiple redundant copies" is not an excuse for a poorly managed online backup service
Agreed. But "the single backup service failed" is similarly not something that is going to encourage me to have sympathy for the user!
Sync services are excellent secondary backups, and they satisfy the "off site" requirement that a great many home and small-office users neglect (I know people who worked for a small company which died because the backups were in the same building as the active data so one fire took out the lot), but no one should trust them as their only backup. I even recommend against multiple sync services as your only backups as that in itself can cause problems (when bugs or other peculiarities from one interact badly with similar behaviours in another).
Using just a sync service means you have no off-line (or even semi-offline) backup, which is as bad as not having any off-site copies.
Of course getting this involves educating users as to the risks they are taking, which is a point notoriously difficult to get across, so while I don't expect a service like DB to be perfectly bug free (and therefore wouldn't trust it as an only backup) I think such services need to help the education process by being a little less indigenous about the safety issue and making sure their users know that an accidental delete will be propagated everywhere and be unrecoverable after a time. This will never happen though: "please use some other backup option too in case ours goes wrong" is not a sentiment marketing departments or investors will want to see publicly stated!
[+] [-] unknown|11 years ago|reply
[deleted]
[+] [-] raverbashing|11 years ago|reply
I disagree (a little bit)
There are some issues here:
- Using a free service
- Using a service that "thinks" when you want to add or remove stuff. Yes, it's easier to use, but it's not explicit, and it usually fails spectacularly.
- Thinking that this "magical save thing" is equivalent to a backup.
[+] [-] egypturnash|11 years ago|reply
Sync is not backup.
Sync is not backup.
My strategy: a big external drive used for Time Machine, and a subscription to Backblaze. Both of these are all about retaining multiple versions, recovering from accidental deletion, and continuously backing up in the background. Dropbox is about syncing stuff between computers.
[+] [-] gabemart|11 years ago|reply
How is Dropbox not a backup provider? It may not be a very good backup provider, depending on your point of view, but it clearly markets itself as a backup provider ("your stuff is always safe in Dropbox and can be restored in a snap"), is used by its users as a backup provider, and has the features of a backup provider.
[+] [-] bruceboughton|11 years ago|reply
[+] [-] unknown|11 years ago|reply
[deleted]
[+] [-] zperrault|11 years ago|reply
[1] http://www.piriform.com/recuva [2] http://www.cgsecurity.org/wiki/PhotoRec
[+] [-] rayiner|11 years ago|reply
> From all this information it seems that Dropbox client first deletes files locally before it informs the server about the new selective sync settings. Consequently, if the client crashes or is killed before the server is contacted, the files remain deleted without any trace. After the client restarts again, it only sees there are some files missing and syncs this new state with the server.
It concerns me that the top several comments are some variation of "you shouldn't trust the cloud with your data." How did software services become so totally insulated from the expectations we have of every other service that holds on to our valuable property?
[+] [-] kalleboo|11 years ago|reply
No, it's "you shouldn't trust one single thing with your data". Hardware fails. OSes have bugs (OS X's "space in your drive name and we delete your user folder.") Services have unexpected gotchas. This is why you never store anything in only one place.
A sync service by definition is not a separate place (unless he had found a bug in the Packrat feature)
[+] [-] achr2|11 years ago|reply
[+] [-] Aardwolf|11 years ago|reply
I trust file operations "move", "copy" and "remove".
But "sync" could mean anything: which side is syncing to which side? If something is not present on one side, will it delete it on the other?
I wish the term didn't exist, and apps/clients/... would copy or even just load or show things, and if you want to "sync", then rsync with clear source and destination. Same with mail for example: I don't want to "sync" email messages, I just want to see them, the most recent ones first.
[+] [-] icebraining|11 years ago|reply
I wish the term didn't exist, and apps/clients/... would copy or even just load or show things, and if you want to "sync", then rsync with clear source and destination.
The source and destination is always clear. It's to the server and from the server. Dropbox uses a star topology, not P2P.
You'd get the same result if you used rsync with source and destination, if you passed --delete, which is equivalent to what Dropbox does.
[+] [-] steven2012|11 years ago|reply
I've warned my friend who is computer illiterate that he is risking losing his childrens' complete photo collection for their entire lives because he is only keeping a single copy of his data on his hard drive. This is probably the situation with millions of people in the US alone, and when that hard drive failure event happens, which will happen, then they will lose everything.
[+] [-] ominous_prime|11 years ago|reply
If the drive was spinning, I could usually recover most of the data. Even if the drive was completely dead, there was still some chance at recovery of I had the correct logic board, and for a hefty price I could send it out to a company with more resources to get the platters spinning again.
SSDs can be much harder recover from, since you need information stored in the controller to map out the data, and many consumer level hard drives brick themselves on failure to prevent silent data corruption.
[+] [-] DanBC|11 years ago|reply
Would it have been more effective to just provide him with a list of stuff - a choice of two drives; a choice of two softwares (with really easy URLs to buy and download) and instructions for using and testing.
That could be a gift for birthday or Xmas?
[+] [-] CaptainZapp|11 years ago|reply
Granted, Jan's case is a bit more complex and I'm really sorry for his loss.
Stories like that should really be a lesson to everybody never to completely trust a cloud based service as your main backup.
On a side note: I agree that archiving of digital files is a hard problem. The smartest librarians of the world are thinking about how to achieve this for, literally, decades and I'm not sure they even have a good solution to the problem.
My personal strategy is redundancy: I buy new hard disks every couple of years and copy all important files, twice. One hard disk is kept off site.
It's not perfect, but it's the best I can come up with. Reading horror stories, like Jan's, indicates that it's the better solution. Despite the messiness.
[+] [-] simi_|11 years ago|reply
[+] [-] petercooper|11 years ago|reply
Simplify to If you only have a sole backup, think again. Points 4 and 5 should be you always have more than one backup of important material and at least one backup should be on physical media that you own.
[+] [-] paulmalenke|11 years ago|reply
[+] [-] zedadex|11 years ago|reply
> Simplify to If you only have a sole backup, think again.
Yep. 3-2-1: "At least three copies, in two different formats, with one of those copies off-site."
[+] [-] SunboX|11 years ago|reply
[+] [-] jabiko|11 years ago|reply
[+] [-] je42|11 years ago|reply
On their website they market the service as:
"Even if your phone goes for a swim, your stuff is always safe in Dropbox and can be restored in a snap."
With this message in mind, and a valuation of roughly >>10$ BILLION<<.
I think, you can have very very high expectations regarding the data retention of their service.
Also, the whole point of using a service like dropbox is to remove friction/time investment regarding the standard backup/sync tasks.
If you suggests to handle multiple harddisk backups locally+offsite, that is fine except that not everyone wants this kind of time investment and cost associated with a do-it-yourself backup service and rather depend on a service provider that they can trust.
[+] [-] wrs|11 years ago|reply
Doing sync on top of an existing filesystem, making it work with existing apps, maintaining solid transaction semantics, and having a simple, understandable user model -- you just can't do all four. The Dropbox experiment has now shown that "correctness" doesn't matter, commercially speaking.
[+] [-] restlessmedia|11 years ago|reply
[+] [-] morganvachon|11 years ago|reply
The only exception was our wedding photos. We have a set of prints directly from the studio hired to take the pictures, as well as a digital copy on DVD from them. We also have those digital files backed up on our home server, a computer at my parents' house, Dropbox, Snapfish, and both of our Facebook accounts. The photography company also has long-term storage that we would have access to if all else failed. That's about as redundant as we can get without spending a ton of money, but it's more than most people seem to have.
[+] [-] ToastyMallows|11 years ago|reply
[+] [-] nly|11 years ago|reply
[+] [-] PhantomGremlin|11 years ago|reply
E.g. people have mentioned that "packrat" would have helped in this situation. But try finding that service from the pricing page. Hint: NOT THERE. Someone here linked to some explanation in their "help" section. Seriously? Such an important feature is only mentioned under "help"?
Even if you go to sign up for the paid version, Dropbox Pro, packrat isn't mentioned [1]. Their horrid website alone is enough for me to not want to do business with them.
Someone else here has succinctly summarized the situation [2]:
[1] https://www.dropbox.com/upgrade [2] https://news.ycombinator.com/item?id=8105548[+] [-] yoda_sl|11 years ago|reply
I personally have such setup with a local one with my Time Machine, and having CrashPlan running all the time, it did save me a couple times when after many weeks I deleted a wrong file or folder... Got everything back without problem.
If you don't have a Time Capsule, then you can always consider an alternative with just a smaller subset of what you want to archive by using a JetDrive by Transcend. I recently got a 128GB JetDrive and it is my local destination for my MBP for TimeMachine... Yes 128GB is not enough but knowing that CrashPlan is constantly running make me comfortable enough with that setup.
[+] [-] slantyyz|11 years ago|reply
[+] [-] pierreio|11 years ago|reply
What about in 10, 20 years? Photo libraries will keep inflating. Local storage will not. As of now I backup from a SSD Mac. What happens when I don't have a computer anymore?
Interestingly, people don't value "bits" or information. We value moments and emotions and work and art. There's no successful current consumer business model for people to store and backup photos (Backblaze is mainly prosumer).
And so aren't social networks the real backups by now? The redundancy of publishing on multiple services means some photos will stick and the rest will fade, somehow like former printed pictures I guess. Publish it or lose it?
[+] [-] walterbell|11 years ago|reply
External backup drives continue to increase in size at a falling cost per GB.
It is cheaper and lower latency to store/retrieve a large amount of data locally than remotely, unless you believe that the current glut of "near-free cloud storage" will continue indefinitely. Market cycles suggest otherwise.
> Publish it or lose it
Why do social networks store your photos "for free"?
[+] [-] artmageddon|11 years ago|reply
I hope not. I cringe when I see people who treat them like they are. What do you do when your account password gets cracked(it happens to people on Facebook a lot), or the site itself goes away(MySpace)? Also, you're not able to store the images in their original form; I'm pretty sure sites like these will have limitations on what you can upload. I'm not a pro-photographer by any means, but there's no way I could upload a RAW file to Facebook.
[+] [-] icebraining|11 years ago|reply
Wifi-enabled drives are already common; no computer needed. They are even targeted at mobile device users ("for iPad/iPhone").
[+] [-] filmgirlcw|11 years ago|reply
I agree with the OP that that should be a built-in option for all paying customers -- or at least make it more visible as an add-on. I've had instances in the past where it was months later that I realized something was either deleted or didn't sync and I had to look through Events and use Packrat to restore. It can be scary - even if you do make backups of your backups.
[+] [-] Fofer|11 years ago|reply
Sorry this happened to you, but better backup practices could have avoided it easily.
[+] [-] JoeAltmaier|11 years ago|reply
[+] [-] therealmocker|11 years ago|reply
[+] [-] profsnuggles|11 years ago|reply
Have real backups. Syncing is not a backup strategy, raid is not a backup strategy, etc. 3-2-1 At least three copies in 2 different (storage) formats and at least one copy offsite. It sounds like overkill but you have to decide how much your data is really worth.
[+] [-] arethuza|11 years ago|reply
Though I have seen it used (to my horror) as a software release strategy.... :-)
[+] [-] catshirt|11 years ago|reply
what's currently the most convenient and simple way to store multiple local backups?