top | item 45487994

(no title)

jonah-archive | 4 months ago

Hi, I run the datacenter/infrastructure team at the Internet Archive! We would love to see you at our various events this fall but if paying for the ticket is difficult for you, please email me (in bio) and we'll get you in (if possible).

discuss

order

psychoslave|4 months ago

Are they distributed events all around the world of just in wherever the team is gathered (San Francisco I guess?)

By the way, thank you all the teams in IA, what you provide is such an important thing for humanity.

zhynn|4 months ago

Thanks for helping to run my favorite library on earth.

moralestapia|4 months ago

Hey, Q., so what's the size of the internet archive?

textfiles|4 months ago

For the purposes of ballpark, between 150-200 petabytes of unique data, probably on the lowish end of that last I checked.

metalman|4 months ago

it is large enough that I am wondering if the data captured by the actual physical magnetic charges has a heft, that a person could feel. obviously the hardware would fill a house or something, but at what point does the worlds data become a discernable physical reality, at least in theory

southernplaces7|4 months ago

Most of all, i'm curious about how you reliably and securely store or host so many archived pages. Would you mind briefly explaining such a huge undertaking? Also, total congratulations on the fantastic achievement of this. You guys are my go-to for so much information.

Edit: And how many terabytes it all amounts to.

WhereIsTheTruth|4 months ago

We all know the NSA has access to servers hosted in the U.S. How are you protecting the archive from malicious tampering? Are you using any form of immutable storage? Is it post-quantum secure?

gosub100|4 months ago

Why would they do that? Have you previously seen a case where they "maliciously tampered" with anyone's website?

vettyvignesh|4 months ago

would love technical details around this feat. ex: how you even crawl to begin with, storage, etc