top | item 23871258

(no title)

wikiman | 5 years ago

I'm not going to lie, the comment about "if they can use git" is the core of my problem with the arctic storage. I'm still wondering what the point is other than to make some meta point about how committed github is to preservation

discuss

order

brabel|5 years ago

As GitHub explains in their blog post[1]:

_It will also include works which explain the many layers of technical foundations that make software possible: microprocessors, networking, electronics, semiconductors, and even pre-industrial technologies. This will allow the archive’s inheritors to better understand today’s world and its technologies, and may even help them recreate computers to use the archived software._

[1] https://github.blog/2020-07-16-github-archive-program-the-jo...

Deimorz|5 years ago

There's no git usage, they're just storing a snapshot of each repo's current HEAD in a TAR file (and then compressed and QR-encoded).

> The snapshot will consist of the HEAD of the default branch of each repository, minus any binaries larger than 100KB in size—depending on available space, repos with more stars may retain binaries. Each repository will be packaged as a single TAR file.

The guide included on each reel about how to access the data is public, git isn't involved in the process: https://github.com/github/archive-program/blob/master/GUIDE....

dgellow|5 years ago

Haha, yes, I have this image in mind of future scientists trying to understand the mess the git CLI is! I'm sure they will have a bunch of theories in which git is considered a religious relic from past humans, used to summon some gods by writing weird incantations, that's already how it feels to me some time :)

berkes|5 years ago

The git-cli is an inconsistent mess, agreed.

The data-model, however, is not. It is actually a very smart, mathematical (cryptographical) tree-structure[1].

I'm pretty sure some far future generation will be able to figure out the math behind a "git database". I'm actually more confident they will be able to do so, than your average "excel" or "winrar backup".

-- [1] I really started liking git and understanding what it all is about (and how to use the weird commandline --flags - or was is -flags or -f or -F or flags) after reading the Git From the Bottom Up book: https://jwiegley.github.io/git-from-the-bottom-up/

ocdtrekkie|5 years ago

My understanding is that the vault also includes instructions on how to access the stored projects. I would guess this has to at least explain the minimum ability to replicate git or understand git's format?

throwaway744678|5 years ago

The git source repo is probably included in the archive itself. So meta.

ericlewis|5 years ago

Pretty sure the repos are stored plaintext in tar balls