I had never thought about this before but from this repo I can see all these smart engineers at the big tech firms live within their own walled garden (or prison?).
If it weren’t for such a vibrant open source world then they’d really be stuck if they tried to apply their knowledge outside of their companies.
It is almost the other way around. Many of these open-source projects, have roots with engineers leaving these big companies and wanting to use the same tools, so they create them. Or, by these big companies releasing papers and then folks creating open-source versions. Here's some examples of the latter: Dapper (w/ OpenTracing) [1], Spanner (w/ CockroachDB) [2], Borg (w/ Mesos) [3], Dremel (w/ Drill) [4]. I'm sure there are tons more examples but these are just off the top of my head.
A lot more than just ex-FAANG folks would be stuck without the OSS technologies listed in the right column. I daresay, considering much of this software was authored by Xooglers or inspired by Google papers, that it's actually everyone else who would be stuck. The folks who have seen how these problems can be solved at scale would just reimplement the solutions, as indeed they have done.
With Microsoft, this has changed substantially in the past decade, and there are many teams running entirely on public online services (GitHub, Azure DevOps, Travis etc) and with end-to-end open source tooling and dependencies. So it's much less of a technological bubble than it used to be. Probably less so than Google at this point. Can't say about Amazon.
They don’t, and if you’ve got the G-factor, pun intended, to get an offer at Google you can get an offer anywhere. Which, honestly, makes this repository all the more confusing to me.
I’m sure there are some people that have lived only within this ecosystem, but that’s the possibly the height of privilege.
This is pretty misleading: a lot of the presented open source equivalents are nothing like the original. For example, comparing HDFS to Colossus is... optimistic? Having used / worked on/with both, the similarity is roughly that they both store bytes.
This would be a lot less misleading if there were annotations and if the "open source/real world" didn't mix together other large companies' products with open source. For example protobuf is open source, but this makes it look like it isn't.
If you needed something Colossus-like in the open source world, what would you use? Is there anything available that's better than HDFS for the "general-purpose, big, distributed file system" use-case?
Sourcegraph CEO here. Cool to see we are listed as a CodeSearch alternative. We assembled some links to Google internal docs and studies about how they use their Google internal CodeSearch tool. See the links at the top of
https://docs.sourcegraph.com/user/search if interested.
What does Google (or others) use for an intranet or team documentation? We're using Quip and it is just a black hole where documents go to die, never be seen again no matter how badly you want to find it.
Amazon had a giant internal wiki that they used for team information, playbooks, design docs, etc. Formatting was kind of ugly, but the barrier to making an edit was super low, and having everything in one searchable place was super useful. From experience, I think any system that requires more than a few minutes to make an edit ends up not being updated and going stale pretty quickly.
It used to be Google Sites (then part of Google Apps for Business which is now GSuite), Docs and Slides which were indexed by an internal Google search engine called MOMA (which I believe was powered by the Google Search Appliance, but my recollection on this last point is faint)
An equivalent of this for Microsoft is pretty much impossible because there isn't one single way of doing any of these things at the company. Different departments/orgs/teams use different tools, some self-built and some off-the-shelf (including most of the stuff on this list).
I can't speak to Amazon, but at Microsoft the equivalents to the Google offering are going to have the same OSS competitors. Having worked on services there, the only thing I really miss is Kusto; it's now available as an Azure service, but I'm at a company all in on AWS :(
Not sure why you think free food is traded for free time. Free food helps me use my time at work more effectively. I usually have breakfast and lunch with teammates, and it's a mix of socialization and work-related conversations. Half an hour in the free cafe > 45-90 minutes going someplace to buy food.
We (Sourcegraph) are interested in building an alternative to Critique at some point in the future, or at least enhancing existing code review tools to offer many of the favorite features of Critique.
What are your favorite/must-have Critique features?
(Also would love to jump on a video chat if you or anyone else is interested, and we could live-stream/share it on YouTube for others interested. Email me at [email protected] if interested.)
I've tried Eclipse Che (listed in the article), and it's pretty good. I'm looking forward to Che switching its editor to Eclipse Theia, which promises to be "the VS Code experience in the browser".
I recently discovered Coder's open source version of their web-based IDE (https://github.com/codercom/code-server). It's pretty much VS Code in the browser, which is almost exactly what I was looking for. You just download the binary, point it to a folder and it works exactly the way you would expect (including the terminal running under the account the server runs as so you get to keep your shell of choice and all that fun stuff).
Where AWS/GitHub is mentioned, there should be the equivalent Azure stuff, especially for bug tracking, CI/CD, repo subfolder "owners", code review, there's a ton of stuff equivalent or better to buganizer, OWNERS file, etc .
In services there should be vitess since it's both a YouTube and an open source project.
It seem to have a new transactional messaging feature with message sending and Ack: https://vitess.io/docs/advanced/messaging/, which seem to fill a slightly different niche than rabbitmq/pubsub/etc.
There is no listed equivalent of RecordIO. What do people use for high-reliability journals?
When I needed something like RecordIO to store market data, I couldn't find anything. So I implemented https://github.com/romkatv/ChunkIO. I later learned of https://github.com/google/riegeli (work in progress), which could've saved me a lot of time if only I found it earlier. I think my ChunkIO is a better though.
Well I guess I’m “that guy” who finds it highly interesting as an outsider with a very high interest in search. Never knew what tools Google’s using (never bothered to look it up either) and this list is a feast of “Aha!”’s for me :)
Hopefully that answers your question ;-) Same as with a discussion about patents the other day. It’s fascinating to learn / read / understand how big businesses work as someone who’s never worked at one nor aspires to work at one, but does want to build one ;-)
Just winging it here, but maybe because these are some of the most influential companies on the face of the Earth, they employ tens of thousands of HN users, they are the envy of maybe half of all employees on the planet, their programming systems effect everyone reading HN, and many people here are genuinely interested in what happens at ginormous corporations that employ other hackers.
[+] [-] jarym|7 years ago|reply
If it weren’t for such a vibrant open source world then they’d really be stuck if they tried to apply their knowledge outside of their companies.
Thanks for sharing the repo!
[+] [-] WestCoastJustin|7 years ago|reply
[1] https://ai.google/research/pubs/pub36356.pdf
[2] https://ai.google/research/pubs/pub39966.pdf
[3] https://ai.google/research/pubs/pub43438.pdf
[4] https://ai.google/research/pubs/pub36632.pdf
[+] [-] asdfasgasdgasdg|7 years ago|reply
[+] [-] int_19h|7 years ago|reply
[+] [-] akhilcacharya|7 years ago|reply
I’m sure there are some people that have lived only within this ecosystem, but that’s the possibly the height of privilege.
[+] [-] smueller1234|7 years ago|reply
[+] [-] puzzle|7 years ago|reply
[+] [-] alexhutcheson|7 years ago|reply
[+] [-] sqs|7 years ago|reply
[+] [-] mherdeg|7 years ago|reply
[+] [-] mdeeks|7 years ago|reply
[+] [-] bitskits|7 years ago|reply
Pros: Get's checked in like code, gets reviewed, and leaves a nice paper trial of edits.
Cons: Doesn't invite comments/discussions as well as Docs does.
[+] [-] alexhutcheson|7 years ago|reply
Source: https://www.xwiki.com/en/references/amazon
[+] [-] bauerd|7 years ago|reply
[+] [-] verst|7 years ago|reply
[+] [-] unknown|7 years ago|reply
[deleted]
[+] [-] dsl|7 years ago|reply
[+] [-] alpb|7 years ago|reply
[+] [-] myroon5|7 years ago|reply
[+] [-] paxys|7 years ago|reply
[+] [-] nickbarnwell|7 years ago|reply
[+] [-] tyingq|7 years ago|reply
Google -> Real World
free food -> :(
With:
free food -> free time
[+] [-] QuercusMax|7 years ago|reply
[+] [-] paxys|7 years ago|reply
[+] [-] dekhn|7 years ago|reply
[+] [-] djtriptych|7 years ago|reply
[+] [-] sqs|7 years ago|reply
What are your favorite/must-have Critique features?
(Also would love to jump on a video chat if you or anyone else is interested, and we could live-stream/share it on YouTube for others interested. Email me at [email protected] if interested.)
[+] [-] kaspnilsson|7 years ago|reply
[+] [-] jaytaylor|7 years ago|reply
Now that Amazon took cloud9 (c9.io) out of the market, it seems like there is a sizeable void.
I want web-based Spacemacs or Evil! Or even just intelli-j.
[+] [-] anton_kosyakov|7 years ago|reply
IDE is based on Theia (https://theia-ide.org), i.e. VS Code with cloud-first mindset.
[+] [-] tsechin|7 years ago|reply
[+] [-] HuggableSquare|7 years ago|reply
[+] [-] kajecounterhack|7 years ago|reply
[+] [-] fhrow4484|7 years ago|reply
In services there should be vitess since it's both a YouTube and an open source project.
It seem to have a new transactional messaging feature with message sending and Ack: https://vitess.io/docs/advanced/messaging/, which seem to fill a slightly different niche than rabbitmq/pubsub/etc.
[+] [-] draw_down|7 years ago|reply
[deleted]
[+] [-] romka2|7 years ago|reply
When I needed something like RecordIO to store market data, I couldn't find anything. So I implemented https://github.com/romkatv/ChunkIO. I later learned of https://github.com/google/riegeli (work in progress), which could've saved me a lot of time if only I found it earlier. I think my ChunkIO is a better though.
[+] [-] shereadsthenews|7 years ago|reply
Edit: it even includes an open source implementation of Cord, which they've renamed to Chain for some reason.
[+] [-] mempko|7 years ago|reply
https://riak.com/assets/bitcask-intro.pdf
[+] [-] unknown|7 years ago|reply
[deleted]
[+] [-] antimora|7 years ago|reply
For example, I see Pipelines could be replaced by CodeDeploy. I hear nix is similar to the brazil build system. Any other similar mapping by others?
[+] [-] unknown|7 years ago|reply
[deleted]
[+] [-] simula67|7 years ago|reply
[+] [-] stingraycharles|7 years ago|reply
[+] [-] q3k|7 years ago|reply
[+] [-] puzzle|7 years ago|reply
[+] [-] malkia|7 years ago|reply
[+] [-] noncoml|7 years ago|reply
[+] [-] dstick|7 years ago|reply
Hopefully that answers your question ;-) Same as with a discussion about patents the other day. It’s fascinating to learn / read / understand how big businesses work as someone who’s never worked at one nor aspires to work at one, but does want to build one ;-)
[+] [-] tomcam|7 years ago|reply
[+] [-] adamnemecek|7 years ago|reply
https://news.ycombinator.com/item?id=19576092
Are people really still jerking about a company that's been around for like a 1/4 of a century?