This is badass, thanks for sharing @carride. In celebration that it's Friday, I'm going to test it out right now.
In case the author happens to see this and can respond, how did you figure all of this out? Would really enjoy a deep dive covering your process. This is brilliant.
I also wonder if it could be extended to Nim, Zig, or any other less-straightforward hookable languages [than C/C++].
Edit: Apologies, maybe this is not that interesting of a wonderment. I did a little research and both Nim and Zig interface with libc.
I'm not yet clear on whether node.js or Deno use libc, if any fellow HNers know please leave a reply! If they don't use libc, they could be interesting targets.
Thanks for the kind words. One of the authors here :)
Actually this blog post started from a Twitter thread (that I forgot to link in the post https://twitter.com/Aviramyh/status/1544964265961979905) - if you have any more questions, feel free to ask. We didn't want to go "too low level" so the common dev would enjoy this :)
regarding extending, probably possible - I think it'd be easier with Zig (no weird stacks AFAIK) but we don't need it as they probably just use libc like most languages.
P.S would love hearing your thoughts about mirrord
Hey, One of the writers of the article here.
This is a very controversial topic in Go ecosystem, and following all the discussions that led to this decision is very hard.
In short, I think main reasons are:
1. libc is considered hazard - security, comfort, runtime. It needs to maintain support for so many flows and setups so it's hard to make things better. For example, having `errno` as a global is quite weird (instead of just returning it, which is a whole discussion of it's own)
2. Golang devs like to re-invent the wheel, in a kind of Apple-ish way - all other are doing it wrong, we're going to do it better. ofc it's debatable whether they're correct with their approach.
3. Given they use Plan 9 system design, doing FFI from Go is very expensive (need to switch stack, save context, etc on each call)
This is mostly due to ease of distribution. The binary is self-contained, no need to ship anything but the binary itself. You can also compile locally and then run it elsewhere that is a completely different flavor of Linux. Quite handy when experimenting. Finally, no dependency on libc enables easy cross-compilation. You might not even have libc installed for the architecture you are targeting!
Isn't there something important missing here? My understanding is that Go's non-standard ABI doesn't guarantee much available stack space or the existence of guard pages. IIUC, this places a standard-ABI-like stack frame on the Go stack for calling into Rust, but what guarantee is there that the Rust/libc code won't overflow the small stack? What happens if it does? AFAICT, the answer are "none" and "memory corruption".
> "Goroutine stack is dynamic, i.e. it is constantly expanding/shrinking depending on the current needs. This means any common code that runs in system stack assumes it can grow as it wishes (until it exceeds max stack size) while actually, it can’t unless using Go APIs for expanding. Our Rust code isn’t aware of it, so it uses parts of the stack that aren’t actually usable and causes stack overflow."
tl;dr - yes, you're correct and we replace Go stack with system stack for the duration of the call, just like cgo does.
How big is your Go codebase versus your Rust codebase? Why not rewrite bits in Rust? This is very cool, but seems like a lot of effort and ongoing maintenance.
It’s not the case actually.
We’re working on a dev tool called mirrord that lets you create local processes in context of a remote environment so the ergonomics would be of local setup with the benefits of leveraging real cloud environments.
The way we accomplish that is we hook sys calls and then choose what happens locally and what happens remotely.
If you have the source code for both you should ideally use something like GRPC or another way to communicate. Or maybe cgo if you want to wet your hands into that.
The approach in this blog works with compiled binaries.
> Go used to do raw system calls on macOS, and binaries were occasionally broken by kernel updates. Now Go uses libc on macOS, and binaries are forward compatible with future macOS versions just like any other C/C++/ObjC/swift program. OS X 10.10 (Yosemite) is the current minimum supported version.
They tried but macOS doesn’t have any user<>kernel stable API so you have to rely on libsystem to provide it. (It broke very often so they changed it to use libsystem)
[+] [-] metadat|3 years ago|reply
In case the author happens to see this and can respond, how did you figure all of this out? Would really enjoy a deep dive covering your process. This is brilliant.
I also wonder if it could be extended to Nim, Zig, or any other less-straightforward hookable languages [than C/C++].
Edit: Apologies, maybe this is not that interesting of a wonderment. I did a little research and both Nim and Zig interface with libc.
I'm not yet clear on whether node.js or Deno use libc, if any fellow HNers know please leave a reply! If they don't use libc, they could be interesting targets.
[+] [-] aviramha|3 years ago|reply
regarding extending, probably possible - I think it'd be easier with Zig (no weird stacks AFAIK) but we don't need it as they probably just use libc like most languages.
P.S would love hearing your thoughts about mirrord
[+] [-] infiniteregrets|3 years ago|reply
[+] [-] generichuman|3 years ago|reply
Zig does not depend on libc on Linux, for example: https://github.com/ziglang/zig/blob/master/lib/std/os/linux....
You can choose to link libc though.
[+] [-] zasdffaa|3 years ago|reply
Could someone explain this. I'm not familiar with low level linux stuff. Why would you choose not to use libc, what are the implications?
[+] [-] aviramha|3 years ago|reply
1. libc is considered hazard - security, comfort, runtime. It needs to maintain support for so many flows and setups so it's hard to make things better. For example, having `errno` as a global is quite weird (instead of just returning it, which is a whole discussion of it's own)
2. Golang devs like to re-invent the wheel, in a kind of Apple-ish way - all other are doing it wrong, we're going to do it better. ofc it's debatable whether they're correct with their approach.
3. Given they use Plan 9 system design, doing FFI from Go is very expensive (need to switch stack, save context, etc on each call)
[+] [-] rapidlua|3 years ago|reply
[+] [-] scottlamb|3 years ago|reply
[+] [-] aviramha|3 years ago|reply
> "Goroutine stack is dynamic, i.e. it is constantly expanding/shrinking depending on the current needs. This means any common code that runs in system stack assumes it can grow as it wishes (until it exceeds max stack size) while actually, it can’t unless using Go APIs for expanding. Our Rust code isn’t aware of it, so it uses parts of the stack that aren’t actually usable and causes stack overflow."
tl;dr - yes, you're correct and we replace Go stack with system stack for the duration of the call, just like cgo does.
[+] [-] jgavris|3 years ago|reply
[+] [-] aviramha|3 years ago|reply
[+] [-] nedsma|3 years ago|reply
[+] [-] pijiskhan|3 years ago|reply
[+] [-] aviramha|3 years ago|reply
[+] [-] arriu|3 years ago|reply
Does anyone have any recommendations on how to do something like this but in reverse? Calling a go function from rust?
[+] [-] aviramha|3 years ago|reply
Do you mean for an already compiled Go binary? if you can recompile you can use cgo.
If not the following requirements come to mind (there are more probably):
1. Allocate g and m if doesn’t exist in current process. 2. Switch to go stack before call
Btw there’s the cgocallback routine that manages C code that calls into go so that’s a good look to see what’s needed.
[+] [-] IceWreck|3 years ago|reply
The approach in this blog works with compiled binaries.
[+] [-] ithrow|3 years ago|reply
[+] [-] infiniteregrets|3 years ago|reply
also found a discussion - https://news.ycombinator.com/item?id=18439100 that points to why this might be the case
[+] [-] aviramha|3 years ago|reply