Tell HN: YC companies scrape GitHub activity, send spam emails to users
675 points| miki123211 | 3 days ago
I recently noticed that an YC company (Run ANywhere, W26) sent me the following email:
From: Aditya <aditya@buildrunanywhere.org>
Subject: Mikołaj, think you'd like this
[snip]
Hi Mikołaj,
I found your GitHub and thought you might like what we're building.
[snip]
I have also received a deluge of similar emails from another AI company, Voice.AI (doesn't seem to be YC affiliated). These emails indicate that those companies scrape people's Github activity, and if they notice users contributing to repos in their field of business, send marketing emails to those users without receiving their consent. My guess is that they use commit metadata for this purpose. This includes recipients under the GDPR (AKA me).
I've sent complaints to both organizations, no response so far.
I have just contacted both Github and YC Ethics on this issue, I'll update here if I get a response.
martinwoodward|3 days ago
The fundamental nature of Git makes this pretty easy for folks to scrape data from open source repositories. It's against our terms of service and those folks might want to talk with some lawyers about doing it - but as every Git commit contains your name and email address in the commit data it's not technically difficult even if it is unethical.
From the early days we've added features to help users anonymise their email addresses for commits posted to GitHub. Basically, you configure your local Git client to use your 'no-reply' email address in commits and that still links back to your GitHub account when you push: https://docs.github.com/en/account-and-profile/reference/ema...
I think that's still probably the best route. We want to keep open source data as open as possible, so I don't think locking down API's etc is the right route. We do throttle API requests and scraping traffic, but then again there have been plenty of posts here over the years from people annoyed at hitting those limits so it's definitely a balancing act. Love to know what folks here think though.
david_allison|3 days ago
This isn't my experience. I requested that you looked into a spammer in July 2025, you ignored my reply and the account is still active.
----
Thank you so much for the report. We're sorry to hear you're receiving unwanted emails, but it's always a possibility when your public contact information is listed on the web. You can keep your email address private if you wish by following the steps here:
Setting your commit email address
We do expect our users to comply with our Terms of Service, which prohibits transmitting using information from the GitHub (whether scraped, collected through our API, or obtained otherwise) for spamming purposes. I'm happy to look into it further to see if we can contact the reported user and let them know that this type of activity is not allowed.
Please let us know if you have any other questions or concerns.
----
My reply which was ignored:
----
I understand it will happen from time to time. I'd rather be contactable (I've received legitimate emails today because my email is on my profile).
Please take further action. My email is public with the expectation that the ToS will be enforced. If GitHub isn't discouraging spammers then it makes it much harder to justify being contactable.
All the best, David
retlehs|3 days ago
I even wrote about a specific example of a YC company spamming me from my GitHub email at https://benword.com/dont-tolerate-unsolicited-spam
koito17|3 days ago
It's one thing to offer anonymous e-mail addresses, but it's also awesome that GitHub can help prevent mistakes that would otherwise leak a user's e-mail address. I am not sure how many people try to be privacy conscious on GitHub, but I assume most users don't, so it's nice seeing this little feature exist.
ayhanfuat|3 days ago
skwashd|3 days ago
danesparza|3 days ago
How do I report that person, though? Your support page about reporting abuse assumes I know the person's Github account: https://docs.github.com/en/communities/maintaining-your-safe...
blobbers|3 days ago
I think it's pretty clear you need to use an anonymization scheme in the way commits are handled so that it links back to your github account and the email addresses are kept private.
Privacy centric companies like Apple do this for users offering hashed emails, on a per login basis.
I'm sure this would not work in a world of scraping, but having that kind of ability to figure out bad actors would be nice. You could require authenticated users for certain kinds of requests, and block user information from non-authenticated requests.
realityloop|2 days ago
Tonho<tonho@tonho.wtf>
Hey, I found your GitHub profile and thought you might find this useful.
I've been building Omniget, a desktop downloader that works with YouTube, Telegram, Udemy, Hotmart and 1000+ other sites. It's open source and built with Rust and Tauri.
The part I'm most proud of: you don't even need to open the app. Just press a hotkey and it grabs whatever video you're watching.
I've been working on this for a while now, even got an artist to design a mascot. I'm shaping the app based on feedback from people who actually use it, so if you have any thoughts I'd love to hear them.
Here's the repo: https://github.com/tonhowtf/omniget
Thanks for your time!
Tonho
AznHisoka|3 days ago
If someone wants to message someone, it goes through github notifications or github emails them
Also banning an account doesnt seem like a heavy punishment, given they can simply move to gitlab, bitbucket etc
just6979|3 days ago
I did a quick scan of the ToS and all I could find was D8 that states that autmated access (scraping) used for "AI" applies a reciprocal license that prevents the scraper from restricting GitHub's access to the data (the whole model? the weights?) resulting from the scraping.
This makes it sound like any model trained on GitHhub content cannot be commercialized, because charging for access to the output would be a "technical or other limit"... So you're obviously not really enforcing this, otherwise MS would be suing every big commercial model out there!
shawmakesmagic|2 days ago
nickphx|2 days ago
ericol|3 days ago
"What you are doing is against Github's TOS"
TheSaifurRahman|3 days ago
Foxboron|3 days ago
dent9|3 days ago
trympet|3 days ago
miki123211|3 days ago
blibble|3 days ago
kettle, pot, black?
I received the following offical spam last week from GitHub:
> Build AI agents with the new GitHub Copilot SDK
despite never granting consent for marketing material
(and yes, there's a GDPR complaint now working its way through the national regulator)
moomoo11|3 days ago
I will pay more for GitHub if you go hard on these mfs.
observationist|3 days ago
Mind fixing lucidrains account? Something happened without notice or recourse. He's one of, if not the most well known open source AI researchers on the planet, with implementations and explanations of papers and ideas that are wonderful. If you could bring some sanity to that situation and take it out of whatever kafkaesque account purgatory it fell into, you'd be doing the work of angels.
Thanks!
scottydelta|3 days ago
otherayden|3 days ago
cassonmars|3 days ago
wslh|3 days ago
ls-a|3 days ago
[deleted]
keiferski|3 days ago
Cold emailing rarely works by itself. Cold emailing developers via emails you pulled from their GitHub accounts? At that point, you're actively harming your brand, and may as well just send them spam diet pill ads.
RandallBrown|3 days ago
If it's obviously just a bot scraping emails and sending generic job requests, that's very different.
polishdude20|3 days ago
You searched for people who do what you need to have done, found me, looked at what I've worked on and determined I'd be a good fit and you reached out? That's the number one way to get me to want to work for you.
an0malous|3 days ago
unfunco|3 days ago
unfunco|3 days ago
foldr|3 days ago
armchairhacker|3 days ago
https://news.ycombinator.com/item?id=9332418 (11 years ago)
https://news.ycombinator.com/item?id=20660624 (7 years ago)
https://news.ycombinator.com/item?id=27855152 (5 years ago)
https://news.ycombinator.com/item?id=30900237 (4 years ago)
Seems it’s a reoccurring issue
cyann|3 days ago
From: henry@joincactuscompute.com
Hey,
I hope all is well with you, just reaching out as you seem to be interested in on-device speech models.
Cactus is a low-latency AI engine for consumer devices like phones, Macs, wearables, Raspberry Pis, etc.
We support transcription models like Whisper & Parakeet, benchmarks available in the attached GitHub repo.
GitHub: https://github.com/cactus-compute/cactus
We are keen to get your feedback, and star if feeling generous.
Thanks a million
ignoramous|3 days ago
A 419 scam?
mattpal21|2 days ago
elwebmaster|3 days ago
From: james@techglobal.website Quick note – your GitHub profile Hi X,
I came across your profile on GitHub. Given you're based in the US, I thought it might be relevant to reach out.
Profile:
I run a technical team (full-stack, cloud, DevOps) that delivers for clients. We're looking to work with an engineer based in the US on client-facing coordination—discovery, requirements, alignment—while we handle delivery. If that might be relevant, I'd be glad to set up a short call.
Regards, James
If I had to guess, "James" is a North Korean looking to scam US clients, based on my experience with shady actors.
max__dev|3 days ago
From: james@techglobal.website Brief note – Following up on your GitHub work
Hi ,
I came across your profile on GitHub. Given you're based in the US, I thought it might be relevant to reach out.
Profile:
I run a technical team (full-stack, cloud, DevOps) that delivers for clients. We're looking to work with an engineer based in the US on client-facing coordination—discovery, requirements, alignment—while we handle delivery. If that might be relevant, I'd be glad to set up a short call.
Best, James
vintagedave|3 days ago
bakugo|3 days ago
> I came across your GitHub profile and thought you might be interested in what my team and I are building. We're developing an open source SDK that runs LLMs directly on-device.
What's even more interesting is that both buildrunanywhere.org and runanywheresdk.com show a stock hostinger parking page when accessed in a browser. Something tells me they're intentionally registering these "alternate" domains specifically for spam, to avoid tanking the email reputation of their main runanywhere.ai domain.
I guess I shouldn't be surprised given YC is going all in on AI and most AI companies are no better than the crypto scammers of yesteryear, but still.
Imustaskforhelp|3 days ago
> Something tells me they're intentionally registering these "alternate" domains specifically for spam, to avoid tanking the email reputation of their main runanywhere.ai domain
This is a really bad look on them.
https://www.whatsmydns.net/domain-age?q=buildrunanywhere.org and https://www.whatsmydns.net/domain-age?q=runanywheresdk.com
Both these domain were registered only 36 days ago
Their main domain had been around for 6 month (216 days) tho:- https://www.whatsmydns.net/domain-age?q=runanywhere.ai
(I also couldn't see any post created by them on YC checking algolia from their website fwiw)
Seeing their star history on their product, I see some few interesting observations[0] Their star history was almost horizontal between december and february until it got vertical all of a sudden.
[0]:https://www.star-history.com/#runanywhere.ai/runanywhere.ai&...
I looked through their linkedin and found this website owned by them as well https://www.openclawpi.com/ and using the YC brand here as well. (registerered 26 days ago)
This website looks fairly AI generated to me as well and there are some bugs within the original website as well which I am now incredibly more unsure of if generated by AI or not given the similarities between the two websites UI/UX as well.
unknown|3 days ago
[deleted]
kristoff_it|3 days ago
AznHisoka|3 days ago
neya|3 days ago
neya|3 days ago
https://news.ycombinator.com/item?id=45357205
medi8r|3 days ago
unknown|3 days ago
[deleted]
ChrisMarshallNY|3 days ago
Hope they didn’t get too many folks.
nubinetwork|3 days ago
c16|3 days ago
efreak|3 days ago
dewey|3 days ago
progbits|3 days ago
tommoor|3 days ago
unknown|10 hours ago
[deleted]
callamdelaney|3 days ago
roryirvine|2 days ago
Honestly, I'd chalked it up as a one-off mistake by an inexperienced team (I had discussed my expected range in our first conversation, so found it odd that they they would waste their own time going through the hiring process only to make me such a stupidly low offer!)
So is this something that they're actually being advised to do? What's the intention behind it?
WhatsName|3 days ago
stevekemp|3 days ago
devmor|3 days ago
Goronmon|3 days ago
Regardless of any claims of having this, I would say this behavior aligns with what I have seen over the last couple decades. I'm more surprised that other people would expect anything different?
thinkingtoilet|3 days ago
buellerbueller|3 days ago
Rapzid|2 days ago
whalesalad|3 days ago
mbesto|3 days ago
Sorry but lol you must be new here.
ttul|3 days ago
dathinab|3 days ago
And them claiming "they didn't know" can be dismissed given that many dev on GH have location information set.
It also in general doesn't change anything. the law doesn't care if you know or didn't.
Startups starting out their journey by committing crime is always a grate sign for their trustability.
ChrisMarshallNY|3 days ago
Every day, I get deluged with hundreds of spam and scam emails, often because some knucklehead entered my email in a form (either accidentally, or as a throwaway red herring).
Maxious|3 days ago
> Some examples of ethical behavior we expect from founders are:
> - Not spamming members of the community
> To maintain our community, if we determine (in our sole discretion) that a founder has behaved unethically during or after YC, we will revoke their YC founder status. This includes access to all Y Combinator spaces, software, lists and events. All founders in a company may be held responsible for the unethical actions of a single co-founder or a company employee, depending on the circumstances.
mattpal21|2 days ago
""" Hi there!
I noticed you’re interested in on-device AI development and wanted to flag a new bounty program we just launched with Qualcomm. We’re looking for developers to build a local Android AI app (using the Nexa SDK). Since you're already exploring this space, I thought this might be an easy win for you.
The Bounty at a glance: - Prizes: $6,500 cash pool + Flagship Snapdragon devices. - Perks: Direct partnership & marketing spotlight from Qualcomm (huge for visibility) - The Ask: Build a working Android AI app that runs locally.
Registration is open now: https://sdk.nexa.ai/bounty
I want to help you win. Once you register, please reply to this email. I’d be happy to advise on your ideas to increase your chances of winning. Or, if you have an existing project, I can guide you on how to port it to NexaSDK for the submission
Best, Lynn @ Nexa AI """
mattpal21|2 days ago
""" Hi Matt,
I found your GitHub and thought you might like what we're building. We're developing an open source SDK that runs LLMs directly on-device.
We're getting about 45 tokens per second on iPhones, with support for Swift, Kotlin, React Native and Flutter. There's also a fully offline voice pipeline built in, so everything runs locally. We recently got into Y Combinator and are focused on expanding support to more edge devices and continuously improving performance.
If you're curious, here's the repo: github.com/RunanywhereAI/runanywhere-sdks
Feel free to reply to this email with any feedback or ideas you'd like to explore with on-device AI, or if you'd be interested in contributing. I'd love to hear your thoughts.
Best, Aditya """
Just to share the entire email, I think it's pretty well written, I went ahead and talked to the team, they were very curious and took my feedback regarding their flutter sdks very seriously, and they seem to be great people. Also, just an fyi, I tried their sdks, it's great! and I've been loving their apps as well.
I think their team is great, and I asked them for adding the rag implementation, they did it in less than week and it's pretty impressive. I think it's worth checking it out, It's easier to demean someone in public like that but might be worth checking.
coffeecoders|3 days ago
I don’t engage. I mark as spam, block the sender/domain, and move on.
theturtletalks|3 days ago
suyash|3 days ago
ergocoder|3 days ago
csense|3 days ago
I feel like if you don't want companies to cold-email you, you shouldn't make your email public. Github provides noreply email addresses for this purpose.
heikkilevanto|3 days ago
I find it interesting that some fucking spammers think that just because they found out my email somewhere, they should be allowed to waste my time and resources for their shit.
That is explicitly illegal here in EU. Unless I have clearly given you my consent, you are not allowed to spam me. Is informed consent really such a difficult concept to understand?
16B5775dTgZ|3 days ago
Scraping emails is also against the GitHub terms of service.
If you don't know what jurisdiction the owner of the email address resides in, it may also be illegal.
So whether it is scraping emails off a website or finding yourself on a private island with beautiful people "made available" to you, "consent" requires more than just having access.
scosman|3 days ago
And they are using a different domain for the emails so the spam markers don’t hit their primary domain.
pscanf|3 days ago
You mention GDPR, which also "applies" to me, though I wonder if what they're doing is actually illegal. I mean, after all, I'm putting my email on GitHub precisely to give people a way to contact me.
Of course, I do that naïvely, assuming good faith, not expecting _companies_ to use it to spam me. So definitely what they're doing is, at the very least, in poor taste.
notpushkin|3 days ago
They’re not only looking at the public email in your profile, they’re also looking at your committer email (git config user.email). You could argue that you’re not putting that out for people to contact you.
(I’ve used that trick a couple times to reach out to people, too, but never mass emailing.)
zvqcMMV6Zcr|3 days ago
victorbjorklund|3 days ago
EdNutting|3 days ago
I sometimes use different git/GitHub addresses depending on who I'm working for or specific projects so I can more accurately detect where data is being scraped from.
EdNutting|3 days ago
insane_dreamer|3 days ago
Side note but the trick I learned, at least with gMail is not to delete the email (which doesn't prevent you from getting new ones), or even reporting as spam (which may or may not work), but instead dragging it into the Promotions tab, into which all future emails from that email address will automatically go. Promotions tab then acts as your Trash.
The quickest way to get me to never do business with you is to send me spam.
oefrha|3 days ago
mustaphah|3 days ago
I immediately realize it's engagement farming + free labor. I said "No thanks."
Got this reply: "(...) I'm looking forward to reviewing your PRs. Feel free to share me any of your questions. (...)"
Apparently, no one read my reply - not even AI. They are automating this shit. It's sad that many fall for it (check their Github repo)
---
Company: Aden (W20)
Contact: Vincent Jiang, Founder
Github: https://github.com/aden-hive/hive
tock|2 days ago
lordgrenville|3 days ago
darknavi|3 days ago
edelbitter|3 days ago
Kernel guidelines now have a more verbose section about tagging: https://www.kernel.org/doc/html/latest/process/submitting-pa...
haute_cuisine|3 days ago
ktm5j|3 days ago
6thbit|3 days ago
They have this other thing where they reject pushes for the 'known' emails you've told them you have, but kinda seems there should be a setting to do that for any email that is not your noreply private one. is that a feasible thing to ask for?
cperciva|3 days ago
Of course, there's nothing stopping you from using a git-only email address (nospam-6thbit@yourdomain) and routing that to /dev/null. GitHub can't change email addresses, but you can.
arcfour|3 days ago
agcat|2 days ago
jazzpush2|3 days ago
I did YC and now work at a frontier lab.
I've received multiple spam-style emails from (mostly young) current founders tagging me and all other YC-alum at my place-of-work with the profiles of their friends for internship roles, referrals, etc.. Same girl has done it for like 5 different people.
b8|3 days ago
danbrooks|3 days ago
ttoinou|3 days ago
jonathanlydall|3 days ago
GitHub hides the emails on their web UI, but nothing stops people from pulling the repository with a Git client and looking at the emails in the commit log after doing so.
arcfour|3 days ago
LeoPanthera|3 days ago
sieep|2 days ago
ting0|3 days ago
buellerbueller|3 days ago
jedberg|3 days ago
pmdr|3 days ago
scirob|3 days ago
unknown|2 days ago
[deleted]
unknown|3 days ago
[deleted]
apparent|3 days ago
They're literally hurting their own brand, as well as YC's.
rlaabs|3 days ago
j16sdiz|3 days ago
This is not GitHub only, I have got a survey on how my experience interacting with folks on lkml
oldned|2 days ago
something GitHub can do: - offer an email address like [spam@github.com] so we can easily forward suspect TOS violations.
outloudvi|3 days ago
These providers are the only ones that care about their reputation and thus may take some action. Investors? Nope.
john_strinlai|3 days ago
in this example, the email came from buildrunanywhere.org, which is just a parked domain. the real domain is runanywhere.ai, which they arent using for spam.
so, once buildrunanywhere.org has their reputation burned from reports, they will simply register buildrunanywheres.org and start spamming again.
dariubs|2 days ago
malmeloo|3 days ago
They're getting more aggressive at it too. Just yesterday I received an email from Alignerr (not YC affiliated I think) saying that my sign-up was complete and cheerfully welcoming me to their platform. I had never even heard of them. An automated "job opportunity!" email didn't arrive until 3 hours later, but by then I had already directed some angry words towards their support email.
Other, even less respectable projects are also regularly enrolling my GitHub projects into their platforms, and I have to actively reach out to them to remove it.
I'm so tired of this man. Can someone go and take away these organizations' ability to send emails?
nektro|2 days ago
dukeofharen|2 days ago
jacquesm|3 days ago
unknown|2 days ago
[deleted]
vexorkai|22 hours ago
nprateem|3 days ago
ellieh|2 days ago
suprjami|3 days ago
If you're lonely just upload a few AI keywords to a repo. You'll get emails forever.
axegon_|3 days ago
dagi3d|3 days ago
tom_m|3 days ago
rodrigodlu|3 days ago
And I use a different email fromy priority email for GitHub commits since 4 years ago.
So just stop with marketing slop please.
Yes, I work with AI, and I'm becoming pretty good at it.
But this doesn't mean I'm comfortable pushing AI slop into potential users and customers.
I (and they) want to use AI to facilitate their processes, not to ingest slop content.
davidcollantes|2 days ago
koakuma-chan|3 days ago
lyu07282|3 days ago
Rapzid|2 days ago
hmokiguess|3 days ago
alexchantavy|3 days ago
HN is deeply skeptical, technical, cynical, sarcastic. It's a great place to learn new things and I've loved it since I found it in 2012.
The current startup climate (not just limited to YC) feels very AI bro YEAHHLETSFUCKINGGOO (and I say this as a founder myself having gone through YC recently in W25).
idoxer|3 days ago
JOHN34567|2 days ago
[deleted]
JOHN3456|2 days ago
[deleted]
NimrodKramer|2 days ago
[deleted]
mattpal21|2 days ago
[deleted]
atfzl|3 days ago
[deleted]
speedgoose|3 days ago
bilekas|3 days ago
RobotToaster|3 days ago
ValentineC|3 days ago
There are likely marketing email datasets floating around the internet that contain email addresses scraped from commit metadata.
I use a catchall with a specific Git client (not GitHub) email address, and found spam and phishing emails being sent there quite a few times.
input_sh|3 days ago