(no title)
boardwaalk | 3 years ago
As one just example, I searched for a unique error message in code that exists on GitHub, is in a fairly popular repo, and is not new and Google just could not find it. That seems like a very basic failure.
boardwaalk | 3 years ago
As one just example, I searched for a unique error message in code that exists on GitHub, is in a fairly popular repo, and is not new and Google just could not find it. That seems like a very basic failure.
bane|3 years ago
Every single returned link after the first 3 had the superstring version of the name and not the correct name. It turns out that this returns endless results for a fairly well known singer, not my friend.
So now did I not get the results I was looking for, I got tons of results that were objectively wrong.
Then suddenly, about 6 pages into those results, I started getting ones for the correct last name, but now the first name is a mess.
This happened on Google, DDG, Baidu, Sogou, Haosou, Dogpile, the current Yahoo search, Bing, and to some extent on Yandex. Naver was worse, Daum totally worthless with incorrect results.
Utterly worthless.
The thing is, my friend's name is surprisingly fairly unique, there's probably less than 20 people in the world with that specific name. It's like the search engine's desire to fill the screen with worthless garbage results has overpowered the need to supply the 2 or 3 that are actually correct, even if the quantity is a little disappointing.
giancarlostoro|3 years ago
asicsp|3 years ago
behringer|3 years ago
Denzel|3 years ago
To test your hypothesis, I did a basic search for exact matches on "we do not synchronize on the update of the broker node" and Google returned 2 search results in 240ms:
- https://github.com/a0x8o/kafka/blob/master/core/src/main/sca...
- https://jar-download.com/artifacts/org.apache.kafka/kafka_2....
Which contain exactly the source code from GitHub that I was looking for. You'll notice that the first result is actually a0x80's fork of apache/kafka. Google states that some entries very similar to the 2 already displayed were omitted, and I'm able to remove that filter. With that filter removed, I can see the same document indexed from apache/kafka on GitHub.
There's nothing I can do or promise directly, but I can assure you that Google takes the quality of our search results very seriously. If you believe we're not delivering quality results, I strongly encourage you to click that "Send Feedback" link at the bottom of your results so that our teams can act upon your feedback.
Disclosure: I work on Search at Google.
Disclaimer: The words, views, and opinions expressed in this post are my own. They are not representative nor do they represent my employer in any capacity.
jrvarela56|3 years ago
Congrats and hope Google takes advantage of HN, similar to how startups use this forum to engage with users - it is now a meme that Google Search is unusable so there must be something to learn from the audience.
I will use the send feedback button tomorrow as you suggest.
dpkirchner|3 years ago
I've definitely seen that sort of thing before but there is no such link there at the moment -- at least not when searching from my iPhone, whether or not I'm in desktop mode. I just see a large error box that says "It looks like there aren't many great matches for your search" followed by the link to the a0x80 fork.
By the way, the a0x80 result highlights a serious problem with search results: the GitHub URL is strangely modified. Instead of showing the full URL or even a prefix leading up to it Google is selecting parts of the URL, showing "https://github.com > src > transaction" on mobile and "https://github.com > kafka > coordinator > transaction" when I request the desktop site. In neither case is it obvious that the content isn't the canonical source from Apache. I've noticed this middle-out truncation for GH urls before but I'm not sure when it started.
sefrost|3 years ago
userbinator|3 years ago
Another extremely noticeable degradation is in finding part numbers, IC markings, service manuals (NOT the useless user manual), schematics, and the like. Anything that proponents of right-to-repair would be extremely interested in, to the extent that I wonder if there's been some sort of conscious effort being made by certain interests to eliminate or limit such information.
Then there's the niche-but-legal adult content. I won't go into too much detail about that, but suffice to say it used to be far easier to find.
It's been 5 years since this notorious item here, and I've only seen Google get worse: https://news.ycombinator.com/item?id=16153840
nus07|3 years ago
goldforever|3 years ago
[deleted]
saddd|3 years ago
simonw|3 years ago
https://github.com/robots.txt
That "/*/tree" rule means that search engine crawlers are allowed to hit the README file of a repo but effectively NONE of the other files in it.Which means that if you keep your project documentation on GitHub in a docs/ folder it won't be indexed!
You need to publish it to a separate site via GitHub Pages, or use https://readthedocs.org/
(Side note: I just noticed https://github.com/ekansa/Open-Context-Data is explicitly listed in the robots.txt for GitHub - the only repo that gets a mention like that. I'd love to know the story behind that!)
saurik|3 years ago
sebosp|3 years ago
Nuzzerino|3 years ago
papito|3 years ago
Pure speculation, but innovative companies at first, they started over-hiring and bloating, using questionable interviewing techniques (puzzles, Leetcode), taking on thousands of employees who were just there to game the system, coast, and collect the check.
It just looks like they stopped caring.
asddubs|3 years ago
fIREpOK|3 years ago
I have recently almost completely stopped using Google's search engine due to the fact that I am very often offered zero search results for simple queries (usually involving quotes though) .. It's so bad I can't even believe it.
Note: I've been a Google search since it started... Gmail since Beta, etc...
At one point, I thought that maybe they started punishing ad-block users excessively.
pjmlp|3 years ago
Now only we get tons of ads back as first results, Google keeps rewriting the queries for whatever "helpful" nonsense.
MuffinFlavored|3 years ago
I read this constantly here (echo chamber) and I can't help but feel it's a little biased/overdramatic.
dimator|3 years ago
MagicMoonlight|3 years ago
It’s time for competitors to start wiping them out.
pharmakom|3 years ago
karmakaze|3 years ago
bloodyplonker22|3 years ago
selimthegrim|3 years ago
trilbyglens|3 years ago
gonzo41|3 years ago