Sci-Bay: Google Scholar plus Sci-Hub

[+] bringtheaction|8 years ago|reply

Just tested it by searching for “spline”. This is great! Can someone elaborate on how it was made? Specifically, how it integrates with Google Scholar. Is that done client side or server side? If server side how come it hasn’t been blocked by Google seeing as Google don’t seem to like robots using their regular search function so my guess would be they wouldn’t like it for Scholar either. Perhaps it is proxying requests and would also pass on any CAPTCHAs presented? Still in that case I would expect all requests to get hit with a CAPTCHA. Perhaps it just hasn’t had enough traffic yet?

[+] samat|8 years ago|reply

This reminds me of Popcorn time so much.

Rightholders do not fear torrents as long as they are unusable for the general population.

The second they see something usable — they go berserk.

Gonna need some popcorn to watch this one.

[+] Vinnl|8 years ago|reply

Likewise, the traditional publishers often respond to demands by funders to make research available by e.g. allowing researchers to share their work elsewhere, and often only after a year or so after publication [0]. This makes the barrier to do so higher, and makes the research less findable. It's not odd to expect that when initiatives like Unpaywall [1] make that research more discoverable, things like embargo periods will get worse.

[0] https://medium.com/flockademic/how-open-can-open-access-be-c...

[1] https://unpaywall.org/

[+] lsh|8 years ago|reply

If sci-hub is going to scrape OA publishers, they could put in a bit more effort.

For example this (which sucks): https://sci-bay.org/article?link=https://www.ncbi.nlm.nih.go...

Versus the actual article: https://elifesciences.org/articles/24234

[+] n4r9|8 years ago|reply

I think the issue is with sci-bay rather than sci-hub. Searching sci-hub for the title of the article brings you to the second webpage you linked.

[+] unknown|8 years ago|reply

[deleted]

[+] agumonkey|8 years ago|reply

reminder that this went up not long ago https://whereisscihub.herokuapp.com

[+] Vinnl|8 years ago|reply

Thanks for sharing; note that I've moved that to a somewhat simpler URL: https://whereisscihub.now.sh/

I should also add that I am also working on a project to incentivising authors to make their work freely available: https://flockademic.com/

(More info here: https://medium.com/p/the-holy-grail-in-open-access-sharing-t... )

[+] bcaa7f3a8bbc|8 years ago|reply

> If sci-hub is going to scrape publishers, they could put in a bit more effort.

+1, especially for the Onion site. Onion service supposed to be a primary mean to host uncensored websites instead of having to look for the latest domain name everyday, unfortunately it seems nobody cares about it. Most of the time I access it from my browser, the front-end proxy was malfunctioning, or the back-end Tor daemon has dead... Tor network itself do have capacity problem, but they could do much better than a broken front-end proxy... e.g. with Onion Balance.

[+] jrochkind1|8 years ago|reply

Google Scholar definitely and intentionally offers no API.

I don't see this lasting long...

[+] gpm|8 years ago|reply

At a glance it looks like it's really just a proxy, that was limited to scholar.google.com and mutates the page slightly (adds a header, sci-hub links).

Does google generally block proxy servers?

[+] jrochkind1|8 years ago|reply

Home page currently says:

> See you later

> Too much attention is a bad thing, Sci-Bay decides to stop service for a while. Sorry.

Apparently I was not wrong.

This could be developed as a browser plugin that would be much harder or almost impossible for Google to prevent. Well, a Firefox browser plugin, a Chrome browser plugin presumably they wouldn't allow.

[+] matheusmoreira|8 years ago|reply

The page's HTML is the API. It's pretty easy to download a web page, parse the HTML and then extract specific bits of information from it. The browser does the same thing on the user's behalf, which is why it is called the user agent.

[+] danielecook|8 years ago|reply

Why is that? Seems like it would be really beneficial to the scientific community.

[+] mchannon|8 years ago|reply

google.com/scholar doesn’t work for you?

[+] moomin|8 years ago|reply

This is a clever mashup. Of course, if you want it to last a week, I'd be making some effort to distribute the source far and wide...

[+] PokemonNoGo|8 years ago|reply

I'm sorry but I don't see how it works?

>https://sci-bay.org/scholar?hl=en&as_sdt=0%2C5&q=entropy+sha...

-> Please show you're not a robot

[+] xstartup|8 years ago|reply

It works well, OT: Anyone knows how to remove the top header in this Scihub link:

https://sci-bay.org/article?link=https://pdfs.semanticschola...

[+] gpm|8 years ago|reply

Download the pdf, open it in your browser (or another pdf reader) directly.

[+] Myrmornis|8 years ago|reply

I'm 100% in favour of sci-hub.

However, note that they are very anarchic when it comes to commercial books, not just journal articles!

E.g. from the Sci-Bay search results, this is $131 on amazon.com, and quite possibly the authors do want the royalties.

[BOOK] Intelligent optimisation techniques: genetic algorithms, tabu search, simulated annealing and neural networks D Pham, D Karaboga - 2012 - books.google.com ... Cited by 916 Related articles All 3 versions [Download Book]

[+] gkya|8 years ago|reply

I believe people in academia are paid to write these books anyways, so they might as well not receive the royalties. As a prospective academic myself, I find it unethical.

[+] abhishekjha|8 years ago|reply

Is the website down? It just says "too much attention" caused it to shut down.

[+] petra|8 years ago|reply

How did you integrate scholar in sci-bay ? Does scholar have an API ?

And about the future, how do you see google responding?

[+] shakna|8 years ago|reply

I don't know how they are doing it, but Google Scholar does not have an API, and scraping is against their TOS.

> Don’t misuse our Services. For example, don’t interfere with our Services or try to access them using a method other than the interface and the instructions that we provide.

Despite this, there is scholar.py [0], which can extract files from Google Scholar, though it explicitly doesn't work around the rate limits.

[0] https://github.com/ckreibich/scholar.py

[+] s2th4d|8 years ago|reply

And it's down.

"See you later Too much attention is a bad thing, Sci-Bay decides to stop service for a while. Sorry. Anyone who knows how Sci-Bay works and wishes this tool benefits more academics, please contact: [email protected]"

[+] aysus|8 years ago|reply

BarOn, R. (1997). EQ-i Baron Emotional Quotient Inventory: A Measure of Emotional Intelligence : Technical Manual. Toronto, ON: MHS.

[+] jmnicholson|8 years ago|reply

How is this any better than using the sci-hub plugin?

[+] rkskejfj|8 years ago|reply

Beyond close vs. distant ties: Understanding post-service sharing of information with close, exchange, and hybrid ties

106 comments