jeffspies | 9 years ago | on: Ask HN: How do you get notified about newest research papers in your field?
jeffspies's comments
jeffspies | 10 years ago | on: Ask HN: Is it possible to start a successful tech non-profit?
jeffspies | 10 years ago | on: The Center for Open Science
jeffspies | 10 years ago | on: The Center for Open Science
The DublinCore schema is quite widely used--many of the institutional repositories that we harvest from use it.
I know of Omeka--I've seen it used quite a bit. We should look into some sort of add-on connection with it and the OSF.
perma.cc is new to me, but our partnerships lead has been evaluating preservation services that we could enlist in archiving OSF content. I'll make sure to pass it along in case he hasn't seen it. We are going to great lengths to ensure our users that we care about long-term sustainability and that we see no value in lock-in.
Collecting Genomics looks very neat; I'll have to reach out.
And thanks for the note about the about link! I'll get a fix submitted tonight unless someone beats me to it.
jeffspies | 10 years ago | on: The Center for Open Science
Some info: we're mostly a web development shop, and everything we do is 100% free and open source. Our flagship product is the Open Science Framework (http://osf.io, http://github.com/CenterForOpenScience/osf.io). We are also creating a free, open data set of scholarly (meta)data (http://www.share-research.org/; http://github.com/CenterForOpenScience/SHARE). You might have seen us in the news recently for the Reproducibility Project: Psychology (http://www.sciencemag.org/content/349/6251/aac4716; http://osf.io/ezcuj). We're always looking for mission-driven developers. Feel free to get in touch.
jeffspies | 11 years ago | on: Ask HN: Who is hiring? (June 2014)
Developers - FULLTIME, INTERN
Statisticians - FULLTIME
We're a well-funded, non-profit tech startup; everything we develop is free and open source. Openness is inclusivity, and it's a driving principle in our organization.
We're hiring developers that love open source and want to change science by creating and connecting web-based tools that make being transparent easy for scientists--allowing them to remain or become more efficient in their daily workflow.
We're also hiring statisticians that love open tools, value reproducibility, and enjoy teaching and consulting on both.
We strongly encourage applications from members of groups underrepresented in science and technology industries.
http://share-research.org
So, if you want to see a reddit for research, better news feeds, etc., it is the SHARE dataset that can provide that data. SHARE won't build all those things--we want to facilitate others in doing so. You can contribute at
https://github.com/CenterForOpenScience/share
The tooling is all free open source, and we're just finishing up work on v2. You can see an example search page http://osf.io/share, currently using v1. Some more info on the problem and our approach....
What is SHARE doing?
SHARE is harvesting, (legally) scraping, and accepting data to aggregate into a free, open dataset. This is metadata about activity across the research lifecycle: publications and citations, funding information, data, materials, etc. We are using both automatic and manual, crowd-sourced curation interfaces to clean and enhance what is usually highly variable and inconsistent data. This dataset will facilitate metascience (science of science) and innovation in technology that currently can't take place because the data does not exist. To help foster the use of this data, SHARE is creating example interfaces (e.g., search, curation, dashboards) to demonstrate how this data can be used.
Why is SHARING doing it?
The metadata that SHARE is interested in is typically locked behind paywalls, licensing fees, restrictive terms of service and licenses, or a lack of APIs. This is the metadata that powers sites like Google Scholar, Web of Science, and Scopus--literature search and discovery tools that are critical to the research process but that are incredibly closed (and often incredibly expensive to access). This means that innovation is exclusive to major publishers or groups like Google but is otherwise stifled for everyone else. We don't see theses, dissertations, or startups proposing novel algorithms or interfaces for search and discovery because the barrier of entry in acquiring the data is too high.