(no title)
Ninjaneered | 5 years ago
[1] https://meta.wikimedia.org/wiki/User:Denny
[2] https://en.wikipedia.org/wiki/Semantic_MediaWiki
Ninjaneered | 5 years ago
[1] https://meta.wikimedia.org/wiki/User:Denny
[2] https://en.wikipedia.org/wiki/Semantic_MediaWiki
xiler|5 years ago
https://research.google/people/vrandecic/
https://storage.googleapis.com/pub-tools-public-publication-...
O_H_E|5 years ago
And to all the other people doing awesome work but not on the top of HN.
gcbw3|5 years ago
Can the data be licensed as GPL-3 or similar?
9nGQluzmnq3M|5 years ago
Semantic Mediawiki (which I attempted to use at one point) is difficult to work with and far too complicated and abstract for the average Wiki editor. (See also Tim Berners-Lee and the failure of Semantic Web.)
WikiData is a seemingly genius concept -- turn all those boxes of data into a queryable database! -- kneecapped by academic but impractical technology choices (RDF/SPARQL). If they had just dumped the data into a relational database queryable by SQL, it would be far more accessible to developers and data scientists.
mmarx|5 years ago
Note that the internal data format used by Wikidata is _not_ RDF triples [0], and it's also highly non-relational, since every statement can be annotated by a set of property-value pairs; the full data set is available as a JSON dump. The RDF export (there's actually two, I'm referring to the full dump here) maps this to RDF by reifying statements as RDF nodes; if you wanted to end up with something queryable by SQL, you would also need to resort to reification – but then SPARQL is still the better choice of query language since it allows you to easily do path queries, whereas WITH RECURSIVE at the very least makes your SQL queries quite clunky.
[0] https://www.mediawiki.org/wiki/Wikibase/DataModel [1] https://www.mediawiki.org/wiki/Wikibase/Indexing/RDF_Dump_Fo...
zozbot234|5 years ago
LukeEF|5 years ago