top | item 43816820

(no title)

audiodude | 10 months ago

Volunteer for Kiwix here (https://kiwix.org), we do a lot of offline Wikipedia stuff. I've personally worked on MWOffliner (https://github.com/openzim/mwoffliner) which scrapes MediaWikis, primarily Wikipedia.

We have apps for basically every platform. Our PWA even supports IE 11!

You can use the WP1 tool which I'm the primary maintainer of (https://wp1.openzim.org/#/selections/user) to create "selections" which let you have your own custom version of Wikipedia, using categories that you define, WikiProjects, or even custom SPARQL queries.

discuss

order

strofocles|10 months ago

May I suggest somebody out of your company reviews the website. It is not clear to me what you do, what the apps do and so on. The copy is also kind of abstract "we make the world a better place" type of copy. From your comment I understand you do good work and would be a shame for people new to your products to struggle understanding what are you doing.

bcraven|10 months ago

I don't agree with your assessment. Did you find the 'About Us' page insufficient?

freedomben|10 months ago

Neat, thanks! I'm CTO of Ameelio (non-profit) and have been eyeing Kiwix for awhile. Getting content to incarcerated people is a unique challenge due to the exceptional security requirements, and an offline solution like kiwix might fit in well. Being able to narrow down categories is a huge capability for us. Thank you!

gehwartzen|10 months ago

Just wanted to comment on what a great mission Ameelio seems to have! Glad you guys are helping some of the most unseen in our society. Kudos!

benoitberaud|10 months ago

Feel free to reach to us (Kiwix), we've already helped NGO deploy our content to prisons for the exact reason you mention.

hoseja|10 months ago

I had an offline copy of wikipedia from like five years ago, just in case. When I recently needed it I opened the kiwix app and everything was broken by some godforsaken overhaul update. I don't have an offline copy of wikipedia on my new phone anymore.

ForOldHack|10 months ago

Does archive.org have a mirror of the iso?

yreg|10 months ago

Regarding mwoffliner: Why scrape Wikipedia when you can just download a dump?

detaro|10 months ago

If you want to test Mediawiki tooling, wikipedia is good test target, because it uses a lot of the features (unsurprisingly), compared to smaller wikis. (OTOH, the latter often have custom extensions, so it's not quite enough)

dal|10 months ago

I was thinking the same. It must take much less space in database form than all the html pages.

flipgimble|10 months ago

If I'm reading this right, the last full zim archive of all of english wikipedia is wikipedia_en_all_maxi_2024-01.zim which is now about 16 months old. Is that right, or is are another more recent sources?

The current US administration is actively trying to interfere with Wikipedia and censor public speech or information that is detrimental to their disinformation campaign. [1]

Do you know if there is an effort to publish more recent archives ? Or do you have any advice how outside developers could jump in to help with that project?

[1] - https://news.ycombinator.com/item?id=43799302

prepperdisk|10 months ago

Kiwix team is close on this, it’s even a partnership directly with Wikipedia to work on the newer APIs and function reliably.

themadturk|10 months ago

Is there concern for AI-produced slop in Wikipedia? I have the 2024-01 version which may be out of date, but may also have less slop.

avni5|10 months ago

[deleted]