top | item 33276920

(no title)

polarix | 3 years ago

This has been available for a while but it's great to see some acknowledgement especially since the most recent data set was stuck in 2019 for a while.

Here are the datasets: http://download.kiwix.org/zim/stack_exchange/

It's not clear to me why the data set shrank between 2019/3 and 2022/6; was something excluded? Compression improvements?

> stackoverflow.com_en_all_2019-02.zim 2019-03-12 19:53 134G

> stackoverflow.com_en_all_2022-05.zim 2022-06-17 12:36 75G

discuss

order

FinnLeSueur|3 years ago

The article states:

> ... to ensure that an up-to-date version of our dataset is easily available for those who need it, and will work to improve its readability and reduce its size so there is less friction for end users...