the files decompress fine using Linux bzip2: $ time bzip2 --decompress --verbose --keep "enwiki-20200920-pages-articles-multistream1.xml-p1p41242.bz2" enwiki-20200920-pages-articles-multistream1.xml-p1p41242.bz2: done
real 2m22.089s user 2m6.664s sys 0m7.184s $ time bzip2 --decompress --verbose --keep "enwiki-20200920-pages-articles-multistream4.xml-p311330p558391.bz2" enwiki-20200920-pages-articles-multistream4.xml-p311330p558391.bz2: done real 3m48.505s user 3m21.632s sys 0m11.884s $ time bzip2 --decompress --verbose --keep "enwiki-20200920-pages-articles-multistream5.xml-p558392p958045.bz2" enwiki-20200920-pages-articles-multistream5.xml-p558392p958045.bz2: done real 4m24.744s user 3m48.576s sys 0m14.524s $ time bzip2 --decompress --verbose --keep "enwiki-20200920-pages-articles-multistream6.xml-p958046p1483661.bz2" enwiki-20200920-pages-articles-multistream6.xml-p958046p1483661.bz2: done real 4m41.823s user 4m6.836s sys 0m15.004s $ $ _IFL="enwiki-20200920-pages-articles-multistream1.xml-p1p41242" $ ls -l "${_IFL}" -r--r--r-- 1 lbrtchx lbrtchx 885035378 Oct 14 00:05 enwiki-20200920-pages-articles-multistream1.xml-p1p41242 $ file --brief "${_IFL}" HTML document, UTF-8 Unicode text, with very long lines $ _IFL="enwiki-20200920-pages-articles-multistream4.xml-p311330p558391" $ ls -l "${_IFL}" -r--r--r-- 1 lbrtchx lbrtchx 1453887824 Oct 14 00:05 enwiki-20200920-pages-articles-multistream4.xml-p311330p558391 $ file --brief "${_IFL}" HTML document, UTF-8 Unicode text, with very long lines $ _IFL="enwiki-20200920-pages-articles-multistream5.xml-p558392p958045" $ ls -l "${_IFL}" -r--r--r-- 1 lbrtchx lbrtchx 1615967954 Oct 14 00:05 enwiki-20200920-pages-articles-multistream5.xml-p558392p958045 $ file --brief "${_IFL}" HTML document, UTF-8 Unicode text, with very long lines $ _IFL="enwiki-20200920-pages-articles-multistream6.xml-p958046p1483661" $ ls -l "${_IFL}" -r--r--r-- 1 lbrtchx lbrtchx 1718742545 Oct 14 00:06 enwiki-20200920-pages-articles-multistream6.xml-p958046p1483661 $ file --brief "${_IFL}" HTML document, UTF-8 Unicode text, with very long lines --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@commons.apache.org For additional commands, e-mail: dev-h...@commons.apache.org