Created https://issues.apache.org/jira/browse/LUCENE-7546 where further
discussion can continue.  FYI you can grab the wikipedia dump at this URL
now:
http://home.apache.org/~dsmiley/data/enwiki-20070527-pages-articles.xml.bz2

On Tue, Nov 8, 2016 at 9:53 AM David Smiley <[email protected]>
wrote:

> (CC'ing Grant)
>
> I remember hearing something about people.apache.org getting migrated to
> home.apache.org:
>
> https://mail-archives.apache.org/mod_mbox/openoffice-dev/201511.mbox/%[email protected]%3E
> I've found it difficult to find public information on this; it's more
> indirect shares that individuals in specific projects (like OpenOffice
> here) did.  In essence, I learned that a lot of content simply moved to the
> new server but some of the largest files were not copied.  This one did
> not; I checked.
>
> I have a copy of this file -- 2.69GB.  I'll upload it to my account in
> home.apache.org.  It's not clear there is a better space for it; it's
> large for the intended use of home.apache.org.  I'll also update
> lucene/benchmark/build.xml to reference the new URL.
>
> ~ David
>
> On Mon, Nov 7, 2016 at 6:46 PM Timothy Rodriguez (BLOOMBERG/ 120 PARK) <
> [email protected]> wrote:
>
> Anyone else having problems retrieving the test wikipedia set used for
> benchmarks? It looks like the resource is no longer available. When I run
> ant enwiki I receive the following:
>
> [get] Error opening connection java.io.FileNotFoundException:
> http://people.apache.org/~gsingers/wikipedia/enwiki-
> 20070527-pages-articles.xml.bz2
>
> I've tried the link directly from a browser and it looks like it's moved.
> Is there a mirror someplace?
>
> -Tim
>
> --
> Lucene/Solr Search Committer, Consultant, Developer, Author, Speaker
> LinkedIn: http://linkedin.com/in/davidwsmiley | Book:
> http://www.solrenterprisesearchserver.com
>
-- 
Lucene/Solr Search Committer, Consultant, Developer, Author, Speaker
LinkedIn: http://linkedin.com/in/davidwsmiley | Book:
http://www.solrenterprisesearchserver.com

Reply via email to