Re: Using Lucene to index Wikipedia

2011-10-23 Thread Michael Sokolov
Daniel, since no one knowledgeable has answered I'll take a stab - there are a number of ant targets you can run, most of which incorporate some indexing step(s). Basically you can run: ant -Dtask.alg= it looks as if the ant build.xml is set up to run conf/micro-standard.alg by default, but

Using Lucene to index Wikipedia

2011-10-20 Thread Daniel Quach
How do I use the Lucene Benchmark to index a wikipedia dump? I want to be able to execute phrase queries on the latest english wikipedia page dump. I'm trying to look for example use cases but I haven't found any. I downloaded the latest english dump, named: enwiki-latest-pages-articles.xml.bz