I'm using contrib/benchmark to do some tests for my ApacheCon talk and have some questions.

1. In looking at micro-standard.alg, it seems like not all braces are closed. Is a line ending a separator too? 2. Is there anyway to dump out what params are supported by the various tasks? I am esp. uncertain on the Search related tasks. 3. Is there anyway to dump out the stats as a CSV file or something? Would I implement a Task for this? Ultimately, I want to be able to create a graph in Excel that shows tradeoffs between speed and memory. 4. Is there a way to set how many tabs occur between columns in the final report? They merge and buffer factors get hard to read for larger values. 5. Below is my "alg" file, any tips? What I am trying to do is show the tradeoffs of merge factor and max buffered and how it relates to memory and indexing time. I want to process all the documents in the Reuters benchmark collection, not the 2000 in the micro-standard. I don't want any pauses and for now I am happy doing things in serial. I think it is doing what I want, but am not 100% certain.

-----------  alg file --------

#last value is more than all the docs in reuters
merge.factor=mrg:10:100:1000:5000:10:10:10:10:100:1000
max.buffered=buf:10:10:10:10:100:1000:10000:21580:21580:21580
compound=true

analyzer=org.apache.lucene.analysis.standard.StandardAnalyzer
directory=FSDirectory
#directory=RamDirectory

doc.stored=true
doc.tokenized=true
doc.term.vector=false
doc.add.log.step=1000

docs.dir=reuters-out
#docs.dir=reuters-111

#doc.maker=org.apache.lucene.benchmark.byTask.feeds.SimpleDocMaker
doc.maker=org.apache.lucene.benchmark.byTask.feeds.ReutersDocMaker

#query.maker=org.apache.lucene.benchmark.byTask.feeds.SimpleQueryMaker
query.maker=org.apache.lucene.benchmark.byTask.feeds.ReutersQueryMaker

# task at this depth or less would print when they start
task.max.depth.log=2

log.queries=true
# ------------------------------------------------------------------------ -------------

{ "Rounds"

    ResetSystemErase

    { "Populate"
        CreateIndex
        { "MAddDocs" AddDoc > : 22000
        Optimize
        CloseIndex
    }

    OpenReader
    { "SearchSameRdr" Search > : 5000
    CloseReader

    { "WarmNewRdr" Warm > : 50

    { "SrchNewRdr" Search > : 500

    { "SrchTrvNewRdr" SearchTrav > : 300

    { "SrchTrvRetNewRdr" SearchTravRet > : 100

    NewRound

} : 10

RepSumByName
RepSumByPrefRound MAddDocs


Thanks,
Grant

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to