For Mahout 0.7, the format of the model files for LDA are just a
SequenceFile<IntWritable, VectorWritable>, with the row numbers being the
topicIds, and the entries being the (un-normalized) probabilities for each
termId.

bin/vectordump --dictionary <path to dictionary file> \
                         --dictioanryType <either text or sequencefile> \
                         --input <path to model files> \
                         --vectorSize <num entries per topic you want to
see> \
                         --sortVectors


On Wed, Oct 17, 2012 at 10:11 PM, vineeth <[email protected]> wrote:

> Hello,
>
> I am seeing from this website http://theglassicon.com/**
> computing/machine-learning/**running-lda-algorithm-mahout<http://theglassicon.com/computing/machine-learning/running-lda-algorithm-mahout>(Mahout
>  0.5). This website give the complete procedure to get probabilities
> of word and topics using LDA. However, these steps donot work on Mahout
> 0.7. Can some one give an updated website of the same steps?, or can some
> one provide me the alternative commands and parameters?
>
> Thank You
> Vineeth
>



-- 

  -jake

Reply via email to