Using Lucene as a recommendation engine.
On Sat, Oct 22, 2011 at 6:33 PM, Grant Ingersoll wrote:
>
> On Oct 22, 2011, at 6:03 PM, Sujit Pal wrote:
>
>> Hi Grant,
>>
>> Not sure if this qualifies as a "bet you didn't know", but one could use
>> Lucene term vectors to construct document vectors for
On Oct 22, 2011, at 6:03 PM, Sujit Pal wrote:
> Hi Grant,
>
> Not sure if this qualifies as a "bet you didn't know", but one could use
> Lucene term vectors to construct document vectors for similarity,
> clustering and classification tasks. I found this out recently (although
> I am probably no
On Oct 22, 2011, at 2:49 AM, Luca Rondanini wrote:
> I usually use Nutch for this but, just for fun, I tried to create a language
> identifier based on Lucene only.
Talking of which:
Google's Compact Language Detector
http://blog.mikemccandless.com/2011/10/language-detection-with-googles-compac
Hi Grant,
These are 2 cases into work i've done that I can think of:
-use Lucene to match products in a database with eBay auctions, the title
of the auction is used as the query to Lucene.
-use a servlet filter and Lucene to map well-formed URL's into a website
to it's individual (product) page
My use case is the following :
Given an n-dimensional vector ( only +ve quadrants/points ) find its
closest neighbours. I would like to try out with lucene's default
ranking. Here is how a typical document will look like :
( or same thing
)
doc1 = 1245:15 3490:20 8856:20 etc.
As reflected in th
Hi Grant,
Not sure if this qualifies as a "bet you didn't know", but one could use
Lucene term vectors to construct document vectors for similarity,
clustering and classification tasks. I found this out recently (although
I am probably not the first one), and I think this could be quite
useful.
-
Hmm, this is because as of 3.2.0 the default MergePolicy is now
TieredMergePolicy.
But: if you pass Version.LUCENE_31 when you create the
IndexWriterConfig you should get the old default (LogMergePolicy) and
then IW.setMergeFactor should work.
But it's better to use TieredMergePolicy (it's able t
Grant,
for years the ActiveMath learning environment has been using as storage engine.
At the time (~2004), it was by far the best storage engine ever doable in a
pure java-world.
Now it still is perfect in terms of performance.
We had an issue with the separate versions where the stored-fields w
Hi All,
I'm giving a talk at ApacheCon titled "Bet you didn't know Lucene can..."
(http://na11.apachecon.com/talks/18396). It's based on my observation, that
over the years, a number of us in the community have done some pretty cool
things using Lucene that don't fit under the core premise of
I think i'd try to use a bitset instead of a string for your
categories, is that possible? how many categories do you have roughly?
simon
On Sat, Oct 22, 2011 at 6:01 AM, Peyman Faratin wrote:
> Hi
>
> I have a field that is indexed as follows
>
> for(String c: article.getCategories()){
>
10 matches
Mail list logo