Text search lexer's handling of hyphens and negatives

2019-10-16 Thread Raymond Lu
We've been happily using pgsql to store user-generated documents for a while now. We also wanted to be able to search the documents so we tossed the document contents into a tsvector and did a pretty straightforward contents @@ phraseto_tsquery('simple', 'the query'). Our users have a lot of thing

Re: Text search lexer's handling of hyphens and negatives

2019-10-16 Thread Alan Hodgson
On Tue, 2019-10-15 at 20:34 -0700, raylu wrote: > On Tue, Oct 15, 2019 at 3:35 PM Alan Hodgson > wrote: > > My company has found the pg_trm extension to be more useful for > > partial text searches than the full text functions. I don't know > > specifically how it might help with your hyphens but

Re: Text search lexer's handling of hyphens and negatives

2019-10-16 Thread Daniel Verite
raylu wrote: > to_tsvector('simple', 'UVW-789-XYZ') is > 'uvw':1 '-789':2 'xyz':3 > because -789 is a negative integer. If we turn the query '789-XYZ' > into the tsquery as before, we get to_tsquery('simple', '789 <-> xyz') > which doesn't match it. > > Are we missing something here? Is t

Re: Text search lexer's handling of hyphens and negatives

2019-10-15 Thread raylu
On Tue, Oct 15, 2019 at 3:35 PM Alan Hodgson wrote: > My company has found the pg_trm extension to be more useful for partial text > searches than the full text functions. I don't know specifically how it might > help with your hyphens but it would be worth testing. The docs actually > suggest

Text search lexer's handling of hyphens and negatives

2019-10-15 Thread raylu
(I sent a similar message before subscribing to the list but it hasn't gone through yet, so sorry if you see a duplicate of this...) We've been happily using pgsql to store user-generated documents for a while now. We also wanted to be able to search the documents so we tossed the document content