Re: [GENERAL] Searching for "bare" letters

2011-10-03 Thread Andrew Sullivan
On Sun, Oct 02, 2011 at 05:45:48PM +0200, Reuven M. Lerner wrote: > quite grateful for that. (I really hadn't ever needed to deal with > such issues in the past, having worked mostly with English and > Hebrew, which don't have such accent marks.) That isn't quite true about English. We have word

Re: [GENERAL] Searching for "bare" letters

2011-10-02 Thread Eduardo Morras
At 01:25 02/10/2011, Reuven M. Lerner wrote: Hi, everyone. I'm working on a project on PostgreSQL 9.0 (soon to be upgraded to 9.1, given that we haven't yet launched). The project will involve numerous text fields containing English, Spanish, and Portuguese. Some of those text fields w

Re: [GENERAL] Searching for "bare" letters

2011-10-02 Thread Mike Christensen
>> I don't see the problem - you can have a dictionary, which does all work >> on recognizing bare letters and output several versions. Have you seen >> unaccent >> dictionary ? > > This seems to be the direction that everyone is suggesting, and I'm quite > grateful for that.  (I really hadn't ever

Re: [GENERAL] Searching for "bare" letters

2011-10-02 Thread Reuven M. Lerner
Hi, Oleg. You wrote: I don't see the problem - you can have a dictionary, which does all work on recognizing bare letters and output several versions. Have you seen unaccent dictionary ? This seems to be the direction that everyone is suggesting, and I'm quite grateful for that. (I really h

Re: [GENERAL] Searching for "bare" letters

2011-10-02 Thread Oleg Bartunov
I don't see the problem - you can have a dictionary, which does all work on recognizing bare letters and output several versions. Have you seen unaccent dictionary ? Oleg On Sun, 2 Oct 2011, Uwe Schroeder wrote: Hi, everyone. Uwe wrote: What kind of "client" are the users using? I assume yo

Re: [GENERAL] Searching for "bare" letters

2011-10-02 Thread Uwe Schroeder
> Hi, everyone. Uwe wrote: > > What kind of "client" are the users using? I assume you will have some > > kind of user interface. For me this is a typical job for a user > > interface. The number of letters with "equivalents" in different > > languages are extremely limited, so a simple matching

Re: [GENERAL] Searching for "bare" letters

2011-10-02 Thread hamann . w
Reuven M. Lerner wrote: >> Hi, everyone.  I'm working on a project on PostgreSQL 9.0 (soon >> to be upgraded to 9.1, given that we haven't yet launched).  The >> project will involve numerous text fields containing English, >> Spanish, and Portuguese.  Some of those text fiel

Re: [GENERAL] Searching for "bare" letters

2011-10-02 Thread Reuven M. Lerner
Hi, everyone. Uwe wrote: What kind of "client" are the users using? I assume you will have some kind of user interface. For me this is a typical job for a user interface. The number of letters with "equivalents" in different languages are extremely limited, so a simple matching routine in the

Re: [GENERAL] Searching for "bare" letters

2011-10-01 Thread Cody Caughlan
One approach would be to "normalize" all the text and search against that. That is, basically convert all non-ASCII characters to their equivalents. I've had to do this in Solr for searching for the exact reasons you've outlined: treat "ñ" as "n". Ditto for "ü" -> "u", "é" => "e", etc. This is

Re: [GENERAL] Searching for "bare" letters

2011-10-01 Thread Uwe Schroeder
> Hi, everyone. I'm working on a project on PostgreSQL 9.0 (soon to be > upgraded to 9.1, given that we haven't yet launched). The project will > involve numerous text fields containing English, Spanish, and Portuguese. > Some of those text fields will be searchable by the user. That's easy >

Re: [GENERAL] Searching for "bare" letters

2011-10-01 Thread planas
On Sun, 2011-10-02 at 01:25 +0200, Reuven M. Lerner wrote: > Hi, everyone. I'm working on a project on PostgreSQL 9.0 (soon to be > upgraded to 9.1, given that we haven't yet launched). The project > will involve numerous text fields containing English, Spanish, and > Portuguese. Some of those

[GENERAL] Searching for "bare" letters

2011-10-01 Thread Reuven M. Lerner
Hi, everyone.  I'm working on a project on PostgreSQL 9.0 (soon to be upgraded to 9.1, given that we haven't yet launched).  The project will involve numerous text fields containing English, Spanish, and Portuguese.  Some of those text fields will be searchable by