I suppose tricky plurals are handled well with https://solr.apache.org/guide/solr/latest/indexing-guide/filters.html#hunspell-stem-filter Medical terminology it's the whole industry. Perhaps you were lucky to get something from https://www.nlm.nih.gov/research/umls/licensedcontent/umlsknowledgesources.html
On Wed, Feb 22, 2023 at 9:57 PM Matthew Castrigno <castr...@slhs.org> wrote: > Yes, I realize a really good synonym list would be very specific to the > site. > > I was wanting to start with some basic English words or even plurals that > are spelled differently. I realize these are not really synonyms but the > synonym.txt file is an expedient way of expanding results. > ie > baby, babies > > Our application is for clients; however, clients may put in specific > medical terms. Those will however generate results as our indexed data > tends to have specific medical terminology indexed. Cephalgia would be > indexed for a document but it may not have headaches. > > headaches, cephalgia > > > > *Matthew Castrigno* > > *IHT Developer II* > > *St. Luke’s Health System* > > ( 208-859-4276 > + castr...@slhs.org <xxxxx...@slhs.org> > ------------------------------ > *From:* Walter Underwood <wun...@wunderwood.org> > *Sent:* Wednesday, February 22, 2023 11:44 AM > *To:* users@solr.apache.org <users@solr.apache.org> > *Cc:* Matthew Castrigno <castr...@slhs.org> > *Subject:* Re: synonym file > > Sets of synonyms are almost always domain-specific. I needed a demo > synonym file for a product once and ended up with UK/US equivalents, like > lift/elevator. But even that wasn’t always a good idea. A customer that > made solid fuel rocket motors > ZjQcmQRYFpfptBannerStart > This Message Is From an External Sender > This message came from outside the St. Luke's email system. > > ZjQcmQRYFpfptBannerEnd > Sets of synonyms are almost always domain-specific. I needed a demo > synonym file for a product once and ended up with UK/US equivalents, like > lift/elevator. But even that wasn’t always a good idea. A customer that > made solid fuel rocket motors was a little confused when they searched for > “rocket fuel” and it suggested “arugula”. > > A medical synonym lexicon could be huge. And even then, is this for > medical professionals or clients? Will it use contusion or bruise? > > wunder > Walter Underwood > https://observer.wunderwood.org/ [observer.wunderwood.org] > <https://urldefense.com/v3/__https://observer.wunderwood.org/__;!!FkC3_z_N!KbC5SdNVDwvAFe6w8atDSwHGuRIhFAMwX4FWPu8CvShkyYb-QCDHgdzyJfVA1iLFJXqiA1WUksp-jq0K$> > > On Feb 22, 2023, at 11:48 AM, Matthew Castrigno <castr...@slhs.org> wrote: > > > Hello, > > Is there a source for a general english synonym file? > One for medical terms? > > Thank you, > > > > > *Matthew Castrigno* > > *IHT Developer II* > > *St. Luke’s Health System* > > ( 208-859-4276 > + castr...@slhs.org <xxxxx...@slhs.org> > ------------------------------ > "This message is intended for the use of the person or entity to which it > is addressed and may contain information that is confidential or > privileged, the disclosure of which is governed by applicable law. If the > reader of this message is not the intended recipient, you are hereby > notified that any dissemination, distribution, or copying of this > information is strictly prohibited. If you have received this message by > error, please notify us immediately and destroy the related message." > > ------------------------------ > "This message is intended for the use of the person or entity to which it > is addressed and may contain information that is confidential or > privileged, the disclosure of which is governed by applicable law. If the > reader of this message is not the intended recipient, you are hereby > notified that any dissemination, distribution, or copying of this > information is strictly prohibited. If you have received this message by > error, please notify us immediately and destroy the related message." > -- Sincerely yours Mikhail Khludnev https://t.me/MUST_SEARCH A caveat: Cyrillic!