$ cat add_test.sh
DATA='
<add>
<doc>
<field name="docid">666</field>
<field name="titi_txt_fr">toto titi tata toto tutu titi</field>
</doc>
</add>
'
$ sh add_test.sh
<?xml version="1.0" encoding="UTF-8"?>
<response>
<lst name="responseHeader"><int name="status">0</int><int
name="QTime">484</int></lst>
</response>
$ curl
'http://localhost:8983/solr/alian_test/terms?terms.fl=titi_txt_fr&terms.sort=index'
<?xml version="1.0" encoding="UTF-8"?>
<response>
<lst name="responseHeader"><int name="status">0</int><int
name="QTime">0</int></lst><lst name="terms"><lst
name="titi_txt_fr"><int name="tata">1</int><int
name="titi">1</int><int name="toto">1</int><int
name="tutu">1</int></lst></lst>
</response>
So it's not only on Luke Side, it's come from Solr. Does it sound normal ?
2017-11-21 11:43 GMT+01:00 Barbet Alain <[email protected]>:
> Hi,
>
> I build a custom analyzer & setup it in solr, but doesn't work as I expect.
> I always get 1 as frequency for each word even if it's present
> multiple time in the text.
>
> So I try with default analyzer & find same behavior:
> My schema
>
> <fieldType name="text_ami" class="solr.TextField">
> <analyzer class="org.apache.lucene.analysis.standard.StandardAnalyzer"/>
> </fieldType>
> <field name="docid" type="string" indexed="true" required="true"
> stored="true"/>
> <field name="test_text" type="nametext"/>
>
> alian@yoda:~/solr> cat add_test.sh
> DATA='
> <add>
> <doc>
> <field name="docid">666</field>
> <field name="test_text">toto titi tata toto tutu titi</field>
> </doc>
> </add>
> '
> curl -X POST -H 'Content-Type: text/xml'
> 'http://localhost:8983/solr/alian_test/update?commit=true'
> --data-binary "$DATA"
>
> When I test in solr interface / analyze, I find the right behavior
> (find titi & toto 2 times).
> But when I look in solr index with Luke or solr interface / schema,
> the top term always get 1 as frequency. Can someone give me the thing
> I forget ?
>
> (solr 6.5)
>
> Thank you !