IMO that is problem of your code. Have a look at tesseract code how to
handle spaces.
Here is result for you image for different OEM:

> tesseract test_2020-11-03_122112048.png - --oem 0 -l fra
En votre aimable règlement,
Cordialement,

> tesseract test_2020-11-03_122112048.png - --oem 1 -l fra
En votre aimable règlement,
Cordialement,

> tesseract test_2020-11-03_122112048.png - --oem 2 -l fra
En votre aimable règlement,
Cordialement,





Zdenko


ut 3. 11. 2020 o 11:56 Enzo Merotto <louzo9...@gmail.com> napísal(a):

> We have recently change the version of tesseract from 3.02 to 4.0 to
> improve the performance and the rapidity, but when we use the LTSM mode,
> firstly we have a warning about the dpi: "Invalid resolution 0 dpi. Using
> 70 instead". We know why this problem appears. I don't know if the problem
> of non spaces detection comes from this warning.
> Look this example that is a french text:
> [image: CaptureText.PNG]
> We can see the warning and the transcribed text on the terminal without
> spaces. We expected:
> "En votre aimable règlement,
> Cordialement,"
>
> This is how we use tesseract:
> [image: CaptureCode1.PNG]
> [image: CaptureCode3.PNG][image: CaptureCode2.PNG]
> The image is a cv::Mat with 1 channel (8UC1).
>
> Enzo Merotto
>
> Le mardi 3 novembre 2020 à 09:52:36 UTC+1, zdenop a écrit :
>
>> Please provide reproducible example of what you are doing, how, what is
>> the result and desired result.
>>
>> Zdenko
>>
>>
>> ut 3. 11. 2020 o 9:41 Enzo Merotto <louz...@gmail.com> napísal(a):
>>
>>> Hello,
>>> I have a problem with the ltsm mode because it do not detect space and
>>> regroup every words in one.
>>> Do you have an idea of why it does not detect spaces ?
>>>
>>> --
>>> You received this message because you are subscribed to the Google
>>> Groups "tesseract-ocr" group.
>>> To unsubscribe from this group and stop receiving emails from it, send
>>> an email to tesseract-oc...@googlegroups.com.
>>> To view this discussion on the web visit
>>> https://groups.google.com/d/msgid/tesseract-ocr/41cb6003-55ad-43d3-b8da-699fae606625n%40googlegroups.com
>>> <https://groups.google.com/d/msgid/tesseract-ocr/41cb6003-55ad-43d3-b8da-699fae606625n%40googlegroups.com?utm_medium=email&utm_source=footer>
>>> .
>>>
>> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to tesseract-ocr+unsubscr...@googlegroups.com.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/tesseract-ocr/8e1189b6-929c-4ed3-8400-92a841c12fafn%40googlegroups.com
> <https://groups.google.com/d/msgid/tesseract-ocr/8e1189b6-929c-4ed3-8400-92a841c12fafn%40googlegroups.com?utm_medium=email&utm_source=footer>
> .
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAJbzG8xSuJqfw_HC9%2B22zpSRVNq0HOhTvhrc%2BG0fCveubsDQ8A%40mail.gmail.com.

Reply via email to