Many thanks for the information, Johannes!

I have played with *textord_max_noise_size *and it turned out that noise
in my particular case is not removed even when I set
*textord_max_noise_size=45*. Above that value almost all other
characters have been considered as noise.

However *textord_heavy_nr=1* worked well for me. It looks like this very
setting works on its own and does not depend on values for other
settings mentioned.

On 30.05.2013 9:11, Johannes Richter wrote:
> The parameter i meant is  "*textord_max_noise_size*" and it defines
> the maximum size of noise in pixels. You could also try the one you
> have found in the list "*textord_heavy_nr*".
>
> "Opening and Closing Operators" are morphological operators. I
> searched Wikipedia fo a nice example, but the english version is only
> a stub.
> In your case the opening-operation is the way to go. Many image
> processing frameworks include morphological operations. If your
> software does not provide a opening operator look for *erosion* and
> *dilation*.(opening is just a erosion followed by dilation)
>
> I made a quick example in gimp.
> the picture "before.png" shows my object (the circle) with some noise
> i want to remove. I executed the erosion operation on this picture
> with a proper filter mask. The result is in picture "after
> erosion.png". The circle has changed in size (and shape). As last step
> i executed the dilation operation in gimp. The resulting image "after
> dilation.png" shows only the circle.
>
> Depending on your objects and noise you need to choose a proper filter
> mask for this operations. This operation will change the shape of your
> characters slightly.

-- 
With best regards,
Dmitry

-- 
-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

--- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/groups/opt_out.


Reply via email to