Hello Zdenko, Thank you for that tip. Yes I am extremely interested in using Leptonica functions directly, especially if they are expected to run faster. But I am almost illiterate on C - I have the precompiled Leptonica DLL's
they are called *liblept-5.dll (7969kb)* or *pvt.cppan.demo.danbloomberg.leptonica-1.74.4.dll (1681kb)* I tried finding the function entry points with InterOpSignatureToolkit, that can make a .NET wrapper signature but It fails to load those DLL's, saying there is no assembly manifest. How can I use Leptonica from the command line or call the DLL's from .Net ? Any tips much appreciated Hakan On Thursday, January 11, 2018 at 5:16:15 PM UTC+3, zdenop wrote: > > If you need to detect just orientation it should be faster to use only > leptonica functions. See > https://tpgit.github.io/Leptonica/flipdetect_8c_source.html > http://tpgit.github.io/Leptonica/skew_8c_source.html > > > Zdenko > > On Thu, Jan 11, 2018 at 12:36 PM, Hakan Usakli <hakan....@gmail.com > <javascript:>> wrote: > >> In case it helps someone, >> Yes there is a way to change the behaviour of 'minimum number of >> characters' I struggled with the same problem you have as well for a while >> >> In this file, >> https://github.com/tesseract-ocr/tesseract/blob/master/ccmain/osdetect.cpp >> change the value of this constant to something like 5. Recompile and you >> are done. >> >> const int kMinCharactersToTry = 50 >> >> I have asked the developers to make a command line setting of that >> internal constant. If/When they do it I dont know. >> Enjoy >> Hakan >> >> On Saturday, April 19, 2014 at 8:13:25 PM UTC+3, Chris Nevin wrote: >>> >>> Hello, >>> >>> I am having some trouble getting Tesseract to recognize individual >>> characters. Whenever I think I have overcome actual errors, I get the line >>> "Too few characters. Skipping this page" >>> >>> Because I am using Tess4J I have been struggling to find out exactly >>> what all of the different options you can set for Tesseract actually are. >>> Would anyone be able to tell me if there is a way to set it to not limit >>> the minimum number of characters on a page? >>> >>> Also, I am trying to get Tesseract to recognise characters from chemical >>> elements (example attached.) Will Tesseract be able to ignore the structure >>> and just pick up on the characters? >>> >>> Basically any advice as to what would be a good way to go about this >>> would be helpful! Even if I should look at training Tesseract or creating a >>> word list with the chemical elements or something? >>> >>> Thanks a lot! >>> >>> Chris >>> >> -- >> You received this message because you are subscribed to the Google Groups >> "tesseract-ocr" group. >> To unsubscribe from this group and stop receiving emails from it, send an >> email to tesseract-oc...@googlegroups.com <javascript:>. >> To post to this group, send email to tesser...@googlegroups.com >> <javascript:>. >> Visit this group at https://groups.google.com/group/tesseract-ocr. >> To view this discussion on the web visit >> https://groups.google.com/d/msgid/tesseract-ocr/b95edf04-1155-4a5f-9c5b-08d4cfb5271d%40googlegroups.com >> >> <https://groups.google.com/d/msgid/tesseract-ocr/b95edf04-1155-4a5f-9c5b-08d4cfb5271d%40googlegroups.com?utm_medium=email&utm_source=footer> >> . >> >> For more options, visit https://groups.google.com/d/optout. >> > > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To post to this group, send email to tesseract-ocr@googlegroups.com. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/12c74555-b011-40e3-9816-6e47db55671e%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.