Keith, Thank you so much for this. You are not alone. You have just clued me in. I am about ready to start my first training run. Then I saw this in my email box.
You may be a life saver for doing this. How are we supposed to know these things if the docs are not updated. After looking inside my own tesseract folder (I have tesseract 5.00 on Ubuntu 18.04), I don't even see the training subfolder that I expected to see. When you cloned the tesstrain repo, where did you place the tesstrain folder? Is it a subfolder inside of the ~/tesseract folder itself, or does it stand alone outside of the tesseract folder structure? Thanks again for doing this. I would have been going bonkers within a few days without the clue-in. Max Richey On Fri, Jan 8, 2021 at 8:35 AM Keith <kmo...@gmail.com> wrote: > Shree, > > Thank you for your reply. I should have gone to bed (it was like 2 AM my > time on a work night) instead of continuing to bang my head. > > When I saw your message this morning, I was thinking, "What tesstrain > folder? There's no tesstrain folder in the repo." Which was exactly when it > occurred to me that tesstrain is a separate repo and needs checked out > individually. > > All is well. It's working. > > The phrase "tesstrain" doesn't show up on any of the (4) Compiling and > Installation pages. There's lots of mention about installing the > dependencies to support training, but no mention about actually installing > it. > > Do you think that's worthy of filing an issue? > > I'm probably not the only bonehead out there. > > Thanks, > Keith > > On Fri, Jan 8, 2021 at 3:12 AM Shree Devi Kumar <shreesh...@gmail.com> > wrote: > >> >After placing the groundtruth files in a folder called >> *data/foo-ground-truth* inside the main *tesseract *repo folder, >> >> data/foo-ground-truth needs to be under the tesstrain folder not >> tesseract folder. >> >> You can use ground-truth in a different location, in that case you have >> to refer to it while calling make. >> >> On Fri, Jan 8, 2021 at 12:42 PM Keith M <kmo...@gmail.com> wrote: >> >>> I'm sure I'm making a beginner mistake here, but I'm struggling quite a >>> bit. >>> >>> I've built straight from source, both version 4.1.1 and 5.0.0 on Ubuntu >>> 18.04, and Ubuntu 20.04(fresh install, never used, but properly updated). >>> All exhibit the same behavior. I installed all the dependencies following >>> the build/installation guides. No error during the build that I can see. >>> >>> "make training" and "make training-install" both succeed when run >>> initially. Clearly it's building and finishing without error. >>> >>> At this point, all I'm trying to do is train using the example here: >>> >>> https://github.com/tesseract-ocr/tesstrain >>> >>> using groundtruth files. >>> >>> After placing the groundtruth files in a folder called >>> *data/foo-ground-truth* inside the main *tesseract *repo folder, I >>> unzip the .TIFs and .gt.txt's. >>> >>> When either "make training MODEL_NAME=foo" is run nothing happens. It >>> just returns almost instantly and does nothing. 4.1.1 goes through >>> directories and then says there's nothing that needs done. 5.0.0 reports >>> "make: Nothing to be done for 'training'." >>> >>> Also tried incanting as such " make training MODEL_NAME=<MODEL_NAME> >>> START_MODEL=eng PSM=7 TESSDATA=/usr/local/share/tessdata" >>> >>> Same result. >>> >>> I'm clearly doing something wrong here. I must not have the files in the >>> right directory. I've tried putting data/foo-ground-truth in the root, I >>> tried putting it in tessdata inside the root folder, I tried putting it in >>> /usr/local/share/tessdata. >>> >>> eng.trainneddata has been copied to the tessdata folder. >>> >>> There's something obvious I'm doing wrong, but heck if I can find it..... >>> >>> Help!@# >>> >>> Keith >>> >>> >>> -- >>> You received this message because you are subscribed to the Google >>> Groups "tesseract-ocr" group. >>> To unsubscribe from this group and stop receiving emails from it, send >>> an email to tesseract-ocr+unsubscr...@googlegroups.com. >>> To view this discussion on the web visit >>> https://groups.google.com/d/msgid/tesseract-ocr/6238fb08-4631-43f0-8e32-29ebb0c8c0f4n%40googlegroups.com >>> <https://groups.google.com/d/msgid/tesseract-ocr/6238fb08-4631-43f0-8e32-29ebb0c8c0f4n%40googlegroups.com?utm_medium=email&utm_source=footer> >>> . >>> >> >> >> -- >> >> ____________________________________________________________ >> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com >> >> -- >> You received this message because you are subscribed to the Google Groups >> "tesseract-ocr" group. >> To unsubscribe from this group and stop receiving emails from it, send an >> email to tesseract-ocr+unsubscr...@googlegroups.com. >> To view this discussion on the web visit >> https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduXGkYsM-yr9%3Dbk_KBVrPZS%3DEUcdTHethmyF5Usc4BFnzw%40mail.gmail.com >> <https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduXGkYsM-yr9%3Dbk_KBVrPZS%3DEUcdTHethmyF5Usc4BFnzw%40mail.gmail.com?utm_medium=email&utm_source=footer> >> . >> > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to tesseract-ocr+unsubscr...@googlegroups.com. > To view this discussion on the web visit > https://groups.google.com/d/msgid/tesseract-ocr/CADEyXY-iXvvU6Nrwormy8oFBdp6a%3DZ%3DrAstCLdFuY%3DBqGt1XOw%40mail.gmail.com > <https://groups.google.com/d/msgid/tesseract-ocr/CADEyXY-iXvvU6Nrwormy8oFBdp6a%3DZ%3DrAstCLdFuY%3DBqGt1XOw%40mail.gmail.com?utm_medium=email&utm_source=footer> > . > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAMHDbVoN8UaBMEcB7M86x0LsRzh8Ou-AJw7BNuGHJtJoO%3DAu%2BQ%40mail.gmail.com.