Depending on your skills:
a) You can analyze space between boxes to identify words (if you want to
use box file)
b) You can parse tesseract hocr output (if you have no clue what is
hocr, search in this forum)
c) You can use C++/C API of tesseract to create your own output - have a
look at hocr
Please help me if anybody know the solution !!!
THANK YOU ..
On Sunday, March 3, 2013 12:32:32 PM UTC+5:30, SUBHADIP SINHA wrote:
>
> Hi,ALL
>
> I finally got the .box file with all characters coordinate from .png
> file,Now i want to group the charecters from the .box file with words
> and n
2 matches
Mail list logo