[tesseract-ocr] “How do i extract specific fields text from different zoom in zoom out images. Previously i have cropped images by according specific region text using image coordinates. But problem i

2019-04-05 Thread Kailas Hambarde
“How do i extract specific fields text from different zoom in zoom out 
images. Previously i have cropped images by according specific region text 
using image coordinates. But problem is some images are zoom in or out and 
that changed the region coordinate. How do i handle this problem..? ”

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/6116f40e-6988-418f-92fa-c2a63d99cb78%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


[tesseract-ocr] Re: Not able to lift text from the attached file please help!!!

2019-04-05 Thread Shobhit Kapil
Hi, 

Thanks for the reply, i just want to elaborate what i am actually doing 
will give you a clean picture for a proper guidance!!!

I am using tesseract dll in windows and i am using the below code for 
engine and page segment mode

 private static TesseractEngine _engine;
private static TesseractEngine Engine
{
get
{
if (_engine == null || _engine.IsDisposed)
{
try
{
_engine = new 
TesseractEngine(Path.GetDirectoryName(System.Reflection.Assembly.GetExecutingAssembly().Location)
 
+ "\\tessdata", "eng", EngineMode.LstmOnly);
}
catch (Exception ex)
{

ex.Message.ToString();
}

}
return _engine;
}
}

then later i am initializing this engine for reading the text from image:

private string LiftText(Bitmap img)
{
string resultText = string.Empty;
try
{
TesseractEngine engine = new TesseractEngine("./tessdata", 
"eng", EngineMode.LstmOnly);
Tesseract.Page mypage = engine.Process(img, 
PageSegMode.SparseText);

resultText = mypage.GetText();
mypage.Dispose();
engine.Dispose();
}
catch (Exception ex)
{
Exceptioninfo("LiftText()", ex.StackTrace);
}

return resultText;
}


On Wednesday, April 3, 2019 at 7:58:14 PM UTC+5:30, Shobhit Kapil wrote:
>
> Hi Team,
>
> i am using Tesseract version 4 and using page segment mode is LSTM, so 
> with that i am not able to lift text properly from the attached file, 
> please let me know what extra things need to do for this sort of files.
> i have posted multiple concerns regarding Tesseract issues but none of 
> them answered hoping to have some answers this time.
>
> Team please help!!!
>
> Thanks,
> Shobhit 
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/29ac0d7e-5333-4f7d-83de-1adf917b4761%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


Re: [tesseract-ocr] How to train tesseract with new script?

2019-04-05 Thread Soumik Ranjan Dasgupta
If you have a font of the said script alphabet, yes, I think it is
possible.

On Thu, Apr 4, 2019, 11:01 PM Moni  wrote:

> Hi all
> I am planning to train the ancient scripts for language translation. Is
> there any alternate rather than amazon mechanical turk to train the
> character? in stroke format. Or else have to  train manually??
>
> Thanks for taking time off your busy schedule...
>
> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to tesseract-ocr+unsubscr...@googlegroups.com.
> To post to this group, send email to tesseract-ocr@googlegroups.com.
> Visit this group at https://groups.google.com/group/tesseract-ocr.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/tesseract-ocr/CAGMnXaKf4D22zsN2S7yyPv%3DijgCBwhaqG3k3LofW_jAn9O06og%40mail.gmail.com
> 
> .
> For more options, visit https://groups.google.com/d/optout.
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAB_aDAerXzJbJP70ZtCfLCLuOr6ovuj6Dq65yJy9baZMfkjH6A%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.


[tesseract-ocr] Re: Training Tesseract 4 from Scratch

2019-04-05 Thread Shobhit Kapil
Hi ,

Before starting this training process i would like to know a bit about the 
process

1. i have files which are not very clear and have different sort of noises 
will the training will be helpful in such scenarios.
2. Character are not reading correctly i.e. most of the time 5 is reading 
as S and Z is reading as 2 and i as !, so will this be covered in training.

Thanks,
Shobhit

On Wednesday, April 3, 2019 at 8:16:36 PM UTC+5:30, Shobhit Kapil wrote:
>
> Hi Team,
>
> I am not at all aware of training tesseract 4, is there any way that how 
> to learn train tesseract 4.
> By reading the document also i am not getting from where to start and what 
> to start.
>
> Thanks,
> Shobhit 
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/8b10d666-8e85-4bdb-83d3-740443cf1820%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


[tesseract-ocr] Tesseract on VS

2019-04-05 Thread Shobhit Kapil
Hi All,

Is there anyone who is using Tesseract on windows using Visual Studio.

If yes i will be having few questions to ask.


Thanks,
Shobhit

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/0c0d74bc-3683-43b6-990b-202bbd585937%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.