Here is an example of the sample image. I believe we are using the legacy 
engine. Does this help?

On Saturday, October 31, 2020 at 11:15:46 PM UTC-4 shree wrote:

> >When we use tesseract on the images without the trained language we 
> receive outputs that are accurate about 50% of the time. 
>
> You haven't shared a sample image. Sometimes preprocessing the images, 
> using a whitelist in case of limited character set can be the solution 
> rather than training.
>
> On Sun, Nov 1, 2020, 03:29 Cailey McVay <cailey.m...@dartmouth.edu> wrote:
>
>> Hello!
>> I am working on a project that is trying to read borehole video depths. 
>> We trained a new language to read these numbers called NTS. When we use 
>> tesseract on the images without the trained language we receive outputs 
>> that are accurate about 50% of the time. However when we use the new 
>> language, we receive no output at all. Is it possible that we overtrained 
>> tesseract to not recognize any of the images? I will attach below our box 
>> file, unicharset file, box trained file, pffmtable file, and normproto 
>> file. Our shapetable file processes but then returns an empty file. Could 
>> something be wrong with our shapetable? And if so, how could we fix that?
>>
>> Box File for the first five images:
>> 0 3 1 14 19 0
>> 9 18 0 29 20 0
>> 3 33 1 46 19 0
>> . 50 1 56 19 0
>> 2 64 1 75 19 0
>> 5 76 1 93 19 0
>> 2 92 1 111 19 0
>> 0 4 1 15 19 1
>> 8 19 1 30 19 1
>> 3 34 1 46 19 1
>> . 54 1 57 5 1
>> 4 65 1 77 19 1
>> 1 82 1 91 19 1
>> 4 96 1 107 19 1
>> 0 3 1 15 19 2
>> 8 19 1 30 19 2
>> 6 34 1 46 19 2
>> . 53 1 57 5 2
>> 8 65 1 77 19 2
>> 3 80 1 91 19 2
>> 9 95 1 107 19 2
>> 0 4 1 15 19 3
>> 8 17 1 31 19 3
>> 8 32 1 46 19 3
>> . 52 2 58 8 3
>> 1 64 0 77 20 3
>> 8 80 1 91 19 3
>> 5 96 1 107 19 3
>> 0 3 1 15 19 4
>> 8 19 1 30 19 4
>> 7 34 1 47 19 4
>> . 53 1 58 9 4
>> 5 65 1 77 19 4
>> 6 80 1 92 19 4
>> 4 95 0 109 20 4
>> 0 4 1 15 19 5
>> 7 19 1 30 19 5
>> 5 34 1 46 19 5
>> . 53 1 57 5 5
>> 3 65 1 76 19 5
>> 1 82 1 90 19 5
>> 3 96 1 107 19 5
>>
>>
>> Unicharset:
>> 14
>> NULL 0 Common 0
>> Joined 7 0,255,0,255,0,0,0,0,0,0 Latin 1 0 1 Joined # Joined [4a 6f 69 6e 
>> 65 64 ]a
>> |Broken|0|1 21 0,255,0,255,0,0,0,0,0,0 Common 2 10 2 |Broken|0|1 # Broken
>> 0 8 0,255,0,255,0,0,0,0,0,0 Common 3 2 3 0 # 0 [30 ]0
>> 9 8 0,255,0,255,0,0,0,0,0,0 Common 4 2 4 9 # 9 [39 ]0
>> 3 8 0,255,0,255,0,0,0,0,0,0 Common 5 2 5 3 # 3 [33 ]0
>> . 22 0,255,0,255,0,0,0,0,0,0 Common 6 6 6 . # . [2e ]p
>> 2 8 0,255,0,255,0,0,0,0,0,0 Common 7 2 7 2 # 2 [32 ]0
>> 5 8 0,255,0,255,0,0,0,0,0,0 Common 8 2 8 5 # 5 [35 ]0
>> 8 8 0,255,0,255,0,0,0,0,0,0 Common 9 2 9 8 # 8 [38 ]0
>> 4 8 0,255,0,255,0,0,0,0,0,0 Common 10 2 10 4 # 4 [34 ]0
>> 1 8 0,255,0,255,0,0,0,0,0,0 Common 11 2 11 1 # 1 [31 ]0
>> 6 8 0,255,0,255,0,0,0,0,0,0 Common 12 2 12 6 # 6 [36 ]0
>> 7 8 0,255,0,255,0,0,0,0,0,0 Common 13 2 13 7 # 7 [37 ]0
>>
>>
>> NTS.font.exp0.tr file:
>> font 0 3 1 14 19 0
>>  4
>> mf 16
>>  -0.085041896 0.30783021 0.27617577 0 0 0
>>  -0.25234067 0.27376649 0.089746617 0.13718249 0 0
>>  -0.28155157 0.0045010448 0.47040343 0.25 0 0
>>  -0.25234067 -0.26476437 0.08974655 0.36281759 0 0
>>  -0.085041896 -0.29882804 0.27617577 0.5 0 0
>>  -0.031931162 -0.21447986 0.1730229 0.96998096 0 0
>>  -0.11690831 0.020721853 0.43796182 0.75 0 0
>>  -0.031931162 0.23970276 0.1699543 0.5 0 0
>>  0.24424461 0.072628468 0.47339222 0.76789355 0 0
>>  0.1353676 0.30783021 0.16464323 0 0 0
>>  0.10615671 0.18941826 0.14627755 0.37934926 0 0
>>  0.15926743 -0.011719763 0.30170703 0.25 0 0
>>  0.10615671 -0.19663697 0.12619166 0.090763755 0 0
>>  0.1353676 -0.29882804 0.16464323 0.5 0 0
>>  0.27079996 -0.26476437 0.12619169 0.59076369 0 0
>>  0.29735535 -0.19663697 0.086383387 0.85538673 0 0
>> cn 1
>>  0.36328125 0.35781249 0.2421875 0.1484375
>> if 73
>>  133 69 248
>>  119 72 248
>>  104 75 248
>>  97 82 192
>>  97 95 192
>>  97 107 192
>>  97 120 192
>>  97 132 192
>>  97 145 192
>>  97 157 192
>>  97 170 192
>>  97 182 192
>>  104 188 128
>>  119 188 128
>>  133 188 128
>>  135 206 0
>>  123 206 0
>>  111 206 0
>>  99 206 0
>>  88 206 0
>>  76 206 0
>>  66 201 35
>>  59 193 35
>>  55 182 64
>>  55 168 64
>>  55 155 64
>>  55 142 64
>>  55 128 64
>>  55 115 64
>>  55 101 64
>>  55 88 64
>>  55 75 64
>>  59 64 93
>>  66 55 93
>>  76 51 128
>>  88 51 128
>>  99 51 128
>>  111 51 128
>>  123 51 128
>>  135 51 128
>>  145 184 97
>>  154 175 97
>>  163 167 97
>>  168 156 64
>>  168 143 64
>>  168 130 64
>>  168 118 64
>>  168 105 64
>>  168 92 64
>>  163 82 23
>>  154 77 23
>>  145 71 23
>>  148 51 128
>>  162 51 128
>>  176 51 128
>>  187 53 151
>>  196 59 151
>>  205 65 151
>>  207 72 219
>>  200 81 219
>>  196 92 192
>>  196 105 192
>>  196 118 192
>>  196 130 192
>>  196 143 192
>>  196 156 192
>>  195 168 204
>>  191 179 204
>>  188 190 204
>>  184 200 204
>>  176 206 0
>>  162 206 0
>>  148 206 0
>> tb 1
>>  64 251 114
>>
>>
>> pffmtable:
>> NULL 0
>> Joined 0
>> |Broken|0|1 0
>> 0 0
>> 9 0
>> 3 0
>> . 0
>> 2 0
>> 5 0
>> 8 0
>> 4 0
>> 1 0
>> 6 0
>> 7 0
>>
>> NTS.normproto file:
>> linear   essential      -0.250000   0.750000
>> linear   non-essential   0.000000   1.000000
>> linear   essential       0.000000   1.000000
>> linear   essential       0.000000   1.000000
>>
>> 0 1
>> significant   elliptical    34
>>   0.364775  0.371404  0.241039  0.150391
>>   0.000400  0.000416  0.000400  0.000400
>>
>> 9 1
>> significant   elliptical    13
>>   0.372897  0.418750  0.241286  0.157752
>>   0.000400  0.004734  0.000400  0.001087
>>
>> 3 1
>> significant   elliptical    16
>>   0.365479  0.385596  0.247070  0.143799
>>   0.000400  0.003148  0.000400  0.000702
>>
>> . 1
>> significant   elliptical    27
>>   0.081019  0.055483  0.060619  0.050492
>>   0.000400  0.000400  0.000400  0.000400
>>
>> 2 1
>> significant   elliptical    10
>>   0.354297  0.359492  0.248828  0.138672
>>   0.000400  0.000400  0.000400  0.000400
>>
>> 5 1
>> significant   elliptical    10
>>   0.363672  0.350859  0.248047  0.144922
>>   0.000400  0.000400  0.000400  0.000400
>>
>> 8 1
>> significant   elliptical    19
>>   0.365543  0.378536  0.234786  0.141653
>>   0.000400  0.000400  0.000400  0.000400
>>
>> 4 1
>> significant   elliptical     9
>>   0.325521  0.274219  0.215278  0.128038
>>   0.000400  0.000400  0.000400  0.000400
>>
>> 1 1
>> significant   elliptical    11
>>   0.320312  0.217259  0.248580  0.091974
>>   0.000400  0.000400  0.000400  0.000400
>>
>> 6 1
>> significant   elliptical    20
>>   0.360156  0.370703  0.238281  0.143164
>>   0.000400  0.000400  0.000400  0.000400
>>
>> 7 1
>> significant   elliptical    20
>>   0.448633  0.243359  0.242969  0.113477
>>   0.000400  0.000400  0.000400  0.000400
>>
>> -- 
>>
> You received this message because you are subscribed to the Google Groups 
>> "tesseract-ocr" group.
>> To unsubscribe from this group and stop receiving emails from it, send an 
>> email to tesseract-oc...@googlegroups.com.
>> To view this discussion on the web visit 
>> https://groups.google.com/d/msgid/tesseract-ocr/9e3a6851-0311-4148-af1f-b61999f38977n%40googlegroups.com
>>  
>> <https://groups.google.com/d/msgid/tesseract-ocr/9e3a6851-0311-4148-af1f-b61999f38977n%40googlegroups.com?utm_medium=email&utm_source=footer>
>> .
>>
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/c09d4786-595e-4e49-b5c6-b7ded4bee47fn%40googlegroups.com.

Reply via email to