I would suggest you to update to more recent svn revision - there were a
lot of bugfixes (see [1])

Also I would suggest you to create new github repository WITHOUT language
data files (IMO it just consume space at your project and you still can
download specific version of language data file from tesseract svn project
e.g. with link like this:
https://code.google.com/p/tesseract-ocr/source/browse/trunk/tessdata/eng.traineddata?r=660
where
"r=660" indicate svn revision). Deleting language data from current
repository IMHO do not help, because they will stay in git history,
so cloning of your project will be painful (from point of data size).

Anyway - great job! And thanks for sharing you project as open source.

[1] https://code.google.com/p/tesseract-ocr/source/list

Zdenko


On Wed, May 15, 2013 at 4:18 PM, Renard Wellnitz
<[email protected]>wrote:

> Hi Nick,
>
> here is the console output:
>
> localhost:tesseract-ocr-3.02 renard$  svn log -r COMMITTED
> ------------------------------------------------------------------------
> r705 | [email protected] | 2012-03-15 22:05:12 +0100 (Thu, 15 Mar 2012) |
> 1 line
>
> fixed build in java directory; create documentation package with 'make
> doc-pack'
> ------------------------------------------------------------------------
>
>
> Cheers
> Renard
>
>
> Am Mittwoch, 15. Mai 2013 14:28:35 UTC+2 schrieb Nick White:
>>
>> I'm no expert with SVN, but I think this command will tell me what I
>> want to know:
>>
>>   svn log -r COMMITTED
>>
>> Thanks.
>>
>> On Wed, May 15, 2013 at 04:02:34AM -0700, Renard Wellnitz wrote:
>> > Hi Nick,
>> >
>> > i'm not really proficient with svn. Maybe this helps? If you want me to
>> run a
>> > specific svn command i'll gladly do it.
>> >
>> >
>> >     localhost:tesseract-ocr-3.02 renard$ svn ls "^/tags"
>> >     release-2.04/
>> >     release-3.00/
>> >     release-3.00.1/
>> >     release-3.01/
>> >     release-3.02.01/
>> >     release-3.02.02/
>> >     localhost:tesseract-ocr-3.02 renard$ svnversion .
>> >     705M
>> >     localhost:tesseract-ocr-3.02 renard$
>> >
>> >
>> > I do not remember the exact changes. But my main goals was the get
>> progress
>> > information during the ocr process so that my app could show the
>> bounding boxes
>> > of the currently processed word.
>> >
>> > Cheers
>> > Renard
>> >
>> >
>> > Am Mittwoch, 15. Mai 2013 11:37:26 UTC+2 schrieb Nick White:
>> >
>> >     Ah, I see it's pretty close to 3.02.01 (now only available as an
>> SVN
>> >     tag). Am I correct in thinking that's the release you used? Or was
>> >     it a SVN revision near it?
>> >
>> >     Thanks again,
>> >
>> >     Nick
>> >
>> >     On Wed, May 15, 2013 at 10:30:29AM +0100, Nick White wrote:
>> >     > Hi Renard,
>> >     >
>> >     > This is awesome, great job :)
>> >     >
>> >     > I was interested to see what changes you'd made to tesseract, so
>> ran
>> >     > 'diff -r' on the tesseract-ocr-3.02 directory in github, but a
>> quick
>> >     > look made it seem quite different to the
>> >     > tesseract-ocr-3.02.02.tar.gz currently available from Tesseract.
>> >     >
>> >     > Am I correct in thinking that? Is it based on a version from SVN?
>> If
>> >     > so, which? If not, I'll just have to spend more time with diff
>> ;-)
>> >     >
>> >     > I'd be keen to try and isolate and generalise any changes you
>> made
>> >     > and get them back into the core code, if I can.
>> >     >
>> >     > Thanks for all this lovely free code!
>> >     >
>> >     > Nick
>> >     >
>> >     > On Tue, May 14, 2013 at 01:51:15PM -0700, Renard Wellnitz wrote:
>> >     > > Hi Tom,
>> >     > >
>> >     > > i decided to publish the code of the app under the Apache 2
>> licence.
>> >     However
>> >     > > the c++ code that deals with image processing uses the stricter
>> GLP v3
>> >     since
>> >     > > that is the place where i put a lot of effort into.
>> >     > >
>> >     > > The project still needs a readme and instructions on how to
>> build the
>> >     binaries.
>> >     > > For someone with a bit of Android/NDK experience it should be
>> not a big
>> >     problem
>> >     > > however.
>> >     > > Readme and build instructions will follow in a couple of days.
>> >     > >
>> >     > > 
>> > https://github.com/renard314/**textfairy<https://github.com/renard314/textfairy>
>> >     > >
>> >     > > Cheers!
>> >     > > Renard
>> >
>> > --
>> > --
>> > You received this message because you are subscribed to the Google
>> > Groups "tesseract-ocr" group.
>> > To post to this group, send email to [email protected]
>> > To unsubscribe from this group, send email to
>> > tesseract-oc...@**googlegroups.com
>> > For more options, visit this group at
>> > http://groups.google.com/**group/tesseract-ocr?hl=en<http://groups.google.com/group/tesseract-ocr?hl=en>
>> >
>> > ---
>> > You received this message because you are subscribed to the Google
>> Groups
>> > "tesseract-ocr" group.
>> > To unsubscribe from this group and stop receiving emails from it, send
>> an email
>> > to tesseract-oc...@**googlegroups.com.
>> > For more options, visit 
>> > https://groups.google.com/**groups/opt_out<https://groups.google.com/groups/opt_out>.
>>
>> >
>> >
>>
>  --
> --
> You received this message because you are subscribed to the Google
> Groups "tesseract-ocr" group.
> To post to this group, send email to [email protected]
> To unsubscribe from this group, send email to
> [email protected]
> For more options, visit this group at
> http://groups.google.com/group/tesseract-ocr?hl=en
>
> ---
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to [email protected].
> For more options, visit https://groups.google.com/groups/opt_out.
>
>
>

-- 
-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

--- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/groups/opt_out.


Reply via email to