android - Tesseract TessData fonts used for training -


i using tesseract ocr in android app. focusing on chinese language need recognise few keywords thinking of creating .traineddata files using jtessboxeditor. wanted know fonts chinese traditional tessdata file use? https://github.com/tesseract-ocr/tessdata

alternatively, there way can edit chi_tra.traineddata file recognises few keywords? main reason doing because file size 63.4 mb , tesseract takes around 2 3 minutes before finishing. accuracy great slow.

the font_properties file of tesseract trained languages can found in github. may check traditional chinese specific fonts supported list.

from tesseract-ocr/langdata folder here in github, can check chi_tra.wordlist inside chi_tra folder find words used training.


Comments

Popular posts from this blog

inversion of control - Autofac named registration constructor injection -

verilog - Systemverilog dynamic casting issues -

ios - Change Storyboard View using Seague -