android - Tesseract TessData fonts used for training -
i using tesseract ocr in android app. focusing on chinese language need recognise few keywords thinking of creating .traineddata files using jtessboxeditor. wanted know fonts chinese traditional tessdata file use? https://github.com/tesseract-ocr/tessdata
alternatively, there way can edit chi_tra.traineddata file recognises few keywords? main reason doing because file size 63.4 mb , tesseract takes around 2 3 minutes before finishing. accuracy great slow.
the font_properties file of tesseract trained languages can found in github. may check traditional chinese specific fonts supported list.
from tesseract-ocr/langdata folder here in github, can check chi_tra.wordlist inside chi_tra folder find words used training.
Comments
Post a Comment