首页| JavaScript| HTML/CSS| Matlab| PHP| Python| Java| C/C++/VC++| C#| ASP| 其他|
购买积分 购买会员 激活码充值

您现在的位置是:虫虫源码 > 其他 > 这是Tesseract OCR引擎,支持印度脚本的一个港口。

这是Tesseract OCR引擎,支持印度脚本的一个港口。

资 源 简 介

NOTE The project code hosting has moved to git. Find it at https://github.com/debayan/Tesseract-Indic-OCR/ Project Description The aim of this project is to add Indic script support to the Tesseract OCR engine, which currently does not support connected script such as devnagri. This includes adding some routines to the existing code base, training the engine with sample images and then testing for accuracy for subsequent debugging and refinement in the algorithms. Hacker documentation for this project is maintained at http://hacking-tesseract.blogspot.com Mailing List -> http://groups.google.com/group/indic-ocr These instructions are for Bengali. Follow the same instructions for any other language. Simply replace the 3 letter code "ban" with your own language code. Currently available language support is for Hindi (hin) and Bangla (ban)

文 件 列 表

TesseractIndic-Trainer-GUI-0.1.3
tesseract_trainer
hin.alphabet
output
Hindi.images
trainer_gui.py
tesseract-gui-logo.png
lang_list.txt
__init__.py
CHANGELOG
temp
.trainer_gui.py.swp
pango.png
README
VIP VIP
0.189746s