首页| JavaScript| HTML/CSS| Matlab| PHP| Python| Java| C/C++/VC++| C#| ASP| 其他|
购买积分 购买会员 激活码充值

您现在的位置是:虫虫源码 > Java > 把莫迪(微软OCR文件)到HOCR文件

把莫迪(微软OCR文件)到HOCR文件

  • 资源大小:1,005.82 kB
  • 上传时间:2021-06-30
  • 下载次数:0次
  • 浏览次数:0次
  • 资源积分:1积分
  • 标      签: Academic office

资 源 简 介

Microsoft Office contains a decent OCR engine, yet it does not create PDF files with a text layer on it. This project contains a script that takes a tif file and converts it into HOCR format (HTML + OCR). This can be then processed with a simple Java program to get a PDF file. Batch scripts that do that are included.

文 件 列 表

README.txt
hocrtopdf-0.0.1.jar
iText.jar
modi2hocr.js
tif2pdf.bat
VIP VIP
0.175932s