文字识别Tesseract-OCR tessdata eng.traineddata OCR识别训练数据文件

上传者: xiao899 | 上传时间: 2024-05-17 17:27:03 | 文件大小: 31.4MB | 文件类型: ZIP
1. 样本图片准备 2. 打开 jTessBoxEditor ,选择 Tools -> Merge TIFF,打开对话框,选择训练样本所在文件夹,并选中所有要参与训练的样本图片 3 弹出保存对话框,还是选择在当前路径下保存,文件命名为ty.cp.exp6.tif 4. tesseract ty.cp.exp6.tif ty.cp.exp6 -l ty batch.nochop makebox 5. 打开 jTessBoxEditor ,点击 Box Editor -> Open ,打开步骤2中生成的ty.cp.exp6.tif ,会自动关联到 “ty.cp.exp6.box” 文件: 6. 使用echo命令创建字体特征文件 echo cp 0 0 0 0 0>font_properties. 输入内容 “cp 0 0 0 0 0” 7. 使用 tesseract 生成 ty.cp.exp6.tr 训练文件 在终端中执行以下命名: tesseract ty.cp.exp6.tif ty.cp.exp6 nobatch box.train 8. 生成字符集文件 在终端中执行以下命令: unicharset_extractor ty.cp.exp6.box 9. mftraining -F font_properties -U unicharset -O ty.unicharset ty.cp.exp6.tr 与 cntraining ty.cp.exp6.tr 生成之后手工修改 Clustering 过程生成的 4 个文件(inttemp、pffmtable、normproto、shapetable)的名称为 [lang].xxx。这里改为 ty.inttemp、ty.pffmtable、ty.normproto、ty.shapetable。 10. 合并数据文件 在终端中执行以下命令: combine_tessdata ty. tesseract b01.jpg result -l ty --psm 7

文件下载

资源详情

[{"title":"( 35 个子文件 31.4MB ) 文字识别Tesseract-OCR tessdata eng.traineddata OCR识别训练数据文件","children":[{"title":"tyA.traineddata <span style='color:#111;'> 507.21KB </span>","children":null,"spread":false},{"title":"osd.traineddata <span style='color:#111;'> 10.07MB </span>","children":null,"spread":false},{"title":"pdf.ttf <span style='color:#111;'> 572B </span>","children":null,"spread":false},{"title":"configs","children":[{"title":"logfile <span style='color:#111;'> 25B </span>","children":null,"spread":false},{"title":"linebox <span style='color:#111;'> 70B </span>","children":null,"spread":false},{"title":"api_config <span style='color:#111;'> 26B </span>","children":null,"spread":false},{"title":"inter <span style='color:#111;'> 59B </span>","children":null,"spread":false},{"title":"digits <span style='color:#111;'> 37B </span>","children":null,"spread":false},{"title":"lstm.train <span style='color:#111;'> 328B </span>","children":null,"spread":false},{"title":"tsv <span style='color:#111;'> 46B </span>","children":null,"spread":false},{"title":"box.train.stderr <span style='color:#111;'> 355B </span>","children":null,"spread":false},{"title":"rebox <span style='color:#111;'> 65B </span>","children":null,"spread":false},{"title":"pdf <span style='color:#111;'> 46B </span>","children":null,"spread":false},{"title":"bigram <span style='color:#111;'> 129B </span>","children":null,"spread":false},{"title":"hocr <span style='color:#111;'> 64B </span>","children":null,"spread":false},{"title":"box.train <span style='color:#111;'> 355B </span>","children":null,"spread":false},{"title":"makebox <span style='color:#111;'> 26B </span>","children":null,"spread":false},{"title":"kannada <span style='color:#111;'> 101B </span>","children":null,"spread":false},{"title":"unlv <span style='color:#111;'> 46B </span>","children":null,"spread":false},{"title":"quiet <span style='color:#111;'> 21B </span>","children":null,"spread":false},{"title":"ambigs.train <span style='color:#111;'> 146B </span>","children":null,"spread":false},{"title":"txt <span style='color:#111;'> 166B </span>","children":null,"spread":false},{"title":"strokewidth <span style='color:#111;'> 377B </span>","children":null,"spread":false}],"spread":false},{"title":"chi_sim.traineddata <span style='color:#111;'> 50.22MB </span>","children":null,"spread":false},{"title":"tessconfigs","children":[{"title":"msdemo <span style='color:#111;'> 402B </span>","children":null,"spread":false},{"title":"batch.nochop <span style='color:#111;'> 37B </span>","children":null,"spread":false},{"title":"nobatch <span style='color:#111;'> 1B </span>","children":null,"spread":false},{"title":"matdemo <span style='color:#111;'> 243B </span>","children":null,"spread":false},{"title":"batch <span style='color:#111;'> 50B </span>","children":null,"spread":false},{"title":"segdemo <span style='color:#111;'> 329B </span>","children":null,"spread":false}],"spread":true},{"title":"ty1.traineddata <span style='color:#111;'> 164.26KB </span>","children":null,"spread":false},{"title":"ty.traineddata <span style='color:#111;'> 302.46KB </span>","children":null,"spread":false},{"title":"eng.user-words <span style='color:#111;'> 27B </span>","children":null,"spread":false},{"title":"eng.user-patterns <span style='color:#111;'> 33B </span>","children":null,"spread":false},{"title":"ty180201.traineddata <span style='color:#111;'> 498.11KB </span>","children":null,"spread":false}],"spread":true}]

评论信息

免责申明

【只为小站】的资源来自网友分享,仅供学习研究,请务必在下载后24小时内给予删除,不得用于其他任何用途,否则后果自负。基于互联网的特殊性,【只为小站】 无法对用户传输的作品、信息、内容的权属或合法性、合规性、真实性、科学性、完整权、有效性等进行实质审查;无论 【只为小站】 经营者是否已进行审查,用户均应自行承担因其传输的作品、信息、内容而可能或已经产生的侵权或权属纠纷等法律责任。
本站所有资源不代表本站的观点或立场,基于网友分享,根据中国法律《信息网络传播权保护条例》第二十二条之规定,若资源存在侵权或相关问题请联系本站客服人员,zhiweidada#qq.com,请把#换成@,本站将给予最大的支持与配合,做到及时反馈和处理。关于更多版权及免责申明参见 版权及免责申明