天池比赛作品整理。实现从PDF中提取出姓名,出生年月,性别,电话,最高学历,籍贯,落户市县,政治面貌,毕业院校,工作单位,工作内容,职务,项目名称,项目责任、学

上传者: 54707168 | 上传时间: 2021-07-04 17:03:33 | 文件大小: 50.71MB | 文件类型: ZIP
参加了天池的一个pdf简历信息提取的比赛,这里进行回顾、整理和分享 赛题要求从pdf简历中提取出信息,可能会让人觉得,籍贯等。这里搭建了一个BiLSTM-CRF模型,从PDF简历中提取出想要的信息。 模型的线上得分是0.727,排名 21/1200+

文件下载

资源详情

[{"title":"( 50 个子文件 50.71MB ) 天池比赛作品整理。实现从PDF中提取出姓名,出生年月,性别,电话,最高学历,籍贯,落户市县,政治面貌,毕业院校,工作单位,工作内容,职务,项目名称,项目责任、学","children":[{"title":"PDF-Resume-Information-Extraction-master","children":[{"title":"data_process.py <span style='color:#111;'> 13.43KB </span>","children":null,"spread":false},{"title":"util.py <span style='color:#111;'> 9.89KB </span>","children":null,"spread":false},{"title":"train.py <span style='color:#111;'> 7.85KB </span>","children":null,"spread":false},{"title":"supporting_document","children":[{"title":"wrong_pdf.txt <span style='color:#111;'> 47.44KB </span>","children":null,"spread":false},{"title":"log.txt <span style='color:#111;'> 361.00KB </span>","children":null,"spread":false},{"title":"train_word_to_tag_0223.json <span style='color:#111;'> 32.01KB </span>","children":null,"spread":false},{"title":"long_text_error.txt <span style='color:#111;'> 8.25KB </span>","children":null,"spread":false},{"title":"main.ipynb <span style='color:#111;'> 10.33KB </span>","children":null,"spread":false},{"title":"word_to_ix_0219_2.json <span style='color:#111;'> 32.07KB </span>","children":null,"spread":false},{"title":"word_to_ix_add_unk_0219.json <span style='color:#111;'> 32.07KB </span>","children":null,"spread":false}],"spread":true},{"title":"gen_json.py <span style='color:#111;'> 3.40KB </span>","children":null,"spread":false},{"title":"model","children":[{"title":"model_70emd_70hid_10ep_0220.pth <span style='color:#111;'> 722.54KB </span>","children":null,"spread":false},{"title":"model_0223.pth <span style='color:#111;'> 3.22MB </span>","children":null,"spread":false},{"title":"model_100emd_100hid_25ep_Adam_clip_0221.pth <span style='color:#111;'> 3.22MB </span>","children":null,"spread":false},{"title":"model_100_all_data_0226.pth <span style='color:#111;'> 1.08MB </span>","children":null,"spread":false},{"title":"model_70emd_10ep_0220.pth <span style='color:#111;'> 678.24KB </span>","children":null,"spread":false},{"title":"model_100emd_100hid_10ep_0220.pth <span style='color:#111;'> 1.07MB </span>","children":null,"spread":false},{"title":"latest_model.pth <span style='color:#111;'> 679.88KB </span>","children":null,"spread":false},{"title":"model_100emd_100hid_10ep_Adam_clip_0221.pth <span style='color:#111;'> 3.22MB </span>","children":null,"spread":false},{"title":"model_0222.pth <span style='color:#111;'> 3.22MB </span>","children":null,"spread":false},{"title":"model_100_all_data_0224.pth <span style='color:#111;'> 1.07MB </span>","children":null,"spread":false},{"title":"model_100emd_2ep_0219.pth <span style='color:#111;'> 953.16KB </span>","children":null,"spread":false},{"title":"model_100_best_0223.pth <span style='color:#111;'> 3.21MB </span>","children":null,"spread":false},{"title":"model_best_0223.pth <span style='color:#111;'> 3.21MB </span>","children":null,"spread":false},{"title":"model_latest_no_best_0223.pth <span style='color:#111;'> 3.21MB </span>","children":null,"spread":false},{"title":"model_100_all_data_0225.pth <span style='color:#111;'> 1.08MB </span>","children":null,"spread":false},{"title":"model_100_all_data_perfect_0226.pth <span style='color:#111;'> 1.08MB </span>","children":null,"spread":false},{"title":"model_100_all_data_0301.pth <span style='color:#111;'> 1.08MB </span>","children":null,"spread":false},{"title":"model_perfect_1_epoch_0227.pth <span style='color:#111;'> 1.08MB </span>","children":null,"spread":false},{"title":"model_100emd_100hid_12ep_Adam_clip_0221.pth <span style='color:#111;'> 3.22MB </span>","children":null,"spread":false},{"title":"model_150_latest_no_best_0223.pth <span style='color:#111;'> 5.32MB </span>","children":null,"spread":false},{"title":"model_2_epoch_0301.pth <span style='color:#111;'> 1.08MB </span>","children":null,"spread":false},{"title":"model_perfect_1_epoch_0226.pth <span style='color:#111;'> 1.08MB </span>","children":null,"spread":false},{"title":"model_150_best_0223.pth <span style='color:#111;'> 5.32MB </span>","children":null,"spread":false},{"title":"model_100_all_data_perfect_0227.pth <span style='color:#111;'> 1.08MB </span>","children":null,"spread":false},{"title":"model_100emd_100hid_from10ep_Adam_clip_0222.pth <span style='color:#111;'> 3.22MB </span>","children":null,"spread":false},{"title":"model_add_unk_2ep_0219.pth <span style='color:#111;'> 678.24KB </span>","children":null,"spread":false}],"spread":false},{"title":"requirement.txt <span style='color:#111;'> 81B </span>","children":null,"spread":false},{"title":"model.py <span style='color:#111;'> 5.02KB </span>","children":null,"spread":false},{"title":"eval.py <span style='color:#111;'> 4.82KB </span>","children":null,"spread":false},{"title":"README.md <span style='color:#111;'> 1.98KB </span>","children":null,"spread":false},{"title":"push_dir","children":[{"title":"gen_json.py <span style='color:#111;'> 12.35KB </span>","children":null,"spread":false},{"title":"Dockerfile <span style='color:#111;'> 369B </span>","children":null,"spread":false},{"title":"run.sh <span style='color:#111;'> 19B </span>","children":null,"spread":false},{"title":"test_result.json <span style='color:#111;'> 80.08KB </span>","children":null,"spread":false},{"title":"word_to_ix_add_unk_0219.json <span style='color:#111;'> 32.07KB </span>","children":null,"spread":false}],"spread":true},{"title":"__pycache__","children":[{"title":"util.cpython-36.pyc <span style='color:#111;'> 7.87KB </span>","children":null,"spread":false},{"title":"model.cpython-36.pyc <span style='color:#111;'> 3.75KB </span>","children":null,"spread":false},{"title":"data_process.cpython-36.pyc <span style='color:#111;'> 9.02KB </span>","children":null,"spread":false}],"spread":true},{"title":"debug.py <span style='color:#111;'> 9.64KB </span>","children":null,"spread":false}],"spread":false}],"spread":true}]

评论信息

免责申明

【只为小站】的资源来自网友分享,仅供学习研究,请务必在下载后24小时内给予删除,不得用于其他任何用途,否则后果自负。基于互联网的特殊性,【只为小站】 无法对用户传输的作品、信息、内容的权属或合法性、合规性、真实性、科学性、完整权、有效性等进行实质审查;无论 【只为小站】 经营者是否已进行审查,用户均应自行承担因其传输的作品、信息、内容而可能或已经产生的侵权或权属纠纷等法律责任。
本站所有资源不代表本站的观点或立场,基于网友分享,根据中国法律《信息网络传播权保护条例》第二十二条之规定,若资源存在侵权或相关问题请联系本站客服人员,zhiweidada#qq.com,请把#换成@,本站将给予最大的支持与配合,做到及时反馈和处理。关于更多版权及免责申明参见 版权及免责申明