py-pdf-parser:一个Python工具,可帮助从结构化PDF中提取信息

上传者: 42129005 | 上传时间: 2023-02-13 16:42:14 | 文件大小: 509KB | 文件类型: ZIP
py-pdf-parser Py PDF Parser是帮助从结构化PDF中提取信息的工具。 完整的详细信息和安装说明可以在以下位置找到: : 该项目基于Sam Whitehall(github.com/samwhitehall)的原始设计和原型。

文件下载

资源详情

[{"title":"( 92 个子文件 509KB ) py-pdf-parser:一个Python工具,可帮助从结构化PDF中提取信息","children":[{"title":"py-pdf-parser-master","children":[{"title":"MANIFEST.in <span style='color:#111;'> 34B </span>","children":null,"spread":false},{"title":"imagemagick_policy.xml <span style='color:#111;'> 3.10KB </span>","children":null,"spread":false},{"title":"mypy.ini <span style='color:#111;'> 37B </span>","children":null,"spread":false},{"title":".github","children":[{"title":"scripts","children":[{"title":"docs.sh <span style='color:#111;'> 918B </span>","children":null,"spread":false},{"title":"test.sh <span style='color:#111;'> 963B </span>","children":null,"spread":false},{"title":"lint.sh <span style='color:#111;'> 4.61KB </span>","children":null,"spread":false}],"spread":true},{"title":"dependabot.yml <span style='color:#111;'> 284B </span>","children":null,"spread":false},{"title":"ISSUE_TEMPLATE","children":[{"title":"bug_report.md <span style='color:#111;'> 818B </span>","children":null,"spread":false},{"title":"question.md <span style='color:#111;'> 657B </span>","children":null,"spread":false},{"title":"feature_request.md <span style='color:#111;'> 632B </span>","children":null,"spread":false}],"spread":true},{"title":"pull_request_template.md <span style='color:#111;'> 735B </span>","children":null,"spread":false},{"title":"workflows","children":[{"title":"continuous-integration.yml <span style='color:#111;'> 946B </span>","children":null,"spread":false},{"title":"codeql-analysis.yml <span style='color:#111;'> 1.64KB </span>","children":null,"spread":false},{"title":"release.yml <span style='color:#111;'> 1.01KB </span>","children":null,"spread":false}],"spread":true}],"spread":true},{"title":"py_pdf_parser","children":[{"title":"exceptions.py <span style='color:#111;'> 701B </span>","children":null,"spread":false},{"title":"sectioning.py <span style='color:#111;'> 6.03KB </span>","children":null,"spread":false},{"title":"tables.py <span style='color:#111;'> 18.72KB </span>","children":null,"spread":false},{"title":"__init__.py <span style='color:#111;'> 0B </span>","children":null,"spread":false},{"title":"loaders.py <span style='color:#111;'> 2.98KB </span>","children":null,"spread":false},{"title":"components.py <span style='color:#111;'> 17.69KB </span>","children":null,"spread":false},{"title":"visualise","children":[{"title":"sections.py <span style='color:#111;'> 11.75KB </span>","children":null,"spread":false},{"title":"main.py <span style='color:#111;'> 10.86KB </span>","children":null,"spread":false},{"title":"__init__.py <span style='color:#111;'> 28B </span>","children":null,"spread":false},{"title":"info_figure.py <span style='color:#111;'> 2.66KB </span>","children":null,"spread":false},{"title":"background.py <span style='color:#111;'> 1.21KB </span>","children":null,"spread":false}],"spread":true},{"title":"filtering.py <span style='color:#111;'> 37.84KB </span>","children":null,"spread":false},{"title":"common.py <span style='color:#111;'> 1.94KB </span>","children":null,"spread":false}],"spread":true},{"title":"CONTRIBUTING.md <span style='color:#111;'> 2.10KB </span>","children":null,"spread":false},{"title":"LICENSE <span style='color:#111;'> 1.04KB </span>","children":null,"spread":false},{"title":"dockerfiles","children":[{"title":"Dockerfile <span style='color:#111;'> 609B </span>","children":null,"spread":false},{"title":"Dockerfile_dev <span style='color:#111;'> 685B </span>","children":null,"spread":false},{"title":"Dockerfile_tests <span style='color:#111;'> 797B </span>","children":null,"spread":false}],"spread":true},{"title":"setup.py <span style='color:#111;'> 1.81KB </span>","children":null,"spread":false},{"title":"README.md <span style='color:#111;'> 681B </span>","children":null,"spread":false},{"title":"SECURITY.md <span style='color:#111;'> 327B </span>","children":null,"spread":false},{"title":"docs","children":[{"title":"source","children":[{"title":"reference","children":[{"title":"common.rst <span style='color:#111;'> 66B </span>","children":null,"spread":false},{"title":"filtering.rst <span style='color:#111;'> 134B </span>","children":null,"spread":false},{"title":"tables.rst <span style='color:#111;'> 66B </span>","children":null,"spread":false},{"title":"index.rst <span style='color:#111;'> 120B </span>","children":null,"spread":false},{"title":"loaders.rst <span style='color:#111;'> 69B </span>","children":null,"spread":false},{"title":"components.rst <span style='color:#111;'> 78B </span>","children":null,"spread":false},{"title":"sectioning.rst <span style='color:#111;'> 78B </span>","children":null,"spread":false},{"title":"visualise.rst <span style='color:#111;'> 78B </span>","children":null,"spread":false}],"spread":false},{"title":"conf.py <span style='color:#111;'> 1.98KB </span>","children":null,"spread":false},{"title":"overview.rst <span style='color:#111;'> 8.66KB </span>","children":null,"spread":false},{"title":"examples","children":[{"title":"simple_memo.rst <span style='color:#111;'> 8.38KB </span>","children":null,"spread":false},{"title":"index.rst <span style='color:#111;'> 813B </span>","children":null,"spread":false},{"title":"element_ordering.rst <span style='color:#111;'> 5.37KB </span>","children":null,"spread":false},{"title":"extracting_text_from_figures.rst <span style='color:#111;'> 1.40KB </span>","children":null,"spread":false},{"title":"order_summary.rst <span style='color:#111;'> 10.76KB </span>","children":null,"spread":false},{"title":"more_tables.rst <span style='color:#111;'> 11.60KB </span>","children":null,"spread":false}],"spread":false},{"title":"example_files","children":[{"title":"order_summary.pdf <span style='color:#111;'> 36.76KB </span>","children":null,"spread":false},{"title":"figure.pdf <span style='color:#111;'> 8.75KB </span>","children":null,"spread":false},{"title":"tables.pdf <span style='color:#111;'> 22.25KB </span>","children":null,"spread":false},{"title":"simple_memo.pdf <span style='color:#111;'> 20.67KB </span>","children":null,"spread":false},{"title":"columns.pdf <span style='color:#111;'> 7.18KB </span>","children":null,"spread":false},{"title":"grid.pdf <span style='color:#111;'> 12.47KB </span>","children":null,"spread":false}],"spread":false},{"title":"index.rst <span style='color:#111;'> 197B </span>","children":null,"spread":false},{"title":"screenshots","children":[{"title":"order_summary_example","children":[{"title":"showing_font_2.png <span style='color:#111;'> 53.19KB </span>","children":null,"spread":false},{"title":"initial.png <span style='color:#111;'> 49.22KB </span>","children":null,"spread":false},{"title":"zoomed.png <span style='color:#111;'> 29.50KB </span>","children":null,"spread":false},{"title":"showing_font_1.png <span style='color:#111;'> 51.63KB </span>","children":null,"spread":false},{"title":"sections.png <span style='color:#111;'> 52.90KB </span>","children":null,"spread":false}],"spread":false},{"title":"simple_memo_example","children":[{"title":"top.png <span style='color:#111;'> 22.33KB </span>","children":null,"spread":false},{"title":"visualise.png <span style='color:#111;'> 33.29KB </span>","children":null,"spread":false}],"spread":false}],"spread":false},{"title":"CHANGELOG.md <span style='color:#111;'> 18B </span>","children":null,"spread":false}],"spread":true},{"title":"make.bat <span style='color:#111;'> 799B </span>","children":null,"spread":false},{"title":"Makefile <span style='color:#111;'> 738B </span>","children":null,"spread":false}],"spread":true},{"title":"docker-compose.yml <span style='color:#111;'> 1.34KB </span>","children":null,"spread":false},{"title":"tests","children":[{"title":"base.py <span style='color:#111;'> 2.27KB </span>","children":null,"spread":false},{"title":"test_tables.py <span style='color:#111;'> 36.17KB </span>","children":null,"spread":false},{"title":"test_filtering.py <span style='color:#111;'> 49.93KB </span>","children":null,"spread":false},{"title":"utils.py <span style='color:#111;'> 4.47KB </span>","children":null,"spread":false},{"title":"test_loaders.py <span style='color:#111;'> 1.43KB </span>","children":null,"spread":false},{"title":"__init__.py <span style='color:#111;'> 0B </span>","children":null,"spread":false},{"title":"data","children":[{"title":"test.pdf <span style='color:#111;'> 38.05KB </span>","children":null,"spread":false},{"title":"image.pdf <span style='color:#111;'> 8.75KB </span>","children":null,"spread":false}],"spread":false},{"title":"test_doc_examples","children":[{"title":"test_tables.py <span style='color:#111;'> 6.73KB </span>","children":null,"spread":false},{"title":"__init__.py <span style='color:#111;'> 0B </span>","children":null,"spread":false},{"title":"test_simple_memo.py <span style='color:#111;'> 2.71KB </span>","children":null,"spread":false},{"title":"test_extracting_text_from_figures.py <span style='color:#111;'> 800B </span>","children":null,"spread":false},{"title":"test_element_ordering.py <span style='color:#111;'> 3.43KB </span>","children":null,"spread":false},{"title":"test_order_summary.py <span style='color:#111;'> 4.05KB </span>","children":null,"spread":false}],"spread":false},{"title":"test_common.py <span style='color:#111;'> 974B </span>","children":null,"spread":false},{"title":"test_components.py <span style='color:#111;'> 13.34KB </span>","children":null,"spread":false},{"title":"test_sectioning.py <span style='color:#111;'> 8.61KB </span>","children":null,"spread":false}],"spread":false},{"title":".readthedocs.yml <span style='color:#111;'> 630B </span>","children":null,"spread":false},{"title":"CODE_OF_CONDUCT.md <span style='color:#111;'> 3.27KB </span>","children":null,"spread":false},{"title":".gitignore <span style='color:#111;'> 1.20KB </span>","children":null,"spread":false},{"title":"pycodestyle.cfg <span style='color:#111;'> 274B </span>","children":null,"spread":false},{"title":"pyproject.toml <span style='color:#111;'> 31B </span>","children":null,"spread":false},{"title":"CHANGELOG.md <span style='color:#111;'> 4.44KB </span>","children":null,"spread":false}],"spread":false}],"spread":true}]

评论信息

免责申明

【只为小站】的资源来自网友分享,仅供学习研究,请务必在下载后24小时内给予删除,不得用于其他任何用途,否则后果自负。基于互联网的特殊性,【只为小站】 无法对用户传输的作品、信息、内容的权属或合法性、合规性、真实性、科学性、完整权、有效性等进行实质审查;无论 【只为小站】 经营者是否已进行审查,用户均应自行承担因其传输的作品、信息、内容而可能或已经产生的侵权或权属纠纷等法律责任。
本站所有资源不代表本站的观点或立场,基于网友分享,根据中国法律《信息网络传播权保护条例》第二十二条之规定,若资源存在侵权或相关问题请联系本站客服人员,zhiweidada#qq.com,请把#换成@,本站将给予最大的支持与配合,做到及时反馈和处理。关于更多版权及免责申明参见 版权及免责申明