yake:单文档无监督关键字提取-源码

上传者: 42164534 | 上传时间: 2021-08-30 11:04:41 | 文件大小: 416KB | 文件类型: ZIP
另一个关键字提取器(Yake) 使用文本功能自动提取关键字的无监督方法。 AKE! 是一种轻量级无监督自动关键字提取方法,该方法基于从单个文档中提取的文本统计特征来选择文本中最重要的关键字。 我们的系统不需要针对特定​​的文档集进行培训,也不必依赖于字典,外部语料库,文本大小,语言或领域。 为了展示我们建议的优点和重要性,我们将其与十种最新的无监督方法(TF.IDF,KP-Miner,RAKE,TextRank,SingleRank,ExpandRank,TopicRank,TopicalalPageRank,PositionRank和MultipartiteRank)进行比较,以及一种监督方法(KEA)。 在二十个数据集之上进行的实验结果(请参见下面的基准部分)表明,在许多不同大小,语言或领域的集合下,我们的方法明显优于最新方法。 除了此处描述的python包之外,我们还提供了一个,

文件下载

资源详情

[{"title":"( 74 个子文件 416KB ) yake:单文档无监督关键字提取-源码","children":[{"title":"yake-master","children":[{"title":"MANIFEST.in <span style='color:#111;'> 286B </span>","children":null,"spread":false},{"title":"docker","children":[{"title":"build.sh <span style='color:#111;'> 490B </span>","children":null,"spread":false},{"title":"push_images.sh <span style='color:#111;'> 207B </span>","children":null,"spread":false},{"title":"test_image_calls.sh <span style='color:#111;'> 4.23KB </span>","children":null,"spread":false},{"title":"Dockerfiles","children":[{"title":"yake","children":[{"title":"Dockerfile <span style='color:#111;'> 335B </span>","children":null,"spread":false}],"spread":true},{"title":"yake-server","children":[{"title":"yake-rest-api.py <span style='color:#111;'> 5.81KB </span>","children":null,"spread":false},{"title":"Dockerfile <span style='color:#111;'> 690B </span>","children":null,"spread":false}],"spread":true}],"spread":true},{"title":"constants.sh <span style='color:#111;'> 111B </span>","children":null,"spread":false}],"spread":true},{"title":"CONTRIBUTING.rst <span style='color:#111;'> 3.10KB </span>","children":null,"spread":false},{"title":"strategy.ini <span style='color:#111;'> 581B </span>","children":null,"spread":false},{"title":"microsoft.gpg <span style='color:#111;'> 641B </span>","children":null,"spread":false},{"title":"requirements.txt <span style='color:#111;'> 61B </span>","children":null,"spread":false},{"title":"AUTHORS.rst <span style='color:#111;'> 343B </span>","children":null,"spread":false},{"title":"LICENSE <span style='color:#111;'> 33.71KB </span>","children":null,"spread":false},{"title":"yake","children":[{"title":"highlight.py <span style='color:#111;'> 6.89KB </span>","children":null,"spread":false},{"title":"StopwordsList","children":[{"title":"stopwords_no.txt <span style='color:#111;'> 710B </span>","children":null,"spread":false},{"title":"stopwords_de.txt <span style='color:#111;'> 4.30KB </span>","children":null,"spread":false},{"title":"stopwords_lt.txt <span style='color:#111;'> 432B </span>","children":null,"spread":false},{"title":"stopwords_nl.txt <span style='color:#111;'> 3.48KB </span>","children":null,"spread":false},{"title":"stopwords_es.txt <span style='color:#111;'> 2.51KB </span>","children":null,"spread":false},{"title":"stopwords_sk.txt <span style='color:#111;'> 534B </span>","children":null,"spread":false},{"title":"stopwords_ro.txt <span style='color:#111;'> 1.87KB </span>","children":null,"spread":false},{"title":"stopwords_da.txt <span style='color:#111;'> 387B </span>","children":null,"spread":false},{"title":"stopwords_hu.txt <span style='color:#111;'> 7.22KB </span>","children":null,"spread":false},{"title":"stopwords_ar.txt <span style='color:#111;'> 1.41KB </span>","children":null,"spread":false},{"title":"stopwords_br.txt <span style='color:#111;'> 2.69KB </span>","children":null,"spread":false},{"title":"stopwords_fa.txt <span style='color:#111;'> 3.20KB </span>","children":null,"spread":false},{"title":"stopwords_hy.txt <span style='color:#111;'> 340B </span>","children":null,"spread":false},{"title":"stopwords_el.txt <span style='color:#111;'> 725B </span>","children":null,"spread":false},{"title":"stopwords_sv.txt <span style='color:#111;'> 2.80KB </span>","children":null,"spread":false},{"title":"stopwords_zh.txt <span style='color:#111;'> 623B </span>","children":null,"spread":false},{"title":"stopwords_fi.txt <span style='color:#111;'> 6.31KB </span>","children":null,"spread":false},{"title":"stopwords_hr.txt <span style='color:#111;'> 1.03KB </span>","children":null,"spread":false},{"title":"stopwords_it.txt <span style='color:#111;'> 2.71KB </span>","children":null,"spread":false},{"title":"stopwords_sl.txt <span style='color:#111;'> 2.82KB </span>","children":null,"spread":false},{"title":"stopwords_noLang.txt <span style='color:#111;'> 0B </span>","children":null,"spread":false},{"title":"stopwords_bg.txt <span style='color:#111;'> 2.60KB </span>","children":null,"spread":false},{"title":"stopwords_pt.txt <span style='color:#111;'> 2.71KB </span>","children":null,"spread":false},{"title":"stopwords_et.txt <span style='color:#111;'> 185B </span>","children":null,"spread":false},{"title":"stopwords_uk.txt <span style='color:#111;'> 4.41KB </span>","children":null,"spread":false},{"title":"stopwords_ja.txt <span style='color:#111;'> 350B </span>","children":null,"spread":false},{"title":"stopwords_lv.txt <span style='color:#111;'> 1.19KB </span>","children":null,"spread":false},{"title":"stopwords_en.txt <span style='color:#111;'> 4.08KB </span>","children":null,"spread":false},{"title":"stopwords_hi.txt <span style='color:#111;'> 2.02KB </span>","children":null,"spread":false},{"title":"stopwords_id.txt <span style='color:#111;'> 3.08KB </span>","children":null,"spread":false},{"title":"stopwords_pl.txt <span style='color:#111;'> 842B </span>","children":null,"spread":false},{"title":"stopwords_cz.txt <span style='color:#111;'> 1.73KB </span>","children":null,"spread":false},{"title":"stopwords_fr.txt <span style='color:#111;'> 3.24KB </span>","children":null,"spread":false},{"title":"stopwords_tr.txt <span style='color:#111;'> 1.70KB </span>","children":null,"spread":false},{"title":"stopwords_ru.txt <span style='color:#111;'> 4.84KB </span>","children":null,"spread":false}],"spread":false},{"title":"__init__.py <span style='color:#111;'> 187B </span>","children":null,"spread":false},{"title":"datarepresentation.py <span style='color:#111;'> 15.86KB </span>","children":null,"spread":false},{"title":"cli.py <span style='color:#111;'> 1.95KB </span>","children":null,"spread":false},{"title":"Levenshtein.py <span style='color:#111;'> 1.19KB </span>","children":null,"spread":false},{"title":"yake.py <span style='color:#111;'> 3.21KB </span>","children":null,"spread":false}],"spread":true},{"title":"setup.cfg <span style='color:#111;'> 441B </span>","children":null,"spread":false},{"title":"setup.py <span style='color:#111;'> 4.52KB </span>","children":null,"spread":false},{"title":"README.md <span style='color:#111;'> 20.18KB </span>","children":null,"spread":false},{"title":"pke","children":[{"title":"yake.py <span style='color:#111;'> 17.58KB </span>","children":null,"spread":false}],"spread":true},{"title":"docs","children":[{"title":"authors.rst <span style='color:#111;'> 28B </span>","children":null,"spread":false},{"title":"make.bat <span style='color:#111;'> 6.30KB </span>","children":null,"spread":false},{"title":"readme.rst <span style='color:#111;'> 27B </span>","children":null,"spread":false},{"title":"contributing.rst <span style='color:#111;'> 33B </span>","children":null,"spread":false},{"title":"conf.py <span style='color:#111;'> 8.16KB </span>","children":null,"spread":false},{"title":"usage.rst <span style='color:#111;'> 2.52KB </span>","children":null,"spread":false},{"title":"installation.rst <span style='color:#111;'> 1.05KB </span>","children":null,"spread":false},{"title":"history.rst <span style='color:#111;'> 28B </span>","children":null,"spread":false},{"title":"pyake.rst <span style='color:#111;'> 722B </span>","children":null,"spread":false},{"title":"index.rst <span style='color:#111;'> 290B </span>","children":null,"spread":false},{"title":"Makefile <span style='color:#111;'> 6.60KB </span>","children":null,"spread":false},{"title":"YAKEvsBaselines.jpg <span style='color:#111;'> 391.02KB </span>","children":null,"spread":false}],"spread":false},{"title":"tests","children":[{"title":"__init__.py <span style='color:#111;'> 59B </span>","children":null,"spread":false},{"title":"test_yake.py <span style='color:#111;'> 16.78KB </span>","children":null,"spread":false},{"title":"doc1.txt <span style='color:#111;'> 2.28KB </span>","children":null,"spread":false}],"spread":false}],"spread":false}],"spread":true}]

评论信息

免责申明

【只为小站】的资源来自网友分享,仅供学习研究,请务必在下载后24小时内给予删除,不得用于其他任何用途,否则后果自负。基于互联网的特殊性,【只为小站】 无法对用户传输的作品、信息、内容的权属或合法性、合规性、真实性、科学性、完整权、有效性等进行实质审查;无论 【只为小站】 经营者是否已进行审查,用户均应自行承担因其传输的作品、信息、内容而可能或已经产生的侵权或权属纠纷等法律责任。
本站所有资源不代表本站的观点或立场,基于网友分享,根据中国法律《信息网络传播权保护条例》第二十二条之规定,若资源存在侵权或相关问题请联系本站客服人员,zhiweidada#qq.com,请把#换成@,本站将给予最大的支持与配合,做到及时反馈和处理。关于更多版权及免责申明参见 版权及免责申明