Top2Vec主题建模语义搜索算法.zip

上传者: 38747087 | 上传时间: 2021-03-10 16:19:58 | 文件大小: 6.26MB | 文件类型: ZIP
Topic2Vector是用于主题建模和语义搜索的算法。它自动检测文本中存在的主题,并生成联合嵌入的主题,文档和单词向量。op2Vec - Generate topic, document and word embeddings.' by Dimo Angelov
安装Top2Vec的简单方法是:pip install top2vec
用法
从 top2vec 导入 Top2Vec
型号= Top2Vec(文档)
参数:
documents:输入语料库,应为字符串列表。
speed:此参数将确定模型训练的速度。“快速学习”选项是最快的,将生成最低质量的向量。“学习”选项将学习更好的质量向量,但需要花费更长的时间进行训练。“深度学习”选项将学习最佳质量的向量,但将花费大量时间进行训练。
workers:用于训练模型的工作线程数量。较大的数量将导致更快的培训。
经过训练的模型可以保存和加载。
model.save(“ filename ”)
型号= Top2Vec.load(“ filename ”)

文件下载

资源详情

[{"title":"( 21 个子文件 6.26MB ) Top2Vec主题建模语义搜索算法.zip","children":[{"title":"Top2Vec-master","children":[{"title":"README.md <span style='color:#111;'> 11.28KB </span>","children":null,"spread":false},{"title":"docs","children":[{"title":"source","children":[{"title":"README.md <span style='color:#111;'> 15B </span>","children":null,"spread":false},{"title":"conf.py <span style='color:#111;'> 2.13KB </span>","children":null,"spread":false},{"title":"index.rst <span style='color:#111;'> 447B </span>","children":null,"spread":false}],"spread":true}],"spread":true},{"title":"LICENSE <span style='color:#111;'> 1.48KB </span>","children":null,"spread":false},{"title":"top2vec","children":[{"title":"tests","children":[{"title":".gitkeep <span style='color:#111;'> 1B </span>","children":null,"spread":false}],"spread":true},{"title":"Top2Vec.py <span style='color:#111;'> 19.48KB </span>","children":null,"spread":false},{"title":"__init__.py <span style='color:#111;'> 37B </span>","children":null,"spread":false}],"spread":true},{"title":"requirements.txt <span style='color:#111;'> 765B </span>","children":null,"spread":false},{"title":"images","children":[{"title":"doc_word_embedding.svg <span style='color:#111;'> 281.14KB </span>","children":null,"spread":false},{"title":"topic29.png <span style='color:#111;'> 513.23KB </span>","children":null,"spread":false},{"title":"topic9.png <span style='color:#111;'> 512.06KB </span>","children":null,"spread":false},{"title":"topic21.png <span style='color:#111;'> 494.56KB </span>","children":null,"spread":false},{"title":"umap_docs.png <span style='color:#111;'> 1.75MB </span>","children":null,"spread":false},{"title":"topic_words.svg <span style='color:#111;'> 65.28KB </span>","children":null,"spread":false},{"title":"topic61.png <span style='color:#111;'> 540.33KB </span>","children":null,"spread":false},{"title":"hdbscan_docs.png <span style='color:#111;'> 2.11MB </span>","children":null,"spread":false},{"title":"topic_vector.svg <span style='color:#111;'> 35.55KB </span>","children":null,"spread":false},{"title":"topic48.png <span style='color:#111;'> 560.65KB </span>","children":null,"spread":false}],"spread":true},{"title":"notebooks","children":[{"title":"CORD-19_top2vec.ipynb <span style='color:#111;'> 13.67KB </span>","children":null,"spread":false}],"spread":true},{"title":"setup.py <span style='color:#111;'> 1.13KB </span>","children":null,"spread":false}],"spread":true}],"spread":true}]

评论信息

免责申明

【只为小站】的资源来自网友分享,仅供学习研究,请务必在下载后24小时内给予删除,不得用于其他任何用途,否则后果自负。基于互联网的特殊性,【只为小站】 无法对用户传输的作品、信息、内容的权属或合法性、合规性、真实性、科学性、完整权、有效性等进行实质审查;无论 【只为小站】 经营者是否已进行审查,用户均应自行承担因其传输的作品、信息、内容而可能或已经产生的侵权或权属纠纷等法律责任。
本站所有资源不代表本站的观点或立场,基于网友分享,根据中国法律《信息网络传播权保护条例》第二十二条之规定,若资源存在侵权或相关问题请联系本站客服人员,zhiweidada#qq.com,请把#换成@,本站将给予最大的支持与配合,做到及时反馈和处理。关于更多版权及免责申明参见 版权及免责申明