利用BERT和基于类的TF-IDF创建易于解释的主题。-Python开发

上传者: 42134054 | 上传时间: 2021-08-03 10:35:53 | 文件大小: 5.78MB | 文件类型: ZIP
BERTopic是一种主题建模技术,它利用BERT嵌入和c-TF-IDF创建密集的群集,从而使主题易于理解,同时又在主题描述中保留了重要的单词。 BERTopic是一种主题建模技术,它利用BERT嵌入和c-TF-IDF创建密集的群集,从而使主题易于理解,同时又在主题描述中保留了重要的单词。 相应的中等职位可以在这里找到。 关于项目算法的目录2.1。 句子转换器2.2。 UMAP + HDBSCAN 2.3。 c-TF-IDF入门3.1。 安装3.2。 基本用法3.3。 概述Google合作实验室1.关于项目返回目录

文件下载

资源详情

[{"title":"( 74 个子文件 5.78MB ) 利用BERT和基于类的TF-IDF创建易于解释的主题。-Python开发","children":[{"title":"BERTopic-master","children":[{"title":".gitignore <span style='color:#111;'> 916B </span>","children":null,"spread":false},{"title":"README.md <span style='color:#111;'> 9.78KB </span>","children":null,"spread":false},{"title":".github","children":[{"title":"workflows","children":[{"title":"testing.yml <span style='color:#111;'> 607B </span>","children":null,"spread":false}],"spread":true}],"spread":true},{"title":"tests","children":[{"title":"conftest.py <span style='color:#111;'> 231B </span>","children":null,"spread":false},{"title":"test_other.py <span style='color:#111;'> 1.07KB </span>","children":null,"spread":false},{"title":"test_models.py <span style='color:#111;'> 10.91KB </span>","children":null,"spread":false},{"title":"test_topic_representation.py <span style='color:#111;'> 4.68KB </span>","children":null,"spread":false},{"title":"test_utils.py <span style='color:#111;'> 1.26KB </span>","children":null,"spread":false},{"title":"__init__.py <span style='color:#111;'> 0B </span>","children":null,"spread":false},{"title":"test_bertopic.py <span style='color:#111;'> 2.54KB </span>","children":null,"spread":false}],"spread":true},{"title":"bertopic","children":[{"title":"_ctfidf.py <span style='color:#111;'> 2.13KB </span>","children":null,"spread":false},{"title":"_bertopic.py <span style='color:#111;'> 69.65KB </span>","children":null,"spread":false},{"title":"_mmr.py <span style='color:#111;'> 2.16KB </span>","children":null,"spread":false},{"title":"__init__.py <span style='color:#111;'> 94B </span>","children":null,"spread":false},{"title":"backend","children":[{"title":"_spacy.py <span style='color:#111;'> 3.29KB </span>","children":null,"spread":false},{"title":"_base.py <span style='color:#111;'> 2.33KB </span>","children":null,"spread":false},{"title":"_tfidf.py <span style='color:#111;'> 0B </span>","children":null,"spread":false},{"title":"_flair.py <span style='color:#111;'> 2.91KB </span>","children":null,"spread":false},{"title":"_word_doc.py <span style='color:#111;'> 1.58KB </span>","children":null,"spread":false},{"title":"__init__.py <span style='color:#111;'> 175B </span>","children":null,"spread":false},{"title":"_utils.py <span style='color:#111;'> 3.82KB </span>","children":null,"spread":false},{"title":"_use.py <span style='color:#111;'> 1.69KB </span>","children":null,"spread":false},{"title":"_gensim.py <span style='color:#111;'> 2.34KB </span>","children":null,"spread":false},{"title":"_sentencetransformers.py <span style='color:#111;'> 2.25KB </span>","children":null,"spread":false}],"spread":true},{"title":"_utils.py <span style='color:#111;'> 3.03KB </span>","children":null,"spread":false}],"spread":true},{"title":"docs","children":[{"title":"faq.md <span style='color:#111;'> 6.00KB </span>","children":null,"spread":false},{"title":"api","children":[{"title":"ctfidf.md <span style='color:#111;'> 46B </span>","children":null,"spread":false},{"title":"bertopic.md <span style='color:#111;'> 46B </span>","children":null,"spread":false},{"title":"mmr.md <span style='color:#111;'> 54B </span>","children":null,"spread":false}],"spread":true},{"title":"img","children":[{"title":"ctfidf.png <span style='color:#111;'> 14.86KB </span>","children":null,"spread":false},{"title":"icon1.png <span style='color:#111;'> 11.46KB </span>","children":null,"spread":false},{"title":"probabilities.html <span style='color:#111;'> 10.08KB </span>","children":null,"spread":false},{"title":"probabilities.png <span style='color:#111;'> 104.48KB </span>","children":null,"spread":false},{"title":"icon.png <span style='color:#111;'> 5.99KB </span>","children":null,"spread":false},{"title":"tuna.png <span style='color:#111;'> 63.59KB </span>","children":null,"spread":false}],"spread":true},{"title":"changelog.md <span style='color:#111;'> 8.24KB </span>","children":null,"spread":false},{"title":"index.md <span style='color:#111;'> 4.41KB </span>","children":null,"spread":false},{"title":"tutorial","children":[{"title":"topicreduction","children":[{"title":"topicreduction.md <span style='color:#111;'> 3.29KB </span>","children":null,"spread":false}],"spread":true},{"title":"topicsovertime","children":[{"title":"trump.html <span style='color:#111;'> 22.54KB </span>","children":null,"spread":false},{"title":"topicsovertime.md <span style='color:#111;'> 6.07KB </span>","children":null,"spread":false}],"spread":false},{"title":"visualization","children":[{"title":"trump.html <span style='color:#111;'> 22.54KB </span>","children":null,"spread":false},{"title":"viz.html <span style='color:#111;'> 3.50MB </span>","children":null,"spread":false},{"title":"topics_per_class.html <span style='color:#111;'> 28.20KB </span>","children":null,"spread":false},{"title":"probabilities.html <span style='color:#111;'> 10.25KB </span>","children":null,"spread":false},{"title":"probabilities.png <span style='color:#111;'> 194.20KB </span>","children":null,"spread":false},{"title":"visualization.md <span style='color:#111;'> 5.00KB </span>","children":null,"spread":false}],"spread":false},{"title":"search","children":[{"title":"search.md <span style='color:#111;'> 1.58KB </span>","children":null,"spread":false}],"spread":false},{"title":"topicsperclass","children":[{"title":"topicsperclass.md <span style='color:#111;'> 2.46KB </span>","children":null,"spread":false},{"title":"topics_per_class.html <span style='color:#111;'> 27.61KB </span>","children":null,"spread":false}],"spread":false},{"title":"supervised","children":[{"title":"supervised.md <span style='color:#111;'> 3.39KB </span>","children":null,"spread":false}],"spread":false},{"title":"models","children":[{"title":"models.md <span style='color:#111;'> 1.33KB </span>","children":null,"spread":false}],"spread":false},{"title":"embeddings","children":[{"title":"embeddings.md <span style='color:#111;'> 8.44KB </span>","children":null,"spread":false}],"spread":false},{"title":"topicrepresentation","children":[{"title":"topicrepresentation.md <span style='color:#111;'> 2.64KB </span>","children":null,"spread":false}],"spread":false},{"title":"algorithm","children":[{"title":"algorithm.png <span style='color:#111;'> 178.24KB </span>","children":null,"spread":false},{"title":"algorithm.md <span style='color:#111;'> 4.15KB </span>","children":null,"spread":false}],"spread":false},{"title":"quickstart","children":[{"title":"viz.html <span style='color:#111;'> 3.50MB </span>","children":null,"spread":false},{"title":"quickstart.md <span style='color:#111;'> 2.87KB </span>","children":null,"spread":false}],"spread":false}],"spread":false},{"title":"style.css <span style='color:#111;'> 0B </span>","children":null,"spread":false}],"spread":true},{"title":"LICENSE <span style='color:#111;'> 1.05KB </span>","children":null,"spread":false},{"title":".gitattributes <span style='color:#111;'> 31B </span>","children":null,"spread":false},{"title":"mkdocs.yml <span style='color:#111;'> 1.66KB </span>","children":null,"spread":false},{"title":"images","children":[{"title":"logo.png <span style='color:#111;'> 17.51KB </span>","children":null,"spread":false},{"title":"icon_white.png <span style='color:#111;'> 5.99KB </span>","children":null,"spread":false},{"title":"ctfidf.png <span style='color:#111;'> 14.86KB </span>","children":null,"spread":false},{"title":"topic_visualization.gif <span style='color:#111;'> 308.94KB </span>","children":null,"spread":false},{"title":"probabilities.png <span style='color:#111;'> 194.20KB </span>","children":null,"spread":false},{"title":"clusters.png <span style='color:#111;'> 862.01KB </span>","children":null,"spread":false},{"title":"icon.png <span style='color:#111;'> 11.46KB </span>","children":null,"spread":false},{"title":"dtm.gif <span style='color:#111;'> 1.98MB </span>","children":null,"spread":false}],"spread":true},{"title":"notebooks","children":[{"title":"BERTopic.ipynb <span style='color:#111;'> 148.38KB </span>","children":null,"spread":false}],"spread":true},{"title":"setup.py <span style='color:#111;'> 2.51KB </span>","children":null,"spread":false},{"title":"Makefile <span style='color:#111;'> 370B </span>","children":null,"spread":false},{"title":"theme","children":[{"title":"logo.png <span style='color:#111;'> 8.96KB </span>","children":null,"spread":false},{"title":"style.css <span style='color:#111;'> 0B </span>","children":null,"spread":false}],"spread":false}],"spread":false}],"spread":true}]

评论信息

免责申明

【只为小站】的资源来自网友分享,仅供学习研究,请务必在下载后24小时内给予删除,不得用于其他任何用途,否则后果自负。基于互联网的特殊性,【只为小站】 无法对用户传输的作品、信息、内容的权属或合法性、合规性、真实性、科学性、完整权、有效性等进行实质审查;无论 【只为小站】 经营者是否已进行审查,用户均应自行承担因其传输的作品、信息、内容而可能或已经产生的侵权或权属纠纷等法律责任。
本站所有资源不代表本站的观点或立场,基于网友分享,根据中国法律《信息网络传播权保护条例》第二十二条之规定,若资源存在侵权或相关问题请联系本站客服人员,zhiweidada#qq.com,请把#换成@,本站将给予最大的支持与配合,做到及时反馈和处理。关于更多版权及免责申明参见 版权及免责申明