项目实战-朴素贝叶斯算法实现新闻分类源码及数据集.zip

上传者: asialee_bird | 上传时间: 2022-04-17 16:08:07 | 文件大小: 185KB | 文件类型: ZIP
1、内容概要:本资源主要基朴素贝叶斯算法实现新闻分类,适用于初学者学习文本分类使用。 2、新闻分类源码实现过程:将数据集划分为训练集和测试集;使用jieba模块进行分词,词频统计,停用词过滤,文本特征提取,将文本数据向量化,使用朴素贝叶斯算法进行分类。 3、主要内容:搜狗新闻数据集SogouC,标签包括财经、IT、健康、体育、旅游、教育、招聘、文化和军事;停用词文件stopwords_cn.txt;Naive_Bay.py 朴素贝叶斯算法实现源码;News_NB.py新闻分类实现源码。

文件下载

资源详情

[{"title":"( 95 个子文件 185KB ) 项目实战-朴素贝叶斯算法实现新闻分类源码及数据集.zip","children":[{"title":"项目实战-朴素贝叶斯算法实现新闻分类源码及数据集","children":[{"title":"SogouC","children":[{"title":"Sample","children":[{"title":"C000013","children":[{"title":"19.txt <span style='color:#111;'> 2.43KB </span>","children":null,"spread":false},{"title":"18.txt <span style='color:#111;'> 9.04KB </span>","children":null,"spread":false},{"title":"14.txt <span style='color:#111;'> 1.36KB </span>","children":null,"spread":false},{"title":"12.txt <span style='color:#111;'> 1.49KB </span>","children":null,"spread":false},{"title":"10.txt <span style='color:#111;'> 7.53KB </span>","children":null,"spread":false},{"title":"13.txt <span style='color:#111;'> 968B </span>","children":null,"spread":false},{"title":"17.txt <span style='color:#111;'> 2.19KB </span>","children":null,"spread":false},{"title":"11.txt <span style='color:#111;'> 7.47KB </span>","children":null,"spread":false},{"title":"16.txt <span style='color:#111;'> 3.81KB </span>","children":null,"spread":false},{"title":"15.txt <span style='color:#111;'> 869B </span>","children":null,"spread":false}],"spread":true},{"title":"C000022","children":[{"title":"19.txt <span style='color:#111;'> 1.22KB </span>","children":null,"spread":false},{"title":"18.txt <span style='color:#111;'> 1.76KB </span>","children":null,"spread":false},{"title":"14.txt <span style='color:#111;'> 1.76KB </span>","children":null,"spread":false},{"title":"12.txt <span style='color:#111;'> 2.94KB </span>","children":null,"spread":false},{"title":"10.txt <span style='color:#111;'> 698B </span>","children":null,"spread":false},{"title":"13.txt <span style='color:#111;'> 4.90KB </span>","children":null,"spread":false},{"title":"17.txt <span style='color:#111;'> 1.76KB </span>","children":null,"spread":false},{"title":"11.txt <span style='color:#111;'> 522B </span>","children":null,"spread":false},{"title":"16.txt <span style='color:#111;'> 4.64KB </span>","children":null,"spread":false},{"title":"15.txt <span style='color:#111;'> 1.87KB </span>","children":null,"spread":false}],"spread":true},{"title":"C000023","children":[{"title":"19.txt <span style='color:#111;'> 1.43KB </span>","children":null,"spread":false},{"title":"18.txt <span style='color:#111;'> 1.92KB </span>","children":null,"spread":false},{"title":"14.txt <span style='color:#111;'> 1.09KB </span>","children":null,"spread":false},{"title":"12.txt <span style='color:#111;'> 1.41KB </span>","children":null,"spread":false},{"title":"10.txt <span style='color:#111;'> 3.05KB </span>","children":null,"spread":false},{"title":"13.txt <span style='color:#111;'> 3.27KB </span>","children":null,"spread":false},{"title":"17.txt <span style='color:#111;'> 3.25KB </span>","children":null,"spread":false},{"title":"11.txt <span style='color:#111;'> 3.20KB </span>","children":null,"spread":false},{"title":"16.txt <span style='color:#111;'> 1.75KB </span>","children":null,"spread":false},{"title":"15.txt <span style='color:#111;'> 2.08KB </span>","children":null,"spread":false}],"spread":true},{"title":"C000010","children":[{"title":"19.txt <span style='color:#111;'> 3.04KB </span>","children":null,"spread":false},{"title":"18.txt <span style='color:#111;'> 376B </span>","children":null,"spread":false},{"title":"14.txt <span style='color:#111;'> 1.31KB </span>","children":null,"spread":false},{"title":"12.txt <span style='color:#111;'> 1.11KB </span>","children":null,"spread":false},{"title":"10.txt <span style='color:#111;'> 442B </span>","children":null,"spread":false},{"title":"13.txt <span style='color:#111;'> 9.60KB </span>","children":null,"spread":false},{"title":"17.txt <span style='color:#111;'> 1.49KB </span>","children":null,"spread":false},{"title":"11.txt <span style='color:#111;'> 875B </span>","children":null,"spread":false},{"title":"16.txt <span style='color:#111;'> 1.44KB </span>","children":null,"spread":false},{"title":"15.txt <span style='color:#111;'> 3.40KB </span>","children":null,"spread":false}],"spread":true},{"title":"C000020","children":[{"title":"19.txt <span style='color:#111;'> 4.12KB </span>","children":null,"spread":false},{"title":"18.txt <span style='color:#111;'> 11.83KB </span>","children":null,"spread":false},{"title":"14.txt <span style='color:#111;'> 2.05KB </span>","children":null,"spread":false},{"title":"12.txt <span style='color:#111;'> 3.45KB </span>","children":null,"spread":false},{"title":"10.txt <span style='color:#111;'> 4.44KB </span>","children":null,"spread":false},{"title":"13.txt <span style='color:#111;'> 5.38KB </span>","children":null,"spread":false},{"title":"17.txt <span style='color:#111;'> 7.17KB </span>","children":null,"spread":false},{"title":"11.txt <span style='color:#111;'> 6.81KB </span>","children":null,"spread":false},{"title":"16.txt <span style='color:#111;'> 6.11KB </span>","children":null,"spread":false},{"title":"15.txt <span style='color:#111;'> 3.03KB </span>","children":null,"spread":false}],"spread":true},{"title":"C000024","children":[{"title":"19.txt <span style='color:#111;'> 936B </span>","children":null,"spread":false},{"title":"18.txt <span style='color:#111;'> 12.61KB </span>","children":null,"spread":false},{"title":"14.txt <span style='color:#111;'> 15.74KB </span>","children":null,"spread":false},{"title":"12.txt <span style='color:#111;'> 4.07KB </span>","children":null,"spread":false},{"title":"10.txt <span style='color:#111;'> 1.50KB </span>","children":null,"spread":false},{"title":"13.txt <span style='color:#111;'> 4.84KB </span>","children":null,"spread":false},{"title":"17.txt <span style='color:#111;'> 2.29KB </span>","children":null,"spread":false},{"title":"11.txt <span style='color:#111;'> 6.55KB </span>","children":null,"spread":false},{"title":"16.txt <span style='color:#111;'> 1.64KB </span>","children":null,"spread":false},{"title":"15.txt <span style='color:#111;'> 1.18KB </span>","children":null,"spread":false}],"spread":true},{"title":"C000016","children":[{"title":"19.txt <span style='color:#111;'> 1.31KB </span>","children":null,"spread":false},{"title":"18.txt <span style='color:#111;'> 3.36KB </span>","children":null,"spread":false},{"title":"14.txt <span style='color:#111;'> 1.53KB </span>","children":null,"spread":false},{"title":"12.txt <span style='color:#111;'> 9.40KB </span>","children":null,"spread":false},{"title":"10.txt <span style='color:#111;'> 3.91KB </span>","children":null,"spread":false},{"title":"13.txt <span style='color:#111;'> 5.08KB </span>","children":null,"spread":false},{"title":"17.txt <span style='color:#111;'> 1.22KB </span>","children":null,"spread":false},{"title":"11.txt <span style='color:#111;'> 537B </span>","children":null,"spread":false},{"title":"16.txt <span style='color:#111;'> 5.10KB </span>","children":null,"spread":false},{"title":"15.txt <span style='color:#111;'> 3.60KB </span>","children":null,"spread":false}],"spread":true},{"title":"C000008","children":[{"title":"19.txt <span style='color:#111;'> 736B </span>","children":null,"spread":false},{"title":"18.txt <span style='color:#111;'> 7.35KB </span>","children":null,"spread":false},{"title":"14.txt <span style='color:#111;'> 2.11KB </span>","children":null,"spread":false},{"title":"12.txt <span style='color:#111;'> 2.70KB </span>","children":null,"spread":false},{"title":"10.txt <span style='color:#111;'> 6.19KB </span>","children":null,"spread":false},{"title":"13.txt <span style='color:#111;'> 1.83KB </span>","children":null,"spread":false},{"title":"17.txt <span style='color:#111;'> 7.65KB </span>","children":null,"spread":false},{"title":"11.txt <span style='color:#111;'> 901B </span>","children":null,"spread":false},{"title":"16.txt <span style='color:#111;'> 725B </span>","children":null,"spread":false},{"title":"15.txt <span style='color:#111;'> 754B </span>","children":null,"spread":false}],"spread":true},{"title":"C000014","children":[{"title":"19.txt <span style='color:#111;'> 3.03KB </span>","children":null,"spread":false},{"title":"18.txt <span style='color:#111;'> 1.35KB </span>","children":null,"spread":false},{"title":"14.txt <span style='color:#111;'> 3.28KB </span>","children":null,"spread":false},{"title":"12.txt <span style='color:#111;'> 7.49KB </span>","children":null,"spread":false},{"title":"10.txt <span style='color:#111;'> 2.24KB </span>","children":null,"spread":false},{"title":"13.txt <span style='color:#111;'> 3.70KB </span>","children":null,"spread":false},{"title":"17.txt <span style='color:#111;'> 2.22KB </span>","children":null,"spread":false},{"title":"11.txt <span style='color:#111;'> 3.44KB </span>","children":null,"spread":false},{"title":"16.txt <span style='color:#111;'> 1.72KB </span>","children":null,"spread":false},{"title":"15.txt <span style='color:#111;'> 3.76KB </span>","children":null,"spread":false}],"spread":true}],"spread":true},{"title":"ClassList.txt <span style='color:#111;'> 131B </span>","children":null,"spread":false}],"spread":true},{"title":"stopwords_cn.txt <span style='color:#111;'> 2.67KB </span>","children":null,"spread":false},{"title":"News_NB.py <span style='color:#111;'> 6.77KB </span>","children":null,"spread":false},{"title":"README.md <span style='color:#111;'> 172B </span>","children":null,"spread":false},{"title":"Naive_Bay.py <span style='color:#111;'> 5.73KB </span>","children":null,"spread":false}],"spread":true}],"spread":true}]

评论信息

免责申明

【只为小站】的资源来自网友分享,仅供学习研究,请务必在下载后24小时内给予删除,不得用于其他任何用途,否则后果自负。基于互联网的特殊性,【只为小站】 无法对用户传输的作品、信息、内容的权属或合法性、合规性、真实性、科学性、完整权、有效性等进行实质审查;无论 【只为小站】 经营者是否已进行审查,用户均应自行承担因其传输的作品、信息、内容而可能或已经产生的侵权或权属纠纷等法律责任。
本站所有资源不代表本站的观点或立场,基于网友分享,根据中国法律《信息网络传播权保护条例》第二十二条之规定,若资源存在侵权或相关问题请联系本站客服人员,zhiweidada#qq.com,请把#换成@,本站将给予最大的支持与配合,做到及时反馈和处理。关于更多版权及免责申明参见 版权及免责申明