TextClassification:使用Python进行文本分类的简单实践

上传者: 42113794 | 上传时间: 2023-03-12 19:06:52 | 文件大小: 1.16MB | 文件类型: ZIP
文本分类 使用Python进行文本分类的简单实践 文件 内容 罗基奥.py 使用 Rocchio 算法的文本分类。 每个文档都在一个向量空间中表示。 在训练阶段,找到每类文档的质心。 在测试阶段,计算测试文档到每个质心的距离,并将文档分配到最近的质心类。 天真的eBayes.py 使用朴素贝叶斯算法的文本分类。 每个文档在一个向量空间中表示。 在训练阶段,学习字典每个术语的类先验和类条件概率。 在测试阶段,文档被分配给给定测试文档具有最大后验概率的类。 这是一个 IPython 笔记本,展示了使用 scikits-learn 机器学习库的完整但简单的文本分类管道。 管道从文本清理和标记化开始,然后将每个文档投影到一个向量空间中。 Tfidf 加权用于对向量进行归一化。 然后测试一些分类器; 使用它们的默认参数。 最后,在蛮力参数网格搜索上使用 10 倍交叉验证,找到了一些分类器的最

文件下载

资源详情

[{"title":"( 807 个子文件 1.16MB ) TextClassification:使用Python进行文本分类的简单实践","children":[{"title":"NaiveBayes.py <span style='color:#111;'> 9.48KB </span>","children":null,"spread":false},{"title":".gitignore <span style='color:#111;'> 774B </span>","children":null,"spread":false},{"title":"sklearn-text classification.ipynb <span style='color:#111;'> 110.23KB </span>","children":null,"spread":false},{"title":"LICENSE <span style='color:#111;'> 1.05KB </span>","children":null,"spread":false},{"title":"Rocchio.py <span style='color:#111;'> 9.61KB </span>","children":null,"spread":false},{"title":"52792 <span style='color:#111;'> 2.73KB </span>","children":null,"spread":false},{"title":"52827 <span style='color:#111;'> 1011B </span>","children":null,"spread":false},{"title":"52815 <span style='color:#111;'> 1.13KB </span>","children":null,"spread":false},{"title":"52751 <span style='color:#111;'> 1.49KB </span>","children":null,"spread":false},{"title":"53557 <span style='color:#111;'> 2.60KB </span>","children":null,"spread":false},{"title":"52754 <span style='color:#111;'> 1.68KB </span>","children":null,"spread":false},{"title":"53572 <span style='color:#111;'> 1.25KB </span>","children":null,"spread":false},{"title":"53576 <span style='color:#111;'> 1.01KB </span>","children":null,"spread":false},{"title":"52772 <span style='color:#111;'> 679B </span>","children":null,"spread":false},{"title":"52730 <span style='color:#111;'> 1.60KB </span>","children":null,"spread":false},{"title":"53556 <span style='color:#111;'> 2.54KB </span>","children":null,"spread":false},{"title":"52718 <span style='color:#111;'> 900B </span>","children":null,"spread":false},{"title":"52727 <span style='color:#111;'> 1.33KB </span>","children":null,"spread":false},{"title":"52733 <span style='color:#111;'> 1.36KB </span>","children":null,"spread":false},{"title":"52739 <span style='color:#111;'> 4.67KB </span>","children":null,"spread":false},{"title":"52765 <span style='color:#111;'> 636B </span>","children":null,"spread":false},{"title":"52810 <span style='color:#111;'> 1.21KB </span>","children":null,"spread":false},{"title":"52768 <span style='color:#111;'> 728B </span>","children":null,"spread":false},{"title":"53515 <span style='color:#111;'> 1.45KB </span>","children":null,"spread":false},{"title":"52806 <span style='color:#111;'> 2.36KB </span>","children":null,"spread":false},{"title":"53510 <span style='color:#111;'> 1.49KB </span>","children":null,"spread":false},{"title":"52767 <span style='color:#111;'> 695B </span>","children":null,"spread":false},{"title":"53529 <span style='color:#111;'> 1.39KB </span>","children":null,"spread":false},{"title":"52732 <span style='color:#111;'> 1.19KB </span>","children":null,"spread":false},{"title":"52740 <span style='color:#111;'> 1005B </span>","children":null,"spread":false},{"title":"52721 <span style='color:#111;'> 804B </span>","children":null,"spread":false},{"title":"52781 <span style='color:#111;'> 1.47KB </span>","children":null,"spread":false},{"title":"52756 <span style='color:#111;'> 770B </span>","children":null,"spread":false},{"title":"53534 <span style='color:#111;'> 1.42KB </span>","children":null,"spread":false},{"title":"53566 <span style='color:#111;'> 2.14KB </span>","children":null,"spread":false},{"title":"52824 <span style='color:#111;'> 706B </span>","children":null,"spread":false},{"title":"53520 <span style='color:#111;'> 900B </span>","children":null,"spread":false},{"title":"52745 <span style='color:#111;'> 1.03KB </span>","children":null,"spread":false},{"title":"53548 <span style='color:#111;'> 1.69KB </span>","children":null,"spread":false},{"title":"53560 <span style='color:#111;'> 1.31KB </span>","children":null,"spread":false},{"title":"52434 <span style='color:#111;'> 2.30KB </span>","children":null,"spread":false},{"title":"52790 <span style='color:#111;'> 3.44KB </span>","children":null,"spread":false},{"title":"52750 <span style='color:#111;'> 766B </span>","children":null,"spread":false},{"title":"52814 <span style='color:#111;'> 1.18KB </span>","children":null,"spread":false},{"title":"53582 <span style='color:#111;'> 3.15KB </span>","children":null,"spread":false},{"title":"52722 <span style='color:#111;'> 2.02KB </span>","children":null,"spread":false},{"title":"52717 <span style='color:#111;'> 689B </span>","children":null,"spread":false},{"title":"52811 <span style='color:#111;'> 968B </span>","children":null,"spread":false},{"title":"53550 <span style='color:#111;'> 879B </span>","children":null,"spread":false},{"title":"52752 <span style='color:#111;'> 2.27KB </span>","children":null,"spread":false},{"title":"53528 <span style='color:#111;'> 1.14KB </span>","children":null,"spread":false},{"title":"52829 <span style='color:#111;'> 1.24KB </span>","children":null,"spread":false},{"title":"53523 <span style='color:#111;'> 2.24KB </span>","children":null,"spread":false},{"title":"52748 <span style='color:#111;'> 2.23KB </span>","children":null,"spread":false},{"title":"53580 <span style='color:#111;'> 1.35KB </span>","children":null,"spread":false},{"title":"53527 <span style='color:#111;'> 2.02KB </span>","children":null,"spread":false},{"title":"53570 <span style='color:#111;'> 1.71KB </span>","children":null,"spread":false},{"title":"53533 <span style='color:#111;'> 1.08KB </span>","children":null,"spread":false},{"title":"52795 <span style='color:#111;'> 2.57KB </span>","children":null,"spread":false},{"title":"52755 <span style='color:#111;'> 2.55KB </span>","children":null,"spread":false},{"title":"52749 <span style='color:#111;'> 875B </span>","children":null,"spread":false},{"title":"53568 <span style='color:#111;'> 2.60KB </span>","children":null,"spread":false},{"title":"53518 <span style='color:#111;'> 2.08KB </span>","children":null,"spread":false},{"title":"52729 <span style='color:#111;'> 2.24KB </span>","children":null,"spread":false},{"title":"52789 <span style='color:#111;'> 1.74KB </span>","children":null,"spread":false},{"title":"53524 <span style='color:#111;'> 1.62KB </span>","children":null,"spread":false},{"title":"53547 <span style='color:#111;'> 1.59KB </span>","children":null,"spread":false},{"title":"52720 <span style='color:#111;'> 1.20KB </span>","children":null,"spread":false},{"title":"53563 <span style='color:#111;'> 2.42KB </span>","children":null,"spread":false},{"title":"53552 <span style='color:#111;'> 1.97KB </span>","children":null,"spread":false},{"title":"52794 <span style='color:#111;'> 3.07KB </span>","children":null,"spread":false},{"title":"53506 <span style='color:#111;'> 1.80KB </span>","children":null,"spread":false},{"title":"52809 <span style='color:#111;'> 1.17KB </span>","children":null,"spread":false},{"title":"52808 <span style='color:#111;'> 3.04KB </span>","children":null,"spread":false},{"title":"53521 <span style='color:#111;'> 1.19KB </span>","children":null,"spread":false},{"title":"52799 <span style='color:#111;'> 1.71KB </span>","children":null,"spread":false},{"title":"53525 <span style='color:#111;'> 1.35KB </span>","children":null,"spread":false},{"title":"52746 <span style='color:#111;'> 1.61KB </span>","children":null,"spread":false},{"title":"52759 <span style='color:#111;'> 2.51KB </span>","children":null,"spread":false},{"title":"53551 <span style='color:#111;'> 794B </span>","children":null,"spread":false},{"title":"52788 <span style='color:#111;'> 573B </span>","children":null,"spread":false},{"title":"53536 <span style='color:#111;'> 1.33KB </span>","children":null,"spread":false},{"title":"52822 <span style='color:#111;'> 1.01KB </span>","children":null,"spread":false},{"title":"53522 <span style='color:#111;'> 1.00KB </span>","children":null,"spread":false},{"title":"52778 <span style='color:#111;'> 1.26KB </span>","children":null,"spread":false},{"title":"53540 <span style='color:#111;'> 1.16KB </span>","children":null,"spread":false},{"title":"52719 <span style='color:#111;'> 1.78KB </span>","children":null,"spread":false},{"title":"52803 <span style='color:#111;'> 1.46KB </span>","children":null,"spread":false},{"title":"52769 <span style='color:#111;'> 1.62KB </span>","children":null,"spread":false},{"title":"53535 <span style='color:#111;'> 817B </span>","children":null,"spread":false},{"title":"53532 <span style='color:#111;'> 2.11KB </span>","children":null,"spread":false},{"title":"52800 <span style='color:#111;'> 1.65KB </span>","children":null,"spread":false},{"title":"52805 <span style='color:#111;'> 1.98KB </span>","children":null,"spread":false},{"title":"53549 <span style='color:#111;'> 2.10KB </span>","children":null,"spread":false},{"title":"52826 <span style='color:#111;'> 780B </span>","children":null,"spread":false},{"title":"52823 <span style='color:#111;'> 1.90KB </span>","children":null,"spread":false},{"title":"52801 <span style='color:#111;'> 2.61KB </span>","children":null,"spread":false},{"title":"53526 <span style='color:#111;'> 994B </span>","children":null,"spread":false},{"title":"53541 <span style='color:#111;'> 1.63KB </span>","children":null,"spread":false},{"title":"52784 <span style='color:#111;'> 2.86KB </span>","children":null,"spread":false},{"title":"......","children":null,"spread":false},{"title":"<span style='color:steelblue;'>文件过多,未全部展示</span>","children":null,"spread":false}],"spread":true}]

评论信息

免责申明

【只为小站】的资源来自网友分享,仅供学习研究,请务必在下载后24小时内给予删除,不得用于其他任何用途,否则后果自负。基于互联网的特殊性,【只为小站】 无法对用户传输的作品、信息、内容的权属或合法性、合规性、真实性、科学性、完整权、有效性等进行实质审查;无论 【只为小站】 经营者是否已进行审查,用户均应自行承担因其传输的作品、信息、内容而可能或已经产生的侵权或权属纠纷等法律责任。
本站所有资源不代表本站的观点或立场,基于网友分享,根据中国法律《信息网络传播权保护条例》第二十二条之规定,若资源存在侵权或相关问题请联系本站客服人员,zhiweidada#qq.com,请把#换成@,本站将给予最大的支持与配合,做到及时反馈和处理。关于更多版权及免责申明参见 版权及免责申明