中文分词与关键词提取(NLPIR java版)

上传者: tiancaiywt | 上传时间: 2019-12-21 21:15:10 | 文件大小: 3.8MB | 文件类型: rar
NLPIR汉语分词系统(又名ICTCLAS2013),主要功能包括中文分词;词性标注;命名实体识别;用户词典功能;支持GBK编码、UTF8编码、BIG5编码。新增微博分词、新词发现与关键词提取;这个是java版本

文件下载

资源详情

( 50 个子文件 3.8MB ) 中文分词与关键词提取(NLPIR java版)
Win-32bit-JNI-lib
TestNLPIR.java 2.67KB
test_result2.TXT 1.39KB
.settings
org.eclipse.core.resources.prefs 121B
test_result1.TXT 1.39KB
.project 393B
test.TXT 819B
.classpath 226B
NLPIR_JNI.dll 1.60MB
kevin
zhang
NLPIR.class 1.14KB
NLPIR.java 4.76KB
TestNLPIR.class 2.35KB
Data
GBKC.wordlist 163.07KB
NLPIR_First.map 288B
UserDict.pdat 5.58KB
nr.role 1.68MB
UTF2GBK.map 279.49KB
FieldDict.pdat 256.09KB
UTF8.pdat 544.21KB
UTF8.wordlist 186.22KB
FieldDict.pos 72B
BIG5.pdat 457.48KB
GBK.pdat 536.33KB
PKU_First.map 288B
GBKC.pdat 537.94KB
GBK2BIG.map 279.49KB
NLPIR.user 3.28KB
BIG5.wordlist 154.98KB
GBK2UTF.map 279.49KB
nr.ctx 2.16KB
ICTPOS.map 406B
charset.type 64.00KB
NLPIR.ctx 36.38KB
nr.fsa 2.94KB
BIG2GBK.map 279.49KB
CoreDict.pos 1.70MB
GBKA.wordlist 163.07KB
GBKC2GBK.map 279.49KB
GranDict.pos 1.70MB
Configure.xml 856B
GBKA2UTF.map 279.49KB
NewWord.lst 126B
BiWord.big 3.36MB
GBK2GBKC.map 279.49KB
GranDict.pdat 1.89MB
GBK.wordlist 163.07KB
PKU.map 307B
GBKA.pdat 537.94KB
UTF2GBKA.map 279.49KB
CoreDict.pdat 1.62MB
CoreDict.unig 466.96KB
[{"title":"( 50 个子文件 3.8MB ) 中文分词与关键词提取(NLPIR java版)","children":[{"title":"Win-32bit-JNI-lib","children":[{"title":"TestNLPIR.java <span style='color:#111;'> 2.67KB </span>","children":null,"spread":false},{"title":"test_result2.TXT <span style='color:#111;'> 1.39KB </span>","children":null,"spread":false},{"title":".settings","children":[{"title":"org.eclipse.core.resources.prefs <span style='color:#111;'> 121B </span>","children":null,"spread":false}],"spread":true},{"title":"test_result1.TXT <span style='color:#111;'> 1.39KB </span>","children":null,"spread":false},{"title":".project <span style='color:#111;'> 393B </span>","children":null,"spread":false},{"title":"test.TXT <span style='color:#111;'> 819B </span>","children":null,"spread":false},{"title":".classpath <span style='color:#111;'> 226B </span>","children":null,"spread":false},{"title":"NLPIR_JNI.dll <span style='color:#111;'> 1.60MB </span>","children":null,"spread":false},{"title":"kevin","children":[{"title":"zhang","children":[{"title":"NLPIR.class <span style='color:#111;'> 1.14KB </span>","children":null,"spread":false},{"title":"NLPIR.java <span style='color:#111;'> 4.76KB </span>","children":null,"spread":false}],"spread":true}],"spread":true},{"title":"TestNLPIR.class <span style='color:#111;'> 2.35KB </span>","children":null,"spread":false},{"title":"Data","children":[{"title":"GBKC.wordlist <span style='color:#111;'> 163.07KB </span>","children":null,"spread":false},{"title":"NLPIR_First.map <span style='color:#111;'> 288B </span>","children":null,"spread":false},{"title":"UserDict.pdat <span style='color:#111;'> 5.58KB </span>","children":null,"spread":false},{"title":"nr.role <span style='color:#111;'> 1.68MB </span>","children":null,"spread":false},{"title":"UTF2GBK.map <span style='color:#111;'> 279.49KB </span>","children":null,"spread":false},{"title":"FieldDict.pdat <span style='color:#111;'> 256.09KB </span>","children":null,"spread":false},{"title":"UTF8.pdat <span style='color:#111;'> 544.21KB </span>","children":null,"spread":false},{"title":"UTF8.wordlist <span style='color:#111;'> 186.22KB </span>","children":null,"spread":false},{"title":"FieldDict.pos <span style='color:#111;'> 72B </span>","children":null,"spread":false},{"title":"BIG5.pdat <span style='color:#111;'> 457.48KB </span>","children":null,"spread":false},{"title":"GBK.pdat <span style='color:#111;'> 536.33KB </span>","children":null,"spread":false},{"title":"PKU_First.map <span style='color:#111;'> 288B </span>","children":null,"spread":false},{"title":"GBKC.pdat <span style='color:#111;'> 537.94KB </span>","children":null,"spread":false},{"title":"GBK2BIG.map <span style='color:#111;'> 279.49KB </span>","children":null,"spread":false},{"title":"NLPIR.user <span style='color:#111;'> 3.28KB </span>","children":null,"spread":false},{"title":"BIG5.wordlist <span style='color:#111;'> 154.98KB </span>","children":null,"spread":false},{"title":"GBK2UTF.map <span style='color:#111;'> 279.49KB </span>","children":null,"spread":false},{"title":"nr.ctx <span style='color:#111;'> 2.16KB </span>","children":null,"spread":false},{"title":"ICTPOS.map <span style='color:#111;'> 406B </span>","children":null,"spread":false},{"title":"charset.type <span style='color:#111;'> 64.00KB </span>","children":null,"spread":false},{"title":"NLPIR.ctx <span style='color:#111;'> 36.38KB </span>","children":null,"spread":false},{"title":"nr.fsa <span style='color:#111;'> 2.94KB </span>","children":null,"spread":false},{"title":"BIG2GBK.map <span style='color:#111;'> 279.49KB </span>","children":null,"spread":false},{"title":"CoreDict.pos <span style='color:#111;'> 1.70MB </span>","children":null,"spread":false},{"title":"GBKA.wordlist <span style='color:#111;'> 163.07KB </span>","children":null,"spread":false},{"title":"GBKC2GBK.map <span style='color:#111;'> 279.49KB </span>","children":null,"spread":false},{"title":"GranDict.pos <span style='color:#111;'> 1.70MB </span>","children":null,"spread":false},{"title":"Configure.xml <span style='color:#111;'> 856B </span>","children":null,"spread":false},{"title":"GBKA2UTF.map <span style='color:#111;'> 279.49KB </span>","children":null,"spread":false},{"title":"NewWord.lst <span style='color:#111;'> 126B </span>","children":null,"spread":false},{"title":"BiWord.big <span style='color:#111;'> 3.36MB </span>","children":null,"spread":false},{"title":"GBK2GBKC.map <span style='color:#111;'> 279.49KB </span>","children":null,"spread":false},{"title":"GranDict.pdat <span style='color:#111;'> 1.89MB </span>","children":null,"spread":false},{"title":"GBK.wordlist <span style='color:#111;'> 163.07KB </span>","children":null,"spread":false},{"title":"PKU.map <span style='color:#111;'> 307B </span>","children":null,"spread":false},{"title":"GBKA.pdat <span style='color:#111;'> 537.94KB </span>","children":null,"spread":false},{"title":"UTF2GBKA.map <span style='color:#111;'> 279.49KB </span>","children":null,"spread":false},{"title":"CoreDict.pdat <span style='color:#111;'> 1.62MB </span>","children":null,"spread":false},{"title":"CoreDict.unig <span style='color:#111;'> 466.96KB </span>","children":null,"spread":false}],"spread":false}],"spread":false}],"spread":true}]

评论信息

  • luoluol123l :
    为什么我感觉没有实现
    2016-10-20
  • ziyouren2008 :
    有点麻烦,可以使用
    2016-05-21
  • 西风_漂流 :
    基本可以实现中文分词的效果,,具体的准确率还有待检验,,
    2015-11-05
  • mstang13 :
    对,官网下载的是一样的。中文分词用,,不过分词还是有一定的瑕疵
    2015-05-27
  • bnbvbchen :
    有点麻烦,可以使用
    2014-12-22

免责申明

【只为小站】的资源来自网友分享,仅供学习研究,请务必在下载后24小时内给予删除,不得用于其他任何用途,否则后果自负。基于互联网的特殊性,【只为小站】 无法对用户传输的作品、信息、内容的权属或合法性、合规性、真实性、科学性、完整权、有效性等进行实质审查;无论 【只为小站】 经营者是否已进行审查,用户均应自行承担因其传输的作品、信息、内容而可能或已经产生的侵权或权属纠纷等法律责任。
本站所有资源不代表本站的观点或立场,基于网友分享,根据中国法律《信息网络传播权保护条例》第二十二条之规定,若资源存在侵权或相关问题请联系本站客服人员,zhiweidada#qq.com,请把#换成@,本站将给予最大的支持与配合,做到及时反馈和处理。关于更多版权及免责申明参见 版权及免责申明
服务器状态检查中...