icwb2-data:该目录包含第二次国际汉语分词测试中使用的培训,测试和金标准数据。 还包括用于对烘烤参与者提交的结果进行评分的脚本以及用于生成基线和顶线数据的简单分段器-源码

上传者: 42138716 | 上传时间: 2021-09-12 22:55:13 | 文件大小: 50.24MB | 文件类型: ZIP
icwb2-data Source : SIGHAN是国际计算语言学会(ACL)中文语言处理小组的简称,其英文全称为“Special Interest Group for Chinese Language Processing of the Association for Computational Linguistics”,又可以理解为“SIG汉“或“SIG汉“。而Bakeoff则是SIGHAN所主办的国际中文语言处理竞赛,第一届于2003年在日本札幌举行(Bakeoff 2003),第二届于2005年在韩国济州岛举行(Bakeoff 2005), 而2006年在悉尼举行的第三届(Bakeoff 2006)则在前两届的基础上加入了中文命名实体识别评测。目前SIGHAN Bakeoff已成功举办了6届,其中Bakeoff 2005的数据和结果在其主页上是完全免费和公开的,但是请注意使用

文件下载

资源详情

[{"title":"( 37 个子文件 50.24MB ) icwb2-data:该目录包含第二次国际汉语分词测试中使用的培训,测试和金标准数据。 还包括用于对烘烤参与者提交的结果进行评分的脚本以及用于生成基线和顶线数据的简单分段器-源码","children":[{"title":"icwb2-data-master","children":[{"title":"training","children":[{"title":"msr_training.txt <span style='color:#111;'> 12.25MB </span>","children":null,"spread":false},{"title":"as_training.utf8 <span style='color:#111;'> 38.86MB </span>","children":null,"spread":false},{"title":"cityu_training.utf8 <span style='color:#111;'> 8.15MB </span>","children":null,"spread":false},{"title":"msr_training.utf8 <span style='color:#111;'> 16.11MB </span>","children":null,"spread":false},{"title":"pku_training.txt <span style='color:#111;'> 5.63MB </span>","children":null,"spread":false},{"title":"cityu_training.txt <span style='color:#111;'> 5.94MB </span>","children":null,"spread":false},{"title":"as_training.b5 <span style='color:#111;'> 26.36MB </span>","children":null,"spread":false},{"title":"pku_training.utf8 <span style='color:#111;'> 7.37MB </span>","children":null,"spread":false}],"spread":true},{"title":"testing","children":[{"title":"cityu_test.txt <span style='color:#111;'> 132.85KB </span>","children":null,"spread":false},{"title":"as_test.utf8 <span style='color:#111;'> 603.51KB </span>","children":null,"spread":false},{"title":"pku_test.txt <span style='color:#111;'> 335.08KB </span>","children":null,"spread":false},{"title":"msr_test.utf8 <span style='color:#111;'> 547.10KB </span>","children":null,"spread":false},{"title":"as_test.txt <span style='color:#111;'> 412.37KB </span>","children":null,"spread":false},{"title":"msr_test.txt <span style='color:#111;'> 367.46KB </span>","children":null,"spread":false},{"title":"pku_test.utf8 <span style='color:#111;'> 497.64KB </span>","children":null,"spread":false},{"title":"cityu_test.utf8 <span style='color:#111;'> 196.63KB </span>","children":null,"spread":false}],"spread":true},{"title":"doc","children":[{"title":"result_instructions.txt <span style='color:#111;'> 3.51KB </span>","children":null,"spread":false},{"title":"instructions.txt <span style='color:#111;'> 7.12KB </span>","children":null,"spread":false}],"spread":true},{"title":"README.md <span style='color:#111;'> 7.74KB </span>","children":null,"spread":false},{"title":"scripts","children":[{"title":"mwseg.pl <span style='color:#111;'> 3.46KB </span>","children":null,"spread":false},{"title":"score <span style='color:#111;'> 7.06KB </span>","children":null,"spread":false}],"spread":true},{"title":"gold","children":[{"title":"cityu_training_words.utf8 <span style='color:#111;'> 571.49KB </span>","children":null,"spread":false},{"title":"cityu_test_gold.utf8 <span style='color:#111;'> 235.12KB </span>","children":null,"spread":false},{"title":"msr_training_words.txt <span style='color:#111;'> 723.16KB </span>","children":null,"spread":false},{"title":"msr_test_gold.txt <span style='color:#111;'> 569.18KB </span>","children":null,"spread":false},{"title":"as_testing_gold.utf8 <span style='color:#111;'> 920.48KB </span>","children":null,"spread":false},{"title":"as_training_words.utf8 <span style='color:#111;'> 1.33MB </span>","children":null,"spread":false},{"title":"pku_training_words.utf8 <span style='color:#111;'> 478.73KB </span>","children":null,"spread":false},{"title":"cityu_test_gold.txt <span style='color:#111;'> 171.34KB </span>","children":null,"spread":false},{"title":"pku_test_gold.utf8 <span style='color:#111;'> 701.50KB </span>","children":null,"spread":false},{"title":"cityu_training_words.txt <span style='color:#111;'> 411.87KB </span>","children":null,"spread":false},{"title":"pku_test_gold.txt <span style='color:#111;'> 538.93KB </span>","children":null,"spread":false},{"title":"as_testing_gold.txt <span style='color:#111;'> 623.69KB </span>","children":null,"spread":false},{"title":"msr_training_words.utf8 <span style='color:#111;'> 1.02MB </span>","children":null,"spread":false},{"title":"as_training_words.txt <span style='color:#111;'> 951.07KB </span>","children":null,"spread":false},{"title":"pku_training_words.txt <span style='color:#111;'> 338.97KB </span>","children":null,"spread":false},{"title":"msr_test_gold.utf8 <span style='color:#111;'> 748.81KB </span>","children":null,"spread":false}],"spread":false}],"spread":true}],"spread":true}]

评论信息

免责申明

【只为小站】的资源来自网友分享,仅供学习研究,请务必在下载后24小时内给予删除,不得用于其他任何用途,否则后果自负。基于互联网的特殊性,【只为小站】 无法对用户传输的作品、信息、内容的权属或合法性、合规性、真实性、科学性、完整权、有效性等进行实质审查;无论 【只为小站】 经营者是否已进行审查,用户均应自行承担因其传输的作品、信息、内容而可能或已经产生的侵权或权属纠纷等法律责任。
本站所有资源不代表本站的观点或立场,基于网友分享,根据中国法律《信息网络传播权保护条例》第二十二条之规定,若资源存在侵权或相关问题请联系本站客服人员,zhiweidada#qq.com,请把#换成@,本站将给予最大的支持与配合,做到及时反馈和处理。关于更多版权及免责申明参见 版权及免责申明