LCSTS高质量中文短文本摘要数据集

nlp

文档中包含网盘的地址，数据共319M NLP方向文本摘要，文本分类，等方向可采纳！ The LCSTS dataset includes two parts: /DATA: 1. PART I: is the main contents of LCSTS that contains 2,400,591 (short text, summary) pairs. It can be used to train supervised learning models for summary generation. 2. PART II: contains 10,666 human labled (short text, summary) pairs which can be used to train classifier to filter the noises of the PART I. 3. PART III: contains 1,106 (short text, summary) pairs, this part is labled by 3 persons with the same labels. These pairs with score 3,4 and 5 can be used as test set for evaluating summary generation systems. /Result: 1.sumary.generated.char.context.txt: contains the summary generated by using RNN+context on the character based input. 2.sumary.generated.char.nocontext.txt: contains the summary generated by using RNN+nocontext on the character based input. 3.sumary.generated.word.context.txt: contains the summary generated by using RNN+context on the word based input. 4.sumary.generated.word.nocontext.txt: contains the summary generated by using RNN+nocontext on the word based input. 5.weibo.txt: contains the weibo of the test set. 6.sumary.human: contains the sumaries corresponding to 'weibo.txt' written by human. This part is the test set of the paper. 7. rouge.char_context.txt: the rouge metric on sumary.generated.char.context 8. rouge.char_nocontext.txt:the rouge metric on sumary.generated.char.nocontext 9. rouge.word_context.txt: the rouge metric on sumary.generated.word.context 10. rouge.word_nocontext.txt:the rouge metric on sumary.generated.word.nocontext

文件下载

评论信息

roadog2006 :

不错，谢谢
2020-03-13
roadog2006 :

不错，谢谢
2020-03-13
beijing2008chinese :

亲测真实有效
2019-12-25
beijing2008chinese :

亲测真实有效
2019-12-25
u010526186 :

所下载的文件中只包含一个百度网盘的下载链接和提取码。我本来就是为了不从百度网盘下载才到csdn打资源的，怪我自己没有仔细看。
2019-10-08
cppowboy :

所下载的文件中只包含一个百度网盘的下载链接和提取码。我本来就是为了不从百度网盘下载才到csdn打资源的，怪我自己没有仔细看。
2019-10-08

其他资源

免责申明

【只为小站】的资源来自网友分享，仅供学习研究，请务必在下载后24小时内给予删除，不得用于其他任何用途，否则后果自负。基于互联网的特殊性，【只为小站】无法对用户传输的作品、信息、内容的权属或合法性、合规性、真实性、科学性、完整权、有效性等进行实质审查；无论【只为小站】经营者是否已进行审查，用户均应自行承担因其传输的作品、信息、内容而可能或已经产生的侵权或权属纠纷等法律责任。
本站所有资源不代表本站的观点或立场，基于网友分享，根据中国法律《信息网络传播权保护条例》第二十二条之规定，若资源存在侵权或相关问题请联系本站客服人员，zhiweidada#qq.com，请把#换成@，本站将给予最大的支持与配合，做到及时反馈和处理。关于更多版权及免责申明参见版权及免责申明

LCSTS高质量中文短文本摘要数据集

文件下载

评论信息

其他资源

免责申明

个人信息

相关资源标签

热门下载

最新下载