搜索【LOB语料库】的结果

Brown语料库和LOB语料库

Brown语料库是世界上第一个计算机可读的语料库，它搜集的语料来自1961年美国英语出版物上的文本，共500篇，每篇大约2000个单词，合计100万单词。LOB语料库是模仿Brown语料库的比例建立起来的英国英语语料库，其预料搜集自1961年英国英语出版物上的文本，共500篇，每篇大约2000个单词，合计100万单词。Brown语料库带词性标记，LOB语料库不带词性标记。

2021-04-21 20:18:02 5.46MB Brown LOB 语料库

英语语料库LOB语料库

LOB语料库创建时间: 1970年代初创建单位:英国Lancaster大学和挪威Oslo大学以及Bergen大学规模层级: 100万词次基本情况:研究当代英国英语,与美国英语对比,使用了TAGIT系统,以统计方式建立换算几率矩阵,提高标注正确率。 The Lancaster-Oslo Bergen Corpus (LOB) was compiled by researchers in Lancaster, Oslo and Bergen. It consists of one million words of British En glish texts from 1961. The texts for the corpus were sampled from 15 different text categories. Each text is just over 2.000 words long (longer texts have b een cut at the first sentence boundary after 2.000 words) and the number of texts in each category varies (see table below). Further information about the t exts can be found in the LOB manual (external link). This corpus is the British counterpart of the Brown Corpus of American English. which contains texts printed in the same year so that comparison bet ween both varieties could be made

2019-12-21 19:33:16 94.94MB LOB语料库 英语语料库

个人信息

热门下载

最新下载

其他资源