word_list_tools:Python 和 Pandas 工具可对不同类型的单词列表进行各种分析

上传者: 42163404 | 上传时间: 2023-03-16 23:15:45 | 文件大小: 15.15MB | 文件类型: ZIP
word_list_tools Python 和 Pandas 工具可对不同类型的单词列表进行各种分析 注意:这个仓库在 2014 年 9 月 13 日彻底重组,我试图通过并确保所有路径都是有效的,但有可能被我忽略了。 使用的词库: COHA,来自杨百翰大学的美国历史英语语料库。 1-grams 需要许可证才能使用,所以这里不包括它们; .gitignore 有一个规则可以忽略 coha_1*.*。 此处包含元数据/摘要数据。 布朗语料库,python 的 NLTK 的一部分 Europarl: A Parallel Corpus for Statistical Machine Translation, Philipp Koehn, MT Summit 2005 ( ) [文件不包括,因为它们非常庞大] 使用的简单单词列表: 填字游戏单词的 Moby 列表(113,809 个

文件下载

资源详情

[{"title":"( 158 个子文件 15.15MB ) word_list_tools:Python 和 Pandas 工具可对不同类型的单词列表进行各种分析","children":[{"title":"accented_characters.csv <span style='color:#111;'> 1.70KB </span>","children":null,"spread":false},{"title":".gitattributes <span style='color:#111;'> 483B </span>","children":null,"spread":false},{"title":".gitignore <span style='color:#111;'> 2.94KB </span>","children":null,"spread":false},{"title":"letter_distributions_comparison.ipynb <span style='color:#111;'> 2.07MB </span>","children":null,"spread":false},{"title":"trendiness.ipynb <span style='color:#111;'> 591.58KB </span>","children":null,"spread":false},{"title":"letter_distributions.ipynb <span style='color:#111;'> 283.19KB </span>","children":null,"spread":false},{"title":"gadsby_letter_frequency_analysis.ipynb <span style='color:#111;'> 206.05KB </span>","children":null,"spread":false},{"title":"gadsby_word_frequency_analysis.ipynb <span style='color:#111;'> 126.52KB </span>","children":null,"spread":false},{"title":"single_letter_frequency.ipynb <span style='color:#111;'> 27.58KB </span>","children":null,"spread":false},{"title":"word list shenanigans.ipynb <span style='color:#111;'> 27.37KB </span>","children":null,"spread":false},{"title":"initial_data_munge.ipynb <span style='color:#111;'> 23.84KB </span>","children":null,"spread":false},{"title":"letter_proximity.ipynb <span style='color:#111;'> 22.68KB </span>","children":null,"spread":false},{"title":"top_decades.ipynb <span style='color:#111;'> 18.40KB </span>","children":null,"spread":false},{"title":"pattern_search.ipynb <span style='color:#111;'> 13.17KB </span>","children":null,"spread":false},{"title":"search_word_list.ipynb <span style='color:#111;'> 7.40KB </span>","children":null,"spread":false},{"title":"google_ngram_to_hdf5.ipynb <span style='color:#111;'> 2.98KB </span>","children":null,"spread":false},{"title":"coha_and_xword.json <span style='color:#111;'> 902.81KB </span>","children":null,"spread":false},{"title":"README.md <span style='color:#111;'> 1.81KB </span>","children":null,"spread":false},{"title":"readme.md <span style='color:#111;'> 1.23KB </span>","children":null,"spread":false},{"title":"coha_words.pickle <span style='color:#111;'> 15.05MB </span>","children":null,"spread":false},{"title":"europarl_Italian.pickle <span style='color:#111;'> 5.03MB </span>","children":null,"spread":false},{"title":"europarl_Spanish.pickle <span style='color:#111;'> 4.92MB </span>","children":null,"spread":false},{"title":"europarl_Portuguese.pickle <span style='color:#111;'> 4.45MB </span>","children":null,"spread":false},{"title":"brown_df.pickle <span style='color:#111;'> 4.44MB </span>","children":null,"spread":false},{"title":"europarl_Polish.pickle <span style='color:#111;'> 4.21MB </span>","children":null,"spread":false},{"title":"europarl_French.pickle <span style='color:#111;'> 3.52MB </span>","children":null,"spread":false},{"title":"europarl_English.pickle <span style='color:#111;'> 2.71MB </span>","children":null,"spread":false},{"title":"moby_crossword.pickle <span style='color:#111;'> 2.17MB </span>","children":null,"spread":false},{"title":"brown_words.pickle <span style='color:#111;'> 1.49MB </span>","children":null,"spread":false},{"title":"coha_words_bigrams.pickle <span style='color:#111;'> 22.21KB </span>","children":null,"spread":false},{"title":"brown_words_bigrams.pickle <span style='color:#111;'> 19.46KB </span>","children":null,"spread":false},{"title":"letters_brown_words_51.pickle <span style='color:#111;'> 11.32KB </span>","children":null,"spread":false},{"title":"Finnish_letters10_stats.pickle <span style='color:#111;'> 5.32KB </span>","children":null,"spread":false},{"title":"Finnish_letters_stats.pickle <span style='color:#111;'> 5.32KB </span>","children":null,"spread":false},{"title":"German_letters_stats.pickle <span style='color:#111;'> 5.31KB </span>","children":null,"spread":false},{"title":"German_letters10_stats.pickle <span style='color:#111;'> 5.31KB </span>","children":null,"spread":false},{"title":"Polish_letters10_stats.pickle <span style='color:#111;'> 5.30KB </span>","children":null,"spread":false},{"title":"Polish_letters_stats.pickle <span style='color:#111;'> 5.30KB </span>","children":null,"spread":false},{"title":"English_letters10_stats.pickle <span style='color:#111;'> 5.28KB </span>","children":null,"spread":false},{"title":"English_letters_stats.pickle <span style='color:#111;'> 5.28KB </span>","children":null,"spread":false},{"title":"Portuguese_letters10_stats.pickle <span style='color:#111;'> 5.27KB </span>","children":null,"spread":false},{"title":"Portuguese_letters_stats.pickle <span style='color:#111;'> 5.27KB </span>","children":null,"spread":false},{"title":"French_letters10_stats.pickle <span style='color:#111;'> 5.26KB </span>","children":null,"spread":false},{"title":"French_letters_stats.pickle <span style='color:#111;'> 5.26KB </span>","children":null,"spread":false},{"title":"German_xletters_stats.pickle <span style='color:#111;'> 5.26KB </span>","children":null,"spread":false},{"title":"Italian_letters10_stats.pickle <span style='color:#111;'> 5.26KB </span>","children":null,"spread":false},{"title":"Italian_letters_stats.pickle <span style='color:#111;'> 5.26KB </span>","children":null,"spread":false},{"title":"Spanish_letters_stats.pickle <span style='color:#111;'> 5.23KB </span>","children":null,"spread":false},{"title":"Spanish_letters10_stats.pickle <span style='color:#111;'> 5.23KB </span>","children":null,"spread":false},{"title":"letters_brown_words_21.pickle <span style='color:#111;'> 4.99KB </span>","children":null,"spread":false},{"title":"letters_coha_words_21.pickle <span style='color:#111;'> 4.99KB </span>","children":null,"spread":false},{"title":"English_letters_norm.pickle <span style='color:#111;'> 3.72KB </span>","children":null,"spread":false},{"title":"English_letters10_equal_area.pickle <span style='color:#111;'> 3.72KB </span>","children":null,"spread":false},{"title":"Polish_letters10_compromise.pickle <span style='color:#111;'> 3.72KB </span>","children":null,"spread":false},{"title":"Finnish_letters_equal_area.pickle <span style='color:#111;'> 3.72KB </span>","children":null,"spread":false},{"title":"Finnish_letters10_compromise.pickle <span style='color:#111;'> 3.72KB </span>","children":null,"spread":false},{"title":"French_letters10_compromise.pickle <span style='color:#111;'> 3.72KB </span>","children":null,"spread":false},{"title":"German_xletters_equal_area.pickle <span style='color:#111;'> 3.72KB </span>","children":null,"spread":false},{"title":"Finnish_letters.pickle <span style='color:#111;'> 3.72KB </span>","children":null,"spread":false},{"title":"Finnish_letters10_norm.pickle <span style='color:#111;'> 3.72KB </span>","children":null,"spread":false},{"title":"Portuguese_letters10_compromise.pickle <span style='color:#111;'> 3.72KB </span>","children":null,"spread":false},{"title":"Portuguese_letters10_equal_area.pickle <span style='color:#111;'> 3.72KB </span>","children":null,"spread":false},{"title":"Polish_letters.pickle <span style='color:#111;'> 3.72KB </span>","children":null,"spread":false},{"title":"Spanish_lettersless20.pickle <span style='color:#111;'> 3.72KB </span>","children":null,"spread":false},{"title":"English_letters.pickle <span style='color:#111;'> 3.72KB </span>","children":null,"spread":false},{"title":"French_letters_norm.pickle <span style='color:#111;'> 3.72KB </span>","children":null,"spread":false},{"title":"Polish_letters_norm.pickle <span style='color:#111;'> 3.72KB </span>","children":null,"spread":false},{"title":"Finnish_letters_norm.pickle <span style='color:#111;'> 3.72KB </span>","children":null,"spread":false},{"title":"Portuguese_letters_norm.pickle <span style='color:#111;'> 3.72KB </span>","children":null,"spread":false},{"title":"Italian_lettersless20.pickle <span style='color:#111;'> 3.72KB </span>","children":null,"spread":false},{"title":"English_lettersless20.pickle <span style='color:#111;'> 3.72KB </span>","children":null,"spread":false},{"title":"letters_coha_words_15.pickle <span style='color:#111;'> 3.72KB </span>","children":null,"spread":false},{"title":"Spanish_letters_compromise.pickle <span style='color:#111;'> 3.72KB </span>","children":null,"spread":false},{"title":"Polish_lettersless20.pickle <span style='color:#111;'> 3.72KB </span>","children":null,"spread":false},{"title":"Finnish_letters10_equal_area.pickle <span style='color:#111;'> 3.72KB </span>","children":null,"spread":false},{"title":"Spanish_letters10_norm.pickle <span style='color:#111;'> 3.72KB </span>","children":null,"spread":false},{"title":"English_letters_equal_area.pickle <span style='color:#111;'> 3.72KB </span>","children":null,"spread":false},{"title":"Italian_letters_compromise.pickle <span style='color:#111;'> 3.72KB </span>","children":null,"spread":false},{"title":"French_letters_compromise.pickle <span style='color:#111;'> 3.72KB </span>","children":null,"spread":false},{"title":"French_letters.pickle <span style='color:#111;'> 3.72KB </span>","children":null,"spread":false},{"title":"Spanish_letters.pickle <span style='color:#111;'> 3.72KB </span>","children":null,"spread":false},{"title":"Portuguese_letters_equal_area.pickle <span style='color:#111;'> 3.72KB </span>","children":null,"spread":false},{"title":"letters_brown_words_15.pickle <span style='color:#111;'> 3.72KB </span>","children":null,"spread":false},{"title":"French_letters10_norm.pickle <span style='color:#111;'> 3.72KB </span>","children":null,"spread":false},{"title":"English_letters10_norm.pickle <span style='color:#111;'> 3.72KB </span>","children":null,"spread":false},{"title":"Polish_letters_compromise.pickle <span style='color:#111;'> 3.72KB </span>","children":null,"spread":false},{"title":"Polish_letters10_norm.pickle <span style='color:#111;'> 3.72KB </span>","children":null,"spread":false},{"title":"German_letters_equal_area.pickle <span style='color:#111;'> 3.72KB </span>","children":null,"spread":false},{"title":"French_lettersless20.pickle <span style='color:#111;'> 3.72KB </span>","children":null,"spread":false},{"title":"Portuguese_letters10_norm.pickle <span style='color:#111;'> 3.72KB </span>","children":null,"spread":false},{"title":"Italian_letters10_equal_area.pickle <span style='color:#111;'> 3.72KB </span>","children":null,"spread":false},{"title":"German_xletters_norm.pickle <span style='color:#111;'> 3.72KB </span>","children":null,"spread":false},{"title":"Italian_letters10_compromise.pickle <span style='color:#111;'> 3.72KB </span>","children":null,"spread":false},{"title":"Spanish_letters10_compromise.pickle <span style='color:#111;'> 3.72KB </span>","children":null,"spread":false},{"title":"Finnish_letters_compromise.pickle <span style='color:#111;'> 3.72KB </span>","children":null,"spread":false},{"title":"Finnish_lettersless20.pickle <span style='color:#111;'> 3.72KB </span>","children":null,"spread":false},{"title":"German_xletters_compromise.pickle <span style='color:#111;'> 3.72KB </span>","children":null,"spread":false},{"title":"Portuguese_letters_compromise.pickle <span style='color:#111;'> 3.72KB </span>","children":null,"spread":false},{"title":"Spanish_letters10_equal_area.pickle <span style='color:#111;'> 3.72KB </span>","children":null,"spread":false},{"title":"Italian_letters10_norm.pickle <span style='color:#111;'> 3.72KB </span>","children":null,"spread":false},{"title":"......","children":null,"spread":false},{"title":"<span style='color:steelblue;'>文件过多,未全部展示</span>","children":null,"spread":false}],"spread":true}]

评论信息

免责申明

【只为小站】的资源来自网友分享,仅供学习研究,请务必在下载后24小时内给予删除,不得用于其他任何用途,否则后果自负。基于互联网的特殊性,【只为小站】 无法对用户传输的作品、信息、内容的权属或合法性、合规性、真实性、科学性、完整权、有效性等进行实质审查;无论 【只为小站】 经营者是否已进行审查,用户均应自行承担因其传输的作品、信息、内容而可能或已经产生的侵权或权属纠纷等法律责任。
本站所有资源不代表本站的观点或立场,基于网友分享,根据中国法律《信息网络传播权保护条例》第二十二条之规定,若资源存在侵权或相关问题请联系本站客服人员,zhiweidada#qq.com,请把#换成@,本站将给予最大的支持与配合,做到及时反馈和处理。关于更多版权及免责申明参见 版权及免责申明