模糊匹配器:用于确定对象相似概率的Java库-源码

上传者: 42172204 | 上传时间: 2021-07-01 16:38:24 | 文件大小: 509KB | 文件类型: ZIP
模糊匹配器 介绍 一个基于Java的库,用于对文档集中的“相似”元素进行匹配和分组。 想象一下,在一个具有联系人集合的系统中工作,并且想要对具有相似名称,地址或其他属性的联系人进行匹配和分类。 模糊匹配匹配算法可以帮助您做到这一点。 模糊匹配算法甚至可以帮助您查找重复的联系人,或防止系统添加重复的联系人。 该库可以对任何域对象(如联系人)起作用,并可以找到各种用例的相似性。 它深入每个角色并找出两个或更多对象相似的可能性。 什么是模糊的 联系人"Steven Wilson" living at "45th Avenue 5th st." 和"Stephen Wilkson" living at "45th Ave 5th Street"可能看起来像是属于同一个人。 对于人类来说,很容易忽略名称拼写的微小差异,或者忽略地址中使用的缩写。 但是对于计算机程序,它们并不相同。 字符串Stev

文件下载

资源详情

[{"title":"( 43 个子文件 509KB ) 模糊匹配器:用于确定对象相似概率的Java库-源码","children":[{"title":"fuzzy-matcher-master","children":[{"title":".travis.yml <span style='color:#111;'> 168B </span>","children":null,"spread":false},{"title":"pom.xml <span style='color:#111;'> 9.68KB </span>","children":null,"spread":false},{"title":"LICENSE <span style='color:#111;'> 11.07KB </span>","children":null,"spread":false},{"title":"CONTRIBUTING.md <span style='color:#111;'> 1.98KB </span>","children":null,"spread":false},{"title":"src","children":[{"title":"test","children":[{"title":"resources","children":[{"title":"demo.csv <span style='color:#111;'> 86B </span>","children":null,"spread":false},{"title":"test-data.csv <span style='color:#111;'> 2.45KB </span>","children":null,"spread":false},{"title":"Sample-Big-Data.csv <span style='color:#111;'> 644.73KB </span>","children":null,"spread":false}],"spread":true},{"title":"java","children":[{"title":"com","children":[{"title":"intuit","children":[{"title":"fuzzymatcher","children":[{"title":"domain","children":[{"title":"DocumentTest.java <span style='color:#111;'> 6.20KB </span>","children":null,"spread":false},{"title":"ElementTest.java <span style='color:#111;'> 1.95KB </span>","children":null,"spread":false}],"spread":true},{"title":"component","children":[{"title":"ElementMatchTest.java <span style='color:#111;'> 4.57KB </span>","children":null,"spread":false},{"title":"MatchServicePerfTest.java <span style='color:#111;'> 5.70KB </span>","children":null,"spread":false},{"title":"MatchServiceTest.java <span style='color:#111;'> 43.03KB </span>","children":null,"spread":false},{"title":"TokenRepoTest.java <span style='color:#111;'> 8.77KB </span>","children":null,"spread":false}],"spread":true},{"title":"function","children":[{"title":"ScoringFunctionTest.java <span style='color:#111;'> 4.73KB </span>","children":null,"spread":false},{"title":"PreProcessFunctionTest.java <span style='color:#111;'> 3.44KB </span>","children":null,"spread":false},{"title":"TokenizerFunctionTest.java <span style='color:#111;'> 4.92KB </span>","children":null,"spread":false}],"spread":false}],"spread":true}],"spread":true}],"spread":true}],"spread":true}],"spread":true},{"title":"main","children":[{"title":"resources","children":[{"title":"address-dictionary.txt <span style='color:#111;'> 4.85KB </span>","children":null,"spread":false},{"title":"name-dictionary.txt <span style='color:#111;'> 291B </span>","children":null,"spread":false}],"spread":true},{"title":"java","children":[{"title":"com","children":[{"title":"intuit","children":[{"title":"fuzzymatcher","children":[{"title":"exception","children":[{"title":"MatchException.java <span style='color:#111;'> 387B </span>","children":null,"spread":false}],"spread":true},{"title":"domain","children":[{"title":"Matchable.java <span style='color:#111;'> 457B </span>","children":null,"spread":false},{"title":"MatchType.java <span style='color:#111;'> 103B </span>","children":null,"spread":false},{"title":"Match.java <span style='color:#111;'> 2.62KB </span>","children":null,"spread":false},{"title":"ElementType.java <span style='color:#111;'> 2.04KB </span>","children":null,"spread":false},{"title":"Document.java <span style='color:#111;'> 5.60KB </span>","children":null,"spread":false},{"title":"ElementClassification.java <span style='color:#111;'> 1.29KB </span>","children":null,"spread":false},{"title":"Token.java <span style='color:#111;'> 1004B </span>","children":null,"spread":false},{"title":"Element.java <span style='color:#111;'> 8.71KB </span>","children":null,"spread":false},{"title":"Score.java <span style='color:#111;'> 706B </span>","children":null,"spread":false}],"spread":false},{"title":"component","children":[{"title":"TokenRepo.java <span style='color:#111;'> 5.44KB </span>","children":null,"spread":false},{"title":"ElementMatch.java <span style='color:#111;'> 1.94KB </span>","children":null,"spread":false},{"title":"Dictionary.java <span style='color:#111;'> 2.08KB </span>","children":null,"spread":false},{"title":"MatchService.java <span style='color:#111;'> 7.29KB </span>","children":null,"spread":false},{"title":"DocumentMatch.java <span style='color:#111;'> 2.61KB </span>","children":null,"spread":false}],"spread":false},{"title":"function","children":[{"title":"ScoringFunction.java <span style='color:#111;'> 6.35KB </span>","children":null,"spread":false},{"title":"PreProcessFunction.java <span style='color:#111;'> 4.60KB </span>","children":null,"spread":false},{"title":"TokenizerFunction.java <span style='color:#111;'> 2.35KB </span>","children":null,"spread":false}],"spread":false},{"title":"util","children":[{"title":"Utils.java <span style='color:#111;'> 2.00KB </span>","children":null,"spread":false}],"spread":false}],"spread":true}],"spread":true}],"spread":true}],"spread":true}],"spread":true}],"spread":true},{"title":"fuzzy-match.png <span style='color:#111;'> 167.61KB </span>","children":null,"spread":false},{"title":"fuzzy-logo.svg <span style='color:#111;'> 5.83KB </span>","children":null,"spread":false},{"title":".gitignore <span style='color:#111;'> 373B </span>","children":null,"spread":false},{"title":"CHANGELOG.md <span style='color:#111;'> 5.40KB </span>","children":null,"spread":false},{"title":"README.md <span style='color:#111;'> 14.32KB </span>","children":null,"spread":false},{"title":"perf.png <span style='color:#111;'> 31.27KB </span>","children":null,"spread":false}],"spread":false}],"spread":true}]

评论信息

免责申明

【只为小站】的资源来自网友分享,仅供学习研究,请务必在下载后24小时内给予删除,不得用于其他任何用途,否则后果自负。基于互联网的特殊性,【只为小站】 无法对用户传输的作品、信息、内容的权属或合法性、合规性、真实性、科学性、完整权、有效性等进行实质审查;无论 【只为小站】 经营者是否已进行审查,用户均应自行承担因其传输的作品、信息、内容而可能或已经产生的侵权或权属纠纷等法律责任。
本站所有资源不代表本站的观点或立场,基于网友分享,根据中国法律《信息网络传播权保护条例》第二十二条之规定,若资源存在侵权或相关问题请联系本站客服人员,zhiweidada#qq.com,请把#换成@,本站将给予最大的支持与配合,做到及时反馈和处理。关于更多版权及免责申明参见 版权及免责申明