基于hadoop搜索引擎 离线处理程序

上传者: long1657 | 上传时间: 2020-01-03 11:25:41 | 文件大小: 30.11MB | 文件类型: zip
本项目是基于hadoop搜索引擎的离线处理程序,主要包含三部分 1.网页信息过滤 2.生成倒排索引文件 3.生成二级索引文件;

文件下载

资源详情

[{"title":"( 55 个子文件 30.11MB ) 基于hadoop搜索引擎 离线处理程序","children":[{"title":"BBS","children":[{"title":".project <span style='color:#111;'> 379B </span>","children":null,"spread":false},{"title":"src","children":[{"title":"mydict.dic <span style='color:#111;'> 8B </span>","children":null,"spread":false},{"title":"ext_stopword.dic <span style='color:#111;'> 0B </span>","children":null,"spread":false},{"title":"com","children":[{"title":"sl","children":[{"title":"bbs","children":[{"title":"PositionInfo.java <span style='color:#111;'> 1000B </span>","children":null,"spread":false},{"title":"mapred","children":[{"title":"TokenIndex.java <span style='color:#111;'> 2.92KB </span>","children":null,"spread":false},{"title":"GetFileSeek.java <span style='color:#111;'> 1.89KB </span>","children":null,"spread":false},{"title":"io","children":[{"title":"RecordWritable.java <span style='color:#111;'> 3.05KB </span>","children":null,"spread":false},{"title":"SplitFilePartitioner.java <span style='color:#111;'> 551B </span>","children":null,"spread":false}],"spread":true},{"title":"RemoveRepeatMR.java <span style='color:#111;'> 2.82KB </span>","children":null,"spread":false},{"title":"InvertedIndexer.java <span style='color:#111;'> 5.97KB </span>","children":null,"spread":false}],"spread":true},{"title":"BBS.java <span style='color:#111;'> 1.00KB </span>","children":null,"spread":false},{"title":"RankPosition.java <span style='color:#111;'> 883B </span>","children":null,"spread":false},{"title":"main","children":[{"title":"Main.java <span style='color:#111;'> 1.00KB </span>","children":null,"spread":false}],"spread":true},{"title":"hdfs","children":[{"title":"Util.java <span style='color:#111;'> 6.63KB </span>","children":null,"spread":false}],"spread":true}],"spread":true}],"spread":true}],"spread":true},{"title":"IKAnalyzer.cfg.xml <span style='color:#111;'> 421B </span>","children":null,"spread":false},{"title":"logback.xml <span style='color:#111;'> 1.02KB </span>","children":null,"spread":false}],"spread":true},{"title":"classes","children":null,"spread":false},{"title":"lib","children":[{"title":"hbase-examples-0.98.0-hadoop2.jar <span style='color:#111;'> 101.79KB </span>","children":null,"spread":false},{"title":"hadoop-common-2.2.0.jar <span style='color:#111;'> 2.55MB </span>","children":null,"spread":false},{"title":"commons-configuration-1.6.jar <span style='color:#111;'> 291.83KB </span>","children":null,"spread":false},{"title":"gson-2.2.4.jar <span style='color:#111;'> 185.96KB </span>","children":null,"spread":false},{"title":"slf4j-api-1.6.4.jar <span style='color:#111;'> 25.35KB </span>","children":null,"spread":false},{"title":"IKAnalyzer2012FF_u1.jar <span style='color:#111;'> 1.11MB </span>","children":null,"spread":false},{"title":"slf4j-log4j12-1.6.4.jar <span style='color:#111;'> 9.52KB </span>","children":null,"spread":false},{"title":"lucene-analyzers-common-4.3.0.jar <span style='color:#111;'> 1.49MB </span>","children":null,"spread":false},{"title":"zookeeper-3.4.5.jar <span style='color:#111;'> 1.25MB </span>","children":null,"spread":false},{"title":"quartz-2.2.1.jar <span style='color:#111;'> 644.85KB </span>","children":null,"spread":false},{"title":"lucene-queryparser-4.3.0.jar <span style='color:#111;'> 376.56KB </span>","children":null,"spread":false},{"title":"commons-logging-1.1.1.jar <span style='color:#111;'> 59.26KB </span>","children":null,"spread":false},{"title":"log4j-1.2.17.jar <span style='color:#111;'> 478.40KB </span>","children":null,"spread":false},{"title":"hadoop-mapreduce-client-core-2.2.0.jar <span style='color:#111;'> 1.39MB </span>","children":null,"spread":false},{"title":"quartz-jobs-2.2.1.jar <span style='color:#111;'> 33.18KB </span>","children":null,"spread":false},{"title":"hadoop-hdfs-2.2.0.jar <span style='color:#111;'> 5.00MB </span>","children":null,"spread":false},{"title":"hbase-server-0.98.0-hadoop2.jar <span style='color:#111;'> 3.17MB </span>","children":null,"spread":false},{"title":"guava-12.0.1.jar <span style='color:#111;'> 1.71MB </span>","children":null,"spread":false},{"title":"hbase-hadoop2-compat-0.98.0-hadoop2.jar <span style='color:#111;'> 71.32KB </span>","children":null,"spread":false},{"title":"findbugs-annotations-1.3.9-1.jar <span style='color:#111;'> 14.96KB </span>","children":null,"spread":false},{"title":"hbase-client-0.98.0-hadoop2.jar <span style='color:#111;'> 872.77KB </span>","children":null,"spread":false},{"title":"hbase-protocol-0.98.0-hadoop2.jar <span style='color:#111;'> 3.14MB </span>","children":null,"spread":false},{"title":"lucene-core-4.3.0.jar <span style='color:#111;'> 2.11MB </span>","children":null,"spread":false},{"title":"hbase-hadoop-compat-0.98.0-hadoop2.jar <span style='color:#111;'> 31.54KB </span>","children":null,"spread":false},{"title":"mysql-connector-java-5.0.8.jar <span style='color:#111;'> 528.18KB </span>","children":null,"spread":false},{"title":"hbase-common-0.98.0-hadoop2.jar <span style='color:#111;'> 421.59KB </span>","children":null,"spread":false},{"title":"htrace-core-2.04.jar <span style='color:#111;'> 30.79KB </span>","children":null,"spread":false},{"title":"commons-lang-2.6.jar <span style='color:#111;'> 277.56KB </span>","children":null,"spread":false},{"title":"hadoop-annotations-2.2.0.jar <span style='color:#111;'> 16.38KB </span>","children":null,"spread":false}],"spread":false},{"title":".settings","children":[{"title":"org.eclipse.jdt.core.prefs <span style='color:#111;'> 598B </span>","children":null,"spread":false},{"title":"org.eclipse.core.resources.prefs <span style='color:#111;'> 57B </span>","children":null,"spread":false}],"spread":true},{"title":"dist","children":[{"title":"bbac.jar <span style='color:#111;'> 5.28MB </span>","children":null,"spread":false}],"spread":true},{"title":"build.xml <span style='color:#111;'> 4.25KB </span>","children":null,"spread":false},{"title":".classpath <span style='color:#111;'> 2.24KB </span>","children":null,"spread":false},{"title":"bin","children":[{"title":"mydict.dic <span style='color:#111;'> 8B </span>","children":null,"spread":false},{"title":"ext_stopword.dic <span style='color:#111;'> 0B </span>","children":null,"spread":false},{"title":"IKAnalyzer.cfg.xml <span style='color:#111;'> 421B </span>","children":null,"spread":false},{"title":"logback.xml <span style='color:#111;'> 1.02KB </span>","children":null,"spread":false}],"spread":true},{"title":"logback.xml <span style='color:#111;'> 1.02KB </span>","children":null,"spread":false}],"spread":true}],"spread":true}]

评论信息

  • qq_43690962 :
    主要是没有数据,无法跑程序
    2020-01-05
  • scansoft :
    不错不错,辛苦楼主了
    2018-01-09
  • m0_37937872 :
    不错,很详细
    2017-12-31
  • 小嘎子闯天涯 :
    确实不错!值得学习
    2017-05-17
  • sinat_30814613 :
    hadoop版本更新了,po主能不能上传依赖的hadoop?
    2016-08-09

免责申明

【只为小站】的资源来自网友分享,仅供学习研究,请务必在下载后24小时内给予删除,不得用于其他任何用途,否则后果自负。基于互联网的特殊性,【只为小站】 无法对用户传输的作品、信息、内容的权属或合法性、合规性、真实性、科学性、完整权、有效性等进行实质审查;无论 【只为小站】 经营者是否已进行审查,用户均应自行承担因其传输的作品、信息、内容而可能或已经产生的侵权或权属纠纷等法律责任。
本站所有资源不代表本站的观点或立场,基于网友分享,根据中国法律《信息网络传播权保护条例》第二十二条之规定,若资源存在侵权或相关问题请联系本站客服人员,zhiweidada#qq.com,请把#换成@,本站将给予最大的支持与配合,做到及时反馈和处理。关于更多版权及免责申明参见 版权及免责申明