搜索【hadoop spark】的结果

spark-2.1.1-bin-hadoop2.7.tgz.7z

基于hadoop2.7.2，scala2.11的sparklinux软件包，解压到指定目录后即可使用，实测可行

2024-04-13 17:58:26 191.82MB spark

1

hadoop-eclipse-plugin-1.2.1.jar

hadoop-eclipse开发插件，此jar所支持的是hadoop-1.2.1，请下载该插件后放置于Eclipse\plugins目录下，然后重启eclipse即可。

2024-04-11 16:23:11 5.58MB eclipse插件 hadoop

1

hadoop 所用的jar包

该jar包是属于大数据hadoop使用的些jar包，可以在编写代码的时候导入工程中该jar包是属于大数据hadoop使用的些jar包，可以在编写代码的时候导入工程中

2024-04-08 13:27:09 26.84MB hadoop

1

基于hadoop开发分布式爬虫，后端django，前端vue.zip

人工智能-hadoop

2024-04-07 23:16:28 15.11MB 人工智能 hadoop 分布式文件系统

1

Hadoop之外卖订单数据分析系统

基于Hadoop大数据平台对某网站的外卖订单数据进行分析，分析结果进行可视化展示

2024-04-03 15:36:30 10.14MB hadoop 可视化

1

Hadoop.in.Practice.2nd.Edition

Title: Hadoop in Practice, 2nd Edition Author: Alex Holmes Length: 512 pages Edition: 2 Language: English Publisher: Manning Publications Publication Date: 2014-10-12 ISBN-10: 1617292222 ISBN-13: 9781617292224 Summary Hadoop in Practice, Second Edition provides over 100 tested, instantly useful techniques that will help you conquer big data, using Hadoop. This revised new edition covers changes and new features in the Hadoop core architecture, including MapReduce 2. Brand new chapters cover YARN and integrating Kafka, Impala, and Spark SQL with Hadoop. You'll also get new and updated techniques for Flume, Sqoop, and Mahout, all of which have seen major new versions recently. In short, this is the most practical, up-to-date coverage of Hadoop available anywhere. Purchase of the print book includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications. About the Book It's always a good time to upgrade your Hadoop skills! Hadoop in Practice, Second Edition provides a collection of 104 tested, instantly useful techniques for analyzing real-time streams, moving data securely, machine learning, managing large-scale clusters, and taming big data using Hadoop. This completely revised edition covers changes and new features in Hadoop core, including MapReduce 2 and YARN. You'll pick up hands-on best practices for integrating Spark, Kafka, and Impala with Hadoop, and get new and updated techniques for the latest versions of Flume, Sqoop, and Mahout. In short, this is the most practical, up-to-date coverage of Hadoop available. Readers need to know a programming language like Java and have basic familiarity with Hadoop. What's Inside Thoroughly updated for Hadoop 2 How to write YARN applications Integrate real-time technologies like Storm, Impala, and Spark Predictive analytics using Mahout and RR Readers need to know a programming language like Java and have basic familiarity with Hadoop. About the Author Alex Holmes works on tough big-data problems. He is a software engineer, author, speaker, and blogger specializing in large-scale Hadoop projects. Table of Contents Part 1: Background and fundamentals Chapter 1: Hadoop in a heartbeat Chapter 2: Introduction to YARN Part 2: Data logistics Chapter 3: Data serialization— working with text and beyond Chapter 4: Organizing and optimizing data in HDFS Chapter 5: Moving data into and out of Hadoop Part 3: Big data patterns Chapter 6: Applying MapReduce patterns to big data Chapter 7: Utilizing data structures and algorithms at scale Chapter 8: Tuning, debugging, and testing Part 4: Beyond MapReduce Chapter 9: SQL on Hadoop Chapter 10: Writing a YARN application Appendix: Installing Hadoop and friends

2024-04-03 06:29:08 9.46MB Hadoop

1

Impala-JDBC连接jar包（ImpalaJDBC41-2.6.3）

jar包引入命令：mvn install:install-file -DgroupId=com.cloudera -DartifactId=ImpalaJDBC41 -Dversion=2.6.3 -Dpackaging=jar -Dfile=./ImpalaJDBC41-2.6.3.jar DgroupId: pom.xml配置中groupId的值 DartifactId: pom.xml配置中artifactId的值 Dversion: 版本号 Dpackaging: 文件类型 Dfile: 文件路径

2024-03-30 18:02:49 13.04MB impala jdbc spark

1

计算机毕业设计Python+Spark游戏推荐系统.zip

计算机毕业设计Python+Spark游戏推荐系统游戏可视化游戏爬虫游戏用户画像系统游戏大屏可视化游戏数据分析游戏情感分析神经网络混合CF推荐算法大数据毕业设计大数据毕设

2024-03-26 21:53:58 20.9MB

1

PDM :基于Hadoop的并行数据分析系统 (2012年)

提出了一款基于Hadoop的并行数据分析系统―――PDM.该系统拥有大量以MapReduce为计算框架的并行数据分析算法,不仅包括传统的ETL、数据挖掘、数据统计和文本分析算法,还引入了基于图理论的SNA(社会网络分析)算法.详细阐述了并行多元线性回归算法和“多源最短路径”算法的原理和实现,其中,提出的“消息传递模型”能有效解决MapReduce难以处理邻接矩阵的问题;介绍了基于电信数据的典型应用,如采用并行k均值和决策树算法实现的“套餐推荐”,利用并行PageRank算法实现的“营销关键点发现”等;最后

2024-03-25 13:56:36 894KB 自然科学 论文

1

Spark的共享单车数据存储管理系统-基于Web的Spark的共享单车数据存储系统设计与实现-Spark的共享单车数据存储管理系

Spark的共享单车数据存储-Spark的共享单车数据存储系统-Spark的共享单车数据存储系统源码-Spark的共享单车数据存储管理系统-Spark的共享单车数据存储管理系统java代码-Spark的共享单车数据存储系统设计与实现-基于springboot的Spark的共享单车数据存储系统-基于Web的Spark的共享单车数据存储系统设计与实现-Spark的共享单车数据存储网站-Spark的共享单车数据存储网站代码-Spark的共享单车数据存储平台-Spark的共享单车数据存储平台代码-Spark的共享单车数据存储项目-Spark的共享单车数据存储项目代码-Spark的共享单车数据存储代码 1、技术栈：java,springboot,vue，ajax，maven，mysql，MyBatisPlus等开发语言：Java 框架：SpringBoot JDK版本：JDK1.8 数据库：mysql 5.7 数据库工具：SQLyog/Navicat 开发软件：eclipse/myeclipse/idea Maven包：Maven 浏览器：谷歌浏览器 2、系统的实现用户信息图片素材视频

2024-02-26 14:44:11 11MB spark 代码 springboot Java

1

个人信息

热门下载

最新下载

其他资源