spark3读hive1,配置spark.sql.hive.metastore.jars
2021-10-18 15:07:23 82.99MB spark spark3 hive
1
emp.json员工信息
2021-10-18 15:07:15 2KB json spark
1
spark源码在hadoop-cdh5.7.0编译生成,用于学习hadoop和spark课程
2021-10-17 23:23:21 182.9MB spark hadoop cdh5.7.0
1
Today's Web-enabled deluge of electronic data calls for automated methods of data analysis. Machine learning provides these, developing methods that can automatically detect patterns in data and then use the uncovered patterns to predict future data. This textbook offers a comprehensive and self-contained introduction to the field of machine learning, a unified, probabilistic approach. The coverage combines breadth and depth, offering necessary background material on such topics as probability, optimization, and linear algebra as well as discussion of recent developments in the field, including conditional random fields, L1 regularization, and deep learning. The book is written in an informal, accessible style, complete with pseudo-code for the most important algorithms. All topics are copiously illustrated with color images and worked examples drawn from such application domains as biology, text processing, computer vision, and robotics. Rather than providing a cookbook of different heuristic methods, the book stresses a principled model-based approach, often using the language of graphical models to specify models in a concise and intuitive way. Almost all the models described have been implemented in a MATLAB software package--PMTK (probabilistic modeling toolkit)--that is freely available online. The book is suitable for upper-level undergraduates with an introductory-level college math background and beginning graduate students. 优点:新,全! 由于成书时间较晚,所以涵盖了更多最近几年的hot topic,比如Dirichlet Process 。 更重要的,是全,基本上ML领域的专有名词,你都可以在书后的index找到。说道这里,不得不佩服本书的作者Kevin Murphy,剑桥的本科,UCB的博士,MIT的博后,得到过多位大牛的真传 。 还有一个非常重要的,就是这本书配备了详尽的matlab code,你几乎可以尝试书中的每一个例子。 单从以上这几点,绝对应该把他排在所有ML教材的首位!
2021-10-17 14:59:04 25.08MB spark,ml
1
在开发spark2.5.8的时候用到,希望大家也能找到
2021-10-15 11:09:49 450KB synthetica netbeans spark
1
项目实战:Java一站式解决Hive内用Spark取数,新建ES索引,灌入数据,并且采用ES别名机制,实现ES数据更新的无缝更新,底层采用Spark计算框架,数据较快。
2021-10-15 11:00:26 167.34MB elasticsearch spark hive
1
本文是通过拟牛顿算法去解决逻辑回归与线性支持向量机的!
2021-10-14 18:08:07 216KB Spark
1
spark案例之------------------------------------高铁需求,恰同学少年,风华正茂,挥斥方遒
2021-10-14 16:49:42 124KB spark
1
大数据面试题,大数据成神之路开启...Flink/Spark/Hadoop/Hbase/Hive... 已经更新100+篇~ 关注公众号~ 大数据成神之路目录 大数据开发基础篇 :skis: Java基础 :memo: NIO :open_book: 并发 :guitar: JVM :dollar_banknote: 分布式 :floppy_disk: Zookeeper :oncoming_fist: RPC :artist_palette: Netty :laptop: Linux Java基础 NIO 并发容器 JVM 分布式 zookeeper RPC Netty Linux 大数据框架学习篇 Hadoop Hive Spark Flink HBase Kafka Zookeeper Flume Sqoop Azkaban 大数据开发实战进阶篇 这里的文章主要是我平时发表在公众号,博客等的文章,精心挑选,以飨读者。 Flink实战进阶 Sp
2021-10-14 14:32:11 81.1MB Python Learning Tutorial
1
自己整理的笔记,278章节
2021-10-14 14:08:14 10.62MB 大数据 Spark