[电子书]Fast Data Processing with Spark 2, 3rd Edition下载

　　本书是《Spark快速数据处理》第三版，全书基于Spark 2.0.0编写。本书适合Spark入门者，作者Krishna Sankar，由Packt出版社于2016年10月出版，全书共274页。通过本书你将学到以下知识：
　　（1）、安装和设置你的Spark集群；
　　（2）、使用Spark交互式Shell来实现简单的分布式应用程序；
　　（3）、使用新的DataFrame API操作数据；
　　（4）、了解和Spark RDD交互的各种不同方法；
　　（5）、通过类SQL语法来操作Spark的数据；
　　（6）、了解Spark是如何处理海量数据；
　　（7）、使用高度可扩展的算法来实现机器学习系统；
　　（8）、在Spark中使用R来操作数据；
　　（9）、使用GraphX学习有趣的图形算法和图形处理。

如果想及时了解Spark、Hadoop或者Hbase相关的文章，欢迎关注微信公共帐号：iteblog_hadoop

本书的章节

Chapter 1: Installing Spark and Setting Up Your Cluster
Chapter 2: Using the Spark Shell
Chapter 3: Building and Running a Spark Application
Chapter 4: Creating a SparkSession Object
Chapter 5: Loading and Saving Data in Spark
Chapter 6: Manipulating Your RDD
Chapter 7: Spark 2.0 Concepts
Chapter 8: Spark SQL
Chapter 9: Foundations of Datasets/DataFrames – The Proverbial Workhorse for DataScientists
Chapter 10: Spark with Big Data
Chapter 11: Machine Learning with Spark ML Pipelines
Chapter 12: GraphX

下载地址

关注本微信公众号iteblog_hadoop并回复 fast_spark 获取本书的下载地址。

点击进入下载

本博客文章除特别声明，全部都是原创！
原创文章版权归过往记忆大数据（过往记忆）所有，未经许可不得转载。
本文链接: 【[电子书]Fast Data Processing with Spark 2, 3rd Edition下载】（https://www.iteblog.com/archives/1915.html）