本书于2017-07由Packt Publishing出版,作者Sourav Gulati, Sumit Kumar,全书662页。
通过本书你将学到以下知识
- Process data using different file formats such as XML, JSON, CSV, and plain and delimited text, using the Spark core Library.
- Perform analytics on data from various data sources such as Kafka, and Flume using Spark Streaming Library
- Learn SQL schema creation and the analysis of structured data using various SQL functions including Windowing functions in the Spark SQL Library
- Explore Spark Mlib APIs while implementing Machine Learning techniques to solve real-world problems
- Get to know Spark GraphX so you understand various graph-based analytics that can be performed with Spark
本书的章节
- INTRODUCTION TO SPARK
- REVISITING JAVA
- LET US SPARK
- UNDERSTANDING THE SPARK PROGRAMMING MODEL
- WORKING WITH DATA AND STORAGE
- SPARK ON CLUSTER
- SPARK PROGRAMMING MODEL - ADVANCED
- WORKING WITH SPARK SQL
下载地址
提供了PDF、azw3 以及 epub 三种格式的下载。
本博客文章除特别声明,全部都是原创!原创文章版权归过往记忆大数据(过往记忆)所有,未经许可不得转载。
本文链接: 【[电子书]Apache Spark 2.x for Java Developers PDF下载】(https://www.iteblog.com/archives/2244.html)