本书于2017-03由Packt Publishing出版,作者Muhammad Asif Abbasi,全书356页。
通过本书你将学到以下知识:
- Get an overview of big data analytics and its importance for organizations and data professionals
- Delve into Spark to see how it is different from existing processing platforms
- Understand the intricacies of various file formats, and how to process them with Apache Spark.
- Realize how to deploy Spark with YARN, MESOS or a Stand-alone cluster manager.
- Learn the concepts of Spark SQL, SchemaRDD, Caching and working with Hive and Parquet file formats
- Understand the architecture of Spark MLLib while discussing some of the off-the-shelf algorithms that come with Spark.
- Introduce yourself to the deployment and usage of SparkR.
- Walk through the importance of Graph computation and the graph processing systems available in the market
- Check the real world example of Spark by building a recommendation engine with Spark using ALS.
- Use a Telco data set, to predict customer churn using Random Forests.
本书的章节
- Architecture and Installation
- Transformations and Actions with Spark RDDs
- ETL with Spark
- Spark SQL
- Spark Streaming
- Machine Learning with Spark
- GraphX
- Operating in Clustered Mode
- Building a Recommendation System
- Customer Churn Prediction
- There's More with Spark
下载地址
提供了PDF、azw3 以及 epub 二种格式的下载。
本博客文章除特别声明,全部都是原创!原创文章版权归过往记忆大数据(过往记忆)所有,未经许可不得转载。
本文链接: 【[电子书]Learning Apache Spark 2 PDF下载】(https://www.iteblog.com/archives/2214.html)