本书于2017-07由Packt Publishing出版,作者Md. Rezaul Karim, Sridhar Alla,全书1587页。
通过本书你将学到以下知识
- Understand object-oriented & functional programming concepts of Scala
- In-depth understanding of Scala collection APIs
- Work with RDD and DataFrame to learn Spark’s core abstractions
- Analysing structured and unstructured data using SparkSQL and GraphX
- Scalable and fault-tolerant streaming application development using Spark structured streaming
- Learn machine-learning best practices for classification, regression, dimensionality reduction, and recommendation system to build predictive models with widely used algorithms in Spark MLlib & ML
- Build clustering models to cluster a vast amount of data
- Understand tuning, debugging, and monitoring Spark applications
- Deploy Spark applications on real clusters in Standalone, Mesos, and YARN
本书的章节
- INTRODUCTION TO SCALA
- OBJECT-ORIENTED SCALA
- FUNCTIONAL PROGRAMMING CONCEPTS
- COLLECTION APIS
- TACKLE BIG DATA – SPARK COMES TO THE PARTY
- START WORKING WITH SPARK – REPL AND RDDS
- SPECIAL RDD OPERATIONS
- INTRODUCE A LITTLE STRUCTURE - SPARK SQL
- STREAM ME UP, SCOTTY - SPARK STREAMING
- EVERYTHING IS CONNECTED - GRAPHX
- LEARNING MACHINE LEARNING - SPARK MLLIB AND SPARK ML
- ADVANCED MACHINE LEARNING BEST PRACTICES
- MY NAME IS BAYES, NAIVE BAYES
- TIME TO PUT SOME ORDER - CLUSTER YOUR DATA WITH SPARK MLLIB
- TEXT ANALYTICS USING SPARK ML
- SPARK TUNING
- TIME TO GO TO CLUSTERLAND - DEPLOYING SPARK ON A CLUSTER
- TESTING AND DEBUGGING SPARK
- PYSPARK AND SPARKR
- ACCELERATING SPARK WITH ALLUXIO
- INTERACTIVE DATA ANALYTICS WITH APACHE ZEPPELIN
下载地址
提供了PDF、azw3 以及 epub 三种格式的下载。
本博客文章除特别声明,全部都是原创!原创文章版权归过往记忆大数据(过往记忆)所有,未经许可不得转载。
本文链接: 【[电子书]Scala and Spark for Big Data Analytics PDF下载】(https://www.iteblog.com/archives/2241.html)