经过这段时间的整理以及格式调整,以及纠正其中的一些错误修改,整理出PDF下载。下载地址:
CSDN免积分下载
完整版可以到这里下载Learning Spark完整版下载
附录:Learning Spark目录
Chapter 1 Introduction to Data Analysis with Spark
What Is Apache Spark?
A Unified Stack
Who Uses Spark, and for What?
A Brief History of Spark
Spark Versions and Releases
Storage Layers for Spark
Chapter 2 Downloading Spark and Getting Started
Downloading Spark
Introduction to Spark’s Python and Scala Shells
Introduction to Core Spark Concepts
Standalone Applications
Conclusion
Chapter 3 Programming with RDDs
RDD Basics
Creating RDDs
RDD Operations
Passing Functions to Spark
Common Transformations and Actions
Persistence (Caching)
Conclusion
Chapter 4 Working with Key/Value Pairs
Motivation
Creating Pair RDDs
Transformations on Pair RDDs
Actions Available on Pair RDDs
Data Partitioning (Advanced)
Conclusion
Chapter 5 Loading and Saving Your Data
Motivation
File Formats
Filesystems
Structured Data with Spark SQL
Databases
Conclusion
Chapter 6 Advanced Spark Programming
Introduction
Accumulators
Broadcast Variables
Working on a Per-Partition Basis
Piping to External Programs
Numeric RDD Operations
Conclusion
Chapter 7 Running on a Cluster
Introduction
Spark Runtime Architecture
Deploying Applications with spark-submit
Packaging Your Code and Dependencies
Scheduling Within and Between Spark Applications
Cluster Managers
Which Cluster Manager to Use?
Conclusion
Chapter 8 Tuning and Debugging Spark
Configuring Spark with SparkConf
Components of Execution: Jobs, Tasks, and Stages
Finding Information
Key Performance Considerations
Conclusion
Chapter 9 Spark SQL
Linking with Spark SQL
Using Spark SQL in Applications
Loading and Saving Data
JDBC/ODBC Server
User-Defined Functions
Spark SQL Performance
Conclusion
Chapter 10 Spark Streaming
A Simple Example
Architecture and Abstraction
Transformations
Output Operations
Input Sources
24/7 Operation
Streaming UI
Performance Considerations
Conclusion
Chapter 11 Machine Learning with MLlib
Overview
System Requirements
Machine Learning Basics
Data Types
Algorithms
Tips and Performance Considerations
Pipeline API
Conclusion
本博客文章除特别声明,全部都是原创!What Is Apache Spark?
A Unified Stack
Who Uses Spark, and for What?
A Brief History of Spark
Spark Versions and Releases
Storage Layers for Spark
Chapter 2 Downloading Spark and Getting Started
Downloading Spark
Introduction to Spark’s Python and Scala Shells
Introduction to Core Spark Concepts
Standalone Applications
Conclusion
Chapter 3 Programming with RDDs
RDD Basics
Creating RDDs
RDD Operations
Passing Functions to Spark
Common Transformations and Actions
Persistence (Caching)
Conclusion
Chapter 4 Working with Key/Value Pairs
Motivation
Creating Pair RDDs
Transformations on Pair RDDs
Actions Available on Pair RDDs
Data Partitioning (Advanced)
Conclusion
Chapter 5 Loading and Saving Your Data
Motivation
File Formats
Filesystems
Structured Data with Spark SQL
Databases
Conclusion
Chapter 6 Advanced Spark Programming
Introduction
Accumulators
Broadcast Variables
Working on a Per-Partition Basis
Piping to External Programs
Numeric RDD Operations
Conclusion
Chapter 7 Running on a Cluster
Introduction
Spark Runtime Architecture
Deploying Applications with spark-submit
Packaging Your Code and Dependencies
Scheduling Within and Between Spark Applications
Cluster Managers
Which Cluster Manager to Use?
Conclusion
Chapter 8 Tuning and Debugging Spark
Configuring Spark with SparkConf
Components of Execution: Jobs, Tasks, and Stages
Finding Information
Key Performance Considerations
Conclusion
Chapter 9 Spark SQL
Linking with Spark SQL
Using Spark SQL in Applications
Loading and Saving Data
JDBC/ODBC Server
User-Defined Functions
Spark SQL Performance
Conclusion
Chapter 10 Spark Streaming
A Simple Example
Architecture and Abstraction
Transformations
Output Operations
Input Sources
24/7 Operation
Streaming UI
Performance Considerations
Conclusion
Chapter 11 Machine Learning with MLlib
Overview
System Requirements
Machine Learning Basics
Data Types
Algorithms
Tips and Performance Considerations
Pipeline API
Conclusion
原创文章版权归过往记忆大数据(过往记忆)所有,未经许可不得转载。
本文链接: 【Learning Spark pdf下载】(https://www.iteblog.com/archives/1249.html)
看看好不好哦
写得很不错
是的,这书的确很不错。
求下载
很好的资源!
非常好 正需要的学习资料