摘要本书于2017-07由Packt Publishing出版,作者Sourav Gulati, Sumit Kumar,全书662页。 关注大数据猿(bigdata_ai)公众号及时获取最新大数据相关电子书、资讯等 通过本书你将学到以下知识 Process data using different file formats such as XML, JSON, CSV, and plain and delimited text, using the Spark core Library. Perform analytics on data from various data sources such as Kafka, and Flume using Spark Streaming Library Learn SQL schema creation and the analysis of structured data using various SQL …