Deep Dive into Spark SQL with Advanced Performance Tuning

spark

SQL

apache

spark

需积分: 9 12 下载量 89 浏览量 2018-06-11 09:46:22 上传评论 1 收藏 4.43MB PDF 举报

温馨提示

试读

45页

Spark SQL is a highly scalable and efficient relational processing engine with ease-to-use APIs and mid-query fault tolerance. It is a core module of Apache Spark. Spark SQL can process, integrate and analyze the data from diverse data sources (e.g., Hive, Cassandra, Kafka and Oracle) and file formats (e.g., Parquet, ORC, CSV, and JSON). This talk will dive into the technical details of SparkSQL spanning the entire lifecycle of a query execution. The audience will get a deeper understanding of Spark SQL and understand how to tune Spark SQL performance.

资源推荐

资源详情

资源评论