コース概要
Introduction
- Apache Spark vs Hadoop MapReduce
Overview of Apache Spark Features and Architecture
Choosing a Programming Language
Setting up Apache Spark
Creating a Sample Application
Choosing the Data Set
Running Data Analysis on the Data
Processing of Structured Data with Spark SQL
Processing Streaming Data with Spark Streaming
Integrating Apache Spark with 3rd Part Machine Learning Tools
Using Apache Spark for Graph Processing
Optimizing Apache Spark
Troubleshooting
Summary and Conclusion
要求
- Experience with the Linux command line
- A general understanding of data processing
- Programming experience with Java, Scala, Python, or R
Audience
- Developers
お客様の声 (5)
多くの実用的な例、同じ問題へのさまざまなアプローチ方法、そして現在のソリューションを改善するためのあまり知られていないトリックなど
Rafal - Nordea
コース - Apache Spark MLlib
Machine Translated
The live examples
Ahmet Bolat - Accenture Industrial SS
コース - Python, Spark, and Hadoop for Big Data
very interactive...
Richard Langford
コース - SMACK Stack for Data Science
Sufficient hands on, trainer is knowledgable
Chris Tan
コース - A Practical Introduction to Stream Processing
Get to learn spark streaming , databricks and aws redshift