Course Outline
Perkenalan
- Ikhtisar fitur dan arsitektur Spark dan Hadoop.
- Memahami data besar
- Python dasar-dasar pemrograman
Mulai
- Menyiapkan Python, Spark, dan Hadoop
- Memahami struktur data di Python
- Memahami PySpark API
- Memahami HDFS dan MapReduce
Mengintegrasikan Spark dan Hadoop dengan Python
- Menerapkan Spark RDD di Python
- Pengolahan data menggunakan MapReduce
- Membuat kumpulan data terdistribusi di HDFS
Machine Learning dengan Spark MLlib
Memproses Big Data dengan Spark Streaming
Bekerja dengan Sistem Rekomendasi
Bekerja dengan Kafka, Sqoop, Kafka, dan Flume
Apache Mahout dengan Spark dan Hadoop
Penyelesaian masalah
Ringkasan dan Langkah Selanjutnya
Requirements
- Pengalaman dengan Spark dan Hadoop
- Python pengalaman pemrograman
Hadirin
- Ilmuwan data
- Pengembang
Testimonials (3)
The fact that we were able to take with us most of the information/course/presentation/exercises done, so that we can look over them and perhaps redo what we didint understand first time or improve what we already did.
Raul Mihail Rat - Accenture Industrial SS
Course - Python, Spark, and Hadoop for Big Data
I liked that it managed to lay the foundations of the topic and go to some quite advanced exercises. Also provided easy ways to write/test the code.
Ionut Goga - Accenture Industrial SS
Course - Python, Spark, and Hadoop for Big Data
The live examples