Apache Spark Training Courses

Apache Spark Training

Apache Spark - an engine for big data processing training

Testi...Client Testimonials

Spark for Developers

Richard is very calm and methodical, with an analytical insight - exactly the qualities needed to present this sort of course

Kieran Mac Kenna - BAE Systems Applied Intelligence

Spark for Developers

We know know a lot more about the whole environment

John Kidd - Cardano Risk Management

Spark for Developers

The trainer made the class interesting and entertaining which helps quite a bit with all day trainings

Ryan Speelman -

Spark for Developers

I think the trainer had an excellent style of combining humor and real life stories to make the subjects at hand very approachable. I would highly recommend this professor in the future.

Spark for Developers

I think the trainer had an excellent style of combining humor and real life stories to make the subjects at hand very approachable. I would highly recommend this professor in the future.

Apache Spark Course Outlines

Code Name Duration Overview
68780 Apache Spark 14 hours Why Spark? Problems with Traditional Large-Scale Systems Introducing Spark Spark Basics What is Apache Spark? Using the Spark Shell Resilient Distributed Datasets (RDDs) Functional Programming with Spark Working with RDDs RDD Operations Key-Value Pair RDDs MapReduce and Pair RDD Operations The Hadoop Distributed File System Why HDFS? HDFS Architecture Using HDFS Running Spark on a Cluster Overview A Spark Standalone Cluster The Spark Standalone Web UI Parallel Programming with Spark RDD Partitions and HDFS Data Locality Working With Partitions Executing Parallel Operations Caching and Persistence RDD Lineage Caching Overview Distributed Persistence Writing Spark Applications Spark Applications vs. Spark Shell Creating the SparkContext Configuring Spark Properties Building and Running a Spark Application Logging Spark, Hadoop, and the Enterprise Data Center Overview Spark and the Hadoop Ecosystem Spark and MapReduce Spark Streaming Spark Streaming Overview Example: Streaming Word Count Other Streaming Operations Sliding Window Operations Developing Spark Streaming Applications Common Spark Algorithms Iterative Algorithms Graph Analysis Machine Learning Improving Spark Performance Shared Variables: Broadcast Variables Shared Variables: Accumulators Common Performance Issues
sparkdev Spark for Developers 21 hours OBJECTIVE: This course will introduce Apache Spark. The students will learn how  Spark fits  into the Big Data ecosystem, and how to use Spark for data analysis.  The course covers Spark shell for interactive data analysis, Spark internals, Spark APIs, Spark SQL, Spark streaming, and machine learning and graphX. AUDIENCE : Developers / Data Analysts Scala primer A quick introduction to Scala Labs : Getting know Scala Spark Basics Background and history Spark and Hadoop Spark concepts and architecture Spark eco system (core, spark sql, mlib, streaming) Labs : Installing and running Spark First Look at Spark Running Spark in local mode Spark web UI Spark shell Analyzing dataset – part 1 Inspecting RDDs Labs: Spark shell exploration RDDs RDDs concepts Partitions RDD Operations / transformations RDD types Key-Value pair RDDs MapReduce on RDD Caching and persistence Labs : creating & inspecting RDDs;   Caching RDDs Spark API programming Introduction to Spark API / RDD API Submitting the first program to Spark Debugging / logging Configuration properties Labs : Programming in Spark API, Submitting jobs Spark SQL SQL support in Spark Dataframes Defining tables and importing datasets Querying data frames using SQL Storage formats : JSON / Parquet Labs : Creating and querying data frames; evaluating data formats MLlib MLlib intro MLlib algorithms Labs : Writing MLib applications GraphX GraphX library overview GraphX APIs Labs : Processing graph data using Spark Spark Streaming Streaming overview Evaluating Streaming platforms Streaming operations Sliding window operations Labs : Writing spark streaming applications Spark and Hadoop Hadoop Intro (HDFS / YARN) Hadoop + Spark architecture Running Spark on Hadoop YARN Processing HDFS files using Spark Spark Performance and Tuning Broadcast variables Accumulators Memory management & caching Spark Operations Deploying Spark in production Sample deployment templates Configurations Monitoring Troubleshooting
hdp Hortonworks Data Platform (HDP) for administrators 21 hours Hortonworks Data Platform is an open-source Apache Hadoop support platform that provides a stable foundation for developing big data solutions on the Apache Hadoop ecosystem. This instructor-led live training introduces Hortonworks and walks participants through the deployment of Spark + Hadoop solution. By the end of this training, participants will be able to: Use Hortonworks to reliably run Hadoop at a large scale Unify Hadoop's security, governance, and operations capabilities with Spark's agile analytic workflows. Use Hortonworks to investigate, validate, certify and support each of the components in a Spark project Process different types of data, including structured, unstructured, in-motion, and at-rest. Audience Hadoop administrators Format of the course Part lecture, part discussion, exercises and heavy hands-on practice To request a customized course outline for this training, please contact us.  
magellan Magellan: Geospatial Analytics with on Spark 14 hours Magellan is an open-source distributed execution engine for geospatial analytics on big data. Implemented on top of Apache Spark, it extends Spark SQL and provides a relational abstraction for geospatial analytics. This instructor-led, live training introduces the concepts and approaches for implementing geospacial analytics and walks participants through the creation of a predictive analysis application using Magellan on Spark. By the end of this training, participants will be able to: Efficiently query, parse and join geospatial datasets at scale Implement geospatial data in business intelligence and predictive analytics applications Use spatial context to extend the capabilities of mobile devices, sensors, logs, and wearables Audience Application developers Format of the course Part lecture, part discussion, exercises and heavy hands-on practice To request a customized course outline for this training, please contact us.  
alluxio Alluxio: Unifying disparate storage systems 7 hours Alexio is an open-source virtual distributed storage system that unifies disparate storage systems and enables applications to interact with data at memory speed. It is used by companies such as Intel, Baidu and Alibaba. In this instructor-led, live training, participants will learn how to use Alexio to bridge different computation frameworks with storage systems and efficiently manage multi-petabyte scale data as they step through the creation of an application with Alluxio. By the end of this training, participants will be able to: Develop an application with Alluxio Connect big data systems and applications while preserving one namespace Efficiently extract value from big data in any storage format Improve workload performance Deploy and manage Alluxio standalone or clustered Audience Data scientist Developer System administrator Format of the course Part lecture, part discussion, exercises and heavy hands-on practice To request a customized course outline for this training, please contact us.  

Upcoming Courses

CourseCourse DateCourse Price [Remote / Classroom]
Spark for Developers - Kielce, ul. Warszawska 19Mon, 2017-11-06 09:0012000PLN / 4600PLN
Apache Spark - Zakopane ul. Jagiellońska 30 Mon, 2017-11-06 09:0018260PLN / 6233PLN

Other regions

Weekend Apache Spark courses, Evening Apache Spark training, Apache Spark boot camp, Apache Spark instructor-led , Apache Spark trainer , Apache Spark classes, Apache Spark coaching, Apache Spark training courses, Apache Spark private courses, Apache Spark on-site,Weekend Apache Spark training, Apache Spark instructor, Apache Spark one on one training

Course Discounts

Course Venue Course Date Course Price [Remote / Classroom]
Adobe InDesign Wrocław, ul.Ludwika Rydygiera 2a/22 Mon, 2017-10-23 09:00 1881PLN / 1027PLN
Adobe Premiere Pro Gdynia, ul. Ejsmonda 2 Mon, 2017-10-23 09:00 3960PLN / 2480PLN
Administration of Linux System Gdynia, ul. Ejsmonda 2 Tue, 2017-10-24 09:00 4950PLN / 3225PLN
Adobe Photoshop Elements Lublin, ul. Spadochroniarzy 9 Wed, 2017-10-25 09:00 1881PLN / 1127PLN
Business Analysis, BABOK V3.0 and IIBA Certification Preparation Kraków, ul. Rzemieślnicza 1 Wed, 2017-10-25 09:00 9405PLN / 5903PLN
Zaawansowana administracja MySQL Poznan, Garbary 100/63 Thu, 2017-10-26 09:00 3416PLN / 2108PLN
Effective working with spreadsheet in Excel Warszawa, ul. Złota 3/11 Thu, 2017-10-26 09:00 2475PLN / 1225PLN
Node.js Olsztyn, ul. Kajki 3/1 Thu, 2017-10-26 09:00 3861PLN / 2431PLN
Advisory & Leadership Skills Wrocław, ul.Ludwika Rydygiera 2a/22 Mon, 2017-10-30 09:00 8524PLN / 2983PLN
Excel For Statistical Data Analysis Warszawa, ul. Złota 3/11 Thu, 2017-11-02 09:00 2673PLN / 1291PLN
SQL Advanced in MySQL Warszawa, ul. Złota 3/11 Thu, 2017-11-02 09:00 1881PLN / 1141PLN
Projektowanie stron na urządzenia mobilne Bielsko-Biała, Al. Armii Krajowej 220 Thu, 2017-11-02 09:00 2624PLN / 1605PLN
SQL language in MSSQL Wrocław, ul.Ludwika Rydygiera 2a/22 Thu, 2017-11-02 09:00 2970PLN / 1143PLN
Oracle 11g - SQL language for developers - Workshop Bielsko-Biała, Al. Armii Krajowej 220 Mon, 2017-11-06 09:00 6930PLN / 4140PLN
Programowanie Aplikacji Webowych z Java EE 6 / 7 Zielona Góra, ul. Reja 6 Mon, 2017-11-06 09:00 7722PLN / 3340PLN
Android - The Basics Wrocław, ul.Ludwika Rydygiera 2a/22 Mon, 2017-11-06 09:00 9801PLN / 4180PLN
Java Spring Wrocław, ul.Ludwika Rydygiera 2a/22 Mon, 2017-11-06 09:00 14414PLN / 5970PLN
Test Automation with Selenium Łódź, ul. Tatrzańska 11 Mon, 2017-11-06 09:00 7722PLN / 3474PLN
Symfony 3 Kraków, ul. Rzemieślnicza 1 Mon, 2017-11-06 09:00 6930PLN / 3300PLN
ORACLE SQL Fundamentals Warszawa, ul. Złota 3/11 Tue, 2017-11-07 09:00 3465PLN / 1755PLN
Visual Basic for Applications (VBA) in Excel - Introduction to programming Warszawa, ul. Złota 3/11 Tue, 2017-11-07 09:00 3564PLN / 1691PLN
Programming for Biologists Warszawa, ul. Złota 3/11 Tue, 2017-11-07 09:00 11781PLN / 3745PLN
Quality Assurance and Continuous Integration Wrocław, ul.Ludwika Rydygiera 2a/22 Tue, 2017-11-07 09:00 2673PLN / 1737PLN
Oracle Service Bus 11g - Design and Integration Gdańsk, ul. Powstańców Warszawskich 45 Tue, 2017-11-07 09:00 15315PLN / 5391PLN
UML in Enterprise Architect (workshops) Warszawa, ul. Złota 3/11 Wed, 2017-11-08 09:00 5940PLN / 3570PLN
Tableau Advanced Gdynia, ul. Ejsmonda 2 Wed, 2017-11-08 09:00 7425PLN / 2975PLN
Managing Configuration with Ansible Warszawa, ul. Złota 3/11 Wed, 2017-11-08 09:00 16612PLN / 6137PLN
JMeter Fundamentals Warszawa, ul. Złota 3/11 Wed, 2017-11-08 09:00 1871PLN / 824PLN
Financial analysis in Excel Warszawa, ul. Złota 3/11 Thu, 2017-11-09 09:00 2079PLN / 1093PLN
Adobe Premiere Pro Gdańsk, ul. Powstańców Warszawskich 45 Thu, 2017-11-09 09:00 3960PLN / 2480PLN
DTP (InDesign, Photoshop, Illustrator, Acrobat) Bielsko-Biała, Al. Armii Krajowej 220 Mon, 2017-11-13 09:00 5940PLN / 3730PLN
Data Analysis with Oracle 11g - workshop Gdynia, ul. Ejsmonda 2 Mon, 2017-11-13 09:00 9900PLN / 4664PLN
MongoDB for Developers Warszawa, ul. Złota 3/11 Mon, 2017-11-13 09:00 4700PLN / 1967PLN
Nagios Core Gdańsk, ul. Powstańców Warszawskich 45 Mon, 2017-11-13 09:00 13919PLN / 4968PLN
Visual Basic for Applications (VBA) in Excel - Advanced Gdańsk, ul. Powstańców Warszawskich 45 Mon, 2017-11-13 09:00 3069PLN / 1773PLN
Website Development in PHP Szczecin, ul. Sienna 9 Mon, 2017-11-13 09:00 2970PLN / 1344PLN
ADO.NET 4.0 Development Warszawa, ul. Złota 3/11 Tue, 2017-11-14 09:00 20176PLN / 6914PLN
Adobe Illustrator Lublin, ul. Spadochroniarzy 9 Tue, 2017-11-14 09:00 2871PLN / 1648PLN
Linux Fundamentals Kraków, ul. Rzemieślnicza 1 Tue, 2017-11-14 09:00 10128PLN / 3869PLN
Visual Basic for Applications (VBA) in Excel - Advanced Warszawa, ul. Złota 3/11 Wed, 2017-11-15 09:00 3069PLN / 1623PLN
Access Basics Szczecin, ul. Sienna 9 Mon, 2017-11-20 09:00 3465PLN / 1550PLN
OCEB2 OMG Certified Expert in BPM - Fundamental Exam Preparation Warszawa, ul. Złota 3/11 Mon, 2017-11-20 09:00 11880PLN / 4760PLN
Introduction to Selenium Poznan, Garbary 100/63 Wed, 2017-12-20 09:00 1871PLN / 824PLN
Adobe Photoshop Warszawa, ul. Złota 3/11 Wed, 2017-12-20 09:00 1881PLN / 1152PLN

Course Discounts Newsletter

We respect the privacy of your email address. We will not pass on or sell your address to others.
You can always change your preferences or unsubscribe completely.

Some of our clients