Course Outline

    Basics of Hadoop. Introduction to Pig. Basic data analysis using the Pig tool. Processing complex data with Pig. Operations on multiple datasets using Pig. Pig Troubleshooting and Optimization. Introduction to Hive, Impala, ELK. Executing queries in Hive, Impala, ELK. Hive data management. Data storage and performance. Analyzes using Hive and Impala. Working with tool Impala and ELK. Analysis of text and complex data types. Hive Optimization, Pig, Impala, ELK. Interoperability and workflow. Questions, tasks, certification.

Requirements

This course is suggested for all data scientists, business analysts, developers and administrators who have experience with SQL and/or scripting languages. No knowledge of Apache Hadoop is required prior to this training.

  28 Hours
 

Number of participants


Starts

Ends


Dates are subject to availability and take place between 09:00 and 16:00.
Open Training Courses require 5+ participants.

Testimonials (1)

Related Courses

Related Categories