Course Outline
- Big data fundamentals
- Big Data and its role in the corporate world
- The phases of development of a Big Data strategy within a corporation
- Explain the rationale underlying a holistic approach to Big Data
- Components needed in a Big Data Platform
- Big data storage solution
- Limits of Traditional Technologies
- Overview of database types
- The four dimensions of Big Data
- Big data impact on business
- Business importance of Big Data
- Challenges of extracting useful data
- Integrating Big data with traditional data
- Big data storage technologies
- Overview of big data technologies
- Data storage models
- Hadoop
- Hive
- Cassandra
- MongoDB
- Choosing the right big data technology
- Overview of big data technologies
- Processing big data
- Connecting and extracting data from database
- Transforming and preparation data for processing
- Using Hadoop MapReduce for processing distributed data
- Monitoring and executing Hadoop MapReduce jobs
- Hadoop distributed file system building blocks
- Mapreduce and Yarn
- Handling streaming data with Spark
- Big data analysis tools and technologies
- Programming Hadoop with Pig Latin language
- Querying big data with Hive
- Mining data with Mahout
- Visualizing and reporting tools
- Big data in business
- Managing and establishing Big Data needs
- Business importance of Big Data
- Selecting the right big data tools for the problem
Data Warehousing Concepts
- What is Data Ware House?
- Difference between OLTP and Data Ware Housing
- Data Acquisition
- Data Extraction
- Data Transformation.
- Data Loading
- Data Marts
- Dependent vs Independent data Mart
- Data Base design
ETL Testing Concepts:
- Introduction.
- Software development life cycle.
- Testing methodologies.
- ETL Testing Work Flow Process.
- ETL Testing Responsibilities in Data stage.
Big data Fundamentals
- Big Data and its role in the corporate world
- The phases of development of a Big Data strategy within a corporation
- Explain the rationale underlying a holistic approach to Big Data
- Components needed in a Big Data Platform
- Big data storage solution
- Limits of Traditional Technologies
- Overview of database types
NoSQL Databases
Hadoop
Map Reduce
Apache Spark
Requirements
Delegates should have an awareness and some experience of storgage tools and an awreness of handling large data sets
Testimonials (5)
The training was conducted in an interesting and professional manner, which allowed for the systematization and expansion of knowledge in the subject area. The instructor demonstrated extensive experience and skill in conveying information. The training was very practical and tailored to our needs. I highly recommend it.
Dominik Kozlowski - Shell Polska
Course - Big Data - Data Science
Machine Translated
The start of day 3 was the best.
- Shell Polska
Course - Big Data - Data Science
Machine Translated
An exercise of the type 'who will create the best model'
Wojtek - Shell Polska
Course - Big Data - Data Science
Machine Translated
trainer's knowledge
Fatma Badi - Dubai Electricity & Water Authority
Course - Big Data - Data Science
The way knowledge was conveyed was very clear for me. Good communication with the instructor allowed the group to complete the training without any issues. After finishing the training, I was left with a feeling of wanting more, which made me desire even more of this training.
Mateusz Gorniak
Course - Big Data - Data Science
Machine Translated