A Practical Introduction to Data Analysis and Big Data Training Course
Participants who complete this instructor-led, live training will gain a practical, real-world understanding of Big Data and its related technologies, methodologies and tools.
Participants will have the opportunity to put this knowledge into practice through hands-on exercises. Group interaction and instructor feedback make up an important component of the class.
The course starts with an introduction to elemental concepts of Big Data, then progresses into the programming languages and methodologies used to perform Data Analysis. Finally, we discuss the tools and infrastructure that enable Big Data storage, Distributed Processing, and Scalability.
Format of the Course
- Interactive lecture and discussion.
- Lots of exercises and practice.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
Course Outline
Introduction to Data Analysis and Big Data
- What Makes Big Data "Big"?
- Velocity, Volume, Variety, Veracity (VVVV)
- Limits to Traditional Data Processing
- Distributed Processing
- Statistical Analysis
- Types of Machine Learning Analysis
- Data Visualization
Big Data Roles and Responsibilities
- Administrators
- Developers
- Data Analysts
Languages Used for Data Analysis
- R Language
- Why R for Data Analysis?
- Data manipulation, calculation and graphical display
- Python
- Why Python for Data Analysis?
- Manipulating, processing, cleaning, and crunching data
Approaches to Data Analysis
- Statistical Analysis
- Time Series analysis
- Forecasting with Correlation and Regression models
- Inferential Statistics (estimating)
- Descriptive Statistics in Big Data sets (e.g. calculating mean)
- Machine Learning
- Supervised vs unsupervised learning
- Classification and clustering
- Estimating cost of specific methods
- Filtering
- Natural Language Processing
- Processing text
- Understaing meaning of the text
- Automatic text generation
- Sentiment analysis / topic analysis
- Computer Vision
- Acquiring, processing, analyzing, and understanding images
- Reconstructing, interpreting and understanding 3D scenes
- Using image data to make decisions
Big Data Infrastructure
- Data Storage
- Relational databases (SQL)
- MySQL
- Postgres
- Oracle
- Non-relational databases (NoSQL)
- Cassandra
- MongoDB
- Neo4js
- Understanding the nuances
- Hierarchical databases
- Object-oriented databases
- Document-oriented databases
- Graph-oriented databases
- Other
- Relational databases (SQL)
- Distributed Processing
- Hadoop
- HDFS as a distributed filesystem
- MapReduce for distributed processing
- Spark
- All-in-one in-memory cluster computing framework for large-scale data processing
- Structured streaming
- Spark SQL
- Machine Learning libraries: MLlib
- Graph processing with GraphX
- Hadoop
- Scalability
- Public cloud
- AWS, Google, Aliyun, etc.
- Private cloud
- OpenStack, Cloud Foundry, etc.
- Auto-scalability
- Public cloud
Choosing the Right Solution for the Problem
The Future of Big Data
Summary and Next Steps
Requirements
- A general understanding of math
- A general understanding of programming
- A general understanding of databases
Audience
- Developers / programmers
- IT consultants
Open Training Courses require 5+ participants.
A Practical Introduction to Data Analysis and Big Data Training Course - Booking
A Practical Introduction to Data Analysis and Big Data Training Course - Enquiry
A Practical Introduction to Data Analysis and Big Data - Consultancy Enquiry
Consultancy Enquiry
Testimonials (7)
How big data work, data programs, greater knowledge of how our current world works using data
Ozayr Hussain - Vodacom
Course - A Practical Introduction to Data Analysis and Big Data
The practical side of the training.
Patrick - Vodacom PTy Ltd
Course - A Practical Introduction to Data Analysis and Big Data
Interactive topics and the style used by the lecture to simplified the topics for the students
Miran Saeed - Sulaymaniyah Asayish Agency
Course - A Practical Introduction to Data Analysis and Big Data
the trainer and his ability to lecture
ibrahim hamakarim - Sulaymaniyah Asayish Agency
Course - A Practical Introduction to Data Analysis and Big Data
Practical exercises
JOEL CHIGADA - University of the Western Cape
Course - A Practical Introduction to Data Analysis and Big Data
R programming
Osden Jokonya - University of the Western Cape
Course - A Practical Introduction to Data Analysis and Big Data
Overall the Content was good.
Sameer Rohadia
Course - A practical introduction to Data Analysis and Big Data
Provisional Courses
Related Courses
Advanced Data Analysis with TIBCO Spotfire
14 HoursThis instructor-led, live training in Poland (online or onsite) is aimed at business analysts who wish to learn advanced Spotfire Analyst techniques for analyzing data.
By the end of this training, participants will be able to:
- Share visualizations among different team members.
- Secure access to software based on roles and access controls.
- Create visualizations such as map charts.
- Integrate statistical computing languages such as R with Spotfire.
ArcGIS from Basic to Advanced
35 HoursThis instructor-led, live training in Poland (online or onsite) is aimed at beginner-level to advanced-level GIS professionals and analysts who wish to learn how to effectively use ArcGIS for data visualization, spatial analysis, and geospatial project management.
By the end of this training, participants will be able to:
- Navigate and utilize ArcGIS tools for geospatial data management.
- Create and customize maps with layers and attributes.
- Perform advanced spatial analysis and geoprocessing tasks.
- Automate workflows using ModelBuilder and Python.
ArcGIS Enterprise for Technical Support
14 HoursThis instructor-led, live training in Poland (online or onsite) is aimed at beginner-level IT support personnel who wish to provide robust support for ArcGIS Enterprise, addressing any anomalies or failures effectively.
By the end of this training, participants will be able to:
- Understand the architecture and components of ArcGIS Enterprise.
- Learn to install, configure, and manage ArcGIS Enterprise.
- Gain skills in troubleshooting and resolving common issues.
- Develop proficiency in monitoring and maintaining ArcGIS Enterprise environments.
- Master the techniques for backup, recovery, and performance optimization.
ArcGIS Fundamentals
14 HoursThis instructor-led, live training in Poland (online or onsite) is aimed at beginner-level professionals who wish to learn the fundamental concepts and tools of ArcGIS.
By the end of this training, participants will be able to:
- Understand the basic concepts of GIS and spatial data.
- Navigate the ArcGIS interface.
- Create and manage spatial data.
- Perform basic spatial analysis.
- Create maps and visualizations.
Advanced ArcGIS Pro for Spatial Analysis
35 HoursThis instructor-led, live training in Poland (online or onsite) is aimed at advanced-level GIS professionals who wish to use ArcGIS Pro to enhance their spatial analysis capabilities, conduct comprehensive geostatistical analysis, and apply advanced 3D modeling techniques for more effective decision-making and problem-solving in real-world scenarios.
By the end of this training, participants will be able to:
- Develop advanced skills in spatial analysis techniques using ArcGIS Pro.
- Utilize Python scripting for automation and complex data processing.
- Apply spatial modeling for problem-solving in real-world scenarios.
- Conduct geostatistical analysis for advanced data interpretation.
- Integrate external data sources and leverage 3D spatial data analysis.
Automated Monitoring with Zabbix
14 HoursThis instructor-led, live training in Poland (online or onsite) covers the installation, planning and configuration of Zabbix, and focuses on practical implementation and tooling.
By the end of this training, participants will be able to:
- Install and configure Zabbix for monitoring IT infrastructure.
- Set up and manage hosts, items, triggers, and actions within Zabbix.
- Utilize Zabbix's features for data collection, alerting, and reporting.
- Integrate Zabbix with other tools and platforms for enhanced monitoring and automation.
Insurtech: A Practical Introduction for Managers
14 HoursInsurtech (a.k.a Digital Insurance) refers to the convergence of insurance + new technologies. In the field of Insurtech "digital insurers" apply technology innovations to their business and operating models in order to reduce costs, improve the customer experience and enhance the agility of their operations.
In this instructor-led training, participants will gain an understanding of the technologies, methods and mindset needed to bring about a digital transformation within their organizations and in the industry at large. The training is aimed at managers who need to gain a big picture understanding, break down the hype and jargon, and take the first steps in establishing an Insurtech strategy.
By the end of this training, participants will be able to:
- Discuss Insurtech and all its component parts intelligently and systematically
- Identify and demystify the role of each key technology within Insurtech.
- Draft a general strategy for implementing Insurtech within their organization
Audience
- Insurers
- Technologists within the insurance industry
- Insurance stakeholders
- Consultants and business analysts
Format of the course
- Part lecture, part discussion, exercises and case study group activities
Fluentd for Log Data Unification
14 HoursThis instructor-led, live training (online or onsite) is aimed at engineers who wish to set up an architecture where everything is logged.
By the end of this training, participants will be able to:
- Install and configure Fluentd.
- Collect logs from large numbers of disparate servers.
- Unify the logging layer within an organization.
Format of the Course
- Interactive lecture and discussion.
- Lots of exercises and practice.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- This training covers logging approaches for various types of systems. Please contact us to arrange coverage for specific systems (syslog, Apache, Nginx, IoT, ElasticSearch, MongoDB, Hadoop, etc.).
- To learn more about Fluentd, please visit: https://www.fluentd.org/
Nagios
35 HoursThis instructor-led, live training in Poland (online or onsite) is aimed at intermediate-level IT professionals who wish to implement and manage infrastructure monitoring using Nagios.
By the end of this training, participants will be able to:
- Install and configure Nagios Core and relevant plugins.
- Monitor servers, network devices, services, and applications.
- Configure alerts and performance thresholds.
- Integrate Nagios with databases and third-party tools.
- Set up distributed monitoring and high availability environments.
- Visualize monitoring data using tools such as NagVis and BPI.
Nagios Core
21 HoursThis instructor-led, live training in Poland (online or onsite) is aimed at intermediate-level IT professionals who wish to implement, configure, and maintain Nagios Core for real-time infrastructure monitoring.
By the end of this training, participants will be able to:
- Install and configure Nagios Core and its components.
- Monitor hosts, services, and network resources.
- Configure secure user access and alerting systems.
- Create custom checks and extend Nagios monitoring capabilities.
- Utilize plugins and graphing tools for reporting and analysis.
Nagios XI Administration
21 HoursNagios XI is enterprise server and network monitoring software.
In this instructor-led, live training, participants will learn how to set up and operate Nagios XI as they step through process of managing Linux and Windows servers in a series of hands-on live-lab exercises.
By the end of this training, participants will be able to:
- Install and configure Nagios XI
- Monitor Windows and Linux machines
- Monitor network devices
- Perform administrating tasks, including backing up, restoring, and scheduling downtime of Nagios XI
Audience
- System administrators
Format of the Course
- Part lecture, part discussion, exercises and heavy hands-on practice
Note
- To request a customized training for this course, please contact us to arrange.
QGIS for Geographic Information System
21 HoursA geographic information system (GIS) is a system designed to capture, store, manipulate, analyze, manage, and present spatial or geographic data. The acronym GIS is sometimes used for geographic information science (GIScience) to refer to the academic discipline that studies geographic information systems and is a large domain within the broader academic discipline of geoinformatics.
QGIS functions as geographic information system (GIS) software, allowing users to analyze and edit spatial information, in addition to composing and exporting graphical maps. QGIS supports both raster and vector layers; vector data is stored as either point, line, or polygon features. Multiple formats of raster images are supported, and the software can georeference images. To summarize it allows the users to Create, edit, visualise, analyse and publish geospatial information on Windows, Mac, Linux, BSD.
This program, in its first phase, introduces the QGIS interface for general usage. In the second phase, we introduce PyQGIS - the python libraries of QGIS that allows the integration of GIS functionalities in your python code or your python application, so that you may even create your own Python Plugin around a particular GIS functionality.
Introduction to Spotfire
14 HoursThis instructor-led, live training in Poland (online or onsite) is aimed at business analysts and data analysts who wish to learn basic Spotfire Analyst techniques for analyzing data.
By the end of this training, participants will be able to:
- Install and configure TIBCO Spotfire.
- Combine data from different databases.
- Visualize large datasets.
- Create and share complex dashboards.
AI-Driven Data Analysis with TIBCO Spotfire X
14 HoursThis instructor-led, live training in Poland (online or onsite) is aimed at business analysts and data analysts who wish to use TIBCO Spotfire X with its artificial intelligence capabilities to visualize, transform, and analyze data.
By the end of this training, participants will be able to:
- Install and configure TIBCO Spotfire X.
- Understand the features and architecture of TIBCO Spotfire X.
- Understand the concepts behind augmented and predictive analytics.
- Learn how to load, process, and visualize data using Spotfire X.
- Create interactive and enhanced data visualizations.
Data Analysis with SQL, Python and Spotfire
14 HoursIn this instructor-led, live training in Poland, participants will learn three different approaches for accessing, analyzing and visualizing data. We start with an introduction to RDMS databases; the focus will be on accessing and querying an Oracle database using the SQL language. Then we look at strategies for accessing an RDMS database programmatically using the Python language. Finally, we look at how to visualize and present data graphically using TIBCO Spotfire.
Format of the Course
Interactive lecture and discussion.
Lots of exercises and practice.
Hands-on implementation in a live-lab environment.