Scaling Data Analysis with Python and Dask Training Course
Dask is a flexible and high-performance Python library for parallel computing. It scales and accelerates big data processing with other Python-based data science libraries, such as Pandas, Numpy, and Scikit-Learn.
This instructor-led, live training (online or onsite) is aimed at data scientists and software engineers who wish to use Dask with the Python ecosystem to build, scale, and analyze large datasets.
By the end of this training, participants will be able to:
- Set up the environment to start building big data processing with Dask and Python.
- Explore the features, libraries, tools, and APIs available in Dask.
- Understand how Dask accelerates parallel computing in Python.
- Learn how to scale the Python ecosystem (Numpy, SciPy, and Pandas) using Dask.
- Optimize the Dask environment to maintain high performance in handling large datasets.
Format of the Course
- Interactive lecture and discussion.
- Lots of exercises and practice.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
Course Outline
Introduction
- Overview of Dask features and advantages
- Parallel computing in Python
Getting Started
- Installing Dask
- Dask libraries, components, and APIs
- Best practices and tips
Scaling NumPy, SciPy, and Pandas
- Dask arrays examples and use cases
- Chunks and blocked algorithms
- Overlapping computations
- SciPy stats and LinearOperator
- Numpy slicing and assignment
- DataFrames and Pandas
Dask Internals and Graphical UI
- Supported interfaces
- Scheduler and diagnostics
- Analyzing performance
- Graph computation
Optimizing and Deploying Dask
- Setting up adaptive deployments
- Connecting to remote data
- Debugging parallel programs
- Deploying Dask clusters
- Working with GPUs
- Deploying Dask on cloud environments
Troubleshooting
Summary and Next Steps
Requirements
- Experience with data analysis
- Python programming experience
Audience
- Data scientists
- Software engineers
Open Training Courses require 5+ participants.
Scaling Data Analysis with Python and Dask Training Course - Booking
Scaling Data Analysis with Python and Dask Training Course - Enquiry
Scaling Data Analysis with Python and Dask - Consultancy Enquiry
Testimonials (2)
Examples/exercices perfectly adapted to our domain
Luc - CS Group
Course - Scaling Data Analysis with Python and Dask
The fact of having more practical exercises using more similar data to what we use in our projects (satellite images in raster format)
Matthieu - CS Group
Course - Scaling Data Analysis with Python and Dask
Provisional Courses
Related Courses
Advanced Alerting and Automation with Grafana and Prometheus
14 HoursThis instructor-led, live training in Poland (online or onsite) is aimed at advanced-level DevOps and SRE professionals who wish to enhance their alerting and automation skills with Grafana and Prometheus.
By the end of this training, participants will be able to:
- Create and manage advanced alerting rules in Prometheus.
- Integrate Prometheus Alertmanager with external tools using webhooks.
- Automate responses to alerts for faster issue resolution.
- Use Grafana to visualize and manage alerts effectively.
ArcGIS from Basic to Advanced
35 HoursThis instructor-led, live training in Poland (online or onsite) is aimed at beginner-level to advanced-level GIS professionals and analysts who wish to learn how to effectively use ArcGIS for data visualization, spatial analysis, and geospatial project management.
By the end of this training, participants will be able to:
- Navigate and utilize ArcGIS tools for geospatial data management.
- Create and customize maps with layers and attributes.
- Perform advanced spatial analysis and geoprocessing tasks.
- Automate workflows using ModelBuilder and Python.
ArcGIS Enterprise for Technical Support
14 HoursThis instructor-led, live training in Poland (online or onsite) is aimed at beginner-level IT support personnel who wish to provide robust support for ArcGIS Enterprise, addressing any anomalies or failures effectively.
By the end of this training, participants will be able to:
- Understand the architecture and components of ArcGIS Enterprise.
- Learn to install, configure, and manage ArcGIS Enterprise.
- Gain skills in troubleshooting and resolving common issues.
- Develop proficiency in monitoring and maintaining ArcGIS Enterprise environments.
- Master the techniques for backup, recovery, and performance optimization.
ArcGIS Fundamentals
14 HoursThis instructor-led, live training in Poland (online or onsite) is aimed at beginner-level professionals who wish to learn the fundamental concepts and tools of ArcGIS.
By the end of this training, participants will be able to:
- Understand the basic concepts of GIS and spatial data.
- Navigate the ArcGIS interface.
- Create and manage spatial data.
- Perform basic spatial analysis.
- Create maps and visualizations.
ArcGIS Professional Plus: Advanced GIS Data Management and Analysis
14 HoursArcGIS Professional Plus is an advanced version of ArcGIS Pro, offering extended features for geospatial data analysis, 3D modeling, automation, and enterprise collaboration.
This instructor-led, live training (online or onsite) is aimed at intermediate-level GIS professionals who wish to deepen their skills in spatial data analysis, automation, and sharing using ArcGIS Professional Plus tools.
By the end of this training, participants will be able to:
- Leverage ArcGIS Pro Plus tools for data visualization and analysis.
- Create 2D and 3D maps with advanced symbology and geoprocessing techniques.
- Automate workflows using ModelBuilder and Python scripting.
- Integrate ArcGIS with external data services and enterprise systems.
Format of the Course
- Interactive lecture and discussion.
- Lots of exercises and practice.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
Advanced ArcGIS Pro for Spatial Analysis
35 HoursThis instructor-led, live training in Poland (online or onsite) is aimed at advanced-level GIS professionals who wish to use ArcGIS Pro to enhance their spatial analysis capabilities, conduct comprehensive geostatistical analysis, and apply advanced 3D modeling techniques for more effective decision-making and problem-solving in real-world scenarios.
By the end of this training, participants will be able to:
- Develop advanced skills in spatial analysis techniques using ArcGIS Pro.
- Utilize Python scripting for automation and complex data processing.
- Apply spatial modeling for problem-solving in real-world scenarios.
- Conduct geostatistical analysis for advanced data interpretation.
- Integrate external data sources and leverage 3D spatial data analysis.
Building Effective Dashboards with Grafana and Prometheus
14 HoursThis instructor-led, live training in Poland (online or onsite) is aimed at intermediate-level DevOps and SRE professionals who wish to create impactful dashboards and optimize their monitoring practices using Grafana and Prometheus.
By the end of this training, participants will be able to:
- Understand best practices for designing effective dashboards.
- Create and configure advanced Grafana dashboard elements.
- Leverage Grafana templating for dynamic and reusable dashboards.
- Implement alerting mechanisms to enhance operational awareness.
Custom Metrics and Instrumentation with Prometheus and Grafana
14 HoursThis instructor-led, live training in Poland (online or onsite) is aimed at intermediate-level developers and DevOps engineers who wish to instrument their applications to export and monitor custom metrics effectively using Prometheus and Grafana.
By the end of this training, participants will be able to:
- Understand the importance of custom metrics for application monitoring.
- Instrument applications to export custom metrics for Prometheus.
- Create and configure dashboards in Grafana to visualize custom metrics.
- Apply best practices for integrating monitoring into the development lifecycle.
Advanced Geographic Information Systems (GIS)
21 HoursThis instructor-led, live training in Poland (online or onsite) is aimed at intermediate-level geographers who wish to deepen their expertise in spatial analysis, data management, and GIS applications.
By the end of this training, participants will be able to:
- Apply advanced spatial analysis techniques to solve complex geographical problems.
- Manage large spatial databases and perform data quality control.
- Create dynamic and interactive maps and visualizations for various applications.
- Utilize programming and automation to streamline GIS workflows.
Google Maps API for Developers
14 HoursThis instructor-led, live training in Poland (online or onsite) is aimed at developers who wish to enhance their website or web application by adding geo-location functionality through Google Maps.
By the end of this training, participants will be able to:
- Embed a map into a new or existing website or web application.
- Display live data that synchs with current location.
- Create a mashup that includes Google Maps.
- Apply good CSS and JavaScript coding practices
- Create a custom map for an application.
Monitoring with Grafana
14 HoursThis instructor-led, live training in Poland (onsite or remote) is aimed at data analysts who wish to use Grafana for monitoring and visualizing data.
By the end of this training, participants will be able to:
- Set up the necessary development environment to start creating visualizations in Grafana.
- Set up Grafana for high availability.
- Customize panels and dashboards with data.
- Configure a reverse proxy for fast loading speeds.
Python for Geographic Information System (GIS)
21 HoursA geographic information system (GIS) is a system designed to capture, store, manipulate, analyze, manage, and present spatial or geographic data. The acronym GIS is sometimes used for geographic information science (GIScience) to refer to the academic discipline that studies geographic information systems and is a large domain within the broader academic discipline of geoinformatics.
The use of Python with GIS has substantially increased over the last two decades, particularly with the introduction of Python 2.0 series in 2000, which included many new programming features that made the language much easier to deploy. Since that time, Python has not only been utilized within commercial GIS such as products by Esri but also open source platforms, including as part of QGIS and GRASS. In fact, Python today is by far the most widely used language by GIS users and programmers.
This program covers the usage of Python and its advance libraries like geopandas, pysal, bokeh and osmnx to implement your own GIS features. The program also covers introductory modules around ArcGIS API, and QGIS toolboox.
Python for ArcGIS and QGIS for Earth Sciences and Engineering Professionals
35 HoursThis instructor-led, live training in Poland (online or onsite) is aimed at beginner-level earth sciences and engineering professionals who wish to use Python for geospatial analysis in both ArcGIS and QGIS environments.
By the end of this training, participants will be able to:
- Learn Python syntax and control structures for executing geospatial tasks efficiently.
- Use Pandas, Numpy, and Matplotlib for data analysis and visualization in GIS.
- Manipulate and analyze vector data with Geopandas, Arcpy, and PyQGIS libraries.
- Automate geospatial processes and workflows using Python scripting in ArcGIS and QGIS.
- Develop custom Python-based geoprocessing tools for ArcGIS and QGIS to streamline tasks.
QGIS for Geographic Information System
21 HoursA geographic information system (GIS) is a system designed to capture, store, manipulate, analyze, manage, and present spatial or geographic data. The acronym GIS is sometimes used for geographic information science (GIScience) to refer to the academic discipline that studies geographic information systems and is a large domain within the broader academic discipline of geoinformatics.
QGIS functions as geographic information system (GIS) software, allowing users to analyze and edit spatial information, in addition to composing and exporting graphical maps. QGIS supports both raster and vector layers; vector data is stored as either point, line, or polygon features. Multiple formats of raster images are supported, and the software can georeference images. To summarize it allows the users to Create, edit, visualise, analyse and publish geospatial information on Windows, Mac, Linux, BSD.
This program, in its first phase, introduces the QGIS interface for general usage. In the second phase, we introduce PyQGIS - the python libraries of QGIS that allows the integration of GIS functionalities in your python code or your python application, so that you may even create your own Python Plugin around a particular GIS functionality.