Plan Szkolenia

Introduction

  • The Data Science Process
  • Roles and responsibilities of a Data Scientist

Preparing the Development Environment

  • Libraries, frameworks, languages and tools
  • Local development
  • Collaborative web-based development

Data Collection

  • Different Types of Data
    • Structured 
      • Local databases
      • Database connectors
      • Common formats: xlxs, XML, Json, csv, ...
    • Un-Structured
      • Clicks, censors, smartphones
      • APIs
      • Internet of Things (IoT)
      • Documents, pictures, videos, sounds
  • Case study: Collecting large amounts of unstructured data continuosly

Data Storage

  • Relational databases
  • Non-relational databases
  • Hadoop: Distributed File System (HDFS)
  • Spark: Resilient Distributed Dataset (RDD)
  • Cloud storage

Data Preparation

  • Ingestion, selection, cleansing, and transformation
  • Ensuring data quality - correctness, meaningfulness, and security
  • Exception reports

Languages used for Preparation, Processing and Analysis

  • R language
    • Introduction to R
    • Data manipulation, calculation and graphical display
  • Python
    • Introduction to Python
    • Manipulating, processing, cleaning, and crunching data

Data Analytics

  • Exploratory analysis
    • Basic statistics
    • Draft visualizations
    • Understand data 
  • Causality
  • Features and transformations
  • Machine Learning
    • Supervised vs unsurpevised
    • When to use what model
  • Natural Language Processing (NLP)

Data Visualization

  • Best Practices
  • Selecting the right chart for the right data
  • Color pallets
  • Taking it to the next level
    • Dashboards
    • Interactive Visualizations
  • Storytelling with data

Summary and Conclusion

Wymagania

  • A general understanding of database concepts
  • A basic understanding of statistics
 35 godzin

Liczba uczestników



Cena za uczestnika

Opinie uczestników (2)

Szkolenia Powiązane

Kaggle

14 godzin

Accelerating Python Pandas Workflows with Modin

14 godzin

GPU Data Science with NVIDIA RAPIDS

14 godzin

Anaconda Ecosystem for Data Scientists

14 godzin

QGIS for Geographic Information System

21 godzin

Sensu: Beginner to Advanced

14 godzin

Monitoring Your Resources with Munin

7 godzin

Automated Monitoring with Zabbix

14 godzin

Fluentd for Log Data Unification

14 godzin

Nagios Certified Administrator Preparation

21 godzin

Advanced Nagios

21 godzin

Nagios

35 godzin

Nagios Core

21 godzin

Nagios Certified Professional Preparation

21 godzin

Nagios XI Administration

21 godzin

Powiązane Kategorie