Plan Szkolenia

Introduction

  • Overview of Spark and Hadoop features and architecture
  • Understanding big data
  • Python programming basics

Getting Started

  • Setting up Python, Spark, and Hadoop
  • Understanding data structures in Python
  • Understanding PySpark API
  • Understanding HDFS and MapReduce

Integrating Spark and Hadoop with Python

  • Implementing Spark RDD in Python
  • Processing data using MapReduce
  • Creating distributed datasets in HDFS

Machine Learning with Spark MLlib

Processing Big Data with Spark Streaming

Working with Recommender Systems

Working with Kafka, Sqoop, Kafka, and Flume

Apache Mahout with Spark and Hadoop

Troubleshooting

Summary and Next Steps

Wymagania

  • Experience with Spark and Hadoop
  • Python programming experience

Audience

  • Data scientists
  • Developers
 21 godzin

Liczba uczestników



Cena za uczestnika

Opinie uczestników (2)

Szkolenia Powiązane

Python and Spark for Big Data (PySpark)

21 godzin

Introduction to Graph Computing

28 godzin

Artificial Intelligence - the most applied stuff - Data Analysis + Distributed AI + NLP

21 godzin

Apache Spark MLlib

35 godzin

Data Analysis with Python, Pandas and Numpy

14 godzin

Accelerating Python Pandas Workflows with Modin

14 godzin

Machine Learning with Python and Pandas

14 godzin

Scaling Data Analysis with Python and Dask

14 godzin

FARM (FastAPI, React, and MongoDB) Full Stack Development

14 godzin

Developing APIs with Python and FastAPI

14 godzin

Scientific Computing with Python SciPy

7 godzin

Game Development with PyGame

7 godzin

Web application development with Flask

14 godzin

Advanced Flask

14 godzin

Build REST APIs with Python and Flask

14 godzin

Powiązane Kategorie