Course Outline

Introduction

  • Overview of Dask features and advantages
  • Parallel computing in Python

Getting Started

  • Installing Dask
  • Dask libraries, components, and APIs
  • Best practices and tips

Scaling NumPy, SciPy, and Pandas

  • Dask arrays examples and use cases
  • Chunks and blocked algorithms
  • Overlapping computations
  • SciPy stats and LinearOperator
  • Numpy slicing and assignment
  • DataFrames and Pandas

Dask Internals and Graphical UI

  • Supported interfaces
  • Scheduler and diagnostics
  • Analyzing performance
  • Graph computation

Optimizing and Deploying Dask

  • Setting up adaptive deployments
  • Connecting to remote data
  • Debugging parallel programs
  • Deploying Dask clusters
  • Working with GPUs
  • Deploying Dask on cloud environments

Troubleshooting

Summary and Next Steps

Requirements

  • Experience with data analysis
  • Python programming experience

Audience

  • Data scientists
  • Software engineers
  14 Hours
 

Number of participants


Starts

Ends


Dates are subject to availability and take place between 09:30 and 16:30.
Open Training Courses require 5+ participants.

Testimonials (3)

Related Courses

Related Categories