Course Outline

Introduction

  • Overview of Databricks and Apache Spark
  • Understanding the Databricks architecture

Getting Started

  • Setting up the Environment
  • Setting up and configuring Databricks
  • Navigating the Databricks user interface
  • Creating a Databricks workspace

Working with Data in Databricks

  • Connecting to an Apache Spark data source
  • Understanding the basics columns and datatypes
  • Managing file system into Notebooks

Managing Jobs and Clusters

  • Creating and configuring clusters
  • Creating jobs using Notebook
  • Running jobs
  • Viewing jobs and job details

Using Delta Lake in Databricks

  • Loading data into Delta Lake
  • Managing data in Delta Lake

Securing Databricks

  • Managing Databricks security
  • Managing backup and recovery

Troubleshooting

Summary and Next Steps

Requirements

  • Basic understanding of data analytics
  • Knowledge of Apache Spark

Audience

  • Data Engineers
  • Data Scientists
  • Developers
  14 Hours
 

Number of participants


Starts

Ends


Dates are subject to availability and take place between 09:30 and 16:30.
Open Training Courses require 5+ participants.

Testimonials (2)

Related Courses

Data Analytics With R

  21 Hours

Related Categories