Sale!

Big-Data Batch processing pipeline for Beginners | End to End | Spark + Scala

549.001,299.00

In this course you will get an end to end flow of a Big-Data Batch processing pipeline from Data ingestion to Business reporting, using Apache Spark, Hadoop Hortonworks cluster, Apache airflow for scheduling, and Power BI reporting.

Please check the details in the Description section and choose the Project Variant that suits you!

Clear
SKU: GKBP0002 Category:

Description

Course Introduction

Course Variants

Starter variant

  • 5+ Hours of video sessions with detailed explanation on
    • Project Overview.
    • Understanding Business use cases.
    • Basic project and tools setup.
    • Basics of SCALA & Linux.
    • Detailed code development in Spark with SCALA.
    • SQL DB Integration.
    • Power BI, reporting, and visualization.
    • Code deployment on Hortonworks Cluster.
    • Pipeline orchestration using Apache Airflow.
  • Entire Code-Base
  • Basic Data-sets

Extended variant

  • 5+ Hours of video sessions with detailed explanation on
    • Project Overview.
    • Understanding Business use cases.
    • Basic project and tools setup.
    • Basics of SCALA & Linux.
    • Detailed code development in Spark with SCALA.
    • SQL DB Integration.
    • Power BI, reporting, and visualization.
    • Code deployment on Hortonworks Cluster.
    • Pipeline orchestration using Apache Airflow.
  • Entire Code-Base
  • Basic Data-sets
  • Six Months access to GKCodelabs premium series, with regular updates on Big Data technology-stack including:
    • Cloud deployments on GCP, EMR, Azure.
    • Integration with HBase, MongoDB.
    • Advanced Datawarehouse modeling.
    • Spark optimization techniques.
    • Spark Streaming.

 

ONLY FOR NON-INDIAN Payments


Project Variant
Email



Additional information

Project Variant

Starter, Extended, Starter-Upgrade