Apache Spark
Original price was: $450.00.$430.00Current price is: $430.00.
Description
What is Apache Spark?
Spark is a fast and general cluster computing system for Big Data. It provides high-level APIs in Scala, Java, Python, and R, and an optimized engine that supports general computation graphs for data analysis. It also supports a rich set of higher-level tools including Spark SQL for SQL and DataFrames, Spark ML for machine learning, GraphX for graph data processing, and Spark Streaming for live data stream processing. With Spark running on Apache Hadoop YARN, developers can create applications to derive actionable insights within a single, shared dataset in Hadoop.
Course Overview
This training course will teach you how to solve Big Data problems using Apache Spark framework. The training will cover a wide range of Big Data use cases such as ETL, DWH, data virtualization, streaming, graph data structure, machine learning. It will also demonstrate how Spark integrates with other well established Hadoop ecosystem products. You will learn the course curriculum through theory lectures, live demonstrations and lab exercises. This course will be taught in primarily Scala programming language.
Duration: 3 Days or 24 Hours
Mode of Delivery: Virtual Training
Prerequisites
Following are the pre-requisites for the course.
- Basic Knowledge of big data use-cases
- Basic knowedge of Hadoop, HDFS and Hive
- Basic knowledge of databases, OLAP/OTLP use cases, SQL
- Programming knowledhe in python
Reviews
There are no reviews yet.