Skip to content

Apache Spark Fundamentals


EduCBA

Summary

Price
£149 inc VAT
Or £49.67/mo. for 3 months...
Study method
Online
Duration
2 hours · Self-paced
Qualification
No formal qualification

Overview

Apache Spark™ is a fast and general engine for large-scale data processing. It is supposed to run programs at a high speed as compared to Hadoop and Mapreduce. It is quite easy to use and can write applications quickly in Java, Scala, Python, R. In terms of performance it can combine SQL, streaming, and complex analytics. It is quite versatile and can runs on Hadoop, Mesos, standalone, or in the cloud. It can access diverse data sources including HDFS, Cassandra, HBase, and S3.

Description

Through this overview course on Apache Spark you shall understand the fundamental mechanisms and basic internals of the framework and understand the need to use Spark. This course is basically intended for users who are interested to learn about Apache Spark and are just starting to learn about what is does. The course will give you an overview of Apache Spark, why use Spark and Spark Core.

Course Objective:

  • To understand the need for Apache Spark
  • Overview of Core Apache Spark

Curriculum

1. Introduction
Introduction to Apache Spark Spark
Spark Context
Spark Components

2. Spark RDD Basics
Introduction to Spark RDD Basics
Use of Filter Function
RDD Transformations in Spark
RDD Transformations in Spark Continues
RDD Persistence in Spark
Group Sort and Actions on Pair RDDs
Spark File Formats
Spark File Formats Continues

Questions and answers

Currently there are no Q&As for this course. Be the first to ask a question.

Reviews

Currently there are no reviews for this course. Be the first to leave a review.

FAQs

Study method describes the format in which the course will be delivered. At Reed Courses, courses are delivered in a number of ways, including online courses, where the course content can be accessed online remotely, and classroom courses, where courses are delivered in person at a classroom venue.

CPD stands for Continuing Professional Development. If you work in certain professions or for certain companies, your employer may require you to complete a number of CPD hours or points, per year. You can find a range of CPD courses on Reed Courses, many of which can be completed online.

A regulated qualification is delivered by a learning institution which is regulated by a government body. In England, the government body which regulates courses is Ofqual. Ofqual regulated qualifications sit on the Regulated Qualifications Framework (RQF), which can help students understand how different qualifications in different fields compare to each other. The framework also helps students to understand what qualifications they need to progress towards a higher learning goal, such as a university degree or equivalent higher education award.

An endorsed course is a skills based course which has been checked over and approved by an independent awarding body. Endorsed courses are not regulated so do not result in a qualification - however, the student can usually purchase a certificate showing the awarding body's logo if they wish. Certain awarding bodies - such as Quality Licence Scheme and TQUK - have developed endorsement schemes as a way to help students select the best skills based courses for them.