Apache Spark and Scala

Simpliv LLC


£20.24 inc VAT
Study method
Online, self-paced
6 Hours
No formal qualification
Additional info
  • Certificate of completion available and is included in the price

Add to basket or enquire


About this Course
This course on Apache Spark and Scala aims at providing an advanced expertise in big data Hadoop ecosystem. This course will provide a standard skillset which helps one become a specialist on the top of Big data Hadoop developer.

The course starts with a detailed description on limitations of mapreduce and how Spark can help overcome them. Further it covers a deeper dive into the Scala programming language.

Moving on it covers Spark as a standalone cluster and an understanding of Resiliient Distributed Datasets.

The course also covers concepts of Spark SQL using SQL queries through SQL context and Hive Queries through Hive context.

This course certainly provides material required for building a career path from Big data Hadoop developer to BIg data Hadoop architect.

Basic knowledge
Prior knowledge of Apache Hadoop will be an added advantage, but not compulsory.
Fundamental understanding of any programming language


What you will learn
Understand the limitations of Hadoop mapreduce and how Spark overcomes these limitations
Gain expertise in Scala programming language and its characteristics
Able to work with RDDs' and create applications in Spark
A thorough understanding about Spark SQL by using SQL queries in Spark


  • Module-1 Introduction to Big data, Hadoop and Spark
  • Module 2: Introduction to Scala Programming Language
  • Module 3: Advanced Scala Programming
  • Apache Spark RDDs
  • Module 2: Apache Spark RDDs II
  • Module 6: Working with Key-Value Pairs
  • Advanced Spark Programming
  • Running Spark Jobs on Cluster
  • Module 9: Spark SQL
  • Module 10: Spark Streaming

Who is this course for?

Students who aspire to gain a deep understanding of Apache Spark
Professionals looking for a career in real time big data analytics
Big Data and Hadoop Developers who want to analyze data faster


Currently there are no reviews for this course. Be the first to leave a review.


What does study method mean?

Study method describes the format in which the course will be delivered. At courses are delivered in a number of ways, including online courses, where the course content can be accessed online remotely, and classroom courses, where courses are delivered in person at a classroom venue.

What are CPD hours/points?

CPD stands for Continuing Professional Development. If you work in certain professions or for certain companies, your employer may require you to complete a number of CPD hours or points, per year. You can find a range of CPD courses on, many of which can be completed online.

What is a ‘regulated qualification’?

A regulated qualification is delivered by a learning institution which is regulated by a government body. In England, the government body which regulates courses is Ofqual. Ofqual regulated qualifications sit on the Regulated Qualifications Framework (RQF), which can help students understand how different qualifications in different fields compare to each other. The framework also helps students to understand what qualifications they need to progress towards a higher learning goal, such as a university degree or equivalent higher education award.

What is an ‘endorsed’ course?

An endorsed course is a skills based course which has been checked over and approved by an independent awarding body. Endorsed courses are not regulated so do not result in a qualification - however, the student can usually purchase a certificate showing the awarding body’s logo if they wish. Certain awarding bodies - such as ABC Awards and TQUK - have developed endorsement schemes as a way to help students select the best skills based courses for them.


Mobile Navigation