Skip to content

Web Scraping In Python: Master The Fundamentals


Stone River eLearning

Summary

Price
£300 inc VAT
Or £50.00/mo. for 6 months...
Study method
Online
Duration
4 hours · Self-paced
Qualification
No formal qualification

Overview

Web scraping is the art of picking out data from a website by looking at the HTML code and identifying patterns that can be used to identify your data. This data can then be gathered and later used for your own analysis.

In this course we will go over the basic of web scraping, learning all about how we can extract data from websites, and all of this is guided along by a work example.

At the end of the course you should be able to go off on your own, and pick out most common

What Will I Learn?

  • Tackle new challenges by understand the underlying method/approach to take
  • Scrape static webpages
  • Be able to scrape websites that use Javascript
  • Extract all sorts of data from websites
  • Know what to look for and how to approach parsing a website
  • Gather data from all over the internet
  • Use recursion algorithms to search through website content

Description


Class Curriculum

Prerequisite knowledge

Start

Introduction (3:20)

Preview

APIs (2:00)

Start

Prerequisite Libraries (3:00)

Start

Introduction to The Modulus Operation (5:01)

Start

Introduction to Simple Error Handling (4:25)

Start

Introduction to Pandas (6:41)

Static Data Extraction/Web Scraping

Start

Response Status Codes From a HTTP Request (7:18)

Start

Reading The Response Text From Our Request (11:40)

Start

First Approach at Parsing The Data (13:18)

Preview

Understanding The Exception Cases (6:39)

Start

Parsing Out All Data for One Company (9:33)

Start

Determining Where We Can Get More Ticker Symbols (15:46)

Start

Extracting Company Ticker Symbols Part 1 (16:32)

Start

Extracting Company Ticker Symbols Part 2 (10:41)

Start

Getting Data For All Parsed Companies (8:11)

Start

Final Data For All Parsed Companies (5:13)

Start

Final Result (1:40)

Scraping Websites That Load Data With Javascript

Start

Prerequisite Libraries (5:02)

Preview

Short review: Recursive Functions (7:43)

Start

Getting started with Selenium (8:47)

Start

View The Page Source (9:14)

Start

Website Elements and XPath (8:11)

Start

Navigating Deeper Into The Page Source (14:37)

Start

Identifying The Path To Our Data (19:28)

Start

Using The XPath To Our Data (9:50)

Preview

Parsing Out Our Data (8:42)

Start

Getting Our Final Data (14:56)

Start

Final Results (4:13)

APIs overview

Start

Introduction To APIs (10:33)

Who is this course for?

  • Anyone interested in analyzing data
  • Anyone who doesn't know how to start gathering data
  • Anyone who wants to develop their ability to scrap data
  • Anyone interested in starting with web scraping
  • Anyone who is interested in expanding their Python knowledge
  • Anyone who wants to gather a wide array of data to play with

Requirements

  • Basic Python knowledge
  • A Python 3 Environment to Code i

Career path

Python developers

Questions and answers

Currently there are no Q&As for this course. Be the first to ask a question.

Reviews

Currently there are no reviews for this course. Be the first to leave a review.

FAQs

Study method describes the format in which the course will be delivered. At Reed Courses, courses are delivered in a number of ways, including online courses, where the course content can be accessed online remotely, and classroom courses, where courses are delivered in person at a classroom venue.

CPD stands for Continuing Professional Development. If you work in certain professions or for certain companies, your employer may require you to complete a number of CPD hours or points, per year. You can find a range of CPD courses on Reed Courses, many of which can be completed online.

A regulated qualification is delivered by a learning institution which is regulated by a government body. In England, the government body which regulates courses is Ofqual. Ofqual regulated qualifications sit on the Regulated Qualifications Framework (RQF), which can help students understand how different qualifications in different fields compare to each other. The framework also helps students to understand what qualifications they need to progress towards a higher learning goal, such as a university degree or equivalent higher education award.

An endorsed course is a skills based course which has been checked over and approved by an independent awarding body. Endorsed courses are not regulated so do not result in a qualification - however, the student can usually purchase a certificate showing the awarding body's logo if they wish. Certain awarding bodies - such as Quality Licence Scheme and TQUK - have developed endorsement schemes as a way to help students select the best skills based courses for them.