Waqar Ahmed
Learn how to scrape websites using Python with Beautiful Soup and Selenium. Extract data efficiently and organize it with practical projects!
This course is all about making web scraping super easy and fun. You'll find out how to grab data from websites like a pro using Python tools like Beautiful Soup and Selenium. We're gonna keep it light and simple, just like chatting with a buddy!
Web Scraping and Crawling with Python: Beautiful Soup, Requests & Selenium
Web Scraping (also termed Screen Scraping, Web Data Extraction, Web Harvesting, etc.) is a technique for extracting large amounts of data from websites and save the the extracted data to a local file or to a database.
In this course, you will learn how to perform web scraping using Python 3 and the Beautiful Soup, a free open-source library written in Python for parsing HTML.
We will use lxml, which is an extensive library for parsing XML and HTML documents very quickly; it can even handle messed up tags. We will also be using the Requests module instead of the already built-in urllib2 module due to improvements in speed and readability.
Finally, we will use Selenium alongside Beautiful Soup to crawl AJAX & JavaScript driven pages.
The course cover the following topics: accessing web pages programmatically; scraping web pages to extract the required data using Beautiful Soup to parse web pages; interacting with web pages to do different things with them programmatically; and using Selenium for web scraping and when we need it.
By the end of this course, you will be able to understand how websites and servers function, diverse data extraction techniques, and methods of handling and organizing data.
This Web Scraping course covers the following topics:
Review of data structures (Lists, Dictionaries, Tuples, File Handling)
How websites are hosted on servers
Calls to the server (GET, POST methods)
Review of HTML and CSS
Requests Module and BeautifulSoup Module overview
Parsing HTML using BeautifulSoup
Filtering elements using BeautifulSoup and navigating the Parse Tree
JavaScript and AJAX overview
Selenium and the need for it
Selecting elements using Selenium
CSS selectors
XPath selectors
Navigating pages using Selenium
Practical Projects
Python Refresher: Review of Data Structures, Conditionals, File Handling
How Websites are Hosted on Servers; Basic Calls to Server (GET, POST Methods)
Web Scraping with Python Beautiful Soup and Requests
1 Lectures
11 Lectures
2 Lectures
1 Lectures
1 Lectures
4 Lectures
2 Lectures
7 Lectures
4 Lectures
4 Lectures
8 Lectures
13 Lectures
1 Lectures
1 Lectures
GoTrained is an e-learning academy aiming at creating useful content in different languages and it concentrates on technology and management. We adopt a special approach for selecting content we provide; we mainly focus on skills that are frequently requested by clients and jobs while there are only few videos that cover them. We also try to build video series to...
Perfect
Foi alĂ©m do que eu esperava. Apesar de ser um curso básico, ele teve um aprofundamento normalmente nĂŁo visto em outros cursos de mesmo nĂvel.
Si he aprendido y complementado con info en la web, le falta actualizacion en la parte de Driver Manager
very nice
Everything is quite understandable and at a decent pace, not too slow or too fast. Manages to keep my attention quite well.
Hard listening with the Indian accent and the hesitant presentation.
e.g. scrapping rather than scraping, scrap rather than scrape ...
Took me a while to understand that correctors are characters etc ...
An excellent course with a nice project at the end.
I really liked the Instagram project.
The instructor could have explained XPath in a little more detailed
Had a lot of great info, examples, relatively up to date on frequently-changing libraries. Good material for both beginners and those with more experience in Python already.
A wonderful intro and detailed explanation to the world of Web Scraping and the way Mr. Ahmed is patiently giving his perspective his phenominal. I think this would really help the scrapers who are new to the data world and NLP to learn and implement easily
Your email address will not be published. Required fields are marked *
Hynek K.
The course is well-structured and easy to follow, making it accessible even for beginners. However, the last update was in December 2018, and some websites used for scraping have since changed. This made completing the final project (scraping pictures from Instagram) impossible due to Instagram’s suspicious login alerts and the need for email confirmation codes. Despite this, I’m giving the course 4.5 out of 5 stars because the 8 hours of content provided a clear understanding of the concepts and allowed me to learn a lot.