Brief Summary
This course is all about becoming a pro at web scraping using JavaScript, CasperJS, and PhantomJS. You’ll tackle 5 projects to learn how to extract data and manipulate web pages like a boss. Perfect for anyone wanting to up their web game!
Key Points
-
Learn to scrape data from web pages using CasperJS.
-
Work through 5 hands-on projects to solidify concepts.
-
Understand advanced web scraping techniques.
-
Use PhantomJS and CasperJS for testing and UI interaction.
-
Capture, download, and save website data effectively.
Learning Outcomes
-
Gain confidence in web scraping with practical projects.
-
Master the use of CasperJS and PhantomJS for various tasks.
-
Be able to scrape dynamically loaded content.
-
Learn how to handle AJAX-loaded data.
-
Understand data management by saving results as tables or JSON.
About This Course
Become a better JavaScript Developer and Learn Front-End Testing. We'll use javascript, lodash and jquery for scraping.
In this course you will learn how to scrape data from web pages using CasperJS.
This course consists of 5 example projects to help you fully understand the powers of the headless browser using the CasperJS API.
What You Will Learn
You will gain a thorough understanding of advanced web scraping concepts and also gain an insight into how to use the CasperJS for Testing DOM manipulation and UI interaction.
What to Expect
The Projects Will Cover
How to scrape websites that are rendered with JavaScript instead of standard HTML
How to wait for AJAX loaded data to appear before scraping elements
How to submit forms both for Authorization and when making searches
How to define navigation Steps - like logging into a site, clicking a button and following links
How to write and save specified data in tables then output as an .html file or as JSON.
And how to take screenshots both of full web pages and specific containers
What is PhantomJS?
PhantomJS is a Full Web Stack that employs a headless browser. Phantom gives us the power to perform many interesting actions on a web page, such as: performing page
manipulation, simulating user interaction and the ability to dynamically capture and
save website data.
What is CasperJS?
CasperJS is a stand-alone framework built on top Phantom and is compatible with most operating systems. The focus of this course will be on the Casper API and we'll
be using this API to write all our web scraping scripts.
What You Should Know
You should already know JavaScript basics. Including what a callback function is. It will help if you know some jQuery. We use lodash in of our examples but only as a replacement for the built-in Map method that's part of the native Javascript API.
Know how to use JavaScript for Data Mining
Be able to Capture, Download and Save Website Data
Understand how to use CasperJS and PhantomJS
Mohammed A.
Course for beginners
one big issue I found I doesn't use windows