Getting Started Main Features Examples

ScrapeStorm Tutorial : How to scrape data to get the current employment trends and hot industries in UK from Glassdoor

2018-10-15 19:38:44
302 views

Abstract:This tutorial explains in detail how to extract data from employment websites.

ScrapeStorm’s “Smart Mode” function automatically extracts list data and identifies the page. We can use ScrapeStorm to extract the information we need on employment websites, such as jobs, salary, company address and so on. Through the analysis of these data, we can get the current employment trends and hot industries in the UK.

ScrapeStorm for Windows, MacOS and Linux Download:

http://www.scrapestorm.com/

 

Step 1.Creating a task.

This step can refer to the previous lesson tutorial:

ScrapeStorm Tutorial 1: Use “Smart Mode” to extract listing data from website

Product list link:

https://www.glassdoor.com/Salaries/uk-salary-SRCH_IL.0,2_IN2.htm

Delete unwanted data and increase the required data.

Step 2. Scraping into the the product listing page.

Select the title link column and click “Scrape Into”.

Click “Pre-login”, sign in on the popup page, then close the page. Wait a while, the extraction page will be refreshed.

On detail page click “Add Field” button and then select the element in web page to extract its related text.

Select “Modify Data” from the drop down box, and click”Extract Number”.

Step 3. Starting to extract.

Click “Start”, check “Block Ads” in the pop-up box to prevent the extraction of ads and change the request time to 5s. Then you can find that ScrapeStorm has extracted data.

Click “Export” to download your data.

After the extraction is completed, you can export the data to a local file (including excel, html, csv, etc.) and a database.

P.S. The data of the list page and the detail page will be merged during the extraction.

The following image is a screenshot of the file exported to excel2007:

If you are still confused about the process, please watch the tutorial video as below: