Examples Tutorials Videos

ScrapeStorm Tutorial: Use "Smart Mode" to extract data of multiple URLs from an e-commerce website

2018-08-02 17:46:41

Abstract:This tutorial explains how to scrape data from multiple product pages simultaneously on an e-commerce website.

Step 1.Creating a task,enter multiple URLs

Open ScrapeStorm, select “Smart Mode”, click “Start”,enter the above URLs.

P.S. When entering multiple URLs, only one can be entered in one line

 

For the extraction of multiple URLs, there is a second method, which is more suitable for a large number of URLs.

First create a new doucument and enter the URLs into this document.

Click “From files” and select the newly created document, then import it.

 

Step 2.Extracting data

The software will automatically load the first URL,then it will identify the data to be extracted.

Click “title_link” and “Scrape into”, you will enter to the detail page from the list page.

On detail page click “Add Field” button and then select the element in web page to extract its related text.

 Rename the fields to make them more intuitive.

 

Step 3.Starting to extract.

Click “Start”, then you can find that ScrapeStorm has extracted more data.

Choose your own needs, you can block ads and images, and you can extract data regularly.

Click “Export” to download your data.

After the extraction is completed, you can export the data to a local file (including excel, html, csv, etc.) and a database.

P.S. The data of the list page and the detail page will be merged during the extraction.

The following image is a screenshot of the file exported to excel2007:

If you are still confused about the process, please watch the tutorial video as below: