Getting Started Main Features Examples

How to scrape product data from Amazon

2018-10-15 19:41:47
320 views

Abstract:This tutorial explains in detail how to scrape data from Amazon via ScrapeStorm's smart mode.

In this article, we will tell you how to scrape product data from Amazon by using ScrapeStorm’s “Smart mode”.

Introduction to the scraping tool

ScrapeStorm is a new generation of web scraping tool based on artificial intelligence technology. It is the first scraper to support both Windows, Mac and Linux operating systems.

Introduction of scraping objects

Amazon is the largest online e-commerce company in the United States, located in Seattle, Washington. It is one of the first companies to start e-commerce on the Internet. It has become the world’s largest online retailer and the second largest Internet company in the world.

Official website: https://www.amazon.com/

Scraping fields

title, title_link, Image, Price, Review, Stars

Function point directory

How to download images

Preview of the scraped result

Export to Excel2007:

Export images to local:

 

1. Download and install ScrapeStorm, then register and log in

2. Create a task

(1) Copy the URL of Amazon

(2) Create a new smart mode task

You can create a new scraping task directly on the software, or you can create a task by importing rules.

Click here to learn how to import and export scraping rule.

3. Configure the scraping rules

(1) Set the fields

ⅰ. Add and Delete fields

ⅱ. Rename the fields

Right click on the data and select “Rename” to modify the field name.

(2) Modify data

Right click on the data and select “Modify Data” to do some processing on the data. Here we choose “Extract Number”.

Click here to learn how to how to configure the extracted field.

4. Set up and start the scraping task

(1) Running and Anti-block settings

Click “Setting”, set waiting time based on web page open speed. You can check “Block Images” and “Block Ads”. The anti-block settings follow the system default settings. Then click “Save”.

P.S. “Block Images” will reduce the load time and speed up the scraping process. And this operation does not affect the scraping and downloading of images.

Click here to learn more about how to configure the scraping task.

(2) Start scraping data

Premium Plan and above users can use “Scheduled job” and “Sync to Database”. If you want to download images, you can check “Download images while running”. Then click “Start”.

Click here to learn about scheduled job.

Click here to learn about sync to database.

Click here to learn about download images.

(3) Wait a moment, you will see the data being scraped.

5. Export and view data

(1) Click “Export” to download your data.

(2) Choose the format to export according to your needs.

Click here to learn more about how to view the extraction results and clear the extracted data.

Click here to learn more about how to export the result of extraction.