Download and Sign Up
Get a $5 Coupon For Free
Getting Started Main Features

【Flowchart Mode】Basic operational procedures | Web Scraping Tool | ScrapeStorm

2021-07-16 10:37:29
13383 views

Abstract:This tutorial demonstrates the basic operational procedures of Flowchart Mode. ScrapeStormFree Download

1. Enter the correct URL

Flowchart mode is not as strict as smart mode, but you still need to enter the correct URL format as required.

ScrapeStorm supports single and multiple URL scraping, and supports Manual Input, File Import and URLs Generator.

For more details, please refer to the following tutorial:

How to create a flowchart mode task

 

2. Pre Login

When you encounter a webpage that requires login, you can click this button to use the Pre Login function.

For more details, please refer to the following tutorial:

How to scrape web pages that need to be logged in to view

 

3. Solve Captcha

When you encounter captcha while editing a task, you can click this button to use the Solve Captcha function.

 

4. Open Proxy

When you encounter captcha or other anti-climbing on the home page, you can use the switch proxy function in addition to the solve captcha function.

Click here to learn more about Open Proxy.

 

5. Web Security Option

You can try this feature when you encounter a web page exception, but be aware that opening this option may cause some content on the page to not be scraped (such as content in an iframe).

 

6. Advanced Settings

You can monitor pushStates and block URLs in the advanced settings.

 

7. Switch Browser

Some webpages display different content on the computer and on the mobile phone. The software generally scraps the webpage of the computer version by default. If the user wants to scrape the webpage of the mobile version, it can be scraped by switching the browser mode.

For more details, please refer to the following tutorial:

What is the role of switching browser mode

 

8. How to use components

Flowchart mode supports visual web page click operation. You can click on elements according to the prompts provided by the software. The software will generally automatically detect page breaks and follow the software prompts to generate rules.

For more details, please refer to the following tutorials:

Introduction to flowchart components

How to set up paging

The operation flow is shown in the following figure:

 

9. Set Extract component

In flowchart mode, we set the fields by using the Extract component. You can click the prompt tips to generate the component automatically or drag the component from the component window.

For more details, please refer to the following tutorial:

How to set the fields

 

10. Scrape In

If you need to scrape the data on the detail page, you can use the Scrape In function to scrape.

For more details, please refer to the following tutorial:

How to Scrape In

 

11. Run settings

Before starting the task, we need to configure the task, including Schedule, IP Rotation & Delay, automatic export, Download Files, Speed Boost, Data Deduplication and Developer.

For more details, please refer to the following tutorial:

How to configure the scraping task

 

12. Run interface

After starting the task, it will jump to the data running interface. On this interface, the user can see the scraping of the data.

 

13. View scraping results and export data

After the task is over, the user can view the scraping results and export the data.

For more details, please refer to the following tutorial:

How to export the scraping results

 

 

Keyword extraction from web content python crawler Match emails with Regex Download web page as word python download file Automatically organize data into excel Data scraping with python Generate URLs in batches php crawler Download images in batches
关闭