Download and Sign Up
Get a $5 Coupon For Free
Getting Started Main Features

【Smart Mode】【Flowchart Mode】How to use URLs Generator | Web Scraping Tool | ScrapeStorm

2020-01-17 16:52:27
22991 views

Abstract:This tutorial will show you how to use URLs generator of ScrapeStorm. No Programming Needed. Visual Operation. ScrapeStormFree Download

ScrapeStorm supports URLs Generator, which is to automatically batch generation of URLs you need by setting certain rules.

The settings of Smart Mode and Flowchart Mode are the same, as shown in the following figure:

URLs Generator is generally used for multi-URLs scraping scenarios, which can be divided into two cases:

1. There is no next-page button on the page.

In this case you cannot scrape it by clicking the next-page button. The URLs of each page of this kind of website generally only have some different parameters, such as page number. So you can use the URLs Generator to generate URLs for all pages at once, and then scrape them by multiple URLs.

2. When you need to scrape multiple URLs that are compliant with certain rules, then can use the URLs Generator to generate URLs without having to manually fill them out.

In general, the basic structure of the URL is:

fixed URL + change parameters + fixed URL

Here we use Wikipedia as an example, the URL example is as follows:

https://en.wikipedia.org/wiki/100_(number)

https://en.wikipedia.org/wiki/101_(number)

https://en.wikipedia.org/wiki/102_(number)

……

https://en.wikipedia.org/wiki/110_(number)

We can see that the URLs are the same except for the red part numbers.

……

Now let’s introduce how to use URLs Generator.

Step 1: Enter the first part of the fixed URL.

https://en.wikipedia.org/wiki/

Step 2: Click Add Param

Step 3: Set the first param and set the parameter type to number. Then configure parameters such as start value,end value, and step size.

Step 4: Click Add Param, add the second parameter, set the parameter type to customize, and then paste the second half of the fixed URL into it, that is: _(number)

We can see the resulting URLs in the URLs preview.

P.S. This article only describes the parameters of digital type as an example. The parameters supported by ScrapeStorm also include letters and time. You can refer to this article for setting.

 

Data scraping with python Download web page as word python crawler Download videos in batches Automatically organize data into excel php crawler Generate URLs in batches Keyword extraction from web content Match emails with Regex python download file
关闭