undefined

Web Scraping Feature Study | How to Scrape Data by Searching Multiple Keywords on A Website?

Wednesday, April 27, 2016 8:05 AM

For the latest tutorials, visit our new self-service portal. Sharpen your skills and explore new ways to use Octoparse.

 

"Enter text" is an action to simulate the behavior of entering text on a web page.

For example, we can input login credentials on the page to scrape the data behind a login. Also, if we have a list of keywords to search on websites, we can use this feature to input the keywords on the search box. In this tutorial, we'll be learning how to use the "Enter text" feature to input text on a web page.

 

1. Input a single keyword into the text box

2. Input multiple keywords into a search box

 

1. Input a single keyword into the text box

Entering a text or keyword in Octoparse is easy. With the built-in browser, you can interact with the web page by simply pointing and clicking, just like what you do using any normal browser. 

  •  Click on the login box on the page  and select "Enter text" in the "Tips" panel 

         

enter text  

 

  •  Input text in the box on "Tips" and then click "Confirm"   

 

enter text 2   

You will see an Enter Text step created in the workflow:

 

workflow

 

 

2. Input multiple keywords into a search box

If you have a series of pre-defined and specific text values, you can add them to the "Text list" to create a loop search action. Octoparse will automatically enter every word in the list into the search box, one word at a time.

  •  Hover on the workflow and click the "+" button and select "Loop" 

 

loop

 

  •  Click "Loop Item" and switch the Loop Mode to "Text List"
  •  Click mceclip0.pngand then input the keywords (one keyword per line) on the pop-up box.
  • Click “Confirm” and "Apply" 

 

           enter text 3

  • Click on the search box on the page in the built-in browser and select "Enter text" 

 

enter text

  • No need to input the text, just click confirm to create an Enter text step

 

     confirm

  • Select "Use text in the loop to enter the text box" then click "Apply" to save the settings.

 

        loop

  • Click on the search 1.png box on the page and choose “Click button”

 

       click button

Go through the workflow step by step, you will find Octoparse can go to the page you need automatically, which means your configuration is successfully set up.

 

If you need any assistance with your data project, please feel free to submit a request here to contact us.

 

Happy Data Hunting!

Author: The Octoparse Team

Download Octoparse Today

 

For more information about Octoparse, please click here.

Sign up today. 

 

We use cookies to enhance your browsing experience. Read about how we use cookies and how you can control them by clicking cookie settings. If you continue to use this site, you consent to our use of cookies.
Accept decline