Step-by-step tutorials for you to get started with web scrapingDownload Octoparse
Data fetched to the incorrect data fieldsFriday, September 21, 2018
When you set a completed task to run locally or in the cloud, you may have data extracted to the wrong "columns" or not being extracted at all. This is likely due to incorrect XPath failing to locate the correct data and extract it.
Here’s an example of data fetched to the incorrect data fields.
This is the data we need -
And this is the data output. Notice all the highlighted data are not being extracted correctly.
How to fix it?
To ensure that data will always be extracted correctly to the corresponding data fields, you want to write a new XPath that will always pinpoint the right data on all pages. To do this, refer to our step-by-step tutorial on how to locate the element with XPath. Alternatively, if you are extracting name-value pairs, check out how to associate data with nearby text.
Step 1. Select the data field that needs to be fixed, click the icon for "Customize data field"
Step 2. Click "Customize XPath"
Step 3. Change the XPath to the new one
- Most popular tutorials
- Scrape product information from Amazon
- How to download images from a list of URLs?
- Extract multiple pages through pagination
- Scraping info from Craigslist
- Scraping search results from Google Scholar