How to Use
- Step 1: Click Try it!
- Step 2: Enter Start URLs - the list of URLs of web pages the scraper should start.
- Step 3: Set Max link depth - how deep this scraper will descend from the web pages specified in the start URLs. If zero, the scraper will exclusively crawl the Start URLs without venturing into any subpages.
- Step 4: Set Max number of pages - a limit to the total number of webpages to scrape per URL.
- Step 5: Set whether or not to Stay within the domain - if Yes, the scraper will only follow links on the same domain as the referring page. For example, when the scraper finds https://www.domain-b.com/some-page on https://domain-a.com/some-page, it will not crawl the page because it is on a different domain.
- Step 6: Click Start to run the task in your preferred mode.
Data Preview
Start_URL | Domain | Depth | Referrer_URL | Current_URL | Emails | Phones | Uncertain_Phones | Twitter | YouTube | Facebook | LinkedIn | Instagram | Tiktok |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
https://www.octoparse.com | www.octoparse.com | 0 | https://www.octoparse.com | https://www.octoparse.com | https://twitter.com/Octoparse | https://www.youtube.com/channel/UCweDWm1QY2G67SDAKX7nreg | https://www.linkedin.com/company/octopus-data-inc. | ||||||
https://www.octoparse.com | www.octoparse.com | 1 | https://www.octoparse.com | https://www.octoparse.com/privacy-policy | support@octoparse.com;isabel@octoparse.com$3;u003eisabel@octoparse.com | https://twitter.com/Octoparse | https://www.youtube.com/channel/UCweDWm1QY2G67SDAKX7nreg | https://www.linkedin.com/company/octopus-data-inc. | |||||
https://www.octoparse.com | www.octoparse.com | 1 | https://www.octoparse.com | https://www.octoparse.com/black-friday-sale-2023?utm_source=sitebanner&utm_medium=opsite&utm_campaign=23bf | https://twitter.com/intent/tweet?text=Get%20your%2030%25%20OFF%20offer%20in%20Octoparse%20Black%20Friday%20Sale,%20November%2015-30,%202023%20(EST)%20time-limited.&url=https://www.octoparse.com/black-friday-sale-2023?refid=711 | https://www.linkedin.com/cws/share?url=https://www.octoparse.com/black-friday-sale-2023?refid=711 | |||||||
https://www.octoparse.com | www.octoparse.com | 1 | https://www.octoparse.com | https://www.octoparse.com/ | https://twitter.com/Octoparse | https://www.youtube.com/channel/UCweDWm1QY2G67SDAKX7nreg | https://www.linkedin.com/company/octopus-data-inc. | ||||||
https://www.octoparse.com | www.octoparse.com | 1 | https://www.octoparse.com | https://www.octoparse.com/download | someone@example.com | https://twitter.com/Octoparse | https://www.youtube.com/channel/UCweDWm1QY2G67SDAKX7nreg | https://www.linkedin.com/company/octopus-data-inc. | |||||
https://www.octoparse.com | www.octoparse.com | 1 | https://www.octoparse.com | https://www.octoparse.com/pricing | https://twitter.com/Octoparse | https://www.youtube.com/channel/UCweDWm1QY2G67SDAKX7nreg | https://www.linkedin.com/company/octopus-data-inc. | ||||||
https://www.octoparse.com | www.octoparse.com | 1 | https://www.octoparse.com | https://www.octoparse.com/customer-stories | https://twitter.com/Octoparse | https://www.youtube.com/channel/UCweDWm1QY2G67SDAKX7nreg | https://www.linkedin.com/company/octopus-data-inc. | ||||||
https://www.octoparse.com | www.octoparse.com | 1 | https://www.octoparse.com | https://www.octoparse.com/blog | https://twitter.com/Octoparse | https://www.youtube.com/channel/UCweDWm1QY2G67SDAKX7nreg | https://www.linkedin.com/company/octopus-data-inc. | ||||||
https://www.octoparse.com | www.octoparse.com | 1 | https://www.octoparse.com | https://www.octoparse.com/terms-and-conditions | support@octoparse.com;u003esupport@octoparse.com | (800) 952-5210;(916) 445-1254 | https://twitter.com/Octoparse | https://www.youtube.com/channel/UCweDWm1QY2G67SDAKX7nreg | https://www.linkedin.com/company/octopus-data-inc. |
Notes
- To get leads from a specific website, consider using a website-specific template first.
- This scraper is an enhanced version of Email & Social Media Scraper focusing on email and social media links, but now it can delve deeper into subpages.
- The following contact information is extracted: emails, phone numbers, uncertain phone numbers, YouTube, Tiktok, LinkedIn, Twitter, Facebook, and Instagram profiles.
- Social media profiles are extracted from links in the HTML.
- Due to website restrictions, the Start URLs must not contain any Facebook or Instagram links.
- This template CANNOT scrape contact details that are not shown in the source HTML. For social media profiles, it only detects clickable links.
Is Scraping Contact Details Legal?
Web scraping is generally legal if you scrape publicly available non-personal data. What you do with the data is another question. Documentation, help articles, or blogs are typically protected by copyright, so you can't republish the content without the owner's permission. Learn more about the legality of web scraping in this article. If you're not sure, please seek professional legal advice.