Scraping Data from Any Website Easily
Octoparse is an extraordinary web scraping tool for data analysis, SEO, marketing, e-commerce, IT, real estate, hospitality and more. We know how hard it is for us to build our own database. It is a headache to write the code using python to conduct web scraping for most of us. Octoparse is the ultimate tool for data extraction (web crawling, data crawling and data scraping). With a precise database at hand, you will be able to conduct data analysis, marketing strategy, sentiment analysis, ad campaigns, lead generation and more.
In order to achieve automatic web scraping in a real sense, the Octoparse team has never slowed down its pace in making data more accessible and ready for everybody. It’s rooted in our belief that in the era of big data, anyone should be blessed with the capability to collect data so as to harness the power of big data.
Today, we are extremely excited to introduce the release of our most stunning feature — Web Scraping Template.
What is Web Scraping Template?
Web Scraping Template is a set of pre-formatted tasks ready for everyone without configuring any scraping rules or writing code.
What makes Template Mode so special?
Have you ever wondered about the level of technical proficiency required to build a web scraper? The answer is “None” with the newly launched Web Scraping Template. With the traditional web scraping technique, you have to learn Python in order to complete one task template. However, Python has a stiff learning curve. Think of writing Python like editing photos using Adobe Photoshop. Compared with photography filter apps like Meitu, Adobe Photoshop is way more complicated with sets of parameters. Octoparse Web Scraping Templates are the solution for people who have a hard time laying a hand on web scraping. All you need to do is enter the URLs of the websites, and Octoparse will take care of you from there.
Who is this for?
Anyone! Yes, for anyone that wants to get data fast and easy. If we already have the template you need, that would be great and carry on! If not, let us know through the contact form.
What Web Scraping Templates Does Octoparse Offer?
Note: We are constantly updating the templates. Details may be subject to change. Download Octoparse and open the “Task Template” mode to try it out yourself!
1. Amazon: It is a multinational technology company that focuses on e-commerce. Its gigantic data pool includes infinitive numbers of product information. With Octoparse web scraping template, you would be able to:
Scrape basic product information: product name, price, ASIN, images, descriptions, categories, shipping, delivery, customers reviewed products, ratings, number of reviews, Amazon bestseller lists and page URLs
2. Tokopedia: It is the No.1 Indonesian’s Most Visited E-Marketplace. Indonesia is today one of the fastest-growing e-commerce industries in the world. How can you miss this golden place for your business? With Octoparse web scraping template, you will be able to:
Scraping basic product information: product name, sellers, prices, installment, product weight (Berat), insurance (Asuransi), buy numbers (Beli) and conditions (Kondisi).
3. Walmart: Being the No.1 Fortune 500 Company for 6 straight years. There is a reason why people like to spend money on Walmart. Octoparse can help you find out how Sam Walton “help customers, cut costs and share profits”. With Octoparse web scraping template, you will be able to:
Scrape basic product information including product name, brand, price, shipping, arrival date, free pick update, and item number, ratings, number of reviews, product page URLs
4. Rakuten: an internet services business giant from Japan. It engages in internet advertising, sales in the internet shopping mall, e-commerce sites, hotel reservation sites, banking, credit card related services, money wire business and other segments like telecommunication service. Such a company is a great opportunity to dive in and generate your sales leads. With Octoparse web scraping template, you will be able to:
Scrape basic product information including Store name, product, pricing, members, product rankings, credits, ratings, number of reviews and product page URLs
5. Yahoo shopping: It is one of the biggest online stores besides Rakuten in Japan. With the Octoparse web scraping template, you will be able to:
Scrape basic product information including image URLs, product description, shipping, pricing, store name, store URLs, ratings and number of reviews
6. Houzz Products: It is the third-largest website and online community for architecture, interior design, home and improvement in the United States. With Octoparse web scraping template, you will be able to:
Scrape basic product information including product names, pricing, shipping, and page URLs.
7. Canadian Tire: The company operates its business from three segments of retail, CT REIT and financial services. Together it covers every aspect of our daily life including entertainment, fixing, automotive, gardening, sports and etc. With Octoparse web scraping template, you will be able to:
Scrape basic product information including product name, pricing, stock, item number, ratings and number of reviews
8. Bestbuy: Barron’s has named Best Buy No. 1 on its list of the 100 Most Sustainable Companies for 2019. The various products include software, video games, digital cameras, car stereos, mobile phones and etc. Octoparse is a great tool for price monitoring on Bestbuy. With Octoparse web scraping template, you will be able to:
Scrape basic product information including name, model number, pricing, SKU, open box, product URL, image URLs, page number, extraction time, current list page, page title, page URL, product review number and product review URL
9. Sam’s Club: The company is a membership-based retail warehouse club owned by Walmart. With 599 membership warehouse clubs in 44 U.S. states. As they are growing their business towards online retailing in order to better compete with Amazon, it would be a chance for you to grow the business. With Octoparse web scraping template, you will be able to:
Scrape basic product information including product name, item number, brand, pricing, product URLs, product number of reviews.
10. Bukalapak: It is an e-commerce company that sells various products. The company aims to connect and empower millions of users from Indonesia. With millions of shoppers and sellers already getting connected via its website, Bukalapak offers great opportunities and environments for the e-commerce business to grow or generate leads from. With Octoparse web scraping template, you will be able to:
Scrape basic product information including product name, pricing, sellers, location, sending time, number of customers, order received, image URLs, page URLs
11. 1688.com (Alibaba.com): This is the Chinese portal of Alibaba.com which handles domestic business in China. It is a great place for sellers to sell items at wholesale prices. It is very hard to evaluate how lucrative one business can build out of it. We all know Alibaba which is the world’s largest online B2B trading platform. Don’t waste the opportunity to use the website to build your business. With Octoparse web scraping template, you will be able to:
Scrape basic product information including product name, pricing, image URLs, sales of days, product URL, rate of returns, location, store URLs, membership, business model, keyword, current page, back up keyword, current URLs.
12. JD.COM: One of the biggest online shopping websites in China with over 300 million annual active customers order things from fresh food, apparel, electronics, cosmetics and more. The best strategy to grow your e-commerce business is to dig up sales leads and potential from this global giant. With Octoparse web scraping template, you will be able to:
Scrape basic product information including product name, pricing, product URLs, stores, pricing, number of comments and extraction time.
13. Mercari: It is a very successful online flea market in Japan. With over 10,000 updated items each day, and over one million listings, it is heaven both for digging great goods and business opportunities. With Octoparse web scraping template, you will be able to:
Scrape basic product information including image URLs, price, shipping, delivery, shipping price, product description, brand, product category, seller, seller links.
14. Zozotown: It is the largest online fashion mall founded by Yusaku Maezawa. Being Japan’s leading online fashion retailer with over 6000 popular brands, there are infinite opportunities for e-commerce and foreign trading: With Octoparse web scraping template, you would be able to:
Scrape product, price, brand, image URLs, product link, description, seller and seller phone number, ZOZO customer service, packaging, shipping, delivery, material, size, sex, color, and credits.
15. Taobao：This platform is owned by Alibaba. As one of the most populous e-commerce platforms, Taobao offers foreign companies endless potential. You can sell anything on the platform from food, cosmetics, electronics, and even social media accounts. With Octoparse web scraping template, you will be able to:
Scrape basic product information including product name, product ID, product meta description, product page URL, pricing, property, image URLs, store name and address, product number of reviews and favorites.
16. eBay: It is an online shopping platform with over 170million buyers. The site is best known for its auction and C2C sales. It is also available in many different countries. If you want to kickstart a business on eBay, Octoparse is a must-have tool to monitor prices, generate leads, product rankings, etc. With Octoparse web scraping template, you will be able to:
Scrape product name, item number, product URLs, condition, inventory, price, inventory, seller name, link, product number of positive feedback
17. ヤフオク ( Yahoo! Auctions): With over 50 million listings of products in Japan portal, it is the most popular Japanese auction site. It has a proxy bidding service that allows customers around the globe to bid on Yahoo! Auction items safely. With Octoparse web scraping template, you will be able to：
Scrape the item information including Item name, item ID, image URL, item link, time remaining, condition, catalog, bidding price, return, bidding time, inventory, item description, delivery, and sender.
18. Yahoo! Shopping: One of the biggest e-commerce websites in Japan, in 2014 it reached 134,000 shopkeepers with 100 million products. With Octoparse web scraping template, you will be able to：
Scrape the item information including product name, description, image URL, shipping cost, tax-included price, number of reviews, earning points, rating, seller and seller URL.
1. Booking.com: it is a travel information aggregator website. With almost 30million listings in over 150 thousand destinations across 228 countries and territories, it is a giant data source for business market research and surveys. With Octoparse web scraping template, you will be able to:
Scrape hotel information including hotel name, address, stars, amenities, breakfast information, number of reviews, average score, number of rooms, image URLs, and the page URL.
2. Airbnb: It is an American online marketplace and hospitality service company. It enables people to list properties. Using the web scraping technique, it is possible to gather information including demographics, population, and housing. It is crucial for real estate and travel agents to gather such information in a timely manner. With Octoparse web scraping template, you will be able to:
Scrape hotel information including: title, location, property, page URLs, number of guests, number of bedrooms, number of beds, number of bathrooms, price, rating, number of reviews, amenities, sleeping arrangements, host, Joined time, language, response rate, response time, current time, and image URLs
3. TripAdvisor: With more than 570 million reviews and opinions covering 1.2 million hospitality businesses, TripAdvisor processes a lot of data. In the hospitality industry, it is important for a business to know how to optimize price and advertise wisely. With Octoparse web scraping template, you will be able to:
Scrape hotel information including hotel name, location, number of reviews, ranking, web page URL, phone number, amenities, room features, ratings, location rating, cleanliness rating, service rating, value rating, great for walkers, number of restaurants, number of attractions, image URLs.
1. Instagram: Photo-sharing on this platform has reached up to 1 billion monthly active users. Web scraping is definitely the solution to extract information and keep up-to-date with social trends. With Octoparse web scraping template, you will be able to:
Scrape basic post information including post content, post date, number of likes, location, image URL.
2. Twitter: 500 million tweets sent per day by 326 million users. It is a gold mine for data including entertainment, sports, celebrities, news, financial and etc. It is a popular site to research and analyze the economy, society, and politics. With Octoparse web scraping template, you will be able to:
Scrape basic post information including Twitter username, user ID, tweet content, publishing date, comments, number of retweets, number of likes, image URL, Tweet URL, Video URL.
3. Youtube: The world’s most popular video website. How do you leverage this giant source pool to create your own information index with valuable data? what are the most trendy videos? what do people perceive as a certain type of video and more? Web scraping can help you deal with these questions. With Octoparse web scraping template, you will be able to:
Scrape basic post information including video title, video description, video link, publishing date, total views, channel link, and name.
4. Weibo: with over 400 million users, Weibo is the gold mine for marketers and businesses. It encompasses the features of Twitter, Pinterest, Instagram, Reddit, and YouTube. With Octoparse web scraping template, you will be able to:
Scrape basic post information including user name, post content, number of favorites, publishing time, source and current URL.
5. Bilibili: It represents online entertainment for younger generations in China. With abundant videos including anime, comics, games and another wide array of genres. It is the land of creative and inspirational content. With Octoparse web scraping template, you will be able to:
Scrape basic post information including video title, labels, channel URL, video URL, number of upvotes, description, number of favorites, video length, publishing time, number of views, number of bullet screens, number of coins and number of saved.
6. Facebook: The world’s largest social media platform. businesses compete against each other to gain traffic from it. A smart strategy is to find the target audience and market with the right promotion strategy. With Octoparse web scraping template, you will be able to:
Scrape basic post information including Facebook user name post content, post URL, post content addition, number of likes, number of comments, number of shares, timestamp and extraction time.
There are millions of web pages and content are uploaded every day. Even if search engines can help to refine the searches faster, you still need to manually click through each result to filter out your desired one. To optimize the process, Octoparse can pull out the target information and export it into a structured format. What would be better than having a resource tailor machine which saves your valuable time?
1. Bing: As the third-largest search engine. Bing shared the part resources with Google, yet the search results are different. With Octoparse web scraping template, you will be able to:
Scrape search result information including title, URL and meta description.
2. Google Search: The biggest search engine, the information is overwhelming. To concur with the situation of getting lost, web scraping can help to create our own database of all sites. With Octoparse web scraping template, you will be able to:
Scrape search result information including title, URL and meta description.
1. CrunchBase: is a leading platform to discover talents. with over 50 million professionals including investors, market researchers, sales, entrepreneurs and more. For HR, web scraping is incredible to help you extract the right candidates. With Octoparse web scraping template, you will be able to:
Scrape company information including company names, introductions, categories, found dates, operating status, number of employees, IPO status, company type, website URL, Facebook URL, LinkedIn URL, Twitter URL, and email.
2. Yellowpages: It has been a well-known service provider and business directory for years. Instead of the old-fashioned phone book, now Yellowpages focus on digital marketing. By posting an ad campaign, expanding your brand, and engaging with your potential business partners, web scraping can help you build the data pool, with Octoparse web scraping template, you will be able to:
Scrape business information including name, website, regular hours, address, open hours, phone number, email, rating, categories, neighborhoods, price range, payment methods, and other information.
3. Yelp: millions of people are searching for a business for all kinds of purposes. The community possesses rich data of photos, reviews, and business information. This is where you need to explore and get to know your business and your competitors. With Octoparse web scraping template, you will be able to:
Scraping business information including the name, star rating, number of reviews, tags, phone number, address, website URL and business hours.
4. 食べログ [Eat Log]: A ranking and review website for gourmet restaurants in the whole country. There are up to 900,000 gourmet restaurants with photos, reviews, and rankings. You will be able to find food in various genres. It is the Japanese version of American Yelp. With Octoparse web scraping template, you will be able to:
scrape restaurant names, ratings, categories, review numbers, addresses, reservations, homepages, business hours, dishes, services, reviewers, occasions, phone numbers, spaces and facilities, parking, smoking/non-smoking, private and private dining rooms, number of seats, payment and budget.
5. Iタウンページ [ I TOWN PAGE ]: It is Internet telephone directory provided by NTT Town Page. You can search for phone numbers, maps and directions for shops and businesses all over the country. With Octoparse web scraping template, you will be able to:
scrape information including business name, website URL, business description, address, phone number, and email.
1. Phoenix New Media (Ifeng.com): It is a television network based in HongKong. It features a variety of topics from politics, affairs, entertainment, foreign news and more. It is not difficult to have a news aggregator with Octoparse. Octoparse is able to capture news articles, articles and video links, comments, and reading trends. With Octoparse web scraping template, you will be able to:
Scrape news article information including: article title, category, publishing time, extraction time and current URL.
1. BestBuy Review: if you are an electronics retailer, keep an eye on BestBuy. Besides analyzing price changes, what product is the most popular and what do customers think of it? It is easy to conduct a product sentiment analysis with Octoparse. With Octoparse web scraping template, you will be able to:
Scrape reviews including: product name, model number, SKU, ratings, number of reviews, recommendation rate, account, rating, brief comments, post time, whether to recommend, helpful upvotes, unhelpful rates, page URL, description, and review content.
2. Google Play: It is known as Android Market. According to Statista, there are over 2.6 million applications in the Google Play Store. For app developers, knowing how to create a top-notch app is essential. To do so, we need to know the common features of the top apps. It’s easy to have a database of Top Selling Apps, Top Grossing Apps, Top Games, Top Selling Games, Top Grossing games. With Octoparse web scraping template, you will be able to:
Scrape app reviews including: app name, company name, category, user name, review post time, comments, review star rating, product URL, category URL.
Google Maps have at least 1 billion monthly users. It is intuitive to navigate the business location using Google Map. It is intuitive to use google map for marketing purposes. Octoparse can help you extract the information and create a business index in a certain area.
1. Google Hotel Information: With Octoparse web scraping template, you will be able to
Scrape hotel reviews, including hotel names, addresses, reviews, ratings, websites, phone numbers and business hours.
2. Google Restaurant data: With Octoparse web scraping template, you will be able to
Scrap hotel reviews, including restaurant names, reviews, ratings, addresses, business websites, phone numbers and business hours.
3. Googleマップ: It is Google Maps in Japan, with Octoparse web scraping template, you would be able to:
Scrape business names, business hours, phone number, website URL, address, review numbers, ratings, and descriptions.
1. Houzz Professional: it is an online platform for home design businesses and projects With nearly more than 35 million users on Houzz. It connects homeowners with contractors, art designers, and other professionals. With Octoparse web scraping template, you will be able to:
Scrape professional referral information including the general contractor, number of reviews, rating star, contact information, business website, page URL, business description and job costs.
2. マイナビ転職 : Mynavi Co. Ltd is the largest human resource advertising company in Japan. Its main business is to provide business opportunities, career changes, and employment. With Octoparse web scraping template, you will be able to:
Scrap employment information including company name, address, company description, email address, phone number, website URL, job description, job requirements, work location, working hours, salary, benefits, compensation, vacation, expected hire number, information update date, and job listing URL.
3. リクナビNext: Rikunabi is an employment site provided by the Recruit Group in Japan. Many newly graduated college students rely on this website for job hunting. With Octoparse web scraping template, you will be able to:
Scrape job information including company name, company homepage, job post period, job description, job requirements, work location, salary, working hours, vacation, benefits and compensation and job listing URL.
1. Gumtree: it is the UK’s largest website for the local community with 14.8 million monthly unique visitors. It connects Australians, New Zealanders and South Africans to relocate their homes. For real estate agents, home investors, flippers, home buyers and sellers, this is the land for you to dive into. With Octoparse web scraping template, you will be able to:
Scrape property information including property type, ad ID, title, price, address, description, release date, edit date, number of bedrooms, dwelling type, pet-friendly, bathrooms, parking, furnishing, smoking, availability, owner, page URL, image URL.
2. Kijiji: it is an online advertising service available for more than 300 cities in Canada, Italy, HongKong, and Taiwan. As one of the top ten websites in Canada, it is the leading commercial real estate marketplace for tenants, landlords, and brokers. With Octoparse web scraping template, you will be able to:
Scrape property information including listing ID, title, property type, price, release time, address, furnishings, pet-friendly, seller tag, average reply rate, seller status, and page URL.
3. SUUMO: It is one of the largest real estate aggregator websites in Japan. It provides information including property buying, selling, rental and remodeling of various property types. With Octoparse web scraping template, would you be able to
Scrape property information including property name, property type, property location, price, build date, traffic, land area, coverage ratio and volume ratio, release date, reviews, contacts, management, property image URLs.
1. Yahoo! Finance: is a media website that provides financial news and data including stock quotes, press releases, and financial reports. For people interested in Bitcoin, Ethereum, Litecoin and Octoparse, you can provide your cryptocurrency trading information on time. With the Octoparse web scraping template, you will be able to:
Scrap cryptocurrency information including symbol, URLs, name, intraday price, change, change percentage, market cap, volume in currency, and circulating supply.
It is a freely accessible index for scholarly literature. It is one of the most powerful academic databases. For researchers, professionals, and students, there is no need to spend time collecting papers and sources. With Octoparse web scraping template, would you be able to
Scrape article search results including titles, article links, version numbers, cited numbers, meta descriptions, and authors.