Competitor product extraction
13086
page-template,page-template-full_width,page-template-full_width-php,page,page-id-13086,strata-core-1.0.4,strata-child-theme-ver-1.0.0,strata-theme-ver-3.0.6,ajax_fade,page_not_loaded,wpb-js-composer js-comp-ver-5.7,vc_responsive

Competitor product extraction

Competitor site crawling and product data extraction

Need to extract all product data from competitor sites in one go?
Need a detailed overview of your competitor’s assortment, along with full pricing/product description, category, and brand information?
Price2Spy team has mastered a website crawl & extraction process that could help your business gather such valuable data in bulk.
Just let us know the list of competitor sites where you want the product data to be extracted from – and we’ll be happy to give you a quote for this task.

Some of the key technical strengths that distinguish Price2Spy Crawler from other similar tools are the following:

  • Ability to crawl and extract data from very complex websites
    • having a very complex page navigation structure
    • having complex JavaScript menu and/or paging implementation
    • having strong anti-bot protection (sites that do not want to be crawled) – for example, Amazon
    • capturing multiple product variations shown on the same product page
  • Crawl size / location
    • we can crawl websites in any language / any country
    • crawling entire website / crawling only specific product categories/brands (for example – it doesn’t make much sense to crawl the whole of Amazon. However, crawling several specific product categories on Amazon can be done)
    • crawling websites which are location-sensitive (showing different results depending on visitor’s IP / ZIP code). For example, Amazon.com will show different results depending on whether you are an international or US visitor. This applies to other Amazon websites.
    • big (more than 1 000 000 product pages) or small (less than 1000 product pages) – can be done
  • Extraction results
    • We have the ability to capture data fields which are not shown on the product page itself (for example fields shown on the category page, shown before reaching the product page)
    • Extraction results can be delivered in a list (Excel, CSV, XML). If you need a custom format, do let us know
    • Extraction results can be run against automated translation services (so you get results in your preferred language)
  • One-off or repetitive
    • Price2Spy’s Product Extraction service can be used as a one-off data source, or as a repetitive process
    • If you go for a repetitive crawl/extraction, you will be able to determine the recrawl frequency
  • Product matching
    • Products that have been extracted can be matched (automatically or manually) to your own products

 

We have performed crawl/extraction operations for a multitude of our clients, and we have noticed that they can be roughly grouped into the following use cases:

  • For online retailers
    • Extracting complete competitor’s assortment (to be used as a data source for adding new products on your own store)
    • Extracting delta’s in competitor’s assortment
      • Knowing which products have been added to a competitor website
      • Knowing which products have been discontinued by your competitor
      • (of course, this requires a periodical recrawl)
  • For Brands / Distributors
    • Extracting product reviews from retail websites, in order to determine consumer sentiment towards the product (this service is sometimes combined with automated review translation)
    • Extracting newly released products from competitor brands
  • For Marketing Agencies
    • Extracting products from eCommerce sites (online stores) on behalf of its clients
    • Extracting products and their reviews from Review websites (for example, capterra.com, g2crowd.com)

Results of Price2Spy crawls/extractions can be later used as feed for regular Price2Spy account, so newly extracted products get continuously price-monitored. If needed, a continuous crawl process can be part of your Price2Spy Enterprise package

Use cases

We have performed crawl / extraction operations for a multitude of our clients, and we have noticed that they can be roughly grouped into following use cases

  • For online retailers
    • Extracting complete competitor’s assortment (to be used as data source for adding new products on own store)
    • Extracting delta’s in competitor’s assortment
      • Knowing which products have been added to competitor website
      • Knowing which products have been discontinued by your competitor
      • (of course, this requires a periodical recrawl)
  • For Brands / Distributors
    • Extracting product reviews from retail websites, in order to determine consumer sentiment towards the product (this service is sometimes combined with automated review translation)
    • Extracting newly released products from competitor brands
    • Data science – providing product (or user review) data for in-house data science projects. Data can be provided in original language, or machine-translated
  • For Marketing Agencies
    • Extracting products from Ecommerce sites (online stores) on behalf of it’s clients
    • Extracting products and their reviews from Review websites (for example: capterra.com, g2crowd.com)
    • Data science – providing product (or user review) data for data science projects performed on behalf of agency clients. Data can be provided in original language, or machine-translated

Assortment monitor

Assortment Monitor is a feature designed to suggest products from any website of your choice – products that you do not monitor, but could consider monitoring them.
Assortment Monitor offers basic product data at your disposal completely free of charge (product name, Brand, URL) coming from various sources like proactive site crawls (performed on Price2Spy’s initiative), public data feeds, price comparison websites etc.

However, if you need a more detailed product crawl (with all product fields like Description, ImageURL, NumberOfReviews, AverageRating etc etc) please contact – we will be happy to give you a quote.