Back to Blog

Proxies for Scraping Reviews from Amazon, Google Reviews, and Yelp: Complete Guide Without Blocks

Want to automatically collect reviews from Amazon, Google Reviews, and Yelp, but keep getting blocked? This article discusses which proxies to choose and how to set up data collection without bans.

📅May 16, 2026

You start automatic review collection from Amazon or Google — and after just 10-20 requests, you get a CAPTCHA or an IP ban. Sound familiar? Marketplaces and review platforms actively protect themselves from scraping: they detect bots, block ranges of data center IP addresses, and require CAPTCHA solutions. But with the right proxies, this problem can be solved once and for all.

In this guide, we will discuss which type of proxy is suitable for each platform, how to set up IP rotation, which tools to use without coding — and how to ultimately collect thousands of reviews daily without getting blocked.

Why Amazon, Google, and Yelp Block Review Scraping

Before choosing proxies, it is important to understand: why do blocks occur at all? It’s not just that platforms “don’t want to share data.” They have specific technical protection mechanisms that need to be bypassed skillfully.

Too many requests from one IP. When an average person scrolls through reviews on Amazon, they make 2-5 requests per minute. A scraper makes hundreds. The system detects abnormal activity and blocks the IP. This is the most common reason for bans when collecting data.

Data center IP addresses on blacklists. Amazon, Google, and Yelp have long blacklisted ranges of IPs from major cloud providers: AWS, Google Cloud, DigitalOcean, Hetzner. If you use cheap data center proxies with “exposed” addresses — you will be blocked even before the first request.

Browser fingerprint analysis and headers. Modern protection systems (Cloudflare, PerimeterX, DataDome) analyze not only IPs but also HTTP headers, User-Agent, mouse behavior, and the sequence of requests. If the headers indicate a bot — blocking is inevitable.

Geolocation restrictions. Some reviews on Amazon are only available to users from certain countries. For example, reviews on amazon.de are visible differently from Germany than from Russia. For correct data collection, proxies with the required geolocation are needed.

CAPTCHA and JS challenges. Google especially actively uses reCAPTCHA. Yelp employs JS checks that simple HTTP requests cannot pass. These mechanisms require either the use of browser tools or special services to solve CAPTCHAs.

Key takeaway:

Blocks are not a coincidence, but a system. You can only bypass it comprehensively: the right type of proxy + IP rotation + proper request headers + suitable scraping tool.

What Types of Proxies Exist and What is Suitable for Review Scraping

Not all proxies are equally useful for collecting reviews. Let’s discuss three main types and their applicability to the task.

Datacenter Proxies

These are IP addresses owned by server companies. They are fast, cheap, and well-suited for tasks where speed is more important than anonymity. However, they perform poorly for scraping reviews on Amazon or Google: most of these IPs are already blacklisted. You may be able to collect a few pages, but you will quickly receive a block or CAPTCHA.

Datacenter proxies are justified only for testing the scraper or for platforms with minimal protection — for example, small regional review sites.

Residential Proxies

These are IP addresses of real home users. From the perspective of Amazon or Google — this is an ordinary person with home internet. Such proxies rarely end up on blacklists because their IPs change constantly and belong to real devices.

Residential proxies are the optimal choice for scraping reviews on Amazon, Yelp, and most platforms with moderate protection. They allow requests with the required geolocation (country, city), which is critical for obtaining local reviews.

Mobile Proxies

IP addresses from mobile operators (4G/5G). This is the most “trusted” type of traffic for any platform: mobile IPs are rarely blocked because a single IP may be shared by hundreds of real users (NAT of mobile operators). Google is especially lenient towards mobile addresses.

Mobile proxies are indispensable for scraping Google Reviews and Yelp, where bot protection is particularly aggressive. They are more expensive than residential proxies but provide the highest percentage of successful requests without CAPTCHA.

Parsing Amazon Reviews: Features and Proxy Setup

Amazon is one of the most challenging sites to scrape. The company uses several layers of protection simultaneously: behavior analysis, header verification, geolocation, and an aggressive CAPTCHA system. Nevertheless, thousands of marketers and analysts collect reviews from Amazon daily — they just do it correctly.

What You Need for Successful Amazon Reviews Parsing

Here is the minimum set of conditions under which scraping will work reliably:

  • Residential or mobile proxies with the geolocation of the required country (US for amazon.com, DE for amazon.de)
  • IP rotation — at least every 10-30 requests
  • Correct User-Agent — emulating a real browser (Chrome, Firefox)
  • Delays between requests — 2-5 seconds, to avoid looking like a bot
  • Cookies session — Amazon responds better to requests with saved cookies

Step-by-Step Setup for Amazon

Step 1. Choose residential proxies with the geolocation of the required country. For amazon.com — the USA, for amazon.co.uk — the UK. This is important: Amazon shows different reviews to users from different countries.

Step 2. Set up rotation. If you are using a ready-made scraper (for example, Octoparse or ParseHub), specify the proxies in the connection settings. Most of these tools support a proxy list with automatic rotation.

Step 3. Set delays between requests. In Octoparse, this is done in the “Delay Settings” section — set a random interval from 2 to 6 seconds.

Step 4. Run a test on 50-100 pages. If CAPTCHA appears more than 5% of the time — increase the delay or change the proxy pool.

Step 5. Scale up. After a successful test, you can start collecting thousands of reviews. A good pool of residential proxies allows you to collect 5000-10000 reviews a day without getting blocked.

Important about Amazon:

Amazon regularly updates its protection algorithms. If your scraper worked a month ago but is now starting to get blocks — it’s likely that the verification algorithm has changed. Solution: update the User-Agent to the latest version of Chrome and check if your proxies have been blacklisted.

Collecting Google Reviews: What You Need to Know

Google Reviews — reviews on Google Maps and Google Business Profile — are a valuable source of data for marketers, SEO specialists, and analysts. But Google protects its data particularly aggressively: reCAPTCHA v3, behavior analysis, browser fingerprint verification.

The main difficulty: Google Reviews do not load through a regular HTTP request. Reviews are dynamically loaded via JavaScript. This means that a simple HTTP scraper will not work — you need a tool that can render JavaScript (a browser scraper).

How to Properly Collect Google Reviews

Option 1: Ready-made services. Tools like Outscraper, Apify (Google Maps Scraper actor), or PhantomBuster can collect Google Reviews through a browser engine. You only need to specify the URL or name of the establishment and connect the proxies.

Option 2: Octoparse with browser mode. Octoparse supports browser rendering mode. In the settings, specify residential or mobile proxies — and the tool will collect reviews like a real user.

Option 3: Google Places API. The official way is to use the Google Places API. It provides up to 5 reviews per establishment for free, but for larger volumes, payment is required. However, there are no blocks, and proxies are not needed.

Why Mobile Proxies are Needed for Google

Google is the creator of reCAPTCHA and one of the leaders in bot detection. Residential proxies work, but mobile IPs yield significantly better results. The reason is simple: Google itself is a mobile platform and trusts mobile traffic. Requests from mobile IPs trigger CAPTCHA less frequently and are less likely to fall under behavioral analysis.

For large-scale collection of Google Reviews (thousands of establishments per day), it is recommended to use rotating mobile proxies with the geolocation of the required city or region. This way, the reviews will be maximally relevant to local searches.

Parsing Yelp Without Bans: Step-by-Step Instructions

Yelp is the largest review platform in the USA. For marketers working with the American market, it is an essential source of data about competitors, customer sentiments, and trends in the niche. Scraping Yelp is more complicated than it seems at first glance: the platform uses Cloudflare and its own bot protection system.

Yelp's Protection Features

  • Cloudflare Bot Management — analyzes behavior and fingerprint
  • Request limitation: more than 30-50 requests per minute from one IP — ban
  • JavaScript checks on first visit
  • Some reviews are hidden (“filtered reviews”) and are only available to authorized users

Step-by-Step Instructions for Yelp

Step 1. Choose a tool that supports Cloudflare. Regular scrapers will not pass Cloudflare. Use Apify (Yelp Scraper actor), Bright Data Scraping Browser, or PhantomBuster — they can bypass JS checks.

Step 2. Connect residential proxies with geolocation in the USA. Yelp is focused on the American market. Proxies with American IPs will provide maximum access to data and minimal blocking.

Step 3. Set up rotation every 5-10 requests. Yelp is very sensitive to request frequency. IP rotation every 5-10 pages is a mandatory condition for stable operation.

Step 4. Add delays of 3-8 seconds between requests. Random delays mimic real user behavior and significantly reduce the risk of blocking.

Step 5. Use the Yelp Fusion API for some data. Yelp provides an official API with access to basic business and review data. For small volumes, this is the simplest way without the risk of blocks.

Tools for Collecting Reviews Without Coding

The good news: you don’t need to be a programmer to scrape reviews. There are ready-made tools with a visual interface that support proxy connections and automatic IP rotation. Let’s discuss the most popular ones.

Tool Platforms Proxy Support Difficulty
Octoparse Amazon, Yelp, any websites ✅ Proxy list + rotation Low (visual mode)
Apify Amazon, Google Maps, Yelp ✅ Built-in + own proxies Low (ready-made actors)
PhantomBuster Google Maps, Yelp ✅ Residential proxies Low (templates)
ParseHub Amazon, Yelp, any websites ✅ Proxy list Medium
Outscraper Google Reviews, Maps ✅ Built-in Very low (SaaS)

How to Connect Proxies in Octoparse (Step-by-Step)

Octoparse is one of the most popular tools among marketers without a technical background. Here’s how to connect proxies:

  1. Open Octoparse → go to Settings → Proxy Settings
  2. Select “Use custom proxy”
  3. Paste the proxy list in the format ip:port:login:password
  4. Enable “Auto rotate proxy” — the tool will automatically change IPs
  5. Click “Test proxy” — make sure all proxies are working
  6. Run the task in normal mode

How to Connect Proxies in Apify

  1. Go to apify.com → select the desired actor (for example, “Amazon Reviews Scraper”)
  2. In the “Input” section, find the “Proxy configuration” field
  3. Select “Custom proxies” and paste your proxy data
  4. Specify the URLs of the review pages or ASIN codes of the products (for Amazon)
  5. Click “Start” — results will be available in JSON or Excel format

Proxy Rotation and Safe Review Scraping Rules

Even the highest quality proxies won’t help if safe scraping rules are not followed. Platforms analyze behavior patterns, and one incorrect request pattern can lead to the blocking of the entire IP pool.

Rule 1: Rotate IPs Regularly

For Amazon, it is recommended to change IPs every 10-20 requests. For Google — every 5-10. For Yelp — every 5-8. If you are using session proxies (sticky sessions), ensure that the session does not exceed 10-15 minutes — after that, request a new IP.

Rule 2: Mimic Real Behavior

A real user does not make requests with perfect intervals of 2 seconds. Add random delays: from 1 to 8 seconds. Occasionally take breaks of 30-60 seconds — as if a person is reading the page. This significantly reduces the likelihood of detection.

Rule 3: Use Current User-Agents

The User-Agent must correspond to a real browser. Outdated versions of Chrome or Firefox immediately raise suspicions. Update the User-Agent at least once a month. Good scrapers (Octoparse, Apify) do this automatically.

Rule 4: Don’t Scrape at Night (according to the platform's local time)

Protection systems are more sensitive to suspicious traffic at night. If you are scraping Amazon US, it’s better to do it during the daytime according to American time (EST/PST) — when there are many real users on the platform and your requests “sink” in the overall traffic.

Rule 5: Monitor the Percentage of Successful Requests

A good indicator is more than 95% successful requests without CAPTCHA. If the percentage drops below 85% — something is going wrong. Check: have the proxies become outdated, has the protection algorithm changed, do you need to update the User-Agent or increase delays?

Safe Review Scraping Checklist:

  • ✅ Residential or mobile proxies with the required geolocation
  • ✅ IP rotation every 5-20 requests (depending on the platform)
  • ✅ Random delays of 1-8 seconds between requests
  • ✅ Current User-Agent (latest version of Chrome)
  • ✅ Scraping during daytime according to the platform's local time
  • ✅ Monitoring the percentage of successful requests (goal: >95%)
  • ✅ Updating the proxy pool every 1-2 months

Comparison of Proxy Types for Different Review Platforms

Below is a summary table that will help you quickly choose the right type of proxy for a specific task. Use it as a reference when setting up scraping.

Platform Datacenter Residential Mobile Recommendation
Amazon Reviews ❌ Blocked ✅ Good ✅ Excellent Residential (US)
Google Reviews ❌ Blocked ⚠️ Average ✅ Excellent Mobile
Yelp ❌ Blocked ✅ Good ✅ Excellent Residential (US)
Trustpilot ⚠️ Partially ✅ Good ✅ Excellent Residential
Wildberries / Ozon ⚠️ Work ✅ Excellent ✅ Excellent Residential (RU)
TripAdvisor ❌ Blocked ✅ Good ✅ Excellent Residential

Why Collect Reviews at All: Use Cases

Automatic review collection solves several business tasks at once:

  • Competitor analysis. Collect reviews on competitors' products on Amazon → find weaknesses → use them in your marketing
  • Reputation monitoring. Automatically track new reviews about your brand on all platforms
  • Audience research. Analyze thousands of reviews → identify patterns → improve the product
  • SEO and content. Collect reviews for semantic analysis, identify keywords used by real buyers
  • Review aggregators. Create services that aggregate reviews from multiple platforms for businesses

Parsing Reviews on Russian Marketplaces

If your business is focused on the Russian market, you should also consider collecting reviews from Wildberries, Ozon, and Yandex.Market. These platforms have softer protection compared to Amazon but still block mass scraping. For them, residential proxies with Russian IPs are sufficient — they allow stable data collection without CAPTCHA and bans.

Popular tools for scraping Russian marketplaces include Screaming Frog (with proxies), Octoparse, as well as specialized services like Moneyplace or MPStats, which already include proxies in their infrastructure.

Conclusion: How to Start Collecting Reviews Without Blocks Right Now

Automatic review collection from Amazon, Google Reviews, and Yelp is a powerful tool for marketers, analysts, and e-commerce specialists. The main obstacle is blocks. And the main solution is the right type of proxy combined with proper rotation and delay settings.

In short: for Amazon and Yelp, use residential proxies with the geolocation of the required country. For Google Reviews — mobile proxies, which provide the highest percentage of successful requests. Datacenter proxies for these tasks are practically useless — their IPs have long been blacklisted.

Among no-code tools, Octoparse and Apify are the best options — both support external proxy connections and automatic rotation. For Google Reviews, also consider Outscraper — it is a specialized service already optimized for collecting data from Google Maps.

If you plan to regularly collect reviews from multiple platforms, we recommend starting with residential proxies — they provide an optimal balance between speed, anonymity, and cost, and are suitable for most review monitoring tasks. For working with Google Reviews, where protection is particularly aggressive, consider mobile proxies — they offer the highest percentage of successful requests without CAPTCHA.