How to block Yandex using IPTABLES or APF

Re: Yandex IP range, Yandex subnets, Block Yandex Robots

Across our server range we are finding that Yandex continues to ignore robots.txt files and crawls some sites constantly, so how do you stop such an abuse of your network resources?

If you use IPTABLES or APF (you should!) then you can block all Yandex spiders using the following IP ranges: # # # # # # # # # # # # # # # # # # # # # # #  # # # # # # # # # # #

Simply restart APF and Yandex will no longer be a problem (until they extend their network!).

How do I block the bot from crawling my site?

Re: How to stop Yandex, Blocking Yandex.RU

Yandex is the most popular search engine in Russia. Your bandwidth can go through the roof if this bot targets your website.

Unfortunately for many, the robots.txt file is ignored so blocking Yandex using the official method is not an option.

If you have a busy forum or website with hundreds of pages you may find that the Yandex Bot is starting to take up more of your site resources by indexing up to 90 pages every 15 minutes often leaving connections open or failing to close them properly.

You can easily disable the Bot by placing the following in your .htaccess file:

SetEnvIfNoCase User-Agent "^Yandex*" bad_bot
Order Deny,Allow
Deny from env=bad_bot

Using this method saves you the trouble of having to find blocks of Yandex IP addresses and block them individually which would only work for a limited time.

Learn more about the Yandex bot

What is the bot?

The bot, also known as YandexBot, is a web crawler and indexing bot used by Yandex, a leading Russian search engine. It scans websites, indexes web pages, and provides data for Yandex’s search results.

How can I check if the bot has visited my website?

You can check if the bot has visited your website by examining your website’s server logs or by using tools like Yandex.Webmaster, which provides detailed information about bot activity.

Is it necessary to optimise my website for the bot?

Yes, if you want your website to appear in Yandex search results, it’s advisable to optimize it for the bot. This includes following Yandex’s webmaster guidelines for indexing and ranking.

Are there specific rules or guidelines for bot optimisation?

Yandex provides webmaster guidelines that include recommendations for optimising your website for the bot. This includes creating a sitemap, using proper meta tags, and ensuring mobile-friendliness.

How can I block or allow the bot from accessing my website?

You can control bot access through your website’s robots.txt file. To allow bot, include “User-agent: Yandex” in your robots.txt. To block it, use “Disallow: /” for all Yandex user-agents.