What is semrushbot-ft?

semrushbot-ft is the robots.txt token for Semrush's Plagiarism Checker crawler. It collects web page content and links to power the Plagiarism Checker and other Semrush products like Backlink Analytics and Site Audit.

Does semrushbot-ft respect robots.txt?

Yes. It respects Disallow, Allow, Sitemap, and Crawl-delay directives. Use the token SemrushBot-FT in your robots.txt rules. Be aware that changes may take up to one hour to take effect.

Does Semrush use crawled content for AI training?

This is unclear. Semrush's privacy policy mentions data analysis and product improvement but does not explicitly state whether crawled content trains AI models. The primary use is powering Semrush's SEO and content tools.

Can I block semrushbot-ft by IP address?

Semrush discourages IP-based blocking and does not publish consecutive IP ranges. Use robots.txt with the SemrushBot-FT token instead, or contact bot@semrush.com for assistance.

How do I set a crawl rate limit for semrushbot-ft?

Add a Crawl-delay directive under User-agent: SemrushBot-FT in your robots.txt. Values up to 10 seconds are honored. Anything above 10 seconds is automatically reduced to 10.

Will blocking semrushbot-ft remove my site from Semrush tools?

Blocking via robots.txt will prevent the crawler from indexing new content from your site. You can also request removal from Plagiarism Checker results through the Semrush UI or by emailing bot@semrush.com.

Agent Directory SEMrushSemrush Plagiarism Checker

Semrush Plagiarism Checker

SEO

Plagiarism checking crawler.

What does Semrush Plagiarism Checker do?

The Semrush Plagiarism Checker crawler (semrushbot-ft) discovers and collects web pages and hyperlinks to build a crawl frontier for Semrush's Plagiarism Checker tool. Collected data also feeds other Semrush products like Backlink Analytics, Site Audit, and Content Toolkit. When your content appears in Plagiarism Checker results, source URLs are displayed and clickable, which can drive referral traffic back to your site.

Should I allow and optimize for Semrush Plagiarism Checker to drive organic growth?

Allowing semrushbot-ft supports your visibility across multiple Semrush products used by millions of SEO professionals and content marketers. The Plagiarism Checker surfaces source URLs in its results, creating a direct referral path. Your site also appears in Backlink Analytics and Site Audit reports, which influences how SEO practitioners discover and recommend content. Blocking this crawler removes your site from these indexes, reducing your presence in a widely used SEO ecosystem.

Here's how to optimize for Semrush Plagiarism Checker:

Allow SemrushBot-FT in your robots.txt to ensure your pages appear in Plagiarism Checker results
Use a Crawl-delay value of 10 seconds or less (Semrush truncates higher values)
Include a Sitemap directive in robots.txt to help the crawler discover your pages efficiently
Ensure your robots.txt returns HTTP 200 (5xx responses will prevent crawling entirely)
Add canonical tags to consolidate duplicate content signals
Keep your site responsive under load, as the crawler adapts frequency based on server performance

Data Usage & Training

It is unclear whether content crawled by semrushbot-ft is used to train AI models. Semrush's privacy policy describes internal uses such as data analysis, product improvement, and research, but does not explicitly confirm or deny AI training. The primary purpose of this crawler is to power Semrush's product indexes and reports.

How Semrush Plagiarism Checker Accesses Content

Here's how Semrush Plagiarism Checker accesses your site and understands your content:

Fetches HTML via standard HTTP requests
Follows hyperlinks to discover new pages and build a crawl frontier
Respects robots.txt Disallow, Allow, Sitemap, and Crawl-delay directives
Adjusts request frequency based on server load and robots.txt rules
JavaScript rendering capability is unknown

Continuous and adaptive. Semrush maintains a crawl frontier and revisits pages according to internal policies, adjusting request frequency based on server load and robots.txt directives. Changes to robots.txt may take up to one hour or approximately 100 requests to be detected.

How to Block or Control Semrush Plagiarism Checker

To block the Semrush Plagiarism Checker crawler via robots.txt: User-agent: SemrushBot-FT Disallow: / You can also request domain exclusion from Plagiarism Checker results through the Semrush product UI, or contact [email protected] for support. IP-based blocking is discouraged and unreliable because Semrush does not use consecutive IP blocks and does not publish IP ranges. Place your robots.txt at the site root and ensure it returns HTTP 200. A 4xx response is treated as if no robots.txt exists (crawling proceeds), while a 5xx response halts crawling entirely.

Common Issues & Troubleshooting

Watch out for these common problems when working with Semrush Plagiarism Checker:

Robots.txt changes can take up to one hour or ~100 requests before the crawler detects them
Crawl-delay values above 10 seconds are silently truncated to 10 seconds
A robots.txt returning 4xx is treated as missing, meaning the crawler will proceed without restrictions
A robots.txt returning 5xx prevents all crawling, which may not be your intent
IP-based blocking is ineffective because Semrush does not use consecutive IP ranges
If the crawler appears to ignore your rules, Semrush recommends sending logs to [email protected] for investigation

Quick Reference

Platform

SEMrush

Agent Category

SEO

Growth Value

Official Documentation

semrush.com/bot/

User Agent String

semrushbot-ft

robots.txt Entry

User-agent: semrushbot-ft
Disallow: /

See which agents visit your site

Monitor real-time AI agent and bot activity on your site for free with Siteline Agent Analytics

Get started free

Frequently Asked Questions

Similar Agents & Bots

AhrefsBot

Ahrefs crawler used for backlink and SEO analysis.

AhrefsSiteAudit

Ahrefs site auditing bot that checks technical SEO issues.

Barkrowler

Qwant's SEO crawler for site analysis and indexing.

ClarityBot

Microsoft Clarity bot for SEO and verification checks.

Learn More

Agentic Web Tech

The AI Crawler Optimization Guide to Increase Site Visibility

Related Resources

AI Readiness Audit

Check if AI agents and bots can easily discover your content

AI Agent Directory

Continuously updated directory of AI agents, bots & crawlers

Case Studies

Real stories of driving organic growth from AI search

Blog

Research, guides, feature updates and more

💥 Get started

Ready to track Semrush Plagiarism Checker on your site?

Start monitoring agent traffic, understand how AI discovers your content, and optimize for the next generation of search.