firewall/crawlers.txt
Your Name cc33ea0480 fix
2024-09-22 22:20:21 -06:00

130 lines
1.5 KiB
Plaintext

facebookexternalhit
YandexBot
PetalBot
GPTBot
Amazonbot
SemrushBot
AhrefsBot
slurp
openai
ahrefsbot
majestic
applebot
duckduckbot
baidu
lemmy
FediDB
facebook
bingbot
MJ12bot
lemmy-stats-crawler
Bytespider
CDSCbot
Googlebot
FediDB
ClaudeBot
Podverse
Expanse
oii-research
DotBot
ZoominfoBot
LivelapBot
rss-is-dead.lol
FreshRSS
Barkrowler
DataForSeoBot
PixelFedBot
SerendeputyBot
2ip bot
GNUsocialBot
BacklinksExtendedBot
ws-bot-v1
ImagesiftBot
WellKnownBot
FediIndex
FriendlyCrawler
gotosocial
Synapse
impendoom-bot
Mitra
kbinBot
BitSightBot
FediFetcher
MbinBot
Discordbot
YisouSpider
LinkPreview
SurdotlyBot
AwarioSmartBot
msnbot-media
msnbot
ev-crawler
BLEXBot
YandexImages
Chodes
hstspreload-bot
Twitterbot
TelegramBot
Slack-ImgProxy
GenomeCrawlerd
search-engine-indexer
SemanticScholarBot
yacybot
BW/1.2
Twingly
IonCrawl
vmcrawl
SeoCherryBot
coccocbot-web
FAST-WebCrawler
YandexImageResizer
serpstatbot
YandexRenderResourcesBot
CCBot
wpbot
LinkedInBot
SeznamBot
Mail.RU_Bot
RedekenBot
SiteCheckerBotCrawler
AwarioBot
intelx.io_bot
YandexWebmaster
Qwant
PagePeeker
Sogou Push
KOCMOHABT
ldspider
robots.txt
bots.retroverse.social
archive.org_bot
Facebot
Exabot
MisskeyBot
ISSCyberRiskCrawler
AportCatalogRobot
RSS Discovery Engine
AdsBot-Google
CyberFind Crawler
t3versionsBot
CyberFindCrawler
KixxActivityPubCrawler
x22Xpanse-bot
Arquivo-web-crawler
YandexUserproxy
Ai2Bot-Dolma
PerplexityBot
VirusTotalBot
Gabanzabot
Horrid Chodes For Everyone
TurnitinBot
YandexFavicon
Trident
meta-externalagent
Barkrowler
Googlebot-Image
CensysInspect
Go-http-client
Friendica
HeadlessChrome