firewall/crawlers.txt

129 lines
1.5 KiB
Plaintext
Raw Normal View History

2024-09-23 22:17:27 -06:00
impendoom-bot
msnbot-media
LinkPreview
SerendeputyBot
Arquivo-web-crawler
2024-07-04 10:05:00 -06:00
Expanse
2024-09-23 22:17:27 -06:00
x22Xpanse-bot
KixxActivityPubCrawler
Gabanzabot
hstspreload-bot
YisouSpider
gotosocial
FAST-WebCrawler
Facebot
MisskeyBot
search-engine-indexer
2024-07-04 10:05:00 -06:00
PixelFedBot
2024-09-23 22:17:27 -06:00
lemmy-stats-crawler
Discovery
Twingly
Friendica
Podverse
2024-07-04 10:05:00 -06:00
WellKnownBot
2024-09-23 22:17:27 -06:00
TelegramBot
Crawler
SemrushBot
bingbot
CyberFindCrawler
AportCatalogRobot
LivelapBot
duckduckbot
YandexBot
intelx.io_bot
2024-07-04 10:05:00 -06:00
FediIndex
2024-09-23 22:17:27 -06:00
Sogou
YandexImageResizer
Slack-ImgProxy
ISSCyberRiskCrawler
2024-07-04 10:05:00 -06:00
Mitra
2024-09-23 22:17:27 -06:00
YandexRenderResourcesBot
FriendlyCrawler
2024-07-04 10:05:00 -06:00
MbinBot
YandexImages
2024-09-23 22:17:27 -06:00
Exabot
2024-07-04 10:05:00 -06:00
SemanticScholarBot
2024-09-23 22:17:27 -06:00
Twitterbot
SeznamBot
oii-research
Horrid
Ai2Bot-Dolma
ZoominfoBot
2024-07-10 23:24:05 -06:00
CCBot
2024-09-23 22:17:27 -06:00
serpstatbot
YandexUserproxy
SeoCherryBot
Amazonbot
DotBot
VirusTotalBot
AwarioBot
2024-07-10 23:24:05 -06:00
wpbot
2024-09-23 22:17:27 -06:00
ws-bot-v1
AhrefsBot
ldspider
Googlebot-Image
ImagesiftBot
Bytespider
BW/1.2
AwarioSmartBot
vmcrawl
GenomeCrawlerd
Chodes
facebook
Barkrowler
FediDB
ev-crawler
FediFetcher
CDSCbot
PerplexityBot
BitSightBot
facebookexternalhit
DataForSeoBot
baidu
2024-07-14 00:10:59 -06:00
RedekenBot
2024-09-23 22:17:27 -06:00
coccocbot-web
GNUsocialBot
2024-07-19 00:12:35 -06:00
PagePeeker
2024-07-22 22:47:26 -06:00
bots.retroverse.social
2024-09-23 22:17:27 -06:00
2ip
CensysInspect
BLEXBot
Googlebot
2024-07-23 15:17:22 -06:00
archive.org_bot
2024-09-23 22:17:27 -06:00
majestic
applebot
Mail.RU_Bot
KOCMOHABT
openai
Discordbot
lemmy
2024-09-05 13:44:40 -06:00
TurnitinBot
2024-09-23 22:17:27 -06:00
BacklinksExtendedBot
2024-09-09 14:14:23 -06:00
meta-externalagent
2024-09-23 22:17:27 -06:00
ahrefsbot
Synapse
PetalBot
kbinBot
robots.txt
IonCrawl
SiteCheckerBotCrawler
RSS
yacybot
FreshRSS
YandexWebmaster
LinkedInBot
2024-09-12 14:57:43 -06:00
HeadlessChrome
2024-09-23 22:17:27 -06:00
t3versionsBot
ClaudeBot
Qwant
msnbot
Trident
rss-is-dead.lol
SurdotlyBot
MJ12bot
YandexFavicon
AdsBot-Google
GPTBot
CyberFind
slurp