firewall/crawlers.txt

130 lines
1.5 KiB
Plaintext
Raw Normal View History

2024-09-21 21:01:23 -06:00
facebookexternalhit
2024-07-04 10:05:00 -06:00
YandexBot
PetalBot
2024-09-22 21:45:07 -06:00
GPTBot
2024-07-04 10:05:00 -06:00
Amazonbot
SemrushBot
AhrefsBot
slurp
openai
ahrefsbot
majestic
applebot
duckduckbot
baidu
lemmy
FediDB
facebook
bingbot
2024-07-22 22:40:33 -06:00
MJ12bot
2024-07-04 10:05:00 -06:00
lemmy-stats-crawler
Bytespider
CDSCbot
Googlebot
FediDB
ClaudeBot
Podverse
Expanse
oii-research
DotBot
ZoominfoBot
LivelapBot
rss-is-dead.lol
FreshRSS
Barkrowler
DataForSeoBot
PixelFedBot
SerendeputyBot
2ip bot
GNUsocialBot
BacklinksExtendedBot
ws-bot-v1
ImagesiftBot
WellKnownBot
FediIndex
FriendlyCrawler
gotosocial
Synapse
impendoom-bot
Mitra
kbinBot
BitSightBot
FediFetcher
MbinBot
Discordbot
YisouSpider
LinkPreview
SurdotlyBot
AwarioSmartBot
msnbot-media
2024-07-24 21:23:15 -06:00
msnbot
2024-07-04 10:05:00 -06:00
ev-crawler
BLEXBot
YandexImages
2024-09-05 13:44:40 -06:00
Chodes
2024-07-04 10:05:00 -06:00
hstspreload-bot
Twitterbot
TelegramBot
Slack-ImgProxy
GenomeCrawlerd
search-engine-indexer
SemanticScholarBot
yacybot
BW/1.2
2024-07-06 14:16:30 -06:00
Twingly
IonCrawl
2024-07-07 21:44:44 -06:00
vmcrawl
SeoCherryBot
2024-07-08 15:06:07 -06:00
coccocbot-web
2024-07-08 22:51:13 -06:00
FAST-WebCrawler
2024-07-08 23:57:48 -06:00
YandexImageResizer
2024-07-10 23:24:05 -06:00
serpstatbot
YandexRenderResourcesBot
CCBot
wpbot
2024-07-14 00:10:59 -06:00
LinkedInBot
SeznamBot
Mail.RU_Bot
RedekenBot
2024-07-14 20:33:58 -06:00
SiteCheckerBotCrawler
2024-07-16 23:36:21 -06:00
AwarioBot
intelx.io_bot
YandexWebmaster
2024-07-18 15:12:26 -06:00
Qwant
2024-07-19 00:12:35 -06:00
PagePeeker
Sogou Push
2024-07-19 19:35:59 -06:00
KOCMOHABT
2024-07-22 22:40:33 -06:00
ldspider
2024-07-22 22:41:00 -06:00
robots.txt
2024-07-22 22:47:26 -06:00
bots.retroverse.social
2024-07-23 15:17:22 -06:00
archive.org_bot
2024-07-24 23:51:44 -06:00
Facebot
Exabot
2024-07-27 00:50:09 -06:00
MisskeyBot
2024-07-29 15:34:47 -06:00
ISSCyberRiskCrawler
2024-07-29 22:59:08 -06:00
AportCatalogRobot
2024-08-01 13:06:04 -06:00
RSS Discovery Engine
AdsBot-Google
2024-08-03 11:52:40 -06:00
CyberFind Crawler
t3versionsBot
2024-08-06 20:08:09 -06:00
CyberFindCrawler
2024-08-09 20:45:05 -06:00
KixxActivityPubCrawler
x22Xpanse-bot
2024-08-10 12:09:17 -06:00
Arquivo-web-crawler
2024-08-15 19:35:37 -06:00
YandexUserproxy
2024-08-25 15:15:45 -06:00
Ai2Bot-Dolma
2024-09-01 22:35:47 -06:00
PerplexityBot
VirusTotalBot
2024-09-05 09:53:01 -06:00
Gabanzabot
2024-09-12 11:36:03 -06:00
Horrid Chodes For Everyone
2024-09-05 13:44:40 -06:00
TurnitinBot
YandexFavicon
Trident
2024-09-09 14:14:23 -06:00
meta-externalagent
2024-09-09 22:32:56 -06:00
Barkrowler
Googlebot-Image
2024-09-10 09:15:01 -06:00
CensysInspect
2024-09-10 11:35:44 -06:00
Go-http-client
2024-09-12 12:50:55 -06:00
Friendica
2024-09-12 14:57:43 -06:00
HeadlessChrome