ABCDEFGHIJKLMNOPQRSTUVWXYZAAABACADAEAFAGAHAIAJAKALAMANAOAPAQARASATAUAVAWAXAYAZBABBBCBDBEBFBGBHBIBJBKBLBMBNBOBPBQBRBSBTBUBVBWBXBYBZCACBCCCDCECFCGCHCICJCKCLCMCNCOCPCQCRCSCTCUCV
1
Made by OneScales.com - Article at https://onescales.com/blogs/main/the-bot-blocklist
2
WebsiteWebsite CategoryEnglish SiteBlocked Bots
3
https://americanexpress.com/robots.txtBank/Financialyes
4
https://bankofamerica.com/robots.txtBank/FinancialyesOmniExplorer_Bot
5
https://capitalone.com/robots.txtBank/Financialyes
6
https://chase.com/robots.txtBank/Financialyes
7
https://citi.com/robots.txtBank/Financialyes
8
https://discovercard.com/robots.txtBank/Financialyes
9
https://hdfcbank.com/robots.txtBank/Financialyes
10
https://icicibank.com/robots.txtBank/FinancialyesAlexibotAqua_Productsasteriasb2w/0.1
BackDoorBot/1.0
BecomeBotBlowFish/1.0
Bookmark search tool
BotALotBuiltBotToughBullseye/1.0BunnySlippersCheeseBotCherryPicker
CherryPickerElite/1.0
CherryPickerSE/1.0
CopernicCopyRightCheckcosmosCrescent
Crescent Internet ToolPak HTTP OLE Control v.1.0
DittoSpyderdumbotEmailCollectorEmailSiphonEmailWolf
Enterprise_Search
Enterprise_Search/1.0
EroCrawleresExtractorProFairAd Client
Flaming AttackBot
FoobotFreeFindGaisbotGetRight/4.2grubgrub-clientHarvest/1.5Hatena AntennahloaderhttplibhumanlinksInfoNaviRobotIron33/1.0.2JennyBotJetbotJetbot/1.0Kenjin Spider
Keyword Density/0.9
larbinLexiBotlibWeb/clsHTTPLinkextractorPro
LinkScan/8.1a Unix
LinkWalkerLNSpiderguylwp-triviallwp-trivial/1.34Mata Hari
Microsoft URL Control
Microsoft URL Control - 5.01.4511
Microsoft URL Control - 6.00.8169
MIIxpcMIIxpc/4.2Mister PiXmogetmoget/2.1MSIECrawlernaverNetAntsNetMechanicNICErsPRONutchOffline Explorer
OmniExplorer_Bot
OpenbotOpenfind
Openfind data gathere
Oracle Ultra Search
PerMan
ProPowerBot/2.14
ProWebWalkerpsbotPython-urllib
QueryN Metasearch
Radiation Retriever 1.1
RepoMonkey
RepoMonkey Bait & Tackle/v1.01
RMAsearchpreviewSiteSnaggersootleSpankBotspannerStanford
11
https://paypal.com/robots.txtBank/Financialyes
12
https://robinhood.com/robots.txtBank/Financialyes
13
https://us.etrade.com/robots.txtBank/Financialyes
14
https://wellsfargo.com/robots.txtBank/Financialyes
15
https://www.schwab.com/robots.txtBank/Financialyes
16
https://6pm.com/robots.txtCommerce/Products/Servicesyes
17
https://alibaba.com/robots.txtCommerce/Products/Servicesyes
18
https://aliexpress.com/robots.txtCommerce/Products/Servicesyes
19
https://amazon.com/robots.txtCommerce/Products/ServicesyesGPTBotEtaoSpider
20
https://ancestry.com/robots.txtCommerce/Products/Servicesyes
21
https://apple.com/robots.txtCommerce/Products/Servicesyes
22
https://asos.com/robots.txtCommerce/Products/Servicesyes
23
https://att.com/robots.txtCommerce/Products/ServicesyesDotBotdotbot
24
https://bestbuy.com/robots.txtCommerce/Products/ServicesyesGetIntentCrawler
25
https://costco.com/robots.txtCommerce/Products/Servicesyes
26
https://dell.com/robots.txtCommerce/Products/Servicesyes
27
https://ebay.com/robots.txtCommerce/Products/Servicesyes
28
https://etsy.com/robots.txtCommerce/Products/ServicesyesSpinn3r
29
https://fedex.com/robots.txtCommerce/Products/Servicesyes
30
https://flipkart.com/robots.txtCommerce/Products/Servicesyes
31
https://gap.com/robots.txtCommerce/Products/Servicesyes
32
https://gmarket.co.kr/robots.txtCommerce/Products/Servicesyes
33
https://groupon.com/robots.txtCommerce/Products/ServicesyesUptimebotia_archiver
34
https://hp.com/robots.txtCommerce/Products/Servicesyes
35
https://ikea.com/robots.txtCommerce/Products/ServicesyesGPTBot
36
https://kakaku.com/robots.txtCommerce/Products/Servicesno
37
https://kickstarter.com/robots.txtCommerce/Products/Servicesyes
38
https://kohls.com/robots.txtCommerce/Products/ServicesyesBaiduspider
39
https://lenovo.com/robots.txtCommerce/Products/Servicesyes
40
https://lowes.com/robots.txtCommerce/Products/Servicesyes
41
https://macys.com/robots.txtCommerce/Products/ServicesyesTwitterbot
42
https://mobile.de/robots.txtCommerce/Products/Servicesno
43
https://newegg.com/robots.txtCommerce/Products/ServicesyesChangeDetection008Nutch
44
https://nike.com/robots.txtCommerce/Products/Servicesyes
45
https://nordstrom.com/robots.txtCommerce/Products/Servicesyes
46
https://orange.fr/robots.txtCommerce/Products/Servicesno
47
https://overstock.com/robots.txtCommerce/Products/Servicesyes
48
https://playstation.com/robots.txtCommerce/Products/Servicesyes
49
https://rakuten.co.jp/robots.txtCommerce/Products/Servicesno
50
https://samsung.com/robots.txtCommerce/Products/Servicesyes
51
https://shutterstock.com/robots.txtCommerce/Products/ServicesyesCCBotGPTBot
52
https://snapdeal.com/robots.txtCommerce/Products/ServicesyesEtaoSpiderMJ12BotDotBotMauiBot
53
https://steamcommunity.com/robots.txtCommerce/Products/Servicesyes
54
https://taobao.com/robots.txtCommerce/Products/ServicesnoBaiduspiderbaiduspider
55
https://target.com/robots.txtCommerce/Products/Servicesyes
56
https://ups.com/robots.txtCommerce/Products/Servicesyesia_archiver
57
https://usps.com/robots.txtCommerce/Products/Servicesyes
58
https://verizon.com/robots.txtCommerce/Products/Servicesyes
59
https://verizonwireless.com/robots.txtCommerce/Products/Servicesyesdotbot
60
https://walmart.com/robots.txtCommerce/Products/Servicesyes
61
https://www.gamespot.com/robots.txtCommerce/Products/ServicesyesCliqzbotAhrefsBot
62
https://xe.com/robots.txtCommerce/Products/Servicesyes
63
https://xfinity.com/robots.txtCommerce/Products/Servicesyes
64
https://data.gov/robots.txtGovernmentyes
65
https://nih.gov/robots.txtGovernmentyes
66
https://www.archives.gov/robots.txtGovernmentyes
67
https://www.australia.gov.au/robots.txtGovernmentyes
68
https://www.benefits.gov/robots.txtGovernmentyes
69
https://www.canada.ca/robots.txtGovernmentyes
70
https://www.gov.uk/robots.txtGovernmentyesdeepcrawl
MS Search 6.0 Robot
71
https://www.nasa.gov/robots.txtGovernmentyes
72
https://9gag.com/robots.txtInformational Sites (Media, Forums, Ads)yes
73
https://accuweather.com/robots.txtInformational Sites (Media, Forums, Ads)yes
74
https://adcash.com/robots.txtInformational Sites (Media, Forums, Ads)yes
75
https://allrecipes.com/robots.txtInformational Sites (Media, Forums, Ads)yes
76
https://answers.com/robots.txtInformational Sites (Media, Forums, Ads)yes
77
https://archive.org/robots.txtInformational Sites (Media, Forums, Ads)yes
78
https://ask.fm/robots.txtInformational Sites (Media, Forums, Ads)yesBaiduspider
79
https://azlyrics.com/robots.txtInformational Sites (Media, Forums, Ads)yes008
80
https://baike.com/robots.txtInformational Sites (Media, Forums, Ads)noUbiCrawlerZao
sitecheck.internetseer.com
ZealbotMSIECrawlerSiteSnaggerWebStripperWebCopierFetchOffline ExplorerTeleportTeleportProWebZIPlinkoHTTrack
Microsoft.URL.Control
XenularbinlibwwwZyBORGDownload Ninjawgetgrub-clientk2spiderNPBotWebReaperBaiduspiderbaiduspider
81
https://bleacherreport.com/robots.txtInformational Sites (Media, Forums, Ads)yes
82
https://blogger.com/robots.txtInformational Sites (Media, Forums, Ads)yes
83
https://blogspot.com/robots.txtInformational Sites (Media, Forums, Ads)yes
84
https://businessinsider.com/robots.txtInformational Sites (Media, Forums, Ads)yesGPTBot
85
https://buzzfeed.com/robots.txtInformational Sites (Media, Forums, Ads)yesdiscobotdotbotyacybot
86
https://cnet.com/robots.txtInformational Sites (Media, Forums, Ads)yes
87
https://community.fandom.com/robots.txtInformational Sites (Media, Forums, Ads)yesSemrushBotserpstatbotGPTBot
88
https://corriere.it/robots.txtInformational Sites (Media, Forums, Ads)noPetalBotYandex
89
https://craigslist.org/robots.txtInformational Sites (Media, Forums, Ads)yes
90
https://detik.com/robots.txtInformational Sites (Media, Forums, Ads)noChatGPT-UserOpenAICCBot
91
https://deviantart.com/robots.txtInformational Sites (Media, Forums, Ads)yesPinterestbot
92
https://diply.com/robots.txtInformational Sites (Media, Forums, Ads)yes
93
https://disqus.com/robots.txtInformational Sites (Media, Forums, Ads)yes
94
https://engadget.com/robots.txtInformational Sites (Media, Forums, Ads)yes
95
https://espn.go.com/robots.txtInformational Sites (Media, Forums, Ads)yesclaritybot
96
https://forbes.com/robots.txtInformational Sites (Media, Forums, Ads)yes
97
https://foxnews.com/robots.txtInformational Sites (Media, Forums, Ads)yes
98
https://gamer.com.tw/robots.txtInformational Sites (Media, Forums, Ads)no
99
https://gfycat.com/robots.txtInformational Sites (Media, Forums, Ads)yes
100
https://gizmodo.com/robots.txtInformational Sites (Media, Forums, Ads)yes