- -DIE-KRAEHE- META-SEARCH-ENGINE/1.1 http://www.die-kraehe.de
- :robot/1.0
- :robot/1.0 (linux) ( admin e-mail: undefined http://www.neofonie.de/loesungen/search/robot.html )
- !Susie (http://www.sync2it.com/susie)
- ( Robots.txt Validator http://www.searchengineworld.com/cgi-bin/robotcheck.cgi )
- (DreamPassport/3.0; isao/MyDiGiRabi)
- (Privoxy/1.0)
- */Nutch-0.9-dev
- +SitiDi.net/SitiDiBot/1.0 (+Have Good Day)
- 123spider-Bot (Version: 1.02, powered by www.123spider.de
- 192.comAgent
- 1st ZipCommander (Net) - http://www.zipcommander.com/
- 2Bone_LinkChecker/1.0 libwww-perl/5.64
- 403 Server Code
- 404 Server Code
- 4anything.com
- 4anything.com LinkChecker v2.0
- 8484 Boston Project v 1.0
- A-Online Search
- A1 Keyword Research/1.0.2 (+http://www.micro-sys.dk/products/keyword-research/) miggibot/2007.03.27
- A1 Sitemap Generator/1.0 (+http://www.micro-sys.dk/products/sitemap-generator/) miggibot/2006.01.24
- aardvark-crawler
- AbachoBOT
- AbachoBOT (Mozilla compatible)
- ABCdatos
- ABCdatos BotLink/5.xx.xxx#BBL
- Aberja Checkomat
- abot/0.1
- abot/0.1 (abot; http://www.abot.com; abot@abot.com)
- About
- About/0.1libwww-perl/5.47
- Accelatech RSSCrawler/0.4
- accoona
- Accoona-AI-Agent/1.1.1 (crawler at accoona dot com)
- Accoona-AI-Agent/1.1.2 (aicrawler at accoonabot dot com)
- Ace Explorer
- Ack
- Ack (http://www.ackerm.com/)
- AcoiRobot
- Acoon
- Acoon Robot v1.50.001
- Acoon Robot v1.52 (http://www.acoon.de)
- Acoon-Robot 4.0.x.[xx] (http://www.acoon.de)
- Acoon-Robot v3.xx (http://www.acoon.de and http://www.acoon.com)
- Acorn
- Acorn/Nutch-0.9 (Non-Profit Search Engine; acorn.isara.org; acorn at isara dot org)
- ActiveBookmark 1.x
- Activeworlds
- ActiveWorlds/3.xx (xxx)
- Ad Muncher v4.xx.x
- Ad Muncher v4x Build xxxxx
- Ad Title
- Adaxas Spider (http://www.adaxas.net/)
- Advanced Browser (http://www.avantbrowser.com)
- AESOP
- AESOP_com_SpiderMan
- Agadine
- agadine/1.x.x (+http://www.agada.de)
- Agent-SharewarePlazaFileCheckBot/2.0+(+http://www.SharewarePlaza.com)
- AgentName/0.1 libwww-perl/5.48
- agentname/Nutch
- AIBOT
- AIBOT/2.1 By +(www.21seek.com A Real artificial intelligence search engine China)
- AideRSS/1.0 (aiderss.com)
- aipbot
- aipbot/1.0 (aipbot; http://www.aipbot.com; aipbot@aipbot.com)
- aipbot/2-beta (aipbot dev; http://aipbot.com; aipbot@aipbot.com)
- Akregator/1.2.9; librss/remnants
- Aladin
- Aladin/3.324
- Alcatel-BG3/1.0 UP.Browser/5.0.3.1.2
- Aleksika
- Aleksika Spider/1.0 (+http://www.aleksika.com/)
- AlertInfo 2.0 (Powered by Newsbrain)
- Alexa (www.alexa.com)
- AlkalineBOT
- AlkalineBOT/1.3
- AlkalineBOT/1.4 (1.4.0326.0 RTM)
- Allesklar
- Allesklar/0.1 libwww-perl/5.46
- Alligator 1.31 (www.nearsoftware.com)
- Allrati/1.1 (+)
- AltaVista
- AltaVista Intranet V2.0 AVS EVAL search@freeit.com
- AltaVista Intranet V2.0 Compaq Altavista Eval sveand@altavista.net
- AltaVista Intranet V2.0 evreka.com crawler@evreka.com
- AltaVista V2.0B crawler@evreka.com
- amaya/x.xx libwww/x.x.x
- Amfibi
- AmfibiBOT
- Amfibibot/0.06 (Amfibi Web Search; http://www.amfibi.com; agent@amfibi.com)
- Amfibibot/0.07 (Amfibi Robot; http://www.amfibi.com; agent@amfibi.com)
- amibot
- Amiga-AWeb/3.4.167SE
- AmigaVoyager/3.4.4 (MorphOS/PPC native)
- AmiTCP Miami (AmigaOS 2.04)
- Amoi 8512/R21.0 NF-Browser/3.3
- amzn_assoc
- AnnoMille
- AnnoMille spider 0.1 alpha - http://www.annomille.it
- annotate_google; http://ponderer.org/download/annotate_google.user.js
- Anonymized by ProxyOS: http://www.megaproxy.com
- Anonymizer/1.1
- AnswerBus
- AnswerBus (http://www.answerbus.com/)
- AnswerChase
- AnswerChase PROve x.0
- AnswerChase x.0
- ANTFresco/x.xx
- antibot-V1.1.5/i586-linux-2.2
- AnzwersCrawl
- AnzwersCrawl/2.0 (anzwerscrawl@anzwers.com.au;Engine)
- Apache-HttpClient
- Apexoo
- Apexoo Spider 1.x
- Aplix HTTP/1.0.1
- Aplix_SANYO_browser/1.x (Japanese)
- Aplix_SEGASATURN_browser/1.x (Japanese)
- Aport
- Appie
- appie 1.1 (www.walhello.com)
- Apple iPhone v1.1.4 CoreMedia v1.0.0.4A102
- Apple-PubSub/65.1.1
- ArabyBot (compatible; Mozilla/5.0; GoogleBot; FAST Crawler 6.4; http://www.araby.com;)
- ArachBot
- Arachnoidea
- Arachnoidea (arachnoidea@euroseek.com)
- aranhabot
- ArchitextSpider
- archive.org
- archive.org_bot
- Argus/1.1 (Nutch; http://www.simpy.com/bot.html; feedback at simpy dot com)
- Arikus_Spider
- Arquivo-web-crawler (compatible; heritrix/1.12.1 +http://arquivo-web.fccn.pt)
- ASAHA Search Engine Turkey V.001 (http://www.asaha.com/)
- Asahina
- Asahina-Antenna/1.x
- Asahina-Antenna/1.x (libhina.pl/x.x ; libtime.pl/x.x)
- ask.24x.info
- AskAboutOil
- AskAboutOil/0.06-rcp (Nutch; http://www.nutch.org/docs/en/bot.html; nutch-agent@askaboutoil.com)
- Asked
- asked/Nutch-0.8 (web crawler; http://asked.jp; epicurus at gmail dot com)
- ASPseek
- ASPSeek/1.2.5
- ASPseek/1.2.9d
- ASPSeek/1.2.x
- ASPSeek/1.2.xa
- ASPseek/1.2.xx
- ASPSeek/1.2.xxpre
- ASSORT/0.10
- asterias
- asterias/2.0
- Atlocal
- AtlocalBot/1.1 +(http://www.atlocal.com/local-web-site-owner.html)
- Atomic_Email_Hunter/4.0
- Atomz
- Atomz/1.0
- atraxbot
- atSpider/1.0
- Attentio/Nutch-0.9-dev (Attentio's beta blog crawler; www.attentio.com; info@attentio.com)
- AU-MIC/2.0 MMP/2.0
- AUDIOVOX-SMT5600
- augurfind
- augurnfind V-1.x
- autoemailspider
- autohttp
- autowebdir 1.1 (www.autowebdir.com)
- AV Fetch 1.0
- Avant Browser (http://www.avantbrowser.com)
- AVSearch-1.0(peter.turney@nrc.ca)
- AVSearch-2.0-fusionIdx-14-CompetitorWebSites
- AVSearch-3.0(AltaVista/AVC)
- AWeb
- Axadine
- axadine/ (Axadine Crawler; http://www.axada.de/; )
- AxmoRobot
- AxmoRobot - Crawling your site for better indexing on www.axmo.com search engine.
- Azureus 2.x.x.x
- BabalooSpider/1.3 (BabalooSpider; http://www.babaloo.si; spider@babaloo.si)
- BaboomBot
- BaboomBot/1.x.x (+http://www.baboom.us)
- BackDoorBot/1.0
- BackStreet Browser
- BackStreet Browser 3.x
- BaiduImagespider+(+http://www.baidu.jp/search/s308.html)
- BaiDuSpider
- Baiduspider+(+http://help.baidu.jp/system/05.html)
- Baiduspider+(+http://www.baidu.com/search/spider_jp.html)
- Baiduspider+(+http://www.baidu.com/search/spider.htm)
- Balihoo/Nutch-1.0-dev (Crawler for Balihoo.com search engine - obeys robots.txt and robots meta tags ; http://balihoo.com/index.aspx; robot at balihoo dot com)
- BanBots/1.2 (spider@banbots.com)
- Barca/2.0.xxxx
- BarcaPro/1.4.xxxx
- BarraHomeCrawler (albertof@barrahome.org)
- bCentral Billing Post-Process
- Bdcindexer
- bdcindexer_2.6.2 (research@bdc)
- BDFetch
- BDNcentral Crawler v2.3 [en] (http://www.bdncentral.com/robot.html) (X11; I; Linux 2.0.44 i686)
- BeamMachine/0.5 (dead link remover of www.beammachine.net)
- Beautybot
- beautybot/1.0 (+http://www.uchoose.de/crawler/beautybot/)
- BebopBot
- BebopBot/2.5.1 ( crawler http://www.apassion4jazz.net/bebopbot.html )
- BecomeBot
- BeebwareDirectory/v0.01
- Behavioral Targeting
- Big Brother (http://pauillac.inria.fr/~fpottier/)
- Big Fish v1.0
- BigBrother/1.6e
- BigCliqueBOT
- BigCliqueBOT/1.03-dev (bigclicbot; http://www.bigclique.com; bot@bigclique.com)
- BIGLOTRON
- BIGLOTRON (Beta 2;GNU/Linux)
- Bigsearch.ca
- Bigsearch.ca/Nutch-x.x-dev (Bigsearch.ca Internet Spider; http://www.bigsearch.ca/; info@enhancededge.com)
- Bilbo/2.3b-UNIX
- Bilgi
- BilgiBetaBot
- BilgiBetaBot/0.8-dev (bilgi.com (Beta) ; http://lucene.apache.org/nutch/bot.html; nutch-agent@lucene.apache.org)
- BilgiBot/1.0(beta) (http://www.bilgi.com/; bilgi at bilgi dot com)
- billbot wjj@cs.cmu.edu
- Bitacle
- Bitacle bot/1.1
- Bitacle Robot (V:1.0;) (http://www.bitacle.com)
- Biyubi/x.x (Sistema Fenix; G11; Familia Toledo; es-mx)
- Black Hole
- BlackBerry7520/4.0.0 Profile/MIDP-2.0 Configuration/CLDC-1.1 UP.Browser/5.0.3.3 UP.Link/5.1.2.12 (Google WAP Proxy/1.0)
- BlackWidow
- Blaiz-Bee
- Blaiz-Bee/1.0 (+http://www.blaiz.net)
- Blaiz-Bee/2.00.8222 (BE Internet Search Engine http://www.rawgrunt.com)
- Blaiz-Bee/2.00.xxxx (+http://www.blaiz.net)
- BlitzBOT
- BlitzBOT@tricus.net
- BlitzBOT@tricus.net (Mozilla compatible)
- BlockNote.Net
- BlogBot
- BlogBot/1.x
- BlogBridge 2.13 (http://www.blogbridge.com/)
- Bloglines
- Bloglines Title Fetch/1.0 (http://www.bloglines.com)
- Bloglines-Images/0.1 (http://www.bloglines.com)
- Bloglines/3.1 (http://www.bloglines.com)
- BlogMap (http://www.feedmap.net)
- Blogpulse
- Blogpulse (info@blogpulse.com)
- BlogPulseLive (support@blogpulse.com)
- BlogSearch
- BlogSearch/1.x +http://www.icerocket.com/
- blogsearchbot-pumpkin-3
- BlogsNowBot
- BlogsNowBot, V 2.01 (+http://www.blogsnow.com/)
- BlogVibeBot-v1.1 (spider@blogvibe.nl)
- blogWatcher_Spider
- blogWatcher_Spider/0.1 (http://www.lr.pi.titech.ac.jp/blogWatcher/)
- BlogzIce
- BlogzIce/1.0 (+http://icerocket.com; rhodes@icerocket.com)
- BlogzIce/1.0 +http://www.icerocket.com/
- BloobyBot
- Bloodhound/Nutch-0.9 (Testing Crawler for Research - obeys robots.txt and robots meta tags ; http://balihoo.com/index.aspx; robot at balihoo dot com)
- BlowFish
- bluefish 0.6 HTML editor
- BMCLIENT
- BMLAUNCHER
- Bobby/4.0.x RPT-HTTPClient/0.3-3E
- boitho.com
- boitho.com-dc/0.xx (http://www.boitho.com/dcbot.html)
- boitho.com-robot/1.x
- boitho.com-robot/1.x (http://www.boitho.com/bot.html)
- Bookdog/x.x
- Bookmark Buddy bookmark checker (http://www.bookmarkbuddy.net/)
- Bookmark Renewal Check Agent [http://www.bookmark.ne.jp/]
- Bookmark Renewal Check Agent [http://www.bookmark.ne.jp/] (Version 2.0beta)
- BookmarkBase(2/;http://bookmarkbase.com)
- Bot mailto:craftbot@yahoo.com
- BotALot
- BPImageWalker/2.0 (www.bdbrandprotect.com)
- Brand and Branding
- BravoBrian
- BravoBrian bstop.bravobrian.it
- BravoBrian SpiderEngine MarcoPolo
- BrightCrawler (http://www.brightcloud.com/brightcrawler.asp)
- BruinBot
- BruinBot (+http://webarchive.cs.ucla.edu/bruinbot.html)
- BSDSeek/1.0
- BStop.BravoBrian.it Agent Detector
- BTbot
- BTbot/0.x (+http://www.btbot.com/btbot.html)
- BTWebClient/180B(9704)
- BuildCMS
- BuildCMS crawler (http://www.buildcms.com/crawler)
- BuiltBotTough
- Bulkfeeds/r1752 (http://bulkfeeds.net/)
- BullsEye
- bumblebee@relevare.com
- BunnySlippers
- BurstFindCrawler
- BurstFindCrawler/1.1 (crawler.burstfind.com; http://crawler.burstfind.com; crawler@burstfind.com)
- Buscaplus
- Buscaplus Robi/1.0 (http://www.buscaplus.com/robi/)
- BW-C-2.0
- bwh3_user_agent
- Cabot/Nutch-0.9 (Amfibi's web-crawling robot; http://www.amfibi.com/cabot/; agent@amfibi.com)
- Cabot/Nutch-1.0-dev (Amfibi's web-crawling robot; http://www.amfibi.com/cabot/; agent@amfibi.com)
- CamelHttpStream/1.0
- Cancer Information and Support International;
- Carleson
- carleson/1.0
- Carnegie_Mellon_University
- Carnegie_Mellon_University_Research_WebBOT--
- Carnegie_Mellon_University_Research_WebBOT-->PLEASE READ-->http://www.andrew.cmu.edu/~brgordon/webbot/index.html http://www.andrew.cmu.edu/~brgordon/webbot/index.html
- Carnegie_Mellon_University_WebCrawler http://www.andrew.cmu.edu/~brgordon/webbot/index.html
- Catall Spider
- CazoodleBot/CazoodleBot-0.1 (CazoodleBot Crawler; http://www.cazoodle.com/cazoodlebot; cazoodlebot@cazoodle.com)
- CCBot/1.0 (+http://www.commoncrawl.org/bot.html)
- Ccubee
- ccubee/x.x
- CDR/1.7.1 Simulator/0.7(+http://timewe.net) Profile/MIDP-1.0 Configuration/CLDC-1.0
- CE-Preload
- Cegbfeieh
- CentiverseBot
- CentiverseBot - investigator
- CentiverseBot/3.0 (http://www.centiverse-project.net)
- Ceramic Tile Installation Guide (http://www.floorstransformed.com)
- CERN-LineMode/2.15
- cfetch/1.0
- CFNetwork
- CFNetwork/x.x
- cg-eye interactive
- Charlotte
- Charon/1.x (Amiga)
- Chat Catcher/1.0
- Checkbot/1.xx LWP/5.xx
- CheckLinks/1.x.x
- CheckUrl
- CheckWeb
- CheeseBot
- CherryPicker
- Chilkat/1.0.0 (+http://www.chilkatsoft.com/ChilkatHttpUA.asp)
- China Local Browse 2.6
- Chitika ContentHit 1.0
- ChristCRAWLER 2.0
- CHttpClient by Open Text Corporation
- CipinetBot (http://www.cipinet.com/bot.html)
- Cityreview Robot (+http://www.cityreview.org/crawler/)
- CJ Spider/
- CJB.NET Proxy
- ClariaBot/1.0
- Claymont.com
- CloakDetect
- CloakDetect/0.9 (+http://fulltext.seznam.cz/)
- Clushbot
- Clushbot/2.x (+http://www.clush.com/bot.html)
- Clushbot/3.x-BinaryFury (+http://www.clush.com/bot.html)
- Clushbot/3.xx-Ajax (+http://www.clush.com/bot.html)
- Clushbot/3.xx-Hector (+http://www.clush.com/bot.html)
- Clushbot/3.xx-Peleus (+http://www.clush.com/bot.html)
- COAST WebMaster Pro/4.x.x.xx (Windows NT)
- CoBITSProbe
- Cocoal.icio.us/1.0 (v36) (Mac OS X; http://www.scifihifi.com/cocoalicious)
- Cogentbot/1.X (+http://www.cogentsoftwaresolutions.com/bot.html)
- ColdFusion
- ColdFusion (BookmarkTracker.com)
- collage.cgi
- collage.cgi/1.xx
- combine/0.0
- Combine/2.0 http://combine.it.lth.se/
- Combine/3 http://combine.it.lth.se/
- Combine/x.0
- cometrics-bot
- cometrics-bot, http://www.cometrics.de
- Commerce Browser Center
- complex_network_group/Nutch-0.9-dev (discovering the structure of the world-wide-web; http://cantor.ee.ucla.edu/~networks/crawl; nimakhaj@gmail.com)
- Computer_and_Automation_Research
- Computer_and_Automation_Research_Institute_Crawler crawler@ilab.sztaki.hu
- Comrite
- Comrite/0.7.1 (Nutch; http://lucene.apache.org/nutch/bot.html; nutch-agent@lucene.apache.org)
- Contact
- ContactBot/0.2
- ContentSmartz
- contype
- Convera
- Convera Internet Spider V6.x
- ConveraCrawler/0.2
- ConveraCrawler/0.9d (+http://www.authoritativeweb.com/crawl)
- ConveraMultiMediaCrawler/0.1 (+http://www.authoritativeweb.com/crawl)
- Conversion Analytics
- CoolBot
- Cooliris/1.5 CFNetwork/459 Darwin/10.0.0d3
- CopyRightCheck
- CoralWebPrx
- CoralWebPrx/0.1.1x (See http://coralcdn.org/)
- cosmos
- cosmos/0.8_(robot@xyleme.com)
- cosmos/0.9_(robot@xyleme.com)
- CoteoNutchCrawler/Nutch-0.9 (info [at] coteo [dot] com)
- CougarSearch
- CougarSearch/0.x (+http://www.cougarsearch.com/faq.shtml)
- Covac TexAs Arachbot
- CoverScout%203/3.0.1 CFNetwork/339.5 Darwin/9.5.0 (i386) (iMac5,1)
- Cowbot
- Cowbot-0.1 (NHN Corp. / +82-2-3011-1954 / nhnbot@naver.com)
- Cowbot-0.1.x (NHN Corp. / +82-2-3011-1954 / nhnbot@naver.com)
- CrawlConvera
- CrawlConvera0.1 (CrawlConvera@yahoo.com)
- Crawler
- Crawler (cometsearch@cometsystems.com)
- Crawler admin@crawler.de
- Crawler V 0.2.x admin@crawler.de
- Crawler V 0.2.x admin@crawler.de Crawler.de / Abac
- crawler@alexa.com
- CrawlerBoy Pinpoint.com
- Crawllybot
- Crawllybot/0.1 (Crawllybot; +http://www.crawlly.com; crawler@crawlly.com)
- CreativeCommons
- CreativeCommons/0.06-dev (Nutch; http://www.nutch.org/docs/en/bot.html; nutch-agent@lists.sourceforge.net)
- Crescent
- Cricket-A100/1.0 UP.Browser/6.3.0.7 (GUI) MMP/2.0
- CrocCrawler
- CrocCrawler vx.3 [en] (http://www.croccrawler.com) (X11; I; Linux 2.0.44 i686)
- csci_b659/0.13
- CSE HTML Validator Professional (http://www.htmlvalidator.com/)
- Cuam Ver0.050bx
- Cuasarbot/0.9b http://www.cuasar.com/spider_beta/
- curl/7.10.x (i386-redhat-linux-gnu) libcurl/7.10.x OpenSSL/0.9.7a ipv6 zlib/1.1.4
- curl/7.7.x (i386--freebsd4.3) libcurl 7.7.x (SSL 0.9.6) (ipv6 enabled)
- curl/7.8 (i686-pc-linux-gnu) libcurl 7.8 (OpenSSL 0.9.6)
- curl/7.9.x (win32) libcurl 7.9.x
- CurryGuide SiteScan 1.1
- Custo x.x (www.netwu.com)
- Custom Spider www.bisnisseek.com
- Custom Spider www.bisnisseek.com /1.0
- Cyberdog/2.0 (Macintosh; 68k)
- CyberPatrol SiteCat Webbot (http://www.cyberpatrol.com/cyberpatrolcrawler.asp)
- CyberSpyder Link Test/2.1.12 (admin@mspennyworth.com)
- CydralSpider
- CydralSpider/1.x (Cydral Web Image Search; http://www.cydral.com)
- CydralSpider/3.0 (Cydral Image Search; http://www.cydral.com)
- Czego nie mogą zawierać reklamy
- DA 3.5 (www.lidan.com)
- DA 4.0
- DA 4.0 (www.downloadaccelerator.com)
- DA 5.0
- DA 7.0
- DAP x.x
- Dart Communications PowerTCP
- DataCha0s/2.0
- DataFountains
- DataFountains/DMOZ Downloader
- DataFountains/Dmoz Downloader (http://ivia.ucr.edu/useragents.shtml)
- DataFountains/DMOZ Feature Vector Corpus Creator (http://ivia.ucr.edu/useragents.shtml)
- DataparkSearch
- DataparkSearch/4.47 (+http://dataparksearch.org/bot)
- DataparkSearch/4.xx (http://www.dataparksearch.org/)
- DataSpear
- DataSpear/1.0 (Spider; http://www.dataspear.com/spider.html; spider@dataspear.com)
- DataSpearSpiderBot/0.2 (DataSpear Spider Bot; http://dssb.dataspear.com/bot.html; dssb@dataspear.com)
- DatenBot( http://www.sicher-durchs-netz.de/bot.html)
- DaviesBot
- DaviesBot/1.7 (www.wholeweb.net)
- daypopbot/0.x
- dbDig
- dbDig(http://www.prairielandconsulting.com)
- DBrowse 1.4b
- DBrowse 1.4d
- DC-Sakura/x.xx
- dCSbot/1.1
- DDD
- dds explorer v1.0 beta
- de.searchengine.comBot
- de.searchengine.comBot 1.2 (http://de.searchengine.com/spider)
- DeadLinkCheck/0.4.0 libwww-perl/5.xx
- Deep Link Calculator v1.0
- deepak-USC/ISI
- DeepIndex
- DeepIndex ( http://www.zetbot.com )
- DeepIndex (www.en.deepindex.com)
- DeepIndexer.ca
- del.icio.us-thumbnails/1.0 Mozilla/5.0 (compatible; Konqueror/3.4; FreeBSD) KHTML/3.4.2 (like Gecko)
- DeleGate/9.0.5-fix1
- Demo Bot DOT 16b
- Demo Bot Z 16b
- Denmex websearch (http://search.denmex.com)
- DepSpid
- Der große BilderSauger 2.00u
- dev-spider2.searchpsider.com
- dev-spider2.searchpsider.com/1.3b
- DevComponents.com HtmlDocument Object
- DiaGem/1.1 (http://www.skyrocket.gr.jp/diagem.html)
- Diamond
- Diamond/x.0
- DiamondBot
- Digger
- Digger/1.0 JDK/1.3.0rc3
- DigOut4U
- DIIbot/1.2
- Dillo/0.8.5-i18n-misc
- Dillo/0.x.x
- disastrous/1.0.5 (running with Python 2.5.1; http://www.bortzmeyer.org/disastrous.html; archangel77@del.icio.us)
- DISCo Pump x.x
- disco/Nutch-0.9 (experimental crawler; www.discoveryengine.com; disco-crawl@discoveryengine.com)
- disco/Nutch-1.0-dev (experimental crawler; www.discoveryengine.com; disco-crawl@discoveryengine.com)
- discobot
- Display ads
- DittoSpyder
- dlman
- dloader(NaverRobot)/1.0
- DNSRight.com WebBot Link Ckeck Tool. Report abuse to: dnsr@dnsright.com
- DoCoMo
- DoCoMo/1.0/Nxxxi/c10
- DoCoMo/1.0/Nxxxi/c10/TB
- DoCoMo/1.0/P502i/c10 (Google CHTML Proxy/1.0)
- DoCoMo/2.0 P900iV(c100;TB;W24H11)
- DoCoMo/2.0 SH901iS(c100;TB;W24H12),gzip(gfe) (via translate.google.com)
- DoCoMo/2.0 SH902i (compatible; Y!J-SRD/1.0; http://help.yahoo.co.jp/help/jp/search/indexing/indexing-27.html)
- DoCoMo/2.0/SO502i (compatible; Y!J-SRD/1.0; http://help.yahoo.co.jp/help/jp/search/indexing/indexing-27.html)
- DocZilla/1.0 (Windows; U; WinNT4.0; en-US; rv:1.0.0) Gecko/20020804
- dodgebot/experimental
- DonutP; Windows98SE
- dotbot
- Doubanbot/1.0 (bot@douban.com http://www.douban.com)
- Download Demon/3.x.x.x
- Download Druid 2.x
- Download Express 1.0
- Download Master
- Download Ninja 3.0
- Download Wonder
- Download-Tipp Linkcheck (http://download-tipp.de/)
- Download.exe(1.1) (+http://www.sql-und-xml.de/freeware-tools/)
- DownloadDirect.1.0
- Dr.Web (R) online scanner: http://online.drweb.com/
- Dragonfly File Reader
- DreamCatcher
- Drecombot
- Drecombot/1.0 (http://career.drecom.jp/bot.html)
- Drupal (+http://drupal.org/)
- DSurf15a 01
- DSurf15a 71
- DSurf15a 81
- DSurf15a VA
- DTAAgent
- dtSearchSpider
- Dual Proxy
- DuckDuckBot/1.0; (+http://duckduckgo.com/duckduckbot.html)
- Dumbot
- Dumbot(version 0.1 beta - dumbfind.com)
- Dumbot(version 0.1 beta - http://www.dumbfind.com/dumbot.html)
- Dumbot(version 0.1 beta)
- e-sense 1.0 ea(www.vigiltech.com/esensedisclaim.html)
- e-SocietyRobot
- e-SocietyRobot(http://www.yama.info.waseda.ac.jp/~yamana/es/)
- eApolloBot/2.0 (compatible; heritrix/2.0.0-SNAPSHOT-20071024.170148 +http://www.eapollo-opto.com)
- EARTHCOM
- EARTHCOM.info/1.x [www.earthcom.info]
- EARTHCOM.info/1.xbeta [www.earthcom.info]
- EasyDL
- EasyDL/3.xx
- EasyDL/3.xx http://keywen.com/Encyclopedia/Bot
- eBot
- EBrowse 1.4b
- eCatch/3.0
- EchO!
- EchO!/2.0
- Educate Search VxB
- egothor/3.0a (+http://www.xdefine.org/robot.html)
- EgotoBot/4.8 (+http://www.egoto.com/about.htm)
- ejupiter
- ejupiter.com
- EldoS TimelyWeb/3.x
- elfbot
- elfbot/1.0 (+http://www.uchoose.de/crawler/elfbot/)
- ELI/20070402:2.0 (DAUM RSS Robot, Daum Communications Corp.; +http://ws.daum.net/aboutkr.html)
- ELinks (0.x.x; Linux 2.4.20 i586; 132x60)
- ELinks/0.x.x (textmode; NetBSD 1.6.2 sparc; 132x43)
- EmailCollector
- EmailSiphon
- EmailSpider
- EmailWolf
- EmailWolf 1.00
- EmeraldShield.com WebBot
- EmeraldShield.com WebBot (http://www.emeraldshield.com/webbot.aspx)
- EMPAS
- EMPAS_ROBOT
- EnaBot/1.x (http://www.enaball.com/crawler.html)
- endo/1.0 (Mac OS X; ppc i386; http://kula.jp/endo)
- Enfish Tracker
- Enterprise_Search
- Enterprise_Search/1.0
- Enterprise_Search/1.0.xxx
- Enterprise_Search/1.00.xxx;MSSQL (http://www.innerprise.net/es-spider.asp)
- Envolk
- envolk[ITS]spider/1.6(+http://www.envolk.com/envolkspider.html)
- envolk/1.7 (+http://www.envolk.com/envolkspiderinfo.php)
- EroCrawler
- ES.NET_Crawler
- ES.NET_Crawler/2.0 (http://search.innerprise.net/)
- eseek-larbin_2.6.2
- eseek-larbin_2.6.2 (crawler@exactseek.com)
- ESISmartSpider
- eStyleSearch 4
- eStyleSearch 4 (compatible; MSIE 6.0; Windows NT 5.0)
- ESurf15a 15
- EuripBot
- EuripBot/0.x (+http://www.eurip.com) GetFile
- EuripBot/0.x (+http://www.eurip.com) GetRobots
- EuripBot/0.x (+http://www.eurip.com) PreCheck
- Eurobot/1.0 (http://www.ayell.eu)
- EvaalSE
- EvaalSE - bot@evaal.com
- Eventax
- eventax/1.3 (eventax; http://www.eventax.de/; info@eventax.de)
- Everest-Vulcan
- Everest-Vulcan Inc./0.1 (R&D project; host=e-1-24; http://everest.vulcan.com/crawlerhelp)
- Everest-Vulcan Inc./0.1 (R&D project; http://everest.vulcan.com/crawlerhelp)
- Exabot
- Exabot-Images/1.0
- Exabot-Test/1.0
- Exabot/2.0
- Exabot/3.0
- ExactSearch
- ExactSeek
- ExactSeek Crawler/0.1
- exactseek-crawler-2.63 (crawler@exactseek.com)
- exactseek-pagereaper-2.63 (crawler@exactseek.com)
- exactseek.com
- Exalead
- Exalead NG/MimeLive Client
- Exalead NG/MimeLive Client (convert/http/0.120)
- Excalibur Internet Spider V6.5.4
- Execrawl
- Execrawl/1.0 (Execrawl; http://www.execrawl.com/; bot@execrawl.com)
- exooba crawler/exooba crawler (crawler for exooba.com; http://www.exooba.com/; info at exooba dot com)
- exooba/exooba crawler (exooba; exooba)
- ExperimentalHenrytheMiragoRobot
- Expired Domain Sleuth
- Express WebPictures (www.express-soft.com)
- ExtractorPro
- Extreme Picture Finder
- EyeCatcher
- EyeCatcher (Download-tipp.de)/1.0
- Factbot
- factbot : http://www.factbites.com/robots
- Factbot 1.09 (see http://www.factbites.com/webmasters.php)
- FaEdit/2.0.x
- FairAd Client
- FANGCrawl/0.01
- FARK.com link verifier
- Fast Crawler Gold Edition
- FAST Enterprise Crawler
- FAST Enterprise Crawler 6 (Experimental)
- FAST Enterprise Crawler 6 / Scirus scirus-crawler@fast.no; http://www.scirus.com/srsapp/contactus/
- FAST Enterprise Crawler 6 used by Cobra Development (admin@fastsearch.com)
- FAST Enterprise Crawler 6 used by Comperio AS (sts@comperio.no)
- FAST Enterprise Crawler 6 used by FAST (FAST)
- FAST Enterprise Crawler 6 used by Pages Jaunes (pvincent@pagesjaunes.fr)
- FAST Enterprise Crawler 6 used by Sensis.com.au Web Crawler (search_commentsatsensisdotcomdotau)
- FAST Enterprise Crawler 6 used by Singapore Press Holdings (crawler@sphsearch.sg)
- FAST Enterprise Crawler 6 used by WWU (wardi@uni-muenster.de)
- FAST Enterprise Crawler/6 (www.fastsearch.com)
- FAST Enterprise Crawler/6.4 (helpdesk at fast.no)
- FAST FirstPage retriever
- FAST FirstPage retriever (compatible; MSIE 5.5; Mozilla/4.0)
- FAST MetaWeb Crawler
- FAST MetaWeb Crawler (helpdesk at fastsearch dot com)
- Fast PartnerSite Crawler
- FAST-WebCrawler
- FAST-WebCrawler/2.2.10 (Multimedia Search) (crawler@fast.no; http://www.fast.no/faq/faqfastwebsearch/faqfastwebcrawler.html)
- FAST-WebCrawler/2.2.6 (crawler@fast.no; http://www.fast.no/faq/faqfastwebsearch/faqfastwebcrawler.html)
- FAST-WebCrawler/2.2.7 (crawler@fast.no; http://www.fast.no/faq/faqfastwebsearch/faqfastwebcrawler.html)http://www.fast.no
- FAST-WebCrawler/2.2.8 (crawler@fast.no; http://www.fast.no/faq/faqfastwebsearch/faqfastwebcrawler.html)http://www.fast.no
- FAST-WebCrawler/3.2 test
- FAST-WebCrawler/3.3 (crawler@fast.no; http://fast.no/support.php?c=faqs/crawler)
- FAST-WebCrawler/3.4/Nirvana (crawler@fast.no; http://fast.no/support.php?c=faqs/crawler)
- FAST-WebCrawler/3.4/PartnerSite (crawler@fast.no; http://fast.no/support.php?c=faqs/crawler)
- FAST-WebCrawler/3.5 (atw-crawler at fast dot no; http://fast.no/support.php?c=faqs/crawler)
- FAST-WebCrawler/3.6 (atw-crawler at fast dot no; http://fast.no/support/crawler.asp)
- FAST-WebCrawler/3.6/FirstPage (crawler@fast.no; http://fast.no/support.php?c=faqs/crawler)
- FAST-WebCrawler/3.7 (atw-crawler at fast dot no; http://fast.no/support/crawler.asp)
- FAST-WebCrawler/3.7/FirstPage (atw-crawler at fast dot no;http://fast.no/support/crawler.asp)
- FAST-WebCrawler/3.8 (atw-crawler at fast dot no; http://fast.no/support/crawler.asp)
- FAST-WebCrawler/3.8/Fresh (atw-crawler at fast dot no; http://fast.no/support/crawler.asp)
- FAST-WebCrawler/3.x Multimedia
- FAST-WebCrawler/3.x Multimedia (mm dash crawler at fast dot no)
- Fastbot
- fastbot crawler beta 2.0 (+http://www.fastbot.de)
- FastBug http://www.ay-up.com
- FastCrawler
- FastCrawler 3.0.1 (crawler@1klik.dk)
- Fasterfox
- FastSearch Web Crawler for Verizon SuperPages (kevin.watters@fastsearch.com)
- FastSearch-AllTheWeb.com
- Favcollector/2.0 (info@favcollector.com http://www.favcollector.com/)
- FavIconizer
- favo.eu
- favo.eu crawler/0.6 (http://www.favo.eu)
- FavOrg
- Favorites Checking (http://campulka.net)
- Favorites Sweeper v.2.03
- Faxobot
- Faxobot/1.0
- FDM 1.x
- FDM 2.x
- Feed (XML Feed)
- Feed Seeker Bot
- Feed Seeker Bot (RSS Feed Seeker http://www.MyNewFavoriteThing.com/fsb.php)
- Feed::Find/0.0x
- Feed24.com
- Feedable/0.1 (compatible; MSIE 6.0; Windows NT 5.1)
- FeedChecker/0.01
- FeedDemon/2.7 (http://www.newsgator.com/; Microsoft Windows XP)
- Feedfetcher-Google
- Feedfetcher-Google-iGoogleGadgets; (+http://www.google.com/feedfetcher.html)
- Feedfetcher-Google; (+http://www.google.com/feedfetcher.html)
- FeedForAll rss2html.php v2
- FeedHub FeedDiscovery/1.0 (http://www.feedhub.com)
- FeedHub MetaDataFetcher/1.0 (http://www.feedhub.com)
- Feedjit Favicon Crawler 1.0
- Feedreader 3.xx (Powered by Newsbrain)
- Feedshow/x.0 (http://www.feedshow.com; 1 subscriber)
- FeedshowOnline (http://www.feedshow.com)
- Feedster Crawler
- Feedster Crawler/3.0; Feedster, Inc.
- FeedZcollector v1.x (Platinum) http://www.feeds4all.com/feedzcollector
- Felix - Mixcat Crawler
- Felix - Mixcat Crawler (+http://mixcat.com)
- fetch libfetch/2.0
- FFC Trap Door Spider
- Filangy
- Filangy/0.01-beta (Filangy; http://www.nutch.org/docs/en/bot.html; filangy-agent@filangy.com)
- Filangy/1.0x (Filangy; http://www.filangy.com/filangyinfo.jsp?inc=robots.jsp; filangy-agent@filangy.com)
- Filangy/1.0x (Filangy; http://www.nutch.org/docs/en/bot.html; filangy-agent@filangy.com)
- fileboost.net/1.0 (+http://www.fileboost.net)
- FileHound x.x
- Filtrbox/1.0
- FindAnISP.com
- FindAnISP.com_ISP_Finder_v99a
- Findexa
- Findexa Crawler (http://www.findexa.no/gulesider/article26548.ece)
- findfiles.org
- findlinks/x.xxx (+http://wortschatz.uni-leipzig.de/findlinks/)
- FineBot
- Finjan-prefetch
- Firefly
- Firefly/1.0
- Firefly/1.0 (compatible; Mozilla 4.0; MSIE 5.5)
- Firefox (kastaneta03@hotmail.com)
- Firefox_1.0.6 (kasparek@naparek.cz)
- FirstGov.gov
- FirstGov.gov Search - POC:firstgov.webmasters@gsa.gov
- firstsbot
- Flapbot
- Flapbot/0.7.2 (Flaptor Crawler; http://www.flaptor.com; crawler at flaptor period com)
- FlashGet
- FLATARTS_FAVICO
- Flexum spider
- Flexum/2.0
- FlickBot 2.0 RPT-HTTPClient/0.3-3
- flunky
- fly/6.01 libwww/4.0D
- flyindex.net 1.0/http://www.flyindex.net
- FnooleBot/2.5.2 (+http://www.fnoole.com/addurl.html)
- FocusedSampler/1.0
- Folkd.com Spider/0.1 beta 1 (www.folkd.com)
- FollowSite Bot ( http://www.followsite.com/bot.html )
- FollowSite.com ( http://www.followsite.com/b.html )
- Foobot
- Fooky.com
- Fooky.com/ScorpionBot/ScoutOut; http://www.fooky.com/scorpionbots
- Francis
- Francis/1.0 (francis@neomo.de http://www.neomo.de/)
- Franklin Locator 1.8
- free-downloads.net download-link validator /0.1
- FreeFind
- FreeFind.com-SiteSearchEngine/1.0 (http://freefind.com; spiderinfo@freefind.com)
- Freemium
- Frelicbot/1.0 +http://www.frelic.com/
- FreshDownload/x.xx
- FreshNotes
- FreshNotes crawler
- FSurf15a 01
- FTB-Bot http://www.findthebest.co.uk/
- Full Web Bot 0416B
- Full Web Bot 0516B
- Full Web Bot 2816B
- FuseBulb
- FuseBulb.Com
- FyberSpider
- FyberSpider (+http://www.fybersearch.com/fyberspider.php)
- g2Crawler
- Gagglebot
- GAIS Robot
- GAIS Robot/1.0B2
- Gaisbot
- Gaisbot/3.0 (indexer@gais.cs.ccu.edu.tw; http://gais.cs.ccu.edu.tw/robot.php)
- Gaisbot/3.0+(robot06@gais.cs.ccu.edu.tw;+http://gais.cs.ccu.edu.tw/robot.php)
- GalaxyBot
- GalaxyBot/1.0 (http://www.galaxy.com/galaxybot.html)
- Gallent Search Spider
- Gallent Search Spider v1.4 Robot 2 (http://robot.GallentSearch.com)
- Gamekitbot
- gamekitbot/1.0 (+http://www.uchoose.de/crawler/gamekitbot/)
- Gamespy_Arcade
- GammaSpider
- GammaSpider/1.0
- gazz/x.x (gazz@nttrd.com)
- geckobot
- Generic Mobile Phone (compatible; Googlebot-Mobile/2.1; +http://www.google.com/bot.html)
- generic_crawler/01.0217/
- GenesisBrowser (HTTP 1.1; 0.9; XP SP2; .NET CLR 2.0.50727)
- geniebot
- genieBot (http://64.5.245.11/faq/faq.html)
- geniebot wgao@genieknows.com
- GeoBot/1.0
- GeonaBot
- GeonaBot 1.x; http://www.geona.com/
- geourl/2.0b2
- GeoURLBot 1.0 (http://geourl.org)
- GetBot
- GetRight/3.x.x
- GetRight/4.5xx
- GetRight/4.x
- GetRight/4.x[a-e]
- GetRight/6.1 (Pro)
- GetRightPro/6.0beta2
- GetWeb/0.1 libwww-perl/5.16
- GhostRouteHunter/20021130 (https://www.sixxs.net/tools/grh/; info@sixxs.net)
- gigabaz/3.1x (baz@gigabaz.com; http://gigabaz.com/gigabaz/)
- Gigabot
- Gigabot/2.0 (gigablast.com)
- Gigabot/2.0; http://www.gigablast.com/spider.html
- Gigabot/2.0/gigablast.com/spider.html
- Gigabot/2.0att
- Gigabot/3.0 (http://www.gigablast.com/spider.html)
- Gigabot/x.0
- GigabotSiteSearch/2.0 (sitesearch.gigablast.com)
- GNODSPIDER
- GNODSPIDER (www.gnod.net)
- Go-Ahead-Got-It/1.1
- Go!Zilla 3.x (www.gozilla.com)
- Go!Zilla/4.x.x.xx
- Goblin
- Goblin/0.9 (http://www.goguides.org/)
- Goblin/0.9.x (http://www.goguides.org/goblin-info.html)
- GoForIt
- GoForIt.com
- GOFORITBOT ( http://www.goforit.com/about/ )
- GoGuides.Org Link Check
- GoldenFeed Spider 1.0 (http://www.goldenfeed.com)
- Goldfire Server
- gonzo1
- gonzo1[P] +http://www.suchen.de/popups/faq.jsp
- gonzo2[P] +http://www.suchen.de/faq.html
- Goofer/0.2
- Google AdSense
- Google Talk
- Googlebot
- googlebot (larbin2.6.0@unspecified.mail)
- Googlebot-Image/1.0
- Googlebot-Image/1.0 ( http://www.googlebot.com/bot.html)
- Googlebot/2.1 ( http://www.google.com/bot.html)
- Googlebot/2.1 ( http://www.googlebot.com/bot.html)
- Googlebot/Test ( http://www.googlebot.com/bot.html)
- Gordon's Spider/Nutch-0.9 (http://www.sharethis.com; gordon@sharethis.com)
- GrapeFX/0.3 libwww/5.4.0
- great-plains-web-spider/flatlandbot (Flatland Industries Web Spider; http://www.flatlandindustries.com/flatlandbot.php; jason@flatlandindustries.com)
- GreatNews/1.0
- GreenBrowser
- gridwell (http://search.gridwell.com)
- GrigorBot
- GrigorBot 0.8 (http://www.grigor.biz/bot.html)
- Gromit
- Gromit/1.0
- grub crawler(http://www.grub.org)
- grub-client
- gsa-crawler
- gsa-crawler (Enterprise; GID-01422; jplastiras@google.com)
- gsa-crawler (Enterprise; GID-01742;gsatesting@rediffmail.com)
- gsa-crawler (Enterprise; GIX-02057; dm@enhesa.com)
- gsa-crawler (Enterprise; GIX-03519; cknuetter@stubhub.com)
- gsa-crawler (Enterprise; GIX-0xxxx; enterprise-training@google.com)
- GSiteCrawler/v1.xx rev. xxx (http://gsitecrawler.com/)
- Guestbook Auto Submitter
- Gulliver
- Gulliver/1.3
- Gulper Web Bot 0.2.4 (www.ecsl.cs.sunysb.edu/~maxim/cgi-bin/Link/GulperBot)
- Gungho/0.08004 (http://code.google.com/p/gungho-crawler/wiki/Index)
- GurujiBot
- GurujiBot/1.0 (+http://www.guruji.com/WebmasterFAQ.html)
- GurujiImageBot/1.0 (+http://www.guruji.com/en/WebmasterFAQ.html)
- Haier-T10C/1.0 iPanel/2.0 WAP2.0 (compatible; UP.Browser/6.2.2.4; UPG1; UP/4.0; Embedded)
- HappyFunBot
- HappyFunBot/1.1
- Harvest
- Harvest-NG
- Harvest-NG/1.0.2
- Haste/0.12 (HOME: http://haste.kytoon.com/)
- Hatena
- Hatena Antenna/0.4 (http://a.hatena.ne.jp/help#robot)
- Hatena Mobile Gateway/1.0
- Hatena Pagetitle Agent/1.0
- Hatena RSS/0.3 (http://r.hatena.ne.jp)
- HatenaScreenshot/1.0 (checker)
- hbtronix.spider.2 -- http://hbtronix.de/spider.php
- HeinrichderMirago
- HeinrichderMiragoRobot
- HeinrichderMiragoRobot (http://www.miragorobot.com/scripts/deinfo.asp)
- Helix
- Helix/1.x ( http://www.sitesearch.ca/helix/)
- HenriLeRobotMirago
- HenriLeRobotMirago (http://www.miragorobot.com/scripts/frinfo.asp)
- HenrytheMirago
- HenrytheMiragoRobot
- HenryTheMiragoRobot (http://www.miragorobot.com/scripts/mrinfo.asp)
- heritrix
- hgrepurl/1.0
- Hi! I'm CsCrawler my homepage: http://www.kde.cs.uni-kassel.de/lehre/ss2005/googlespam/crawler.html RPT-HTTPClient/0.3-3
- HiDownload
- Hippias
- Hippias/0.9 Beta
- HitList
- Hitwise Spider
- Hitwise Spider v1.0 http://www.hitwise.com
- hl_ftien_spider
- HLoader
- hoge
- holmes
- holmes/3.11 (http://morfeo.centrum.cz/bot)
- holmes/3.9
- holmes/3.9 (onet.pl)
- holmes/3.xx (OnetSzukaj/5.0; +http://szukaj.onet.pl)
- holmes/x.x
- HolmesBot (http://holmes.ge)
- HomePageSearch(hpsearch.uni-trier.de)
- Homerbot: www.homerweb.com
- Honda-Search
- Honda-Search/0.7.2 (Nutch; http://lucene.apache.org/nutch/bot.html; search@honda-search.com)
- HooWWWer/2.1.3 (debugging run) (+http://cosco.hiit.fi/search/hoowwwer/ | mailto:crawler-info
- HooWWWer/2.1.3 (debugging run) (+http://cosco.hiit.fi/search/hoowwwer/ | mailto:crawler-infohiit.fi)
- HooWWWer/2.1.x ( http://cosco.hiit.fi/search/hoowwwer/ | mailto:crawler-info
- HooWWWer/2.1.x ( http://cosco.hiit.fi/search/hoowwwer/ | mailto:crawler-infohiit.fi)
- HotJava/1.0.1/JRE1.1.x
- Hotzonu/x.0
- HPL/Nutch-0.9 -
- htdig/3.1.6 (http://computerorgs.com)
- htdig/3.1.6 (unconfigured@htdig.searchengine.maintainer)
- htdig/3.1.x (root@localhost)
- Html Link Validator (www.lithopssoft.com)
- HTML2JPG Blackbox, http://www.html2jpg.com
- HTML2JPG Enterprise
- HTMLParser/1.x
- HTTP Retriever
- HTTP::Lite/2.x.x
- http://Anonymouse.org/ (Unix)
- http://Ask.24x.Info/ (http://narres.it/)
- http://hilfe.acont.de/bot.html ACONTBOT
- http://OzySoftware.com/Index.html
- http://www.almaden.ibm.com
- http://www.almaden.ibm.com/cs/crawler
- http://www.almaden.ibm.com/cs/crawler [rc1.wf.ibm.com]
- http://www.almaden.ibm.com/cs/crawler [wf216]
- http://www.ip2location.com
- http://www.istarthere.com
- http://www.istarthere.com_spider@istarthere.com
- http://www.monogol.de
- http://www.sygol.com
- http://www.timelyweb.com/
- http://www.trendtech.dk/spider.asp
- http://www.trendtech.dk/spider.asp)
- HTTPEyes
- httplib
- HTTPResume v. 1.x
- httpunit/1.5
- httpunit/1.x
- httrack
- humanlinks
- Hybrid/1.2 [en] (OS Independent)
- HyperEstraier/1.x.xx
- i1searchbot
- i1searchbot/2.0 (i1search web crawler; http://www.i1search.com; crawler@i1search.com)
- ia_archiver
- ia_archiver-web.archive.org
- ia_archiver/1.6
- IAArchiver-1.0
- iaskspider
- iaskspider2 (iask@staff.sina.com.cn)
- IBrowse/2.2 (AmigaOS 3.5)
- IBrowse/2.2 (Windows 3.1)
- iCab/2.5.2 (Macintosh; I; PPC)
- ICC-Crawler(Mozilla-compatible; http://kc.nict.go.jp/icc/crawl.html; icc-crawl(at)ml(dot)nict(dot)go(dot)jp)
- ICC-Crawler(Mozilla-compatible;http://kc.nict.go.jp/icc/crawl.html;icc-crawl-contact(at)ml(dot)nict(dot)go(dot)jp)
- iCCrawler
- ICCrawler - ICjobs (http://www.icjobs.de/bot.htm)
- iCCrawler (http://www.iccenter.net)
- ICE Browser/5.05 (Java 1.4.0; Windows 2000 5.0 x86)
- ichiro
- ichiro/x.0 (http://help.goo.ne.jp/door/crawler.html)
- ichiro/x.0 (ichiro@nttr.co.jp)
- IconSurf
- IconSurf/2.0 favicon finder (see http://iconsurf.com/robot.html)
- IconSurf/2.0 favicon monitor (see http://iconsurf.com/robot.html)
- ICOO Loader v.x.x.x
- ICRA_label_spider
- ICRA_label_spider/x.0
- icsbot-0.1
- IDA
- ideare - SignSite/1.x
- iearthworm/1.0, iearthworm@yahoo.com.cn
- IEFav172Free
- iFeed.jp/2.0 (www.psychedelix.com/agents/agents.rss; 0 subscribers)
- igdeSpyder (compatible; igde.ru; +http://igde.ru/doc/tech.html)
- iGetter/1.x (Macintosh;G;PPC)
- iGetter/2 (Macintosh; U; PPC Mac OS X; en)
- IIITBOT
- IIITBOT/1.1 (Indian Language Web Search Engine; http://webkhoj.iiit.net; pvvpr at iiit dot ac dot in)
- Ilial
- ilial/Nutch-0.9 (Ilial, Inc. is a Los Angeles based Internet startup company. For more information please visit http://www.ilial.com/crawler; http://www.ilial.com/crawler; crawl@ilial.com)
- ilial/Nutch-0.9-dev
- IlseBot
- IlseBot/1.x
- IlTrovatore
- IlTrovatore-Setaccio ( http://www.iltrovatore.it)
- Iltrovatore-Setaccio/0.3-dev (Indexing; http://www.iltrovatore.it/bot.html; info@iltrovatore.it)
- IlTrovatore-Setaccio/1.2 ( http://www.iltrovatore.it/aiuto/faq.html)
- Iltrovatore-Setaccio/1.2 (It-bot; http://www.iltrovatore.it/bot.html; info@iltrovatore.it)
- iltrovatore-setaccio/1.2-dev (spidering; http://www.iltrovatore.it/aiuto/.....)
- IlTrovatore/1.2 (IlTrovatore; http://www.iltrovatore.it/bot.html; bot@iltrovatore.it)
- ImageVisu/v4.x.x
- ImageWalker
- ImageWalker/2.0 (www.bdbrandprotect.com)
- imedixbot
- Incutio HttpClient v0.x
- IncyWincy
- IncyWincy data gatherer(webmaster@loopimprovements.com
- IncyWincy page crawler(webmaster@loopimprovements.com
- IncyWincy(http://www.look.com)
- IncyWincy(http://www.loopimprovements.com/robot.html)
- IncyWincy/2.1(loopimprovements.com/robot.html)
- IndexTheWeb.com
- IndexTheWeb.com Crawler7
- Industry Program 1.0.x
- Indy Library
- Inet library
- InetURL/1.0
- info@pubblisito.com- (http://www.pubblisito.com) il Sud dei Motori di Ricerca
- Infoaxe./Nutch-0.9
- infoConveraCrawler/0.8 ( http://www.authoritativeweb.com/crawl)
- InfoFly/1.0 (http://www.versions-project.org/)
- InfoLink/1.x
- INFOMINE
- INFOMINE/8.0 Adders
- INFOMINE/8.0 RemoteServices
- INFOMINE/8.0 VLCrawler (http://infomine.ucr.edu/useragents)
- InfoNaviRobot
- InfoNaviRobot(F107)
- InfoSeek
- InfoSeek Sidewinder/0.9
- InfoSeek Sidewinder/1.0A
- InfoSeek Sidewinder/1.1A
- Infoseek SideWinder/1.45 (Compatible; MSIE 10.0; UNIX)
- Infoseek SideWinder/2.0B (Linux 2.4 i686)
- INGRID/3.0 MT
- INGRID/3.0 MT (webcrawler@NOSPAMexperimental.net; http://webmaster.ilse.nl/jsp/webmaster.jsp)
- Inktomi
- Inktomi Search
- InnerpriseBot
- InnerpriseBot/1.0 (http://www.innerprise.com/)
- Insitor
- Insitor.com search and find world wide!
- Insitornaut
- InstallShield DigitalWizard
- integrity/1.6
- Intelix
- Intelix/0.x (cs; http://www.microton.cz/intelix/; microton@@microton.cz)
- Interarchy/x.x.x (InterarchyCrawler)
- Internet Ninja x.0
- InternetArchive (BOT)
- InternetArchive/0.8-dev(Nutch;http://lucene.apache.org/nutch/bot.html;nutch-agent@lucene.apache
- InternetLinkAgent/3.1
- InternetSeer.com
- intraVnews/1.x
- IOI/2.0 (ISC Open Index crawler; http://index.isc.org/; bot@index.isc.org)
- IP*Works! V5 HTTP/S Component - by /n software - www.nsoftware.com
- IP2LocationBot/1.0 http://www.ip2location.com
- IP2MapBot/1.1
- IP2MapBot/1.1 http://www.ip2map.com
- IPiumBot laurion(dot)com
- IpselonBot
- IpselonBot/0.xx-beta (Ipselon; http://www.ipselon.com; ipselonbot@ipselon.com)
- Iria/1.xxa
- IRLbot
- IRLbot/1.0 ( http://irl.cs.tamu.edu/crawler)
- IRLbot/3.0 (compatible; MSIE 6.0; http://irl.cs.tamu.edu/crawler/)
- IrssiUrlLog/0.2
- Irvine/1.x.x
- ISC Systems iRc Search 2.1
- isilox
- iSiloX/4.xx Windows/32
- isurf (tszhu@canada.com)
- iTunes/x.x.x
- IUPUI Research Bot v 1.9a
- iVia Page Fetcher (http://ivia.ucr.edu/useragents.shtml)
- iVia/4.0 CanonizeUrl (http://infomine.ucr.edu/iVia/useragents.shtml
- IWAgent/ 1.0 - www.brandprotect.com
- J-PHONE/3.0/J-SH07
- Jabot
- Jabot/6.x (http://odin.ingrid.org/)
- Jabot/7.x.x (http://odin.ingrid.org/)
- Jack
- Jakarta Commons-HttpClient/2.0xxx
- Jakarta Commons-HttpClient/3.0-rcx
- Jambot
- Jambot/0.1.x (Jambot; http://www.jambot.com/blog; crawler@jambot.com)
- Jambot/0.2.1 (Jambot; http://www.jambot.com/blog/static.php?page=webmaster-robot; crawler@jambot.com)
- Java 1.1
- Java/1.4.1_01
- Java1.0.21.0
- Java1.1.xx.x
- Java1.3.0rc1
- Java1.3.x
- Java1.4.0
- Jayde Crawler
- Jayde Crawler. http://www.jayde.com
- JBH Agent 2.0
- jBrowser/J2ME Profile/MIDP-1.0 Configuration/CLDC-1.0 (Google WAP Proxy/1.0)
- JCheckLinks/0.1 RPT-HTTPClient/0.3-1
- JDK/1.1
- JennyBot
- Jeode/1.x.x
- Jetbot
- Jetbot/1.0
- JetBrains Omea Reader 1.0.x (http://www.jetbrains.com/omea_reader/)
- JetBrains Omea Reader 2.0 Release Candidate 1 (http://www.jetbrains.com/omea_reader/)
- JetCar
- Jigsaw/2.2.x W3C_CSS_Validator_JFouffa/2.0
- JoBo/@JOBO_VERSION@(http://www.matuschek.net/jobo.html)
- JoBo/1.x (http://www.matuschek.net/jobo.html)
- JobSpider_BA/1.1
- JOC Web Spider
- JordoMedia/1.0 RSS File Reader (http://www.jordomedia.com)
- Journster [alpha] (http://journster.com/)
- Journster.com RSS/Atom aggregator 0.5 (http://www.journster.com/bot.phtml)
- JRTS Check Favorites Utility
- JRTwine Software Check Favorites Utility
- jyxobot
- Jyxobot/x
- K-Meleon/0.6 (Windows; U; Windows NT 5.1; en-US; rv:0.9.5) Gecko/20011011
- k2spider
- KAIST AITrc Crawler
- KakleBot - www.kakle.com/0.1 (KakleBot - www.kakle.com; http:// www.kakle.com/bot.html; support@kakle.com)
- kalooga/kalooga-4.0-dev-datahouse (Kalooga; http://www.kalooga.com; info@kalooga.com)
- kalooga/KaloogaBot (Kalooga; http://www.kalooga.com/info.html?page=crawler; crawler@kalooga.com)
- Kapere (http://www.kapere.com)
- Kazehakase/0.x.x.[x]
- KDDI-SN22 UP.Browser/6.0.7 (GUI) MMP/1.1 (Google WAP Proxy/1.0)
- KE_1.0/2.0
- KE_1.0/2.0 libwww/5.2.8
- Kenjin Spider
- Kevin http://dznet.com/kevin/
- Kevin http://websitealert.net/kevin/
- Keyword Density
- KFSW-Bot
- KFSW-Bot (Version: 1.01 powered by KFSW www.kfsw.de)
- Kinja
- kinja-imagebot (http://www.kinja.com/)
- kinjabot (http://www.kinja.com)
- KIT-Fireball
- KIT-Fireball/2.0
- KIT-Fireball/2.0 (compatible; Mozilla 4.0; MSIE 5.5)
- Klondike/1.50 (WSP Win32) (Google WAP Proxy/1.0)
- KnowItAll(knowitall@cs.washington.edu)
- Knowledge.com
- Knowledge.com/0.x
- Kontiki Client x.xx
- Krugle/Krugle,Nutch/0.8+ (Krugle web crawler; http://www.krugle.com/crawler/info.html; webcrawler@krugle.com)
- KSbot/1.0 (KnowledgeStorm crawler; http://www.knowledgestorm.com/resources/content/crawler/index.html; crawleradmin@knowledgestorm.com)
- Kuloko
- kuloko-bot/0.x
- kulokobot www.kuloko.com kuloko@backweave.com
- kulturarw3
- kulturarw3/0.1
- KummHttp/1.1 (compatible; KummClient; Linux rulez)
- KWC-KX9/1109 UP.Browser/6.2.3.9.g.1.107 (GUI) MMP/2.0 UP.Link/6.3.0.0.0
- Labrador/0.2; http://ir.dcs.gla.ac.uk/labrador; craigm@dcs.gla.ac.uk
- Lachesis
- lanshanbot/1.0
- lanshanbot/1.0 (+http://search.msn.com/msnbot.htm)
- LapozzBot
- LapozzBot/1.4 ( http://robot.lapozz.com)
- LapozzBot/1.5 (+http://robot.lapozz.hu)
- larbin
- larbin (samualt9@bigfoot.com)
- larbin_2.1.1 larbin2.1.1@somewhere.com
- larbin_2.2.0 (crawl@compete.com)
- larbin_2.2.1_de_Viennot (Laurent.Viennot@inria.fr)
- larbin_2.2.2 (sugayama@lab7.kuis.kyoto-u.ac.jp)
- larbin_2.2.2_guillaume (guillaume@liafa.jussieu.fr)
- larbin_2.6_basileocaml (basile.starynkevitch@cea.fr)
- larbin_2.6.0 (larbin2.6.0@unspecified.mail)
- larbin_2.6.1 (larbin2.6.1@unspecified.mail)
- larbin_2.6.2 (hamasaki@grad.nii.ac.jp)
- larbin_2.6.2 (larbin2.6.2@unspecified.mail)
- larbin_2.6.2 (listonATccDOTgatechDOTedu)
- larbin_2.6.2 (pimenas@systems.tuc.gr)
- larbin_2.6.2 (tom@lemurconsulting.com)
- larbin_2.6.2 (vitalbox1@hotmail.com)
- larbin_2.6.3 (ltaa_web_crawler@groupes.epfl.ch)
- larbin_2.6.3 (wgao@genieknows.com)
- larbin_2.6.3_for_(http://cosco.hiit.fi/search/) tsilande@hiit.fi
- larbin_devel (http://pauillac.inria.fr/~ailleret/prog/larbin/)
- LARBIN-EXPERIMENTAL (efp@gmx.net)
- lawinfo-crawler/Nutch-0.9-dev (Crawler for lawinfo.com pages; http://www.lawinfo.com; webmaster@lawinfo.com)
- lc/$ROADS::Version libwww-perl/5.00
- lcabotAccept: */*
- LeapTag/0.8.1.beta081.r3750 (compatible; Mozilla 4.0; MSIE 5.5; robot@yoriwa.com)
- LECodeChecker/3.0 libgetdoc/1.0
- LeechGet 200x (www.leechget.de)
- LEIA/
- LEIA/2.90
- LEIA/3.01pr (LEIAcrawler; [SNIP])
- LetsCrawl.com/1.0 +http://letscrawl.com/
- LexiBot
- LexiBot/1.00
- LG-LX260 POLARIS-LX260/2.0 MMP/2.0 Profile/MIDP-2.0 Configuration/CLDC-1.1
- LG/U8138/v1.0
- Libby_1.1/libwww-perl/5.47
- libcurl-agent/1.0
- LibertyW (+http://www.lw01.com)
- libWeb/clsHTTP
- libWeb/clsHTTP -- hiongun@kt.co.kr
- libwww-perl/5.41
- libwww-perl/5.45
- libwww-perl/5.48
- libwww-perl/5.50
- libwww-perl/5.52 FP/2.1
- libwww-perl/5.52 FP/4.0
- libwww-perl/5.53
- libwww-perl/5.63
- libwww-perl/5.64
- libwww-perl/5.65
- libwww-perl/5.800
- libwww/5.3.2
- Liferea/0.x.x (Linux; en_US.UTF-8; http://liferea.sf.net/)
- Liferea/1.x.x (Linux; es_ES.UTF-8; http://liferea.sf.net/)
- LightningDownload/1.0beta2
- LightningDownload/1.x.x
- LightningDownload/1.x.x [Accelerated x]