Bots & Agents

Learn about the bots and agents that visit your site. Understand how they impact your traffic and revenue.

What is a bot?

A bot is an automated system that works on the internet. Bots visit websites for many reasons.

We have 12 categories that describe why they visit web pages.

We have 12 subcategories that explain who runs the bot.

What is an agent?

An agent is any HTTP client that visits your website.

Every HTTP client includes a user-agent string which is like an internet name badge.

robots.nxt monitors these headers to identify and categorize bots.

Want to see what bots are visiting your website?

What the bot does:

Archiving & Preservation10

Content Scraping10

Content & Sentiment Analysis14

Defunct / Out of Service1

Developer Crawler Package5

Malware, Botnet, Spam3

Metrics & Site Analytics16

Page Indexer23

Research6

RSS & Summaries18

Services & Virtual Assistants49

Unknown22

Who runs the bot:

Access Control3

Advertiser4

AI / ML Company1

Bad or No Contact Address2

Business Intelligence14

Content Aggregator & RSS16

Content Scraper6

Research5

Search16

SEO / SER6

Social Media, Chat, or Shorteners7

Unknown10

Mozilla
→

Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; Googlebot/2.1; +http://www.google.com/bot.html) Chrome/133.0.6943.53 Safari/537.36

This bot is a Googlebot, a web crawler used by Google to index and categorize content on the web for search results. It helps maintain and improve Google's search engine by periodically scanning websites and updating its vast database of online content.

What the bot does

Services & Virtual Assistants

Content & Sentiment Analysis

Mozilla
→

Mozilla/5.0 (Linux; Android 5.0) AppleWebKit/537.36 (KHTML, like Gecko) Mobile Safari/537.36 (compatible; Bytespider; https://zhanzhang.toutiao.com/)

This bot appears to be a web scraping bot, likely used for data collection or monitoring purposes, specifically designed to mimic a Mozilla/5.0 (Linux; Android 5.0) AppleWebKit/537.36 (KHTML, like Gecko) Mobile Safari/537.36 user agent, compatible with Bytespider's crawling capabilities.

What the bot does

Services & Virtual Assistants

Malware, Botnet, Spam

Content Scraping

Mozilla
→

Mozilla/5.0 (compatible; MJ12bot/v1.4.8; http://mj12bot.com/)

The MJ12bot is a web scraping bot that collects data from various websites. Its primary function is to fetch and store web pages, which can be used for various purposes such as website monitoring, market research, or data analysis.

What the bot does

Services & Virtual Assistants

Unknown

Mozilla
→

Mozilla/5.0 (Linux; Android 6.0.1; Nexus 5X Build/MMB29P) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/133.0.6943.53 Mobile Safari/537.36 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)

This bot is a Googlebot, a web-crawling robot created by Google to index and analyze web pages for search results.

What the bot does

Services & Virtual Assistants

Archiving & Preservation

Client
→

Client/2082 CFNetwork/3826.400.120 Darwin/24.3.0

This bot is a conversational AI designed to assist users by understanding and responding to their queries, providing information on a wide range of topics, and engaging in natural-sounding conversations. It aims to simulate human-like interactions, offering helpful and accurate responses to user questions and interests.

What the bot does

Page Indexer

Services & Virtual Assistants

[FBAN
→

[FBAN/FB4A;FBAV/502.0.0.66.79;FBBV/702083372;FBDM/{density=2.8125,width=1080,height=2208};FBLC/en_US;FBRV/0;FBCR/cricket;FBMF/samsung;FBBD/samsung;FBPN/com.facebook.katana;FBDV/SM-A146U;FBSV/13;FBOP/1;FBCA/arm64-v8a:;]

This bot is designed to assist users by providing useful information and completing tasks related to cricket, a popular sport. With a user-friendly interface, this bot can help users access cricket scores, news, and updates, as well as provide valuable insights and analysis.

What the bot does

Services & Virtual Assistants

RSS & Summaries

Unknown

Uptime-Kuma
→

Uptime-Kuma/1.12.1

This bot is an uptime monitoring bot that tracks the status of various web applications and services in real-time. With the User Agent "Uptime-Kuma/1.12.1", it periodically checks the availability and response time of designated targets, providing alerts and notifications in case of downtime or performance issues.

What the bot does

Unknown

Mozilla
→

Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; Amazonbot/0.1; +https://developer.amazon.com/support/amazonbot) Chrome/119.0.6045.214 Safari/537.36

This bot is Amazonbot, a web scraping bot developed by Amazon to crawl and index web pages for its search engine and product pages. Amazonbot is used to gather information, such as product details, prices, and reviews, to improve the accuracy of Amazon's search results and product offerings.

What the bot does

Services & Virtual Assistants

Metrics & Site Analytics

Client
→

Client/2082 CFNetwork/1404.0.5 Darwin/22.3.0

This AI-powered chatbot is designed to provide assistance and answer questions on a wide range of topics. It uses natural language processing to understand user queries and respond with accurate and helpful information, making it a reliable resource for users seeking knowledge or guidance.

What the bot does

Content Scraping

Research

Client
→

Client/2082 CFNetwork/1404.0.5 Darwin/22.3.0

This bot is a helpful assistant that provides information and assists with tasks. It's designed to be a helpful tool for users, and it's always happy to lend a hand.

What the bot does

Services & Virtual Assistants

Unknown

Client
→

Client/2082 CFNetwork/1404.0.5 Darwin/22.3.0

This bot is a language model designed to provide helpful and accurate responses to users' questions and engage in productive conversations. It can understand and respond to a wide range of topics and topics of interest, from general knowledge to more specific subjects.

What the bot does

Content Scraping

Research

Client
→

Client/2082 CFNetwork/1404.0.5 Darwin/22.3.0

This bot is designed to assist users by providing information and answering questions on a wide range of topics, from general knowledge to specific industries and sectors. It uses machine learning algorithms to process and analyze data, and can provide personalized responses and recommendations based on the user's needs and interests.

What the bot does

Unknown

Archiving & Preservation

Chrome Privacy Preserving Prefetch Proxy
→

Chrome Privacy Preserving Prefetch Proxy

This bot, referred to as the "Chrome Privacy Preserving Prefetch Proxy", is a crawling bot designed to examine websites and data while maintaining user privacy. It uses Chrome's privacy preserving prefetch feature to fetch web pages and content in the background while respecting users' privacy. This means it does not store cookies, makes anonymous requests, and does not allow third-party scripts to track users.

What the bot does

Services & Virtual Assistants

Content & Sentiment Analysis

Archiving & Preservation

Mozilla
→

Mozilla/5.0 (Linux; Android 5.0) AppleWebKit/537.36 (KHTML, like Gecko) Mobile Safari/537.36 (compatible; TikTokSpider; ttspider-feedback@tiktok.com)

This bot is designed to simulate user interactions with websites, mimicking the behavior of a mobile device. It uses a TikTokSpider user agent, indicating that its primary purpose is to gather information about content on the popular video-sharing app, TikTok.

What the bot does

Services & Virtual Assistants

RSS & Summaries

Archiving & Preservation

Mozilla
→

Mozilla/5.0 (compatible; Googlebot/2.1 +http://www.googlebot.com/bot.html)

This bot appears to be a search engine bot, specifically Googlebot, which is used by Google to crawl and index web pages to improve its search results. It's likely scanning websites to gather information, follow links, and update Google's database.

What the bot does

Services & Virtual Assistants

Defunct / Out of Service

Mozilla
→

Mozilla/5.0 (Linux; Android 6.0.1; Nexus 5X Build/MMB29P) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/133.0.6943.141 Mobile Safari/537.36 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)

This bot is designed to assist users by providing general information and answers to various questions. It uses natural language processing (NLP) to understand user queries and generates human-like responses. The bot is trained on a large dataset of text and is constantly learning to improve its accuracy and relevance.

What the bot does

Services & Virtual Assistants

RSS & Summaries

Archiving & Preservation

Mozilla
→

Mozilla/5.0 (Linux; Android 6.0.1; Nexus 5X Build/MMB29P) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/133.0.6943.126 Mobile Safari/537.36 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)

This bot is a Googlebot, a search engine bot commonly used by Google to crawl and index web pages. It's likely used to gather and update content, and is identified by its user agent string as a mobile-friendly bot.

What the bot does

Services & Virtual Assistants

Metrics & Site Analytics

Content & Sentiment Analysis

Mozilla
→

Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/131.0.0.0 Safari/537.36; compatible; OAI-SearchBot/1.0; +https://openai.com/searchbot

This bot is an AI-powered search engine bot that helps users find relevant information on the internet. It can locate specific data, answer questions, and provide summaries of online content.

What the bot does

Services & Virtual Assistants

RSS & Summaries

curl
→

curl/7.61.1

I'm happy to provide a description for this bot!

What the bot does

Unknown

python-requests
→

python-requests/2.32.3

This bot is a general-purpose bot that can answer a wide range of questions and provide useful information on various topics. It can assist with language-related tasks, such as language translation, text summarization, and even generating text.

What the bot does

Services & Virtual Assistants

RSS & Summaries

Research

8LEGS
→

8LEGS

The 8LEGS bot is a general-purpose conversational AI assistant that can engage in various topics, provide information, and answer questions. It's designed to understand natural language inputs and respond accordingly, making it a helpful companion for users seeking assistance or just looking for interesting conversations.

What the bot does

Metrics & Site Analytics

Archiving & Preservation

Mozilla
→

Mozilla/5.0 (compatible)

This bot is a general-purpose chatbot designed to assist users with various tasks and answer questions. It understands natural language and can engage in conversation, providing helpful information and responding to user queries. The bot's capabilities are flexible and ongoing enhancements aim to expand its functionality and knowledge base.

What the bot does

Services & Virtual Assistants

Content & Sentiment Analysis

Mozilla
→

Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; Googlebot/2.1; +http://www.google.com/bot.html) Chrome/133.0.6943.126 Safari/537.36

This bot appears to be a Googlebot, a webcrawler used by Google to index and search web pages. It's likely used to gather information and crawl websites to improve Google's search results.

What the bot does

Services & Virtual Assistants

Metrics & Site Analytics

Mozilla
→

Mozilla/5.0 (compatible; Dataprovider.com)

This bot is likely a web scraping bot from Dataprovider.com, a service that provides data scraping and web scraping solutions. This bot is used to extract data from websites and gather information for various purposes, such as market research, competitor analysis, or data enrichment.

What the bot does

Unknown

python-requests
→

python-requests/2.31.0

This is a chatbot designed to assist users by providing helpful information, answering questions, and offering suggestions. It is trained on a wide range of topics and is capable of understanding natural language inputs to provide accurate and relevant responses.

What the bot does

Services & Virtual Assistants

Metrics & Site Analytics

Research

Mozilla
→

Mozilla/5.0 (iPad; CPU OS 9_1 like Mac OS X) AppleWebKit/601.1 (KHTML, like Gecko; compatible; pageburst) CriOS/79.0.3945.117 Mobile/13B143 Safari/601.1.46

This bot appears to be a detection system for mobile devices, specifically designed to test and analyze the capabilities of a device such as yours, with a user agent that suggests a custom iOS-based browser. It likely sends requests to the device's browser and analyzes the responses to gather information about the device's configurations, capabilities, and possibly even detects any bugs or vulnerabilities.

What the bot does

Metrics & Site Analytics

RSS & Summaries

Mozilla
→

Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; bingbot/2.0; +http://www.bing.com/bingbot.htm) Chrome/100.0.4896.127 Safari/537.36

The Bingbot is a web crawler and search engine bot from Microsoft Bing. It is used to index and categorize web pages for inclusion in Bing's search results, providing users with relevant search results and ensuring the accuracy of search data.

What the bot does

Page Indexer

Services & Virtual Assistants

ALittle Client
→

ALittle Client

I'd be happy to help! Here is a short description for the bot: "ALittle Client is a helpful bot that provides [insert bot functionality]. It uses natural language processing to understand user requests and responds with relevant and accurate information to assist with [specific tasks or queries]. The bot aims to make it easy for users to get accurate information and complete tasks with ease."

What the bot does

Unknown

Archiving & Preservation

Go-http-client
→

Go-http-client/2.0

This bot is a text-based AI assistant that can help answer your questions, provide information on various topics, and even engage in basic conversations. It's designed to be a helpful companion and can assist with tasks such as language translation, providing definitions, and more.

What the bot does

Unknown

Archiving & Preservation

Who.is Bot
→

Who.is Bot

This bot, "Who.is Bot", is a tool designed to assist users by providing them with accurate and reliable information about various entities, individuals, and organizations. It can be used to track the online presence of individuals, monitor online reputation, and even generate reports based on the data collected.

What the bot does

RSS & Summaries

Unknown

Mozilla
→

Mozilla/5.0 researchscan.comsys.rwth-aachen.de

This bot, "ResearchScan", is designed to scan and analyze research content from various sources, providing users with a summary of the latest developments and trends in a specific field. By leveraging its advanced algorithms and crawling capabilities, the bot helps research enthusiasts and professionals stays up-to-date with the latest breakthroughs and discoveries.

What the bot does

Services & Virtual Assistants

Research

Mozilla
→

Mozilla/5.0 (compatible; Linux x86_64; Mail.RU_Bot/2.0; +https://help.mail.ru/webmaster/indexing/robots)

This bot, Mail.RU_Bot, is a web crawler from Mail.RU, a Russian web search engine and portal. It is used to index and catalog web pages for search results and may occasionally visit websites to gather content and updates.

What the bot does

Page Indexer

Services & Virtual Assistants

Mozilla
→

Mozilla/5.0 (keys-so-bot)

The keys-so-bot is a helpful assistant designed to provide information and answer questions related to software development, programming languages, and coding techniques. Its primary purpose is to serve as a reliable knowledge base for developers, offering solutions and guidance to common problems encountered in the software development process.

What the bot does

Unknown

Mozilla
→

Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; bingbot/2.0; +http://www.bing.com/bingbot.htm) Chrome/112.0.0.0 Safari/537.36

This bot, known as Bingbot, is a web crawler developed by Microsoft to index web pages for Bing Search. It systematically scans and analyzes website content to provide relevant search results to users.

What the bot does

Services & Virtual Assistants

Unknown

Mozilla
→

Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; bingbot/2.0; +http://www.bing.com/bingbot.htm) Chrome/112.0.0.0 Safari/537.36

The Mozilla/5.0 bot is a search engine bot, specifically Bingbot, used by Microsoft Bing to crawl and index web pages. It is used to gather information and data from websites to improve Bing's search results and website ranking. This bot sends requests to websites, mimicking a user's browsing activity, to collect data, images, and other relevant information.

What the bot does

Services & Virtual Assistants

Unknown

Client
→

Client/51018 CFNetwork/3826.400.120 Darwin/24.3.0

This bot is a language model designed to assist and provide information to users. It can understand natural language queries and respond with relevant and accurate answers. It can also generate text based on a prompt and engage in conversation.

What the bot does

RSS & Summaries

Archiving & Preservation

Developer Crawler Package

Mozilla
→

Mozilla/5.0 (compatible; Baiduspider-render/2.0; +http://www.baidu.com/search/spider.html)

This Bot is a user agent identified as Mozilla/5.0 (compatible; Baiduspider-render/2.0; +http://www.baidu.com/search/spider.html), which is a spider bot from Baidu, a popular Chinese search engine. It is likely scanning websites for content and structure to improve Baidu's search results.

What the bot does

Services & Virtual Assistants

RSS & Summaries

Mozilla
→

Mozilla/5.0 (compatible; SemrushBot-BA; +http://www.semrush.com/bot.html)

This bot, SemrushBot-BA, is a search engine marketing analytics bot that collects and analyzes data for website optimization and improvement. It helps webmasters and content creators understand user behavior, track website performance, and identify opportunities to improve search engine rankings and overall online presence.

What the bot does

Services & Virtual Assistants

Unknown

Slackbot-LinkExpanding 1.0 (+https:
→

Slackbot-LinkExpanding 1.0 (+https://api.slack.com/robots)

This bot, Slackbot-LinkExpanding, is a link expander bot that assists in expanding shortened links sent within Slack conversations. With this bot, users can effortlessly expand links to their original URLs, allowing for easy access to the desired content, websites, or multimedia files.

What the bot does

Page Indexer

Archiving & Preservation

Slackbot 1.0 (+https:
→

Slackbot 1.0 (+https://api.slack.com/robots)

"Welcome to Slackbot 1.0! This bot is designed to assist with various tasks and provide information to help make your Slack experience more efficient and enjoyable. It can help with answering questions, providing definitions, and even generating random fun facts."

What the bot does

Metrics & Site Analytics

Unknown

Mozilla
→

Mozilla/5.0 (compatible; SeznamBot/4.0; +https://o-seznam.cz/napoveda/vyhledavani/en/seznambot-crawler/)

This bot is SeznamBot, a search engine crawler from Seznam, a Czech search engine. It periodically scans websites to improve its search results and ensure accurate indexing of online content. Its activity may include web scraping and data extraction for the purpose of maintaining its search engine database.

What the bot does

Services & Virtual Assistants

Unknown

Googlebot
→

Googlebot/2.1 (+http://www.google.com/bot.html)

Googlebot is a search engine bot that is used by Google to crawl and index web pages. It is used to gather and compile data for Google's search engine results, allowing users to find relevant information online. Googlebot helps to keep the web's information organized and up-to-date, making it easier for users to find what they're looking for.

What the bot does

Page Indexer

Who runs the bot:

DoCoMo N9051i Googlebot-Mobile
→

DoCoMo/2.0 N905i(c100;TB;W24H16) (compatible; Googlebot-Mobile/2.1; +http://www.google.com/bot.html)

This is Googlebot-Mobile, a web crawler bot used by Google to index and retrieve mobile-friendly web pages. It periodically searches the web to collect and update information, providing users with the most relevant and accurate search results.

What the bot does

Page Indexer

Who runs the bot:

Googlebot-Image
→

Googlebot-Image/1.0

The Googlebot-Image bot is a web crawler service operated by Google that extracts and indexes images from the internet. Its primary function is to identify and store image content, allowing users to search and retrieve visual data. This bot is an essential component of Google's image search functionality, ensuring that users can find relevant and accurate visual results when searching online.

What the bot does

Page Indexer

Who runs the bot:

Content Aggregator & RSS

Googlebot-News
→

Googlebot-News

This bot, Googlebot-News, is a web crawler designed by Google to retrieve news articles from various sources and update online indexes for Google News. It helps users discover current and past news stories from around the world, making it easier to stay informed about global events.

What the bot does

Page Indexer

Who runs the bot:

Content Aggregator & RSS

Googlebot-Video
→

Googlebot-Video/1.0

The Googlebot-Video bot is a program used by Google to index and render video content on the web. This bot helps Google to crawl and analyze video files, allowing users to discover and access video content through Google's search engine.

What the bot does

Page Indexer

Who runs the bot:

Content Aggregator & RSS

BingBot or adidxbot
→

Mozilla/5.0 (Windows Phone 8.1; ARM; Trident/7.0; Touch; rv:11.0; IEMobile/11.0; NOKIA; Lumia 530) like Gecko (compatible; adidxbot/2.0; +http://www.bing.com/bingbot.htm)

This bot is a search engine bot used by Microsoft Bing to crawl and index web pages in order to provide accurate search results. It is known as "adidxbot" and is likely used to gather data on various web pages and websites to improve Bing's search engine functionality.

What the bot does

Page Indexer

Who runs the bot:

LinkedInBot
→

LinkedInBot/1.0 (compatible; Mozilla/5.0; Jakarta Commons-HttpClient/3.1 +http://www.linkedin.com)

This bot is designed to assist users in monitoring and analyzing their LinkedIn activities, including news and updates from the platform. With capabilities to scrape and process data, this bot aims to provide insights and reports on LinkedIn account activity, helping users to stay informed and improve their online presence.

What the bot does

Page Indexer

Who runs the bot:

Social Media, Chat, or Shorteners

Content Aggregator & RSS

python-httpx
→

python-httpx/0.16.1

python is a language commonly used for data analysis and web scraping. Bots that use python and don't uniquely identify themselves or provide a contact address are risks for content scraping.

What the bot does

Malware, Botnet, Spam

Content Scraping

Unknown

Who runs the bot:

Content Scraper

Bad or No Contact Address

Unknown

yacybot
→

yacybot (/global; amd64 FreeBSD 10.3-RELEASE; java 1.8.0_77; GMT/en) http://yacy.net/bot.html

The yacybot is a web crawler and search engine bot from YaCy, a decentralized search engine project. It periodically crawls the web to index new content and update existing information, helping to power YaCy's peer-to-peer search capabilities.

What the bot does

Page Indexer

Who runs the bot:

YandexBot
→

Mozilla/5.0 (compatible; YandexBot/3.0; +http://yandex.com/bots)

Yandex, a Russian technology company, uses this bot to crawl websites and gather information for their search engine. It may be used to index web pages, follow links, and gather metadata to improve the relevance and accuracy of search results.

What the bot does

Page Indexer

Who runs the bot:

BaiduSpider
→

Mozilla/5.0 (compatible; Baiduspider/2.0; +http://www.baidu.com/search/spider.html)

This bot is Baiduspider, a web crawler bot from Baidu, a prominent Chinese search engine. It is likely used to index web pages and gather data for Baidu's search results, crawling websites and web pages to build its database of online content.

What the bot does

Page Indexer

Who runs the bot:

TurnitinBot
→

TurnitinBot (https://turnitin.com/robot/crawlerinfo.html)

The TurnitinBot is a web crawler bot from Turnitin, a plagiarism detection and prevention service. This bot is used to collect and analyze data from various online sources to help detect plagiarism and promote academic integrity. The bot is designed to follow links, collect web page content, and crawl websites to identify and gather information relevant to academic research and writing.

What the bot does

Services & Virtual Assistants

Metrics & Site Analytics

Content Scraping

Who runs the bot:

Content Scraper

Content Aggregator & RSS

Business Intelligence

facebookexternalhit
→

facebookexternalhit/1.0 (+http://www.facebook.com/externalhit_uatext.php)

This bot is a Facebook web crawler, which is used to scan and index content from various websites. It's a legitimate bot authorized by Facebook to help improve the search results and suggestions on the platform.

What the bot does

Page Indexer

Who runs the bot:

Social Media, Chat, or Shorteners

rogerBot
→

Mozilla/5.0 (compatible; rogerBot/1.0; UrlCrawler; http://www.seomoz.org/dp/rogerbot)

RogerBot is a web crawler bot from UrlCrawler, a SEO monitoring tool. It collects data on website trends and optimization to provide insights for website owners and developers. As part of its activities, RogerBot is designed to visit and analyze websites, gathering information on technical aspects such as page speed, crawlability, and backlinks.

What the bot does

Page Indexer

Metrics & Site Analytics

Who runs the bot:

SEO / SER

Twitterbot
→

Twitterbot/0.1

Twitter’s crawler respects Google’s robots.txt specification when scanning URLs. If a page with card markup is blocked, no card will be shown. If an image URL is blocked, no thumbnail or photo will be shown. Twitter uses the User-Agent of Twitterbot (with version, such as Twitterbot/1.0), which can be used to create an exception in the robots.txt file.

What the bot does

Metrics & Site Analytics

Content Scraping

Who runs the bot:

Social Media, Chat, or Shorteners

Content Aggregator & RSS

LivelapBot
→

LivelapBot/0.2 (http://site.livelap.com/crawler)

Livelap is a content discovery app that indexes web content. Probably you have seen the Livelapbot/0.1 or LivelapBot/0.2 crawler in your server logs. LivelapBot can visit a page if it is shared on social media, and as part of its RSS/page crawling schedule. Livelap indexes web content and makes meta data and a link to your content available in livelap.com and in the Livelap app. For indexing we only use official HTML and media meta tags in your page. We don't scrape the contents of your articles.

What the bot does

Page Indexer

Services & Virtual Assistants

RSS & Summaries

Who runs the bot:

Social Media, Chat, or Shorteners

Content Aggregator & RSS

Slackbot-LinkExpanding
→

Slackbot-LinkExpanding (+https://api.slack.com/robots)

This robot responds to links that Slack users post into their channels. It fetches as little of the page as it can (using HTTP Range headers) to extract meta tags about the content. Specifically, we are looking for oEmbed and Twitter Card / Open Graph tags. If a page's tags refer to an image, video, or audio file, we will fetch that file as well to check validity and extract other metadata.

What the bot does

Services & Virtual Assistants

RSS & Summaries

Content & Sentiment Analysis

Who runs the bot:

Social Media, Chat, or Shorteners

Content Aggregator & RSS

Pinterestbot
→

Mozilla/5.0 (compatible; Pinterestbot/1.0; +http://www.pinterest.com/bot.html)

Pinterestbot is Pinterest’s web crawler. Pinterestbot crawls, or visits public websites to index their content, with the aim of driving traffic back to those websites. It also scrapes content to make sure Pin details, like price and title, are up to date, and to detect and remove broken website links behind Pins. Pinterestbot will respect all requests to stop or limit crawling. You can request that Pinterestbot stop or limit crawling your website via robots.txt (the robots exclusion standard).

What the bot does

Page Indexer

Services & Virtual Assistants

RSS & Summaries

Who runs the bot:

Social Media, Chat, or Shorteners

Content Scraper

Content Aggregator & RSS

MetaJobBot
→

Mozilla/5.0 (compatible; MetaJobBot; http://www.metajob.de/crawler)

MetaJobBot is METAJob's automated crawler that searches websites for jobs. MetaJobBot complies with the Robots Exclusion Standard. MetaJobBot respects the robots meta tags index/noindex and follow/nofollow. There are at least 5 seconds between 2 consecutive accesses from pages of the same domain There is at least 7 times the last observed response times between 2 accesses. MetaJobBot is a "topical" or "focused" crawler that is only interested in finding jobs.

What the bot does

Services & Virtual Assistants

Who runs the bot:

Content Aggregator & RSS

Business Intelligence

TelegramBot (like TwitterBot)
→

TelegramBot (like TwitterBot)

The Bot API is an HTTP-based interface created for developers keen on building bots for Telegram.

What the bot does

Services & Virtual Assistants

RSS & Summaries

Who runs the bot:

Social Media, Chat, or Shorteners

Content Aggregator & RSS

JamesBot
→

Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.8.1.6) Gecko/20070725 Firefox/2.0.0.6 - James BOT - WebCrawler http://cognitiveseo.com/bot.html

We are an SEO toolset that crawls the web and analyzes links. If you want to block our crawler just add the following lines to your robots.txt file: User-agent: JamesBOT Disallow: / If you have problems with our crawler just send a quick mail at support@cognitiveseo.com and we will get back ASAP.

What the bot does

Metrics & Site Analytics

Content & Sentiment Analysis

Who runs the bot:

SEO / SER

TinyEye-bot
→

Mozilla/5.0 (compatible; TinEye-bot/1.31; +http://www.tineye.com/crawler.html)

The TinEye-bot is a web crawler bot that helps users identify the ownership and usage of images online. It's a powerful tool for photographers, artists, and content creators who want to track how their work is being used on the internet.

What the bot does

Services & Virtual Assistants

Content Scraping

Content & Sentiment Analysis

Who runs the bot:

Research

Business Intelligence

AcapBot
→

Mozilla/5.0 (compatible;acapbot/0.1;treat like Googlebot)

The Acapbot is a web scraping bot designed to collect and treat web pages like Googlebot, allowing it to crawl and index websites as if it were a legitimate search engine crawler. Its primary function is to scrape web content anonymously, making it a useful tool for data mining and web research.

What the bot does

Malware, Botnet, Spam

Content Scraping

Unknown

Who runs the bot:

AI / ML Company

Content Scraper

Bad or No Contact Address

Unknown

BLEXBot
→

Mozilla/5.0 (compatible; BLEXBot/1.0; +http://webmeup-crawler.com/)

BLEXBot is a web crawler that periodically scans the internet for new and updated content to index and make searchable through its parent website, SEO Powersuite.

What the bot does

Services & Virtual Assistants

Metrics & Site Analytics

Who runs the bot:

SEO / SER

Business Intelligence

Gwene
→

Gwene/1.0 (The gwene.org rss-to-news gateway) Googlebot

Gwene is an RSS (and Atom, etc) to Usenet News (i. e., NNTP) gateway. If you wish to read various feeds, but don't want to use one of those new-fangled web browser thingies (they'll never take off), you can just point your Usenet news reader towards news.gwene.org and read all the kitteh blogs from the comfort of something I don't quite know how to end this run-on sentence.

What the bot does

Services & Virtual Assistants

RSS & Summaries

Who runs the bot:

Content Aggregator & RSS

Neticle Crawler
→

Neticle Crawler v1.0 ( https://neticle.com/bot/en/ )

Neticle is a browser based social and online media monitoring service which is able to quantify the opinion and mood of the web around a given keyword into a single index called Web Opinion Index (usually the keyword is a brand, a product or a service).

What the bot does

Services & Virtual Assistants

RSS & Summaries

Content & Sentiment Analysis

Who runs the bot:

Research

Business Intelligence

SentiBot
→

SentiBot www.sentibot.eu (compatible with Googlebot)

SentiBot is a sentiment analysis bot that uses natural language processing (NLP) to analyze text data and determine the sentiment or emotional tone behind it, such as positive, negative, or neutral.

What the bot does

Services & Virtual Assistants

Content & Sentiment Analysis

Who runs the bot:

Research

Business Intelligence

FreshRSS
→

FreshRSS/1.11.2 (Linux; https://freshrss.org) like Googlebot

FreshRSS is a self-hosted RSS and Atom feed aggregator.

What the bot does

RSS & Summaries

Who runs the bot:

Content Aggregator & RSS

Unknown

Testomatobot
→

Testomatobot/1.0 (Linux x86_64; +https://www.testomato.com/testomatobot) minicrawler/5.2.2

TestomatoBot is name of a toolset we use to send web forms, download web pages, and download resources that will be checked according to the project configurations of Testomato's users. It accesses websites according to project settings (i.e. every 5 minutes or more).

What the bot does

Services & Virtual Assistants

Content Scraping

Developer Crawler Package

Who runs the bot:

Content Scraper

Unknown

CriteoBot
→

CriteoBot/0.1 (+https://www.criteo.com/criteo-crawler/)

The CriteoBot is a web crawler bot that collects data for the Criteo advertising platform. It anonymously gathers information from websites to help Criteo provide relevant online advertisements to users based on their browsing behavior. The bot does not collect personally identifiable information and does not interact with website content.

What the bot does

Page Indexer

Services & Virtual Assistants

Who runs the bot:

Business Intelligence

Advertiser

SiteCheckerBotCrawler
→

SiteCheckerBotCrawler/1.0 (+http://sitechecker.pro)

This is SiteCheckerBotCrawler, a bot that crawls and checks websites for errors, issues, and performance. It's designed to help website owners and developers identify potential problems and improve the overall user experience.

What the bot does

Metrics & Site Analytics

Who runs the bot:

SEO / SER

TombaPublicWebCrawler
→

Mozilla/5.0 (compatible; TombaPublicWebCrawler/1.0; +https://tombascraper.com)

TombaPublicWebCrawler is an indexing robot for a web search engine, similar to Google. Created by Tomba technology web service LLC.(Tomba.io), TombaPublicWebCrawler’s patented technology continually scans millions of corporate websites, press releases, electronic news services, SEC filings and other online sources. Using advanced natural language processing algorithms, Tomba has created a next generation search engine focused on finding pages with information about businesses and business professionals.

What the bot does

Page Indexer

Services & Virtual Assistants

Who runs the bot:

Business Intelligence

KomodiaBot
→

Mozilla/5.0 (Windows NT 6.1; Win64; x64; +http://www.komodia.com/newwiki/index.php/URL_server_crawler) KomodiaBot/1.0

This bot, known as KomodiaBot, is a URL server crawler that is designed to retrieve and analyze URLs from the web. Its primary function is to gather and categorize URLs, likely for the purpose of improving web search results or identifying trends in online content.

What the bot does

Services & Virtual Assistants

Content & Sentiment Analysis

Who runs the bot:

Business Intelligence

Access Control

KStandBot or KomodiaBot
→

Mozilla/5.0 (Windows NT 6.1; Win64; x64; +http://url-classification.io/wiki/index.php?title=URL_server_crawler) KStandBot/1.0

The KStandBot is a URL classification bot that uses various algorithms to analyze and categorize URLs based on their content, patterns, and reputation. Its primary function is to provide users with accurate and reliable information about the nature and potential risks associated with a given URL, helping them make informed decisions when interacting with online resources.

What the bot does

Page Indexer

Services & Virtual Assistants

Content & Sentiment Analysis

Who runs the bot:

Business Intelligence

Access Control

MTRobot
→

MTRobot/0.2 (Metrics Tools Analytics Crawler; https://metrics-tools.de/robot.html; crawler@metrics-tools.de)

The MTRobot is an analytics crawler provided by Metrics Tools, a service that monitors and analyzes website performance and user behavior. This bot is used to collect data and metrics about websites, focusing on user actions, website traffic, and other relevant metrics to help website owners understand their audience and improve their online presence.

What the bot does

Services & Virtual Assistants

Metrics & Site Analytics

Who runs the bot:

SEO / SER

Audisto Crawler
→

Audisto Crawler (mobile; +https://audisto.com/bot)

The Audisto Crawler bot is a website auditing tool used to analyze and inspect web pages, providing insights on website technical performance, accessibility, and optimization opportunities. This bot helps website administrators and developers improve their website's user experience, search engine ranking, and overall quality.

What the bot does

Services & Virtual Assistants

Metrics & Site Analytics

Who runs the bot:

SEO / SER

AdsBot-Google
→

AdsBot-Google (+http://www.google.com/adsbot.html)

The AdsBot-Google is a crawling bot used by Google to scan and index web pages for its search engine results and advertising purposes. It helps ensure that Google's search algorithms can accurately fetch and display website content to users.

What the bot does

Page Indexer

Content & Sentiment Analysis

Who runs the bot:

Advertiser

Feedfetcher-Google
→

Feedfetcher-Google; (+http://www.google.com/feedfetcher.html; 1 subscribers; feed-id=728742641706423)

Feedfetcher is how Google crawls RSS or Atom feeds for Google News and PubSubHubbub. Feedfetcher stores and periodically refreshes feeds that are requested by users of an app or service. Only podcast feeds get indexed in Google Search; however, if a feed doesn't follow the Atom or RSS specification, it may still be indexed.

What the bot does

RSS & Summaries

Who runs the bot:

Content Aggregator & RSS

Mediapartners-Google
→

Mediapartners-Google

The AdSense crawler, called Mediapartners-Google, visits your site to determine its content in order to provide relevant ads.

What the bot does

Content & Sentiment Analysis

Who runs the bot:

Advertiser

APIs-Google
→

APIs-Google (+https://developers.google.com/webmasters/APIs-Google.html)

APIs-Google is the user agent used by Google APIs to deliver push notification messages. Application developers can request these notifications to avoid the need for continually polling Google's servers to find out if the resources they are interested in have changed. To make sure nobody abuses this service, Google requires developers to prove that they own the domain before allowing them to register a URL with a domain as the location where they want to receive messages.

What the bot does

Services & Virtual Assistants

Who runs the bot:

Content Aggregator & RSS

Business Intelligence

Access Control

GoogleOther
→

GoogleOther

GoogleOther is the generic crawler that may be used by various product teams for fetching publicly accessible content from sites. For example, it may be used for one-off crawls for internal research and development. It has no effect on Google Search or other products.

What the bot does

Services & Virtual Assistants

Research

Who runs the bot:

Unknown

Research

Business Intelligence

Yahoo! Slurp
→

Mozilla/5.0 (compatible; Yahoo! Slurp/3.0; http://help.yahoo.com/help/us/ysearch/slurp)

Slurp is the Yahoo Search robot for crawling and indexing web page information. Although some Yahoo Search results are powered by our partners, sites should allow Yahoo Slurp access in order to appear in Yahoo Mobile Search results. Additionally, Slurp does the following: Collects content from partner sites for inclusion within sites like Yahoo News, Yahoo Finance and Yahoo Sports. Accesses pages from sites across the Web to confirm accuracy and improve Yahoo's personalized content for our users.

What the bot does

Page Indexer

Services & Virtual Assistants

RSS & Summaries

Who runs the bot:

Content Aggregator & RSS

WGETbot
→

WGETbot/1.0 (+http://wget.alanreed.org)

The WGETbot is a bot that uses the WGET utility to fetch and retrieve website content, as well as automate web requests. It is designed to assist in tasks such as data scraping, website backup, and HTTP testing.

What the bot does

Services & Virtual Assistants

Content Scraping

Unknown

Who runs the bot:

Content Scraper

Unknown

Research

Business Intelligence

Python-urllib
→

Python-urllib/1.17

urllib is a package that collects several modules for working with URLs: urllib.request for opening and reading URLs urllib.error containing the exceptions raised by urllib.request urllib.parse for parsing URLs urllib.robotparser for parsing robots.txt files

What the bot does

Unknown

Developer Crawler Package

Who runs the bot:

Unknown

python-requests
→

python-requests/2.9.2

Requests is a simple, yet elegant, HTTP library. Requests allows you to send HTTP/1.1 requests extremely easily. There’s no need to manually add query strings to your URLs, or to form-encode your PUT & POST data — but nowadays, just use the json method! Requests is one of the most downloaded Python packages today, pulling in around 30M downloads / week— according to GitHub, Requests is currently depended upon by 1,000,000+ repositories. You may certainly put your trust in this code.

What the bot does

Unknown

Developer Crawler Package

Who runs the bot:

Unknown

Python
→

Python/3.9 aiohttp/3.7.3

Asynchronous HTTP Client/Server for asyncio and Python.

What the bot does

Unknown

Developer Crawler Package

Who runs the bot:

Unknown

2Bone_LinkChecker
→

2Bone_LinkChecker/1.0 libwww-perl/6.03

The 2Bone LinkChecker bot is a web crawler that appears to be checking the validity and integrity of links on websites. Its purpose is likely to detect and report broken links, orphaned links, and other potential issues with website connectivity.

What the bot does

Services & Virtual Assistants

Metrics & Site Analytics

Who runs the bot:

Unknown

adidxbot
→

adidxbot/1.1 (+http://search.msn.com/msnbot.htm)

AdIdxBot is the crawler used by Bing Ads. AdIdxBot crawls ads and follows the websites from those ads for quality control. Just like Bingbot, AdIdxBot has both “desktop” and “mobile” variants.

What the bot does

Content & Sentiment Analysis

Who runs the bot:

Business Intelligence

Advertiser