Web¶
Purpose¶
This plugin allows you to view website information, like the title of a page. Also provides a Snarfer for URLs.
Usage¶
Add the help for ‘help Web’ here.
Commands¶
doctype <url>
Returns the DOCTYPE string of <url>. Only HTTP urls are valid, of course.
fetch <url>
Returns the contents of <url>, or as much as is configured in supybot.plugins.Web.fetch.maximum. If that configuration variable is set to 0, this command will be effectively disabled.
headers <url>
Returns the HTTP headers of <url>. Only HTTP urls are valid, of course.
location <url>
If the <url> is redirected to another page, returns the URL of that page. This works even if there are multiple redirects. Only HTTP urls are valid. Useful to “un-tinify” URLs.
size <url>
Returns the Content-Length header of <url>. Only HTTP urls are valid, of course.
title [--no-filter] <url>
Returns the HTML <title>…</title> of a URL. If –no-filter is given, the bot won’t strip special chars (action, DCC, …).
urlquote <text>
Returns the URL quoted form of the text.
urlunquote <text>
Returns the text un-URL quoted.
Configuration¶
- supybot.plugins.Web.checkIgnored
This config variable defaults to “True”, is network-specific, and is channel-specific.
Determines whether the title snarfer checks if the author of a message is ignored.
- supybot.plugins.Web.fetch
This is a group of:
- supybot.plugins.Web.fetch.maximum
This config variable defaults to “0”, is not network-specific, and is not channel-specific.
Determines the maximum number of bytes the bot will download via the ‘fetch’ command in this plugin.
- supybot.plugins.Web.fetch.timeout
This config variable defaults to “5”, is not network-specific, and is not channel-specific.
Determines the maximum number of seconds the bot will wait for the site to respond, when using the ‘fetch’ command in this plugin. If 0, will use socket.defaulttimeout
- supybot.plugins.Web.nonSnarfingRegexp
This config variable defaults to “”, is network-specific, and is channel-specific.
Determines what URLs matching the given regexp will not be snarfed. Give the empty string if you have no URLs that you’d like to exclude from being snarfed.
- supybot.plugins.Web.public
This config variable defaults to “True”, is not network-specific, and is not channel-specific.
Determines whether this plugin is publicly visible.
- supybot.plugins.Web.snarfMultipleUrls
This config variable defaults to “False”, is network-specific, and is channel-specific.
Determines whether the title snarfer will query all URLs in a message, or only the first one.
- supybot.plugins.Web.snarferPrefix
This config variable defaults to “Title:”, is network-specific, and is channel-specific.
Determines the string used at before a web page’s title.
- supybot.plugins.Web.snarferReportIOExceptions
This config variable defaults to “False”, is network-specific, and is channel-specific.
Determines whether the bot will notfiy the user about network exceptions like hostnotfound, timeout ….
- supybot.plugins.Web.snarferShowDomain
This config variable defaults to “True”, is network-specific, and is channel-specific.
Determines whether domain names should be displayed by the title snarfer.
- supybot.plugins.Web.snarferShowTargetDomain
This config variable defaults to “False”, is network-specific, and is channel-specific.
Determines whether the domain name displayed by the snarfer will be the original one (posted on IRC) or the target one (got after following redirects, if any).
- supybot.plugins.Web.timeout
This config variable defaults to “5”, is not network-specific, and is not channel-specific.
Determines the maximum number of seconds the bot will wait for the site to respond, when using a command in this plugin other than ‘fetch’. If 0, will use socket.defaulttimeout
- supybot.plugins.Web.titleSnarfer
This config variable defaults to “False”, is network-specific, and is channel-specific.
Determines whether the bot will output the HTML title of URLs it sees in the channel.
- supybot.plugins.Web.urlWhitelist
This config variable defaults to “ “, is not network-specific, and is not channel-specific.
If set, bot will only fetch data from urls in the whitelist, i.e. starting with http://domain/optionalpath/. This will apply to all commands that retrieve data from user-supplied URLs, including fetch, headers, title, doctype.