The Hide
, Group
and Ignore
and
Include
keywords allow you to change the way
Sites
, URLs
, Referrers
, User Agents
and Usernames
are manipulated.
The Ignore,Include
keywords will cause The Webalizer to completely
ignore records as if they didn't exist (and thus not counted in the main site
totals).
The Hide
keywords will prevent things from being displayed
in the 'Top' tables, but will still be counted in the main totals.
The Group
keywords allow
grouping similar objects as if they were one. Grouped records are displayed
in the 'Top' tables and can optionally be displayed in BOLD and/or shaded.
Groups cannot be hidden, and are not counted in the main totals. The
Group
options do not, by default, hide all the items that it
matches. If you want to hide the records that match (so just the grouping
record is displayed), follow with an identical Hide
* keyword
with the same value. (see example below) In addition, Group* keywords may
have an optional label which will be displayed instead of the keywords value.
The label should be seperated from the value by at least one 'white-space'
character, such as a space or tab.
The value can have either a leading or trailing '*' wildcard character. If no wildcard is found, a match can occur anywhere in the string. Given a string "www.yourmama.com", the values "your", "*mama.com" and "www.your*" will all match.
# Your own site should be hidden #HideSite *mrunix.net #HideSite localhostYour own site gives most referrals
#HideReferrer mrunix.net/This one hides non-referrers ("-" Direct requests)
#HideReferrer Direct Request
# Usually you want to hide these HideURL *.gif HideURL *.GIF HideURL *.jpg HideURL *.JPG HideURL *.png HideURL *.PNG HideURL *.raHiding agents is kind of futile
#HideAgent RealPlayer# You can also hide based on authenticated username
#HideUser root #HideUser admin# HideAllSites allows forcing individual sites to be hidden in the report. This is particularly useful when used in conjunction with the "GroupDomain" feature, but could be useful in other situations as well, such as when you only want to display grouped sites (with the GroupSite keywords...).
#HideAllSites no
#GroupURL /cgi-bin/* CGI Scripts #GroupURL /images/* Images #GroupSite *.aol.com #GroupSite *.compuserve.com #GroupReferrer yahoo.com/ Yahoo! #GroupReferrer excite.com/ Excite #GroupReferrer infoseek.com/ InfoSeek #GroupReferrer webcrawler.com/ WebCrawler #GroupUser root Admin users #GroupUser admin Admin users #GroupUser wheel Admin users# The following is a great way to get an overall total for browsers, and not display all the detail records. (You should use MangleAgent to refine further...)
#GroupAgent MSIE Micro$oft Internet Exploder #HideAgent MSIE #GroupAgent Mozilla Netscape #HideAgent Mozilla #GroupAgent Lynx* Lynx #HideAgent Lynx*The GroupDomains keyword allows you to group individual hostnames into their respective domains. The value specifies the level of grouping to perform, and can be thought of as 'the number of dots' that will be displayed. For example, if a visiting host is named cust1.tnt.mia.uu.net, a domain grouping of 1 will result in just "uu.net" being displayed, while a 2 will result in "mia.uu.net".
#GroupDomains 0The GroupShading allows grouped rows to be shaded in the report. Useful if you have lots of groups and individual records that intermingle in the report, and you want to diferentiate the group records a little more.
#GroupShading yes# GroupHighlight allows the group record to be displayed in BOLD.
#GroupHighlight yesThe Ignore keywords allow you to completely ignore log records based on hostname, URL, user agent, referrer or username. I hessitated in adding these, since the Webalizer was designed to generate _accurate_ statistics about a web servers performance. By choosing to ignore records, the accuracy of reports become skewed, negating why I wrote this program in the first place. However, due to popular demand, here they are. Use the same as the Hide* keywords, where the value can have a leading or trailing wildcard '*'. Use at your own risk ;)
#IgnoreSite bad.site.net #IgnoreURL /test* #IgnoreReferrer file:/* #IgnoreAgent RealPlayer #IgnoreUser root# The Include keywords allow you to force the inclusion of log records based on hostname, URL, user agent, referrer or username. They take precidence over the Ignore keywords. Note: Using Ignore/Include combinations to selectivly process parts of a web site is _extremely inefficent_!!! Avoid doing so if possible (ie: grep the records to a seperate file if you really want that kind of report).
# Example: Only show stats on Joe User's pages...
#IgnoreURL * #IncludeURL ~joeuser*# Or based on an authenticated username
#IgnoreUser * #IncludeUser someuser
#MangleAgents 0The SearchEngine keywords allow specification of search engines and their query strings on the URL. These are used to locate and report what search strings are used to find your site. The first word is a substring to match in the referrer field that identifies the search engine, and the second is the URL variable used by that search engine to define it's search terms.
SearchEngine yahoo.com p= SearchEngine altavista.com q= SearchEngine google.com q= SearchEngine eureka.com q= SearchEngine lycos.com query= SearchEngine hotbot.com MT= SearchEngine msn.com MT= SearchEngine infoseek.com qt= SearchEngine webcrawler searchText= SearchEngine excite search= SearchEngine netscape.com search= SearchEngine mamma.com query= SearchEngine alltheweb.com query= SearchEngine northernlight.com qr=For a full list see ftp://ftp.mrunix.net/pub/webalizer/sample.conf