GENUKI Maintainers' Pages

Version 1.15

How to Direct the Spider

Meta Statements

There will always be a few pages that need to be treated in a special way by the spider, and for which we don't want specific errors to be reported. For HTML files this is achieved by the use of one or more <meta> statements in the HTML code for each page that needs to be treated specially. The <meta> statements should be placed in the header section of the HTML page, and the name "genuki" has been chosen for use in all GENUKI meta statements.

The syntax of a GENUKI meta statement is as follows:

<meta name="genuki" content="genuki-directives">

where genuki-directives are one or more directives which instruct the spider to carry out the specified action with respect to the page.

The spider treats the GENUKI meta statement and its attributes in a case insensitive manner. The directives instructing the spider appear in the content attribute of the meta statement. If more than one directive is required, then the directives can be separated by spaces in the content attribute, or multiple meta statements can be used.

The GENUKI directives that the spider recognises are:

Examples of the use of GENUKI meta statements follow:

Href Class Attributes

The finest level of control over the spider is exercised at the link level where maintainers can direct the spider to treat the link in a non-standard way.

A link is specified in HTML as an anchor, the syntax for which is fully defined so the spider uses an existing anchor attribute defined for another purpose. The attribute chosen is the class attribute. Class is normally used to specify style sheets or javascript but this misuse is unlikely to cause problems: multiple classes can be specified if the need arises.

The classes and their meanings are as follows: