Htdig

A search engine is configured for use with your website. The following information provides a quick guide to setting up a basic search page. More detailed documentation for this program is available here.

Before you can use the search engine, we need to add it to your web hosting account. Please send an email to support@anchor.net.au requesting to have htdig enabled on your account.

How to create a search page

The search page is generated using the following html:


  • <form method="get" action="/cgi-bin/htsearch">
        <input type="hidden" name="config" value="htdig" />
        <input type="hidden" name="method" value="and" />
        <input type="hidden" name="sort" value="score" />
        <input type="text" name="words" size="14" />
    </form>


You can also use the following options to restrict the pages which are returned in the search results, where URL is in the form 'http://www.yourdomain.com/non-search-area', it can include specific files or entire directories.


  • <input type="hidden" name="restrict" value="URL" />
    <input type="hidden" name="exclude" value="URL" />


How the search page works

The search engine uses an Open Source program called htdig (see www.htdig.org). Every day our servers automatically run the htdig and htmerge program which catalogue your website. So if you add new pages, or updated content it will not appear in the search results until the following day.

The search program reads a configuration file located in your account at /home/USERNAME/.htdig/htdig.conf, this directory also contains the database files used by the search engines. You can control the operation of the search engine by modifying the configuration file.

The files used to display the search results are located in /home/USERNAME/public_html/htdig/ results-header.html \& result-footer.html are used to present the search results. You can edit these files so that the results pages match the rest of your site.

results-template.html controls the formatting of each of the search results.

The images in the htdig directory are used on the results pages.

How do I stop pages on my site being indexed?

To exclude an entire file or directory add the following line to your htdig.conf file exclude_urls: filename

To exclude content within pages on your site you can add the following comments to your html.

''<!--htdig_nodindex-->content to be ignored goes here<!--/htdig_nodindex--> ''

For more information on configuring the search engine please refer to http://www.htdig.org.