Bigdata
Now Reading
ht-//Dig
0
Review

ht-//Dig

Overview
Synopsis

The ht://Dig system is a complete world wide web indexing and searching system for a domain or intranet. This system is not meant to replace the need for powerful internet-wide search systems like Lycos, Infoseek, Google and AltaVista.

Category

Search Engine Server Free

Features

• Intranet searching
• ht://Dig has the ability to search through many servers on a network by acting as a WWW browser.
• It is free
• The whole system is released under the GNU General Public License
• Robot exclusion is supported
• The Standard for Robot Exclusion is supported by ht://Dig.
• Boolean expression searching
• Searches can be arbitrarily complex using boolean expressions.
• Configurable search results
• The output of a search can easily be tailored to your needs by means of providing HTML templates.
• Fuzzy searching
• Searches can be performed using various configurable algorithms. Currently the following algorithms are supported (in any combination):
• exact
• soundex
• metaphone
• common word endings (stemming)
• synonyms
• accent stripping
• substring and prefix
• Searching of HTML and text files
• Both HTML documents and plain text files can be searched. Searching of other file types will be supported in future versions.
• Keywords can be added to HTML documents
• Any number of keywords can be added to HTML documents which will not show up when the document is viewed. This is used to make a document more like to be found and also to make it appear higher in the list of matches.
• Email notification of expired documents
• Special meta information can be added to HTML documents which can be used to notify the maintainer of those documents at a certain time. It is handy to get reminded when to remove the "New" images from a certain page, for example.
• A Protected server can be indexed
• ht://Dig can be told to use a specific username and password when it retrieves documents. This can be used to index a server or parts of a server that are protected by a username and password.
• Searches on subsections of the database
• It is easy to set up a search which only returns documents whose URL matches a certain pattern. This becomes very useful for people who want to make their own data searchable without having to use a separate search engine or database.

License

Open Source

Price

Free

Pricing

Subscription

Free Trial

Available

Users Size

Small (<50 employees), Medium (50 to 1000 Enterprise (>1001 employees)

Website
Company

ht-//Dig

What is best?

• Intranet searching
• ht://Dig has the ability to search through many servers on a network by acting as a WWW browser.
• It is free
• The whole system is released under the GNU General Public License
• Robot exclusion is supported
• The Standard for Robot Exclusion is supported by ht://Dig.
• Boolean expression searching

What are the benefits?

• Full source code included
• The search engine comes with full source code. The whole system is released under the terms and conditions of the GNU Public License version 2.0
• The depth of the search can be limited
• Instead of limiting the search to a set of machines, it can also be restricted to documents that are a certain number of "mouse-clicks" away from the start document.
• Full support for the ISO-Latin-1 character set
• Both SGML entities like 'à' and ISO-Latin-1 characters can be indexed and searched.

PAT Rating™
Editor Rating
Aggregated User Rating
Rate Here
Ease of use
7.6
9.1
Features & Functionality
7.6
8.8
Advanced Features
7.6
9.2
Integration
7.6
9.2
Performance
7.6
9.1
Training
9.1
Customer Support
7.6
Implementation
Renew & Recommend
Bottom Line

ht://Dig requires a configuration file and several HTML files to operate correctly.

7.6
Editor Rating
9.1
Aggregated User Rating
1 rating
You have rated this

The ht://Dig system is a complete world wide web indexing and searching system for a domain or intranet. This system is not meant to replace the need for powerful internet-wide search systems like Lycos, Infoseek, Google and AltaVista. Instead it is meant to cover the search needs for a single company, campus, or even a particular sub section of a web site. As opposed to some WAIS-based or web-server based search engines, ht://Dig can easily span several web servers. The type of these different web servers doesn't matter as long as they understand common protocols like HTTP. ht://Dig was developed under Unix using C++. For this reason, users will need a Unix machine, a C compiler and a C++ compiler. (The C compiler is needed to compile some of the GNU libraries). Unfortunately the developers only have access to a couple of different Unix machines. Most development is done on Linux systems with gcc/g++, but ht://Dig has been tested.

ht://Dig requires a configuration file and several HTML files to operate correctly. Fortunately, when ht://Dig is installed, a very reasonable configuration is created and in most cases only minor modifications to the files are necessary. The ht://Dig package can be built on the Win32 platform when using the Cygwin package. The ht://Dig project currently uses SourceForge for a mirror of the main website at htdig.sourceforge.net as well as a mirror of ht://Dig releases and contributed work. All ht://Dig programs feature a -v option to get some debugging output. The more of these options users put on the command line, the more output users will usually get.

 

Filter reviews
User Ratings





User Company size



User role





User industry





Ease of use
Features & Functionality
Advanced Features
Integration
Performance
Training
Customer Support
Implementation
Renew & Recommend

What's your reaction?
Love It
0%
Very Good
0%
INTERESTED
0%
COOL
0%
NOT BAD
0%
WHAT !
0%
HATE IT
0%