Sign in to see all reviews and comparisons. It's Free!
The ht://Dig system is a complete world wide web indexing and searching system for a domain or intranet. This system is not meant to replace the need for powerful internet-wide search systems like Lycos, Infoseek, Google and AltaVista.
Search Engine Server Free
• Intranet searching • ht://Dig has the ability to search through many servers on a network by acting as a WWW browser. • It is free • The whole system is released under the GNU General Public License • Robot exclusion is supported • The Standard for Robot Exclusion is supported by ht://Dig. • Boolean expression searching • Searches can be arbitrarily complex using boolean expressions. • Configurable search results • The output of a search can easily be tailored to your needs by means of providing HTML templates. • Fuzzy searching • Searches can be performed using various configurable algorithms. Currently the following algorithms are supported (in any combination): • exact • soundex • metaphone • common word endings (stemming) • synonyms • accent stripping • substring and prefix • Searching of HTML and text files • Both HTML documents and plain text files can be searched. Searching of other file types will be supported in future versions. • Keywords can be added to HTML documents • Any number of keywords can be added to HTML documents which will not show up when the document is viewed. This is used to make a document more like to be found and also to make it appear higher in the list of matches. • Email notification of expired documents • Special meta information can be added to HTML documents which can be used to notify the maintainer of those documents at a certain time. It is handy to get reminded when to remove the "New" images from a certain page, for example. • A Protected server can be indexed • ht://Dig can be told to use a specific username and password when it retrieves documents. This can be used to index a server or parts of a server that are protected by a username and password. • Searches on subsections of the database • It is easy to set up a search which only returns documents whose URL matches a certain pattern. This becomes very useful for people who want to make their own data searchable without having to use a separate search engine or database.
Small (<50 employees), Medium (50 to 1000 employees), Enterprise (>1001 employees)
• Intranet searching • ht://Dig has the ability to search through many servers on a network by acting as a WWW browser. • It is free • The whole system is released under the GNU General Public License • Robot exclusion is supported • The Standard for Robot Exclusion is supported by ht://Dig. • Boolean expression searching
What are the benefits?
• Full source code included • The search engine comes with full source code. The whole system is released under the terms and conditions of the GNU Public License version 2.0 • The depth of the search can be limited • Instead of limiting the search to a set of machines, it can also be restricted to documents that are a certain number of "mouse-clicks" away from the start document. • Full support for the ISO-Latin-1 character set • Both SGML entities like 'à' and ISO-Latin-1 characters can be indexed and searched.
Aggregated User Rating
Ease of use
Features & Functionality
Renew & Recommend
ht://Dig requires a configuration file and several HTML files to operate correctly.
Aggregated User Rating
You have rated this
The ht://Dig system is a complete world wide web indexing and searching system for a domain or intranet. This system is not meant to replace the need for powerful internet-wide search systems like Lycos, Infoseek, Google and AltaVista. Instead it is meant to cover the search needs for a single company, campus, or even a particular sub section of a web site. As opposed to some WAIS-based or web-server based search engines, ht://Dig can easily span several web servers. The type of these different web servers doesn't matter as long as they understand common protocols like HTTP. ht://Dig was developed under Unix using C++. For this reason, users will need a Unix machine, a C compiler and a C++ compiler. (The C compiler is needed to compile some of the GNU libraries). Unfortunately the developers only have access to a couple of different Unix machines. Most development is done on Linux systems with gcc/g++, but ht://Dig has been tested.
ht://Dig requires a configuration file and several HTML files to operate correctly. Fortunately, when ht://Dig is installed, a very reasonable configuration is created and in most cases only minor modifications to the files are necessary. The ht://Dig package can be built on the Win32 platform when using the Cygwin package. The ht://Dig project currently uses SourceForge for a mirror of the main website at htdig.sourceforge.net as well as a mirror of ht://Dig releases and contributed work. All ht://Dig programs feature a -v option to get some debugging output. The more of these options users put on the command line, the more output users will usually get.
PAT RESEARCH is a B2B discovery platform which provides Best Practices, Buying Guides, Reviews, Ratings, Comparison, Research, Commentary, and Analysis for Enterprise Software and Services. We provide Best Practices, PAT Index™ enabled product reviews and user review comparisons to help IT decision makers such as CEO’s, CIO’s, Directors, and Executives to identify technologies, software, service and strategies.