The Lemur Project develops search engines, browser toolbars, text analysis tools, and data resources that support research and development of information retrieval and text mining software, including the Indri search engine and ClueWeb09 dataset.
Search Engine Server Free
• Cross-platform, fast and modular code written in C++
• C++, Java and C# APIs
• Free and open-source software
• In use for over 6 years by a large and growing user community
Small (<50 employees), Medium (50 to 1000 Enterprise (>1001 employees)
• Sophisticated structured query languages (using InQuery and Indri)
• Support for XML and structured document retrieval
• Used commonly with a wide range of research test collections (e.g., TREC CDs 1-5, wt10g, RCV1, gov, gov2)
• Index your web pages with an "out-of-the-box" site search capability
• Interactive interfaces for Windows, Linux, and Web
• Distributed information retrieval and document clustering applications
The Lemur Project develops search engines, browser toolbars, text analysis tools, and data resources that support research and development of information retrieval and text mining software, including the Indri search engine and ClueWeb09 dataset. The Lemur Toolkit is designed to facilitate research in language modeling and information retrieval (IR), where IR is broadly interpreted to include such technologies as ad hoc and distributed retrieval with structured queries, cross-language IR, summarization, filtering, and categorization. The system's underlying architecture was built to support the technologies above. We provide many useful sample applications, but have designed the toolkit to allow the users to easily program their own customizations and applications.
The lemur include files will be installed in /include, the lemur library will be installed in /lib/liblemur.a, and the applications will be installed in /bin. If users only want the applications, the users can use make install-bin. If users only want the library and header files, the users can use make install-lib. If configured with --enable-java, documentation for the Lemur JNI will be installed in /share/lemur/JNIdoc. The file index.html points into the javadoc generated documentation. Four additional jar files are installed. RetUI.jar provides a basic document retrieval GUI for interactive queries, using the Indri API. IndexUI.jar provides a basic collection indexing GUI for building an indri repository. LemurRet.jar provides a basic document retrieval GUI for interactive queries using the Lemur API. LemurIndex.jar provides a basic collection indexing GUI for building Lemur indexes. All are installed in /share/lemur and can be run with.