Isearch
Isearch is software for indexing and searching text documents. It supports full text and field based search, relevance ranked results, Boolean queries, and heterogeneous databases. Isearch can parse many kinds of documents "out of the box," including HTML, mail folders, list digests, SGML-style tagged data, and USMARC. It can be extended to support other formats by creating descendant classes in C++ that define the document structure. It is pretty easy to customize in this way, provided that you know some C++ (and you will need to ftp the source code). A CGI interface is also included for web based searching.
COMPILATION INSTRUCTIONS (unless you downloaded precompiled binaries): Type `make' to compile. That's it! (Make sure you have gcc and the g++ library installed first.)
INSTALLATION INSTRUCTIONS: Type `make install' to copy binaries into /usr/local/bin/.
MORE DOCUMENTATION:
http://www.etymon.com/Isearch/
This software was made possible by the National Science Foundation, MCNC/CNIDR, and others. I also need to acknowledge several people that contributed to the initial development phase of Isearch:
Jim Fullton
Erik Scott (Scott Technologies)
Kevin Gamiel (Island Edge Research)
Archie Warnock (A/WWW Enterprises)
Many other people have contributed in various ways. Thanks to all of you.
Nassib Nassar <nassar@etymon.com>
This material is based on work sponsored by the National Science Foundation under Cooperative Agreement No. NCR-9216963. The Government has certain rights in this material.
Any opinions, findings and conclusions or recommendations expressed in this publication are those of the author(s) and do not necessarily reflect the views of the National Science Foundation.
