Stanford University

CS276A / SYMBSYS 239I / LING 239I
Text Information Retrieval, Mining, and Exploitation
Fall 2002


Installed Libraries README

Quick start:

Type 'uname -a' on the machine to find out if it is Sun/Solaris or Linux
Make sure 'echo $OSTYPE' agrees, or the following steps won't work.

To add all of the installed software to your paths, run the following command. You can add it to your .cshrc so that you don't have to keep rerunning the command each time you login.

source /afs/ir/class/cs276a/software/setpaths.sh

Longer explanation:

I've installed the following code in the /afs/ir/class/cs276a/software/ directory. If you want to customize things, feel free to download them from the listed websites and install them locally in your own directory. If you want to use other free libraries as well, feel free to download and install them in your own directories. Don't reinvent the wheel.

Descriptions of each of the separately installed packages follow. If you used the setpaths.sh script as described in the Quickstart, you don't need to manually set the CLASSPATH and LD_LIBRARY_PATH entries below.

Lucene
Java text indexing library
Google API
Java API for accessing Google
Berkeley DB
Java API for BerkeleyDB database library
WebBase
Java API to Stanford WebBase crawl data
Java WordNet Library
Java API to WordNet

Lucene API:

http://jakarta.apache.org/lucene/
Local documentation

Installed in /afs/ir.stanford.edu/class/cs276a/software/lucene-1.2.

Add the following to CLASSPATH:

/afs/ir.stanford.edu/class/cs276a/software/lucene-1.2/lucene-1.2.jar
/afs/ir.stanford.edu/class/cs276a/software/lucene-1.2/lucene-demos-1.2.jar

for indexing library & demo programs.

Google API:

http://www.google.com/apis/
Local documentation

The Google api is installed in /afs/ir/class/cs276a/software/googleapi.

Students/groups need to obtain their own license keys from http://www.google.com/apis/ The api license allows you 1,000 queries per day. Keep that in mind when designing your project, if you plan on using the Google API.

Add the following to CLASSPATH

/afs/ir/class/cs276a/software/googleapi/googleapi.jar

BerkeleyDB API:

http://www.sleepycat.com/
http://www.sleepycat.com/docs/ref/java/conf.html
Local documentation

Installed in /afs/ir/class/cs276a/software/arch/{solaris,linux}/site/berkeley-db/.

Add the following to CLASSPATH:

/afs/ir/class/cs276a/software/arch/{solaris,linux}/site/berkeley-db/lib/db.jar
/afs/ir/class/cs276a/software/arch/{solaris,linux}/site/berkeley-db/lib/dbexamples.jar

and the following to your LD_LIBRARY_PATH

/afs/ir/class/cs276a/software/arch/{solaris,linux}/site/berkeley-db/lib/

Stanford WebBase Crawl API:

Local documentation

Java interface installed in /afs/ir/class/cs276a/software/webbase-code/.

Add the following to your CLASSPATH

/afs/ir/class/cs276a/software/webbase-code/

and the following to your LD_LIBRARY_PATH

/afs/ir/class/cs276a/software/arch/{solaris,linux}/site/webbase/lib/
/afs/ir/class/cs276a/software/arch/{solaris,linux}/site/libwww/lib/

Java WordNet Library:

http://sourceforge.net/projects/jwordnet/
Local documentation
WordNet Documentation
WordNet Glossary

The Java WordNet Library (Java API for WordNet) is installed in /afs/ir/class/cs276a/software/jwnl/. The WordNet-1.7.1 distribution itself has been installed in /afs/ir/class/cs276a/software/arch/{solaris,linux}/site/WordNet-1.7.1. There is example usage of the JWNL api in /afs/ir/class/cs276a/examples/. To get a quick glimpse of WordNet, you can run the WordNet Browser by running wnb.

Add the following to your CLASSPATH

/afs/ir/class/cs276a/software/jwnl/jwnl.jar

Back to the CS276A homepage
Last modified: Sun Sep 29 18:49:35 PDT 2002