The following is a bibliography from an ACM SIGIR 2002 tutorial given by Bharat, Broder, Hawking, and Raghavan.

Bibliography

Abit97
S. Abiteboul, D. Quass, J. McHugh, and J. Wiener.
The lorel query language for semistructured data.
International Journal on Digital Libraries, 1(1):68-88, 1997.
http://www-db.stanford.edu/pub/papers.

Albe99
R. Albert, H. Jeong, and A.-L. Barabasi.
Diameter of the world wide web.
Nature, 401:130-131, 1999.

Amen00
B. Amento, L. Terveen, and W. Hill.
Does "authority" mean quality? predicting expert quality ratings of web documents.
In Proceedings of ACM SIGIR'00, pages 296-303, Athens, Greece, 2000.

Amit98
E. Amitay.
Using common hypertext links to identify the best phrasal description of target web documents.
In ACM SIGIR 98 Workshop on Hypertext IR for the Web, Melbourne, 1998.

Amit00
E. Amitay and C. Paris.
Automatically summarizing web sites - is there a way around it?
In ACM 9th International Conference on Information and Knowledge Management (CIKM 2000), Washington, DC, 2000.

Aroc97
G. O. Arocena, A. O. Mendelzon, and G. A. Mihaila.
Applications of a Web query language.
Computer Networks and ISDN Systems, 29(8-13):1305-1315, 1997.
http://www.cs.toronto.edu/~websql/www-conf/wsql/PAPER267.html.

Babe97
Babel Team.
Web languages hit parade, June 1997.
http://babel.alis.com/palmares.html.

Baez99
R. Baeza-Yates and B. Ribeiro-Neto.
Modern Information Retrieval.
Addison-Wesley, 1999.

Bail01
P. Bailey, N. Craswell, and D. Hawking.
Engineering a multi-purpose test collection for web retrieval experiments.
In press. http://www.ted.cmis.csiro.au/~dave/cwc.ps.gz, 2001.

BarI02
J. Bar-Ilan.
Methods for measuring search engine performance over time.
JASIST, 53(4):308-319, 2002.
http://www.asis.org/Publications/JASIS/vol53n04.html.

BarY00
Z. Bar-Yossef, A. Berg, S. Chien, J. Fakcharoenphol, and D. Weitz.
Approximating aggregate queries about Web pages via random walks.
In VLDB 2000, Proceedings of 26th International Conference on Very Large Data Bases, September 10-14, 2000, Cairo, Egypt, pages 535-544. Morgan Kaufmann Publishers, 2000.
http://www.vldb.org/dblp/db/conf/vldb/Bar-YossefBCFW00.html.

Bara99
A. Barabasi and R. Albert.
Emergence of scaling in random networks.
Science, 286:509, 1999.

Bhar00b
K. Bharat.
Searchpad: explicit capture of search context to support web search.
In Proceedings of WWW9, pages 493-501, 2000.
http://www9.org/w9cdrom/173/173.html.

Bhar98a
K. Bharat and A. Broder.
A technique for measuring the relative size and overlap of public web search engines.
In Proceedings of WWW7, pages 379-388, 1998.
http://www-sor.inria.fr/mirrors/www7/programme/fullpapers/1937/com1937.htm.

Bhar98b
K. Bharat and A. Broder.
A technique for measuring the relative size and overlap of public web search engines.
In Proceedings of WWW7, pages 369-477, 1998.

Bhar99
K. Bharat and A. Broder.
Mirror, mirror on the web: A study of host pairs with replicated content.
In Proceedings of WWW8, pages 501-512, Toronto, 1999.
http://www8.org/w8-papers/4c-server/mirror/mirror.html.

Bhar00c
K. Bharat, A. Z. Broder, J. Dean, and M. R. Henzinger.
A comparison of techniques to find mirrored hosts on the WWW.
Journal of the American Society of Information Science, 51(12):1114-1122, 2000.

Bhar01b
K. Bharat, B. Chang, M. Henzinger, and M. Ruhl.
Who links to whom: Mining linkage between web sites.
In Proceedings of IEEE ICDM-01, pages 51-58, 2001.
http://theory.lcs.mit.edu/~ruhl/papers/2001-icdm.html.

Bhar98
K. Bharat and M. Henzinger.
Improved algorithms for topic distillation in a hyperlinked environment.
In Proceedings of ACM SIGIR'98, pages 104-111, 1998.

Bhar00
K. Bharat and G. Mihaila.
Hilltop: A search engine based on expert documents.
In Poster proceedings of WWW9, pages 72-73, 2000.

Boro01
A. Borodin, G. Roberts, J. Rosenthal, and P. Tsaparas.
Finding authorities and hubs from link structures on the www.
In Proceedings of WWW10, May 2001.
http://www10.org/cdrom/papers/pdf/p314.pdf.

Brin98
S. Brin and L. Page.
The anatomy of a large-scale hypertextual Web search engine.
In Proceedings of WWW7, pages 107-117, 1998.
http://www7.scu.edu.au/programme/fullpapers/1921/com1921.htm.

Brod98
A. Broder.
On the resemblance and containment of documents.
In Compression and Complexity of Sequences (SEQUENCES'97), pages 21-29. IEEE Computer Society, 1998.
ftp://ftp.digital.com/pub/DEC/SRC/publications/broder/positano-final-wpnums.pdf.

Brod97
A. Broder, S. Glassman, M. Manasse, and G. Zweig.
Syntactic clustering of the web.
In Proceedings of WWW6, pages 391-404, 1997.

Brod00
A. Broder, S. Kumar, F. Maghoul, P. Raghavan, S. Rajagopalan, R. Stata, A. Tomkins, and J. Wiener.
Graph structure in the web: experiments and models.
In Proceedings of WWW9, Amsterdam, 2000.
http://www.www9.org/w9cdrom/160/160.html.

Bruz00
P. Bruza, R. McArthur, and S. Dennis.
Interactive internet search: keyword, directory and query reformulation mechanisms compared.
In Proceedings of ACM SIGIR'2000, Athens, Greece, 2000.

Buck00
C. Buckley and E. Voorhees.
Evaluating evaluation measure stability.
In Proceedings of ACM SIGIR'00, pages 33-40, Athens, Greece, 2000.

Call99
J. Callan, M. Connell, and A. Du.
Automatic discovery of language models for text databases.
In Proceedings of ACM SIGMOD'99, pages 479-490, New York, 1999.

Call95
J. P. Callan, Z. Lu, and W. B. Croft.
Searching distributed collections with inference networks.
In Proceedings of ACM SIGIR'95, pages 12-20, Seattle, WA, 1995.

Carr97
J. Carriere and R. Kazman.
Webquery: Searching and visualizing the web through connectivity.
In Proceedings of WWW6, 1997.
http://www.cgl.uwaterloo.ca/Projects/Vanish/webquery-1.html.

Chak98b
S. Chakrabarti, B. Dom, D. Gibson, R. Kumar, P. Raghavan, A. Tomkins, and S. Rajagopalan.
Spectral filtering for resource discovery.
In ACM SIGIR Workshop on Hypertext IR for the Web, Melbourne, 1998.

Chak99c
S. Chakrabarti, B. Dom, and P. Indyk.
Enhanced hypertext classification using hyperlinks.
In Proceedings of ACM SIGMOD'98, 1998.

Chak98
S. Chakrabarti, B. Dom, P. Raghavan, S. Rajagopalan, D. Gibson, and J. Kleinberg.
Automatic resource compilation by analyzing hyperlink structure and associated text.
In Proceedings of WWW7, pages 65-74, Brisbane, 1998.
http://www7.scu.edu.au/programme/fullpapers/1898/com1898.html.

Chak01
S. Chakrabarti, M. Joshi, and V. Tawde.
Enhanced topic distillation using text, markup tags, and hyperlinks.
In Proceedings of ACM SIGIR'2001, pages 208-216, New Orleans, USA, 2001.

Chak99
S. Chakrabarti, M. van den Berg, and B. Dom.
Focused crawling: A new approach to topic-specific web resource discovery.
In Proceedings of WWW8, Toronto, 1999.

Chi01
E. H. Chi, P. Pirolli, K. Chen, and J. Pitkow.
Using information scent to model user information needs and actions on the web.
In Proceedings of ACM CHI 2001, pages 490-497, Seattle, 2001.

Cho00
J. Cho and H. Garcia-Molina.
The evolution of the web and implications for an incremental crawler.
In Proceedings of the Twenty-sixth International Conference on Very Large Databases, 2000.
http://rose.cs.ucla.edu/~cho/papers/cho-evol.pdf.

Cho98
J. Cho, H. Garcia-Molina, and L. Page.
Efficient crawling through url ordering.
In Proceedings of WWW7, pages 161-172, Brisbane, 1998.
http://www7.scu.edu.au/programme/fullpapers/1919/com1919.htm.

Cras99
N. Craswell, P. Bailey, and D. Hawking.
Is it fair to evaluate web systems using trec ad hoc methods?
ACM SIGIR '99 Workshop on Web Retrieval, 1999.
http://pastime.anu.edu.au/nick/pubs/sigir99ws.ps.gz.

Cras00
N. Craswell, P. Bailey, and D. Hawking.
Server selection on the world wide web.
In Proceedings of the ACM Digital Libraries Conference, San Antonio, Texas, pages 37-46, June 2000.

Cras01
N. Craswell, D. Hawking, and K. Griffiths.
Which search engine is best at finding airline site home pages?
Technical Report 01/45, CSIRO Mathematical and Information Sciences, 2001.
http://www.ted.cmis.csiro.au/~nickc/pubs/airlines.pdf.

Cras01a
N. Craswell, D. Hawking, and S. Robertson.
Effective site finding using link anchor information.
In Proceedings of ACM SIGIR 2001, pages 250-257, New Orleans, 2001.
http://www.ted.cmis.csiro.au/nickc/pubs/sigir01.pdf.

Cras99b
N. Craswell, D. Hawking, and P. Thistlewaite.
Merging results from isolated search engines.
In Proceedings of the 10th Australasian Database Conference, Auckland, NZ, pages 189-200. Springer-Verlag, 1999.
http://www.ted.vic.cmis.csiro.au/~nickc/pubs/adc99.ps.gz.

WEBT01
CSIRO.
Trec web tracks home page.
http://www.ted.cmis.csiro.au/TRECWeb/, 2001.

Davi00
B. Davison.
Recognizing nepotistic links on the web.
In AAAI workshop on AI in web search, 2000.
http://archive.org/pub/aaai2000/BDavison2000.ps.

Davi00b
B. Davison.
Topical locality in the web.
In Proceedings of ACM SIGIR'2000, pages 272-279, Athens, Greece, 2000.
http://www.cs.rutgers.edu/~davison/pubs/2000/sigir/.

Davi00a
B. Davison.
Topical locality in the web: Experiments and observations.
Technical report, Department of Computer Science, Rutgers University, 2000.
http://www.cs.rutgers.edu/pub/technical-reports/dcs-tr-414.ps.Z.

Dean99
J. Dean and M. R. Henzinger.
Finding related pages in the World Wide Web.
Computer Networks (Amsterdam, Netherlands: 1999), 31(11-16):1467-1479, 1999.
http://research.compaq.com/SRC/WebArcheology/papers/companion.ps.

Deer90
S. Deerwester, S. Dumais, G. Furnas, T. Landauer, and R. Harshman.
Indexing by latent semantic analysis.
Journal of the American Society for Information Science, 41(6):391-407, 1990.
http://www.si.umich.edu/%7Efurnas/POSTSCRIPTS/LSI.JASIS.paper.ps.

Drei97
D. Dreilinger and A. E. Howe.
Experiences wtih selecting search engines using metasearch.
ACM Transactions on Information Systems, 15(3):195-222, 1997.

Eggh90
L. Egghe and R. Rousseau.
Introduction to Informetrics.
Elsevier, 1990.

Ethn01
Ethnologue.
Ethnologue language name index.
http://www.ethnologue.com/language_index.asp.

Fagi00
R. Fagin, A. Karlin, J. Kleinberg, P. Raghavan, S. Rajagopalan, R. Rubinfeld, M. Sudan, and A. Tomkins.
Random walks with ``back buttons'' (extended abstract).
In Proceedings of STOC 2000, pages 484-493, 2000.

Frak92
W. Frakes.
Stemming algorithms.
In W. B. Frakes and R. Baeza-Yates, editors, Information Retrieval. Data Structures and Algorithms, pages 131-160. Prentice Hall, Upper Saddle River NJ, 1992.

Garf72
E. Garfield.
Citation analysis as a tool in journal evaluation.
Science, 178:471-479, 1972.

Gauc96
S. Gauch and G. Wang.
Information fusion with ProFusion.
In Proceedings of WebNet '96: The First World Conference of the Web Society, pages 174-179, October 1996.
Also at http:// www.designlab.ukans.edu/ProFusion.html.

Gilb97
N. Gilbert.
A simulation of the structure of academic science.
Sociological Research Online, 2(2), 1997.

Gord99
M. Gordon and P. Pathak.
Finding information on the world wide web: The retrieval effectiveness of search engines.
Information Processing and Management, 35(2):141-180, March 1999.

Grav97
L. Gravano, K. Chang, H. Garcia-Molina, C. Lagoze, and A. Paepcke.
STARTS - Stanford protocol proposal for internet retrieval and search.
http:// www-db.stanford.edu/$_~$gravano/starts.html, January 1997.

Harm92
D. Harman.
Evaluation issues in information retrieval.
Information Processing and Management, 28(4):439-440, 1992.

Harm95
D. Harman.
The trec conferences.
In R. Kuhlen and M. Rittberger, editors, Proceedings of HIM 95, 1995.

Have02
T. H. Haveliwala.
Topic-sensitive pagerank.
In Proceedings of WWW2002, Honolulu, May 2002.
http://www2002.org/CDROM/refereed/127/.

Hawk01
D. Hawking, N. Craswell, P. Bailey, and K. Griffiths.
Measuring search engine quality.
Information Retrieval, 4(1):33-59, 2001.
pre-press version at http://www.ted.cmis.csiro.au/~dave/INRT83-00.ps.gz.

Hawk01a
D. Hawking, N. Craswell, and K. Griffiths.
Which search engine is best at finding online services?
In Poster Proceedings of WWW10, May 2001.
www.ted.cmis.csiro.au/~dave/www10poster.pdf.

Hawk99a
D. Hawking, N. Craswell, P. Thistlewaite, and D. Harman.
Results and challenges in web search evaluation.
Proceedings of WWW8, 31:1321-1330, 1999.
http://www8.org/w8-papers/2c-search-discover/results/results.html.

Hawk02
D. Hawking and S. Robertson.
On collection size and retrieval effectiveness.
Information Retrieval, To appear.

Hawk99b
D. Hawking and P. Thistlewaite.
Methods for information server selection.
ACM Transactions on Information Systems., 17(1):40-76, 1999.

Hawk99
D. Hawking, E. Voorhees, N. Craswell, and P. Bailey.
Overview of trec-8 web track.
In Proceedings of TREC-8, pages 131-150, Gaithersburg MD, November 1999.
http://trec.nist.gov/pubs/trec8/t8_proceedings.html.

Hear95
M. Hearst.
Tilebars: Visualization of term distribution in full text information access.
In Proceedings of ACM SIGCHI'95, pages 59-66, Denver, 1995.

Henz99
M. R. Henzinger, A. Heydon, M. Mitzenmacher, and M. Najork.
Measuring index quality using random wals on the web.
In Proceedings of the Eighth International World-Wide Web Conference, 1999.
http://www9.org/w9cdrom/88/88.html.

Hube98
B. Huberman, P. Pirolli, J. Pitkow, and R. Lukose.
Strong regularities in world wide web surfing.
Science, 280:95-97, 1998.

Hull96
D. Hull.
Stemming algorithms: A case study for detailed evaluation.
Journal of the American Society for Information Science, 47:70-84, 1996.

Jans98
B. J. Jansen, A. Spink, J. Bateman, and T. Saracevic.
Real life information retrieval: A study of user queries on the Web.
ACM SIGIR Forum, 32(1):5-17, 1998.

Jarv00
K. Järvelin and J. Kekäläinen.
IR methods for retrieving highly relevant documents.
In Proceedings of ACM SIGIR'00, pages 41-48, Athens, Greece, 2000.

Bhar01
K.Bharat and G. Mihaila.
When experts agree: Using non-affiliated experts to rank popular topics.
In Proceedings of WWW10, Hong Kong, 2001.
(To appear in ACM TOIS.) http://www10.org/cdrom/papers/474/.

Kess63
M. M. Kessler.
Bibliographic coupling between scientific papers.
American Documentation, 14, 1963.

Kirs97
S. T. Kirsch.
Distributed search patent.
U.S. Patent 5,659,732, August 1997.
Infoseek Corporation. http://software.infoseek.com/patents/dist_search/patents.htm.

Kist98
T. Kistler and H. Marais.
Webl - a programming language for the web.
In Proceedings of WWW7, pages 259-270, Brisbane, 1998.
http:///research.compaq.com/SRC/WebL/related/papers/www7/paper.html.

Klei98
J. Kleinberg.
Authoritative sources in a hyperlinked environment.
In Proceedings of the 9th Annual ACM-SIAM Symposium on Discrete Algorithms, pages 668-677, 1998.
http://www.cs.cornell.edu/home/kleinber/auth.ps Also J.ACM?

Kuma00
S. Kumar, P. Raghavan, S. Rajagopalan, D. Sivakumar, A. Tomkins, and E. Upfal.
The web as a graph: Measurements, models and methods.
In Proceedings of ACM Symposium on Principles of Database Systems, pages 1-10, 2000.

Kuma99b
S. Kumar, P. Raghavan, S. Rajagopalan, and A. Tomkins.
Extracting large-scale knowledge bases from the web.
In Proceedings of VLDB, pages 639-650, 1999.

Kuma99
S. Kumar, P. Raghavan, S. Rajagopalan, and A. Tomkins.
Trawling the web for emerging cyber communities.
In Proceedings of WWW8, Toronto, 1999.

Lars96
R. Larson.
Bibliometrics of the world wide web: An exploratory analysis of the intellectual structure of cyberspace.
In Annual Meeting of the American Society Information Science, 1996.

Lawr98b
S. Lawrence and C. Giles.
Inquirus, the NECI meta search engine.
In Proceedings of WWW7, pages 95-105, 1998.

Lawr98
S. Lawrence and C. Giles.
Searching the world wide web.
Science, 280:98-100, Apr. 1998.

Lawr99b
S. Lawrence and C. Giles.
Accessibility of information on the web.
Nature, 400:107-109, 8 July 1999.

Lawr99
S. Lawrence, C. Giles, and K. Bollacker.
Digital libraries and autonomous citation indexing.
IEEE Computer, 32(6):67-71, 1999.
http://www.neci.nj.nec.com/homepages/lawrence/citeseer.html.

LeCa2000
A. LeCalve and J. Savoy.
Database merging strategy based on logistic regression.
Information Processing and Management, 36(3):341-359, 2000.

Lotk26
A. Lotka.
The frequency distribution of scientific productivity.
Journal of the Washington Academy of Science, 16(317), 1926.

Marc97
M. Marchiori.
The quest for correct information on the web: Hyper search engines.
In Proceedings of WWW6, Santa Clara, 1997.

Mcbr94
O. McBryan.
GENVL and WWWW: Tools for taming the web.
In Proceedings of WWW1, 1994.

Mend97
A. Mendelzon, G. Mihaila, and T. Milo.
Querying the world wide web.
Journal of Digital Libraries, 1(1):68-88, 1997.

Mill56
G. Miller.
The magical number seven, plus or minus two: Some limits on our capacity for processing information.
The Psychological Review, 63:81-97, 1956.
Reproduced at: http://www.well.com/user/smalin/miller.html.

Netc02
Netcraft.
Netcraft web server survey.
http://www.netcraft.com/survey.

TREC01
NIST.
Trec home page.
http://trec.nist.gov/, 2001.

Note02
G. Notess.
Search engine showdown.
http://notess.com/.

ONei97
E. T. O'Neill, P. D. McClain, and B. F. Lavoie.
A methodology for sampling the world wide web.
Technical report, OCLC Annual Review of Research, 1997.
http://www.oclc.org/research/publications/arr/1997/oneill/o0213.htm.

Page98
L. Page, S. Brin, R. Motwani, and T. Winograd.
The PageRank citation ranking: Bringing order to the Web.
Technical report, Stanford Digital Library Technologies Project, 1998.
http://www-db.stanford.edu/~backrub/pageranksub.ps.

Pa1897
V. Pareto.
Cours d'économie politique.
Rouge, Lausanne and Paris, 1897.

Piro96
P. Pirolli, J. Pitkow, and R. Rao.
Silk from a sow's ear: Extracting usable structures from the web.
In Proceedings of CHI 96, pages 118-125, 1996.

Pitk97
J. Pitkow.
Characterizing World Wide Web ecologies.
PhD thesis, Georgia Institue of Technology, June 1997.

Port80
M. Porter.
An algorithm for suffix stripping.
Program, 14:130-137, 1980.

Rade02
D. R. Radev, K. Libner, and W. Fan.
Getting answers to natural language questions on the web.
JASIST, 53(5), 2002.
http://www.asis.org/Publications/JASIS/vol53n05.html.

Rand01
K. Randall, R. Stata, R. Wickremesinghe, and J. Wiener.
The link database: Fast access to graphs of the web.
Technical Report Research Report 175, Compaq, Systems Research Center, Palo Alto, CA, 2001.
http://gatekeeper.research.compaq.com/pub/DEC/SRC/research-reports/abstracts/src-rr-175.html.

Raso02
Y. Rasolofo, D. Hawking, and J. Savoy.
Result merging strategies for a current news metasearcher.
in submission, 2002.

Rocc71
J. Rocchio.
Relevance feedback in information retrieval.
In G. Salton, editor, The SMART System: Experiments in Automatic Document Processing, pages 313-323. Prentice Hall, Englewood Cliffs, NJ, 1971.

Rusm01
P. Rusmevichientong, D. M. Pennock, S. Lawrence, and C. L. Giles.
Methods for sampling pages uniformly from the world wide web.
In AAAI Fall Symposium on Using Uncertainty Within Computation, pages 121-128, 2001.

Sala99
M. Salampasis and J. Tait.
A link-based collection fusion strategy.
Information Processing and Management, 35(5):691-711, 1999.

Salt88
G. Salton and C. Buckley.
Term-weighting approaches in automatic text retrieval.
Information Processing and Management, 24:513-523, 1988.

Salt90
G. Salton and C. Buckley.
Improving retrieval performance by relevance feedback.
Information Processing and Management, 26:73-92, 1990.

Savo97
J. Savoy.
Statistical inference in retrieval effectiveness evaluation.
Information Processing and Management, 33(4):495-512, 1997.

Schu97
H. Schütze and C. Silverstein.
Projections for efficient document clustering.
In Proceedings of ACM SIGIR'97, pages 74-81, Philadelphia, 1997.

Selb95
E. Selberg and O. Etzioni.
Multi-service search and comparison using the meta-crawler.
In Proceedings of WWW4, Boston MA, 1995.

Shak97
J. Shakes, M. Langheinrich, and O. Etzioni.
Dynamic reference sifting: A case study in the homepage domain.
In Proceedings of WWW6, pages 1193-1204, Santa Clara, 1997.
http://huskysearch.cs.washington.edu:6060/doc/presentation/.

Shiv99b
Shivakumar and Garcia-Molina.
Finding near-replicas of documents on the web.
In WEBDB: International Workshop on the World Wide Web and Databases, WebDB. LNCS, 1999.
http://www-db.stanford.edu/~shiva/Pubs/web.ps.

Shiv99
N. Shivakumar, J. Cho, and H. Garcia-Molin.
Finding replicated web collection.
Technical report, Department of Computer Science, Stanford University, 1999.
http://www-db.stanford.edu/pub/papers/cho_mirror.ps.

Shiv95
N. Shivakumar and H. Garcia-Molina.
SCAM: A copy detection mechanism for digital documents.
In Proceedings of ACM DL'95, Austin TX, 1995.

Silv98
C. Silverstein, M. Henzinger, H. Marais, and M. Moricz.
Analysis of a very large web search engine query log.
SIGIR Forum, 33(1):6-12, 1999.
Previously available as Digital Systems Research Center TR 1998-014 at http://www.research.digital.com/SRC.

Sing01
A. Singhal and M. Kaszkiel.
A case study in web search using trec algorithms.
In Proceedings of WWW10, pages 708-716, Hong Kong, 2001.
http://www.www10.org/cdrom/papers/pdf/p317.pdf.

Smal73
H. Small.
Co-citation in the scientific literature: A new measure of the relationship between two documents.
Journal of the American Society for Information Science, 24, 1973.

Spar98
K. Sparck Jones, S. Walker, and S. Robertson.
A probabilistic model of information retrieval : Development and status.
Technical Report TR 446, Cambridge University Computer Laboratory, September 1998.

Sper97
E. Spertus.
Parasite: Mining structural information on the web.
In Proceedings of WWW6, Santa Clara, 1997.
http://www6.nttlabs.com/HyperNews/get/PAPER206.html.

Spin02
A. Spink, editor.
JASIST, volume 53, chapter Special Issue on Web Research.
Pergamon Press, 2002.
http://www.asis.org/Publications/JASIS/vol53n02.html.

Spin02a
A. Spink.
A user-centered approach to evaluating human interaction with web search engines: an exploratory study.
Information Processing and Management, 38(3):401-426, 2002.

Spin00
A. Spink and J. Qin, editors.
Information Processing and Management, volume 36, chapter Special Issue on Web-based Information Retrieval Research.
Pergamon Press, 2000.

Stat00
R. Stata, K. Bharat, and F. Maghoul.
The term vector database: fast access to indexing terms for web pages.
In Proceedings of WWW9, pages 247-255, Amsterdam, 2000.
http://www.www9.org/w9cdrom/159/159.html.

Sten99
D. Stenmark.
Method for intranet search engine evaluations.
In Proceedings of IRIS22, Department of CS/IS, University of Jyväskylä, Finland, August 1999.
http://w3.informatik.gu.se/~dixi/publ/method.pdf.

Stev46
S. Stevens.
On the theory of scales of measurement.
Science, 103(2684):677-680, 1946.

Trav01
R. Travis and A. Broder.
Web search quality vs. informational relevance.
In Proceedings of the 2001 Infonortics Search Engines Meeting, Boston, 2001.
http://www.infonortics.com/searchengines/sh01/slides-01/travis.html.

Turp01
A. Turpin and W. Hersh.
Why batch and user evaluations do not give the same results.
In Proceedings of ACM SIGIR'01, page ?, New Orleans, LA, 2001.

Voor98b
E. Voorhees.
Using wordnet for text retrieval.
In C. Fellbaum, editor, WordNet: An Electronic Lexical Database, pages 285-303. The MIT Press, Cambridge MA, 1998.

Voor98
E. Voorhees.
Variations in relevance judgments and the measurement of retrieval effectiveness.
In Proceedings of ACM SIGIR'98, pages 315-323, 1998.
http://www.itl.nist.gov/iaui/894.02/works/papers/sigir98.dvi.ps.

Voor01
E. Voorhees.
Evaluation by highly relevant documents.
In Proceedings of ACM SIGIR'01, New Orleans, LA, 2001.

Voor95
E. M. Voorhees, N. K. Gupta, and B. Johnson-Laird.
Learning collection fusion strategies.
In Proceedings of ACM SIGIR'95, pages 172-179, Seattle, WA, 1995.

Whit89
H. White and K. McCain.
Ann. Rev. Info. Sci. and Technology, chapter Bibliometrics, pages 119-186.
Elsevier, 1989.

W3C01
World Wide Web Consortium.
W3c internationalization/localization: Character sets supported by popular web applications.
http://www.w3.org/International/O-charset-list.html.

Yule44
G. Yule.
Statistical Study of Literary Vocabulary.
Cambridge University Press, 1944.

Zami98
O. Zamir and O. Etzioni.
Web document clustering: A feasibility demonstration.
In Proceedings of ACM SIGIR'98, pages 46-54, Melbourne, 1998.

Zami99
O. Zamir and O. Etzioni.
Grouper: A dynamic clustering interface to web search results.
In Proceedings of WWW8, pages 1361-1374, Toronto, 1999.

Zipf49
G. Zipf.
Human Behavior and the Principle of Least Effort.
Addison-Wesley, 1949.