LSI-Related Publications (Articles, Reports, and Books)

(updated 11/21/08)

Computational Methods for Intelligent Information Access
M.W. Berry, S.T. Dumais, and T.A. Letsche.
Proceedings of Supercomputing'95, San Diego, CA, December 1995.

ut-cs-95-284
Low-Rank Orthogonal Decompositions for Information Retrieval Applications.
Michael W. Berry and Ricardo D. Fierro, April 1995. Numerical Linear Algebra with Applications 3:4 (1996), pp. 301-328.

ut-cs-95-271
A Case Study of Latent Semantic Indexing.
M.W. Berry, S.T. Dumais, and A.T. Shippy, January 1995.

ut-cs-94-270
Using Linear Algebra for Intelligent Information Retrieval.
Michael W. Berry and Susan T. Dumais, and Gavin W. O'Brien, December 1994. Published in SIAM Review 37:4 (1995), pp. 573-595.

ut-cs-94-264
XLSI - A Graphical User Interface for a Conceptual Retrieval System.
Susan C. Allen, December 1994.

ut-cs-94-259
Cross-Language Information Retrieval Using Latent Semantic Indexing.
Paul G. Young, October 1994.

ut-cs-94-259
Information Management Tools for Updating an SVD-Encoded Indexing Scheme.
Gavin W. O'Brien, October 1994.

ut-cs-93-194
SVDPACKC (Version 1.0) User's Guide.
Michael Berry, Theresa Do, Gavin O'Brien, Vijay Krishna, and Sowmini Varadhan, April 1993.

umcp-csd:cs-tr-3713
Large Latent Semantic Indexing via a Semi-Discrete Matrix Decomposition.
T.G. Kolda and D.P. O'Leary. Technical Report No. UMCP-CSD CS-TR-3713, Department of Computer Science, Univ. of Maryland,
November 1996.

umcp-csd:cs-tr-3724
A Semi-Discrete Matrix Decomposition for Latent Semantic Indexing in Information Retrieva T.G. Kolda and D.P. O'Leary. Technical Report No. UMCP-CSD CS-TR-3724, Department of Computer Science, Univ. of Maryland, December 1996.

Large-Scale Information Retrieval with Latent Semantic Indexing
T.A. Letsche and M.W. Berry.
Information Sciences - Applications 100 (1997), pp. 105-137.

MS Thesis,
Toward Large-Scale Information Retrieval Using Latent Semantic Indexing. Todd A. Letsche, Department of Computer Science, University of Tennessee, August 1996.

Ph.D. Dissertation
Adaptive Vector Space Text Filtering for Monolingual and Cross-Language Applications, D. Oard, Department of Electrical Engineering, Univ. of Maryland, August 1996.

MS Thesis,
Using Latent Semantic Indexing for Data Mining.
Jingqian Jiang, Department of Computer Science, University of Tennessee, December 1997.

CSE-97-011
On Updating Problems in Latent Semantic Indexing, H. Simon and Hongyuan Zha. Technical Report No. CSE-97-011, Department of Computer Science and Engineering, Pennsylvania State University, 1997.

CSE-98-002
A Subspace-Based Model for Information Retrieval with Applications in Latent Semantic Indexing, Hongyuan Zha. Technical Report No. CSE-98-002, Department of Computer Science and Engineering, Pennsylvania State University, 1998.

CSE-98-012
On Matrices with Low-rank-plus-shift Structures: Partial SVD and Latent Semantic Indexing Hongyuan Zha and Zhenyue Zhang, Technical Report No. CSE-98-012, Department of Computer Science and Engineering, Pennsylvania State University, 1998.

MS Thesis,
Downdating the Latent Semantic Indexing Model for Information Retrieval
Dian I. Witter, Department of Computer Science, University of Tennessee, December 1997.

Senior Thesis,
Evaluation of a PC-based LSI Search Engine.
Safeer Ladha, Department of Computer Science, University of Tennessee, April 1998.

PhD. Dissertation,
Information Retrieval and Filtering Using the Riemannian SVD
Eric P. Jiang, Department of Computer Science, University of Tennessee, August 1998.

Conference Paper
Information Filtering Using the Riemannian SVD (R-SVD).
Eric P. Jiang and Michael W. Berry, August 1998. Proceedings of the Fifth International Symposium on: Solving Irregularly Structured Problems in Parallel, Lecture Notes in Computer Science 1457 (1998), pp. 386-395.

Submitted Paper
Concept Decompositions for Large Sparse Text Data Using Clustering.
Inderjit S. Dhillon and Dharmendra S. Modha, IBM Almaden Research Center, San Jose, CA, May 1999.

Matrices, Vector Spaces, and Information Retrieval
M.W. Berry, Z. Drmac, and E.R. Jessup. SIAM Review 41:2, (1999), pp. 335-362.

Results Ranking in Web Search Engines
M.P. Courtois and M.W. Berry. Online 23:3, (1999), pp. 39-46.

Mining Consumer Product Data Via Latent Semantic Indexing
J. Jiang, M. W. Berry, J. M. Donato, G. Ostrouchov. Intelligent Data Analysis 3:5, (November 1999), pp. 377-398.

A Similarity-based Probability Model for Latent Semantic Indexing
Chris H.Q. Ding. Proc. of 22nd ACM SIGIR'99 Conference, (August 1999), pp. 59-65.
Click here for an updated version of this paper.

Approximate Dimension Equalization in Vector-based Information Retrieval
Fan Jiang and Michael L. Littman, Department of Computer Science, Duke University, (1999), Pre-print.

Understanding Search Engines: Mathematical Modeling and Text Retrieval
M.W. Berry and M. Browne, SIAM Book Series: Software, Environments, and Tools, (June 1999), ISBN: 0-89871-437-0. Book Cover

Computational Information Retrieval
M. Berry (Ed.), Proceedings of CIR'00 (Raleigh, NC), SIAM Proceedings in Applied Mathematics, SIAM, Philadelphia, 2001, 185 p., ISBN: 0-89871-500-8.

Solving Total Least Squares Problems in Information Retrieval
E.P. Jiang and M.W. Berry, Linear Algebra and Its Application 316, (2000), pp. 136-157.

Level Search Schemes for Information Filtering and Retrieval
X. Zhang, M.W. Berry, and P. Raghavan, Information Processing & Management 37:2 (2001), pp. 313-334.

Efficient Computation of the Riemannian SVD in TLS Problems in Information Retrieval
R.D. Fierro and M.W. Berry, in Total Least Squares and Errors-In-Variables Modeling: Analysis, Algorithms, and Applications, S. van Huffel and P. Lemmerling (Eds.), Kluwer Academic Publishers, Boston, (2002), pp. 349-360.

GTP (General Text Parser) Software for Text Mining
J.T. Giles, L. Wo, and M.W. Berry, in Statistical Data Mining and Knowledge Discovery, H. Bozdogan (Ed.), CRC Press, Boca Raton, (2003), pp. 455-471.

Retrieving Images as Text
MS Thesis, Jan van Gemert Intelligent Sensory Information Systems, Informatics Institute, Faculty of Science, University of Amsterdam, Kruislaan 403, 1098 SJ Amsterdam, The Netherlands, April 2003.

The Fight Against Spam, Part 2
Article by François Joseph de Keradec, May 18, 2004.

Neural memories and search engines
E. Mizraji, International Journal of General Systems, March 31, 2008 (online). (2008 Best Paper Award for IJGS)

Latent semantic analysis
T.K. Landauer and S. Dumais, Scholarpedia, July 2008 (online).

Latent semantic analysis
Wikipedia entry, November 2008 (online).