CMSC 828K
Data Structures and Algorithms for Information Retrieval
Instructors: Tapas Kanungo and David Mount
Spring 2000
Schedule of Presentations


This was last updated Feb 14, 2000.

DatePresenterTopicReadings Slides
Feb 1   Course Introduction    
Feb 8 Tapas Kanungo General Overview    
Feb 15 Tapas Kanungo Models for IR Chapter 2 (Modeling) in Baeza-Yates and Ribeiro-Neto, Modern Information Retrieval.  
Feb 22 Bill Pugh Searching on the Web Sergey Brin and Lawrence Page, "Page rank citation ranking: bringing order to the web", (Postscript File).

Sergey Brin and Lawrence Page, "The Anatomy of a Large-Scale Hypertextual Web Search Engine", WWW7 conference (Html Source).

J. Kleinberg, "Authoritative sources in a hyperlinked environment", Proc. 9th ACM-SIAM Symposium on Discrete Algorithms, 1998 (Extended version in Journal of the ACM 46, 1999), (PDF File).

 
Feb 29 Dave Mount Dimension reduction techniques G. Hristescu and M. Farach-Colton, "Cluster-preserving embedding of proteins", DIMACS TR 99-50, (postscript)

C. Faloutsos and K.-I. Lin, "FastMap: A Fast Algorithm for Indexing, Data-Mining and Visualization of Traditional and Multimedia Datasets", ACM SIGMOD, May 1995, San Jose, CA, pp. 163-174, (postscript)

Power Point

Postscript

PDF

Mar 7 Doug Oard Collaborative Filtering Herlocker, Jon et al., 1999 An algorithmic Framework for Performing Collaborative Filtering (pdf)

Pazzani, Michael, 2000 A Framework for Collaborative, Content-Based and Demographic Filtering (pdf)

 
Mar 14 Alex Dekhtyar Probabilistic IR F. Crestani, M. Lalmas, C. J. Can Rijsbergen, I. Campbell, "'Is this document relevant?...probably': A survey of probabilistic methods in information retrieval," ACM Computing Surveys, Vol. 30, pp. 528--552, 1998.

C.J. van Rijsbergen, "A Non-Classical Logic for Information Retrieval," Computer Journal, vol 29, pages 481-485, 1986

S.E. Robertson, "The Probability Ranking Principle in IR," Journal of Documentation, vol. 33, pages 294--304, 1977

Part 1 (PPT)

Part 1 (PDF)

Part 1 (postscript)

Part 2 (PPT)

Part 2 (PDF)

Part 2 (postscript)

Mar 21 Spring Break Spring Break Spring Break Spring Break
Mar 28 Doe-Wan Kim Nearest Neighbor Searching Jon Louis Bentley, "Multidimensional Binary Search Trees Used for Associative Searching", Communications of the ACM, vol.18, no.9, pp. 509-517, September 1975

David A. White, Ramesh Jain, "Similarity Indexing: Algorithms and Performance", Proceedings of the SPIE, vol.2670, pp. 62-73, 1996

Postscript

PDF

Apr 4 Thomas Baby Indexing Chapter 3 in Managing Gigabytes by Ian H. Witten, Alistair Moffat and Timothy C. Bell, Morgan Kaufmann Publishers, Inc., 1999 PPT

PDF

postscript

Apr 11 Tapas Kanungo HMM-based Methods L. R. Rabiner, "A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition," Proceedings of IEEE, vol. 77, pp. 257-286, 1989

D. R. H. Miller, T. Leek, and R. M. Schwartz, "A Hidden Markov Model Information Retrieval System," Proc. of SIGIR, pp. 214-221, Berkeley, CA, 1999. (PDF)

Postscript

PDF

Apr 18 Harry Hochheiser Approximate sequence identification using string algorithms B. S. Baker, "A Theory of Parmeterized Pattern Matching: Algorithms and Applications," ACM STOC, 1993

Graham A. Stephen, "String Searching Algorithms," World Scientific Publishing Co. 1994. (pages 87--100)

Edward M. McCreight, " A Space-Economical Suffix Tree Construction Algorithm," JACM, vol 23, pp. 262-272, 1976

HTML

PPT

PDF

postscript

Apr 25 Kyongil Yoon Video Searching Chapter 7 (Video Indexing and Retrieval) in "Multimedia Database Management Systems," by Guojun Lu

J. D. Courtney, "Automatic Video Indexing Via Object Motion Analysis," Pattern Recognition, vol. 30, pp. 607-625, 1997

PPT

PDF

postscript

May 2 Clara Cabezas Query and Document Expansion Chapter 5 (Query Operations) in Baeza-Yates and Ribeiro-Neto, Modern Information Retrieval.

A. Singhal and F. Pereira, "Document Expansion for Speech Retrieval," Proc. of SIGIR, Berkeley, 1999 (postscript)

L. Ballesteros and W. B. Croft, "Resolving Ambiguity for Cross-language Retrieval," in Proc. of ACM SIGIR, Melbourne, Australia, 1998 (pdf)

L. Ballesteros and W. B. Croft, "Phrasal Translationa and Query Expansion Techniques for Cross-Language Information Retrieval," in Proc. of ACM SIGIR, Philadelphia, 1997. (pdf)

PPT

PDF

postscript

May 9 Fatma Ozcan Information Filtering Tak W. Yan and Hector Garcia-Molina, "Index Structures for Information Filtering Under the Vector Space Model," pp. 337-347, Proc. of IEEE Int. Conf. on Data Engineering (ICDE) 1994. (postscript)

M. Altinel and M. Franklin, "Efficient Filtering of XML Documents for Selective Dissemination of Information," Submitted for publication, February, 2000 (pdf)

U. Cetintemel, M. Franklin, and C. Lee Giles, "Self-Adaptive User Profiles for Large-Scale Data Delivery," ICDE 2000. (postscript)

postscript

PPT

May 16 Anil Akurathi Parallel and Distributed Algorithms Chapter 9, (Parallel and Distributed IR) in Modern Information Retrieval, by Baeza-Yates and Ribeiro-Neto.

B.Ribeiro-Neto, E.S.Moura, M.S.Neubert and N.Ziviani. Efficient Distributed Algorithms to Build Inverted Files. In SIGIR'99, Berkley, USA (pdf) (postscript)

B.Ribeiro-Neto, J.P.Kitajima, G.Navarro, C.Santana and N.Ziviani. Parallel generation of inverted files for distributed text collections. In Proc. of Int. Conf. of the Chilean Society of Computer Science, (SCCC'98) pages 149-15, Antofagasta, Chile, 1998. (pdf) (postscript)

B.Cahoon and K.S.Mckinley, "Performance Evaluation of a Distributed Architecture for Information Retrieval," ACM SIGIR, Switzerland, Aug., 1996. (postscript)

PPT

PDF

postscript

Copy all the slides (tar+gz)

Back to CMSC 828K home page