Publications

Papers by topic

Edited Volumes Special Issues Journal Papers Book Chapters Conference Papers Theses Patents Technical Reports
Citations on Google Scholar
Papers by topic

    Edited Volumes:

  1. Document Recognition and Retrieval X
    SPIE, Belligham, WA, 2003.
    T. Kanungo, E. H. B. Smith, J. Hu, and P. B. Kantor (editors)

  2. Document Recognition and Retrieval IX
    SPIE, Belligham, WA, 2002.
    P. B. Kantor, J. Zhou, and T. Kanungo (editors)

    Journal Special Issue:

  3. Special issue on --- Performance Evaluation: Theory, Practice and Impact
    Int. Journal of Document Analysis and Recognition, vol. 4, no. 3, 2002.
    by T. Kanungo, H. S. Baird and R. M. Haralick

    Journal Papers:

  4. On the Use of Error Propagation for Statistical Validation of Computer Vision Software
    IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 27, no. 10, 2005.
    by X. Liu, T. Kanungo, R. M. Haralick
    Download pdf
  5. The Bible and Multilingual Optical Character Recognition
    Communications of the ACM, vol. 48, no. 6, pp. 124-130, 2005.
    by T. Kanungo, P. Resnik, S. Mao, D. Kim and Q. Zheng
    Download pdf
  6. A Local Search Approximation Algorithm for k-Means Clustering
    Computational Geometry: Theory and Aplications, vol. 28, pp. 89-112, 2004 (invited paper)
    by T. Kanungo, D. M. Mount, N. S. Netanyahu, C. Piatko, R. Silverman and A. Y. Wu
    Download pdf
  7. Estimating Degradation Model Parameters using Neighborhood Pattern Distributions: An Optimization Approach
    IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 26, no. 4, pp. 520-524, 2004.
    by T. Kanungo and Q. Zheng
    Download pdf
  8. A Case for Automated Large Scale Semantic Annotation
    Journal of Web Semantics, Elsevier Press, vol. 1, no. 1, pp. 115-132, 2003 (invited paper).
    by S. Dill, N. Eiron, D. Gibson, D. Gruhl, R. Guha, A. Jhingran, T. Kanungo, K. S. McCurley, S. Rajagopalan, A. Tomkins, J. Tomlin, J. Zien
    Download pdf
  9. Stochastic Language Models for Style-Directed Physical Layout Analysis of Documents
    IEEE Transactions on Image Processing, vol. 12, no. 5, pp. 583-596, 2003.
    by T. Kanungo and S. Mao
    Download pdf
  10. The Architecture of TRUEVIZ: A GroundTRUth/Metadata Editing and VISualiZing Toolkit
    Pattern Recognition, vol. 36, no. 3, pp. 811-825, 2003.
    by C. H. Lee and T. Kanungo
    Download pdf
  11. Attributed Point Matching for Automatic Groundtruth Registration
    Int. Journal on Document Analysis and Recognition, vol. 5, no. 1, pp. 47-66, 2002.
    by D. Kim and T. Kanungo

  12. An Efficient k-Means Clustering Algorithm: Analysis and Implementation
    IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 24, no. 7, pp. 881-892, 2002.
    by T. Kanungo, D. M. Mount, N. S. Netanyahu, C. Piatko, R. Silverman and A. Y. Wu
    Download pdf
  13. Software Architecture of PSET: A Page Segmentation Evaluation Toolkit
    Int. Journal on Document Analysis and Recognition, vol. 4, no. 3, pp. 205-217, 2002.
    by S. Mao and T. Kanungo
    Download pdf
  14. Approximating Large Convolutions in Digital Images
    IEEE Transactions on Image Processing, vol. 10, no. 12, pp. 1826-1835, 2001
    by D. M. Mount, T. Kanungo, N. S. Netanyahu, C. Piatko, R. Silverman and A. Y. Wu
    Download pdf
  15. Empirical Performance Evaluation Methodology and its Application to Page Segmentation Algorithms
    IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 23, no. 3, pp. 242-256, March 2001
    by S. Mao and T. Kanungo
    Download pdf
  16. Statistical, Nonparametric Methodology for Document Degradation Model Validation
    IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 22, no. 11, pp. 1209-1223, November 2000
    by T. Kanungo, R. M. Haralick, H. S. Baird, W. Stuezle, and D. Madigan
    Download pdf
  17. An Automatic Closed-Loop Methodology for Generating Character Groundtruth for Scanned Documents
    IEEE Transactions on Pattern Analysis and Machine Intelligence vol. 21, no. 2, pp. 179-183, February, 1999
    by T. Kanungo and R. M. Haralick
    Download pdf
  18. Grayscale Structuring Element Decomposition
    IEEE Transactions on Image Processing vol. 5, no. 1, pp. 111-120, 1996
    by O. I. Camps, T. Kanungo, and R. M. Haralick
    Download pdf
  19. A Methodology for Quantitative Performance Evaluation of Detection Algorithms
    IEEE Transactions on Image Processing, vol. 4, no. 12, pp. 1667-1674, 1995
    by T. Kanungo, M. Y. Jaisimha, J. Palmer, and R. M. Haralick
    Download pdf
  20. Statistical Morphology
    Journal of Applied Statistics Vol. 21(1/2), supppl. 2, pp. 341-354, 1994
    by R. M. Haralick, E. Dougherty, J. Ha, T. Kanungo, S. Karasu, C. K. Lee, L. Rystrom, V. Ramesh, I. Phillips
    Download postscript
    Reprinted in "Selected Papers on Morphological Image Processing," Tomasz Szoplik (ed).
  21. Nonlinear Local and Global Document Degradation Models
    Int. Journal of Imaging Systems and Technology Vol. 5, no. 4, John Wiley, pp. 220-30, Fall 1994
    by T. Kanungo, R. M. Haralick, and I. Phillips
    Download pdf
  22. Vector Space Solution of a Morphological Shape Decomposition Problem
    Journal of Mathematical Imaging and Vision Kluwer Academic Publishers, Vol. 2, no. 1, pp.51-82, 1992
    by T. Kanungo and R. M. Haralick
    Download pdf

    Book Chapters:

  23. IBM Research Activities at TREC
    in TREC: Experiment and Evaluation in Information Retrieval, E. M. Voorhees and D. K. Harman (eds.), MIT Press, pp. 373-396, 2005
    by E. Brown, D. Carmel, M. Franz, A. Ittycheriah, T. Kanungo, Y. Maarek, J. S. McCarley, R. L. Mack, J. M. Prager, J. R. Smith, A. Soffer, J. Y. Zien, A. D. Marwick
    Download pdf
  24. A Fast Algorithm for MDL-Based Multi-Band Image Segmentation
    in Image Technology, Jorge Sanz (ed.) Springer-Verlag, 1996
    by T. Kanungo, B. Dom, W. Niblack, and D. Steele
    Download technical report ps
  25. Discrete Half-Plane Morphology for Restricted Domains
    in Mathematical Morphology in Image Processing Ed. Dougherty (ed.), Marcel Dekker, pp. 289-321, 1993
    by T. Kanungo and R. M. Haralick

  26. A Quick Way for Relational Matching: Morphology
    Advances in Syntactic and Structural Pattern Recognition H. Bunke, ed., Bern, Switzerland, pp. 47-62, August, 26-28, 1992
    by T. Kanungo, R. M. Haralick, and J. Strupp

    Refereed Conference Papers:

  27. Thresholding strategies for text classifiers: TREC-2005 Biomedical Triage Task Experiments
    Proc. of the Text Retrieval Conference (TREC), Gaithersburg, MD, 2005
    by L. Si and T. Kanungo
    Download pdf
  28. Boosting Performance of Bio-Entity Recognition by Combining Results from Multiple Systems
    Proc. BioKDD Workshop, Chicago, IL, 2005.
    by L. Si, T. Kanungo and X. Huang
    Download pdf
  29. Satisfying database service level agreements while minimizing cost through stroage QoS
    Proc. of Int. Conf. on Services Computing, Orlando, FL, 2005.
    by F. Reiss and T. Kanungo
    Download pdf
  30. IDG: A business information extraction, management, and routing front-end for content management systems
    Proc. of SDIUT, College Park, MD, 2005.
    by V. Krishna, S. Srinivasan, N. Boyette, I. Cheng, J. Kreulen, T. Kanungo

  31. SemTag and SEEKER: Bootstrapping the Semantic Web via Automated Semantic Annotation
    Proc. of the World Wide Web Conference, Budhapest, Hungary, 2003
    by S. Dill, N. Eiron, D. Gibson, D. Gruhl, R. Guha, T. Kanungo, A. Jhingran, S. Rajagopalan, A. Tomkins, J. Tomlin, J. Zien
    Download pdf
  32. Stochastic attributed K-D tree modeling of technical paper title pages
    Proc. of IEEE Int. Conf. on Image Processing, Barcelona, Spain, September 2003.
    by S. Mao, A. Rosenfeld and T. Kanungo

  33. A Characterization of the Sensitivity of Query Optimization to Storage Cost Parameters
    Proc. of the ACM SIGMOD Conference, San Diego, CA, 2003.
    by F. R. Reiss and T. Kanungo
    Download pdf
  34. Document Structure Analysis: A Survey
    Proc. of SPIE Conf. on Document Recognition and Retrieval, Santa Clara, CA, pp. 197-207, January 2003.
    by S. Mao, A. Rosenfeld, and T. Kanungo
    Download pdf
  35. A Local Search Approximation Algorithm for k-Means Clustering
    Proc. of ACM Symposium on Computational Geometry, Barcelona, Spain, pp. 10-18, June, 2002.
    by T. Kanungo, D. M. Mount, N. S. Netanyahu, C. Piatko, R. Silverman and A. Y. Wu

  36. Stochastic Language Models for Analyzing Physical Layout of Documents
    Proc. of SPIE Conf. on Document Recognition and Retrieval, San Jose, CA, pp. 28-36, January 2002.
    by T. Kanungo and S. Mao

  37. Integrating Link Structure and Content Information for Ranking Web Documents
    Proc. of the Text Retrieval Conference, Gaithersburg, MD, September, 2001.
    by T. Kanungo and J. Y. Zien
    Download pdf
  38. Model-based Restoration of Degraded Documents
    Proc. of IEEE Int. Conf. on Image Processing, Greece, pp. 193-196, 2001.
    by Q. Zheng and T. Kanungo
    Download pdf
  39. Stochastic Language Models for Automatic Acquisition of Lexicons from Printed Bilingual Dictionaries
    Proc. of Document Layout Interpretation and its Applications, Seattle, WA, Sept. 9, 2001.
    by S. Mao and T. Kanungo

  40. What Fraction of Images on the Web Contain Text?
    Proc. of Int. Workshop on Web Document Analysis, Seattle, WA, Sept. 8, 2001.
    by T. Kanungo, C. H. Lee and R. Bradford
    Download pdf
  41. Estimation of Morphological Degradation Model Parameters
    Proc. of IEEE Int. Conf. on Acoustics, Speech and Signal Processing, Salt Lake City, UT, pp. 1961-1964, May 2001
    by T. Kanungo and Q. Zheng
    Download pdf
  42. TRUEVIZ: A groundTRUth/metadata Editing and VisualiZing Toolkit for OCR
    Proc. of SPIE Conf. on Document Recognition and Retrieval, San Jose, CA, January 2001.
    by T. Kanungo, C. Lee, J. Czorapinski and I. Bella

  43. Baseline Experiments for OCR-Based Arabic Named-Entity Extraction
    Proc. of IAPR Int. Workshop on Document Analysis Systems, Rio de Janeiro, Brazil, December 2000
    T. Kanungo and O. Bulbul
    Download pdf
  44. PSET: A Page Segmentation Evaluation Toolkit
    Proc. of IAPR Int. Workshop on Document Analysis Systems, Rio de Janeiro, Brazil, December 2000.
    S. Mao and T. Kanungo
    Download pdf
  45. A Point Matching Algorithm for Automatic Generation of Groundtruth for Document Images
    Proc. of IAPR Int. Workshop on Document Analysis Systems, Rio de Janeiro, Brazil, December 2000.
    D. W. Kim and T. Kanungo
    Download pdf
  46. Automatic Training of Page Segmentation Algorithms: An Optimization Approach
    Proc. of IAPR Int. Conf. on Pattern Recognition Barcelona, Spain, September 3-8, 2000
    by S. Mao and T. Kanungo
    Download ps
  47. The Analysis of a Simple K-Means Clustering Algorithm
    Proc. of ACM Symposium on Computational Geometry Hong Kong, June 12-14, 2000
    by T. Kanungo, D. M. Mount, N. S. Netanyahu, C. Piatko, R. Silverman, and A. Y. Wu
    Download ps
  48. Empirical Performance Evaluation of Page Segmentation Algorithms
    Proc. of SPIE Conf. on Document Recognition and Retrieval (VII) San Jose, CA, January 26-27, 2000
    by Song Mao and Tapas Kanungo
    Download ps
  49. OmniPage vs. Sakhr: Paired Model Evaluation of Two Arabic OCR Products
    Proc. of SPIE Conf. on Document Recognition and Retrieval (VI), vol. 3651 San Jose, CA, January 27-28, 1999
    by T. Kanungo, G. A. Marton and O. Bulbul
    Download ps
  50. The Bible, Truth, and Multilingual OCR Evaluation
    Proc. of SPIE Conf. on Document Recognition and Retrieval (VI), vol. 3651, San Jose, CA, January 27-28, 1999
    by T. Kanungo and P. Resnik
    Download ps
  51. Computing Nearest Neighbors for Moving Points and Applications to Clustering
    Proc. of 10th Annual ACM-SIAM Symposium on Discrete Algorithms Baltimore, MD, Jan. 17-19, 1999
    by T. Kanungo, D. M. Mount, N. Netanyahu, C. Piatko, R. Silverman, and A. Wu
    Download ps
  52. Performance Evaluation of Two Arabic OCR Products
    Proc. of AIPR Workshop on Advances in Computer Assisted Recognition, SPIE vol. 3584 Washington DC, October 14-16, 1998.
    by T. Kanungo and G. E. Marton and O. Bulbul
    Download ps
  53. Approximating Large Convolutions in Digital Images
    Proc. of Vision Geometry VII, R. A. Melter, L. J. Latecki, A. Y. Wu, Editors, SPIE vol. 3454, pp 216-227 San Diego, June 1998
    by T. Kanungo, D. M. Mount, N. S. Netanyahu, R. Silverman, and A. Y. Wu
    Download ps
  54. Hierarchical Organization of Appearance-Based Parts and Relations for Object Recognition
    Proc. of IEEE Conf. on Computer Vision and Pattern Recognition, Santa Barbara, June 1998
    by O. I. Camps, C. Y. Huang and T. Kanungo

  55. Object Recognition Using Appearance-Based Parts and Relations
    Proc. of IEEE Conf. on Computer Vision and Pattern Recognition, Puerto Rico, 1997
    by Chien-Yuan Huang, O. I. Camps and T. Kanungo
    Download ps
  56. Automatic Generation of Character Groundtruth for Scanned Documents: A Closed-Loop Approach
    Proc. of Int. Conf. on Pattern Recognition, Vienna, Austria, 1996,
    by T. Kanungo, R. M. Haralick
    Download ps
  57. Statistical Validation of Computer Vision Software
    Proc. of ARPA Image Understanding Workshop Palm Springs, CA, 1996, pp. 1533-1540
    by X. Liu, T. Kanungo, and R. M. Haralick

  58. Constrained Monotone Regression of Receiver Operating Curves and Histograms Using Splines and Polynomials
    Proc. of IEEE Int. Conf. on Image Processing, Washington, D.C., 1995, pp. 292-5, Vol. 2,
    by T. Kanungo, D. M. Gay, and R. M. Haralick
    Download pdf
  59. Receiver Operating Curves and Optimal Bayesian Operating Points
    Proc. of IEEE Int. Conf. on Image Processing, Washington, D.C., 1995, pp. 256-9, Vol. 3,
    by T. Kanungo and R. M. Haralick
    Download pdf
  60. Power Functions and their Use in Selecting Distance Functions for Document Degradation Model Validation
    Proc. of Int. Conf. on Document Analysis and Recognition, Montreal, Canada, 1995, pp. 734-9, Vol. 2,
    by T. Kanungo, R. M. Haralick, and H. S. Baird
    Download ps
  61. Reconstruction of CAD Objects from Engineering Drawings: A Survey
    Proc. of First IAPR Workshop on Graphics Recognition University Park, PA, 1995, pp. 217-228
    by T. Kanungo, R. M. Haralick, and D. Dori
    Download ps pdf
  62. Estimation and Validation of Document Degradation Models
    Proc. of Symposium on Document Analysis and Information Retrieval Las Vegas, NV, 1995, pp. 217-228
    by T. Kanungo, H. S. Baird and R. M. Haralick

  63. Estimation of Morphological Degradation Parameters
    Proc. SPIE Conference on Nonlinear Imaging, San Jose, CA, pp. 86-95, vol. 2424, February 6–9, 1995
    by T. Kanungo and R. M. Haralick

  64. Document Degradation Models: Parameter Estimation and Model Validation
    Proc. of IAPR Workshop on Machine Vision and Applications Kawasaki, Japan, 1994, pp. 552-7
    by T. Kanungo, R. M. Haralick, H. S. Baird, W. Stutzle, and D. Madigan

  65. A Fast Algorithm for MDL-Based Multi-Band Image Segmentation
    Proc. of IEEE Conf. on Computer Vision and Pattern Recognition pp. 609-616, Seattle, June 21-23, 1994
    by T. Kanungo, B. Dom, W. Niblack, and D. Steele
    Download pdf
  66. Statistical Morphology
    Proc. of SPIE Conf. on Image Algebra and Morphological Image Processing IV Vol. 2030, ppl 191-202, San Diego, July 12-13, 1993
    by R. M. Haralick, E. Dougherty, J. Ha, T. Kanungo, S. Karasu, C. K. Lee, L. Rystrom, V. Ramesh, I. Phillips
    Download ps
  67. Global and Local Document Degradation Models
    Proc. of Int. Conf. on Document Analysis and Recognition IEEE Press, Tsukuba, Japan, pp. 730-734, October, 20-22, 1993
    by T. Kanungo, R. M. Haralick, I. Phillips
    Download ps
  68. A Quantitative Methodology for Analyzing the Performance of Detection Algorithms
    Proc. of IEEE Int. Conf. on Computer Vision Berlin, Germany, pp.247-252, May 11-14, 1993
    by T. Kanungo, R. M. Haralick, M.Y. Jaisimha, and J. Palmer

  69. Morphological Image Processing on a Token Passing Pyramid Computer
    Proc. of Int. Conf. on Pattern Recognition, Hague, Netherlands IEEE Press, pp. 83-86, August 30-September 3, 1992
    by T. Kanungo, R. M. Haralick, G. Chiou, A. Somani

  70. Grayscale Structuring Element Decomposition
    Proc. of Int'l. Conf. on Pattern Recognition, Hague, Netherlands IEEE Press, pp.260-263, August 30 - September 3, 1992
    by O. I. Camps, R. M. Haralick, and Tapas Kanungo

  71. Morphological Decomposition of Restricted Domains: A Vector Space Solution
    Proc. of IEEE Conf. on Computer Vision and Pattern Recognition Champaign, IL, pp. 627-629, June 15-18, 1992
    by T. Kanungo and R. M. Haralick
    Download ps
  72. Recursive Morphological Opening Transform
    Proc. of IEEE Conf. on Computer Vision and Pattern Recognition, Champaign, June 1992, pp 560-565.
    by Robert M. Haralick, S. Chen and T. Kanungo

  73. Model-based Character Recognition
    Proc. of DARPA Workshop on Document Understanding Palo Alto, CA, May 1992
    by T. Kanungo and R. M. Haralick

  74. Character Recognition Using Mathematical Morphology
    Proc. of USPS Fourth Advanced Technology Conference Washington, D.C., pp. 973-986, Nov. 5-7, 1990
    by T. Kanungo and R. M. Haralick
    Download ps
  75. A Methodology for Characterization of a Line Detection Algorithm
    Proc. of SPIE Symposium on Advances in Intelligent Systems, Boston, Nov. 4-9, 1990.
    T. Kanungo, M.Y. Jaisimha, R. M. Haralick and J. Palmer

  76. B-Code Dilation and Decomposition of Restricted Convex Shapes
    Proc. of SPIE Conf. on Image Algebra and Morphological Image Processing Vol. 1350, pp. 419-430, San Diego, July 10-12, 1990
    by T. Kanungo, R. M. Haralick and X. Zhuang

    Theses:

  77. Document Degradation Models and a Methodology for Degradation Model Validation
    Doctoral Dissertation, University of Washington, Seattle, WA, 1996 (Advisor: R. M. Haralick)
    by T. Kanungo
    Download ps
  78. Discrete Half-plane Morphology and Decomposition of Restricted Domains
    Master's Thesis, University of Washington, Seattle, WA, 1990 (Advisor: R. M. Haralick)
    by T. Kanungo

    Patents:

    Granted:

  79. Automatic language identification system for multilingual optical character recognition
    US patent US6047251, Caere Corporation, 4 April, 2000.
    by L. K. Pon, T. Kanungo, J. Yang, K. C. Choy and M. R. Bokser
    Download pdf
  80. Automatic language identification system for multilingual optical character recognition
    European patent, EP1016033B1, Scansoft, Inc., 18 June 2003.
    by L. K. Pon, T. Kanungo, J. Yang, K. C. Choy and M. R. Bokser
    Download pdf

    Pending:

  81. System and method for extracting entities from unstructured text using n-gram language models,
    IBM, filed 2006
    by T. Kanungo and J. Rhodes

  82. System and method for using InChi codes to identify similar molecules,
    IBM, filed 2006
    by S. Boyer, G. Breyta, T. Kanungo, J. Kreulen, J. Rhodes

  83. A method for annotating, indexing, and classifying patents with MeSH codes
    IBM, filed 2005
    by S. Boyer et al.

  84. Focussed Sampling: Computing Topical Web Statistics,
    IBM, filed 2004
    by Z. Bar-Yossef, T. Kanungo, and R. Kraughgamer

  85. IO-Saver: A System and Method for Saving IOs in RAID Storage Systems,
    IBM, filed 2004
    by J. L. Hafner, J. R. Hartline, and T. Kanungo

  86. RAID 5-I: A System and Method with Low Parity In-Degree for Tolerating Multiple Disk Failures,
    IBM, filed 2004
    by J. L. Hafner, J. R. Hartline, and T. Kanungo

  87. RAID 5X0: A System and Method for Tolreating Multiple Disk Failures While Achieving Near-Optimal Efficiency,
    IBM, filed 2004
    by J. L. Hafner, J. R. Hartline, and T. Kanungo

    Technical Reports:

    (Not published as conference or journal papers.)
  88. On the Use of Heirarchy Information in Mapping Patents to Biomedical Ontologies
    IBM Technical Report RJ10365, October 2005.
    by Luo Si and T. Kanungo
    Download pdf
  89. Focused Sampling: Computing Topical Web Statistics
    IBM Research Report RJ10339, February, 2005.
    by Z. Bar-Yossef, T. Kanungo, R. Krauthgamer
    Download pdf
  90. Performance Metrics for Erasure Codes in Storage Systems
    IBM Research Report RJ10321, August, 2004.
    by J. L. Hafner, V. Deenadayalan, T. Kanungo, KK Rao
    Download pdf
  91. R5X0: An Efficient High Distance Parity-Based Code with Optimal Update Complexity
    IBM Research Report RJ 10322, August, 2004.
    by J. R. Hartline, T. Kanungo and J. L. Hafner
    Download pdf
  92. Full-Text Access to Historical Newspapers
    CS-TR-4014, University of Maryland, College Park, MD, April 1999.
    by T. Kanungo and R. B. Allen
    Download pdf
Back to my home page.