This page has pointer to software packages that I have been involved with in some way. Either I wrote it, or was involved in its design, or I used it a paper that I co-authored. If you have questions or comments regarding my software, please feel
free to email me -- see contacts page.
- UMDHMM -- Hidden Markov Model Toolkit
- TRUEVIZ -- Groudtruth and Visualization for Document Images (Chang Ha Lee)
- PSET -- Software for Performance Evaluation of Document Segmentation Algorithms. (Song Mao)
- MVNTEST --
Statistical Hypothesis Testing Software
- KMEANS -- K-Means Clustering (David Mount)
UMDHMM
Hidden Markov Model (HMM) Software: Implementation of Forward-Backward,
Viterbi, and Baum-Welch algorithms.
The software has been compiled and tested on UNIX platforms
(sun solaris, dec osf and linux) and PC NT running the GNU package
from Cygnus (has gcc, sh, etc.). A tar file can be found at:
(tar file). If you need a zip file:
zip file .
The README file.
Postscript slides
for tutorial talks that I gave on HMM. The PDF
version of the tutorial.
If for some reason you need the older verison of the code, the
files are here -- zip file and shar
file
To refer:
-
Tapas Kanungo, "UMDHMM: Hidden Markov Model Toolkit,"
in "Extended Finite State Models of Language," A. Kornai (editor), Cambridge University Press, 1999. http://www.kanungo.com/software/software.html.
Also:
-
Philip Resnik implemented a neat part-of-speech tagger
(tutorial
and scripts) using my HMM package.
- Here is a system that uses my HMM package for
predicting the toplogy of trans membrane helical protiens:
[system] [paper]
- If you want to use the software for bioinformatics see:
link.
- If you want to use the software for speech recognition work see (prob. 2):
link.
TRUEVIZ
TRUEVIZ: Groundtruth Editing and Visulization Software.
Software was co-developed by my student
Chang Ha Lee and Jeff Czorapinski.
Software: trueviz-1_02.tar.gz
Readme: README
Papers:
- C. H. Lee and T. Kanungo,
"The Architecture of TRUEVIZ: A GroundTRUth/Metadata Editing and VISualiZing Toolkit," Pattern Recognition, vol. 36, no. 3, pp. 811-825, 2003.
Download pdf
-
Kanungo, Lee, Czorapinsky, Bella,
"TRUEVIZ: A groundtruth/metadata
Editing and Visualizing Toolkit for OCR,"
in Proc. of SPIE Conf. on Document Recognition and Retrieval,"
San Jose, January 2001. Download gzipped ps
-
Tech Report: Lee, Kanungo, gzipped ps .
TrueViz was used at National Library of
Medicine, to create
Rover, which is used in the MEDLINE database creation process.
MVNTEST
My
multivariate hypothesis testing software for Gaussian Data.
A compressed shar file
available from Statlib.
A tech report
describing the tests and software.
PSET
PSET: A Page Segmentation Evaluation Toolkit. Software written by
my student Mao Song. (software/tar file)
The README file.
Papers:
-
S. Mao and T. Kanungo, "Software Architecture of PSET:
A Page Segmentation Evaluation Toolkit, Int. Journal on Document Analysis
and Recognition," vol. 4, no. 3, pp. 205-217, 2002.
Download pdf
-
S. Mao and T. Kanungo,
"PSET: A Page Segmentation Evaluation Toolkit,"
Proc. of Document Analysis Systems,
Rio de Janeiro, Brazil, 2000. Download
ps.
- A technical report describing the
software System. (The journal version above has more current information.)
- A journal paper that uses the PSET package to evaluate various segmentation algorithms
S. Mao and T. Kanungo,
"Empirical Performance Evaluation Methodology and its Application to
Page Segmentation Algorithms,"
IEEE Transactions on Pattern Analysis and Machine Intelligence,
vol. 23, no. 3, pp. 242-256, March 2001.
Download pdf
-
Here is a technical report
that describes our experimental results. (The journal paper above is more current.)
K-Means
The K-Means Custering Software was written by David Mount. You can download the code from his
software page .
Papers associated with this work are:
-
T. Kanungo, D. M. Mount, N. S. Netanyahu, C. Piatko, R. Silverman and A. Y. Wu,
An Efficient k-Means Clustering Algorithm: Analysis and Implementation,
IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 24, no. 7, pp. 881-892, 2002.
Download pdf
-
T. Kanungo, D. M. Mount, N. S. Netanyahu, C. Piatko, R. Silverman and A. Y. Wu,
"A Local Search Approximation Algorithm for k-Means Clustering,"
Computational Geometry: Theory and Aplications, vol. 28, pp. 89-112, 2004,
Download pdf
More coming up. (DDM, DVI2TIF, ESTIM, VALID, etc.)
Publications
My OCR Page
Multilingual OCR Workshop
Back to my home page
Back to my IBM home page