Introduction to the Zen toolkit methodology


Abstract
 
The Zen toolkit is a public-source library in the Objective Caml functional language, designed for computational linguistics applications.

It defines a set of data-structures and algorithms which permit to organize linguistic computations under a uniform methodology of mixed automata, allowing efficient manipulation of generative lexicons, of inflected forms databases storing the result of morpho-phonetics processes, and of transducers effecting lemmatization, segmentation, and further shallow parsing modules. The course will give a survey of this methodology, with demonstration of its application to a Sanskrit computerisation platform. This software is freely downloadable at the Zen site http://sanskrit.inria.fr/ZEN/, where lecture notes for the course are available as a pdf document http://sanskrit.inria.fr/ZEN/zen.pdf.



Back to Schedule