Rosetta Stone

Main research areas:

Multimodal Interaction
Technologies to deal with a recent paradigm shift in the design of Pattern Recognition, where the traditional concept of full-automation is being changed to systems where the decision process is conditioned by human feedback. Problems and applications considered within this area include: Relevance-based (image) information retrieval and Interactive-Predictive processing for Computer Assited Machine Translation, as well as for the Interactive Transcription of speech audio streems and text images.
Machine Translation
Speech-to-speech translation or text-to-text translation for limited domains. Finite-state and statistical transducers are used as the basis of the machine translation systems. These models can be learnt automatically from real examples of translation. Applications: translation of technical reports, hotel services, etc.
Handwritten Cursive Text Recognition (HTR)
Both off-line (document images) and on-line HTR (tablet or e-pen signals) are considered. No prior character or word segmentation is needed. Technology, borrowed from Speech Recognition, relies on character Hidden Markov Models, Finite State word models, and syntactic N-Grams. After model training, for each given text line image, a holistic ("Viterbi") search provides both an optimal transcription and the corresponding word and character segmentations. Applications: Transcription of ancient and legacy documents, transcription of unconstrained handwritten text in survey forms, etc.
Automatic Speech Recognition and Understanding
The speech utterances are decoded into strings of words or into strings of semantic units. Finite-state grammars are used as the basis of such systems. These finite-state grammars are learnt automatically from real examples of utterances or text. Applications: telephone exchange services, device control by voice, information queries, etc.
Image Analysis and Computer Vision
Identification of the objects in an image. Statistical and Syntactic Pattern recognition techniques are used. Applications: OCR and document analysis, medical diagnosis, fingerprint identification, classification of chromosomes, aids for the handicapped, manufacturing quality control, etc.
Activities and results in these areas can be found in: Projects, Demos, Software and Publications,