NIST logo
*

Multimodal Information Group

Welcome

The Multimodal Information Group (774.01) researches and develops measurement and evaluation methods to advance and promote the use of technologies that provide more effective access to multimedia and multi-lingual information and that improve human-computer interface modalities. These technologies include recognizing and/or transforming information in speech, text, images, video, and other multimedia modalities, and the fusion of heterogeneous media content through speech recognition, speaker recognition, language recognition, machine translation, image processing, image understanding, video processing, visual recognition, 2-D and 3D shape analysis, image quality assessment, and interoperable digital media access.

Programs/Projects

Machine Translation Program—The Multimodal Information Group's machine translation (MT) program includes several activities contributing to machine translation technology and metrology advancements, primarily through …

TRECVid Multimedia Event Detection Evaluation Track—The Multimedia Event Detection (MED) evaluation track is part of the TRECVid Evaluation. The goal of MED is to assemble core detection technologies into a system that can search multimedia …

i-vector Challenge—NIST will coordinate a special i-vector challenge in late 2013 and 2014 based on data used in previous NIST Speaker Recognition Evaluations (SREs). This challenge is intended to foster interest in …

TRECVid Multimedia Event Recounting Evaluation Track—The Multimedia Event Recounting (MER) evaluation track is part of the TRECVid Evaluation. MER participants are the participants in the MED evaluation whose MED submission also includes a recounting …

TRECVid Surveillance Event Detection Evaluation Track—The Surveillance Event Detection (SED) evaluation track is part of the TRECVid Evaluation and is intended to help promote technology development for event detection in video surveillance. The goal …

Speaker and Language Recognition Projects—The Multimodal Information Group's speaker and language recognition program includes several activities contributing to speaker and language recognition technology and metrology advancements, …

Video Surveillance Technologies for Retail Security—NIST has brought together major stakeholders from the retail and security industries, computer vision technologists/developers, the research community, law enforcement, and government agencies in a …

Advanced Video and Signal Based Surveillance—The Second Multiple Camera Single Person Tracking Challenge Evaluation (MCSPT) will be held in conjunction with the 7th Advanced Video and Signal Based Surveillance (AVSS) IEEE Conference.  The …

Rich Transcription Evaluation—The Rich Transcription evaluation series promotes and gauges advances in the state-of-the-art in several automatic speech recognition technologies. The goal of the evaluation series is to create …

 
Contact

General Information:
301-975-2944 Telephone
301-670-0939 Facsimile

100 Bureau Drive, M/S 8940
Gaithersburg, MD 20899-8940