Skip to main content
U.S. flag

An official website of the United States government

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

Summary

The Multimodal Information Group's speech analytics program has a long history of activities supporting the development of technologies that extract content from language-based recordings and of metrology advancements, primarily through systematic and targeted annual evaluations.

Since 1987, the Multimodal Information Group has coordinated speech transcription technology evaluations that explored several aspects of language production including the domain of discourse, source language, transcription, keyword search, speech/non-speech segmentation (speech activity detection), and disfluency detection, to name a few.

Description

Current speech analytics work:

  • OpenSAT is a new speech analytic evaluation series designed to include developers from multiple speech analytic technologies where common datasets are used in NIST evaluations. The goal is to bring together developers focused on different speech analytic tasks to enable knowledge sharing and leveraging for advancing these technologies. Speech analytic tasks from past NIST evaluations are included in OpenSAT. 

Past speech analytics work:

  • OpenSAD: The purpose of a Speech Activity Detection (SAD) system is to find regions of speech in an audio file. The NIST Open Speech-Activity-Detection evaluation (OpenSAD) is intended to provide Speech-Activity-Detection system developers with an independent evaluation of performance on a variety of audio data. The OpenSAD evaluation is a counterpart of the DARPA RATS SAD evaluations, but is open to all interested participants.
  • OpenKWS: An annual evaluation of technologies that perform keyword search in a new language each year. The evaluation is an outgrowth of the 2006 Spoken Term Detection evaluation.
  • Rich Transcription: The Rich Transcription evaluation series promotes and gauges advances in the state-of-the-art in several automatic speech recognition technologies. The goal of the evaluation series is to create recognition technologies that will produce transcriptions which are more readable by humans and more useful for machines.
Created January 15, 2013, Updated July 5, 2018