rover

WORK IN PROGRESS

NAME

rover - Recognition Output Voting Error Reduction


SYNOPSIS

rover [ -sT -a alpha -c Nconf -f level -l width ] ( -h hypfile ctm )+ -o outfile -m meth

Input Options

Output Options

Voting Options

Alignment Options

DESCRIPTION

Rover is a tool combine hypothesized word outputs of multiple recognition systems and select the best scoring word sequence. Rover is part of the NIST SCTK Scoring Tookit. A number of different output formats can be generated and different scoring functions can be specified. A more complete description of the rover system can be found in the paper A post-processing system to yield reduced word error rates: Recognizer Output Voting Error Reduction (ROVER).

The ROVER system is implemented in two modules. First, the system outputs from two or more ASR systems are combined into a single word transition network. The network is created using a modification of the dynamic programming alignment protocol traditionally used by NIST to evaluate ASR technology. Once the network is generated, the second module evaluates each branching point using a voting scheme, which selects the best scoring word (with the highest number of votes) for the new transcription. The following figure depicts the the overall system architecture.

The heart of the Rover program is the ability to combine system outputs of mulitple recognition systems using an iterative Dynamic Programming alignment protocol into a single, composite Word Transition Network (WTN). The protocol is fully described in the Section 2.1. RECOGNITION OUTPUT ALIGNMENT MODULE of the paper.

Once the composite WTN is produced, each correspondence set (CS) is evaluated using the selected scoring function. Section 2.2. WTN VOTING SEARCH MODULE: describes the voting process in detail. There are three voting schemes described in the paper:

REVISION HISTORY

BUGS/COMMENTS