Blame view

tools/sctk-2.4.10/src/md-eval/test/md_test4.output.md-eval-v6.txt 8.34 KB
8dcb6dfcb   Yannick Estève   first commit
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
  md-eval run on 2004 Apr 14 at 19:48:32
  command line:  /data/data1/greg/md-eval-v6.pl -D -W -w -t 1.0 -l 2 -u md_test4.uem -r md_test4.ref.rttm -s md_test4.sys.rttm
  
  Word-based metadata alignment, max gap between matching words = 1.0 sec
  
  Metadata evaluation parameters:
      word-optimized metadata mapping
          max gap between matching metadata events = 0.1 words
          max extent to match for SU's = 2 words
  
  Speaker Diarization evaluation parameters:
      The max time to extend no-score zones for NON-LEX exclusions is 0.5 sec
      The no-score collar at SPEAKER boundaries is 0 sec
  
  Exclusion zones for evaluation and scoring are:
                               -----MetaData-----        -----SpkrData-----
       exclusion set name:     DEFAULT    DEFAULT        DEFAULT    DEFAULT
       token type/subtype      no-eval   no-score        no-eval   no-score
               (UEM)              X                         X
           LEXEME/frag                      X                          
           LEXEME/un-lex                    X                          
          NON-LEX/breath                                              X
          NON-LEX/cough                                               X
          NON-LEX/laugh                                               X
          NON-LEX/lipsmack                                            X
          NON-LEX/other                                               X
          NON-LEX/sneeze                                              X
          NOSCORE/<na>            X         X               X         X
   NO_RT_METADATA/<na>            X                                    
               SU/unannotated               X                          
  
  FILLER alignment and scoring details for channel 1 of file md_test4
   ref del ins sub      REF:  token type       tbeg    tend speaker           SYS:  token type       Rtbeg   Rtend     tbeg    tend sys-speaker 
     1   -   -   0     filled_pause FILLER    20.00   21.00 spkr=<sp3r>      filled_pause FILLER  (  20.00   21.00)   20.00   21.00 spkr=<sp3s> 
     1   -   -   0     filled_pause FILLER    25.00   26.00 spkr=<sp3r>      filled_pause FILLER  (  25.00   26.00)   25.00   26.00 spkr=<sp3s> 
  
  IP alignment and scoring details for channel 1 of file md_test4
   ref del ins sub      REF:  token type       tbeg    tend speaker           SYS:  token type       Rtbeg   Rtend     tbeg    tend sys-speaker 
     1   -   -   0           filler IP        20.00   20.00 spkr=<sp3r>            filler IP      (  20.00   20.00)   20.00   20.00 spkr=<sp3s> 
     1   -   -   0           filler IP        25.00   25.00 spkr=<sp3r>            filler IP      (  25.00   25.00)   25.00   25.00 spkr=<sp3s> 
  
  SU alignment and scoring details for channel 1 of file md_test4
   ref del ins sub      REF:  token type       tbeg    tend speaker           SYS:  token type       Rtbeg   Rtend     tbeg    tend sys-speaker 
     1   -   -   0        statement SU        10.00   13.00 spkr1r              statement SU      (  10.00   13.00)   10.00   13.00 spkr1s      
     1   -   -   0         question SU        13.00   16.00 spkr1r               question SU      (  13.00   16.00)   13.00   16.00 spkr1s      
     1   -   -   0      backchannel SU        16.00   19.00 name="<nar>"      backchannel SU      (  16.00   19.00)   16.00   19.00 name="<nas>"
     1   -   -   0         question SU        19.00   24.00 spkr=<sp3r>          question SU      (  19.00   22.00)   19.00   22.00 spkr=<sp3s> 
     1   -   -   0         question SU        24.00   29.00 spkr=<sp3r>          question SU      (  24.00   27.00)   24.00   27.00 spkr=<sp3s> 
     1   -   -   0         question SU        29.00   30.00 spkr=<sp3r>          question SU      (  29.00   30.00)   29.00   30.00 spkr=<sp3s> 
  
  *** Performance analysis for FILLERs ***
  
  FILLER word coverage statistics -- in terms of reference words
                Nref  Ndel  Nins  Nsub  Nerr     %Del   %Ins   %Sub   %Err
         ALL       2     0     0  <ns>     0     0.00   0.00   <ns>   0.00
  
  FILLER detection statistics -- in terms of # of FILLERs
                Nref  Ndel  Nins  Nsub  Nerr     %Del   %Ins   %Sub   %Err
         ALL       2     0     0     0     0     0.00   0.00   0.00   0.00
  
  FILLER detection confusion matrix -- in terms of # of FILLERs
             ALL - ref\sys  filled_p        {Miss}
              filled_pause       2             0  
  
                      {FA}       0  
  
  FILLER word offset statistics for ALL data
    word offsets:  <-3     -3   -2   -1    0    1    2    3     >3
             BEG:    0      -    -    -    2    -    -    -      0
             END:    0      -    -    -    2    -    -    -      0
  
  *** Performance analysis for IPs ***
  
  IP (exact) detection statistics -- in terms of reference words
                Nref  Ndel  Nins  Nsub  Nerr     %Del   %Ins   %Sub   %Err
         ALL       2     0     0  <ns>     0     0.00   0.00   <ns>   0.00
  
  IP detection statistics -- in terms of # of IPs
                Nref  Ndel  Nins  Nsub  Nerr     %Del   %Ins   %Sub   %Err
         ALL       2     0     0     0     0     0.00   0.00   0.00   0.00
  
  IP detection confusion matrix -- in terms of # of IPs
             ALL - ref\sys    filler        {Miss}
                    filler       2             0  
  
                      {FA}       0  
  
  IP word offset statistics for ALL data
    word offsets:  <-3     -3   -2   -1    0    1    2    3     >3
             BEG:    0      -    -    -    2    -    -    -      0
             END:    0      -    -    -    2    -    -    -      0
  
  *** Performance analysis for SUs ***
  
  SU (exact) end detection statistics -- in terms of reference words
                Nref  Ndel  Nins  Nsub  Nerr     %Del   %Ins   %Sub   %Err
         ALL       6     1     1  <ns>     2    16.67  16.67   <ns>  33.33
  
  SU detection statistics -- in terms of # of SUs
                Nref  Ndel  Nins  Nsub  Nerr     %Del   %Ins   %Sub   %Err
         ALL       6     0     0     0     0     0.00   0.00   0.00   0.00
  
  SU detection confusion matrix -- in terms of # of SUs
             ALL - ref\sys  backchan  question  statemen        {Miss}
               backchannel       1         0         0             0  
                  question       0         4         0             0  
                 statement       0         0         1             0  
  
                      {FA}       0         0         0  
  
  SU word offset statistics for ALL data
    word offsets:  <-3     -3   -2   -1    0    1    2    3     >3
             BEG:    0      -    -    -    6    -    -    -      0
             END:    0      -    -    1    5    -    -    -      0
  
  *** Performance analysis for Speaker Diarization ***
  
     TOTAL TIME =     16.99 secs
   TOTAL SPEECH =     16.99 secs (100.0 percent of total time)
    SCORED TIME =     16.99 secs (100.0 percent of total time)
  SCORED SPEECH =     16.99 secs (100.0 percent of scored time)
    TOTAL WORDS =     17        
   SCORED WORDS =     17         (100.0 percent of total words)
  ---------------------------------------------
  MISSED SPEECH =      0.00 secs (  0.0 percent of scored time)
  FALARM SPEECH =      0.00 secs (  0.0 percent of scored time)
   MISSED WORDS =      0         (  0.0 percent of scored words)
  ---------------------------------------------
  SCORED SPEAKER TIME =     16.99 secs (100.0 percent of scored speech)
  MISSED SPEAKER TIME =      0.00 secs (  0.0 percent of scored speaker time)
  FALARM SPEAKER TIME =      0.00 secs (  0.0 percent of scored speaker time)
   SPEAKER ERROR TIME =      0.00 secs (  0.0 percent of scored speaker time)
  SPEAKER ERROR WORDS =      0         (  0.0 percent of scored speaker words)
  ---------------------------------------------
   OVERALL SPEAKER DIARIZATION ERROR SCORE = 0.00 percent of scored speaker time
  ---------------------------------------------
   Speaker type confusion matrix -- speaker weighted
    REF\SYS (count)      child               unknown               MISS              
  child                     1 /  33.3%          0 /   0.0%          0 /   0.0%
  unknown                   0 /   0.0%          2 /  66.7%          0 /   0.0%
    FALSE ALARM             0 /   0.0%          0 /   0.0%
  ---------------------------------------------
   Speaker type confusion matrix -- time weighted
    REF\SYS (seconds)    child               unknown               MISS              
  child                  6.00 /  35.3%       0.00 /   0.0%       0.00 /   0.0%
  unknown                0.00 /   0.0%      10.99 /  64.7%       0.00 /   0.0%
    FALSE ALARM          0.00 /   0.0%       0.00 /   0.0%
  ---------------------------------------------