md_test8.output.saved 18.3 KB
edit raw blame history



1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

31

32

33

34

35

36

37

38

39

40

41

42

43

44

45

46

47

48

49

50

51

52

53

54

55

56

57

58

59

60

61

62

63

64

65

66

67

68

69

70

71

72

73

74

75

76

77

78

79

80

81

82

83

84

85

86

87

88

89

90

91

92

93

94

95

96

97

98

99

100

101

102

103

104

105

106

107

108

109

110

111

112

113

114

115

116

117

118

119

120

121

122

123

124

125

126

127

128

129

130

131

132

133

134

135

136

137

138

139

140

141

142

143

144

145

146

147

148

149

150

151

152

153

154

155

156

157

158

159

160

161

162

163

164

165

166

167

168

169

170

171

172

173

174

175

176

177

178

179

180

181

182

183

184

185

186

187

188

189

190

191

192

193

194

195

196

197

198

199

200

201

202

203

204

205

206

207

208

209

210

211

212

213

214

215

216

217

218

219

220

221

222

223

224

225

226

227

228

229

230

231

232

233

234

235

236

237

238

239

240

241

242

243

244

245

246

247

248

249

250

251

252

253

254

255

256

257

258

259

260

261

262

263


command line (run on 2004 Oct 29 at 14:18:17):  ../src/md-eval-v19a.pl -af -e -D -d -W -w -t 1.0 -l 2 -u md_test8.uem -r md_test8.ref.rttm -s md_test8.sys.rttm

Word-based metadata alignment, max gap between matching words = 1.0 sec

Metadata evaluation parameters:
    word-optimized metadata mapping
        max gap between matching metadata events = 0.1 words
        max extent to match for SU's = 2 words

Speaker Diarization evaluation parameters:
    The max time to extend no-score zones for NON-LEX exclusions is 0.5 sec
    The no-score collar at SPEAKER boundaries is 0 sec

Exclusion zones for evaluation and scoring are:
                             -----MetaData-----        -----SpkrData-----
     exclusion set name:     DEFAULT    DEFAULT        DEFAULT    DEFAULT
     token type/subtype      no-eval   no-score        no-eval   no-score
             (UEM)              X                         X
         LEXEME/un-lex                    X                          
        NON-LEX/breath                                              X
        NON-LEX/cough                                               X
        NON-LEX/laugh                                               X
        NON-LEX/lipsmack                                            X
        NON-LEX/other                                               X
        NON-LEX/sneeze                                              X
        NOSCORE/<na>            X         X               X         X
 NO_RT_METADATA/<na>            X                                    
             SU/unannotated               X                          

Word alignment and scoring details for channel 1 of file md_test8
 ref del ins sub      REF:  token type       tbeg    tend speaker           SYS:  token type       Rtbeg   Rtend     tbeg    tend sys-speaker 
   1   -   -   0        firstword lex       10.00   11.00 spkr1r              firstword lex     (  10.00   11.00)   10.00   11.00 spkr1s      
   1   -   -   0       secondword lex       11.00   12.00 spkr1r             secondword lex     (  11.00   12.00)   11.00   12.00 spkr1s      
   1   -   -   0        thirdword lex       12.00   13.00 spkr1r              thirdword lex     (  12.00   13.00)   12.00   13.00 spkr1s      
   1   -   -   0       fourthword lex       13.00   14.00 spkr1r             fourthword lex     (  13.00   14.00)   13.00   14.00 spkr1s      
   1   -   -   0        fifthword lex       14.00   15.00 spkr1r              fifthword lex     (  14.00   15.00)   14.00   15.00 spkr1s      
   1   -   -   0        sixthword lex       15.00   16.00 spkr2r              sixthword lex     (  15.00   16.00)   15.00   16.00 spkr2s      
   1   -   -   0      seventhword lex       16.00   17.00 spkr2r            seventhword lex     (  16.00   17.00)   16.00   17.00 spkr2s      
   1   -   -   1       eighthword lex       17.00   18.00 spkr2r             eighthword fp      (  17.00   18.00)   17.00   18.00 spkr2s      
   1   -   -   0        ninthword lex       18.00   19.00 spkr2r              ninthword lex     (  18.00   19.00)   18.00   19.00 spkr2s      
   1   -   -   0        tenthword lex       19.00   20.00 spkr2r              tenthword lex     (  19.00   20.00)   19.00   20.00 spkr2s      
   1   -   -   0     eleventhword lex       20.00   21.00 spkr2r           eleventhword lex     (  20.00   21.00)   20.00   21.00 spkr2s      

EDIT alignment and scoring details for channel 1 of file md_test8
 ref del ins sub      REF:  token type       tbeg    tend speaker           SYS:  token type       Rtbeg   Rtend     tbeg    tend sys-speaker 
   1   -   -   0         revision EDIT      11.00   14.00 spkr1r               revision EDIT    (  11.00   14.00)   11.00   14.00 spkr1s      
   1   -   -   1          restart EDIT      16.00   19.00 spkr2r               revision EDIT    (  16.00   19.00)   16.00   19.00 spkr2s      

FILLER alignment and scoring details for channel 1 of file md_test8
 ref del ins sub      REF:  token type       tbeg    tend speaker           SYS:  token type       Rtbeg   Rtend     tbeg    tend sys-speaker 
   1   -   -   0 explicit_editing FILLER    12.00   13.00 spkr1r       explicit_editing FILLER  (  12.00   13.00)   12.00   13.00 spkr1s      
   1   -   -   1 discourse_marker FILLER    17.00   18.00 spkr2r           filled_pause FILLER  (  17.00   18.00)   17.00   18.00 spkr2s      

IP alignment and scoring details for channel 1 of file md_test8
 ref del ins sub      REF:  token type       tbeg    tend speaker           SYS:  token type       Rtbeg   Rtend     tbeg    tend sys-speaker 
   1   -   -   0      edit&filler IP        12.00   12.00 spkr1r            edit&filler IP      (  12.00   12.00)   12.00   12.00 spkr1s      
   1   -   -   0      edit&filler IP        17.00   17.00 spkr2r            edit&filler IP      (  17.00   17.00)   17.00   17.00 spkr2s      

SU alignment and scoring details for channel 1 of file md_test8
 ref del ins sub      REF:  token type       tbeg    tend speaker           SYS:  token type       Rtbeg   Rtend     tbeg    tend sys-speaker 
   1   -   -   0        statement SU        10.00   15.00 spkr1r              statement SU      (  10.00   15.00)   10.00   15.00 spkr1s      
   1   -   -   0         question SU        15.00   21.00 spkr2r               question SU      (  15.00   21.00)   15.00   21.00 spkr2s      

Chronological display of sys data aligned with ref data for file 'md_test8', channel '1'
----------------------- reference ----------------------- | mapped | --------------------- system output ---------------------
    --type-- -subtyp- -----word/spkr-----  -tbeg-  -tend- | ref_ID |     --type-- -subtyp- -----word/spkr-----  -tbeg-  -tend-
beg SEGMENT  <na>     spkr1r                10.00         |        |
beg SPEAKER  child    spkr1r                10.00         |        |
                                            10.00         |        |beg SPEAKER  child    spkr1s=>spkr1r        10.00        
beg SU       statemen spkr1r                10.00         | SU1    |beg SU       statemen spkr1s=>spkr1r        10.00        
    LEXEME   lex      FIRSTWORD             10.00   11.00 | LX1    |    LEXEME   lex      FIRSTWORD             10.00   11.00
beg EDIT     revision spkr1r                11.00         | ED1    |beg EDIT     revision spkr1s=>spkr1r        11.00        
    LEXEME   lex      SECONDWORD             11.00   12.00 | LX2    |    LEXEME   lex      SECONDWORD             11.00   12.00
beg FILLER   explicit spkr1r                12.00         | FL1    |beg FILLER   explicit spkr1s=>spkr1r        12.00        
    IP       edit&fil spkr1r                12.00         | IP1    |    IP       edit&fil spkr1s=>spkr1r        12.00        
    LEXEME   lex      THIRDWORD             12.00   13.00 | LX3    |    LEXEME   lex      THIRDWORD             12.00   13.00
end FILLER   explicit spkr1r                        13.00 | FL1    |end FILLER   explicit spkr1s=>spkr1r                13.00
    LEXEME   lex      FOURTHWORD             13.00   14.00 | LX4    |    LEXEME   lex      FOURTHWORD             13.00   14.00
end EDIT     revision spkr1r                        14.00 | ED1    |end EDIT     revision spkr1s=>spkr1r                14.00
    LEXEME   lex      FIFTHWORD             14.00   15.00 | LX5    |    LEXEME   lex      FIFTHWORD             14.00   15.00
end SU       statemen spkr1r                        15.00 | SU1    |end SU       statemen spkr1s=>spkr1r                15.00
end SPEAKER  child    spkr1r                        15.00 |        |
                                                    15.00 |        |end SPEAKER  child    spkr1s=>spkr1r                15.00
end SEGMENT  <na>     spkr1r                        15.00 |        |
beg SEGMENT  <na>     spkr2r                15.00         |        |
beg SPEAKER  adult_fe spkr2r                15.00         |        |
                                            15.00         |        |beg SPEAKER  adult_fe spkr2s=>spkr2r        15.00        
beg SU       question spkr2r                15.00         | SU2    |beg SU       question spkr2s=>spkr2r        15.00        
    LEXEME   lex      SIXTHWORD             15.00   16.00 | LX6    |    LEXEME   lex      SIXTHWORD             15.00   16.00
beg EDIT     restart  spkr2r                16.00         | ED2    |beg EDIT     revision spkr2s=>spkr2r        16.00        
    LEXEME   lex      SEVENTHWORD             16.00   17.00 | LX7    |    LEXEME   lex      SEVENTHWORD             16.00   17.00
beg FILLER   discours spkr2r                17.00         | FL2    |beg FILLER   filled_p spkr2s=>spkr2r        17.00        
    IP       edit&fil spkr2r                17.00         | IP2    |    IP       edit&fil spkr2s=>spkr2r        17.00        
    LEXEME   lex      EIGHTHWORD             17.00   18.00 | LX8    |    LEXEME   fp       EIGHTHWORD             17.00   18.00
end FILLER   discours spkr2r                        18.00 | FL2    |end FILLER   filled_p spkr2s=>spkr2r                18.00
    LEXEME   lex      NINTHWORD             18.00   19.00 | LX9    |    LEXEME   lex      NINTHWORD             18.00   19.00
end EDIT     restart  spkr2r                        19.00 | ED2    |end EDIT     revision spkr2s=>spkr2r                19.00
    LEXEME   lex      TENTHWORD             19.00   20.00 | LX10   |    LEXEME   lex      TENTHWORD             19.00   20.00
    LEXEME   lex      ELEVENTHWORD             20.00   21.00 | LX11   |    LEXEME   lex      ELEVENTHWORD             20.00   21.00
end SU       question spkr2r                        21.00 | SU2    |end SU       question spkr2s=>spkr2r                21.00
end SPEAKER  adult_fe spkr2r                        21.00 |        |
                                                    21.00 |        |end SPEAKER  adult_fe spkr2s=>spkr2r                21.00
end SEGMENT  <na>     spkr2r                        21.00 |        |

*** Performance analysis for EDITs ***  overall error SCORE = 0.00%

EDIT word coverage statistics -- in terms of reference words
                                Nref    Ndel  Nins  Nsub     %Del   %Ins   %Sub     %D+I   %Tot
                 ALL               6       0     0     3     0.00   0.00  50.00     0.00  50.00

EDIT detection statistics -- in terms of # of EDITs
                                Nref    Ndel  Nins  Nsub     %Del   %Ins   %Sub     %D+I   %Tot
                 ALL               2       0     0     1     0.00   0.00  50.00     0.00  50.00
f=md_test8                         2       0     0     1     0.00   0.00  50.00     0.00  50.00

EDIT detection confusion matrix -- in terms of # of EDITs
           ALL - ref\sys   restart  revision        {Miss}
                 restart       0         1             0  
                revision       0         1             0  

                    {FA}       0         0  

EDIT word offset statistics for ALL data
  word offsets:  <-3     -3   -2   -1    0    1    2    3     >3
           BEG:    0      -    -    -    2    -    -    -      0
           END:    0      -    -    -    2    -    -    -      0

*** Performance analysis for FILLERs ***  overall error SCORE = 50.00%

FILLER word coverage statistics -- in terms of reference words
                                Nref    Ndel  Nins  Nsub     %Del   %Ins   %Sub     %D+I   %Tot
                 ALL               2       0     0     1     0.00   0.00  50.00     0.00  50.00

FILLER detection statistics -- in terms of # of FILLERs
                                Nref    Ndel  Nins  Nsub     %Del   %Ins   %Sub     %D+I   %Tot
                 ALL               2       0     0     1     0.00   0.00  50.00     0.00  50.00
f=md_test8                         2       0     0     1     0.00   0.00  50.00     0.00  50.00

FILLER detection confusion matrix -- in terms of # of FILLERs
           ALL - ref\sys  discours  explicit  filled_p        {Miss}
        discourse_marker       0         0         1             0  
   explicit_editing_term       0         1         0             0  
            filled_pause       0         0         0             0  

                    {FA}       0         0         0  

FILLER word offset statistics for ALL data
  word offsets:  <-3     -3   -2   -1    0    1    2    3     >3
           BEG:    0      -    -    -    2    -    -    -      0
           END:    0      -    -    -    2    -    -    -      0

*** Performance analysis for IPs ***  overall error SCORE = 0.00%

IP (exact) detection statistics -- in terms of reference words
                                Nref    Ndel  Nins  Nsub     %Del   %Ins   %Sub     %D+I   %Tot
                 ALL               2       0     0     0     0.00   0.00   0.00     0.00   0.00

IP detection statistics -- in terms of # of IPs
                                Nref    Ndel  Nins  Nsub     %Del   %Ins   %Sub     %D+I   %Tot
                 ALL               2       0     0     0     0.00   0.00   0.00     0.00   0.00
f=md_test8                         2       0     0     0     0.00   0.00   0.00     0.00   0.00

IP detection confusion matrix -- in terms of # of IPs
           ALL - ref\sys  edit&fil        {Miss}
             edit&filler       2             0  

                    {FA}       0  

IP word offset statistics for ALL data
  word offsets:  <-3     -3   -2   -1    0    1    2    3     >3
           BEG:    0      -    -    -    2    -    -    -      0
           END:    0      -    -    -    2    -    -    -      0

*** Performance analysis for SUs ***  overall error SCORE = 0.00%

SU (exact) end detection statistics -- in terms of reference words
                                Nref    Ndel  Nins  Nsub     %Del   %Ins   %Sub     %D+I   %Tot
                 ALL               2       0     0     0     0.00   0.00   0.00     0.00   0.00

SU detection statistics -- in terms of # of SUs
                                Nref    Ndel  Nins  Nsub     %Del   %Ins   %Sub     %D+I   %Tot
                 ALL               2       0     0     0     0.00   0.00   0.00     0.00   0.00
f=md_test8                         2       0     0     0     0.00   0.00   0.00     0.00   0.00

SU detection confusion matrix -- in terms of # of SUs
           ALL - ref\sys  question  statemen        {Miss}
                question       1         0             0  
               statement       0         1             0  

                    {FA}       0         0  

SU word offset statistics for ALL data
  word offsets:  <-3     -3   -2   -1    0    1    2    3     >3
           BEG:    0      -    -    -    2    -    -    -      0
           END:    0      -    -    -    2    -    -    -      0

*** Performance analysis for Speaker Diarization for f=md_test8 ***

    EVAL TIME =     11.00 secs
  EVAL SPEECH =     11.00 secs (100.0 percent of evaluated time)
  SCORED TIME =     11.00 secs (100.0 percent of evaluated time)
SCORED SPEECH =     11.00 secs (100.0 percent of scored time)
   EVAL WORDS =     11        
 SCORED WORDS =     11         (100.0 percent of evaluated words)
---------------------------------------------
MISSED SPEECH =      0.00 secs (  0.0 percent of scored time)
FALARM SPEECH =      0.00 secs (  0.0 percent of scored time)
 MISSED WORDS =      0         (  0.0 percent of scored words)
---------------------------------------------
SCORED SPEAKER TIME =     11.00 secs (100.0 percent of scored speech)
MISSED SPEAKER TIME =      0.00 secs (  0.0 percent of scored speaker time)
FALARM SPEAKER TIME =      0.00 secs (  0.0 percent of scored speaker time)
 SPEAKER ERROR TIME =      0.00 secs (  0.0 percent of scored speaker time)
SPEAKER ERROR WORDS =      0         (  0.0 percent of scored speaker words)
---------------------------------------------
 OVERALL SPEAKER DIARIZATION ERROR = 0.00 percent of scored speaker time  `(f=md_test8)
---------------------------------------------
 Speaker type confusion matrix -- speaker weighted
  REF\SYS (count)      adult_female        child                 MISS              
adult_female              1 /  50.0%          0 /   0.0%          0 /   0.0%
child                     0 /   0.0%          1 /  50.0%          0 /   0.0%
  FALSE ALARM             0 /   0.0%          0 /   0.0%
---------------------------------------------
 Speaker type confusion matrix -- time weighted
  REF\SYS (seconds)    adult_female        child                 MISS              
adult_female           6.00 /  54.5%       0.00 /   0.0%       0.00 /   0.0%
child                  0.00 /   0.0%       5.00 /  45.5%       0.00 /   0.0%
  FALSE ALARM          0.00 /   0.0%       0.00 /   0.0%
---------------------------------------------

*** Performance analysis for Speaker Diarization for ALL ***

    EVAL TIME =     11.00 secs
  EVAL SPEECH =     11.00 secs (100.0 percent of evaluated time)
  SCORED TIME =     11.00 secs (100.0 percent of evaluated time)
SCORED SPEECH =     11.00 secs (100.0 percent of scored time)
   EVAL WORDS =     11        
 SCORED WORDS =     11         (100.0 percent of evaluated words)
---------------------------------------------
MISSED SPEECH =      0.00 secs (  0.0 percent of scored time)
FALARM SPEECH =      0.00 secs (  0.0 percent of scored time)
 MISSED WORDS =      0         (  0.0 percent of scored words)
---------------------------------------------
SCORED SPEAKER TIME =     11.00 secs (100.0 percent of scored speech)
MISSED SPEAKER TIME =      0.00 secs (  0.0 percent of scored speaker time)
FALARM SPEAKER TIME =      0.00 secs (  0.0 percent of scored speaker time)
 SPEAKER ERROR TIME =      0.00 secs (  0.0 percent of scored speaker time)
SPEAKER ERROR WORDS =      0         (  0.0 percent of scored speaker words)
---------------------------------------------
 OVERALL SPEAKER DIARIZATION ERROR = 0.00 percent of scored speaker time  `(ALL)
---------------------------------------------
 Speaker type confusion matrix -- speaker weighted
  REF\SYS (count)      adult_female        child                 MISS              
adult_female              1 /  50.0%          0 /   0.0%          0 /   0.0%
child                     0 /   0.0%          1 /  50.0%          0 /   0.0%
  FALSE ALARM             0 /   0.0%          0 /   0.0%
---------------------------------------------
 Speaker type confusion matrix -- time weighted
  REF\SYS (seconds)    adult_female        child                 MISS              
adult_female           6.00 /  54.5%       0.00 /   0.0%       0.00 /   0.0%
child                  0.00 /   0.0%       5.00 /  45.5%       0.00 /   0.0%
  FALSE ALARM          0.00 /   0.0%       0.00 /   0.0%
---------------------------------------------