sd_test2.output.saved 15.8 KB
edit raw blame history



1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

31

32

33

34

35

36

37

38

39

40

41

42

43

44

45

46

47

48

49

50

51

52

53

54

55

56

57

58

59

60

61

62

63

64

65

66

67

68

69

70

71

72

73

74

75

76

77

78

79

80

81

82

83

84

85

86

87

88

89

90

91

92

93

94

95

96

97

98

99

100

101

102

103

104

105

106

107

108

109

110

111

112

113

114

115

116

117

118

119

120

121

122

123

124

125

126

127

128

129

130

131

132

133

134

135

136

137

138

139

140

141

142

143

144

145

146

147

148

149

150

151

152

153

154

155

156

157

158

159

160

161

162

163

164

165

166

167

168

169

170

171

172

173

174

175

176

177

178

179

180

181

182

183

184

185

186

187

188

189

190

191

192

193

194

195

196

197

198

199

200

201

202

203

204

205

206

207

208

209

210

211


command line (run on 2004 Oct 29 at 14:13:44):  ../src/md-eval-v19a.pl -1 -v -e -d -D -m -af -c 0.0 -T 0.0 -u sd_test2.uem -r sd_test2.ref.rttm -s sd_test2.sys.rttm

Time-based metadata alignment

Metadata evaluation parameters:
    time-optimized metadata mapping
        max gap between matching metadata events = 0.0 sec
        max extent to match for SU's = 0.5 sec

Speaker Diarization evaluation parameters:
    The max time to extend no-score zones for NON-LEX exclusions is 0.5 sec
    The no-score collar at SPEAKER boundaries is 0.0 sec

Exclusion zones for evaluation and scoring are:
                             -----MetaData-----        -----SpkrData-----
     exclusion set name:     DEFAULT    DEFAULT        DEFAULT    DEFAULT
     token type/subtype      no-eval   no-score        no-eval   no-score
             (UEM)              X                         X
         LEXEME/un-lex                    X                          
        NON-LEX/breath                                              X
        NON-LEX/cough                                               X
        NON-LEX/laugh                                               X
        NON-LEX/lipsmack                                            X
        NON-LEX/other                                               X
        NON-LEX/sneeze                                              X
        NOSCORE/<na>            X         X               X         X
 NO_RT_METADATA/<na>            X                                    
             SU/unannotated               X                          

Word alignment and scoring details for channel 1 of file sd_test2
 ref del ins sub      REF:  token type       tbeg    tend speaker           SYS:  token type       tbeg    tend speaker     
   1   -   -   0        firstword lex       10.00   11.00 spkr1r              firstword lex       10.00   11.00 spkr1s      
   1   -   -   0       secondword lex       11.00   11.80 spkr1r             secondword lex       11.00   11.80 spkr1s      
   1   -   -   0        thirdword lex       13.20   14.00 spkr2r              thirdword lex       13.20   14.00 spkr2s      
   1   -   -   0       fourthword lex       14.00   15.00 spkr2r             fourthword lex       14.00   15.00 spkr2s      
   1   -   -   0        fifthword lex       15.00   16.00 spkr3r              fifthword lex       15.00   16.00 spkr3s      
   1   -   -   0        sixthword lex       19.00   20.00 spkr3r              sixthword lex       19.00   20.00 spkr3s      
   1   -   -   0      seventhword lex       20.00   20.60 spkr4r            seventhword lex       20.00   20.60 spkr4s      
   1   -   -   0       eighthword lex       22.40   23.00 spkr4r             eighthword lex       22.40   23.00 spkr4s      
   1   -   -   0        ninthword lex       23.00   24.00 spkr4r              ninthword lex       23.00   24.00 spkr4s      

SU alignment and scoring details for channel 1 of file sd_test2
 ref del ins sub      REF:  token type       tbeg    tend speaker           SYS:  token type       tbeg    tend speaker     
   1   -   -   0        statement SU        10.00   14.00 spkr1r              statement SU        10.00   14.00 spkr1s      
   1   -   -   0        statement SU        11.90   14.00 spkr2r              statement SU        11.90   14.00 spkr2s      
   1   -   -   0        statement SU        14.00   15.00 spkr2r              statement SU        14.00   15.00 spkr2s      
   1   -   -   0        statement SU        15.00   20.00 spkr3r              statement SU        15.00   20.00 spkr3s      
   1   -   -   0      backchannel SU        20.00   24.00 spkr4r            backchannel SU        20.00   24.00 spkr4s      
'spkr1r' => 'spkr1s'
     1.90 secs matched to 'spkr1s'
'spkr2r' => 'spkr2s'
     3.10 secs matched to 'spkr2s'
'spkr3r' => 'spkr3s'
     5.00 secs matched to 'spkr3s'
'spkr4r' => 'spkr4s'
     4.00 secs matched to 'spkr4s'
beg/dur/end =  10.000/  1.900/ 11.900; REF = (spkr1r); SYS = (spkr1s)
beg/dur/end =  13.200/  1.800/ 15.000; REF = (spkr2r); SYS = (spkr2s)
beg/dur/end =  15.000/  1.500/ 16.500; REF = (spkr3r); SYS = (spkr3s)
beg/dur/end =  18.500/  1.500/ 20.000; REF = (spkr3r); SYS = (spkr3s)
beg/dur/end =  20.000/  0.600/ 20.600; REF = (spkr4r); SYS = (spkr4s)
beg/dur/end =  22.400/  1.600/ 24.000; REF = (spkr4r); SYS = (spkr4s)

Chronological display of sys data aligned with ref data for file 'sd_test2', channel '1'
----------------------- reference ----------------------- | mapped | --------------------- system output ---------------------
    --type-- -subtyp- -----word/spkr-----  -tbeg-  -tend- | ref_ID |     --type-- -subtyp- -----word/spkr-----  -tbeg-  -tend-
beg SEGMENT  <na>     spkr1r                10.00         |        |
beg SPEAKER  child    spkr1r                10.00         |        |
                                            10.00         |        |beg SPEAKER  child    spkr1s=>spkr1r        10.00        
beg SU       statemen spkr1r                10.00         | SU1    |beg SU       statemen spkr1s=>spkr1r        10.00        
    LEXEME   lex      FIRSTWORD             10.00   11.00 | LX1    |    LEXEME   lex      FIRSTWORD             10.00   11.00
    LEXEME   lex      SECONDWORD             11.00   11.80 | LX2    |    LEXEME   lex      SECONDWORD             11.00   11.80
end SPEAKER  child    spkr1r                        11.90 |        |
                                                    11.90 |        |end SPEAKER  child    spkr1s=>spkr1r                11.90
beg SPEAKER  adult_ma spkr2r                11.90         |        |
                                            11.90         |        |beg SPEAKER  adult_ma spkr2s=>spkr2r        11.90        
beg SU       statemen spkr2r                11.90         | SU2    |beg SU       statemen spkr2s=>spkr2r        11.90        
    NON-LEX  sneeze   AHTCHOO             12.00   13.00 |        |
    LEXEME   lex      THIRDWORD             13.20   14.00 | LX3    |    LEXEME   lex      THIRDWORD             13.20   14.00
end SU       statemen spkr1r                        14.00 | SU1    |
end SU       statemen spkr2r                        14.00 | SU2    |
                                                    14.00 | SU1    |end SU       statemen spkr1s=>spkr1r                14.00
                                                    14.00 | SU2    |end SU       statemen spkr2s=>spkr2r                14.00
beg SU       statemen spkr2r                14.00         | SU3    |beg SU       statemen spkr2s=>spkr2r        14.00        
    LEXEME   lex      FOURTHWORD             14.00   15.00 | LX4    |    LEXEME   lex      FOURTHWORD             14.00   15.00
end SU       statemen spkr2r                        15.00 | SU3    |end SU       statemen spkr2s=>spkr2r                15.00
end SPEAKER  adult_ma spkr2r                        15.00 |        |
                                                    15.00 |        |end SPEAKER  adult_ma spkr2s=>spkr2r                15.00
end SEGMENT  <na>     spkr1r                        15.00 |        |
beg SEGMENT  <na>     spkr3r                15.00         |        |
beg SPEAKER  unknown  spkr3r                15.00         |        |
                                            15.00         |        |beg SPEAKER  unknown  spkr3s=>spkr3r        15.00        
beg SU       statemen spkr3r                15.00         | SU4    |beg SU       statemen spkr3s=>spkr3r        15.00        
    LEXEME   lex      FIFTHWORD             15.00   16.00 | LX5    |    LEXEME   lex      FIFTHWORD             15.00   16.00
    NON-LEX  breath   HHHHHHH             17.00   18.00 |        |
    LEXEME   other    SIXTHWORD             19.00   20.00 | LX6    |    LEXEME   other    SIXTHWORD             19.00   20.00
end SU       statemen spkr3r                        20.00 | SU4    |end SU       statemen spkr3s=>spkr3r                20.00
end SPEAKER  unknown  spkr3r                        20.00 |        |
                                                    20.00 |        |end SPEAKER  unknown  spkr3s=>spkr3r                20.00
end SEGMENT  <na>     spkr3r                        20.00 |        |
beg SEGMENT  <na>     spkr4r                20.00         |        |
beg SPEAKER  adult_fe spkr4r                20.00         |        |
                                            20.00         |        |beg SPEAKER  adult_fe spkr4s=>spkr4r        20.00        
beg SU       backchan spkr4r                20.00         | SU5    |beg SU       backchan spkr4s=>spkr4r        20.00        
    LEXEME   lex      SEVENTHWORD             20.00   20.60 | LX7    |    LEXEME   lex      SEVENTHWORD             20.00   20.60
    NON-LEX  laugh    HAHHAH             21.00   22.00 |        |
    LEXEME   lex      EIGHTHWORD             22.40   23.00 | LX8    |    LEXEME   lex      EIGHTHWORD             22.40   23.00
    LEXEME   lex      NINTHWORD             23.00   24.00 | LX9    |    LEXEME   lex      NINTHWORD             23.00   24.00
end SU       backchan spkr4r                        24.00 | SU5    |end SU       backchan spkr4s=>spkr4r                24.00
end SPEAKER  adult_fe spkr4r                        24.00 |        |
                                                    24.00 |        |end SPEAKER  adult_fe spkr4s=>spkr4r                24.00
end SEGMENT  <na>     spkr4r                        24.00 |        |

*** Performance analysis for SUs ***  overall error SCORE = 0.00%

SU (exact) end detection statistics -- in terms of reference words
                                Nref    Ndel  Nins  Nsub     %Del   %Ins   %Sub     %D+I   %Tot
                 ALL               5       0     0     0     0.00   0.00   0.00     0.00   0.00

SU detection statistics -- in terms of # of SUs
                                Nref    Ndel  Nins  Nsub     %Del   %Ins   %Sub     %D+I   %Tot
                 ALL               5       0     0     0     0.00   0.00   0.00     0.00   0.00
f=sd_test2                         5       0     0     0     0.00   0.00   0.00     0.00   0.00

SU detection confusion matrix -- in terms of # of SUs
           ALL - ref\sys  backchan  statemen        {Miss}
             backchannel       1         0             0  
               statement       0         4             0  

                    {FA}       0         0  

SU word offset statistics for ALL data
  word offsets:  <-3     -3   -2   -1    0    1    2    3     >3
           BEG:    0      -    -    -    5    -    -    -      0
           END:    0      -    -    -    5    -    -    -      0

*** Performance analysis for Speaker Diarization for f=sd_test2 ***

    EVAL TIME =     14.00 secs
  EVAL SPEECH =     14.00 secs (100.0 percent of evaluated time)
  SCORED TIME =      8.90 secs ( 63.6 percent of evaluated time)
SCORED SPEECH =      8.90 secs (100.0 percent of scored time)
   EVAL WORDS =      9        
 SCORED WORDS =      9         (100.0 percent of evaluated words)
---------------------------------------------
MISSED SPEECH =      0.00 secs (  0.0 percent of scored time)
FALARM SPEECH =      0.00 secs (  0.0 percent of scored time)
 MISSED WORDS =      0         (  0.0 percent of scored words)
---------------------------------------------
SCORED SPEAKER TIME =      8.90 secs (100.0 percent of scored speech)
MISSED SPEAKER TIME =      0.00 secs (  0.0 percent of scored speaker time)
FALARM SPEAKER TIME =      0.00 secs (  0.0 percent of scored speaker time)
 SPEAKER ERROR TIME =      0.00 secs (  0.0 percent of scored speaker time)
SPEAKER ERROR WORDS =      0         (  0.0 percent of scored speaker words)
---------------------------------------------
 OVERALL SPEAKER DIARIZATION ERROR = 0.00 percent of scored speaker time  `(f=sd_test2)
---------------------------------------------
 Speaker type confusion matrix -- speaker weighted
  REF\SYS (count)      adult_female        adult_male          child               unknown               MISS              
adult_female              1 /  25.0%          0 /   0.0%          0 /   0.0%          0 /   0.0%          0 /   0.0%
adult_male                0 /   0.0%          1 /  25.0%          0 /   0.0%          0 /   0.0%          0 /   0.0%
child                     0 /   0.0%          0 /   0.0%          1 /  25.0%          0 /   0.0%          0 /   0.0%
unknown                   0 /   0.0%          0 /   0.0%          0 /   0.0%          1 /  25.0%          0 /   0.0%
  FALSE ALARM             0 /   0.0%          0 /   0.0%          0 /   0.0%          0 /   0.0%
---------------------------------------------
 Speaker type confusion matrix -- time weighted
  REF\SYS (seconds)    adult_female        adult_male          child               unknown               MISS              
adult_female           2.20 /  24.7%       0.00 /   0.0%       0.00 /   0.0%       0.00 /   0.0%       0.00 /   0.0%
adult_male             0.00 /   0.0%       1.80 /  20.2%       0.00 /   0.0%       0.00 /   0.0%       0.00 /   0.0%
child                  0.00 /   0.0%       0.00 /   0.0%       1.90 /  21.3%       0.00 /   0.0%       0.00 /   0.0%
unknown                0.00 /   0.0%       0.00 /   0.0%       0.00 /   0.0%       3.00 /  33.7%       0.00 /   0.0%
  FALSE ALARM          0.00 /   0.0%       0.00 /   0.0%       0.00 /   0.0%       0.00 /   0.0%
---------------------------------------------

*** Performance analysis for Speaker Diarization for ALL ***

    EVAL TIME =     14.00 secs
  EVAL SPEECH =     14.00 secs (100.0 percent of evaluated time)
  SCORED TIME =      8.90 secs ( 63.6 percent of evaluated time)
SCORED SPEECH =      8.90 secs (100.0 percent of scored time)
   EVAL WORDS =      9        
 SCORED WORDS =      9         (100.0 percent of evaluated words)
---------------------------------------------
MISSED SPEECH =      0.00 secs (  0.0 percent of scored time)
FALARM SPEECH =      0.00 secs (  0.0 percent of scored time)
 MISSED WORDS =      0         (  0.0 percent of scored words)
---------------------------------------------
SCORED SPEAKER TIME =      8.90 secs (100.0 percent of scored speech)
MISSED SPEAKER TIME =      0.00 secs (  0.0 percent of scored speaker time)
FALARM SPEAKER TIME =      0.00 secs (  0.0 percent of scored speaker time)
 SPEAKER ERROR TIME =      0.00 secs (  0.0 percent of scored speaker time)
SPEAKER ERROR WORDS =      0         (  0.0 percent of scored speaker words)
---------------------------------------------
 OVERALL SPEAKER DIARIZATION ERROR = 0.00 percent of scored speaker time  `(ALL)
---------------------------------------------
 Speaker type confusion matrix -- speaker weighted
  REF\SYS (count)      adult_female        adult_male          child               unknown               MISS              
adult_female              1 /  25.0%          0 /   0.0%          0 /   0.0%          0 /   0.0%          0 /   0.0%
adult_male                0 /   0.0%          1 /  25.0%          0 /   0.0%          0 /   0.0%          0 /   0.0%
child                     0 /   0.0%          0 /   0.0%          1 /  25.0%          0 /   0.0%          0 /   0.0%
unknown                   0 /   0.0%          0 /   0.0%          0 /   0.0%          1 /  25.0%          0 /   0.0%
  FALSE ALARM             0 /   0.0%          0 /   0.0%          0 /   0.0%          0 /   0.0%
---------------------------------------------
 Speaker type confusion matrix -- time weighted
  REF\SYS (seconds)    adult_female        adult_male          child               unknown               MISS              
adult_female           2.20 /  24.7%       0.00 /   0.0%       0.00 /   0.0%       0.00 /   0.0%       0.00 /   0.0%
adult_male             0.00 /   0.0%       1.80 /  20.2%       0.00 /   0.0%       0.00 /   0.0%       0.00 /   0.0%
child                  0.00 /   0.0%       0.00 /   0.0%       1.90 /  21.3%       0.00 /   0.0%       0.00 /   0.0%
unknown                0.00 /   0.0%       0.00 /   0.0%       0.00 /   0.0%       3.00 /  33.7%       0.00 /   0.0%
  FALSE ALARM          0.00 /   0.0%       0.00 /   0.0%       0.00 /   0.0%       0.00 /   0.0%
---------------------------------------------