Blame view

tools/sctk-2.4.10/CHANGELOG 16 KB
8dcb6dfcb   Yannick Estève   first commit
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
  SCTK Version 2.4.10 - Release October 15, 2014
    [*] Added the case conversion for Babel's UTF-8 of Mongolian and Guarani data.  Changes for 
        Guarani include handling of two-UTF characters representing G~.
  
  SCTK Version 2.4.9 - Release October 15, 2014
    [*] Removed the use of getopt in acomp.pl and def_art.pl
    [*] rfilter typdef reassignment to char to an unsigned char replaced with a new typedef 'Char'
        to support new GCC.
    [*] Added the sclite_tolower utility to handle case conversion via sclite's library.
    [*] Added the case conversion for Babel's UTF-8 of Kazakh, Cebuano, Kurmanji, and Lithuanian
  
  SCTK Version 2.4.8 - Release March 29, 2013
    [*] Added the case conversion for Babel's UTF-8 of Vietnamese and Turkisoh.
  
  SCTK Version 2.4.1 - Release March 12, 2012
    [*] Fixed compilation issue
  
  SCTK Version 2.4.0 - Release November 10, 2009
    [*] Fixed asclite to handle rttm properly when _ are present in the ref
    [/] The transcript filter, csrfilt.sh, now only modifies the transcript during
         the filter process rather than up-casing all ASCII characters.
    [/] The scoring routines that relied on up-cased text were modified to accept
         the change.
    [/] Hubscr uem filtering is applied to asclite and mdeval 
    [-] removed unwanted gif images for align2html
  
  SCTK Version 2.3.11 - Release April 8, 2009
    [+] rttmValidator can handle RTTM slat and RTTM without SPKR-INFO
  
  SCTK Version 2.3.10 - Release March 6, 2009
    [*] Fixed tests
  
  SCTK Version 2.3.9 - Release March 5, 2009
    [*] Compile with gcc 4.3.
  
  SCTK Version 2.3.8 - Release March 4, 2009
    [*] Fix slat test when gnuplot is not installed.
  
  SCTK Version 2.3.7 - Release March 3, 2009
    [*] Revisite slatreport.
  
  SCTK Version 2.3.6 - Release February 26, 2009
    [*] Fixed install and makefile.
  
  SCTK Version 2.3.5 - Release February 26, 2009
    [*] Fixed slatreport makefile behavior.
  
  SCTK Version 2.3.4 - Release February 24, 2009
    [/] Added SLAT report computation and doc updates.
  
  SCTK Version 2.3.3 - Release February 20, 2009
    [+] Added SLAT report and handling by asclite.
  
  SCTK Version 2.3.2 - Release January 12, 2009
    [/] Changed BZip2 to LZMA.
  
  SCTK Version 2.3.1 - Release January 6, 2009
    [/] Changed miniLZO to BZip2.
  
  SCTK Version 2.3 - Release September 5, 2008
    [+] Code optimization.
    [+] Added label and category information in asclite SGML for STM files. 
  
  SCTK Version 2.3-rc2c - Release May 8, 2008
    [+] Check if the character " is not in a RTTM file.
    [/] Changed the sclite compilation flags.
  
  SCTK Version 2.3-rc2b - Release May 6, 2008
    [*] Fixed an asclite bug went more than 32768 segment are added to the structure.
    [/] Changed the asclite compilation flags.
  
  SCTK Version 2.3-rc2 - Release November 26, 2007
    [*] Fixed the exit code, for a fatal error, it exists with a non 0 value.
    [/] Change the error/Info message when filtering.
    [+] Added information in the documentation for asclite regarding the 
        UEM filtering.
  
  SCTK Version 2.3-rc1 - Release September 21, 2007
    [*] Fixed asclite when no input given
    [*] Fixed mapsswe segmentation code 
    [+] Added UEM inplementation directly to asclite
    [+] -wordalign-cost options for asclite adding some cost variance
        depending in the word syntactic distance
    [+] Added Time base cost model trigger with '-time-base-cost' in asclite
    [+] Generic/Rover full implementation in asclite
    [+] Added cumulative table to HTML reports
    [+] Added man and HTML docs for tools
    [/] Changed the speaker auto-overlap behavior from internal to command-
        line options for asclite
    [/] Inter Segment Gaps by default for every file type in asclite
    [/] Update headers disclaimer and file format
    [+] Added rttmValidator
    [+] Added stmValidator
    [+] Added ctmValidator
    
  SCTK Version 2.2.4 - Release May 1, 2007
    [+] Added block size and force compression in hubscr.pl
    [+] Added block size limit to 1GB
    [+] Added block computation less aggressive to allow bigger block size
    [*] Fixed linking issue for RTTM and STM files in cause of empty segment
    [*] Fixed display issue with align2html
  
  SCTK Version 2.2.3 - Release April 18, 2007
    [*] Fix the time computation in html report when ISG vs. empty segment
    [/] Fix computation percentage for this table of html display
    [/] Better handling on exceptions to drop segment groups from alignments
    [/] Change some outputs to debug level instead of info level
    [/] Better handling of the memory threshold
    [/] Changed from ucl library to lzo library for compression
    [+] Safer memory deallocation in the LCM
    [+] Minimum command line option to drop the small alignments
    
  SCTK Version 2.2.2 - Release March 23, 2007
    [/] Speed up the use of Compression Matrix by not using System Timer
    [/] New implementation of the compressed Levenshtein Matrix to avoid
        overhead creation
    [*] Fix the output report file generation for asclite
  
  SCTK Version 2.2.1 - Release March 13, 2007
    [*] Fixed the sequence number in SGML when using RTTM Refs
    [+] Added Difficulty limit
  
  SCTK Version 2.2.0 - Release March 12, 2007
    [*] test: fixed some tests
    [*] core: fixed align2html display
    [+] core: added compression to align bigger matrices
    [+] core: added pruning using time
  
  SCTK Version 2.1.7 - Release February 22, 2007
    [*] test: fixed some tests
    [*] core: fixed floating point issue
  
  SCTK Version 2.1.6 - Release February 21, 2007
    [+] tools: new display for align2html
    [*] test: fixed some tests
    [*] core: fixed sorting issues
  
  SCTK Version 2.1.5 - Release February 9, 2007
    [+] core: asclite handles speakers preferences for alignment
    [+] core: asclite handles rttm as references and hypotheses
    [+] core: asclite has a new option '-adaptive-cost' that restricts the
              alignment with the time
    [+] core: asclite has a new option '-spkr-align' that handle the new
              SASTT alignment requirement for speaker mapping
    [+] core: asclite output a new level of log displaying the alignment 
              and the group of segments used for the alignment
    [+] core: asclite has a new error output: speaker error
    [+] tools: mergectm2rttm script to merged ctm into rttm file
    [+] tools: md-eval for speaker alignment
    [+] tools: align2html to display graphical alignment in HTML/Javascript
    [*] core: fix bugs that ignore tokens and segments in the alignments
    [*] core: fix the pruning time based algorithm
    [*] core: fix the floating point bug in asclite
    [/] core: asclite options '-time-prune' and '-word-time-align' require 
              milliseconds instead of seconds in the commandline
  
  SCTK Version 2.1.4 - Release May 19, 2006
    [+] Updated asclite to handle 64-bits
    [+] Updated asclite with caching information (increase speed +5-10%)
    
  SCTK Version 2.1.3 - Release April 21, 2006
    [+] Updated toolkit to handle rttm file format
  
  SCTK Version 2.1.2 - Release April 4, 2006
    [*] Updated asclite with bugfix and memory usage
    
  SCTK Version 2.1.1 - Release December 5, 2005
    [*] Hotfix 64bits and Alpha platforms (Thanks to David Huggins-Daines).
  
  SCTK Version 2.1 - Release June 3, 2005
    [*] Fixed csrfilt.sh to respect hyphens in optionally deletable fragments.	
   
  SCTK Version 2.0 Beta 6 - Release May 2005
    [+] Added asclite
    
  SCTK Version 2.0 Beta 5 - Release October 22, 2004
    [+] Added Mandarin hesitations to chfilt.pl and utf_filt.pl
  
  SCTK Version 2.0 Beta 4 - Release October 18, 2004
    [/] Modified tanweenFilt to remove all tanween characters.
    [*] Fixed a bug in tanweenFilt and hamzaNorm to also change optionally
        deletable words.   
  
  SCTK Version 2.0 Beta 3 - Release October 5, 2004
    [/] Fixed the install for rfilter1
    [+] Added an option to chfilt.pl to map channel id's A or B to 1 and 2 respectively.
    [+] Added a new program tanweenFilt.pl
  
  SCTK Version 2.0 Beta 1 - Release August 30, 2004
    [/] Restructured release directories.  'src' now includes many programs
    [+] Integrated in all programs in the 'tranfilt' package
    [*] make install now makes move programs to 'bin'
    [+] hamzaNorm is a new program.
    [/] 2004 Arabic transcripts are now handled by hubscr and chfilt.
  
  SCTK Version 1.3 - Release July 30, 2004
    [*] Minor bug fixes for core dumps
    [+] Added the ability to pass two tags attached to each word through the
        scorer.   The tags are attached to the words by appending ';<string>'
        to the word's text.  There can be up to two tags, and they may be empty.
    [+] Added a '#' after NCE values in the .sys reports to indicate the
        abscence of reference lexemes for a speaker.
    [/] Expanded the buffers in the rpg.c suite of routines for report generation.
    [/] Expanded the maximum alternation size to 10000 characters.
    [+] Added a "Lattice" error rate calculation in the .prn reports.  It's the
        percent of reference tokens not correct in any systems transcript.
  
  SCTK Version 1.2c - Release October 11, 2000
    [/] Improved installation targets in makefile
  
  SCTK Version 1.2b - Release October 1, 2000
    [/] Improved testing code to not report errors under Linux
     
  SCTK Version 1.2a
    [*] Fixed an installation problem for Linux involving scfp.
  
  SCTK Version 1.2
    [+] added the prn report to sc_stats.   Prints N-system alignments together.
    [+] Added option alignment by word-weighted-mediated alignments.
    [/] Weight inputs include wwl file (-w) and LM file (-L).
    [+] Added testing scripts and documentation examples.
    [+] Added the .wws output format.
    [/] Update .prf output to include word weights and other information.
    [+] Add SLM toolkit v2 into the sctk package.
    [/] modified config.in, makefile.in and the installation process
    [/] Various internal structures modified to handle word weights.
    [/] Compiles under Linux using gmake.
    [/] Documetation changes, including additional comments concerning the 
        waveform id in the STM and CTM file formats.
  
  SCTK Version 1.1 - Release November 13, 1997
    [/] Utility versions in this release: sclite V2.1, sc_stats V1.1
    [+] added the Executive and Raw Executive Summaries to sc_stats.
    [+] added the det curve to sc_stats so that combined plots are produced.
    [/] modified mapsswe test to handle arbitrary number of segments
    [*] Correct a bug in mtchprs.c which was free-ing a the test confindence
        array prematurely.
  
  SCTK Version 1.0
    [/] Release under new name SCTK
    [/] Modified the label extraction function 'parse_input_comment_line'
        to ignore duplicate LABEL and CATEGORY lines.
    [+] Added a sequence number to each PATH in alignment sequence so
        that the input sequence of alignments can be reconstructed.
    [+] Added the capability to keep track of reference confidence scores
        when aligning ref ctm's against hyp ctm's.
    [*] Corrected the .pre dump of the alignment structure when the case
        sensitive flag is set.  The error was introduced by modifications.
    [*] Fixed a problem in TEXT_strcasecmp().  It failed to handle the
        case where str1 was shorter than srt2.
    [*] Fixed a problem in 'align.c/extract_speaker()' a NULL was not
        terminating each newly extracted speaker id.
    [/] Revised the reports lut, sum, snt, spkr,ovr to handle speakers W/o
        any reference tokens, In the sum report, the speakers W/o	
        reference tokens are ignored when computing the speaker
        mean, sd, and median.
    [*] fixed a  bug in tcslite.sh which output an error when test 5 was
        run and the use of gnudiff was not compiled in to sclite.
    [*] fixed a bug in config.in which was propagated to config.sh.  The 
        problem was a missing backquote on "uname -s".
    [+] Added error checking to the ctm2ctm alignment module.  No checking
        had been performed to make sure the ref and hyp files had the 
        same conversations and channels.
    [/] Fixed a problem in 'expand_words_to_chars()' it was not deleting
        hyphens from single character words do to an incorrect conditional.
    [+] Added a new way to score, 'Optionally Deletable'.  This required a
        major set of modifications and generalizations.
    [/] Modified the character scoring proceedure so that confidence scores
        are imputed to the sub-characters making up the word.
    [*] Corrected a bug in Compute_ROC:det.c which incorrectly incremented
        pointers.
  
  SCLITE Version 1.4a - Release May 29, 1997
    [/] Cleaned the distribution to be ISO-9669 compatable
  
  SCLITE Version 1.4 - Release October 18, 1996
    [/] Forced confidence values to flow through the entire data pipeline.
    [+] Added the '-C' option to include 'normalized cross-entropy'
        statistics in all output files.
    [+] Added algo2 for the inferred segmentation option '-S'
    [+] Added "IGNORE_TIME_SEGMENT_IN_SCORING" as an allowable 
        transcript for an stm record.  See the stm file documentation for
        it's use.
  
  SCLITE Version 1.3 - Release April 22, 1996
    [*] Corrected a minor makefile inconsistency. (One file was compiled 
        twice).
    [/] Changed Network_dp_align to optionally include NULLS in the output.
    [/] Changed the -m option to now reduce either the reference or 
        hypothesis file, or both before alignment takes place.
    [*] fixed an uninitialized variable in alex.c which became apparent
        in the 'dtl' and 'spk' reports.
    [*] Corrected a argument passed to fill_STM_structure() in stm2ctm.c
        which caused a warning on some compilers.
    [+] Added a bug report proceedures.
  
  SCLITE Version 1.2a - Release March 15, 1996
    [*] Forgot one minor file in the distrubution, "sclite.c".
  
  SCLITE Version 1.2 - Release March 8, 1996
    [*] Corrected a bug in the lur report that was activated if a speaker
        had no reference words, but had errorneously hypothesized words.
    [+] Added the sent, spk, and ovrdtl reports to sclite.
    [+] Added the option to score CTM to CTM files.  This is essentially
        the same code used for the first SWB LVCSR evaluation, however, since
        the new network alignment routines were used unifying the alignment
    	   into a single step, alignments will differ slightly from those 
        generated with the old scoring package.
    [+] Added the "-T" option to do time-mediated alignments.
    [-] Removed the size limitations in the report generation software,
        'rpg.c'.  There is still are hard limit on the length of characters
        for each cell of 200.
    [/] Standardize program exit codes to be 0 for successfull execution
        and 1 for failed execution.
    [*] Correct the handling of NULL alternatives in the hypothesis file.
        Scoring reference to hypothesis yields the same error rates as
        scoring hypothesis to reference.  The only difference is insertions
        are swapped with deletions.
    [+] The installer now has the option to enable or disable alignments
        via GNU's diff.
    [+] Added informative error messages when label definitions, which are
        used by the 'lur' report, have been improperly specified.
  
  SCLITE Version 1.1 - Release September 27, 1995
    [+] New/modified output options:
        * Added options to '-o':  'none' to not make any reports,
  	     'sgml' to create an sgml file for alignments, 'lur' for the
  	     labeled utterance report.
        * '-p'.  Pipes output of alignments to other sclite utilities.
         in the sgml format.
    [+] New Input options:
        * '-P' accepts piped sgml format input from other sclite utilities.
        * '-e' identifies the input character encoding.
    [+] New alignment options:
        * '-S' performs an inferred word segmentation algorithm rather
          than using the word segmentation of the reference and hyp files.
        * '-F' aligns fragments to words with matching substrings and scores
          them as correct.
        * Changed the -c option to include the optional flag "ASCIITOO"
          which also splits ascii words when doing a character alignment.
          Also added another flag, "DH", to delete hyphens from the ref and
          hyp transcripts before alingment.
    [*] Fixes and Changes:
        * Modified the '-n' option to handle multiple hyp files.
        * Fixed a bug in 'parse_stm_line' to handle empty texts.
        * Modified the read function for a CTM file so that any length
          file will be properly read in.
    [/] Compiled and tested using the HP-UX and DEC OSF1 native cc
        compilers.
  
  SCLITE Version 1.0 - Release July 27, 1995
    [+] Initial Release