Blame view

README 4.22 KB
c8737cbd9   Jean-François Rey   bugfix info on re...
1
2
3
4
5
6
7
               ___ _____ __  __ _____ ____ ___    _      _     ___    _    
              / _ \_   _|  \/  | ____|  _ \_ _|  / \    | |   |_ _|  / \   
             | | | || | | |\/| |  _| | | | | |  / _ \   | |    | |  / _ \  
             | |_| || | | |  | | |___| |_| | | / ___ \  | |___ | | / ___ \ 
              \___/ |_| |_|  |_|_____|____/___/_/   \_\ |_____|___/_/   \_\
  
                                                
665a8dac3   Jean-François Rey   ! follow the whit...
8
9
10
11
12
  #---------------#   
  #  OTMEDIA LIA  #
  #    README     #
  #  version 1.0  #
  #---------------#
e6be5137b   Jean-François Rey   reinitialized pro...
13

c8737cbd9   Jean-François Rey   bugfix info on re...
14
15
16
17
18
19
20
21
22
23
24
  DESCRIPTION
  -----------
  
      OTMEDIA means "Observatoire Transmedia", its main objective is to study the evolution and transformation of the media world.
      The scientific objective of the project is the creation of a new generation of media observatory
      based on an interactive automatic analysis system (semi-automatic) transmedia to understand
      the world of information and developments.
  
      Web Site : http://www.otmedia.fr
  
      OTMEDIA LIA project is a set of tools to transcribe radio and TV shows.
665a8dac3   Jean-François Rey   ! follow the whit...
25
26
27
28
29
30
      It does multiple things :
      - First pass : default transcription with speeral and speaker diarization.
      - Second pass : speaker adaptation and a second transcription pass with speeral.
      - Confidence pass : calcul confidence measure from transcription output.
      - Exploit Confidence Measure : use SOLR DB data to extend the lexicon on low confidence measure and create trigg files.
      - Third pass : second pass using the new lexicon and trigg files.
5a14b8678   Jean-François Rey   update doc
31
     
ad76455b2   Jean-François Rey   up
32
      From GIT : http://gitlia.univ-avignon.fr/jean-francois.rey/otmedia
c8737cbd9   Jean-François Rey   bugfix info on re...
33
34
35
  
  DEPENDENCIES
  ------------
c8737cbd9   Jean-François Rey   bugfix info on re...
36

b9a54507e   Jean-François Rey   update bugfix + r...
37
38
39
40
41
  GNU Toolchain
          Available from : http://www.gnu.org
          and debian packages
  
          Compiling, linking, and building applications.
160656fdf   Jean-François Rey   update
42
          (g++ will be needed if you install scoring tools)
b9a54507e   Jean-François Rey   update bugfix + r...
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
  
  
  avconv (libav-tools >= 0.8)
      Available from : http://libav.org
      and debian package
  
      avconv is a very fast video and audio converter.
  
  JAVA JDK and JRE ( >= 6)
      Available from : http://www.oralce.com
      and debian packages
  
      JAVA Developpment kit and JAVA runtime environment.
  
  Python ( >= 2.7.0)
      Available from : http://http://www.python.org/
      and debian packages
  
      Python is a programming language.
  
  Perl ( >= 5.0.0)
      Available from : http://www.perl.org/
      and debian packages
  
      Perl is a programming language.
665a8dac3   Jean-François Rey   ! follow the whit...
68
  iconv ( >= 2.0.0)
b9a54507e   Jean-François Rey   update bugfix + r...
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
      Available from : http://www.gnu.org
      and debian package
  
      Character set conversion.
  
  csh shell (csh) 
      Available on debian packages.
  
      The C shell was originally written at UCB to overcome limitations in the
      Bourne shell.  Its flexibility and comfort (at that time) quickly made it
      the shell of choice until more advanced shells like ksh, bash, zsh or
      tcsh appeared.  Most of the latter incorporate features original to csh
  
  The SRI Language Modeling Toolkit (SRILM >= 1.6.0)
      Available from : http://www.speech.sri.com/projects/srilm/
  
      SRILM is a toolkit for building and applying statistical language models.
    
  Tomcat ( >= 7.0.0)
      Available from : http://tomcat.apache.org/
      and debian packages
  
      Apache Tomcat is an open source software implementation of the Java Servlet and JavaServer Pages technologies.    
c8737cbd9   Jean-François Rey   bugfix info on re...
92

160656fdf   Jean-François Rey   update
93
94
95
96
97
  libxml2-dev ( >= 2.7 ) [needed for scoring only]
      Available from http://www.xmlsoft.org/
      and debian packages
  
      Libxml2 is the XML C parser and toolkit.
1fa99e8a2   Jean-François Rey   add info in INSTA...
98
99
  INSTALL
  -------
c8737cbd9   Jean-François Rey   bugfix info on re...
100
101
102
103
104
  
      See the INSTALL file for the installation procedure.
  
      Quick install below.
   
665a8dac3   Jean-François Rey   ! follow the whit...
105
      Before launching installation :
c8737cbd9   Jean-François Rey   bugfix info on re...
106
107
   
        Be certain that all dependencies are satisfied.
665a8dac3   Jean-François Rey   ! follow the whit...
108
        Have 300 Go of free space for complet install.
c8737cbd9   Jean-François Rey   bugfix info on re...
109
110
111
112
       
      Issue the following commands to the shell :
      $> ./install.sh
      $> export OTMEDIA_HOME=path/to/OTMEDIA/directory
e6be5137b   Jean-François Rey   reinitialized pro...
113

665a8dac3   Jean-François Rey   ! follow the whit...
114
      Read SOLR.INSTALL part 3 to install SOLRDB.
b9a54507e   Jean-François Rey   update bugfix + r...
115
116
117
118
119
  
  RUNNING
  -------
  
      See HOWTO file.
e6be5137b   Jean-François Rey   reinitialized pro...
120

c8737cbd9   Jean-François Rey   bugfix info on re...
121
122
  ACKNOWLEDGEMENTS
  ----------------
b9a54507e   Jean-François Rey   update bugfix + r...
123
124
  
      Many thanks to Jean-François Rey for useful help and work done.
e6be5137b   Jean-François Rey   reinitialized pro...
125

c8737cbd9   Jean-François Rey   bugfix info on re...
126
127
  KNOWN BUGS
  ----------
b9a54507e   Jean-François Rey   update bugfix + r...
128

c8737cbd9   Jean-François Rey   bugfix info on re...
129
      Many.
665a8dac3   Jean-François Rey   ! follow the whit...
130
      For Bug report, please contact Pascal Nocera at pascal.nocera@univ-avignon.fr
e6be5137b   Jean-François Rey   reinitialized pro...
131

c8737cbd9   Jean-François Rey   bugfix info on re...
132
133
  COPYRIGHT
  ---------
b9a54507e   Jean-François Rey   update bugfix + r...
134

c8737cbd9   Jean-François Rey   bugfix info on re...
135
136
137
138
      See the COPYING file.
             
  AUTHORS
  -------
b9a54507e   Jean-François Rey   update bugfix + r...
139

c8737cbd9   Jean-François Rey   bugfix info on re...
140
141
142
      Jean-François Rey <jean-francois.rey@univ-avignon.fr>
      Hugo Mauchrétien <hugo.mauchretien@univ-avignon.fr>
      Emmanuel Ferreira <emmanuel.ferreira@univ-avignon.fr>
e6be5137b   Jean-François Rey   reinitialized pro...
143