Blame view

lib/rir/corpus.rb 1.27 KB
b843bae6b   Romain Deveaud   new files + tests
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
  #!/usr/bin/env ruby
  
  # This file is a part of an Information Retrieval oriented Ruby library
  #
  # Copyright (C) 2010-2011 Romain Deveaud <romain.deveaud@gmail.com>
  #
  # This program is free software: you can redistribute it and/or modify
  # it under the terms of the GNU General Public License as published by
  # the Free Software Foundation, either version 3 of the License, or
  # (at your option) any later version.
  #
  # This program is distributed in the hope that it will be useful,
  # but WITHOUT ANY WARRANTY; without even the implied warranty of
  # MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
  # GNU General Public License for more details.
  #
  # You should have received a copy of the GNU General Public License
  # along with this program.  If not, see <http://www.gnu.org/licenses/>.
  
  # General module for many purposes related to Information Retrieval.
  module RIR
  
    class Corpus
      attr_accessor :path
  
      def initialize(path)
145387519   Romain Deveaud   new stuff with wi...
27
        @path = path.chomp "/"
b843bae6b   Romain Deveaud   new files + tests
28
      end
145387519   Romain Deveaud   new stuff with wi...
29
30
31
32
33
34
      # Recursively outputs all files in +self.path+.
      # WARNING ! This function may take a lot of time if many
      # files are in subdirectories.
      #
      #   c = Corpus.new "my/path"
      #   c.files                  # => ["README.txt", "lib/code.rb"]
b843bae6b   Romain Deveaud   new files + tests
35
      def files
145387519   Romain Deveaud   new stuff with wi...
36
        Dir["#{@path}/**/*.*"]
b843bae6b   Romain Deveaud   new files + tests
37
38
39
40
      end
    end
  
  end