4 lines
256 B
Markdown
4 lines
256 B
Markdown
- Files in text/ are preprocessed by `rake consume` and serialized
|
|
- e.g. text/foo.tweets becomes consumed/foo.corpus
|
|
- `rake consume` looks at hashes to know which it needs to update
|
|
- Preprocessed corpus files are loaded at runtime by Corpus.load('foo')
|