Github time!
This commit is contained in:
commit
e87dc5862b
27 changed files with 20178 additions and 0 deletions
4
NOTES.md
Normal file
4
NOTES.md
Normal file
|
@ -0,0 +1,4 @@
|
|||
- Files in text/ are preprocessed by `rake consume` and serialized
|
||||
- e.g. text/foo.tweets becomes consumed/foo.corpus
|
||||
- `rake consume` looks at hashes to know which it needs to update
|
||||
- Preprocessed corpus files are loaded at runtime by Corpus.load('foo')
|
Loading…
Add table
Add a link
Reference in a new issue