twitter-ebooks/lib/twitter_ebooks
Joel McCoy be6ac9127f MODEL: Read in utf-8, only parse CSV once
Ran into `Encoding::CompatibilityError` issue trying to consume my corpus (tweets.csv) on Windows 7, but this likely affects other environments as well. 

Fix: force reading corpus file contents as utf-8.

Also a quick clean-up of the CSV flow to only parse the content once instead of double-dipping.
2014-06-27 18:42:51 -04:00
..
archive.rb Use ENV['HOME'] for config path 2013-12-03 02:10:54 -08:00
bot.rb 2.2.5 - encoding: utf-8 2014-05-07 16:45:17 +10:00
markov.rb Fix file permissions 2014-02-12 16:23:49 +01:00
model.rb MODEL: Read in utf-8, only parse CSV once 2014-06-27 18:42:51 -04:00
nlp.rb Fix file permissions 2014-02-12 16:23:49 +01:00
suffix.rb 2.2.5 - encoding: utf-8 2014-05-07 16:45:17 +10:00
version.rb csv import now looks for text column 2014-05-03 16:44:07 -06:00