Commit graph

14 commits

Author SHA1 Message Date
Jaiden Mispy
228e0caa65 More memory profiling 2014-10-18 22:21:50 -07:00
Jaiden Mispy
b7f67ec0a6 Memory optimization 2014-10-16 03:02:39 -07:00
Jaiden Mispy
d09d968915 rspec and memory_profiler 2014-10-14 01:02:08 -07:00
Joel McCoy
be6ac9127f MODEL: Read in utf-8, only parse CSV once
Ran into `Encoding::CompatibilityError` issue trying to consume my corpus (tweets.csv) on Windows 7, but this likely affects other environments as well. 

Fix: force reading corpus file contents as utf-8.

Also a quick clean-up of the CSV flow to only parse the content once instead of double-dipping.
2014-06-27 18:42:51 -04:00
Brett O'Connor
2aac54c7aa csv import now looks for text column 2014-05-03 16:44:07 -06:00
Joel McCoy
872dabdbf8 Support consuming tweets.csv from official twitter archives 2014-04-30 20:32:51 -04:00
Mispy
5d55d90f85 Be more paranoid about identifying mentions 2014-04-24 20:55:53 -07:00
Erik Michaels-Ober
7e033b7b3b Fix file permissions 2014-02-12 16:23:49 +01:00
Mispy
34b8c5d0a0 Use binary read/write mode for Windows 2014-01-28 16:36:23 -08:00
Mispy
306c9ab873 Allow consumption of json archives 2013-11-27 05:12:54 -08:00
Mispy
61c5caee4d Retry limit and mention separation 2013-11-20 12:07:24 -08:00
Mispy
95e96ceef9 2.0.9 - Whups, broke context 2013-11-14 10:19:48 -08:00
Mispy
00f0228dd4 2.0.8 -- different generation algorithm 2013-11-14 07:58:46 -08:00
Mispy
e87dc5862b Github time! 2013-11-08 06:02:05 +11:00