summaryrefslogtreecommitdiff
path: root/corpus
AgeCommit message (Expand)Author
2013-01-22russian abbrevsChris Dyer
2013-01-21tokenizer support for utf8 patternsChris Dyer
2013-01-21a little bit of cleanupChris Dyer
2013-01-20control max lenChris Dyer
2013-01-19updated version of boost.m4 and automatically build kenneth's LM builderChris Dyer
2013-01-15corpus filesChris Dyer
2012-12-05slight tokenization bug fixChris Dyer
2012-12-05remove logging, you should be using pvChris Dyer
2012-12-04more flexible corpus cuttingChris Dyer
2012-11-16fixChris Dyer
2012-11-16readmeChris Dyer
2012-11-14major mert clean up, stuff for simple system demoChris Dyer
2012-11-06Merge branch 'master' of github.com:redpony/cdecChris Dyer
2012-11-06add lowercase scriptChris Dyer
2012-11-05script to add sos/eosChris Dyer
2012-10-25add self translationChris Dyer
2012-07-28script to paste files together with the triple pipe separatorChris Dyer
2012-07-28a couple of tools for cleaning corporaChris Dyer