Age | Commit message (Collapse) | Author | |
---|---|---|---|
2017-12-14 | length-ratio | Patrick Simianer | |
2017-12-14 | bitext-filter-length | Patrick Simianer | |
2017-12-13 | filter-tokens | Patrick Simianer | |
2017-12-05 | bishuf: proper fixed source of randomness | Patrick Simianer | |
2017-12-05 | bishuf: simplistic synchronized shuffing of two files | Patrick Simianer | |
2017-12-05 | lang | Patrick Simianer | |
2017-12-03 | select-from: fix | Patrick Simianer | |
2017-12-03 | lang | Patrick Simianer | |
2017-12-03 | filter-len | Patrick Simianer | |
2017-12-03 | hist-tok: +x | Patrick Simianer | |
2017-12-03 | rm | Patrick Simianer | |
2017-11-28 | vocab2 | Patrick Simianer | |
2017-11-27 | fix select-from | Patrick Simianer | |
2017-11-11 | repetition rate | Patrick Simianer | |
2017-11-11 | cleanup | Patrick Simianer | |
2017-11-10 | rr: fix | Patrick Simianer | |
2017-11-10 | rr: fix | Patrick Simianer | |
2017-11-10 | +x | Patrick Simianer | |
2017-11-10 | rr | Patrick Simianer | |
2017-11-09 | rm | Patrick Simianer | |
2017-11-08 | mteval-14.pl | Patrick Simianer | |
2017-11-08 | de-sgm: use egrep instead of grep for compat. | Patrick Simianer | |
2017-08-04 | moses' multi-bleu.perl | Patrick Simianer | |
2017-08-04 | Merge branch 'master' of github.com:pks/nlp_scripts | Patrick Simianer | |
2017-08-04 | hist-tok | Patrick Simianer | |
2017-08-04 | de-bpe | Patrick Simianer | |
2017-08-04 | cumul | Patrick Simianer | |
2017-08-04 | cmp | Patrick Simianer | |
2017-08-04 | avg-seg-len | Patrick Simianer | |
2017-08-04 | per-sentence-bleu: fix | Patrick Simianer | |
2017-08-04 | de-sgm: fix | Patrick Simianer | |
2017-07-05 | overlap | Patrick Simianer | |
2017-06-21 | filter-illegal | Patrick Simianer | |
2016-11-04 | rename, remove non nlp stuff | Patrick Simianer | |
2016-08-18 | non-windowed RR | Patrick Simianer | |
2016-08-18 | take-mem | Patrick Simianer | |
2016-08-18 | non-windowed RR | Patrick Simianer | |
2016-07-05 | mv | Patrick Simianer | |
2015-12-23 | push_rules: push rule weights | Patrick Simianer | |
2015-12-23 | source_sides: get source side from translation rule | Patrick Simianer | |
2015-12-23 | make_rule_features: produce cdec's rule features (ids and bigrams) from a ↵ | Patrick Simianer | |
grammar | |||
2015-12-23 | hadoop_uniq: uniq with hadoop-streaming | Patrick Simianer | |
2015-12-23 | toks_per_line: # tokens per line | Patrick Simianer | |
2015-12-19 | corrected stddev | Patrick Simianer | |
2015-11-12 | Merge branch 'master' of github.com:pks/scripts | Patrick Simianer | |
2015-11-12 | README | Patrick Simianer | |
2015-11-12 | preprocessing without lowercasing | Patrick Simianer | |
2015-11-12 | normalize on char level | Patrick Simianer | |
2015-11-12 | map lines to number of token they contain | Patrick Simianer | |
2015-11-12 | script to normalize hyphens | Patrick Simianer | |