Age | Commit message (Expand) | Author |
---|---|---|
2017-12-14 | bitext-filter-length | Patrick Simianer |
2017-12-13 | filter-tokens | Patrick Simianer |
2017-12-05 | bishuf: proper fixed source of randomness | Patrick Simianer |
2017-12-05 | bishuf: simplistic synchronized shuffing of two files | Patrick Simianer |
2017-12-05 | lang | Patrick Simianer |
2017-12-03 | select-from: fix | Patrick Simianer |
2017-12-03 | lang | Patrick Simianer |
2017-12-03 | filter-len | Patrick Simianer |
2017-12-03 | hist-tok: +x | Patrick Simianer |
2017-12-03 | rm | Patrick Simianer |
2017-11-28 | vocab2 | Patrick Simianer |
2017-11-27 | fix select-from | Patrick Simianer |
2017-11-11 | repetition rate | Patrick Simianer |
2017-11-11 | cleanup | Patrick Simianer |
2017-11-10 | rr: fix | Patrick Simianer |
2017-11-10 | rr: fix | Patrick Simianer |
2017-11-10 | +x | Patrick Simianer |
2017-11-10 | rr | Patrick Simianer |
2017-11-09 | rm | Patrick Simianer |
2017-11-08 | mteval-14.pl | Patrick Simianer |
2017-11-08 | de-sgm: use egrep instead of grep for compat. | Patrick Simianer |
2017-08-04 | moses' multi-bleu.perl | Patrick Simianer |
2017-08-04 | Merge branch 'master' of github.com:pks/nlp_scripts | Patrick Simianer |
2017-08-04 | hist-tok | Patrick Simianer |
2017-08-04 | de-bpe | Patrick Simianer |
2017-08-04 | cumul | Patrick Simianer |
2017-08-04 | cmp | Patrick Simianer |
2017-08-04 | avg-seg-len | Patrick Simianer |
2017-08-04 | per-sentence-bleu: fix | Patrick Simianer |
2017-08-04 | de-sgm: fix | Patrick Simianer |
2017-07-05 | overlap | Patrick Simianer |
2017-06-21 | filter-illegal | Patrick Simianer |
2016-11-04 | rename, remove non nlp stuff | Patrick Simianer |
2016-08-18 | non-windowed RR | Patrick Simianer |
2016-08-18 | take-mem | Patrick Simianer |
2016-08-18 | non-windowed RR | Patrick Simianer |
2016-07-05 | mv | Patrick Simianer |
2015-12-23 | push_rules: push rule weights | Patrick Simianer |
2015-12-23 | source_sides: get source side from translation rule | Patrick Simianer |
2015-12-23 | make_rule_features: produce cdec's rule features (ids and bigrams) from a gra... | Patrick Simianer |
2015-12-23 | hadoop_uniq: uniq with hadoop-streaming | Patrick Simianer |
2015-12-23 | toks_per_line: # tokens per line | Patrick Simianer |
2015-12-19 | corrected stddev | Patrick Simianer |
2015-11-12 | Merge branch 'master' of github.com:pks/scripts | Patrick Simianer |
2015-11-12 | README | Patrick Simianer |
2015-11-12 | preprocessing without lowercasing | Patrick Simianer |
2015-11-12 | normalize on char level | Patrick Simianer |
2015-11-12 | map lines to number of token they contain | Patrick Simianer |
2015-11-12 | script to normalize hyphens | Patrick Simianer |
2015-11-12 | script to remove private use area chars | Patrick Simianer |