Age | Commit message (Collapse) | Author | |
---|---|---|---|
2017-11-28 | vocab2 | Patrick Simianer | |
2017-11-27 | fix select-from | Patrick Simianer | |
2017-11-11 | repetition rate | Patrick Simianer | |
2017-11-11 | cleanup | Patrick Simianer | |
2017-11-10 | rr: fix | Patrick Simianer | |
2017-11-10 | rr: fix | Patrick Simianer | |
2017-11-10 | +x | Patrick Simianer | |
2017-11-10 | rr | Patrick Simianer | |
2017-11-09 | rm | Patrick Simianer | |
2017-11-08 | mteval-14.pl | Patrick Simianer | |
2017-11-08 | de-sgm: use egrep instead of grep for compat. | Patrick Simianer | |
2017-08-04 | moses' multi-bleu.perl | Patrick Simianer | |
2017-08-04 | Merge branch 'master' of github.com:pks/nlp_scripts | Patrick Simianer | |
2017-08-04 | hist-tok | Patrick Simianer | |
2017-08-04 | de-bpe | Patrick Simianer | |
2017-08-04 | cumul | Patrick Simianer | |
2017-08-04 | cmp | Patrick Simianer | |
2017-08-04 | avg-seg-len | Patrick Simianer | |
2017-08-04 | per-sentence-bleu: fix | Patrick Simianer | |
2017-08-04 | de-sgm: fix | Patrick Simianer | |
2017-07-05 | overlap | Patrick Simianer | |
2017-06-21 | filter-illegal | Patrick Simianer | |
2016-11-04 | rename, remove non nlp stuff | Patrick Simianer | |
2016-08-18 | non-windowed RR | Patrick Simianer | |
2016-08-18 | take-mem | Patrick Simianer | |
2016-08-18 | non-windowed RR | Patrick Simianer | |
2016-07-05 | mv | Patrick Simianer | |
2015-12-23 | push_rules: push rule weights | Patrick Simianer | |
2015-12-23 | source_sides: get source side from translation rule | Patrick Simianer | |
2015-12-23 | make_rule_features: produce cdec's rule features (ids and bigrams) from a ↵ | Patrick Simianer | |
grammar | |||
2015-12-23 | hadoop_uniq: uniq with hadoop-streaming | Patrick Simianer | |
2015-12-23 | toks_per_line: # tokens per line | Patrick Simianer | |
2015-12-19 | corrected stddev | Patrick Simianer | |
2015-11-12 | Merge branch 'master' of github.com:pks/scripts | Patrick Simianer | |
2015-11-12 | README | Patrick Simianer | |
2015-11-12 | preprocessing without lowercasing | Patrick Simianer | |
2015-11-12 | normalize on char level | Patrick Simianer | |
2015-11-12 | map lines to number of token they contain | Patrick Simianer | |
2015-11-12 | script to normalize hyphens | Patrick Simianer | |
2015-11-12 | script to remove private use area chars | Patrick Simianer | |
2015-11-12 | add moses' truecaser | Patrick Simianer | |
2015-11-12 | sample: tab as separator | Patrick Simianer | |
2015-06-10 | undo unfortunate variable naming: cfg -> conf! | Patrick Simianer | |
2015-05-30 | fake_svm_light: invert data in svm light format | Patrick Simianer | |
2015-05-29 | feature_dict, convert_to_svmlight_format: stderr output | Patrick Simianer | |
2015-05-29 | tf-idf: glob handling | Patrick Simianer | |
2015-05-29 | add_ln: add line numbers, filter_features: filter text reps of sparse ↵ | Patrick Simianer | |
vectors, split_*: split kbest lists and by line | |||
2015-05-13 | norm | Patrick Simianer | |
2015-01-31 | tools | Patrick Simianer | |
2015-01-31 | kendalls_tau | Patrick Simianer | |