summaryrefslogtreecommitdiff
AgeCommit message (Collapse)Author
2015-12-23make_rule_features: produce cdec's rule features (ids and bigrams) from a ↵Patrick Simianer
grammar
2015-12-23hadoop_uniq: uniq with hadoop-streamingPatrick Simianer
2015-12-23toks_per_line: # tokens per linePatrick Simianer
2015-12-19corrected stddevPatrick Simianer
2015-11-12Merge branch 'master' of github.com:pks/scriptsPatrick Simianer
2015-11-12READMEPatrick Simianer
2015-11-12preprocessing without lowercasingPatrick Simianer
2015-11-12normalize on char levelPatrick Simianer
2015-11-12map lines to number of token they containPatrick Simianer
2015-11-12script to normalize hyphensPatrick Simianer
2015-11-12script to remove private use area charsPatrick Simianer
2015-11-12add moses' truecaserPatrick Simianer
2015-11-12sample: tab as separatorPatrick Simianer
2015-06-10undo unfortunate variable naming: cfg -> conf!Patrick Simianer
2015-05-30fake_svm_light: invert data in svm light formatPatrick Simianer
2015-05-29feature_dict, convert_to_svmlight_format: stderr outputPatrick Simianer
2015-05-29tf-idf: glob handlingPatrick Simianer
2015-05-29add_ln: add line numbers, filter_features: filter text reps of sparse ↵Patrick Simianer
vectors, split_*: split kbest lists and by line
2015-05-13normPatrick Simianer
2015-01-31toolsPatrick Simianer
2015-01-31kendalls_tauPatrick Simianer
2015-01-31add_seg: fixPatrick Simianer
2015-01-25zipf v1.2.2 compatPatrick Simianer
2015-01-25divPatrick Simianer
2015-01-15fixPatrick Simianer
2015-01-15split_pipes: to paramPatrick Simianer
2015-01-14select_from: invertPatrick Simianer
2015-01-07fixPatrick Simianer
2015-01-07select_from, max_lenPatrick Simianer
2014-10-09alles neu macht der maiPatrick Simianer
2014-10-03pot sqrtPatrick Simianer
2014-09-21add_seg: fixPatrick Simianer
2014-09-21add_seg: option to use pre-defined indexPatrick Simianer
2014-09-21add selectPatrick Simianer
2014-09-21rm sample_nPatrick Simianer
2014-09-21samplePatrick Simianer
2014-08-16memusg, to_asciiPatrick Simianer
2014-07-22compound-splitter.perl (taken from moses v2.1.1)Patrick Simianer
2014-07-22collapse_tags.rbPatrick Simianer
2014-06-18fixPatrick Simianer
2014-06-16nlp_ruby -> zipfPatrick Simianer
2014-06-14steal tokenizer from moses' scriptsPatrick Simianer
2014-06-03withdraw previous changePatrick Simianer
2014-06-01hg2json.py: add rule and span to json outputPatrick Simianer
2014-04-24parse-stanfordPatrick Simianer
2014-03-17fixPatrick Simianer
2014-03-17a lot of ... and --- cause moses' compound splitter to hangPatrick Simianer
2014-03-16better no_non_printablesPatrick Simianer
2014-03-16no non printables in preprocPatrick Simianer
2014-03-16filter by rule shapePatrick Simianer