Age | Commit message (Expand) | Author |
---|---|---|
2020-05-11 | Merge pull request #2 from lilt/feature/tmxPython3 | pks |
2020-05-11 | Update tmx-extract.py to use python3 | thomasZen |
2020-03-09 | de-sgm: match more stuff | Patrick Simianer |
2020-02-19 | TSV utils | Patrick Simianer |
2020-02-19 | misc. scripts | pks |
2020-02-19 | sentencepiece-decode | Patrick Simianer |
2020-02-03 | langid-polyglot | Patrick Simianer |
2020-02-03 | zh-ko-or-ja | Patrick Simianer |
2020-02-03 | NFC normalization in python | Patrick Simianer |
2020-02-03 | print out all chars | Patrick Simianer |
2020-02-03 | de-sgm | Patrick Simianer |
2020-01-14 | Merge pull request #1 from lilt/master | pks |
2020-01-13 | Remove paragraph opening and closing tag (#1) | thomasZen |
2019-12-24 | Merge branch 'master' of ssh://github.com/pks/nlp_scripts | Patrick Simianer |
2019-12-24 | de-sgm: tags w/ properties | Patrick Simianer |
2019-12-24 | mkidx | Patrick Simianer |
2019-12-24 | biuniq: uniquify a parallel corpus with a dictionary | Patrick Simianer |
2019-12-24 | formatting | Patrick Simianer |
2019-09-03 | cma: flush | Patrick Simianer |
2019-08-09 | Merge branch 'master' of ssh://github.com/pks/nlp_scripts | Patrick Simianer |
2019-08-09 | percentile | Patrick Simianer |
2019-08-05 | trollop -> optimist | Patrick Simianer |
2019-07-28 | infix for repeated | Patrick Simianer |
2019-07-28 | trollop -> optimist | Patrick Simianer |
2019-03-17 | toks-per-line: count | Patrick Simianer |
2019-03-17 | per-sentence-ter: fix | Patrick Simianer |
2019-03-16 | vocab | Patrick Simianer |
2018-10-19 | mv | Patrick Simianer |
2018-08-14 | moving-sum | Patrick Simianer |
2018-06-21 | mv | Patrick Simianer |
2018-04-27 | Merge branch 'master' of https://github.com/pks/nlp_scripts | Patrick Simianer |
2018-04-27 | filter-len: more params | Patrick Simianer |
2018-04-17 | inv | Patrick Simianer |
2018-04-17 | Merge branch 'master' of https://github.com/pks/nlp_scripts | Patrick Simianer |
2018-04-17 | bitext-filter-length improved | Patrick Simianer |
2018-04-11 | README | Patrick Simianer |
2018-04-11 | rm | Patrick Simianer |
2018-04-11 | substract | Patrick Simianer |
2018-04-11 | Merge branch 'master' of https://github.com/pks/nlp_scripts | Patrick Simianer |
2018-04-11 | first | Patrick Simianer |
2018-04-03 | fix | Patrick Simianer |
2018-03-29 | cma | Patrick Simianer |
2018-03-29 | bitext-filter-length | Patrick Simianer |
2018-01-30 | Merge branch 'master' of github.com:pks/nlp_scripts | Patrick Simianer |
2018-01-30 | tmx-extract.py | Patrick Simianer |
2017-12-14 | length-ratio | Patrick Simianer |
2017-12-14 | bitext-filter-length | Patrick Simianer |
2017-12-13 | filter-tokens | Patrick Simianer |
2017-12-05 | bishuf: proper fixed source of randomness | Patrick Simianer |
2017-12-05 | bishuf: simplistic synchronized shuffing of two files | Patrick Simianer |