diff options
author | Patrick Simianer <p@simianer.de> | 2014-06-14 16:46:27 +0200 |
---|---|---|
committer | Patrick Simianer <p@simianer.de> | 2014-06-14 16:46:27 +0200 |
commit | 26c490f404731d053a6205719b6246502c07b449 (patch) | |
tree | 3aa721098f1251dfbf2249ecd2736434c13b1d48 /overlapping_rules/README |
init
Diffstat (limited to 'overlapping_rules/README')
-rw-r--r-- | overlapping_rules/README | 15 |
1 files changed, 15 insertions, 0 deletions
diff --git a/overlapping_rules/README b/overlapping_rules/README new file mode 100644 index 0000000..5dffd16 --- /dev/null +++ b/overlapping_rules/README @@ -0,0 +1,15 @@ +1. word_pair_keys + group rules by source/target word pairs + input is a cdec grammar (with int index), one rule per line + +2. rules_cross_product + build cross product of rules w/ same key + input is output of 1 + +3. merge_rules + mapred version of merge_rules.rb + +NOTE + cross product doesn't even work with g120: + 319078851 megabytes ~= 300 terabytes + |