diff options
Diffstat (limited to 'overlapping_rules/README')
-rw-r--r-- | overlapping_rules/README | 15 |
1 files changed, 15 insertions, 0 deletions
diff --git a/overlapping_rules/README b/overlapping_rules/README new file mode 100644 index 0000000..5dffd16 --- /dev/null +++ b/overlapping_rules/README @@ -0,0 +1,15 @@ +1. word_pair_keys + group rules by source/target word pairs + input is a cdec grammar (with int index), one rule per line + +2. rules_cross_product + build cross product of rules w/ same key + input is output of 1 + +3. merge_rules + mapred version of merge_rules.rb + +NOTE + cross product doesn't even work with g120: + 319078851 megabytes ~= 300 terabytes + |