From 26c490f404731d053a6205719b6246502c07b449 Mon Sep 17 00:00:00 2001 From: Patrick Simianer Date: Sat, 14 Jun 2014 16:46:27 +0200 Subject: init --- overlapping_rules/README | 15 +++++++++++++++ 1 file changed, 15 insertions(+) create mode 100644 overlapping_rules/README (limited to 'overlapping_rules/README') diff --git a/overlapping_rules/README b/overlapping_rules/README new file mode 100644 index 0000000..5dffd16 --- /dev/null +++ b/overlapping_rules/README @@ -0,0 +1,15 @@ +1. word_pair_keys + group rules by source/target word pairs + input is a cdec grammar (with int index), one rule per line + +2. rules_cross_product + build cross product of rules w/ same key + input is output of 1 + +3. merge_rules + mapred version of merge_rules.rb + +NOTE + cross product doesn't even work with g120: + 319078851 megabytes ~= 300 terabytes + -- cgit v1.2.3