From 28454c98966fb8ff4e4d7a63c5ef773dd226d3bf Mon Sep 17 00:00:00 2001 From: Patrick Simianer Date: Tue, 13 Mar 2012 09:37:10 +0100 Subject: readme --- dtrain/README.md | 10 +++------- 1 file changed, 3 insertions(+), 7 deletions(-) (limited to 'dtrain/README.md') diff --git a/dtrain/README.md b/dtrain/README.md index c39d94d2..0240a694 100644 --- a/dtrain/README.md +++ b/dtrain/README.md @@ -12,24 +12,20 @@ builds when building cdec, see ../BUILDING Running ------- To run this on a dev set locally (default): - -#define DTRAIN_LOCAL - + #define DTRAIN_LOCAL otherwise remove that line or undef. You need a single grammar file or per-sentence-grammars (psg) as you would use with cdec. Additionally you need to give dtrain a file with references (--refs). The input for use with hadoop streaming looks like this: - -\t\t\t - + \t\t\t To convert a psg to this format you need to replace all "\n" by "\t". Make sure there are no tabs in your data. For an example of local usage (with 'distributed' format) the see test/example/ . This expects dtrain to be built without -DTRAIN_LOCAL param. +DTRAIN_LOCAL. Legal stuff ----------- -- cgit v1.2.3