summaryrefslogtreecommitdiff
path: root/dtrain
diff options
context:
space:
mode:
authorPatrick Simianer <p@simianer.de>2012-03-13 09:41:31 +0100
committerPatrick Simianer <p@simianer.de>2012-03-13 09:41:31 +0100
commitfb714888562845a8ae10fd4411cf199961193833 (patch)
treed16d8cd34d4e78d283dcad9b65db21001040d830 /dtrain
parent1247da218ea87bea383faaf1f02fad9208bda60c (diff)
readme
Diffstat (limited to 'dtrain')
-rw-r--r--dtrain/README.md2
1 files changed, 2 insertions, 0 deletions
diff --git a/dtrain/README.md b/dtrain/README.md
index de6a726b..e28bebe7 100644
--- a/dtrain/README.md
+++ b/dtrain/README.md
@@ -21,7 +21,9 @@ Additionally you need to give dtrain a file with
references (--refs).
The input for use with hadoop streaming looks like this:
+```
<sid>\t<source>\t<ref>\t<grammar rules separated by \t>
+```
To convert a psg to this format you need to replace all "\n"
by "\t". Make sure there are no tabs in your data.