diff options
author | Patrick Simianer <p@simianer.de> | 2012-03-13 09:41:31 +0100 |
---|---|---|
committer | Patrick Simianer <p@simianer.de> | 2012-03-13 09:41:31 +0100 |
commit | fb714888562845a8ae10fd4411cf199961193833 (patch) | |
tree | d16d8cd34d4e78d283dcad9b65db21001040d830 | |
parent | 1247da218ea87bea383faaf1f02fad9208bda60c (diff) |
readme
-rw-r--r-- | dtrain/README.md | 2 |
1 files changed, 2 insertions, 0 deletions
diff --git a/dtrain/README.md b/dtrain/README.md index de6a726b..e28bebe7 100644 --- a/dtrain/README.md +++ b/dtrain/README.md @@ -21,7 +21,9 @@ Additionally you need to give dtrain a file with references (--refs). The input for use with hadoop streaming looks like this: +``` <sid>\t<source>\t<ref>\t<grammar rules separated by \t> +``` To convert a psg to this format you need to replace all "\n" by "\t". Make sure there are no tabs in your data. |