diff options
author | Patrick Simianer <simianer@cl.uni-heidelberg.de> | 2012-11-05 15:29:46 +0100 |
---|---|---|
committer | Patrick Simianer <simianer@cl.uni-heidelberg.de> | 2012-11-05 15:29:46 +0100 |
commit | 1db70a45d59946560fbd5db6487b55a8674ef973 (patch) | |
tree | 172585dafe4d1462f22d8200e733d52dddb55b1e /python/README.md | |
parent | 4dd5216d3afa9ab72b150e250a3c30a5f223ce53 (diff) | |
parent | 6bbf03ac46bd57400aa9e65a321a304a234af935 (diff) |
merge upstream/master
Diffstat (limited to 'python/README.md')
-rw-r--r-- | python/README.md | 4 |
1 files changed, 4 insertions, 0 deletions
diff --git a/python/README.md b/python/README.md index da9f9387..bea6190a 100644 --- a/python/README.md +++ b/python/README.md @@ -12,6 +12,10 @@ Compile a parallel corpus and a word alignment into a suffix array representatio python -m cdec.sa.compile -f f.txt -e e.txt -a a.txt -o output/ -c extract.ini +Or, if your parallel corpus is in a single-file format (with source and target sentences on a single line, separated by a triple pipe `|||`), use: + + python -m cdec.sa.compile -b f-e.txt -a a.txt -o output/ -c extract.ini + Extract grammar rules from the compiled corpus: cat input.txt | python -m cdec.sa.extract -c extract.ini -g grammars/ |