diff options
author | Patrick Simianer <p@simianer.de> | 2011-10-14 15:52:39 +0200 |
---|---|---|
committer | Patrick Simianer <p@simianer.de> | 2011-10-14 15:52:39 +0200 |
commit | c7d2162a213bd4b720cef28a6488970a2b2a3108 (patch) | |
tree | 023cd93f46800dadc08c8c4867713bce2c53da1c | |
parent | 0c3c534edae38f83079e5d45db9406c9bcc98926 (diff) |
README.md
-rw-r--r-- | dtrain/README.md | 19 |
1 files changed, 19 insertions, 0 deletions
diff --git a/dtrain/README.md b/dtrain/README.md index d6699cb4..9ecfd26b 100644 --- a/dtrain/README.md +++ b/dtrain/README.md @@ -1,6 +1,25 @@ dtrain ====== +Build & run +----------- +On a hadoop cluster do: +```sh +git clone git://github.com/qlt/cdec-dtrain.git +cd cdec_dtrain +autoreconf -ifv +./configure +make +``` +then +```sh +cd dtrain/hstreaming/ +(edit ini files) +edit hadoop-streaming-job.sh $IN and $OUT +./hadoop-streaming-job.sh +``` + + Ideas ----- * *MULTIPARTITE* ranking (1 vs all, cluster model/score) |