diff options
author | Patrick Simianer <p@simianer.de> | 2011-10-14 15:52:39 +0200 |
---|---|---|
committer | Patrick Simianer <p@simianer.de> | 2011-10-14 15:52:39 +0200 |
commit | 69c3fa4e3ebfee9ca3fa38cc815ef18d39e3a44a (patch) | |
tree | ac921b6c9a8231093a10d23c58aa62b702ce8248 /dtrain | |
parent | a873c1d1ee19b81b85e4dcbd7f6dbf54ba49257c (diff) |
README.md
Diffstat (limited to 'dtrain')
-rw-r--r-- | dtrain/README.md | 19 |
1 files changed, 19 insertions, 0 deletions
diff --git a/dtrain/README.md b/dtrain/README.md index d6699cb4..9ecfd26b 100644 --- a/dtrain/README.md +++ b/dtrain/README.md @@ -1,6 +1,25 @@ dtrain ====== +Build & run +----------- +On a hadoop cluster do: +```sh +git clone git://github.com/qlt/cdec-dtrain.git +cd cdec_dtrain +autoreconf -ifv +./configure +make +``` +then +```sh +cd dtrain/hstreaming/ +(edit ini files) +edit hadoop-streaming-job.sh $IN and $OUT +./hadoop-streaming-job.sh +``` + + Ideas ----- * *MULTIPARTITE* ranking (1 vs all, cluster model/score) |