diff options
Diffstat (limited to 'dtrain/README.md')
-rw-r--r-- | dtrain/README.md | 19 |
1 files changed, 19 insertions, 0 deletions
diff --git a/dtrain/README.md b/dtrain/README.md index d6699cb4..9ecfd26b 100644 --- a/dtrain/README.md +++ b/dtrain/README.md @@ -1,6 +1,25 @@ dtrain ====== +Build & run +----------- +On a hadoop cluster do: +```sh +git clone git://github.com/qlt/cdec-dtrain.git +cd cdec_dtrain +autoreconf -ifv +./configure +make +``` +then +```sh +cd dtrain/hstreaming/ +(edit ini files) +edit hadoop-streaming-job.sh $IN and $OUT +./hadoop-streaming-job.sh +``` + + Ideas ----- * *MULTIPARTITE* ranking (1 vs all, cluster model/score) |