From 69c3fa4e3ebfee9ca3fa38cc815ef18d39e3a44a Mon Sep 17 00:00:00 2001 From: Patrick Simianer Date: Fri, 14 Oct 2011 15:52:39 +0200 Subject: README.md --- dtrain/README.md | 19 +++++++++++++++++++ 1 file changed, 19 insertions(+) (limited to 'dtrain') diff --git a/dtrain/README.md b/dtrain/README.md index d6699cb4..9ecfd26b 100644 --- a/dtrain/README.md +++ b/dtrain/README.md @@ -1,6 +1,25 @@ dtrain ====== +Build & run +----------- +On a hadoop cluster do: +```sh +git clone git://github.com/qlt/cdec-dtrain.git +cd cdec_dtrain +autoreconf -ifv +./configure +make +``` +then +```sh +cd dtrain/hstreaming/ +(edit ini files) +edit hadoop-streaming-job.sh $IN and $OUT +./hadoop-streaming-job.sh +``` + + Ideas ----- * *MULTIPARTITE* ranking (1 vs all, cluster model/score) -- cgit v1.2.3