summaryrefslogtreecommitdiff
path: root/dtrain
diff options
context:
space:
mode:
authorPatrick Simianer <p@simianer.de>2011-10-14 15:52:39 +0200
committerPatrick Simianer <p@simianer.de>2011-10-14 15:52:39 +0200
commitc7d2162a213bd4b720cef28a6488970a2b2a3108 (patch)
tree023cd93f46800dadc08c8c4867713bce2c53da1c /dtrain
parent0c3c534edae38f83079e5d45db9406c9bcc98926 (diff)
README.md
Diffstat (limited to 'dtrain')
-rw-r--r--dtrain/README.md19
1 files changed, 19 insertions, 0 deletions
diff --git a/dtrain/README.md b/dtrain/README.md
index d6699cb4..9ecfd26b 100644
--- a/dtrain/README.md
+++ b/dtrain/README.md
@@ -1,6 +1,25 @@
dtrain
======
+Build & run
+-----------
+On a hadoop cluster do:
+```sh
+git clone git://github.com/qlt/cdec-dtrain.git
+cd cdec_dtrain
+autoreconf -ifv
+./configure
+make
+```
+then
+```sh
+cd dtrain/hstreaming/
+(edit ini files)
+edit hadoop-streaming-job.sh $IN and $OUT
+./hadoop-streaming-job.sh
+```
+
+
Ideas
-----
* *MULTIPARTITE* ranking (1 vs all, cluster model/score)