summaryrefslogtreecommitdiff
path: root/dtrain/README.md
diff options
context:
space:
mode:
Diffstat (limited to 'dtrain/README.md')
-rw-r--r--dtrain/README.md19
1 files changed, 19 insertions, 0 deletions
diff --git a/dtrain/README.md b/dtrain/README.md
index d6699cb4..9ecfd26b 100644
--- a/dtrain/README.md
+++ b/dtrain/README.md
@@ -1,6 +1,25 @@
dtrain
======
+Build & run
+-----------
+On a hadoop cluster do:
+```sh
+git clone git://github.com/qlt/cdec-dtrain.git
+cd cdec_dtrain
+autoreconf -ifv
+./configure
+make
+```
+then
+```sh
+cd dtrain/hstreaming/
+(edit ini files)
+edit hadoop-streaming-job.sh $IN and $OUT
+./hadoop-streaming-job.sh
+```
+
+
Ideas
-----
* *MULTIPARTITE* ranking (1 vs all, cluster model/score)