summaryrefslogtreecommitdiff
path: root/dtrain/README.md
diff options
context:
space:
mode:
authorPatrick Simianer <p@simianer.de>2011-10-14 15:52:39 +0200
committerPatrick Simianer <p@simianer.de>2011-10-14 15:52:39 +0200
commit69c3fa4e3ebfee9ca3fa38cc815ef18d39e3a44a (patch)
treeac921b6c9a8231093a10d23c58aa62b702ce8248 /dtrain/README.md
parenta873c1d1ee19b81b85e4dcbd7f6dbf54ba49257c (diff)
README.md
Diffstat (limited to 'dtrain/README.md')
-rw-r--r--dtrain/README.md19
1 files changed, 19 insertions, 0 deletions
diff --git a/dtrain/README.md b/dtrain/README.md
index d6699cb4..9ecfd26b 100644
--- a/dtrain/README.md
+++ b/dtrain/README.md
@@ -1,6 +1,25 @@
dtrain
======
+Build & run
+-----------
+On a hadoop cluster do:
+```sh
+git clone git://github.com/qlt/cdec-dtrain.git
+cd cdec_dtrain
+autoreconf -ifv
+./configure
+make
+```
+then
+```sh
+cd dtrain/hstreaming/
+(edit ini files)
+edit hadoop-streaming-job.sh $IN and $OUT
+./hadoop-streaming-job.sh
+```
+
+
Ideas
-----
* *MULTIPARTITE* ranking (1 vs all, cluster model/score)