summaryrefslogtreecommitdiff
path: root/README.md
blob: 5a83f0e754b9c8b8378e4db87d36a4f42f2d6498 (plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
Not quite finished machine translation decoder.
(For Linux only)

TODO
====
 * proper parsing (Rico Sennrich's [1][2]?)
 * k-best derivations [3]
 * serialization for sparse vectors
 * Rule-ChartItem-Node transition?
 * cube pruning [4] and integrate kenlm [5]
 * feature factory and observer patterns
 * map all strings to ints?
 * glue grammar [6] alright?
 * read/writed gzipped files [11]
 * integrate some BLAS lib for vector ops [12][13]

Dependencies:
 * MessagePack for object serialization [8]
 * Google's gperftools  [9]
 * json-cpp [10]


[1]  http://aclweb.org/anthology/W/W14/W14-4011.pdf
[2]  https://github.com/redpony/cdec/commit/448b451aa481b1509566ddb11abc3476466def6a
[3]  http://www.cis.upenn.edu/~lhuang3/huang-iwpt-correct.pdf
[4]  http://cui.unige.ch/~gesmundo/papers/gesmundo-iwslt10-fcp.pdf
[5]  http://kheafield.com/code/kenlm/developers/2
[6]  https://github.com/jweese/thrax/wiki/Glue-grammar
[7]  http://aclweb.org/aclwiki/index.php?title=Hypergraph_Format
[8]  http://msgpack.org
[9]  https://code.google.com/p/gperftools/
[10] https://github.com/ascheglov/json-cpp
[11] http://www.cs.unc.edu/Research/compgeom/gzstream/
[12] http://scicomp.stackexchange.com/questions/351/recommendations-for-a-usable-fast-c-matrix-library
[13] http://www.cvmlib.com/