blob: 5a83f0e754b9c8b8378e4db87d36a4f42f2d6498 (
plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
|
Not quite finished machine translation decoder.
(For Linux only)
TODO
====
* proper parsing (Rico Sennrich's [1][2]?)
* k-best derivations [3]
* serialization for sparse vectors
* Rule-ChartItem-Node transition?
* cube pruning [4] and integrate kenlm [5]
* feature factory and observer patterns
* map all strings to ints?
* glue grammar [6] alright?
* read/writed gzipped files [11]
* integrate some BLAS lib for vector ops [12][13]
Dependencies:
* MessagePack for object serialization [8]
* Google's gperftools [9]
* json-cpp [10]
[1] http://aclweb.org/anthology/W/W14/W14-4011.pdf
[2] https://github.com/redpony/cdec/commit/448b451aa481b1509566ddb11abc3476466def6a
[3] http://www.cis.upenn.edu/~lhuang3/huang-iwpt-correct.pdf
[4] http://cui.unige.ch/~gesmundo/papers/gesmundo-iwslt10-fcp.pdf
[5] http://kheafield.com/code/kenlm/developers/2
[6] https://github.com/jweese/thrax/wiki/Glue-grammar
[7] http://aclweb.org/aclwiki/index.php?title=Hypergraph_Format
[8] http://msgpack.org
[9] https://code.google.com/p/gperftools/
[10] https://github.com/ascheglov/json-cpp
[11] http://www.cs.unc.edu/Research/compgeom/gzstream/
[12] http://scicomp.stackexchange.com/questions/351/recommendations-for-a-usable-fast-c-matrix-library
[13] http://www.cvmlib.com/
|