summaryrefslogtreecommitdiff
path: root/extractor
diff options
context:
space:
mode:
authorPaul Baltescu <pauldb89@gmail.com>2014-10-02 12:56:15 +0100
committerPaul Baltescu <pauldb89@gmail.com>2014-10-02 12:56:15 +0100
commitd3560fafafb8e1e907d44c7b544f15eee22c8245 (patch)
treee4d8489db71d9386f496c3cbb6fc7a1d5cfed991 /extractor
parent6ead7d0e4e8de97d23c13989b3968cab873a0342 (diff)
Update C++ extractor ReadMe.
Diffstat (limited to 'extractor')
-rw-r--r--extractor/README.md2
1 files changed, 1 insertions, 1 deletions
diff --git a/extractor/README.md b/extractor/README.md
index 642fbd1d..11138007 100644
--- a/extractor/README.md
+++ b/extractor/README.md
@@ -1,4 +1,4 @@
-C++ implementation of the online grammar extractor originally developed by [Adam Lopez](http://www.cs.jhu.edu/~alopez/).
+A simple and fast C++ implementation of a SCFG grammar extractor using suffix arrays. The implementation is described in this [paper](https://ufal.mff.cuni.cz/pbml/102/art-baltescu-blunsom.pdf). The original cython extractor is described in [Adam Lopez](http://www.cs.jhu.edu/~alopez/)'s PhD [thesis](http://www.cs.jhu.edu/~alopez/papers/adam.lopez.dissertation.pdf).
The grammar extraction takes place in two steps: (a) precomputing a number of data structures and (b) actually extracting the grammars. All the flags below have the same meaning as in the cython implementation.