diff options
author | Paul Baltescu <pauldb89@gmail.com> | 2014-10-02 12:56:15 +0100 |
---|---|---|
committer | Paul Baltescu <pauldb89@gmail.com> | 2014-10-02 12:56:15 +0100 |
commit | cf58b49ccf7fc6fc94eeafb9afdcc59744a0ea21 (patch) | |
tree | 872a0ed8dd4855fd631a6d3530622bcd69c977f5 | |
parent | 791d72e0ee0202c5495baeeb5eaa4232b7160fe5 (diff) |
Update C++ extractor ReadMe.
-rw-r--r-- | extractor/README.md | 2 |
1 files changed, 1 insertions, 1 deletions
diff --git a/extractor/README.md b/extractor/README.md index 642fbd1d..11138007 100644 --- a/extractor/README.md +++ b/extractor/README.md @@ -1,4 +1,4 @@ -C++ implementation of the online grammar extractor originally developed by [Adam Lopez](http://www.cs.jhu.edu/~alopez/). +A simple and fast C++ implementation of a SCFG grammar extractor using suffix arrays. The implementation is described in this [paper](https://ufal.mff.cuni.cz/pbml/102/art-baltescu-blunsom.pdf). The original cython extractor is described in [Adam Lopez](http://www.cs.jhu.edu/~alopez/)'s PhD [thesis](http://www.cs.jhu.edu/~alopez/papers/adam.lopez.dissertation.pdf). The grammar extraction takes place in two steps: (a) precomputing a number of data structures and (b) actually extracting the grammars. All the flags below have the same meaning as in the cython implementation. |