Age | Commit message (Collapse) | Author | |
---|---|---|---|
2013-08-26 | Improve the package structure of pycdec | Victor Chahuneau | |
This change should not break anything, but now you can run: python setup.py build_ext --inplace and use the cleaner: PYTHONPATH=/path/to/cdec/python python -m ... | |||
2013-06-02 | Possible fix for #13 | Victor Chahuneau | |
- use IntList .len instead of len() which can return long - a bit of code cleanup - upgrade to Cython 0.19 | |||
2013-04-23 | Replaced time consuming endl with \n. | Paul Baltescu | |
2013-03-07 | Added unit test for loose phrases. | Paul Baltescu | |
2013-03-06 | Fix memory leak in trie. | Paul Baltescu | |
2013-02-22 | Merge branch 'master' into experiment | Paul Baltescu | |
2013-02-22 | Memory analysis pointless code. | Paul Baltescu | |
2013-02-22 | Updated unit tests for data array. | Paul Baltescu | |
2013-02-21 | Merge branch 'master' of https://github.com/pauldb89/cdec | Paul Baltescu | |
2013-02-19 | Timing every part of the extractor. | Paul Baltescu | |
2013-01-28 | For now, don't use online bilex counts | Michael Denkowski | |
2013-01-28 | Bilexical scores for online rules | Michael Denkowski | |
2013-01-26 | Online grammars now diff with incremental suffix array (except lex, TODO) | Michael Denkowski | |
2013-01-24 | Scored grammars from online extraction. Don't trust them yet. | Michael Denkowski | |
2013-01-07 | Online rule extractor output diffs w/ sa extractor | Michael Denkowski | |
2013-01-04 | Track source span to keep accurate phrase counts | Michael Denkowski | |
2013-01-04 | Fixed issue with overlapping alignment links | Michael Denkowski | |
2013-01-03 | Michael remembers how hiero phrase extraction works. Not totally | Michael Denkowski | |
debugged, use -o with caution, etc. | |||
2012-12-28 | Collect/store stats from new training instances | Michael Denkowski | |
2012-12-27 | Online phrase extraction speaks rulefactory's language. | Michael Denkowski | |
2012-12-27 | Hooks for online grammar extraction | Michael Denkowski | |
2012-12-13 | Enable loose phrase extraction parameter | Victor Chahuneau | |
(default is still tight) use --loose when compiling corpus or tight_phrases = False in config | |||
2012-09-06 | add FeatureContext.input_span | Adam Lopez | |
2012-09-06 | [cdec.sa] Allow sentence annotation and initial configuration | Victor Chahuneau | |
2012-09-06 | [cdec.sa] Fix API to make everyone happy | Victor Chahuneau | |
2012-09-06 | [cdec.sa] Fix the list of matching training source sentence | Victor Chahuneau | |
2012-09-06 | [cdec.sa] Make list of word ids <-> sentence string mapping easy | Victor Chahuneau | |
2012-09-06 | Make Data_Array.data accessible via getter | Adam Lopez | |
2012-09-06 | Merge | Adam Lopez | |
2012-09-05 | Revert to the "old style" pair count... | Victor Chahuneau | |
+ API naming fixes + Multiple feature definition files can be passed to the extractor | |||
2012-09-05 | Pass F, E texts to features | Adam Lopez | |
2012-09-05 | Change FeatureContext.input_span to return slice indices | Adam Lopez | |
2012-09-05 | Fix bug in initialization of FeatureContext.input_span | Adam Lopez | |
2012-09-05 | Expose new feature extraction API | Victor Chahuneau | |
2012-09-05 | Merge alopez/context-features | Victor Chahuneau | |
2012-09-03 | Support Python 2.6 | Victor Chahuneau | |
2012-08-14 | [cdec.sa] Explicit feature names in grammar extractor output | Victor Chahuneau | |
+ sparse features in extractor + hg.intersect(string) + basestring = str|unicode | |||
2012-07-28 | [python] Suffix array compiler can read bitext (-b) | Victor Chahuneau | |
2012-07-27 | [python] Move python files to avoid pythonpath conflicts | Victor Chahuneau | |
2012-07-27 | [python] conversion from cdec.sa.Rule to cdec.TRule | Victor Chahuneau | |
+ remove configobj dependency + re-structure packages (no more top-level library) + "const" stuff + use __new__ instead of constructor for some objects |