summaryrefslogtreecommitdiff
path: root/extools
AgeCommit message (Collapse)Author
2010-08-11major refactor, break bad circular depsredpony
git-svn-id: https://ws10smt.googlecode.com/svn/trunk@509 ec762483-ff6d-05da-a07a-a48fb63a330f
2010-08-04get rid of unused code, remove static linkageredpony
git-svn-id: https://ws10smt.googlecode.com/svn/trunk@480 ec762483-ff6d-05da-a07a-a48fb63a330f
2010-08-04allow for wider set of NT stringstrevor.cohn
git-svn-id: https://ws10smt.googlecode.com/svn/trunk@477 ec762483-ff6d-05da-a07a-a48fb63a330f
2010-07-26Fixed bug in mr_strip_rule_reducer regarding lines with duplicate contexts ↵trevor.cohn
(were overwritten) git-svn-id: https://ws10smt.googlecode.com/svn/trunk@424 ec762483-ff6d-05da-a07a-a48fb63a330f
2010-07-25compilegraehl
git-svn-id: https://ws10smt.googlecode.com/svn/trunk@404 ec762483-ff6d-05da-a07a-a48fb63a330f
2010-07-24compilegraehl
git-svn-id: https://ws10smt.googlecode.com/svn/trunk@401 ec762483-ff6d-05da-a07a-a48fb63a330f
2010-07-23sparse_vector use google::dense_hash_map, fsa scan logginggraehl
git-svn-id: https://ws10smt.googlecode.com/svn/trunk@383 ec762483-ff6d-05da-a07a-a48fb63a330f
2010-07-21Fixing errors.olivia.buzek
git-svn-id: https://ws10smt.googlecode.com/svn/trunk@361 ec762483-ff6d-05da-a07a-a48fb63a330f
2010-07-21Little bug fix to EM clusteringtrevor.cohn
git-svn-id: https://ws10smt.googlecode.com/svn/trunk@357 ec762483-ff6d-05da-a07a-a48fb63a330f
2010-07-20tests compilegraehl
git-svn-id: https://ws10smt.googlecode.com/svn/trunk@332 ec762483-ff6d-05da-a07a-a48fb63a330f
2010-07-19use lexer instead of handwritten parserredpony
git-svn-id: https://ws10smt.googlecode.com/svn/trunk@319 ec762483-ff6d-05da-a07a-a48fb63a330f
2010-07-19Added count pruning.trevor.cohn
git-svn-id: https://ws10smt.googlecode.com/svn/trunk@318 ec762483-ff6d-05da-a07a-a48fb63a330f
2010-07-19Fixed a couple of bugs in the monolingual context extractortrevor.cohn
git-svn-id: https://ws10smt.googlecode.com/svn/trunk@314 ec762483-ff6d-05da-a07a-a48fb63a330f
2010-07-18??trevor.cohn
git-svn-id: https://ws10smt.googlecode.com/svn/trunk@312 ec762483-ff6d-05da-a07a-a48fb63a330f
2010-07-18Changed to UTF8trevor.cohn
git-svn-id: https://ws10smt.googlecode.com/svn/trunk@311 ec762483-ff6d-05da-a07a-a48fb63a330f
2010-07-15fix potential namespace conflict:redpony
git-svn-id: https://ws10smt.googlecode.com/svn/trunk@271 ec762483-ff6d-05da-a07a-a48fb63a330f
2010-07-15Massacred the pipeline to support source language phrases and contexts.trevor.cohn
git-svn-id: https://ws10smt.googlecode.com/svn/trunk@255 ec762483-ff6d-05da-a07a-a48fb63a330f
2010-07-14Added facility to get source or both language phrases/contexts. Need to fix ↵trevor.cohn
the scripts first before using this new feature. git-svn-id: https://ws10smt.googlecode.com/svn/trunk@250 ec762483-ff6d-05da-a07a-a48fb63a330f
2010-07-13revert xsing featuresredpony
git-svn-id: https://ws10smt.googlecode.com/svn/trunk@239 ec762483-ff6d-05da-a07a-a48fb63a330f
2010-07-13singleton featuresredpony
git-svn-id: https://ws10smt.googlecode.com/svn/trunk@237 ec762483-ff6d-05da-a07a-a48fb63a330f
2010-07-13major speed up using DFA parserredpony
git-svn-id: https://ws10smt.googlecode.com/svn/trunk@235 ec762483-ff6d-05da-a07a-a48fb63a330f
2010-07-13start moving toward striped grammar lexerredpony
git-svn-id: https://ws10smt.googlecode.com/svn/trunk@233 ec762483-ff6d-05da-a07a-a48fb63a330f
2010-07-13cdec now supports coarse-to-fine decoding (for SCFG only).adam.d.lopez
CTF has several options: -coarse_to_fine_beam_prune=<double> (required to activate CTF) assign an alpha parameter for pruning the coarse foreast -ctf_beam_widen=<double> (optional, defaults to 2.0): ratio to widen coarse pruning beam if fine parse fails. -ctf_num_widenings=<int> (optional, defaults to 2): number of times to widen coarse beam before defaulting to exhaustive source parsing -ctf_no_exhaustive (optional) do not attempt exhaustive parse if CTF fails to find a parse. Additionally, script extools/coarsen_grammar.pl will create a coarse-to-fine grammar (for X?? categories *only*). cdec will read CTF grammars in a format identical to the original, in which refinements of a rule immediately follow the coarse projection, preceded by an additional whitespace character. Not fully tested, but should be backwards compatible. Also not yet integrated into pipelines, but should work on the command line. git-svn-id: https://ws10smt.googlecode.com/svn/trunk@231 ec762483-ff6d-05da-a07a-a48fb63a330f
2010-07-09Fixing backoff grammar.olivia.buzek
git-svn-id: https://ws10smt.googlecode.com/svn/trunk@214 ec762483-ff6d-05da-a07a-a48fb63a330f
2010-07-09Added a bunch of features.philblunsom
git-svn-id: https://ws10smt.googlecode.com/svn/trunk@209 ec762483-ff6d-05da-a07a-a48fb63a330f
2010-07-08Adding backoff grammar and BackoffRule feature.olivia.buzek
git-svn-id: https://ws10smt.googlecode.com/svn/trunk@191 ec762483-ff6d-05da-a07a-a48fb63a330f
2010-07-08Fixed up evaluation pipeline. Should work now ...philblunsom
git-svn-id: https://ws10smt.googlecode.com/svn/trunk@183 ec762483-ff6d-05da-a07a-a48fb63a330f
2010-07-07fix bugsredpony
git-svn-id: https://ws10smt.googlecode.com/svn/trunk@180 ec762483-ff6d-05da-a07a-a48fb63a330f
2010-07-07In unfinished state. DO NOT USEphilblunsom
git-svn-id: https://ws10smt.googlecode.com/svn/trunk@179 ec762483-ff6d-05da-a07a-a48fb63a330f
2010-07-06moreredpony
git-svn-id: https://ws10smt.googlecode.com/svn/trunk@164 ec762483-ff6d-05da-a07a-a48fb63a330f
2010-07-06warningsgraehl
git-svn-id: https://ws10smt.googlecode.com/svn/trunk@157 ec762483-ff6d-05da-a07a-a48fb63a330f
2010-07-06featurizerredpony
git-svn-id: https://ws10smt.googlecode.com/svn/trunk@154 ec762483-ff6d-05da-a07a-a48fb63a330f
2010-07-05use better featuresredpony
git-svn-id: https://ws10smt.googlecode.com/svn/trunk@135 ec762483-ff6d-05da-a07a-a48fb63a330f
2010-07-02filter and score in a single fileredpony
git-svn-id: https://ws10smt.googlecode.com/svn/trunk@118 ec762483-ff6d-05da-a07a-a48fb63a330f
2010-07-02sort / filter rules by p(e|f), permit more featuresredpony
git-svn-id: https://ws10smt.googlecode.com/svn/trunk@100 ec762483-ff6d-05da-a07a-a48fb63a330f
2010-07-01filt typo that makes filtering too permissiveredpony
git-svn-id: https://ws10smt.googlecode.com/svn/trunk@95 ec762483-ff6d-05da-a07a-a48fb63a330f
2010-07-01commentgraehl
git-svn-id: https://ws10smt.googlecode.com/svn/trunk@90 ec762483-ff6d-05da-a07a-a48fb63a330f
2010-07-01use named featuresredpony
git-svn-id: https://ws10smt.googlecode.com/svn/trunk@89 ec762483-ff6d-05da-a07a-a48fb63a330f
2010-06-29bug fixredpony
git-svn-id: https://ws10smt.googlecode.com/svn/trunk@65 ec762483-ff6d-05da-a07a-a48fb63a330f
2010-06-29isolate warning messagesredpony
git-svn-id: https://ws10smt.googlecode.com/svn/trunk@56 ec762483-ff6d-05da-a07a-a48fb63a330f
2010-06-25optionally use unlimited cacheredpony
git-svn-id: https://ws10smt.googlecode.com/svn/trunk@27 ec762483-ff6d-05da-a07a-a48fb63a330f
2010-06-22initial checkinredpony
git-svn-id: https://ws10smt.googlecode.com/svn/trunk@2 ec762483-ff6d-05da-a07a-a48fb63a330f