<feed xmlns='http://www.w3.org/2005/Atom'>
<title>cdec-dtrain/training, branch word-alignment</title>
<subtitle>Mirror of https://github.com/pks/cdec-dtrain.git
</subtitle>
<id>https://git.simianer.de/mirrored/cdec-dtrain/atom?h=word-alignment</id>
<link rel='self' href='https://git.simianer.de/mirrored/cdec-dtrain/atom?h=word-alignment'/>
<link rel='alternate' type='text/html' href='https://git.simianer.de/mirrored/cdec-dtrain/'/>
<updated>2010-02-19T03:34:17Z</updated>
<entry>
<title>check in modified ones too</title>
<updated>2010-02-19T03:34:17Z</updated>
<author>
<name>Chris Dyer</name>
<email>redpony@gmail.com</email>
</author>
<published>2010-02-19T03:34:17Z</published>
<link rel='alternate' type='text/html' href='https://git.simianer.de/mirrored/cdec-dtrain/commit/?id=3a7bca942d838f945c1cd0cbe5977e20c61ebc2d'/>
<id>urn:sha1:3a7bca942d838f945c1cd0cbe5977e20c61ebc2d</id>
<content type='text'>
</content>
</entry>
<entry>
<title>add generative word alignment model and primitive EM trainer. Model 1 and HMM are supported, without NULL source words</title>
<updated>2010-02-18T22:06:59Z</updated>
<author>
<name>Chris Dyer</name>
<email>redpony@gmail.com</email>
</author>
<published>2010-02-18T22:06:59Z</published>
<link rel='alternate' type='text/html' href='https://git.simianer.de/mirrored/cdec-dtrain/commit/?id=4d47dbd7da0434de67ac619392d516c678e1f2ca'/>
<id>urn:sha1:4d47dbd7da0434de67ac619392d516c678e1f2ca</id>
<content type='text'>
</content>
</entry>
<entry>
<title>word aligner cleanup, new features</title>
<updated>2010-02-01T22:38:39Z</updated>
<author>
<name>Chris Dyer</name>
<email>redpony@gmail.com</email>
</author>
<published>2010-02-01T22:38:39Z</published>
<link rel='alternate' type='text/html' href='https://git.simianer.de/mirrored/cdec-dtrain/commit/?id=c97b8a8b58f7385fb48b74e2cf1ea9610cd1202f'/>
<id>urn:sha1:c97b8a8b58f7385fb48b74e2cf1ea9610cd1202f</id>
<content type='text'>
</content>
</entry>
<entry>
<title>fix min</title>
<updated>2010-01-25T18:44:58Z</updated>
<author>
<name>Chris Dyer</name>
<email>redpony@gmail.com</email>
</author>
<published>2010-01-25T18:44:58Z</published>
<link rel='alternate' type='text/html' href='https://git.simianer.de/mirrored/cdec-dtrain/commit/?id=9e7d60da4421074d279a91cb6e4e67438add4645'/>
<id>urn:sha1:9e7d60da4421074d279a91cb6e4e67438add4645</id>
<content type='text'>
</content>
</entry>
<entry>
<title>more autoconf fixes- use version of boost m4 macros which are much, much, much better</title>
<updated>2010-01-24T09:28:43Z</updated>
<author>
<name>Chris Dyer</name>
<email>redpony@gmail.com</email>
</author>
<published>2010-01-24T09:28:43Z</published>
<link rel='alternate' type='text/html' href='https://git.simianer.de/mirrored/cdec-dtrain/commit/?id=9670fcbc15419c04f9a03d2118c3c697ed1423a8'/>
<id>urn:sha1:9670fcbc15419c04f9a03d2118c3c697ed1423a8</id>
<content type='text'>
</content>
</entry>
<entry>
<title>Support building without gtest</title>
<updated>2010-01-24T08:36:21Z</updated>
<author>
<name>Chris Dyer</name>
<email>redpony@gmail.com</email>
</author>
<published>2010-01-24T08:36:21Z</published>
<link rel='alternate' type='text/html' href='https://git.simianer.de/mirrored/cdec-dtrain/commit/?id=613ccc6ea2401e76099a97d3698f13b8ea283497'/>
<id>urn:sha1:613ccc6ea2401e76099a97d3698f13b8ea283497</id>
<content type='text'>
Now the only dependence is boost, which most modern linux distros have.
</content>
</entry>
<entry>
<title>add alignment visualization tool</title>
<updated>2010-01-19T02:57:23Z</updated>
<author>
<name>Chris Dyer</name>
<email>redpony@gmail.com</email>
</author>
<published>2010-01-19T02:57:23Z</published>
<link rel='alternate' type='text/html' href='https://git.simianer.de/mirrored/cdec-dtrain/commit/?id=a216521744cd4bf1c9935d99c5e53e4198301b57'/>
<id>urn:sha1:a216521744cd4bf1c9935d99c5e53e4198301b57</id>
<content type='text'>
</content>
</entry>
<entry>
<title>cool new alignment feature</title>
<updated>2009-12-19T19:32:28Z</updated>
<author>
<name>Chris Dyer</name>
<email>redpony@gmail.com</email>
</author>
<published>2009-12-19T19:32:28Z</published>
<link rel='alternate' type='text/html' href='https://git.simianer.de/mirrored/cdec-dtrain/commit/?id=27db9d8c05188f64c17d61c394d3dafe8b8e93d8'/>
<id>urn:sha1:27db9d8c05188f64c17d61c394d3dafe8b8e93d8</id>
<content type='text'>
</content>
</entry>
<entry>
<title>add symmetrization heuristics to atools, add null word configuration</title>
<updated>2009-12-19T03:51:11Z</updated>
<author>
<name>Chris Dyer</name>
<email>redpony@gmail.com</email>
</author>
<published>2009-12-19T03:51:11Z</published>
<link rel='alternate' type='text/html' href='https://git.simianer.de/mirrored/cdec-dtrain/commit/?id=544da4d8e42858b19e6229936df56d44d61b1f38'/>
<id>urn:sha1:544da4d8e42858b19e6229936df56d44d61b1f38</id>
<content type='text'>
</content>
</entry>
<entry>
<title>added non-pruning intersection and a CRF tagger</title>
<updated>2009-12-17T18:57:54Z</updated>
<author>
<name>Chris Dyer</name>
<email>redpony@gmail.com</email>
</author>
<published>2009-12-17T18:57:54Z</published>
<link rel='alternate' type='text/html' href='https://git.simianer.de/mirrored/cdec-dtrain/commit/?id=bba4ff830c8722cdcaf29e36c1ff5821a912ae5d'/>
<id>urn:sha1:bba4ff830c8722cdcaf29e36c1ff5821a912ae5d</id>
<content type='text'>
- the linear-chain tagger is more of a proof of concept than a real tagger-- the context-free assumptions made in a number of places mean that the algorithms used may not be as efficient as they could be, but the model is as powerful as any CRF
- it would be easy to add latent variables or semi-CRF support (or both!)
- i've added a couple basic features that are often used for POS tagging
- non-pruning intersection is useful for lexical word alignment models and the tagger
- a sample POS tagger model will be committed later
</content>
</entry>
</feed>
