From 9f35941087e1f4d54a9275bf943c98fe6f444b22 Mon Sep 17 00:00:00 2001 From: "ccb@cs.jhu.edu" Date: Fri, 13 Aug 2010 20:42:41 +0000 Subject: Editing intro git-svn-id: https://ws10smt.googlecode.com/svn/trunk@545 ec762483-ff6d-05da-a07a-a48fb63a330f --- report/intro_slides/ws10-graehl.pdf | Bin 0 -> 611890 bytes report/intro_slides/ws10-graehl.pptx | Bin 0 -> 261421 bytes report/introduction.tex | 8 ++++---- report/ws10-graehl.pdf | Bin 611890 -> 0 bytes report/ws10-graehl.pptx | Bin 261421 -> 0 bytes 5 files changed, 4 insertions(+), 4 deletions(-) create mode 100755 report/intro_slides/ws10-graehl.pdf create mode 100755 report/intro_slides/ws10-graehl.pptx delete mode 100755 report/ws10-graehl.pdf delete mode 100755 report/ws10-graehl.pptx (limited to 'report') diff --git a/report/intro_slides/ws10-graehl.pdf b/report/intro_slides/ws10-graehl.pdf new file mode 100755 index 00000000..0c5448a3 Binary files /dev/null and b/report/intro_slides/ws10-graehl.pdf differ diff --git a/report/intro_slides/ws10-graehl.pptx b/report/intro_slides/ws10-graehl.pptx new file mode 100755 index 00000000..4906b47a Binary files /dev/null and b/report/intro_slides/ws10-graehl.pptx differ diff --git a/report/introduction.tex b/report/introduction.tex index 21e0e907..3b673c8e 100644 --- a/report/introduction.tex +++ b/report/introduction.tex @@ -1,12 +1,12 @@ \chapter{Introduction} -Automatically generating high quality translations for foreign texts remains a central challenge for Natural Language Processing research. -Recent advances in Statistical Machine Translation (SMT) has enabled these technologies to move out of research labs an become viable commercial products and ubiquitous online tools. \footnote{e.g., translate.google.com, www.systran.co.uk, www.languageweaver.com} +Automatically generating high quality translations for foreign texts remains a central challenge for natural language processing research. +Recent advances in statistical machine translation (SMT) has enabled these technologies to move out of research labs an become viable commercial products and useful online tools. \footnote{e.g., translate.google.com, www.systran.co.uk, www.languageweaver.com} However these successes have not been uniform; current state-of-the-art translation output varies markedly in quality depending on the languages being translated. -Those language pairs that are closely related language pairs (e.g., English and French) can be translated with a high degree of precision, while for distant pairs (e.g., English and Chinese) the result is far from acceptable. +Those language pairs that are closely related language pairs (e.g., English and French) can be translated with high quality, while for distant pairs (e.g., English and Chinese) the result tends to be much lower quality. It is tempting to argue that SMT's current limitations can be overcome simply by increasing the amount of data on which the systems are trained. -However, large scale evaluation campaigns for Chinese~$\rightarrow$~English translation have not yielded the expected gains despite the increasing size of the models. +However, large scale evaluation campaigns for Chinese~$\rightarrow$~English translation, such as the DARPA GALE program, have not yielded high quality translation despite providing hundreds of millions of words worth of training data. \begin{figure}[t] \centering \includegraphics[scale=0.55]{urdu_example_translation.pdf} diff --git a/report/ws10-graehl.pdf b/report/ws10-graehl.pdf deleted file mode 100755 index 0c5448a3..00000000 Binary files a/report/ws10-graehl.pdf and /dev/null differ diff --git a/report/ws10-graehl.pptx b/report/ws10-graehl.pptx deleted file mode 100755 index 4906b47a..00000000 Binary files a/report/ws10-graehl.pptx and /dev/null differ -- cgit v1.2.3