Call for Papers are invited at 5th WORKSHOP ON BUILDING AND USING COMPARABLE CORPORA. This event will be organized by ACL SIGWAC (Special Interest Group on Web as Corpus) and FLaReNet (Fostering Language Resources Network).
Topics related to the special theme:
- Comparable corpora use in MT
- Comparable corpora processing tools/kits for MT
- Parallel corpora usage
- Parallel corpora processing tools/platforms
- MT for less-resourced languages
- MT for less-resourced domains
- Open source SMT systems (Moses, etc.)
- Publicly available SMT
Building Comparable Corpora:
- Human translations
- Automatic and semi-automatic methods
- Methods to mine parallel and non-parallel corpora from the Web
- Tools and criteria to evaluate the comparability of corpora
- Parallel vs non-parallel corpora, monolingual corpora
- Rare and minority languages
- Across language families
- Multi-media/multi-modal comparable corpora
Applications of comparable corpora:
- Human translations
- Language learning
- Cross-language information retrieval & document categorization
- Bilingual projections
- Machine translation
- Writing assistance
Mining from Comparable Corpora:
- Extraction of parallel segments or paraphrases from comparable corpora
- Extraction of bilingual and multilingual translations of single words and multi-word expressions; proper names, named entities,
IMPORTANT DATES
- 15 February 2012 Deadline for submission of full papers
- 10 March 2012 Notification of acceptance
- 20 March 2012 Camera-ready papers due
- 26 May 2012 Workshop date
DEADLINE FOR PAPERS: 15 February 2012
For further information, visit the link.