Language Engineering for Syntactic Knowledge Transfer
- Ericsson R&D
Frankfurt, Germany
tom.dinkelaker@ericsson.com - Departament of Computer Science
A.I.Cuza street 13, 200585 Craiova, Romania1231
mcolhon@inf.ucv.ro
Abstract
In this paper we present a method for an English-Romanian treebank construction, together with the obtained evaluation results. The treebank is built upon a parallel English-Romanian corpus word-aligned and annotated at the morphological and syntactic level. The syntactic trees of the Romanian texts are generated by considering the syntactic phrases of the English parallel texts automatically resulted from syntactic parsing. The method reuses and adjusts existing tools and algorithms for cross-lingual transfer of syntactic constituents and syntactic trees alignment.
Key words
parallel treebank, syntactic phrase alignment, bilingual corpus, word-alignments
Digital Object Identifier (DOI)
https://doi.org/10.2298/CSIS120130032C
Publication information
Volume 9, Issue 3 (September 2012)
Special Issue on Advances in Computer Languages, Modeling and Agents
Year of Publication: 2012
ISSN: 2406-1018 (Online)
Publisher: ComSIS Consortium
Full text
Available in PDF
Portable Document Format
How to cite
Colhon, M.: Language Engineering for Syntactic Knowledge Transfer. Computer Science and Information Systems, Vol. 9, No. 3, 1231-1248. (2012), https://doi.org/10.2298/CSIS120130032C