Language Engineering for Syntactic Knowledge Transfer

Mihaela Colhon1

  1. Ericsson R&D
    Frankfurt, Germany
    tom.dinkelaker@ericsson.com
  2. Departament of Computer Science
    A.I.Cuza street 13, 200585 Craiova, Romania1231
    mcolhon@inf.ucv.ro

Abstract

In this paper we present a method for an English-Romanian treebank construction, together with the obtained evaluation results. The treebank is built upon a parallel English-Romanian corpus word-aligned and annotated at the morphological and syntactic level. The syntactic trees of the Romanian texts are generated by considering the syntactic phrases of the English parallel texts automatically resulted from syntactic parsing. The method reuses and adjusts existing tools and algorithms for cross-lingual transfer of syntactic constituents and syntactic trees alignment.

Key words

parallel treebank, syntactic phrase alignment, bilingual corpus, word-alignments

Digital Object Identifier (DOI)

https://doi.org/10.2298/CSIS120130032C

Publication information

Volume 9, Issue 3 (September 2012)
Special Issue on Advances in Computer Languages, Modeling and Agents
Year of Publication: 2012
ISSN: 2406-1018 (Online)
Publisher: ComSIS Consortium

Full text

DownloadAvailable in PDF
Portable Document Format

How to cite

Colhon, M.: Language Engineering for Syntactic Knowledge Transfer. Computer Science and Information Systems, Vol. 9, No. 3, 1231-1248. (2012), https://doi.org/10.2298/CSIS120130032C