A multidimensional approach to aligned sentences in translated text


  • Gard Buen Jenset University of Oxford
  • Lidun Hareide University of Bergen




Using unsupervised clustering techniques this study explores sentence alignment patterns in a parallel corpus of Norwegian source texts and Spanish translations, the NSPC (Hareide and Hofland 2012). The results show that three strategies with respect to sentence alignment dominate: one to one correspondence, merging two sentences into one, and removing sentences altogether (omission). The strategies are intricately correlated with the variables translator, author, and genre. However, we show how visualization techniques for cluster analyses offer a possibility for teasing apart these interactions as well as their relative importance. Our results indicate that non-fiction texts allow translators more freedom with respect to the treatment of sentences than do texts that are written by professional authors of fiction. The style of the author appears to play only a secondary role, but is especially important in fiction.


Keywords: corpus based translation, cluster analysis, parallel corpora, corpus alignment, unidirectional bilingual corpus




How to Cite

Jenset, Gard Buen, and Lidun Hareide. 2013. “A Multidimensional Approach to Aligned Sentences in Translated Text”. Bergen Language and Linguistics Studies 3 (1). https://doi.org/10.15845/bells.v3i1.356.