Forty years of working with corpora: from Ibsen to Twitter, and beyond

Knut Hofland; Paul Meurer; Andrew Salway

doi:10.15845/bells.v3i1.371

Forty years of working with corpora: from Ibsen to Twitter, and beyond

Authors

Knut Hofland University of Bergen
Paul Meurer University of Bergen
Andrew Salway University of Bergen

DOI:

https://doi.org/10.15845/bells.v3i1.371

Abstract

We provide an overview of forty years of work with language corpora by the research group that started in 1972 as the Norwegian Computing Centre for the Humanities. A brief history highlights major corpora and tools that have been developed in numerous collaborations, including corpora of literature, dialect recordings, learner language, parallel texts, newspaper articles, blog posts and tweets. Current activities are also described, with a focus on corpus analysis tools, treebanks and social media analysis.

Keywords: corpus building; corpus analysis tools; treebanks; social media analysis

Downloads

Published

2013-04-10

How to Cite

Hofland, Knut, Paul Meurer, and Andrew Salway. 2013. “Forty Years of Working With Corpora: From Ibsen to Twitter, and Beyond”. Bergen Language and Linguistics Studies 3 (1). https://doi.org/10.15845/bells.v3i1.371.