Download PDFOpen PDF in browser

Supracorpora Databases as Corpus-Based Superstructure for Manual Annotation of Parallel Corpora

13 pagesPublished: November 28, 2016

Abstract

This paper presents a new type on corpus-based information resource: supracorpora databases (SCDBs). SCDBs are designed to enhance functionality of linguistic corpora by supporting customizable manual annotation of linguistic items, including multi-word items. This is similar to query result categorization functions available in some corpora and to functions provided by some of the standalone corpus annotation tools, although many features supported by SCDBs are more sophisticated (e.g. they allow for detailed annotation of multi-word linguistic items, including specification of main words and immediate context). More importantly still, SCDBs allow researchers to create annotated translation correspondences (TCs) in parallel corpora. Aggregation of searchable TCs in a SCDB represents a unique information resource that facilitates creation of new explicit knowledge about cross-linguistic correspondences and translation models. An overview of four SCDBs developed up to date is also included in this paper.

Keyphrases: corpus design, parallel corpora, supracorpora databases, translation correspondence, translation studies

In: Antonio Moreno Ortiz and Chantal Pérez-Hernández (editors). CILC2016. 8th International Conference on Corpus Linguistics, vol 1, pages 236--248

Links:
BibTeX entry
@inproceedings{CILC2016:Supracorpora_Databases_as_Corpus_Based,
  author    = {Mikhail Kruzhkov},
  title     = {Supracorpora Databases as Corpus-Based Superstructure for Manual Annotation of Parallel Corpora},
  booktitle = {CILC2016. 8th International Conference on Corpus Linguistics},
  editor    = {Antonio Moreno Ortiz and Chantal P\textbackslash{}'erez-Hern\textbackslash{}'andez},
  series    = {EPiC Series in Language and Linguistics},
  volume    = {1},
  pages     = {236--248},
  year      = {2016},
  publisher = {EasyChair},
  bibsource = {EasyChair, https://easychair.org},
  issn      = {2398-5283},
  url       = {https://easychair.org/publications/paper/jFjs},
  doi       = {10.29007/fxqj}}
Download PDFOpen PDF in browser