Sustaining a Corpus for Spoken Turkish Discourse: Accessibility and Corpus Management Issues


Ruhi S., ERÖZ TUĞA B., HATİPOĞLU Ç., IŞIK GÜLER H., Acar M. G. C., Eryilmaz K., ...More

7th International Conference on Language Resources and Evaluation (LREC), Valletta, Malta, 17 - 23 May 2010 identifier

  • Publication Type: Conference Paper / Full Text
  • Volume:
  • City: Valletta
  • Country: Malta
  • Hacettepe University Affiliated: Yes

Abstract

This paper addresses the issues of the long-term availability of language resources and the financing of resource maintenance in the context of the web-based corpus management system employed in the Spoken Turkish Corpus (STC), which operates with EXMARaLDA. Section 2 overviews the capacities of the corpus management system with respect to its software infrastructure, online presentation, metadata management, and interoperability. Section 3 describes the plan foreseen in STC for sustaining the resource, and dwells on the ethical issues surrounding the conflicting demands of free resources for non-commercial research and resource maintenance.