Sustaining a Corpus for Spoken Turkish Discourse: Accessibility and Corpus Management Issues


Ruhi S., ERÖZ TUĞA B., HATİPOĞLU Ç., IŞIK GÜLER H., Acar M. G. C., Eryilmaz K., ...Daha Fazla

7th International Conference on Language Resources and Evaluation (LREC), Valletta, Malta, 17 - 23 Mayıs 2010 identifier

  • Yayın Türü: Bildiri / Tam Metin Bildiri
  • Cilt numarası:
  • Basıldığı Şehir: Valletta
  • Basıldığı Ülke: Malta
  • Hacettepe Üniversitesi Adresli: Evet

Özet

This paper addresses the issues of the long-term availability of language resources and the financing of resource maintenance in the context of the web-based corpus management system employed in the Spoken Turkish Corpus (STC), which operates with EXMARaLDA. Section 2 overviews the capacities of the corpus management system with respect to its software infrastructure, online presentation, metadata management, and interoperability. Section 3 describes the plan foreseen in STC for sustaining the resource, and dwells on the ethical issues surrounding the conflicting demands of free resources for non-commercial research and resource maintenance.