University of Oulu

Jouste, M., Mettovaara, J., Morottaja, P., & Partanen, N. (2022). Archive infrastructure and spoken language corpora for Saami languages in Finland. In K. Berglund, M. La Mela, I. Zwart (Eds.), Proceedings of the 6th Digital Humanities in the Nordic and Baltic Countries Conference (DHNB 2022), Uppsala, Sweden, March 15-18, 2022 (pp. 269-278). RWTH Aachen University.

Archive infrastructure and spoken language corpora for Saami languages in Finland

Saved in:
Author: Jouste, Marko1; Mettovaara, Jukka1; Morottaja, Petter1;
Organizations: 1University of Oulu, Finland
2University of Helsinki, Finland
Format: article
Version: published version
Access: open
Online Access: PDF Full Text (PDF, 1.3 MB)
Persistent link: http://urn.fi/urn:nbn:fi-fe2022102062652
Language: English
Published: RWTH Aachen University, 2022
Publish Date: 2022-10-20
Description:

Abstract

This study presents the results of an Aanaar Saami pilot project in the Saami Culture Archive, University of Oulu. The project has established a set of conventions to transcribe and annotate Aanaar Saami recordings in the archive’s collection and created a mechanism through which grammatically annotated but anonymous versions can be imported to the Korp search interface in the Language Bank of Finland. The practices include wide use of Saami language technology, the use of Finnish computational research infrastructure, and they can be extended later to other Saami languages in the archive.

see all

Series: CEUR workshop proceedings
ISSN: 1613-0073
ISSN-E: 1613-0073
ISSN-L: 1613-0073
Volume: 3232
Pages: 269 - 278
Host publication: Proceedings of the 6th Digital Humanities in the Nordic and Baltic Countries Conference (DHNB 2022)
Host publication editor: Berglund, Karl
La Mela, Matti
Zwart, Inge
Conference: Digital Humanities in the Nordic and Baltic Countries
Type of Publication: A4 Article in conference proceedings
Field of Science: 6121 Languages
Subjects:
Copyright information: © 2022 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).
  https://creativecommons.org/licenses/by/4.0/