Skip to content

Latest commit

 

History

History
112 lines (86 loc) · 5.26 KB

README.md

File metadata and controls

112 lines (86 loc) · 5.26 KB

Digitised comparative word list derived from von Rosenberg’s “Der Malayische Archipel: Land und Leute in Schilderungen, gesammelt während eines driessig-jährigen Aufenhaltes in den Kolonien” from 1878.

Gede Primahadi Wijaya Rajeg ORCID iD icon
University of Oxford, UK & Universitas Udayana, Indonesia

The University of Oxford Faculty of Linguistics, Philology and Phonetics, the University of Oxford Arts and Humanities Research Council (AHRC)
This work is part of the AHRC-funded project on the lexical resources for Enggano, led by the Faculty of Linguistics, Philology and Phonetics at the University of Oxford, UK. Visit the central webpage of the Enggano project.

This work is licensed under Creative Commons Attribution-NonCommercial 4.0 International

Overview

The current work in this repository (Rajeg 2024) is XML-tagging the relevant words (with their respective languages and German gloss) in the unstructured OCR output. The tagging is used to processed the OCR into a tibble/table. The comparative word list in von Rosenberg (1878) includes words from the Enggano language and they are included in the Shiny app of the EnoLEX database (Krauße et al. 2024; Rajeg, Krauße & Pramartha 2024).

The column OldFormOrig in the table contains the original form/spelling in the source text while the OldFormChange contains the changes made (e.g., typo correction, adjustment, OCR error fixing) on the original form/spelling.

The English and Indonesian columns are translations in the two languages of the original German glosses of the forms. The translation was performed using the DeepL web translator.

Contributors

Name GitHub user Description Role
Rajeg, Gede Primahadi Wijaya gederajeg Data Curator, Digitisation, Software, Archiving Author
Krauße, Daniel Data source gathering other

References

Krauße, Daniel, Gede Primahadi Wijaya Rajeg, Cokorda Pramartha, Erik Zobel, Charlotte Hemmings, I Wayan Arka & Mary Dalrymple. 2024. EnoLEX: A diachronic lexical database of the Enggano language. Online database. GitHub & Shiny web application: https://github.com/engganolang/enolex. https://enggano.shinyapps.io/enolex/.

Rajeg, Gede Primahadi Wijaya. 2024. Digitised comparative word list derived from von Rosenberg’s “Der Malayische Archipel: Land und Leute in Schilderungen, gesammelt während eines driessig-jährigen Aufenhaltes in den Kolonien” from 1878. https://github.com/complexico/vrosenberg1878.

Rajeg, Gede Primahadi Wijaya, Daniel Krauße & Cokorda Pramartha. 2024. EnoLEX: A diachronic lexical database for the Enggano language. In Ai Inoue, Naho Kawamoto & Makoto Sumiyoshi (eds.), AsiaLex 2024 proceedings: Asian Lexicography - Merging cutting-edge and established approaches, 123–132. Toyo University, Tokyo, Japan. https://doi.org/10.25446/oxford.27013864.

Rosenberg, Carl Benjamin Hermann von. 1878. Der malayische archipel: Land und leute in schilderungen, gesammelt während eines driessig-jährigen aufenhaltes in den kolonien. Leipzig: Gustav Weigel. https://hdl.handle.net/2027/mdp.39015065356076.