-
Notifications
You must be signed in to change notification settings - Fork 1
/
Copy pathREADME.Rmd
46 lines (31 loc) · 3.81 KB
/
README.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
---
output: github_document
author: '[Gede Primahadi Wijaya Rajeg](https://www.ling-phil.ox.ac.uk/people/gede-rajeg) <a itemprop="sameAs" content="https://orcid.org/0000-0002-2047-8621" href="https://orcid.org/0000-0002-2047-8621" target="orcid.widget" rel="noopener noreferrer" style="vertical-align:top;"><img src="https://orcid.org/sites/default/files/images/orcid_16x16.png" style="width:1em;margin-right:.5em;" alt="ORCID iD icon"></a><br>University of Oxford, UK & Universitas Udayana, Indonesia'
title: "Digitised comparative word list derived from von Rosenberg's \"Der Malayische Archipel: Land und Leute in Schilderungen, gesammelt während eines driessig-jährigen Aufenhaltes in den Kolonien\" from 1878."
bibliography: references.bib
csl: "https://raw.githubusercontent.com/engganolang/kahler-1987/refs/heads/main/unified-style-sheet-for-linguistics.csl"
link-citations: true
---
<!-- README.md is generated from README.Rmd. Please edit that file -->
```{r, include = FALSE}
knitr::opts_chunk$set(
collapse = TRUE,
comment = "#>"
)
```
<!-- badges: start -->
[![The University of Oxford](file-oxweb-logo.gif){width="84"}](https://www.ox.ac.uk/) [![Faculty of Linguistics, Philology and Phonetics, the University of Oxford](file-lingphil.png){width="83"}](https://www.ling-phil.ox.ac.uk/) [![Arts and Humanities Research Council (AHRC)](file-ahrc.png){width="325"}](https://www.ukri.org/councils/ahrc/) </br>*This work is part of the [AHRC-funded project](https://app.dimensions.ai/details/grant/grant.12915105) on the lexical resources for Enggano, led by the Faculty of Linguistics, Philology and Phonetics at the University of Oxford, UK. Visit the [central webpage of the Enggano project](https://enggano.ling-phil.ox.ac.uk/)*.
<p xmlns:cc="http://creativecommons.org/ns#">
This work is licensed under <a href="https://creativecommons.org/licenses/by-nc/4.0/?ref=chooser-v1" target="_blank" rel="license noopener noreferrer" style="display:inline-block;">Creative Commons Attribution-NonCommercial 4.0 International <img src="https://mirrors.creativecommons.org/presskit/icons/cc.svg?ref=chooser-v1" style="height:22px!important;margin-left:3px;vertical-align:text-bottom;"/><img src="https://mirrors.creativecommons.org/presskit/icons/by.svg?ref=chooser-v1" style="height:22px!important;margin-left:3px;vertical-align:text-bottom;"/><img src="https://mirrors.creativecommons.org/presskit/icons/nc.svg?ref=chooser-v1" style="height:22px!important;margin-left:3px;vertical-align:text-bottom;"/></a>
</p>
<!-- badges: end -->
## Overview
The work in this repository [@Rajeg_Digitised_comparative_word_2024] involves XML-tagging the relevant words (with their respective languages and German gloss) in the unstructured OCR output. The tagging is used to processed the OCR into a [tibble/table](https://github.com/complexico/vrosenberg1878/blob/main/data/vrosenberg1878.tsv). The comparative word list in von Rosenberg [-@vonrosenberg1878] includes words from the Enggano language and they are included in the Shiny app of the [*EnoLEX*](https://enggano.shinyapps.io/enolex/) database [@krausse_enolex_2024; @rajeg_enolex_2024].
The column `OldFormOrig` in the [table](https://github.com/complexico/vrosenberg1878/blob/main/data/vrosenberg1878.tsv) contains the original form/spelling in the source text while the `OldFormChange` contains the changes made (e.g., typo correction, adjustment, OCR error fixing) on the original form/spelling.
The `English` and `Indonesian` columns are translations in the two languages of the original German glosses of the forms. The translation was performed using the DeepL web translator.
## Contributors
Name | GitHub user | Description | Role
--- | --- | --- | ---
Rajeg, Gede Primahadi Wijaya | gederajeg | Data Curator, Digitisation, Software, Archiving | Author
Krauße, Daniel | | Data source gathering | other
## References