Skip to content

This repository contains a curated dataset of Ottoman Turkish words with IJMES-style transliterations. Currently it has only words with Arabic origin consisting of 75,189 words.

Notifications You must be signed in to change notification settings

Koras0ff/ottoman-lexicon

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 

Repository files navigation

ottoman-lexicon

This repository contains a curated dataset of Ottoman Turkish words with IJMES-style transliterations. Currently it has only words with Arabic origin consisting of 97,073 words.

The data sources are Vankulu Lügâtı (Türkiye Yazma Eserler Kurumu Başkanlığı, 2014) and Kâmûsu'l-Muhît written by Âsım Efendi (Türkiye Yazma Eserler Kurumu Başkanlığı, 2013). These two works were structured for the entries without the word's definition part as JSON files and TXT files which enables users to make direct searches and use the data for their specific studies without an exhaustive preprocessing step.

Unlike various Ottoman Turkish lexicons on the Internet that can be freely accessed, this lexicon utilizes IJMES Transliteration System for Ottoman Turkish.

References:

Türkiye Yazma Eserler Kurumu Başkanlığı. (2013). el-Okyânûsu’l-Basît fî Tercemeti’l-Kâmûsi’l-Muhît: Kâmûsu’l-Muhît Tercümesi, 1. Cilt (Prof. Dr. Mustafa Koç, Ed.). Türkiye Yazma Eserler Kurumu Başkanlığı Yayınları. ISBN: 978-975-17-3658-1.

Türkiye Yazma Eserler Kurumu Başkanlığı. (2014). Vankulu Lügati, 1. Cilt (Mustafa Koç, Ed.). Türkiye Yazma Eserler Kurumu Başkanlığı Yayınları. ISBN: 978-975-17-3761-8.

About

This repository contains a curated dataset of Ottoman Turkish words with IJMES-style transliterations. Currently it has only words with Arabic origin consisting of 75,189 words.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published