Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Cologne standard IAST extension #392

Open
funderburkjim opened this issue Feb 17, 2022 · 5 comments
Open

Cologne standard IAST extension #392

funderburkjim opened this issue Feb 17, 2022 · 5 comments
Labels
Documentation How TXT , XML work

Comments

@funderburkjim
Copy link
Contributor

In recent work with MD dictionary IAST, several improvements to the Cologne extension to IAST standard were made.
Since the intercoding between slp1 and iast is relevant to many dictionaries, some documentation of this work is done in the iast directory of this repository.

funderburkjim added a commit that referenced this issue Feb 17, 2022
@funderburkjim
Copy link
Contributor Author

  • slp1_roman.xml contains the rules for transcoding from slp1 to roman (iast)
  • roman_slp1.xml contains the rules for transcoding from roman (iast) to slp1.
  • slp1_iast.txt contains a more readable form of the two transliterations.

@funderburkjim
Copy link
Contributor Author

consistency

These two xml transcoding rule files should be kept consistent with similarly named files in:

@funderburkjim
Copy link
Contributor Author

accent copy-paste

There are sometimes multiple visually indistinguishable unicode representations for Latin letters with diacritics.
For example, the letter a with acute (udatta) accent can be represented either by

  • the preformed unicode code point \u00e1 Latin Small Letter A with Acute
  • the two-character sequence \u0061\u0301 Latin small letter A + COMBINING ACUTE ACCENT

When it is needed to correct any iast in a text file, then it is advised to copy-paste
from the slp1_iast.txt file. This practice will aid in providing consistent coding
throughout the various dictionaries.

funderburkjim added a commit to sanskrit-lexicon/csl-apidev that referenced this issue Feb 17, 2022
funderburkjim added a commit to sanskrit-lexicon/csl-websanlexicon that referenced this issue Feb 17, 2022
@gasyoun gasyoun added the Documentation How TXT , XML work label Feb 17, 2022
@gasyoun
Copy link
Member

gasyoun commented Feb 17, 2022

Cologne extension to IAST standard

Is there a page dedicated to in on Cologne website itself, @funderburkjim ?

@Andhrabharati
Copy link

H\ ḥ̀ ( Ḥ̀ ) \u1e25\u0300 LATIN SMALL LETTER H WITH DOT BELOW + COMBINING GRAVE ACCENT
H/ ḥ́ ( Ḥ́ ) \u1e25\u0301 LATIN SMALL LETTER H WITH DOT BELOW + COMBINING ACUTE ACCENT
H^ ḥ̂ ( Ḥ̂ ) \u1e25\u0302 LATIN SMALL LETTER H WITH DOT BELOW + COMBINING CIRCUMFLEX ACCENT
M\ ṃ̀ ( Ṃ̀ ) \u1e43\u0300 LATIN SMALL LETTER M WITH DOT BELOW + COMBINING GRAVE ACCENT
M/ ṃ́ ( Ṃ́ ) \u1e43\u0301 LATIN SMALL LETTER M WITH DOT BELOW + COMBINING ACUTE ACCENT
M^ ṃ̂ ( Ṃ̂ ) \u1e43\u0302 LATIN SMALL LETTER M WITH DOT BELOW + COMBINING CIRCUMFLEX ACCENT

@funderburkjim

After some prolonged discussions [just about 6 months back],
sanskrit-lexicon/PWG#5 (comment)
sanskrit-lexicon/PWG#5 (comment)
I thought you'd zeroed on having accents before visarga & anusvara.

Did you change your mind to keep the accents after , subsequently?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Documentation How TXT , XML work
Projects
None yet
Development

No branches or pull requests

3 participants