Cologne standard IAST extension #392

funderburkjim · 2022-02-17T03:23:50Z

In recent work with MD dictionary IAST, several improvements to the Cologne extension to IAST standard were made.
Since the intercoding between slp1 and iast is relevant to many dictionaries, some documentation of this work is done in the iast directory of this repository.

Ref #392

funderburkjim · 2022-02-17T03:32:55Z

slp1_roman.xml contains the rules for transcoding from slp1 to roman (iast)
roman_slp1.xml contains the rules for transcoding from roman (iast) to slp1.
slp1_iast.txt contains a more readable form of the two transliterations.

funderburkjim · 2022-02-17T03:35:54Z

consistency

These two xml transcoding rule files should be kept consistent with similarly named files in:

funderburkjim · 2022-02-17T03:45:48Z

accent copy-paste

There are sometimes multiple visually indistinguishable unicode representations for Latin letters with diacritics.
For example, the letter a with acute (udatta) accent can be represented either by

the preformed unicode code point \u00e1 Latin Small Letter A with Acute
the two-character sequence \u0061\u0301 Latin small letter A + COMBINING ACUTE ACCENT

When it is needed to correct any iast in a text file, then it is advised to copy-paste
from the slp1_iast.txt file. This practice will aid in providing consistent coding
throughout the various dictionaries.

Ref: sanskrit-lexicon/COLOGNE#392

gasyoun · 2022-02-17T05:59:35Z

Cologne extension to IAST standard

Is there a page dedicated to in on Cologne website itself, @funderburkjim ?

Andhrabharati · 2022-02-17T07:05:56Z

H\ ḥ̀ ( Ḥ̀ ) \u1e25\u0300 LATIN SMALL LETTER H WITH DOT BELOW + COMBINING GRAVE ACCENT
H/ ḥ́ ( Ḥ́ ) \u1e25\u0301 LATIN SMALL LETTER H WITH DOT BELOW + COMBINING ACUTE ACCENT
H^ ḥ̂ ( Ḥ̂ ) \u1e25\u0302 LATIN SMALL LETTER H WITH DOT BELOW + COMBINING CIRCUMFLEX ACCENT
M\ ṃ̀ ( Ṃ̀ ) \u1e43\u0300 LATIN SMALL LETTER M WITH DOT BELOW + COMBINING GRAVE ACCENT
M/ ṃ́ ( Ṃ́ ) \u1e43\u0301 LATIN SMALL LETTER M WITH DOT BELOW + COMBINING ACUTE ACCENT
M^ ṃ̂ ( Ṃ̂ ) \u1e43\u0302 LATIN SMALL LETTER M WITH DOT BELOW + COMBINING CIRCUMFLEX ACCENT

@funderburkjim

After some prolonged discussions [just about 6 months back],
sanskrit-lexicon/PWG#5 (comment)
sanskrit-lexicon/PWG#5 (comment)
I thought you'd zeroed on having accents before visarga & anusvara.

Did you change your mind to keep the accents after , subsequently?

funderburkjim added a commit that referenced this issue Feb 17, 2022

Revise iast directory.

b3f223a

Ref #392

funderburkjim added a commit to sanskrit-lexicon/csl-apidev that referenced this issue Feb 17, 2022

Revise iast-slp1 transcoding rules.

82ed737

Ref: sanskrit-lexicon/COLOGNE#392

funderburkjim added a commit to sanskrit-lexicon/csl-websanlexicon that referenced this issue Feb 17, 2022

Revise iast-slp1 transcoding rules.

dad92c2

Ref: sanskrit-lexicon/COLOGNE#392

gasyoun added the Documentation How TXT , XML work label Feb 17, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cologne standard IAST extension #392

Cologne standard IAST extension #392

funderburkjim commented Feb 17, 2022

funderburkjim commented Feb 17, 2022

funderburkjim commented Feb 17, 2022

funderburkjim commented Feb 17, 2022

gasyoun commented Feb 17, 2022

Andhrabharati commented Feb 17, 2022

Cologne standard IAST extension #392

Cologne standard IAST extension #392

Comments

funderburkjim commented Feb 17, 2022

funderburkjim commented Feb 17, 2022

funderburkjim commented Feb 17, 2022

consistency

funderburkjim commented Feb 17, 2022

accent copy-paste

gasyoun commented Feb 17, 2022

Andhrabharati commented Feb 17, 2022