Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

pwg:7452 #1173

Open
funderburkjim opened this issue Jun 5, 2023 · 25 comments
Open

pwg:7452 #1173

funderburkjim opened this issue Jun 5, 2023 · 25 comments
Labels
question Further information is requested

Comments

@funderburkjim
Copy link
Contributor

date: 04/26/2023 07:06:50
dict: pwg
Lnum: 7452
hw: asūy
old: a̱sū̱yannabhyàcākaśa̱ṃ tasmā̀ aspṛhaya̱ṃ punàḥ
new: a̱sū̱yannabhyàcākaśa̱ṃ tasmā́ aspṛhaya̱ṃ punáḥ
comm: Typo LB (It is one of the examples with an accent typo. Other words also have to be corrected.)

@funderburkjim
Copy link
Contributor Author

image

As coded in pwg.txt (with slp1 transliteration)

{#a\sU\yannaBya^cAkaSa\M tasmA^ aspfhaya\M puna^H#}

display Devanagari:

display IAST:
a̱sū̱yannabhyàcākaśa̱ṃ tasmā̀ aspṛhaya̱ṃ punàḥ

display Devanagari:
अ॒सू॒यन्नभ्य॑चाकशं॒ तस्मा॑ अस्पृहयं॒ पुनः॑

@funderburkjim
Copy link
Contributor Author

The user suggests changing accents on two words from svarita to udAtta (have I got the names right?):
tasmā̀ -> tasmā́ and punàḥ -> punáḥ

But udAtta accent in Devanagari uses the special 'superscript u' in PWG
e.g. in previous word:

image

Conclusion

No change required. The user's two changes are wrong, as the accent is not
udAtta but svarita.

@funderburkjim
Copy link
Contributor Author

@Andhrabharati Do you agree with my analysis of no change?

@Andhrabharati
Copy link

Yes, you are right.

@funderburkjim
Copy link
Contributor Author

@Andhrabharati Thank you for checking!

@funderburkjim
Copy link
Contributor Author

The user (Yuzuki T.) provided additional feedback:

I still believe there are errors in the accents.
In PWG, the special symbols for udātta should only be applied to the headwords, while the examples should use regular accent marks.
Initially, I suggested a faulty proposal by simply replacing "svarita" with "udātta,"
but I believe a more accurate transliteration would be
"a̱sū̱yánnabhyàcākaśa̱ṃ tásmā aspṛhaya̱ṃ púnaḥ"
(see https://vedaweb.uni-koeln.de/rigveda/view/id/10.135.2).
Please review the accent rules again (Vedic Grammar for Students pp.448ff https://archive.org/details/vedicgrammarfor00macduoft/page/448/mode/2up).

I am not competent to evaluate this suggestion. Maybe others (@drdhaval2785 , @Andhrabharati , @SergeA , @gasyoun or others) may wish to comment.

Is there a larger issue here with regard to accent representation in the CDSL version of pwg?

@funderburkjim
Copy link
Contributor Author

Also, perhaps @fxru or @maltenth may have opinions or insights into pwg accents in cdsl version.

@funderburkjim funderburkjim reopened this Jun 9, 2023
@Andhrabharati
Copy link

Andhrabharati commented Jun 9, 2023

Here are the standard versions, in Devanagari script (Max Müller) & Roman transliteration (Aufrecht) accepted universally by one and all:

Devanagari
Max Müller: अ॒सू॒यन्नभ्य॑चाकशं॒ तस्मा॑ अस्पृहयं॒ पुनः॑
Vedaweb: अ॒सू॒यन्नभ्य॑चाकशं॒ तस्मा॑ अस्पृहयं॒ पुनः॑
PWG (CDSL): अ॒सू॒यन्नभ्य॑चाकशं॒ तस्मा॑ अस्पृहयं॒ पुनः॑

Transliteration
Aufrecht: asūyánn abhy àcākaśaṁ tásmā aspṛhayam púnaḥ
Vedaweb: asūyánn abhy àcākaśaṁ tásmā aspr̥hayam púnaḥ
PWG (CDSL): a̱sū̱yann abhy àcākaśa̱ṃ tasmā̀ aspṛhaya̱ṃ punàḥ

As we can see, the Devanagari text matched in all three places, as it should be (being the original form)!

Now, coming to Roman transliteration of accents, it needs some special attention because the 'notation norms' are different in it as compared to Devanagari. I understand that CDSL has adopted a simple conversion process of Devanagari script to slp1 and then to IAST (CDSL extension) forms.

As my primary attention and interest is with Devanagari alone, I did not pursue (which I had clearly mentioned then & there itself) the matter on other transliterations with @funderburkjim, when a long debate/discussion took place about PWG/pwk accent marking.

@Andhrabharati
Copy link

We may note that Vedaweb has resorted to the ISO transliteration (which has a wide and full coverage of all Indic scripts), and not to IAST (which has some limitations)

@Andhrabharati
Copy link

Is there a larger issue here with regard to accent representation in the CDSL version of pwg?

Yes, there is an issue where we had a prolonged discussion about two years back, for changing the Devanagari accent notations in PWG/pwk.

@fxru
Copy link

fxru commented Jun 11, 2023

Sorry, I don't have any helpful insight. I would agree with @Andhrabharati that replicating the Devanagari of the print faithfully is important. Else, I would suggest to pick one system and stick with it. I think pwg transferred all examples to the Rigveda style accent marking even if the examples come from other Vedas, so the result should be mostly good. However, there were a few exceptions – from the Śatapatha Brāhmaṇa, if I remember correctly – where the correspondence between Devanagari and Latin would probably be different.

@Yuzki
Copy link

Yuzki commented Jun 12, 2023

I am the CDSD user who provided this feedback.

I think that it is desirable to use Latin transliteration, although, at the current stage, it seems acceptable to use underlining and vertical lines above as accent marks (rather than acute and grave). As @fxru said, there are different accent markings, such as in Śatapatha Brāhmaṇa, and the marking rule of Śatapatha would involve interpretations of the text. Therefore it would be best to stick with the same accent marking system as Devanagari when transliterating into Latin script.

@Andhrabharati
Copy link

Andhrabharati commented Jun 12, 2023

Vertical line in Roman transliteration is a specific (and less frequent, comparatively) variety of svarita accent, so it cannot be used to represent the 'regular' svarita (as in Devanagari).

@fxru
Copy link

fxru commented Jun 12, 2023

Could someone help me with understanding the issue better?

  1. Does the issue involve only examples or also headwords.
  2. Is the problem general to all examples or an issue with specific examples?
  3. Is the issue following a pattern (certain source text? Certain prosodic constellations?) that we can identify (semi-) automatically?
  4. What is the pattern?
  5. Is there disagreement on the nature of the issue or “only” on the solution?

I might not have understood the issue at hand completely.

@Andhrabharati
Copy link

Andhrabharati commented Jun 12, 2023

1 & 2. As I understand, it is in the examples and mostly in a limited number of them.

3 & 4. Need to look at the examples little closer, to get any idea/conclusion.

  1. The issue is quite valid; and the FULL solution, though not impossible, is "involved" to implement [and probably not worth Jim's time; he is the ONLY person that has to do all the 'work' here]. A simpler solution is to avoid 'transliterating' the examples and leave them in Devanagari alone; if someone would be looking at PWG prints, there is no other way than to 'read' the Devanagari script! Anyway, vast majority (barring a very few that need to be 'properly' tagged yet) of RV and AV citations themselves are available as hyperlinks, in transliteration along with some translations as well, in case someone is interested to closely look at.

@Yuzki
Copy link

Yuzki commented Jun 12, 2023

-1. It involves only examples.
-2. I think it is common if the examples are from the Rigveda and other Vedas with the same accent markings.
-3 & 4. Following the Rigveda convention, transliteration into Latin script should be uniquely represented.
-5. We share a common understanding of the nature of the issue.

By the way, I looked at the Max Müller version and found न्न underlined.
スクリーンショット 2023-06-12 211314
Even though I am unfamiliar with Devanagari, I have noticed some discrepancies in the example sentence of the printed version of PWG. However, it is possible that these discrepancies are actually errors. If that is the case, it seems that transliteration could be done automatically based on the rule that a syllable with an acute accent continues from the underline to the next underline or the vertical line above.

@Andhrabharati
Copy link

Yes, this one is having an error in PWG.
[I had presumed earlier that the links avl. at CDSL for RV are with the text of Max Müller's edition; did not check the actual M. M. edition or other standard works, like from the Vedic Samsodhan Mandal.]

Does this mean that we can not rely on the digitised texts, as much as on the original prints' scans?
----------------------------
Here is the snippet from the VSM, Pune--
image

@Andhrabharati
Copy link

But wonder, how and why Aufrecht rendered it without the anudAtta accent!??

image

@Yuzki
Copy link

Yuzki commented Jun 12, 2023

The accent error in PWG is a flaw on this part, and at least for RV, the reliability of digitized texts should be high.

But wonder, how and why Aufrecht rendered it without the anudAtta accent!??

Aufrecht represents high-pitched accents with acute and falling accents with grave. He does not indicate low-pitched (=anudātta) syllables, as they are unmarked in the system.

@gasyoun
Copy link
Member

gasyoun commented Jun 13, 2023

Vedaweb has resorted to the ISO transliteration (which has a wide and full coverage of all Indic scripts), and not to IAST (which has some limitations)

Yes, but uses combined characters for example for ṛ and it becomes a mess in searching or copypasting.

to implement [and probably not worth Jim's time; he is the ONLY person that has to do all the 'work' here]. A simpler solution is to avoid 'transliterating' the examples and leave them in Devanagari alone

Agree, not worth Jim's time. As per leave them in Devanagari alone - disagree. As it is it is totally ok. No need for any update now. Yes, there are issues. No, none of them are critical. It works as it is and we should not care for Śatapatha for next 20 years.

@Yuzki
Copy link

Yuzki commented Jun 13, 2023

I have tried writing code (I can only write in Python, so please forgive me) to determine where to place acute and grave marks in the SLP1 string. It worked well in the example where the specific PWG error is corrected: {#a\sU\yanna\Bya^cAkaSa\M tasmA^ aspfhaya\M puna^H#}.

def add_accent_mark():
    # (modified) example sentence
    characters = list('{#a\sU\yanna\Bya^cAkaSa\M tasmA^ aspfhaya\M puna^H#}')

    # determine the position of the vowel
    vowel_position = [False for _ in range(len(characters))]
    vowel_set =  ('a', 'A', 'i', 'I', 'u', 'U', 'f', 'F', 'x', 'X', 'e', 'E', 'o', 'O')
    for i in range(len(characters)):
        if characters[i] in vowel_set:
            vowel_position[i] = True

    # vowel indexes
    vowel_index_list = [i for i in range(len(vowel_position)) if vowel_position[i] is True]

    # set acute and grave marks (They are provisional symbols and don't follow any specific rule!)
    GRAVE_MARK = '>'
    ACUTE_MARK = '<'

    # set anudAtta flag
    flag_anudatta = False
    flag_first_anudatta = False

    for i in range(len(vowel_index_list) - 1):
        vowel_index = vowel_index_list[i]

        # ascertain the character next to the following vowel
        if characters[vowel_index + 1] == '\\':
            flag_first_anudatta = True

            # if the vowel is between underlining and vertical stroke, it has a falling accent
            if characters[vowel_index_list[i + 1] + 1] == '^':
                characters[vowel_index_list[i + 1]] += GRAVE_MARK
                continue
                
            flag_anudatta = True
            continue
        
        else:
            # until the first underlining appears, vowels without any accent mark have a high-pitch accent
            if flag_first_anudatta is True:
                pass
            else:
                characters[vowel_index] += ACUTE_MARK
        

        if flag_anudatta is True:

            # if the character next to the vowel is backslash (= underlining), continue to the next step
            if characters[vowel_index + 1] == '\\':
                continue
            
            # if the character next to the vowel is caret (= vertical stroke), set flag FALSE
            elif characters[vowel_index + 1] == '^':
                flag_anudatta = False
            
            # if the character next to the vowel is neither backslash nor caret (= if it is a consonant character), the vowel has a high-pitch accent
            else:
                characters[vowel_index] += ACUTE_MARK


    return ''.join(characters)
    # OUTPUT: {#a<sU\ya<nna\Bya>^cAkaSa\M ta<smA^ aspfhaya\M pu<na^H#}

@gasyoun
Copy link
Member

gasyoun commented Jun 13, 2023

I can only write in Python

@Yuzki would you be willing to help Cologne and write more Python code? We need to add additional features, but none of us knows enough Python, other than Jim who already is busy for next 10 years.

@Yuzki
Copy link

Yuzki commented Jun 14, 2023

@Yuzki would you be willing to help Cologne and write more Python code? We need to add additional features, but none of us knows enough Python, other than Jim who already is busy for next 10 years.

@gasyoun
Of course, if I can be of any help, I will help to the best of my ability within the limits of my available time.

@Yuzki
Copy link

Yuzki commented Jun 14, 2023

Regarding accent marking in formats other than the Rigveda (RV) style:

  • Śatapatha Brāhmaṇa (ŚB): No accent marks in the printed version, although the original sentence in the Weber edition has accent marks.
  • Maitrāyaṇī Saṃhitā (MS): PWG does not have examples from MS because the Schroeder edition did not exist at that time.

Conclusion: It is unnecessary to consider accent marking styles of the ŚB or MS in PWG, and it is sufficient to focus on the RV style, which is currently under discussion.

@drdhaval2785
Copy link
Collaborator

I have not gone through this issue in detail.
The issue has been flagged quite often.
sanskrit-lexicon/csl-ldev#7 and sanskrit-lexicon/csl-ldev#9 may also provide some clue.

@gasyoun gasyoun added the question Further information is requested label Oct 10, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

6 participants