-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
vcp-skd comparison #9
Comments
report on unmatched verbsThere are about 170 entries of vcp recognized as verbs, and having not correspondent entry in skd. And about 100 verbs of skd without correspondents in vcp. Here are links to HTML reports:
|
The links above are to github.io. If you click on one of those, you'll get an 'accordion' display, as shown above. |
Hariom. Mark and Jim! vcp-skd-verb2-deva Same, with Devanagari spelling of Sanskrit. They are HTMLs. Will be looking through them. Shall I copy paste whole thing or right click and open it will Notepad++ ? I think any of it will work. Guide me please |
I must say- it's excellent!! I really like it! I can see the whole content of the dictionary here on the html sheet itself!! For me it's magic. https://sanskrit-lexicon.github.io/verbs/vcp_skd/verb2_nomatch_deva.html Eg- The first case of cross checking- अड्ड [Cologne record ID=808] [Printed book page 0095,b] So now, my question is- where to note them, if more come in ? Here like this? Or make any excel sheet or something? |
Shall I tell you something interesting? |
Keep HTMLs open in Chrome for looking. You can make a copy of them and open in Notepad++, but the code is not easy to read in that mode I guess and there is no need actually to do so.
Yes, a good choise.
Is enough, no need for Excel.
Seems clear enough. A list or few smaller lists here as a comment would do. |
Hariom, Namaste |
@funderburkjim Usha said there are cases she can't understand, because they are identical as per eye. |
Hi, @Shalu411 Your example of 'अड्ड' is good and will be useful. The https://sanskrit-lexicon.github.io/verbs/vcp_skd/verb2_nomatch_deva.html report was You have found अड्ड also in VCP as a verb -- this is a useful piece of information. Similarly, your finding It would be good to have a complete list of such missed correspondences. How to present these correspondences? Maybe as a text file
Do you see where the 808 and 582 numbers come from? So this would be one file: call the file 'VCP-SKD_unmatched_matched.txt'. The objective would be to examine all the cases in the '... nomatch_deva.html' file and |
The other example is interesting, but is different. For now, such examples should be in a different file.
|
I prefer this. Excel files are hard for me to work with. Just a nice '.txt' file is easier |
Did not suspect that, hope @Shalu411 understands. |
Hariom. This is totally in place. |
Hope it can be finalized in May. |
Namaste |
This particular case is like the अत्टङ् which is equal to अट्ट- Both the varieties are given in SKD as separate dhatus, whereas they happen to be same dhatu's alternate form. And this is known by the dhAtvartha that is one and the same. VCP उज्झ 8722 = SKD उद्झ 4713 SKD उज्झ 4431 In this case- dhAtvartha is tyAge. |
Hariom ऊष [Cologne record ID=10344] [Printed book page 1393,b] |
If the method of editing this HTML is opening it in Notepad++ and doing the correction and saving it, then I am aware of it. Shall I try? |
Your notation
is good. But a minor addition would be useful.
Now I can see that the fourth match (अम्भ vs. अभ ) was missed because VCP and SKD used To correct these problems, I have to change things in two different places:
|
Yes, I think you should make comments here when you feel confident that a verb in one dictionary |
The issue here is that- it is present in VCP list in the Cologne page- (I mean it is found in digitized version) as well as the printed book, but it is not picked in the automated list. When I search on VCP cologne page, it is found instantly.
Yes- But to be added to our automated No-Match list HTML generated file only for further use.
Yes! Perfetct. :) |
Ah, Sure- These things matter in the beginning of the work. Later re-work can be avoided. Thanks. Will make a comment in he same file with "; note" as you advised. Ok? |
Yes, you should mention. As I need to modify the root selection program for SKD to exclude these.
Also,
|
Perfect!! Done!! Ok. |
Another typo change in UzaCurrent digitization:
Reasons:
@Shalu411 Agree with this spelling change? |
vcp changes re Uza: ref sanskrit-lexicon/csl-orig#272 |
@Shalu411 Have revised vcp-skd-verb2-nomatch-deva Devanagari spelling (other reports The revisions include:
|
not yet handled:The new reports do not yet handle VCP उज्झ 8722 = SKD उद्झ 4713 SKD उज्झ 4431 This is different, because one VCP verb entry corresponds to two SKD entries. Not sure |
Time to introduce the references, as we spoke before? A root can be just a link in a different dictionary. |
Sure!! It's correct. No long U.. |
Hariom. Jim, are you going to share this revised one? Am continuing with the old one however. |
Namaste I have been looking for things seriously differing, but it is ending up in a non-specific one everytime- almost everytime. |
I think that the underlying reason relates to the patterns used to identify verbs in VCP. Will work on this problem in next day or two. |
Thanks. |
@Shalu411 Have revised vcp-skd-verb2-nomatch-deva Devanagari spelling (other reports There are now 71 verbs in SKD that are unmatched.
Adding the 62 cases (no VCP headword) and the 9 cases (VCP headword, but not a verb), Could any of those 71 SKD verbs still be verbs in VCP?Yes, it is possible. However, the VCP spelling would have to be different in some detail One example may be द्रै in SKD (currently not matched, among the 71). As mentioned above, the reports don't know how to handle such cases where one VCP |
Should not we develop a solution for that by now? Otherwise it will come back again and again, same issue. |
Hariom Namaste
It says 404 Error- Page not found. |
additional SKD verbs foundThe following 29 headwords of SKD were determined to be verbs: (L-number , SLP1 spelling):
MethodologyThere were previously 176 VCP headwords which had no matches as SKD verbs, using the A search was made for each of these 176 headword spellings in the SKD dictionary. There were 2 headwords (KelA and goDA) which were found in SKD, but which were judged to The various reports were regenerated using the additions to SKD verbs. |
Now I know how it was made.
And still had
And the list can be seen at? |
Hariom, Namaste
That's wonderful!
Would love to see them and start working. Can't wait more. Please complete whatever is needed, and share Jim.. So that we all see our dream realized. :) |
This is the corrected link: click the linkThis is like looking at an html file as text. I To get an 'html' view of this file, step 1First, click on the 'Raw' button. to get an 'html' view of this file, step 2In the above, RIGHT-CLICK save-as in the above window. You will then see a dialog: to get an 'html' view of this file, step 3Click the 'Save' button. to get an 'html' view of this file, step 4On your local computer, navigate to the downloaded file, then double click. |
Note on prior comment:In the very third comment of this issue, I gave the link: vcp-skd-verb2-nomatch-deva , which gives Note that the file is dated 'March 4, 2020', while the latest display (that you downloaded above) The reason is that I have not been copying the latest versions of the reports to the 'https://sanskrit-lexicon.github.io/' repository. I plan to give links to the latest versions of the reports in this https://github.com/sanskrit-lexicon/SKD/ repository, so you will need to go through the downloading process to view the files |
Not quite. Let me try to say it differently. Please have open the skd verb list skd_verb_filter.txt. For skd, the MAIN rule for verbs is Not every entry with 'iti kavi..' is a verb. For example, entries that have a have non-verb marker are excluded by the program that makes the verb list (specifically, Now look at the 'code=4' lines (there are 33 of these). Entries in the verb list with this code have been added 'manually'. Some of these have no
Note that But there are some of those additional SKD verbs that actually do have some form of
Note that there is not a space in Now as to the question regarding godA and KelA. Here we must also look at the
Being VCP verbs, goDA and KelA were considered as additional candidates for the SKD verb list: and https://www.sanskrit-lexicon.uni-koeln.de/simple/skd/KelA You'll see that both are 'strI' (feminine nominals) -- so that's why they are not in the SKD verb list. Hope this helps clear up some details about how the skd verb list was chosen. P.S. -- @gasyoun -- play around with the 'simple' link form above. Isn't this close to what |
That would make more sense, no? But the downloading process can go with Usha as well, guess.
What's the name of the Python script or part of the code? Want to know how things work
Typo error to be fixed?
I'm speechless. Thanks!!! Have said in the past, let me repeat:
Hope I make myself clear enough. If done so - not questions asked and what has been started 6 years ago will be put to an end, thanks! |
Namaste |
A report has been developed to facilitate comparison for verbs between VCP and SKD dictionaries.
This is in response to an interest expressed by @Shalu411 .
The reports are in the form of HTML documents, located here. To see the reports as rendered html,
click one of the links in the readme.
One report is in slp1 spelling, and one is in Devanagari spelling.
The text was updated successfully, but these errors were encountered: