-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
BOP study-2 #40
Comments
BTW, when enquired with @gasyoun , he said he would help @funderburkjim to see that the Avestan strings are properly rendered using a suitable font. There is one Avestan string in the preface (front.txt), and 42 strings are in the main.txt. (just for info.) |
Here is the first installment of BOP_main study, @funderburkjim -- BOP metaline pc corrections.pdf |
Here is BOP_main text with the tagged greek portions handled-- Next to look out for other places where it is present in the print. Will post the revised file once I finish these two tasks. |
bop_main.txt file looks useable for Greek text. Will continue discussion in the issue above. |
I can only suggest that you use other language/script strings as well, from this! |
Here is the presently 'closed' version of BOP text from my side,-- As this has some good amount of corrections and updates, probably @funderburkjim might consider re-running his programs and update the files that were generated yesterday. Even otherwise, he needs to consider taking the addenda file posted above, to have the complete data. [I would have listed the salient points/corrections in the whole file (in my both versions) wrt the cologne text, but don't think it is of any worth doing.] |
I've sent the work to Jahr for proof-reading based on the first Bop_main.txt file. (ref: sanskrit-lexicon/BOP#1). |
As there seemed to be no response from this Jahr (for a long enough period), we got this BOP greek words proofing done by Anna in April 2023, and you had integrated the same into the CDSL text. Now, you may help @drdhaval2785 , by informing him how you had 'worked' with my earlier BOP file. |
I think I have figured out (mostly). In case I need any assistance, will let you know. |
Good to hear this, Dhaval! [I just thought if Jim could be involved (JIC something could be grasped from him), it might make your effort fruitful faster; hence, messaged him..] |
BTW, you might've noticed that this issue has the other 'parts' of BOP text at the top. |
And once you could finish adapting my BOP_main file, 'integrating' the BOP_addenda data into it could be thought of, in the same lines as done in GRA recently (by Jim); thus establishing the process (to be implemented in other CDSL works as well). |
From the above comments, @drdhaval2785 is taking up AB's revision of BOP. |
@funderburkjim |
It is not about going through the file, but checking for invertibility computationally which takes more time. |
@Andhrabharati There is one Avestan string in the preface (front.txt), and 42 strings are in the main.txt. (just for info.) -- can you copypaste them here, please? |
Look for them yourself, they are already there in the files posted. |
@Andhrabharati When we (dhaval or I) integrate one of your dictionary revisions (such as BOP in this case), we need to understand what you did. Normally, we have to discover I fondly recall one case (with your revisions to PW), where you provided a summary of your changes which was very helpful to me at the time see this comment from AB. If you provide a similar guide for your revision of BOP, this might be helpful for Dhaval's work. Hope you and Dhaval find this comment constructive. |
@gasyoun Here are the Avestan strings marked in main.txt
and here is the one instance in front.txt
|
Regarding the line-break situation in bop.txt and BOP_main.txt The csl-orig/v02/bop/bop.txt has line breaks as in printed text. BOP_main.txt seems to have 'removed' all line-break info. In particular, it does not have the It might have been preferable to use Incidentally, AB appears to have 'resolved' the end of line '-' cases (hyphenated word cases). This is a good improvement. @Andhrabharati what is your procedure for resolving these? This comment duplicated at sanskrit-lexicon/BOP#6 |
I presume that @gasyoun had asked for the Avestan strings, for he has committed with me that he would 'help' you render them properly in CDSL displays [when I had mentioned to him that these Avestan characters wont be 'normally' present in any general/common font], and not for any other reason.
You may see the very first two strings at the L-334 entry, being displayed as BOX characters |
I had already mentioned about my resolving of hyphenation(s) at the line-ending(s) sometime before (somewhere!). |
https://fonts.google.com/noto/specimen/Noto+Sans+Avestan Should this font be acceptable? |
Mostly, but not fully!! Just look at the first word in the list, for example. |
My error, in typing (rather, filling the Avestan strings)! Pl. replace the string as |
Though the look of Noto Sans Avestan is not so pleasing (to the eyes), it sure can be an option to render these strings at CDSL. |
Here is comparison for that first word, of
Incidental question: how does one enter into a text editor a unicode code point |
There are various 'keyboard' utilities that make this possible (for bulk text). But, for a minimal text insertion such as this at BOP, the easiest way out is to use the copy/paste from the (font) 'charmap' utility in Windows OS. Speaking of fonts, I just wonder if we can update the indologic font being used at CDSL, to include these Avestan glyphs and also the 'hom' numbers (with a dot) and Roman numerals? What say you, @gasyoun ? |
As done in my other works so far, this CDSL BOP.txt file is also split into parts-
BOP_front.txt
BOP_abbr.txt
BOP_addenda.txt
The main text portion (BOP_Main.txt) is not done fully yet.
The text was updated successfully, but these errors were encountered: