-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
vac2a - picture data in TE #19
Comments
For the lines so identified, two actions were taken:
|
@Andhrabharati do I understand right that those pages with images should be rescanned or just cutting out as they are would be enough? |
just cutting is enough. |
probably the issues I talked about in meta2 file could be done first as they are kind of "identified"-
And now the abbr. place corrections. |
I guess the tabular data could simply be rendered as / marked, instead of just with a space (or with varying Cn tagging). And I was thinking of taking it up now. |
Here is the list of tables and pictures in the Vacaspatya, made wrt the print pages (whatever worth it has). |
We can use the github markup language there, right. |
I've reorganized the file names a bit in the visible part of the repository.
Thus far, I've implemented changes pertaining to:
And am looking for other low-hanging fruit |
Are you sure we are ready to go deeper as headword issues for now? |
I think the API issue is closed. Regarding Sanskrit-Sanskrit dictionaries, actually they are the ones many people like me use exclusively. If I get no hits in SKD and VCP, then only I turn to MW. |
one might compare how MW and VCP grew up, based on the same roots- WIL, SKD and the German one (PWG).I guess VCP would be with many corrections to PWG notified in PWK, as Taranatha had the full set of manuscripts belonging to Vedic branch in his possession (or accessible to him). |
MonierWilliams had to "wait" for PWK to come and other works been published. MW99 also has couple of entries taken from Apte90 (by Cappeller, as mentioned in the front pages of it). |
This reminds me of the work I started those days in 2016; I had finished the HW correction for the vowels part. It had treated double (multiple) HWs much better than the exercise at cologne (Usha and Jim). |
The Meld exercise is to look (mainly) at the differing lines in vcp2 and vac2 (and refer to scans to decide the corrections). but I strongly feel the other lines also need to be read once with the scans, as both the digitisations (TPT and Koeln) have erred at many places. |
I might have found one. I need your understanding of the tasks in a more detailed way.
Got it. What was next priority in your Dec 2020 list?
Widely used and yet only 2 people from India interested in cleaning this wast ocean. Because all of time and energy can go here and all the other tasks will stop, because there is no end if we go that deep inside these two oceans.
Guess not, sounds just like some fantasy.
Than indeed differs him, but has he used his advantage in full?
Exactly, around 10 years.
Not sure I understand what you mean. Can you give a sample, please?
I mean it was not in the priority list of 2021 and it can stop all tasks, every other task in the list for just this one. There are some minor tasks, that only Jim can give an answer to, but solving and integrating VCP corrections would swallow everything. Even MW is huge, but there is no way back. @drdhaval2785 are you personally eager to put your koshas aside and work with the VCP as intensive as it is proposed above? @Andhrabharati is yet in MW pond, VCP ocean might not what he is interested in, do not know. He works like a bull, but is that something you both can concentrate without disabling Jim for the priorities set 3 months ago? |
I am not at MW99; no fulltime works for past some weeks my side. |
how many people did you get for MW and PWG, to clean/correct worldwide? (forget about occassional feedbacks) |
Agree. In recent comparison, I have also found differences between TE and the |
good to see someone getting my intent correctly. I was just thinking to start full proofing of vacaspatyam on my own. for me neither of the digitisations are satisfactory enough, and I see no point spending time in just keep on comparing them and correcting both. and the way things are moving here to remove the Bengal flavor (dialect), is quite against our (AB) principles of handling the texts. one might recall how the great Panini never took to normalising the words by taking any one school as a standard, but just had them stay side by side. one can just compare different schools, but never let one school override another. so I would better be off from this exercise. |
Are you still?
|
So isn't 2 far better than 0? yes, I might start the work sometime soon; probably after finishing the mw99 annexure portion. |
@Andhrabharati and @funderburkjim , I know that both the digitizations are bad. But strictly speaking in mathematical terms and assuming both digitizations to be completely independent. Let us assume 1/100 letters are wrong in both digitizations. So the probability of both digitizations being wrong at the same letter is 1/10000. Do we want to spend precious time on this miniscule? Maybe once we have corrected differring errors, it may be taken up. Not before. |
Agree. |
In the process of preparing hiatus-corrections (#18),
I discovered that there are some (about 200) awkward lines in the vac2.txt (Tirupati data).
These lines were selected from vac2 based on one of two criteria:
<Picture>
orAbout half the 200 lines thus selected actually satisfy both criteria.
The text was updated successfully, but these errors were encountered: