Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Request for digitized frontmatter and endmatter #391

Open
drdhaval2785 opened this issue Feb 15, 2022 · 2 comments
Open

Request for digitized frontmatter and endmatter #391

drdhaval2785 opened this issue Feb 15, 2022 · 2 comments
Labels
Documentation How TXT , XML work

Comments

@drdhaval2785
Copy link
Contributor

Currently the csl-doc gives PDF / PNG files of prefaces. @funderburkjim Do we have digitized prefaces ? If yes, we should put them in github. Maybe in csl-orig/v02/mw/mw_frontmatter.txt and csl-orig/v02/mw/mw_endmatter.txt.

@funderburkjim
Copy link
Contributor

I think the relevant part of csl-doc is this directory: https://github.com/sanskrit-lexicon/csl-doc/tree/master/source/dictionaries/prefaces

In this directory, there appears to be one or more .rst files for each dictionary.
For some dictionaries (for example accpref.rst, there is digitized material, in the '.rst' format.
For other dictionaries (for example bhspref.rst) there are references to images in subfolders -- thus bhs and similar dictionaries don't have digitized front/end matter.

It would be useful to separate the dictionary codes into two groups based on whether csl-doc has digitized front/endmatter. For those with digitized material, this could be copied to csl-orig as you suggest. Those without digitized material (the majority) could be taken as candidates for digitization.

It is also possible (I am not sure) that dictionary xxx has digitized front/back matter within the csl-orig/v02/xxx/xxx.txt digitization but that csl-doc only has images.

@Andhrabharati
Copy link

Andhrabharati commented Feb 15, 2022

Many of the CDSL 2nd phase digitised works have the full cover-to-cover texts, as a single file in csl-orig repo.

As I was looking at them, I had split them to relevant 'section' parts; and some of them have already been posted. Few of these (not all!) were 'handled' by @funderburkjim too, when he worked on them subsequently.

I had posted the bhs front material from the vol.1 of the work (Grammar) already; the vol.2 (Dictionary) having been considered as a continuum, doesn't contain its own front matter.

The major works remaining to have the front matter digitised are SKD, PWG, VCP, PWK, MW; and I had already done a majority work in the SKD and VCP cases.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Documentation How TXT , XML work
Projects
None yet
Development

No branches or pull requests

4 participants