-
Notifications
You must be signed in to change notification settings - Fork 120
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Create a few small LBX files #1374
Comments
The Chinese translations look strange to me as a native Chinese speaker.
The following are my suggested translations.
|
Many thanks for this. I appreciate it. One question: The LBX files use "p." to refer to an specifig page and "pp." for the total number of pages in a book (source). So from what you saying we should use: page = {{頁}{頁}}, that is, for this term, use the initial proposal. Correct? Another question is: What is the difference between:
You seems to be interested in Biblatex development, why not give a hand to get thiese two LBX files out? |
I am done with Latin, Vietnamese and Korean. Need to figure out how to place a a token (number) before the LBX string. |
No, the page = {{頁}{頁}},
pages = {{頁}{頁}},
pagetotal = {{頁}{頁}},
pagetotals = {{頁}{頁}},
The former has an additional preposition "在" meaning "on" (page 15). Both phrases make sense in Chinese but the latter is preferred in formal writing language style, especially as a notation of a citation rather than in plain sentence. There are more formal phrases with same meaning: |
Changing the order of field content and localisation string is painful, but doable. We had to do it for Hungarian for example. Have a look at https://github.com/plk/biblatex/blob/dev/tex/latex/biblatex/lbx/magyar.lbx (and if you like to dig through the history, also #717 and #780 as well as linked discussions). See https://tex.stackexchange.com/q/724962. If you do end up asking about issues you brought up here elsewhere (e.g. on a forum) or vice versa, please add links to the relevant discussions on both sides so that people interested in the issue can follow all discussions. |
BTW: We're always interested in more localisation files, so if you can help us make That said, I'd really rather prefer to work together directly with a native speaker (or a near-native speaker) on this, since we might have to discuss subtle language details and typographical issues. Having to go through several people or having to rely on colleagues, translation tools etc. is probably not going to cut it for that. (I think I remember a case where we accepted machine-translated versions of a couple of new language strings and they turned out to be not that good in some languages. So I really want things vetted by someone who speaks the language.) Additionally, I think it only makes sense to add localisations if they are complete enough and if we can support good-looking output in the target language. At the moment I don't think we can really do RTL typesetting (#1139), so Arabic and Hebrew localisations are still a long way away. |
BTW: Good that you bring this subject up. Our coverage in terms of
localization is pretty uneven, we have several languages with less than 1m
speakers (basque, estonian, icelandic ...) and no coverage for cjk, arabic,
hindi, bengali, vietnamese, bahasa,... On the list of the 10 largest
languages we only have four (english, russian, spanish and portuguese). I
have been thinking of doing something about that for quite a while now.
Bridging the gap between the groups of native speakers, latex programmers
and interested users has not been easy. Most of the people that have the
skills in a particular language have never heard of an LBX file.
Working with an LBX file is not easy either. Most of the work is done by
picking up ONE file (like english) and replacing the strings, while the
ideal would be to pick up english (as a reference) and several nearby
language files for comparison. So, for example, someone working on Hindi,
Varhadi, Kannada,Telugu, Gujarati, Konkani, ... would pick up the Marathi
list as a helper because many of the letters and words are the same.
Also, authors of the language translation should have immediate access to
the use of each string because of considerations of gender, plural, etc ...
and that is hard, if you are not versed in TeX.
So, I have been thinking about writing a web-tool that would help with this
process. Allowing one to pull out some two or three languages together in a
spreadsheet format list, read them from the standard LBX files, and write
them back as the user progresses. It can easily be shared with others
working on the same language and provide examples on the go. Results can be
fed back to biblatex, text2bib, babel and polyglossia. LBX files are
starting to be used outside (text2bib.org for example) and this can
probably help us acquire more files.
I imagine that you do not want any part of that ... and just would want a
ready file with the strings -- is that correct? If so, I will start a
separate project on GitHub.
And speaking about that, consider that there are 4k written languages, in
293 scripts, 154 of them on Unicode, out of which 150 are supported by the
NoTO fonts. Can we make Biblatex BCP-47 compliant before we walk into a
mess?
…On Fri, Aug 23, 2024 at 2:02 AM moewew ***@***.***> wrote:
BTW: We're always interested in more localisation files, so if you can
help us make biblatex available in more languages, do let us know. We'll
try to help where we can (it sometimes turns out that the usual .lbx
framework is not enough and that we need to do additional stuff to get good
output).
That said, I'd really rather prefer to work together directly with a
native speaker (or a near-native speaker) on this, since we might have to
discuss subtle language details and typographical issues. Having to go
through several people or having to rely on colleagues, translation tools
etc. is probably not going to cut it for that. (I think I remember a case
where we accepted machine-translated versions of a couple of new language
strings and they turned out to be not that good in some languages. So I
really want things vetted by someone who speaks the language.)
Additionally, I think it only makes sense to add localisations if they are
complete enough and if we can support good-looking output in the target
language. At the moment I don't think we can really do RTL typesetting (
#1139 <#1139>), so Arabic and
Hebrew localisations are still a long way away.
—
Reply to this email directly, view it on GitHub
<#1374 (comment)>, or
unsubscribe
<https://github.com/notifications/unsubscribe-auth/AAR7WYU4T6DDKK2NC3ZLGY3ZS33D5AVCNFSM6AAAAABM4VTJJSVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDGMBWGYZTMOBSGI>
.
You are receiving this because you authored the thread.Message ID:
***@***.***>
|
With babel I’ve followed the following strategy. On the one hand, for
Furthermore, besides the (more or less) 300 locales distributed with |
This is great Javier. Editing an lbx file can be intimidating for the
non-initiated.
My suggestion for improvements would be:
1- The ability to pull the list in English and another "nearby" language.
For example, someone that would like to work on Kannada or Telugu, could
pull out the lists of English and Marathi, so that he will have a nearby
language as a reference.
2- A mouse-over with examples of use would also be a nice touch. It is not
easy to understand "cc" or "encl" if you are on a far out language.
It is also nice to see Bebel moving more aggressively to BCP-47. We need
that to be able to build nice spellers, etc ...
…On Thu, Aug 29, 2024 at 11:59 PM Javier Bezos ***@***.***> wrote:
With babel I’ve followed the following strategy. On the one hand, for ldf
files I have created an html form available here:
https://github.com/latex3/babel/tree/main/tools (can be seen in action in Language
incubator for babel <https://www.texnia.com/incubator.html>). On the
other hand, with ini files when a value does not exist a message like the
following is displayed:
Package babel Warning: \chaptername not set for 'mylang'. Please,
(babel) define it after the language has been loaded
(babel) (typically in the preamble) with:
(babel) \setlocalecaption{mylang}{chapter}{..}
(babel) Feel free to contribute on github.com/latex3/babel.
(babel) Reported on input line 26.
Furthermore, besides the (more or less) 300 locales distributed with babel
there are about 400 ‘templates’ here
<https://github.com/latex3/babel/tree/main/locale-templates>. This (I
think) makes the language easily useable and encourages contributions, and
saves contributors from having to think about things like what is the
correct BCP-47 code, or the recommended language name, or the OpenType
script code. And of course, it helps in one of my main goals – language
diversity.
—
Reply to this email directly, view it on GitHub
<#1374 (comment)>, or
unsubscribe
<https://github.com/notifications/unsubscribe-auth/AAR7WYT5BPO45UJMKMBKTUTZUAJ4VAVCNFSM6AAAAABM4VTJJSVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDGMRQGI3DSMJXGY>
.
You are receiving this because you authored the thread.Message ID:
***@***.***>
|
But with caution. BCP-47 is not just about tags, but also about lookups and fallbacks, and also about how they are applied in practice (particularly in the Unicode CLDR).
If made, I’d recommend not to use the same command/macro/... for the locale name and the tag. They should be clearly separated. Well, in @moewew With relation with RTL scripts (actually any script, including CJK or Devanagari), please, feel to to ask for help with relation to |
I have to create some LBX files (Latin, Arabic, Hebrew, Vietnamese, Chinese, Japanese and Korean). They will be fairly small, I only need the snippets:
and I sort of have that worked out by a few native speakers:
I plan to write these small files and place them on my working directory. I got a few piece of it working already and I see the challenges being:
1- The different comma of Arabic, Chinese and Japanese.
2- The different dot of Chinese.
3- Position of the enumeration within the sentence (or snippet).
Are there any information on how to build the LBX files that I can follow?
The text was updated successfully, but these errors were encountered: