Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Overlapping subjoined Myanmar consonants between narrow base and medial ra #17

Open
dscorbett opened this issue Apr 15, 2020 · 9 comments

Comments

@dscorbett
Copy link

Font

NotoSansMyanmar-Regular.otf
NotoSerifMyanmar-Regular.ttf

Where the font came from, and when

Site: https://github.com/googlefonts/noto-fonts/blob/115d38430d957d38307457c036302b7bdbe0bbc4/phaseIII_only/unhinted/otf/NotoSansMyanmar/NotoSansMyanmar-Regular.otf
Site: https://github.com/googlefonts/noto-fonts/blob/115d38430d957d38307457c036302b7bdbe0bbc4/unhinted/NotoSerifMyanmar/NotoSerifMyanmar-Regular.ttf
Date: 2020-04-14

Font version

Version 2.001

Issue

When a narrow consonant has two subjoined consonants and a medial ra, the second subjoined consonant consonant is drawn over the first. If the base consonant is wide or there is no medial ra, the subjoined consonants are stacked correctly.

Character data

ဂ္တ္ဏြ
U+1002 MYANMAR LETTER GA
U+1039 MYANMAR SIGN VIRAMA
U+1010 MYANMAR LETTER TA
U+1039 MYANMAR SIGN VIRAMA
U+100F MYANMAR LETTER NNA
U+103C MYANMAR CONSONANT SIGN MEDIAL RA

Screenshot

ဂ္တ္ဏြ
ဂ္တ္ဏြ

@nizarsq
Copy link

nizarsq commented Sep 21, 2020

@dscorbett I have tried to render the sequence you provided using multiple 3rd parties fonts and all of them show this kind of overlap. I'm not an expert in Myanmar script, but based on little research, the combination you provided does not exist anywhere. I'm wondering if this is a valid combination, in other words, is there a real word that has this combination? Thanks

@patchew
Copy link

patchew commented Sep 21, 2020 via email

@dscorbett
Copy link
Author

In the Sanskrit word matupstriyā, the cluster pstri is written ⟨ပ္သ္တြိ⟩ <U+1015, U+1039, U+101E, U+1039, U+1010, U+103C, U+102D> in Myanmar script. In Noto Sans Myanmar, that cluster is subject to this bug.
ပ္သ္တြိ
(I still think bugs should be fixed even without attestations: they may be low priority, but they are still bugs.)

@ohbendy
Copy link

ohbendy commented Sep 29, 2020

@dscorbett How did you find these combinations? My Burmese Sanskrit conjunct tables don't have any four-letter consonant clusters.

I'd agree that this should be rectified. Although there are only a handful of Burmese Sanskritists who would ever have a use for clusters like this, and considering the need to somewhere draw a line on what's practical to implement, this case is generalisable to any cluster with three consonants and a medial Ra, so a fix would be systematic enough to cover all cases.

@dscorbett
Copy link
Author

I found the original cluster, ⟨ဂ္တ္ဏြ⟩, by analyzing the GSUB table. It was intended to be a minimal reproducible test case for a bug, not a claim of linguistic attestation for that specific cluster. I found the second cluster, ⟨ပ္သ္တြိ⟩, by searching Sanskrit corpora until I found a cluster that when transliterated would reproduce the same bug.

@ohbendy
Copy link

ohbendy commented Sep 29, 2020

Right. Normally we don't aim to support sequences that don't really occur, especially in a script like Burmese where the effort required quickly becomes exponential. If the Sanskrit corpora are public, could you post a link? I'd like to scrape any other clusters from it.

@dscorbett
Copy link
Author

I found it in GRETIL’s Sanskrit corpus. I also looked in OliverHellwig/sanskrit but didn’t find any clusters relevant to this issue.

@ohbendy
Copy link

ohbendy commented Sep 29, 2020

Thank you :)

@ohbendy
Copy link

ohbendy commented Jan 1, 2021

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

6 participants