Simon Cozens 6e0f6d2dfc Merge commit '7d71975ee699566498f4e4aa7f99e0d4794bfbcb' into lang-v0.7.1 3 days ago
..
.github ca36943762 Merge commit 'e75e874ca3efc62d8389e8ba47d001d7e1314ec7' into lang-v0.6.0 6 months ago
Lib 6e0f6d2dfc Merge commit '7d71975ee699566498f4e4aa7f99e0d4794bfbcb' into lang-v0.7.1 3 days ago
data ae08ef115b Delete axisregistry/ and lang/ and rename *_st/ 2 years ago
snippets 35f3303cd6 Merge commit '51880d3ea86d15388c3379c9409757041de115ef' into lang-v0.6.5 1 month ago
src 6e0f6d2dfc Merge commit '7d71975ee699566498f4e4aa7f99e0d4794bfbcb' into lang-v0.7.1 3 days ago
tests 6e0f6d2dfc Merge commit '7d71975ee699566498f4e4aa7f99e0d4794bfbcb' into lang-v0.7.1 3 days ago
.gitignore d1f21388a0 Merge commit '7237c81744f78b3cbf9999f6f2467d76b1025b3e' into lang-v051 1 year ago
AUTHORS.txt ae08ef115b Delete axisregistry/ and lang/ and rename *_st/ 2 years ago
CHANGELOG.md 0fead20849 Merge commit 'b180e0b52a4b357d1d55edf6165894ba7429b30d' into lang-v050 1 year ago
CONTRIBUTORS.txt 65a7586f9e Merge commit '9b4d8d28810488fde8c03ae1f37be1628b6b764a' into lang-v0.6.1 6 months ago
Cargo.toml 6e0f6d2dfc Merge commit '7d71975ee699566498f4e4aa7f99e0d4794bfbcb' into lang-v0.7.1 3 days ago
LICENSE.txt ae08ef115b Delete axisregistry/ and lang/ and rename *_st/ 2 years ago
README.md dbb63139ab Merge commit 'a495359cdf3c536688f79f712554d55afb902615' into lang-0.7.0 2 weeks ago
build.rs 6e0f6d2dfc Merge commit '7d71975ee699566498f4e4aa7f99e0d4794bfbcb' into lang-v0.7.1 3 days ago
dev-requirements.txt 00f12e7673 Lang update (#7962) 4 months ago
pyproject.toml 6e0f6d2dfc Merge commit '7d71975ee699566498f4e4aa7f99e0d4794bfbcb' into lang-v0.7.1 3 days ago
tox.ini 6e0f6d2dfc Merge commit '7d71975ee699566498f4e4aa7f99e0d4794bfbcb' into lang-v0.7.1 3 days ago

README.md

This python module provides an API with data about languages/regions/scripts for use in the language-support categorization of the font families in the Google Fonts collection.

You can also directly access the raw textproto files on the Lib/gflanguages/data directory:

Most of the code in this project was copied from the gftools repository (https://github.com/googlefonts/gftools/) so that language/region/script data can be easily available to all our tools without having to also get the large dependency tree of gftools. The most immediate user of this module is Font Bakery, which needs to validate language support on font binaries being checked. (see https://github.com/googlefonts/fontbakery/issues/3605)

The second obvious user of this gflanguages module is gftools itself.

Language/region/script definitions and the gflanguages modules are used as a subtree in the google/fonts repo, on its lang/ directory (https://github.com/google/fonts/tree/main/lang).

This module is the main place to update these definitions, avoiding data duplication and guaranteeing uniformity across tools.

To learn more about how lang metadata affects downstream, see gf-guide/lang.

Sample text rules

If there is a sample_text field for a language, it should contain all of the following fields:

  • masthead_full: show off four glyphs
  • masthead_partial: show off two glyphs
  • styles: a phrase of 40-60 characters
  • tester: a phrase of 60-90 characters
  • poster_sm: a word or phrase of 10-17 characters
  • poster_md: a word or phrase of 6-12 characters
  • poster_lg: a word or phrase of 3-8 characters
  • specimen_48: a sentence of 50-80 characters
  • specimen_36: a paragraph of 100-120 characters
  • specimen_32: a paragraph of 140-180 characters
  • specimen_21: one or more paragraphs totalling 300-500 characters
  • specimen_16: one or more paragraphs totalling 550-750 characters

Generally the sample text should be taken from the UN Declaration of Human Rights; if using Eric Muller's XML translations, snippets/lang_sample_text.py will convert the XML into textproto.

If the UDHR is not available in the language, the sample text should be a "neutral" text (not political or religious) - folk tales are generally good sources. (We recognise that for some liturgical languages, religious texts may be the only extant samples.) In these cases, please add a note: field with the source of the sample text.