Bug 20944 - African orthographies
Summary: African orthographies
Status: RESOLVED MOVED
Alias: None
Product: fontconfig
Classification: Unclassified
Component: orth (show other bugs)
Version: 2_1
Hardware: Other All
: medium normal
Assignee: fontconfig-bugs
QA Contact: Behdad Esfahbod
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2009-03-30 04:15 UTC by Denis Jacquerye
Modified: 2018-08-20 21:50 UTC (History)
4 users (show)

See Also:
i915 platform:
i915 features:


Attachments
orth files (9.18 KB, application/x-bzip)
2009-03-30 04:15 UTC, Denis Jacquerye
Details
Bomu (bmq) in Mali (1.46 KB, application/octet-stream)
2010-12-03 05:40 UTC, Denis Jacquerye
Details
Tiéyaxo Bozo (boz) in Mali (1.46 KB, application/octet-stream)
2010-12-03 05:41 UTC, Denis Jacquerye
Details
Toro So Dogon (dts) in Mali (1.46 KB, application/octet-stream)
2010-12-03 05:42 UTC, Denis Jacquerye
Details
Koyraboro Senni Songhai (ses) (1.44 KB, application/octet-stream)
2010-12-03 05:44 UTC, Denis Jacquerye
Details
Zarma (dje) in Niger (1.44 KB, application/octet-stream)
2010-12-03 05:55 UTC, Denis Jacquerye
Details
Manga Kanuri (kby) in Niger (1.41 KB, application/octet-stream)
2010-12-03 06:08 UTC, Denis Jacquerye
Details
Balanta-Ganja (bjt) in Senegal (1.49 KB, application/octet-stream)
2010-12-03 06:40 UTC, Denis Jacquerye
Details
Bassari (bsc) in Senegal (1.49 KB, application/octet-stream)
2010-12-03 08:52 UTC, Denis Jacquerye
Details
Syenara Senoufo (shz) in Mali (1.48 KB, application/octet-stream)
2010-12-03 10:19 UTC, Denis Jacquerye
Details
Mamara Senoufo (myk) in Mali (1.48 KB, application/octet-stream)
2010-12-03 10:22 UTC, Denis Jacquerye
Details
Koyraboro Senni Songhai (ses) (1.47 KB, application/octet-stream)
2010-12-09 08:46 UTC, Denis Jacquerye
Details

Description Denis Jacquerye 2009-03-30 04:15:37 UTC
Created attachment 24364 [details]
orth files

I have collected several African orthographies as part of the ANLoc Font sub project. Could they be included in fontconfig orthographies?
Comment 1 Behdad Esfahbod 2009-03-30 09:12:47 UTC
Roozbeh, can you review these?
Comment 2 Roozbeh Pournader 2009-03-30 10:53:07 UTC
Wow! I can't start to review these this week, but will definitely find some time to do it as soon as possible. Behdad, you don't have a target date for a new fontconfig release, do you?
Comment 3 Behdad Esfahbod 2009-03-30 11:11:20 UTC
No timeline right now.  Take your time :)
Comment 4 Dwayne Bailey 2009-05-18 08:27:03 UTC
@Denis: are these autogenerated?

I picked up these problems:

1) The header says:
# fontconfig/fc-lang/aa.orth
in all files

2) The copyright header says Keith Packard instead of Denis Jacquerye - check in paragraph that starts 'Permission to use, copy...'

3) Some duplication ven.orth already exists, although it might be ve.orth.  It might be worth considering including some that don't have any specific font deviations.
Comment 5 Denis Jacquerye 2010-12-03 05:05:12 UTC
Hmm. Sorry for the long silence.

I'm breaking down the patch in parts by country or data source. I guess that will make it easier for you to process.

I have one question about orthographies of one macrolanguage that change across borders.
Do you prefer to have ff_ml.orth, ff_ne.orth and ff_sn.orth with the ISO 639-2 lang code and country code. Or do you prefer to have their ISO 639-3 codes, ffm.orth, fuq.orth and fuc.orth?
Comment 6 Denis Jacquerye 2010-12-03 05:40:52 UTC
Created attachment 40765 [details]
Bomu (bmq) in Mali

from 
Direction Nationale de l'Alphabétisation Fonctionelle et de la Linguistique Appliquée (Bamako). 1993. Alphabets et règles d'orthographe des langues nationales. Bamako : D.N.A.F.L.A. (in English: Alphabets and spelling rules of national lantuagues).
Comment 7 Denis Jacquerye 2010-12-03 05:41:29 UTC
Created attachment 40766 [details]
Tiéyaxo Bozo (boz) in Mali

from DNAFLA 1993
Comment 8 Denis Jacquerye 2010-12-03 05:42:07 UTC
Created attachment 40767 [details]
Toro So Dogon (dts) in Mali

from DNAFLA 1993
Comment 9 Denis Jacquerye 2010-12-03 05:44:09 UTC
Created attachment 40768 [details]
Koyraboro Senni Songhai (ses)

from DNAFLA 1993
Comment 10 Denis Jacquerye 2010-12-03 05:45:55 UTC
DNFLA 1993 also defines thes alphabets of ff_ml (IS0 639-3 ffm) which is a subset of ff.orth already in fc-lang.
Comment 11 Denis Jacquerye 2010-12-03 05:55:15 UTC
Created attachment 40769 [details]
Zarma (dje) in Niger

from Arrêté 215-99 de la République du Niger
http://pagesperso.lina.univ-nantes.fr/info/perso/permanents/enguehard/recherche/Afrique/alphabet_zarma.htm
Comment 12 Denis Jacquerye 2010-12-03 06:08:08 UTC
Created attachment 40770 [details]
Manga Kanuri (kby) in Niger

from Arrêté 213-99 de la République du Niger
http://pagesperso.lina.univ-nantes.fr/info/perso/permanents/enguehard/recherche/Afrique/alphabet_kanuri.htm
Comment 13 Denis Jacquerye 2010-12-03 06:40:20 UTC
Created attachment 40775 [details]
Balanta-Ganja (bjt) in Senegal

from Décret n° 2005-979 du 21 octobre 2005 relatif à l’orthographe et la séparation des mots en balant
http://www.jo.gouv.sn/spip.php?article4789

note: the digital copy of the decree is poorly done, the ɓ is not properly encoded.
Comment 14 Behdad Esfahbod 2010-12-03 08:23:34 UTC
(In reply to comment #5)
> Hmm. Sorry for the long silence.
> 
> I'm breaking down the patch in parts by country or data source. I guess that
> will make it easier for you to process.
> 
> I have one question about orthographies of one macrolanguage that change across
> borders.
> Do you prefer to have ff_ml.orth, ff_ne.orth and ff_sn.orth with the ISO 639-2
> lang code and country code. Or do you prefer to have their ISO 639-3 codes,
> ffm.orth, fuq.orth and fuc.orth?

Donno.  I assume they don't have Linux locales?

Roozbeh?
Comment 15 Denis Jacquerye 2010-12-03 08:52:56 UTC
Created attachment 40782 [details]
Bassari (bsc) in Senegal

from
Décret n° 2005-987 du 21 octobre 2005 relatif à l’orthographe et la séparation des mots en oniyan
http://www.jo.gouv.sn/spip.php?article4797

note: again the website doesn't have all the right special characters.
Comment 16 Denis Jacquerye 2010-12-03 10:19:37 UTC
Created attachment 40785 [details]
Syenara Senoufo (shz) in Mali

also from DNAFLA 1993
Comment 17 Denis Jacquerye 2010-12-03 10:22:07 UTC
Created attachment 40786 [details]
Mamara Senoufo (myk) in Mali

from DNAFLA 1993
Comment 18 Denis Jacquerye 2010-12-03 10:32:05 UTC
Comment on attachment 40768 [details]
Koyraboro Senni Songhai (ses)

ses is a ISO 639-3 code for Kayraboro Senni Songhai. The ISO 639-2 code for Songhai is son, however the DNFAL 1993 specifies the alphabet is designed for the Kayraboro Senni variant of Songhai, among several in Mali. You might want to use the code 'son_ml' instead of 'ses'.
Comment 19 Denis Jacquerye 2010-12-03 10:51:31 UTC
(In reply to comment #14)
> (In reply to comment #5)
> > I have one question about orthographies of one macrolanguage that change across
> > borders.
> > Do you prefer to have ff_ml.orth, ff_ne.orth and ff_sn.orth with the ISO 639-2
> > lang code and country code. Or do you prefer to have their ISO 639-3 codes,
> > ffm.orth, fuq.orth and fuc.orth?
> 
> Donno.  I assume they don't have Linux locales?
> 
> Roozbeh?

CLDR has ff_SN instead of fuc.
Comment 20 Behdad Esfahbod 2010-12-03 10:52:51 UTC
(In reply to comment #19)
> (In reply to comment #14)
> > (In reply to comment #5)
> > > I have one question about orthographies of one macrolanguage that change across
> > > borders.
> > > Do you prefer to have ff_ml.orth, ff_ne.orth and ff_sn.orth with the ISO 639-2
> > > lang code and country code. Or do you prefer to have their ISO 639-3 codes,
> > > ffm.orth, fuq.orth and fuc.orth?
> > 
> > Donno.  I assume they don't have Linux locales?
> > 
> > Roozbeh?
> 
> CLDR has ff_SN instead of fuc.

Then lets stick to that for now.
Comment 21 Denis Jacquerye 2010-12-09 08:46:23 UTC
Created attachment 40956 [details]
Koyraboro Senni Songhai (ses)

Sorry I had a mistake in the data for Songhai Koyraboro Senni 'ses' from Mali.
The DNFLA paper has š, ž and ɲ, they can also be found in Songhai texts.

By the way, there seems to be some confusion between the tags 'son' and 'ses', even though 'son' is a Collective [language] tag and not a 'Macrolanguage' nor a 'Individual' [language].
You can see the tag 'son' is being used for http://pootle.locamotion.org/son/ when 'ses' should be used.
Comment 22 Khaled Hosny 2011-01-24 06:13:53 UTC
Sorry if this is not the proper place to ask. I'm interested in African orthographies using Arabic script, what is the policy for languages written in different scripts, file naming, orthography naming etc.?
Comment 23 Akira TAGOH 2012-02-20 21:14:26 UTC
(In reply to comment #22)
> Sorry if this is not the proper place to ask. I'm interested in African
> orthographies using Arabic script, what is the policy for languages written in
> different scripts, file naming, orthography naming etc.?

I think the script modifier as in other bugs is referring is what you want. but unfortunately fontconfig doesn't support it so far. we may need to think about it first. or moving to the language tags like there are a suggestion in Bug#16869 is one of idea. FYI.
Comment 24 Akira TAGOH 2012-02-28 19:07:59 UTC
reporting multiple issues in one bug is hard to keep it on track. maybe a good idea to file a separate bug per orth files. so we can review them and close it once it's done.
Comment 25 GitLab Migration User 2018-08-20 21:50:12 UTC
-- GitLab Migration Automatic Message --

This bug has been migrated to freedesktop.org's GitLab instance and has been closed from further activity.

You can subscribe and participate further through the new bug through this link to our GitLab instance: https://gitlab.freedesktop.org/fontconfig/fontconfig/issues/74.


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.