Appendix 4. ISO 639-1, ISO 639-2 and ISO 639-3 implementation examples
ISO 639-1 and ISO 639-2
Language property in the Data Catalog Vocabulary (DCAT) – Version 2 (W3C Recommendation 04 February 2020):
Range: | Resources defined by the Library of Congress (ISO 639-1, ISO 639-2) SHOULD be used.If a ISO 639-1 (two-letter) code is defined for language, then its corresponding IRI SHOULD be used; if no ISO 639-1 code is defined, then IRI corresponding to the ISO 639-2 (three-letter) code SHOULD be used. |
Usage note: | Repeat this property if the resource is available in multiple languages. |
The same wording is included in the Data Catalog Vocabulary (DCAT) – Version 3
W3C Working Draft 10 May 2022.
Codes for the Representation of Names of Languages arranged alphabetically by alpha-3/ISO 639-2 Code: http://www.loc.gov/standards/iso639-2/php/code_list.php.
ISO 639-3
ISO 639-3 extends the ISO 639-2 alpha-3 codes with an aim to cover all known natural languages and works better for such languages as Cebuano, Montenegrin, Quechua (which has variations by region of the country) languages. For example, it’s recommended in the ALICIA repository Guide (also a video guide, Peru).
Metadata recommendations for text material stored in Finnish publication repositories recommend the ISO 639-X standard for dc.language.iso. It is preferable to use the 3-character language codes of ISO 639-2 or ISO 639-3, as appropriate.
There are still some implementation issues for a three-letter code as not all repositories could support this now (software and XML language issues) and there might be similar issues with aggregators (for example, OpenAIRE follows https://www.w3.org/TR/xml/ and https://www.w3.org/TR/xml/#RFC1766).
More about language tags
A useful and more descriptive article on “Language tags in HTML and XML” (2014) from W3 with examples:
and a proposal to use
For many lesser-known languages spoken by minorities and also for historical stages of languages, language codes, the basis of language tags, are simply not available, see “The Shortcomings of Language Tags for Linked Data When Modeling Lesser-Known Languages” with recommendations to improve or develop ISO language codes.
Recent Comments in this Document
June 30, 2023 at 5:22 am
Sometimes names of Indigenous Australians will include their traditional country and that needs to be captured
See in context
June 30, 2023 at 5:20 am
The issue of controlled vocabularies not being comprehensive, is especially the case in relation to First Nations people in Australia
See in context
June 30, 2023 at 5:18 am
Presumably this is an area where AI could actually usefully be employed, though it is not my area of expertise
See in context
June 30, 2023 at 5:13 am
We are particularly keen to note the importance of inclusion of Indigenous languages in these discussions. Alongside the use of text in a specific language is also the need to identify the Indigenous country that the work is associated with. There is work ongoing for this in Australia.
See in context
June 30, 2023 at 5:02 am
We are concerned that this will never happen at any useful scale – below the very high level. For example, it is already hard enough to get people to change US English to Australian English in a familiar word processing tool such as Word.
See in context
June 30, 2023 at 1:26 am
(This is actually a comment on Appendix 3.)There is a correction regarding the URL as follows.JPCOAR Metadata Schema 2.0 Drafthttps://schema.irdb.nii.ac.jp/ja/schema/2.0-draft/14https://schema.irdb.nii.ac.jp/ja/schema/2.0-draft/1 –>JPCOAR Metadata Schema 2.0https://schema.irdb.nii.ac.jp/en/schema/2.0/14https://schema.irdb.nii.ac.jp/en/schema/2.0/1
See in context
June 30, 2023 at 1:16 am
I corrected the descriptions about WEKO3.before: WEKO3 is a cloud-based repository system supported by JPCOAR (Japan Consortium for Open Access Repositories). It is developed based on INVENIO by CERN.↓after: WEKO3 is a repository software developed by NII (National Institute of Informatics, Japan) based on INVENIO by CERN. This software operates JAIRO Cloud, a cloud-based repository system, which is supported by JPCOAR (Japan Consortium for Open Access Repositories) and NII.before: Specifically, ISO-639-3 is acceptable as the language of the text and for a language attribute of other metadata elements, ISO-639-1 is acceptable.↓after: Specifically, ISO-639-3 is acceptable as the language of the text, and for a language attribute of other metadata elements, ISO-639-1 is acceptable.before: With each field, you can add a language tag in the form of a two-character ISO using the dropdown menu.↓after: With WEKO3, you can add the language tag in the form of a two-character ISO using the dropdown menu, checkbox, and radio button.
See in context
June 30, 2023 at 1:08 am
It would be recommended that the introduction and recommendations be translated in multiple languages and shared from the COAR website to promote and encourage the multilingualism.
See in context
June 28, 2023 at 6:04 pm
Besides the recognition of the translator we encourage to recognize all the professionals involved in the editorial process(I added) mentioned in the resource, it takes a lot of time but it is worthy. This is an example:
.contributor.assistanttotheeditorinchief
Cruz Salas, Minerva
dc.contributor.businessmanager
Zempoalteca Quintana, Mario
dc.contributor.copyeditorandtranslator
Dashner Monk, Heather
dc.contributor.designer
Pérez Ramírez, Patricia
dc.contributor.editorinchief
Jiménez, Teresa Andreu
dc.contributor.layout
Álvarez Sotelo, María Elena
dc.contributor.salesandcirculationmanager
Creamer Tejeda, Cynthia
dc.contributor.translator
Fernández Hall, María Cristina
In fact, RECOLECTA evaluation for repositories considers this point: “4.5.- Existe un campo específico para indicar la descripción de la colaboración. En este campo se registra la entidad o persona responsable de coordinar, corregir, comentar o, en general, contribuir de alguna otra manera al desarrollo del recurso” (4.5.- There is a specific field to indicate the description of the collaboration. This field records the entity or person responsible for coordinating, correcting, commenting or, in general, contributing in some other way to the development of the resource.
https://calidadrevistas.fecyt.es/sites/default/files/informes/2021guiaevaluacionrecolecta_vf.pdf
See in context
June 28, 2023 at 5:29 pm
ORCID or ISNE help to identify authors or creators nowadays, but the use of authorities catalogues should be included.
See in context