“It would be helpful if CLDR could provide some locale-specific information on the structuring and use of personal names.”
The content and structure of personal names can vary widely from region to region and locale to locale. This session will cover the current standards that exist in LDAP, hcard, and HTML as well as various commercial implementations. Then a look at real word name examples, ranging from mononyms in Indonesia to patronymic and matronymic names in Iceland, Spain and Portugal to Arabic ancestral naming practices. We will look at name usage in different legal, business, familial and formality contexts. Finally, we will present the current state of the evolving effort to add locale-specific name structures to CLDR.
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
CLDR: What’s in a Personal Name?
1. CLDR Personal Names Proposal
44th Internationalization and Unicode Conference ‹#› - (on-line) - October 2020
Yahoo! Confidential
Mike McKenna mgm.globalization(at)gmail.com
v.0.1
October 2020
CLDR: What’s in a Personal Name?
Internationalization and Unicode Conference 44 – (on-line) – October 2020 2
Address Details
How are names used in applications
• “Hello! My name is: ….”
• Mail merge
• Human Resources and
personnel data
• Onboarding and KYC
2. CLDR Personal Names Proposal
44th Internationalization and Unicode Conference ‹#› - (on-line) - October 2020
Internationalization and Unicode Conference 44 – (on-line) – October 2020 3
Address Details
Common web standards
Maybe standards can help us?
• LDAP
• hCard
• HTML
. . .
Name: {
honorific-prefix,
given-name,
additional-name,
family-name,
honorific-suffix,
nickname
}
W3 HTML
autofill fields
Internationalization and Unicode Conference 44 – (on-line) – October 2020 4
Address Details
Problem with names
安倍 晋三
Jane Doe
A
AZ
Elizabeth == Liz ?
Call me “Boomer”
• First name last?
• Mr, Mrs, Ms, Mx?
• Which initials?
• How many initials?
• Middle name?
• Legal name?
• Informal name?
• Nickname?
3. CLDR Personal Names Proposal
44th Internationalization and Unicode Conference ‹#› - (on-line) - October 2020
Internationalization and Unicode Conference 44 – (on-line) – October 2020 5
Address Details
Real word names - length
Let’s look at some real names
European names:
Mr. Jonas Edward Salk, M.D.
Jonas E. Salk
Jonas Salk
Mr. Salk
Jonas
JS
Length: Which fields to use?
• Full
• Medium
• Short
• Initials
length
Internationalization and Unicode Conference 44 – (on-line) – October 2020 6
Address Details
Real word names – how many names
Let’s look at some real names
Spanish names:
Señor Diego María Rivera
Barrientos
Diego Rivera Barrientos
Diego Rivera
Diego | Panzòn
DR
Length:
• TWO surnames
(usually) father mother
length
4. CLDR Personal Names Proposal
44th Internationalization and Unicode Conference ‹#› - (on-line) - October 2020
Internationalization and Unicode Conference 44 – (on-line) – October 2020 7
Address Details
Real word names – order of fields
Let’s look at some real names
Japanese names:
晋三安倍様
晋三安倍
晋三
安倍
His Excellency Shinzo ABE
Shinzo ABE
ABE
Shinzo
What order to use?
family name
given name
given name
family name
length
Internationalization and Unicode Conference 44 – (on-line) – October 2020 8
Address Details
Real word names - formality
Let’s look at some real names
Japanese
松居慶子
松居様
松居先生
松居さん
慶子ちゃん
Keiko Matsui
Matsui-sama
Matsui-sensei
Matsui-san
Keiko-chan
Herr Dr. Albert Schweitzer, MD
Herr Dr. Schweitzer
Dr. Schweitzer
Albert
length
formality
less formal
more formal German
5. CLDR Personal Names Proposal
44th Internationalization and Unicode Conference ‹#› - (on-line) - October 2020
Internationalization and Unicode Conference 44 – (on-line) – October 2020 9
Address Details
Name usage
Nominative
en-GB: Robert Jones
ru-RU: Александр
Исаевич
Солженицын
ja-JP: 松居慶子
ro-RO: Petre Țuțea
Sorting
• surname, given
• given, surname
Hello
<name>!
length
formality
u
s
a
g
e
Vocative / case
en-GB: Hi Bob!
ru-RU: Привет Саша!
ja-JP: 松居さん
こんにちは!
ro-RO: Salut domnule
Țuțea!
Internationalization and Unicode Conference 44 – (on-line) – October 2020 10
Address Details
Mononyms – only one name
• Indonesia
• Tamil Nadu
• Celebrities
“Full name”
6. CLDR Personal Names Proposal
44th Internationalization and Unicode Conference ‹#› - (on-line) - October 2020
Internationalization and Unicode Conference 44 – (on-line) – October 2020 11
Address Details
Lineage names – Father, Mother
• Patronymic – ex: ru_RU: <given> <patronymic> <surname>
hisname father-ovich familyname
Владимир Антонович Иванов
• Matronymic names
• Iceland – hisname father-son
Jón Einarsson
hername mother-dóttir
Guðrún Mínervudóttir
Internationalization and Unicode Conference 44 – (on-line) – October 2020 12
Address Details
Ancestral naming
• Example Saudi Arabia
Farah bint Fadi bin Sajid Al-Tikriti
ﻓ
ر
ح
ﺑ
ﻧ
ت
ﻓ
ﺎ
د
ي
ﺑ
ن
ﺳ
ﺎ
ﺟ
د
ا
ﻟ
ﺗ
ﻛ
ر
ﯾ
ﺗ
ﻲ
7. CLDR Personal Names Proposal
44th Internationalization and Unicode Conference ‹#› - (on-line) - October 2020
Internationalization and Unicode Conference 44 – (on-line) – October 2020 13
Address Details
More standards Industry
• LDAP
• hCard
• HTML
. . .
• OASIS xNL
• UPU S42
• Apple
• Google
• Microsoft
• Facebook
. . .
• Trulio
• Workday
more
complex
Internationalization and Unicode Conference 44 – (on-line) – October 2020 14
Address Details
Status of current standards
• Give you a ”standard” envelope to move data
• No information:
– What order fields are in for each region and language
– What to choose for different lengths
– Formality presentation forms
– Usage, sorting information
8. CLDR Personal Names Proposal
44th Internationalization and Unicode Conference ‹#› - (on-line) - October 2020
Internationalization and Unicode Conference 44 – (on-line) – October 2020 15
Address Details
CLDR – Proposal
• Fields
– Concise set of standard
fields
– Flexible to/from W3C
• Three axis for context
– Length
– Formality
– Usage
• Syntax & Structure
• Skeletons for most uses
• Fall-back mechanism
• Use locales to dictate script-
specific behavior
• Does NOT specify
– Field validation
– Linguistic inflections
Internationalization and Unicode Conference 44 – (on-line) – October 2020 16
Address Details
CLDR – Proposal: Fields
• prefix Mr., Ms., Mx., Dr., Prof., …
• given first name, given name
• middle middle name(s), patronymic
• surname last name, family name
• surname-extended second last name, ancestral name
• suffix Jr, III, MD, PhD, …
• nickname Bob, Jack, Venki, Misha, …
• monogram-override JR, LBJ, MGM, …
9. CLDR Personal Names Proposal
44th Internationalization and Unicode Conference ‹#› - (on-line) - October 2020
Internationalization and Unicode Conference 44 – (on-line) – October 2020 17
Address Details
CLDR – Proposal: Name Attributes
Context-Axis
• Length
– full
– long
– medium
– short
– monogram
– monogram-narrow
• Formality
– informal
– formal
• Usage
– nominative
– vocative
– sorting
length
formality
u
s
a
g
e
Internationalization and Unicode Conference 44 – (on-line) – October 2020 18
Address Details
CLDR – Proposal: LDML Structure
• LDML = Locale Data Markup Language (UTS #35)
• Structure
– Default layouts by context
• Skeletons
<personNames>
<personName length=”full" usage="nominative" style="formal">
{prefix} {given} {middle} {surname} {suffix}
</personName>
…
10. CLDR Personal Names Proposal
44th Internationalization and Unicode Conference ‹#› - (on-line) - October 2020
Internationalization and Unicode Conference 44 – (on-line) – October 2020 19
Address Details
CLDR – Proposal: LDML Structure
<personNames>
<personName length="full" usage="nominative" style="formal">{prefix} {given} {middle} {surname} {suffix}</personName>
<personName length="long" usage="nominative" style="formal">{prefix} {given} {middle-initial} {surname} {suffix}</personName>
<personName length="medium" usage="nominative" style="formal">{prefix} {given} {surname} {suffix}</personName>
<personName length="short" usage="nominative" style="formal">{prefix} {surname} {suffix}</personName>
<personName length="monogram" usage="nominative" style="formal">{given-initial}{middle-initial}{surname-initial}</personName>
<personName length="monogram-narrow" usage="nominative" style="formal">{surname-initial}</personName>
<personName length="long" usage="nominative" style="informal">{given} {middle-initial} {surname}</personName>
<personName length="medium" usage="nominative" style="informal">{given} ({nickname}) {surname}</personName>
<personName length="short" usage="nominative" style="informal">{given}</personName>
<personName length="monogram" usage="nominative" style="informal">{given-initial}{surname-initial}</personName>
<personName length="monogram-narrow" usage="nominative" style="informal">{given-initial}</personName>
<personName length="long" usage="sorting" style="informal">{surname}, {given} {middle}</personName>
...
<personName length="short" usage="vocative" style="informal">{nickname}</personName>
</personNames>
Internationalization and Unicode Conference 44 – (on-line) – October 2020 20
Address Details
CLDR – Proposal: Fall back
What if a skeleton can’t be found?
• length falls back to medium
full à long à medium ß short ß monogram
• usage falls back to nominative
vocative à nominative ß sorting
• style falls back to informal
length
formality
u
s
a
g
e
<personName length="medium" usage="nominative" style="informal">
11. CLDR Personal Names Proposal
44th Internationalization and Unicode Conference ‹#› - (on-line) - October 2020
Internationalization and Unicode Conference 44 – (on-line) – October 2020 21
Address Details
CLDR – Proposal: phonetics and name types
Japanese expects names
• Japanese Kanji and kana
• Phonetic kana
• Romajii (Latin) “international”
…
Styles by language and script
Internationalization and Unicode Conference 44 – (on-line) – October 2020 22
Address Details
CLDR – use case Names in Person object
personNames
Kanji
Kana
(phonetic)
Romaji
(Latin)
12. CLDR Personal Names Proposal
44th Internationalization and Unicode Conference ‹#› - (on-line) - October 2020
Internationalization and Unicode Conference 44 – (on-line) – October 2020 23
Address Details
CLDR – personName: Current status
• Preparing for submission to Unicode Consortium
• Hope to submit PRI for review during CLDR v39 cycle
• Need to finalize use cases and edge cases
Internationalization and Unicode Conference 44 – (on-line) – October 2020 24
Address Details
More to consider…
• Name types: birth, legal, preferred, alias, aka, cc, phonetic, …
• Casing: e.g. ”ABE Shinzu”
• Generation vs honorific: e.g. “J. Jones Jr.” vs “J. Jones Jr. PhD”
• Name prefixes: e.g. “van de”, “de la”, “bin”, “bint”
van der Poel de la Salle bin Abdallah
• Joint names: e.g. “Sue and John Jones JTWROS”
• Arabic ancestral names?
given middle surname-extended surname
given father grandfather family
?
13. CLDR Personal Names Proposal
44th Internationalization and Unicode Conference ‹#› - (on-line) - October 2020
Yahoo! Confidential
Thank you!
CLDR Personal Name - Core Team
Mark Davis @ Google
Peter Edberg @ Apple
Rich Gillam @ Apple
Alex Kolisnychenko @ Microsoft
Kristi Lee @ Microsoft
Mike McKenna @ PayPal
<personName length="long" usage="sorting" style="informal">
{surname}, {given} {middle}
</personName>
Internationalization and Unicode Conference 44 – (on-line) – October 2020 26
Address Details
CLDR – Name Fields compared
Standard Prefix Given
name
Middle
name
Surname Second
surname
Suffix Nickname Monogram Other
CLDR prefix given middle surname surname-
extended
suffix nickname monogram-
override
LDAP
person obj
title givenName givenName
{1,n}
sn,
surname
generation
Qualifier {0,n}
initials initials,
cn,
commonName
hCard honorific-
prefix
given-
name
additional-
name
family-
name
honorific-
suffix
nickname fn,
“abbr”
attribute
HTML honorific-
prefix
given-
name,
cc-given-
name
additional-
name,
cc-
additional-
name
family-
name,
cc-family-
name
honorific-
suffix
nickname name,
cc-name
OASIS xNL PrecedingTitle
{0,n}
Title {0,n}
FirstName {0,n} MiddleName
{0,n}
NamePrefix
LastName
LastName {1,n} Generation
Identifier {0,n}
Suffix {0,n}
GeneralSuffix
Othername {0,n} NameLine
Alias
14. CLDR Personal Names Proposal
44th Internationalization and Unicode Conference ‹#› - (on-line) - October 2020
Internationalization and Unicode Conference 44 – (on-line) – October 2020 27
Address Details
CLDR – Name Fields compared
Standard Prefix Given
name
Middle
name
Surname Second
surname
Suffix Nickname Monogram Other
CLDR prefix given middle surname surname-
extended
suffix nickname monogram-
override
OECD PrecedingTitle,
Title{0,n}
FirstName * MiddleName
{0,n}
NamePrefix,
LastName *
Generation
Identifier {0,n}
Suffix {0,n},
GeneralSuffix
PersonType
UPU S42 form of address
prec qual
given name pt 1 given name pt 2
given name pt 3
interm qual
surname prefix
surname [pt 1]
surname [pt 2] name qualifier
qualification
patronymic
ADIS honorific middle name
2nd middle name
last name prefix
last name
suffix
generation
post honorific
preferred name full name
Google
person
honorificPrefix
phonetic
HonorificPrefix
givenName
phonetic
GivenName
middleName
phonetic
MiddleName
familyName
phonetic
FamilyName
honorificSuffix
phonetic
HonorificSuffix
Nickname
DEFAULT
(nickname) or
ALTERNATE_N
AME
displayName
displayName
LastFirst
Unstructured
Name
phonetic
FullName
Facebook
PersonInfo
first_name middle_name last_name name
name_format
Internationalization and Unicode Conference 44 – (on-line) – October 2020 28
Address Details
CLDR – Name Fields compared
Standard Prefix Given
name
Middle
name
Surname Second
surname
Suffix Nickname Monogram Other
CLDR prefix given middle surname surname-
extended
suffix nickname monogram-
override
Apple
Contact
namePrefix givenName
phonetic
GivenName
middleName
phonetic
MiddleName
familyName
phonetic
FamilName
nameSuffix nickname previousFamily
Name
Microsoft
personal
contact
title givenName
yomiGivenName
middleName surname
yomiSurname
generation nickName initials spouseName
Trulio FirstGivenName MiddleName FirstSurname SecondSurname ISOLatin1Name,
AdditionalFields.
FullName
Workday Prefix_Data {
Title,
Salutation
}
First_Name Middle_Name Last_Name Secondary_Last
_Name
Tertiary_Last
_Name
Suffix_Data {
Social,
Academic,
Hereditary,
Honorary,
Professional,
Religious,
Royal
}
Local_Name
_Detail_Data
15. CLDR Personal Names Proposal
44th Internationalization and Unicode Conference ‹#› - (on-line) - October 2020
Internationalization and Unicode Conference 44 – (on-line) – October 2020 29
Address Details
References - standards
Standards
• LDAP – Lightweight Directory Access Protocol + RFC4519
https://www.iana.org/assignments/ldap-parameters/ldap-parameters.xhtml
https://docs.ldap.com/specs/rfc4519.txt
ITU X.20 https://www.itu.int/itu-t/recommendations/rec.aspx?rec=X.520
• hCard
http://microformats.org/wiki/hcard
• W3C HTML autofill fields
https://www.w3.org/TR/html52/sec-forms.html#autofill-field
• OASIS xNL
http://docs.oasis-open.org/ciq/v3.0/cs02/specs/ciq-specs-v3-cs2.html#_Toc213384944
• UPU S42, ADIS, ISO 19160
http://xml.coverpages.org/ISO-FocusPlus-AddressingStandards-2010-06.pdf
http://xml.coverpages.org/namesAndAddresses.html
http://xml.coverpages.org/ADIS-Address-2001-1.pdf
• Unicode Technical Standard #35 Unicode Locale Data Markup Language (LDML)
http://unicode.org/reports/tr35/
Internationalization and Unicode Conference 44 – (on-line) – October 2020 30
Address Details
References - regulatory
Regulatory Standards
• OECD Standard for automatic exchange of financial account information in tax matters
http://www.oecd.org/tax/exchange-of-tax-information/standard-for-automatic-exchange-of-financial-account-information-in-tax-matters-second-edition-
9789264267992-en.htm
– OECD Common Reporting Standard XML Schema
http://www.oecd.org/tax/automatic-exchange/common-reporting-standard/schema-and-user-guide/#d.en.345315
• ICAO Travel Document Standard – Doc 9303
https://www.icao.int/publications/pages/publication.aspx?docnum=9303
• UK Deed Poll name change requirements
https://www.gov.uk/government/publications/change-of-name-guidance/use-and-change-of-names#grounds-for-refusing-to-change-a-name-on-a-home-office-
issued-document
16. CLDR Personal Names Proposal
44th Internationalization and Unicode Conference ‹#› - (on-line) - October 2020
Internationalization and Unicode Conference 44 – (on-line) – October 2020 31
Address Details
References - industry
Industry Standards
• Google Person / Name Object
https://developers.google.com/people/api/rest/v1/people#name
• Apple CNMutableContact person object
https://developer.apple.com/documentation/contacts/cnmutablecontact
• Facebook User Object
https://developers.facebook.com/docs/graph-api/reference/user/#fields
• Microsft Personal Contact Graph REST API
https://docs.microsoft.com/en-us/graph/api/resources/contact?view=graph-rest-1.0#properties
• Trulio Identity Schema
https://developer.trulioo.com/reference#identity-verification-verify
• Workday Person_Name_Detail_Data
https://community.workday.com/sites/default/files/file-
hosting/productionapi/Human_Resources/v35.0/Change_Legal_Name.html#Person_Name_Detail_DataType
Internationalization and Unicode Conference 44 – (on-line) – October 2020 32
Address Details
References - other
Other References
• Wikipedia, Personal Name
https://en.wikipedia.org/wiki/Personal_name
• Wikipedia, Naming Conventions
https://en.wikipedia.org/wiki/Wikipedia:Naming_conventions_(people)
Wikipedia Manual of Style, Biographical names
https://en.wikipedia.org/wiki/Wikipedia:Manual_of_Style/Biography#Names
• Wikidata personal name object
https://www.wikidata.org/wiki/Q1071027
• Graham Rhind, Global Sourcebook for International Data Management
https://www.grcdi.nl/gsb/global%20sourcebook.html
• A Guide to Names and Naming Practices
https://www.fbiic.gov/public/2008/nov/Naming_practice_guide_UK_2006.pdf
• Law Enforcement Guide to International Names
https://info.publicintelligence.net/ROCICInternationalNames.pdf
• Pan-Data: Names
https://github.com/pan-i18n/pan-data/blob/master/specs/Names.md