SlideShare a Scribd company logo
1
Diakritika in Unicode
Reinhold Heuvelmann
Ç↔C+◌̧
Code Charts
http://www.unicode.org/charts/ , Stichwort "combining"
– http://www.unicode.org/charts/PDF/U0300.pdf
– http://www.unicode.org/charts/PDF/U1AB0.pdf
– http://www.unicode.org/charts/PDF/U1DC0.pdf
– http://www.unicode.org/charts/PDF/U20D0.pdf
– http://www.unicode.org/charts/PDF/UFE20.pdf
| 10 | Diakritika in Unicode | Datenbezieher-Workshop 30. Mai 20172
| 10 | Diakritika in Unicode | Datenbezieher-Workshop 30. Mai 20173
Stacking Sequences, Beispiel 1
| 10 | Diakritika in Unicode | Datenbezieher-Workshop 30. Mai 20174
http://www.unicode.org/versions/Unicode9.0.0/ch02.pdf
Stacking Sequences, Beispiel 2
| 10 | Diakritika in Unicode | Datenbezieher-Workshop 30. Mai 20175
Aus den FAQ zu
Characters and Combining Marks
Q: Why are new combinations of Latin letters with
diacritical marks not suitable for addition to Unicode?
A: There are several reasons. First, Unicode encodes many
diacritical marks, and the combinations can already be
produced, as noted in the answers to some questions
above. If precomposed equivalents were added, the
number of multiple spellings would be increased, and
decompositions would need to be defined and
maintained for them, adding to the complexity of
existing decomposition tables in implementations.
...
| 10 | Diakritika in Unicode | Datenbezieher-Workshop 30. Mai 20176
| 10 | Diakritika in Unicode | Datenbezieher-Workshop 30. Mai 20177
Aus den FAQ zu
Characters and Combining Marks
...
Finally, normalization form NFC (the composed form
favored for use on the Web) is frozen—no new letter
combinations can be added to it. Therefore, the normalized
NFC representation of any new precomposed letters would
still use decomposed sequences, which can already be
expressed by combining character sequences in Unicode.
Nothing would be gained by adding the letter with
diacritical mark as a precomposed character; on the
contrary, adding such a letter would add one or more
multiple spellings to be reckoned with, incrementally
complicating all Unicode implementations for no net gain.
Kombinationen, Kombinationen
https://en.wikipedia.org/wiki/List_of_precomposed_Latin_characters_in_Unicode
| 10 | Diakritika in Unicode | Datenbezieher-Workshop 30. Mai 20178
In der Anwendung durch die DNB
– aktuell keine Zeichensatz-Konversion bei der Erstellung
von bibliografischen Daten
– "garbage in - garbage out"
– MARC 21 ist neutral in Bezug auf Unicode composed
vs. decomposed
– Wo beginnen, wo enden?
– Tools sind vorhanden:
"uconv -f utf-8 -t utf-8 -x NFC [Datei]"
(mit Dank an Johann Rolschewski)
| 10 | Diakritika in Unicode | Datenbezieher-Workshop 30. Mai 20179
Danke
r.heuvelmann@dnb.de
| 10 | Diakritika in Unicode | Datenbezieher-Workshop 30. Mai 201710
MARC 21 Specifications for Record Structure, Character Sets, and
Exchange Media, CHARACTER SETS AND ENCODING OPTIONS:
Part 3: Unicode Encoding Environment
https://www.loc.gov/marc/specifications/speccharucs.html
Part 4: Conversion Between Environments
https://www.loc.gov/marc/specifications/speccharconversion.html
Assessment of Options for Handling Full Unicode Character
Encodings in MARC21
https://www.loc.gov/marc/marbi/2004/2004-report01.pdf
https://www.loc.gov/marc/marbi/2005/2005-report01.pdf

More Related Content

More from Reinhold Heuvelmann

Open Access und Lizenzangaben in MARC 21
Open Access und Lizenzangaben in MARC 21Open Access und Lizenzangaben in MARC 21
Open Access und Lizenzangaben in MARC 21
Reinhold Heuvelmann
 
Overview of Format Activities in Die Deutsche Bibliothek
Overview of Format Activities in Die Deutsche BibliothekOverview of Format Activities in Die Deutsche Bibliothek
Overview of Format Activities in Die Deutsche Bibliothek
Reinhold Heuvelmann
 
Provenance in MARC 21
Provenance in MARC 21Provenance in MARC 21
Provenance in MARC 21
Reinhold Heuvelmann
 
Linked Data at the German National Library
Linked Data at the German National LibraryLinked Data at the German National Library
Linked Data at the German National Library
Reinhold Heuvelmann
 
Entitäten, Relationen und mehr - Erweiterungen in MARC 21 Authority durch di...
Entitäten, Relationen und mehr - Erweiterungen in MARC 21 Authority durch di...Entitäten, Relationen und mehr - Erweiterungen in MARC 21 Authority durch di...
Entitäten, Relationen und mehr - Erweiterungen in MARC 21 Authority durch di...
Reinhold Heuvelmann
 
Some requirements for a future metadata format
Some requirements for a future metadata formatSome requirements for a future metadata format
Some requirements for a future metadata format
Reinhold Heuvelmann
 
BIBFRAME
BIBFRAMEBIBFRAME
GND and URIs: Integration and Identification
GND and URIs: Integration and IdentificationGND and URIs: Integration and Identification
GND and URIs: Integration and Identification
Reinhold Heuvelmann
 
BIBFRAME
BIBFRAMEBIBFRAME
Warum ausgerechnet BIBFRAME?
Warum ausgerechnet BIBFRAME?Warum ausgerechnet BIBFRAME?
Warum ausgerechnet BIBFRAME?
Reinhold Heuvelmann
 
BIBFRAME: Wie geht es weiter?
BIBFRAME: Wie geht es weiter?BIBFRAME: Wie geht es weiter?
BIBFRAME: Wie geht es weiter?
Reinhold Heuvelmann
 
2015 02-24 dnb-linking_data
2015 02-24 dnb-linking_data2015 02-24 dnb-linking_data
2015 02-24 dnb-linking_data
Reinhold Heuvelmann
 
Typen von Publikationen nach RDA
Typen von Publikationen nach RDATypen von Publikationen nach RDA
Typen von Publikationen nach RDA
Reinhold Heuvelmann
 
BIBFRAME on its way
BIBFRAME on its wayBIBFRAME on its way
BIBFRAME on its way
Reinhold Heuvelmann
 
BIBFRAME Report from the German National Library
BIBFRAME Report from the German National LibraryBIBFRAME Report from the German National Library
BIBFRAME Report from the German National Library
Reinhold Heuvelmann
 
Die Implementierung von Content Type, Media Type und Carrier Type
Die Implementierung von Content Type, Media Type und Carrier TypeDie Implementierung von Content Type, Media Type und Carrier Type
Die Implementierung von Content Type, Media Type und Carrier Type
Reinhold Heuvelmann
 
Das MARC-Feld 924 "Bestandsinformationen"
Das MARC-Feld 924 "Bestandsinformationen"Das MARC-Feld 924 "Bestandsinformationen"
Das MARC-Feld 924 "Bestandsinformationen"
Reinhold Heuvelmann
 
Content type, Media type, Carrier type und ihre Implementierung
Content type, Media type, Carrier type und ihre ImplementierungContent type, Media type, Carrier type und ihre Implementierung
Content type, Media type, Carrier type und ihre Implementierung
Reinhold Heuvelmann
 

More from Reinhold Heuvelmann (20)

Open Access und Lizenzangaben in MARC 21
Open Access und Lizenzangaben in MARC 21Open Access und Lizenzangaben in MARC 21
Open Access und Lizenzangaben in MARC 21
 
Overview of Format Activities in Die Deutsche Bibliothek
Overview of Format Activities in Die Deutsche BibliothekOverview of Format Activities in Die Deutsche Bibliothek
Overview of Format Activities in Die Deutsche Bibliothek
 
Provenance in MARC 21
Provenance in MARC 21Provenance in MARC 21
Provenance in MARC 21
 
Linked Data at the German National Library
Linked Data at the German National LibraryLinked Data at the German National Library
Linked Data at the German National Library
 
Entitäten, Relationen und mehr - Erweiterungen in MARC 21 Authority durch di...
Entitäten, Relationen und mehr - Erweiterungen in MARC 21 Authority durch di...Entitäten, Relationen und mehr - Erweiterungen in MARC 21 Authority durch di...
Entitäten, Relationen und mehr - Erweiterungen in MARC 21 Authority durch di...
 
Some requirements for a future metadata format
Some requirements for a future metadata formatSome requirements for a future metadata format
Some requirements for a future metadata format
 
BIBFRAME
BIBFRAMEBIBFRAME
BIBFRAME
 
GND and URIs: Integration and Identification
GND and URIs: Integration and IdentificationGND and URIs: Integration and Identification
GND and URIs: Integration and Identification
 
BIBFRAME
BIBFRAMEBIBFRAME
BIBFRAME
 
Warum ausgerechnet BIBFRAME?
Warum ausgerechnet BIBFRAME?Warum ausgerechnet BIBFRAME?
Warum ausgerechnet BIBFRAME?
 
BIBFRAME: Wie geht es weiter?
BIBFRAME: Wie geht es weiter?BIBFRAME: Wie geht es weiter?
BIBFRAME: Wie geht es weiter?
 
2015 02-24 dnb-linking_data
2015 02-24 dnb-linking_data2015 02-24 dnb-linking_data
2015 02-24 dnb-linking_data
 
Typen von Publikationen nach RDA
Typen von Publikationen nach RDATypen von Publikationen nach RDA
Typen von Publikationen nach RDA
 
BIBFRAME on its way
BIBFRAME on its wayBIBFRAME on its way
BIBFRAME on its way
 
BIBFRAME: Potential und Risiko
BIBFRAME: Potential und RisikoBIBFRAME: Potential und Risiko
BIBFRAME: Potential und Risiko
 
BIBFRAME Report from the German National Library
BIBFRAME Report from the German National LibraryBIBFRAME Report from the German National Library
BIBFRAME Report from the German National Library
 
Die Implementierung von Content Type, Media Type und Carrier Type
Die Implementierung von Content Type, Media Type und Carrier TypeDie Implementierung von Content Type, Media Type und Carrier Type
Die Implementierung von Content Type, Media Type und Carrier Type
 
MODS und MADS
MODS und MADSMODS und MADS
MODS und MADS
 
Das MARC-Feld 924 "Bestandsinformationen"
Das MARC-Feld 924 "Bestandsinformationen"Das MARC-Feld 924 "Bestandsinformationen"
Das MARC-Feld 924 "Bestandsinformationen"
 
Content type, Media type, Carrier type und ihre Implementierung
Content type, Media type, Carrier type und ihre ImplementierungContent type, Media type, Carrier type und ihre Implementierung
Content type, Media type, Carrier type und ihre Implementierung
 

Recently uploaded

High Profile Girls call Service Pune 000XX00000 Provide Best And Top Girl Ser...
High Profile Girls call Service Pune 000XX00000 Provide Best And Top Girl Ser...High Profile Girls call Service Pune 000XX00000 Provide Best And Top Girl Ser...
High Profile Girls call Service Pune 000XX00000 Provide Best And Top Girl Ser...
bhumivarma35300
 
Acumatica vs. Sage Intacct vs. NetSuite _ NOW CFO.pdf
Acumatica vs. Sage Intacct vs. NetSuite _ NOW CFO.pdfAcumatica vs. Sage Intacct vs. NetSuite _ NOW CFO.pdf
Acumatica vs. Sage Intacct vs. NetSuite _ NOW CFO.pdf
BrainSell Technologies
 
Integrating Kafka with MuleSoft 4 and usecase
Integrating Kafka with MuleSoft 4 and usecaseIntegrating Kafka with MuleSoft 4 and usecase
Integrating Kafka with MuleSoft 4 and usecase
shyamraj55
 
WhatsApp Spy Online Trackers and Monitoring Apps
WhatsApp Spy Online Trackers and Monitoring AppsWhatsApp Spy Online Trackers and Monitoring Apps
WhatsApp Spy Online Trackers and Monitoring Apps
HackersList
 
Feature sql server terbaru performance.pptx
Feature sql server terbaru performance.pptxFeature sql server terbaru performance.pptx
Feature sql server terbaru performance.pptx
ssuser1915fe1
 
Patch Tuesday de julio
Patch Tuesday de julioPatch Tuesday de julio
Patch Tuesday de julio
Ivanti
 
Tailored CRM Software Development for Enhanced Customer Insights
Tailored CRM Software Development for Enhanced Customer InsightsTailored CRM Software Development for Enhanced Customer Insights
Tailored CRM Software Development for Enhanced Customer Insights
SynapseIndia
 
Vertex AI Agent Builder - GDG Alicante - Julio 2024
Vertex AI Agent Builder - GDG Alicante - Julio 2024Vertex AI Agent Builder - GDG Alicante - Julio 2024
Vertex AI Agent Builder - GDG Alicante - Julio 2024
Nicolás Lopéz
 
How to Build a Profitable IoT Product.pptx
How to Build a Profitable IoT Product.pptxHow to Build a Profitable IoT Product.pptx
How to Build a Profitable IoT Product.pptx
Adam Dunkels
 
Girls call Kolkata 👀 XXXXXXXXXXX 👀 Rs.9.5 K Cash Payment With Room Delivery
Girls call Kolkata 👀 XXXXXXXXXXX 👀 Rs.9.5 K Cash Payment With Room Delivery Girls call Kolkata 👀 XXXXXXXXXXX 👀 Rs.9.5 K Cash Payment With Room Delivery
Girls call Kolkata 👀 XXXXXXXXXXX 👀 Rs.9.5 K Cash Payment With Room Delivery
sunilverma7884
 
BT & Neo4j: Knowledge Graphs for Critical Enterprise Systems.pptx.pdf
BT & Neo4j: Knowledge Graphs for Critical Enterprise Systems.pptx.pdfBT & Neo4j: Knowledge Graphs for Critical Enterprise Systems.pptx.pdf
BT & Neo4j: Knowledge Graphs for Critical Enterprise Systems.pptx.pdf
Neo4j
 
Semantic-Aware Code Model: Elevating the Future of Software Development
Semantic-Aware Code Model: Elevating the Future of Software DevelopmentSemantic-Aware Code Model: Elevating the Future of Software Development
Semantic-Aware Code Model: Elevating the Future of Software Development
Baishakhi Ray
 
IPLOOK Remote-Sensing Satellite Solution
IPLOOK Remote-Sensing Satellite SolutionIPLOOK Remote-Sensing Satellite Solution
IPLOOK Remote-Sensing Satellite Solution
IPLOOK Networks
 
Litestack talk at Brighton 2024 (Unleashing the power of SQLite for Ruby apps)
Litestack talk at Brighton 2024 (Unleashing the power of SQLite for Ruby apps)Litestack talk at Brighton 2024 (Unleashing the power of SQLite for Ruby apps)
Litestack talk at Brighton 2024 (Unleashing the power of SQLite for Ruby apps)
Muhammad Ali
 
(CISOPlatform Summit & SACON 2024) Digital Personal Data Protection Act.pdf
(CISOPlatform Summit & SACON 2024) Digital Personal Data Protection Act.pdf(CISOPlatform Summit & SACON 2024) Digital Personal Data Protection Act.pdf
(CISOPlatform Summit & SACON 2024) Digital Personal Data Protection Act.pdf
Priyanka Aash
 
(CISOPlatform Summit & SACON 2024) Keynote _ Power Digital Identities With AI...
(CISOPlatform Summit & SACON 2024) Keynote _ Power Digital Identities With AI...(CISOPlatform Summit & SACON 2024) Keynote _ Power Digital Identities With AI...
(CISOPlatform Summit & SACON 2024) Keynote _ Power Digital Identities With AI...
Priyanka Aash
 
High Profile Girls Call ServiCe Hyderabad 0000000000 Tanisha Best High Class ...
High Profile Girls Call ServiCe Hyderabad 0000000000 Tanisha Best High Class ...High Profile Girls Call ServiCe Hyderabad 0000000000 Tanisha Best High Class ...
High Profile Girls Call ServiCe Hyderabad 0000000000 Tanisha Best High Class ...
aslasdfmkhan4750
 
Acumatica vs. Sage Intacct _Construction_July (1).pptx
Acumatica vs. Sage Intacct _Construction_July (1).pptxAcumatica vs. Sage Intacct _Construction_July (1).pptx
Acumatica vs. Sage Intacct _Construction_July (1).pptx
BrainSell Technologies
 
Sonkoloniya documentation - ONEprojukti.pdf
Sonkoloniya documentation - ONEprojukti.pdfSonkoloniya documentation - ONEprojukti.pdf
Sonkoloniya documentation - ONEprojukti.pdf
SubhamMandal40
 
Premium Girls Call Mumbai 9920725232 Unlimited Short Providing Girls Service ...
Premium Girls Call Mumbai 9920725232 Unlimited Short Providing Girls Service ...Premium Girls Call Mumbai 9920725232 Unlimited Short Providing Girls Service ...
Premium Girls Call Mumbai 9920725232 Unlimited Short Providing Girls Service ...
shanihomely
 

Recently uploaded (20)

High Profile Girls call Service Pune 000XX00000 Provide Best And Top Girl Ser...
High Profile Girls call Service Pune 000XX00000 Provide Best And Top Girl Ser...High Profile Girls call Service Pune 000XX00000 Provide Best And Top Girl Ser...
High Profile Girls call Service Pune 000XX00000 Provide Best And Top Girl Ser...
 
Acumatica vs. Sage Intacct vs. NetSuite _ NOW CFO.pdf
Acumatica vs. Sage Intacct vs. NetSuite _ NOW CFO.pdfAcumatica vs. Sage Intacct vs. NetSuite _ NOW CFO.pdf
Acumatica vs. Sage Intacct vs. NetSuite _ NOW CFO.pdf
 
Integrating Kafka with MuleSoft 4 and usecase
Integrating Kafka with MuleSoft 4 and usecaseIntegrating Kafka with MuleSoft 4 and usecase
Integrating Kafka with MuleSoft 4 and usecase
 
WhatsApp Spy Online Trackers and Monitoring Apps
WhatsApp Spy Online Trackers and Monitoring AppsWhatsApp Spy Online Trackers and Monitoring Apps
WhatsApp Spy Online Trackers and Monitoring Apps
 
Feature sql server terbaru performance.pptx
Feature sql server terbaru performance.pptxFeature sql server terbaru performance.pptx
Feature sql server terbaru performance.pptx
 
Patch Tuesday de julio
Patch Tuesday de julioPatch Tuesday de julio
Patch Tuesday de julio
 
Tailored CRM Software Development for Enhanced Customer Insights
Tailored CRM Software Development for Enhanced Customer InsightsTailored CRM Software Development for Enhanced Customer Insights
Tailored CRM Software Development for Enhanced Customer Insights
 
Vertex AI Agent Builder - GDG Alicante - Julio 2024
Vertex AI Agent Builder - GDG Alicante - Julio 2024Vertex AI Agent Builder - GDG Alicante - Julio 2024
Vertex AI Agent Builder - GDG Alicante - Julio 2024
 
How to Build a Profitable IoT Product.pptx
How to Build a Profitable IoT Product.pptxHow to Build a Profitable IoT Product.pptx
How to Build a Profitable IoT Product.pptx
 
Girls call Kolkata 👀 XXXXXXXXXXX 👀 Rs.9.5 K Cash Payment With Room Delivery
Girls call Kolkata 👀 XXXXXXXXXXX 👀 Rs.9.5 K Cash Payment With Room Delivery Girls call Kolkata 👀 XXXXXXXXXXX 👀 Rs.9.5 K Cash Payment With Room Delivery
Girls call Kolkata 👀 XXXXXXXXXXX 👀 Rs.9.5 K Cash Payment With Room Delivery
 
BT & Neo4j: Knowledge Graphs for Critical Enterprise Systems.pptx.pdf
BT & Neo4j: Knowledge Graphs for Critical Enterprise Systems.pptx.pdfBT & Neo4j: Knowledge Graphs for Critical Enterprise Systems.pptx.pdf
BT & Neo4j: Knowledge Graphs for Critical Enterprise Systems.pptx.pdf
 
Semantic-Aware Code Model: Elevating the Future of Software Development
Semantic-Aware Code Model: Elevating the Future of Software DevelopmentSemantic-Aware Code Model: Elevating the Future of Software Development
Semantic-Aware Code Model: Elevating the Future of Software Development
 
IPLOOK Remote-Sensing Satellite Solution
IPLOOK Remote-Sensing Satellite SolutionIPLOOK Remote-Sensing Satellite Solution
IPLOOK Remote-Sensing Satellite Solution
 
Litestack talk at Brighton 2024 (Unleashing the power of SQLite for Ruby apps)
Litestack talk at Brighton 2024 (Unleashing the power of SQLite for Ruby apps)Litestack talk at Brighton 2024 (Unleashing the power of SQLite for Ruby apps)
Litestack talk at Brighton 2024 (Unleashing the power of SQLite for Ruby apps)
 
(CISOPlatform Summit & SACON 2024) Digital Personal Data Protection Act.pdf
(CISOPlatform Summit & SACON 2024) Digital Personal Data Protection Act.pdf(CISOPlatform Summit & SACON 2024) Digital Personal Data Protection Act.pdf
(CISOPlatform Summit & SACON 2024) Digital Personal Data Protection Act.pdf
 
(CISOPlatform Summit & SACON 2024) Keynote _ Power Digital Identities With AI...
(CISOPlatform Summit & SACON 2024) Keynote _ Power Digital Identities With AI...(CISOPlatform Summit & SACON 2024) Keynote _ Power Digital Identities With AI...
(CISOPlatform Summit & SACON 2024) Keynote _ Power Digital Identities With AI...
 
High Profile Girls Call ServiCe Hyderabad 0000000000 Tanisha Best High Class ...
High Profile Girls Call ServiCe Hyderabad 0000000000 Tanisha Best High Class ...High Profile Girls Call ServiCe Hyderabad 0000000000 Tanisha Best High Class ...
High Profile Girls Call ServiCe Hyderabad 0000000000 Tanisha Best High Class ...
 
Acumatica vs. Sage Intacct _Construction_July (1).pptx
Acumatica vs. Sage Intacct _Construction_July (1).pptxAcumatica vs. Sage Intacct _Construction_July (1).pptx
Acumatica vs. Sage Intacct _Construction_July (1).pptx
 
Sonkoloniya documentation - ONEprojukti.pdf
Sonkoloniya documentation - ONEprojukti.pdfSonkoloniya documentation - ONEprojukti.pdf
Sonkoloniya documentation - ONEprojukti.pdf
 
Premium Girls Call Mumbai 9920725232 Unlimited Short Providing Girls Service ...
Premium Girls Call Mumbai 9920725232 Unlimited Short Providing Girls Service ...Premium Girls Call Mumbai 9920725232 Unlimited Short Providing Girls Service ...
Premium Girls Call Mumbai 9920725232 Unlimited Short Providing Girls Service ...
 

Diakritika in Unicode

  • 1. 1 Diakritika in Unicode Reinhold Heuvelmann Ç↔C+◌̧
  • 2. Code Charts http://www.unicode.org/charts/ , Stichwort "combining" – http://www.unicode.org/charts/PDF/U0300.pdf – http://www.unicode.org/charts/PDF/U1AB0.pdf – http://www.unicode.org/charts/PDF/U1DC0.pdf – http://www.unicode.org/charts/PDF/U20D0.pdf – http://www.unicode.org/charts/PDF/UFE20.pdf | 10 | Diakritika in Unicode | Datenbezieher-Workshop 30. Mai 20172
  • 3. | 10 | Diakritika in Unicode | Datenbezieher-Workshop 30. Mai 20173
  • 4. Stacking Sequences, Beispiel 1 | 10 | Diakritika in Unicode | Datenbezieher-Workshop 30. Mai 20174 http://www.unicode.org/versions/Unicode9.0.0/ch02.pdf
  • 5. Stacking Sequences, Beispiel 2 | 10 | Diakritika in Unicode | Datenbezieher-Workshop 30. Mai 20175
  • 6. Aus den FAQ zu Characters and Combining Marks Q: Why are new combinations of Latin letters with diacritical marks not suitable for addition to Unicode? A: There are several reasons. First, Unicode encodes many diacritical marks, and the combinations can already be produced, as noted in the answers to some questions above. If precomposed equivalents were added, the number of multiple spellings would be increased, and decompositions would need to be defined and maintained for them, adding to the complexity of existing decomposition tables in implementations. ... | 10 | Diakritika in Unicode | Datenbezieher-Workshop 30. Mai 20176
  • 7. | 10 | Diakritika in Unicode | Datenbezieher-Workshop 30. Mai 20177 Aus den FAQ zu Characters and Combining Marks ... Finally, normalization form NFC (the composed form favored for use on the Web) is frozen—no new letter combinations can be added to it. Therefore, the normalized NFC representation of any new precomposed letters would still use decomposed sequences, which can already be expressed by combining character sequences in Unicode. Nothing would be gained by adding the letter with diacritical mark as a precomposed character; on the contrary, adding such a letter would add one or more multiple spellings to be reckoned with, incrementally complicating all Unicode implementations for no net gain.
  • 9. In der Anwendung durch die DNB – aktuell keine Zeichensatz-Konversion bei der Erstellung von bibliografischen Daten – "garbage in - garbage out" – MARC 21 ist neutral in Bezug auf Unicode composed vs. decomposed – Wo beginnen, wo enden? – Tools sind vorhanden: "uconv -f utf-8 -t utf-8 -x NFC [Datei]" (mit Dank an Johann Rolschewski) | 10 | Diakritika in Unicode | Datenbezieher-Workshop 30. Mai 20179
  • 10. Danke r.heuvelmann@dnb.de | 10 | Diakritika in Unicode | Datenbezieher-Workshop 30. Mai 201710 MARC 21 Specifications for Record Structure, Character Sets, and Exchange Media, CHARACTER SETS AND ENCODING OPTIONS: Part 3: Unicode Encoding Environment https://www.loc.gov/marc/specifications/speccharucs.html Part 4: Conversion Between Environments https://www.loc.gov/marc/specifications/speccharconversion.html Assessment of Options for Handling Full Unicode Character Encodings in MARC21 https://www.loc.gov/marc/marbi/2004/2004-report01.pdf https://www.loc.gov/marc/marbi/2005/2005-report01.pdf