xref: /linux/fs/unicode/README.utf8data (revision 955405d1174eebcd1b89ab335f720adc27d52b67)
1*955405d1SGabriel Krisman BertaziThe utf8data.h file in this directory is generated from the Unicode
2*955405d1SGabriel Krisman BertaziCharacter Database for version 11.0.0 of the Unicode standard.
3*955405d1SGabriel Krisman Bertazi
4*955405d1SGabriel Krisman BertaziThe full set of files can be found here:
5*955405d1SGabriel Krisman Bertazi
6*955405d1SGabriel Krisman Bertazi  http://www.unicode.org/Public/11.0.0/ucd/
7*955405d1SGabriel Krisman Bertazi
8*955405d1SGabriel Krisman BertaziIndividual source links:
9*955405d1SGabriel Krisman Bertazi
10*955405d1SGabriel Krisman Bertazi  http://www.unicode.org/Public/11.0.0/ucd/CaseFolding.txt
11*955405d1SGabriel Krisman Bertazi  http://www.unicode.org/Public/11.0.0/ucd/DerivedAge.txt
12*955405d1SGabriel Krisman Bertazi  http://www.unicode.org/Public/11.0.0/ucd/extracted/DerivedCombiningClass.txt
13*955405d1SGabriel Krisman Bertazi  http://www.unicode.org/Public/11.0.0/ucd/DerivedCoreProperties.txt
14*955405d1SGabriel Krisman Bertazi  http://www.unicode.org/Public/11.0.0/ucd/NormalizationCorrections.txt
15*955405d1SGabriel Krisman Bertazi  http://www.unicode.org/Public/11.0.0/ucd/NormalizationTest.txt
16*955405d1SGabriel Krisman Bertazi  http://www.unicode.org/Public/11.0.0/ucd/UnicodeData.txt
17*955405d1SGabriel Krisman Bertazi
18*955405d1SGabriel Krisman Bertazimd5sums (verify by running "md5sum -c README.utf8data"):
19*955405d1SGabriel Krisman Bertazi
20*955405d1SGabriel Krisman Bertazi  414436796cf097df55f798e1585448ee  CaseFolding.txt
21*955405d1SGabriel Krisman Bertazi  6032a595fbb782694456491d86eecfac  DerivedAge.txt
22*955405d1SGabriel Krisman Bertazi  3240997d671297ac754ab0d27577acf7  DerivedCombiningClass.txt
23*955405d1SGabriel Krisman Bertazi  2a4fe257d9d8184518e036194d2248ec  DerivedCoreProperties.txt
24*955405d1SGabriel Krisman Bertazi  4e7d383fa0dd3cd9d49d64e5b7b7c9e0  NormalizationCorrections.txt
25*955405d1SGabriel Krisman Bertazi  c9500c5b8b88e584469f056023ecc3f2  NormalizationTest.txt
26*955405d1SGabriel Krisman Bertazi  acc291106c3758d2025f8d7bd5518bee  UnicodeData.txt
27*955405d1SGabriel Krisman Bertazi
28*955405d1SGabriel Krisman Bertazisha1sums (verify by running "sha1sum -c README.utf8data"):
29*955405d1SGabriel Krisman Bertazi
30*955405d1SGabriel Krisman Bertazi  9184727adf7bd20e36312a68581d12ba3ffb9854  CaseFolding.txt
31*955405d1SGabriel Krisman Bertazi  86c55b3eb89de61704da16af9c3f22854f61b57d  DerivedAge.txt
32*955405d1SGabriel Krisman Bertazi  b615703f62b1dbc5110e91acc3ff8b3789a067cf  DerivedCombiningClass.txt
33*955405d1SGabriel Krisman Bertazi  f8b07ef116d7dc21a94f26e70178ed2acf8713e9  DerivedCoreProperties.txt
34*955405d1SGabriel Krisman Bertazi  a5fafb8998c0b8153a2a58430b8a35c811db0abc  NormalizationCorrections.txt
35*955405d1SGabriel Krisman Bertazi  070cdcb00cd4f0860e476750e404c59c2ebe9b25  NormalizationTest.txt
36*955405d1SGabriel Krisman Bertazi  0e060fafb08d6722fbec56d9f9ebe8509f01d0ee  UnicodeData.txt
37*955405d1SGabriel Krisman Bertazi
38*955405d1SGabriel Krisman BertaziTo update to the newer version of the Unicode standard, the latest
39*955405d1SGabriel Krisman Bertazireleased version of the UCD can be found here:
40*955405d1SGabriel Krisman Bertazi
41*955405d1SGabriel Krisman Bertazi  http://www.unicode.org/Public/UCD/latest/
42*955405d1SGabriel Krisman Bertazi
43*955405d1SGabriel Krisman BertaziTo build the utf8data.h file, from a kernel tree that has been built,
44*955405d1SGabriel Krisman Bertazicd to this directory (fs/unicode) and run this command:
45*955405d1SGabriel Krisman Bertazi
46*955405d1SGabriel Krisman Bertazi	make C=../.. objdir=../.. utf8data.h.new
47*955405d1SGabriel Krisman Bertazi
48*955405d1SGabriel Krisman BertaziAfter sanity checking the newly generated utf8data.h.new file (the
49*955405d1SGabriel Krisman Bertaziversion generated from the 11.0.0 UCD should be 13,834 lines long, and
50*955405d1SGabriel Krisman Bertazihave a total size of 1104k) and/or comparing it with the older version
51*955405d1SGabriel Krisman Bertaziof utf8data.h, rename it to utf8data.h.
52*955405d1SGabriel Krisman Bertazi
53*955405d1SGabriel Krisman BertaziIf you are a kernel developer updating to a newer version of the
54*955405d1SGabriel Krisman BertaziUnicode Character Database, please update this README.utf8data file
55*955405d1SGabriel Krisman Bertaziwith the version of the UCD that was used, the md5sum and sha1sums of
56*955405d1SGabriel Krisman Bertazithe *.txt files, before checking in the new versions of the utf8data.h
57*955405d1SGabriel Krisman Bertaziand README.utf8data files.
58