[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Missing Character Set Encodings?
Dear IANA:
 
I have been examining the Character Set name 
specification you have list at the following URL on your Web site
 
 
It seems that there are a number of character sets 
that are not listed in the above online specification.  I was wondering if 
you could provide some insight to why these are missing as well as comment on 
the possibility of their addition.
 
The character sets I found missing are based on an 
examination of current Java supported encodings.  My company is currently 
trying to link the Java-based encodings to the IANA-based character set 
encodings used in XML, HMTL, etc.  This link will help us provide more 
coherent, international support for our internet products.
 
The missing character sets for Microsoft's GBK, 
Thai (874), and multibyte code pages (932, 936, 949 and 950) are of the greatest 
concern.  The list of character sets I found missing are as follows (I 
don't claim this to be comprehensive):
 
  No IANA definition 
for       "Cp737"
  No IANA definition 
for       "Cp838"
  No IANA definition 
for       
"Cp874"    
  No IANA definition 
for       
"Cp875"    
  No IANA definition 
for       
"Cp921"    
  No IANA definition 
for       
"Cp922"    
  No IANA definition 
for       
"Cp930"    
  No IANA definition 
for       
"Cp933"    
  No IANA definition 
for       
"Cp935"    
  No IANA definition 
for       
"Cp937"    
  No IANA definition 
for       
"Cp939"    
  No IANA definition 
for       
"Cp942"    
  No IANA definition 
for       
"Cp948"    
  No IANA definition 
for       
"Cp949"    
  No IANA definition 
for       
"Cp950"    
  No IANA definition 
for       
"Cp964"    
  No IANA definition 
for       
"Cp970"    
  No IANA definition 
for       "Cp1006"  
  No 
IANA definition for       
"Cp1025"   
  No IANA definition 
for       
"Cp1046"   
  No IANA definition 
for       
"Cp1097"   
  No IANA definition 
for       
"Cp1098"   
  No IANA definition 
for       
"Cp1112"   
  No IANA definition 
for       
"Cp1122"   
  No IANA definition 
for       
"Cp1123"   
  No IANA definition 
for       
"Cp1124"   
  No IANA definition 
for       
"Cp1252"   
  No IANA definition 
for       
"Cp1381"   
  No IANA definition 
for       
"Cp1383"   
  No IANA definition 
for       "Cp33722" 
  No IANA 
definition for       
"EUC-TW" 
  No IANA definition 
for       
"GBK"      
  No IANA definition 
for       
"ISO2022CN-CNS"     
  No IANA definition 
for       
"ISO2022CN-GB"      
  No IANA 
definition for       
"MS874"       //Though 
IBM-THAI might be the alias 
  No IANA definition 
for       
"MS932"       //Though 
SHIFT-JIS might be the alias
  No IANA definition 
for       
"MS936"       //Though 
BIG5 might be the alias 
  No IANA definition 
for       
"MS949"       //Though 
EUC-KR might be the alias 
  No IANA definition 
for       
"MS950"       
 
 
You comments on how we can obtain a more 
complete IANA character set specification would be appreciated.  If a later specification exists that does include some (or 
all) of the missing character sets I listed above, please let me 
know how I might access this updated specification.  
 
Thank you,
 
 
Craig R. Cummings
Product Internationalization 
Manager 
Tools Division 
NLS Group
Oracle Corporation 
500 Oracle 
Parkway 
M/S 2op11 
Redwood Shores, CA  94065  USA
(email) crcummin@us.oracle.com 
(tel) 
+1-650-506-4273 
(fax) +1-650-506-7432 
(intranet) http://toolsnls.us.oracle.com