[Cif2-encoding] Splitting of imgCIF and other sub-topics. .. .. .

Bollinger, John C John.Bollinger at STJUDE.ORG
Thu Sep 16 16:48:06 BST 2010


James,

On Thursday, September 16, 2010 8:54 AM, James Hester wrote:
[...]
>For my part, I think the IUCr could handle manuscript submissions as follows:

For the most part I think your suggestions are reasonable (and so I omit them), but I hope you will clarify one of them:

>(iii) UTF8 introduction can be staged relatively slowly, starting from
>allowing it in a few non-essential datanames (e.g. defining
>_author_name_native_script or somesuch).  Let's remember that on day 1
>everything can still be ASCII as the dictionaries will be able to
>restrict character sets to ASCII

Are you suggesting that the character encoding of individual data values be configurable in the dictionary?  I suspect and hope that where you wrote "UTF8", you meant something more like "Unicode" -- i.e. the set of allowed (literal) characters, not their encoding.  Is that right?

If UTF-8 emerges as the only permitted encoding for CIF2 then this will be a mainly semantic difference, but it nevertheless has implications for software design and behavior.  If UTF-8 does *not* emerge as the only permitted encoding for CIF2 then this will be a tremendous difference.


Best,

John
--
John C. Bollinger, Ph.D.
Department of Structural Biology
St. Jude Children's Research Hospital


Email Disclaimer:  www.stjude.org/emaildisclaimer



More information about the cif2-encoding mailing list