_publ_author_name is not a good key for _publ_author

James Hester jamesrhester at gmail.com
Tue Mar 26 06:28:37 GMT 2019


Dear Core CIF group,

The publ_author category has _publ_author.name as the category key, meaning
that _publ_author.name can be used to select a unique row of the loop.
However, it has been pointed out that _publ_author.name is insufficient as
a key for the _publ_author loop, as some authors have the same name (there
are apparently 40 entries in the COD with this feature). One way to fix
this is to add a further disambiguating data name to _publ_author. The
suggestion is that something like "_publ_author.id" could be defined, which
would contain an arbitrary code and which would be added to the category
key.  The lack of this dataname in historical CIFs can be worked around by
assuming it has a constant value, and CIF curators can auto-generate it
when a situation is encountered with multiple identically-named authors.

An alternative approach would be to define "_publ_author.id" as the new key
of the category that can be auto-generated from each packet. While in a
formal sense this makes all current CIFs non-conformant, I can't imagine
that it would affect most software, which will continue to work with _
publ_author.name.

Please indicate your preference or alternative solutions. Particularly
important is the perspective of software authors who may be impacted.

thanks,
James.

-- 
T +61 (02) 9717 9907
F +61 (02) 9717 3145
M +61 (04) 0249 4148
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.iucr.org/pipermail/coredmg/attachments/20190326/48bcd9c7/attachment.html>


More information about the coreDMG mailing list