Hi Paul,

That's fixed in the GitHub master trunk of the RDKit, while it doesn't work 
with the released version (I know because I fixed that bug myself!). Most SDF 
files in the RCSB feature right-aligned elements, which is indeed allowed by 
the CT file specs.

Cheers,
p.

> On 10 Sep 2016, at 21:17, Paul Emsley <[email protected]> wrote:
> 
> 
> Thank you Paolo, that's very useful.
> 
> But does that file work for you?
> 
> >>> import rdkit
> >>> from rdkit import Chem
> >>> ms = [x for x in Chem.SDMolSupplier('example.sdf')]
> [21:04:24]
> 
> ****
> Post-condition Violation
> Element '' not found
> Violation occurred on line 91 in file 
> /Users/pemsley/autobuild/b-e-l/rdkit-Release_2015_03_1/Code/GraphMol/PeriodicTable.h
> Failed Expression: anum>-1
> ****
> 
> [21:04:24] Unexpected error hit on line 5
> 
> Comparing the heads:
> 
> % head Ligands_noHydrogens.sdf
> 1CX2_S58_A_701
>  RCSB PDB01141507103D
> Coordinates from PDB:1CX2:A:701 Model:1 without hydrogens
> 26 28  0  0  0  0            999 V2000
>   27.5570   21.7340   16.8580   C 0  0  0  0  0  0  0  0  0  0  0  0
>   26.3220   22.0290   16.3660   C 0  0  0  0  0  0  0  0  0  0  0  0
>   27.3340   21.1490   18.2120   C 0  0  0  0  0  0  0  0  0  0  0  0
>   28.3530   20.7370   19.2320   C 0  0  0  0  0  0  0  0  0  0  0  0
>   24.0280   21.4680   17.1130   C 0  0  0  0  0  0  0  0  0  0  0  0
>   23.4160   20.4550   17.8620   C 0  0  0  0  0  0  0  0  0  0  0  0
> 
> % head S58_ideal.sdf
> S58
>  -ISIS-            3D
> 
> 37 39  0  0  0  0  0  0  0  0  0
>    1.3830    0.0050   -2.9870 C   0  0  0  0  0
>    0.8620    0.0130   -1.7240 C   0  0  0  0  0
>    0.3190   -0.0060   -3.8900 C   0  0  0  0  0
>    0.4400   -0.0140   -5.3920 C   0  0  0  0  0
>   -1.4220   -0.0020   -0.7940 C   0  0  0  0  0
>   -2.5780    0.7620   -0.8830 C   0  0  0  0  0
> 
> The element column has shifted rightwards 2 spaces in 
> Ligands_noHydrogens.sdf.  Weird? From this output and presuming that if RDKit 
> can't read it then it's wrong, it seems to me that RCSB are distributing 
> non-conformant sdf files.
> 
> Do you agree?
> 
> Paul.
> 
> 
>> On 10/09/2016 18:58, Paolo Tosco wrote:
>> Hi Paul,
>> 
>> SDF files in the RCSB database do store PDB atom names, have a look at
>> 
>> http://www.rcsb.org/pdb/download/downloadLigandFiles.do?ligandIdList=S58&structIdList=1CX2&instanceType=all&excludeUnobserved=false&includeHydrogens=false
>> 
>> 
>> for an example.
>> 
>> Atom names stored in this fashion are read in by the RDKit and can be 
>> accessed as the
>> "molFileAlias" atom property; they can also be stored accordingly. See
>> 
>> https://gist.github.com/ptosco/6e4468350f0fff183e4507ef24f092a1#file-pdb_atom_names-ipynb
>> 
>> for an example.
>> 
>> Cheers,
>> p.
>> 
>>> On 10/09/2016 18:33, Paul Emsley wrote:
>>> Hi RDKiters,
>>> 
>>> Is it possible to store (PDB) atom names in sdf files?  If so, how is this 
>>> done? (It was not
>>> clear to me after reading CTFile.pdf.)
>>> 
>>> (I hope that this question is allowed.)
>>> 
>>> Thanks,
>>> 
>>> Paul.
>>> 
>>> ------------------------------------------------------------------------------
>>> _______________________________________________
>>> Rdkit-discuss mailing list
>>> [email protected]
>>> https://lists.sourceforge.net/lists/listinfo/rdkit-discuss
>> 
> 

------------------------------------------------------------------------------
_______________________________________________
Rdkit-discuss mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/rdkit-discuss

Reply via email to