Hi Anthony,

On Fri, Jul 26, 2013 at 4:56 AM, Anthony Bradley <
[email protected]> wrote:

>  Hi all, ****
>
> ** **
>
> This is my first post on the rdkit mailing list, but I’ve been using it
> for a few months now (and think it’s awesome by the way).****
>
> **
>

Welcome! and thanks!


>  **
>
> I’ve found a slightly quirky behaviour.****
>
> ** **
>
> Rdkit can read in the below mol block but then the smiles it produces
> cannot be read in again.
>

I can reproduce this. Thanks for reporting it.


> ****
>
> ** **
>
> I think the problem is the lack of explicit hydrogen on the aromatic
> sulphur, leading to an inability to kekulize.****
>
> ** **
>
> I was wondering why this might occur?****
>
> **
>

I think there's actually an error in the input structure. It looks like
there should be a charge somewhere in the 5 ring.

In any case, what the RDKit is currently doing is wrong. It should either
fail on the input structure or produce a SMILES that can be read back in.
I'll file a bug for it.

-greg



>  **
>
> Thanks,****
>
> ** **
>
> Anthony****
>
> ** **
>
> Sim_1 = 'C=CCn1c2ccc(S(N)(=O)=O)cc2sc1NS(=O)(=O)c1ccc(Cl)s1' # None mol ->
> what rdkit outputs****
>
> ** **
>
> Smi_2 = 'C=CCn1c2ccc(S(N)(=O)=O)cc2sc1=NS(=O)(=O)c1ccc(Cl)s1' # Not None *
> ***
>
> ** **
>
> Smi_3 = ‘C=CCn1c2ccc(S(N)(=O)=O)cc2[sH]c1NS(=O)(=O)c1ccc(Cl)s1' # Not None
> ****
>
> ** **
>
> # Smi_2 and Smi_3 both hold a different oxidation state for the sulphur.
> According to the SDF it should be Smi_3.****
>
> ** **
>
> from rdkit import Chem****
>
> ** **
>
> sdf= """probmol****
>
>      RDKit          3D****
>
> ** **
>
> 26 28  0  0  0  0  0  0  0  0999 V2000****
>
>    40.2640  -47.5920   65.9800 N   0  0  0  0  0  0  0  0  0  0  0  0****
>
>    41.1750  -46.7850   66.5750 C   0  0  0  0  0  0  0  0  0  0  0  0****
>
>    41.4340  -46.7010   67.9480 C   0  0  0  0  0  0  0  0  0  0  0  0****
>
>    42.4150  -45.8090   68.3990 C   0  0  0  0  0  0  0  0  0  0  0  0****
>
>    43.1240  -45.0120   67.4860 C   0  0  0  0  0  0  0  0  0  0  0  0****
>
>    42.8540  -45.1070   66.1140 C   0  0  0  0  0  0  0  0  0  0  0  0****
>
>    41.8740  -45.9990   65.6750 C   0  0  0  0  0  0  0  0  0  0  0  0****
>
>    41.4130  -46.2370   64.0380 S   0  0  0  0  0  0  0  0  0  0  0  0****
>
>    40.2720  -47.4090   64.6280 C   0  0  0  0  0  0  0  0  0  0  0  0****
>
>    39.4590  -48.0810   63.8510 N   0  0  0  0  0  0  0  0  0  0  0  0****
>
>    39.3560  -48.5030   66.6990 C   0  0  0  0  0  0  0  0  0  0  0  0****
>
>    39.9550  -49.8630   66.8630 C   0  0  0  0  0  0  0  0  0  0  0  0****
>
>    40.2440  -50.3500   68.0660 C   0  0  0  0  0  0  0  0  0  0  0  0****
>
>    39.3310  -47.9450   62.1280 S   0  0  0  0  0  0  0  0  0  0  0  0****
>
>    40.7120  -48.0440   61.5180 O   0  0  0  0  0  0  0  0  0  0  0  0****
>
>    38.4830  -49.0830   61.6020 O   0  0  0  0  0  0  0  0  0  0  0  0****
>
>    38.5560  -46.4150   61.7050 C   0  0  0  0  0  0  0  0  0  0  0  0****
>
>    39.4690  -45.0310   61.2360 S   0  0  0  0  0  0  0  0  0  0  0  0****
>
>    37.9620  -44.2200   61.0510 C   0  0  0  0  0  0  0  0  0  0  0  0****
>
>    36.8440  -44.9740   61.3330 C   0  0  0  0  0  0  0  0  0  0  0  0****
>
>    37.8570  -42.5080   60.5230 Cl  0  0  0  0  0  0  0  0  0  0  0  0****
>
>    37.1890  -46.2470   61.7120 C   0  0  0  0  0  0  0  0  0  0  0  0****
>
>    44.3650  -43.8910   68.0560 S   0  0  0  0  0  0  0  0  0  0  0  0****
>
>    45.0700  -44.4650   69.2670 O   0  0  0  0  0  0  0  0  0  0  0  0****
>
>    45.3810  -43.6550   66.9600 O   0  0  0  0  0  0  0  0  0  0  0  0****
>
>    43.6310  -42.3950   68.5100 N   0  0  0  0  0  0  0  0  0  0  0  0****
>
>   1  2  1  0****
>
>   1 11  1  0****
>
>   2  3  2  0****
>
>   3  4  1  0****
>
>   5 23  1  0****
>
>   5  4  2  0****
>
>   6  5  1  0****
>
>   7  6  2  0****
>
>   7  2  1  0****
>
>   8  9  2  0****
>
>   8  7  1  0****
>
>   9  1  1  0****
>
> 10  9  1  0****
>
> 11 12  1  0****
>
> 12 13  2  0****
>
> 14 10  1  0****
>
> 15 14  2  0****
>
> 16 14  2  0****
>
> 17 22  2  0****
>
> 17 14  1  0****
>
> 18 17  1  0****
>
> 19 18  1  0****
>
> 19 20  2  0****
>
> 20 22  1  0****
>
> 21 19  1  0****
>
> 23 26  1  0****
>
> 23 24  2  0****
>
> 25 23  2  0****
>
> M  END"""****
>
> ** **
>
> mol = Chem.MolFromMolBlock(sdf)****
>
> ** **
>
> mol is None****
>
> ** **
>
> # Gives false ****
>
> ** **
>
> # Then convert to smiles and back****
>
> smimol = Chem.MolFromSmiles(Chem.MolToSmiles(mol))****
>
> ** **
>
> smimol is None****
>
> ** **
>
> # Gives true****
>
> ** **
>
> ** **
>
>
> ------------------------------------------------------------------------------
> See everything from the browser to the database with AppDynamics
> Get end-to-end visibility with application monitoring from AppDynamics
> Isolate bottlenecks and diagnose root cause in seconds.
> Start your free trial of AppDynamics Pro today!
> http://pubads.g.doubleclick.net/gampad/clk?id=48808831&iu=/4140/ostg.clktrk
> _______________________________________________
> Rdkit-discuss mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/rdkit-discuss
>
>
------------------------------------------------------------------------------
See everything from the browser to the database with AppDynamics
Get end-to-end visibility with application monitoring from AppDynamics
Isolate bottlenecks and diagnose root cause in seconds.
Start your free trial of AppDynamics Pro today!
http://pubads.g.doubleclick.net/gampad/clk?id=48808831&iu=/4140/ostg.clktrk
_______________________________________________
Rdkit-discuss mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/rdkit-discuss

Reply via email to