Hi Christos, I recently did something like this for a fingerprint, I can't say whether or not the way you proposed works (though if it does it's a shortcut on what I did!)
What I did was
cfac =
ChemicalFeatures.BuildFeatureFactory('/Users/nfirth/Desktop/PythonScripts/ImportantText/featDef.txt')
allRDKitFeats = cfac.GetFeaturesForMol(mol)
for i in allRDKitFeats:
if(i.GetFamily() == 'HBondAcceptor'):
hBondAcceptors.append(i.GetAtomIds()[0])
elif(i.GetFamily() == 'HBondDonor'):
hBondDonors.append(i.GetAtomIds()[0])
elif(i.GetFamily() == 'Aromatic'):
aroAtoms.append(i.GetAtomIds()[0])
Which as you can see is a bit more complicated, but definitely gives the right
results. The contents of the file featDef.txt is as follows:
DefineFeature HDonor1 [N,O;!H0]
Family HBondDonor
Weights 1.0
EndFeature
DefineFeature HAcceptor1 [N,O]
Family HBondAcceptor
Weights 1.0
EndFeature
DefineFeature AromaticAtom [a]
Family Aromatic
Weights 1.0
EndFeature
Hope that helps if your solution doesn't turn out. If it does then thanks for
the shortcut!
Best,
Nick
Nicholas C. Firth | PhD Student | Cancer Therapeutics
The Institute of Cancer Research | 15 Cotswold Road | Belmont | Sutton | Surrey
| SM2 5NG
T 020 8722 4033 | E [email protected]<mailto:[email protected]> |
W www.icr.ac.uk<http://www.icr.ac.uk/> | Twitter
@ICRnews<https://twitter.com/ICRnews>
Facebook
www.facebook.com/theinstituteofcancerresearch<http://www.facebook.com/theinstituteofcancerresearch>
Making the discoveries that defeat cancer
[cid:[email protected]]
On 26 Jul 2013, at 13:54, Christos Kannas
<[email protected]<mailto:[email protected]>> wrote:
Dear RDKiters,
I'm creating a descriptor for estimating water solubility (clogSw) base on the
following article of Delaney (doi:10.1021/ci034243x).
J. S. Delaney, “ESOL: Estimating Aqueous Solubility Directly from Molecular
Structure,” Journal of Chemical Information and Modeling, vol. 44, no. 3, pp.
1000–1005, May 2004.
In this paper he proposes an equation to calculate an estimation of the water
solubility of molecules based on physio-chemical descriptors.
One of the descriptors used is Aromatic Proportion, that is the proportion of
heavy atoms of the molecule that are in aromatic ring.
So in order to find the aromatic heavy atoms I use GetSubstructMatches(...)
with query SMARTS '[a]'. Is that the correct way to find all the aromatic atoms
of a molecule? If not what is the correct SMARTS to use?
@Greg: When I complete this, can we look into adding it as a new descriptor,
clogSw (like clogP), within the RDKit distribution?
Kind Regards,
Christos
--
Christos Kannas
Researcher
Ph.D Student
e-Health Laboratory<http://www.medinfo.cs.ucy.ac.cy/>
[email protected]<mailto:[email protected]>
[email protected]<mailto:[email protected]>
[email protected]<mailto:[email protected]>
Mob: (+357) 99530608
------------------------------------------------------------------------------
See everything from the browser to the database with AppDynamics
Get end-to-end visibility with application monitoring from AppDynamics
Isolate bottlenecks and diagnose root cause in seconds.
Start your free trial of AppDynamics Pro today!
http://pubads.g.doubleclick.net/gampad/clk?id=48808831&iu=/4140/ostg.clktrk_______________________________________________
Rdkit-discuss mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/rdkit-discuss
The Institute of Cancer Research: Royal Cancer Hospital, a charitable Company
Limited by Guarantee, Registered in England under Company No. 534147 with its
Registered Office at 123 Old Brompton Road, London SW7 3RP.
This e-mail message is confidential and for use by the addressee only. If the
message is received by anyone other than the addressee, please return the
message to the sender by replying to it and then delete the message from your
computer and network.<<inline: image001.gif>>
------------------------------------------------------------------------------ See everything from the browser to the database with AppDynamics Get end-to-end visibility with application monitoring from AppDynamics Isolate bottlenecks and diagnose root cause in seconds. Start your free trial of AppDynamics Pro today! http://pubads.g.doubleclick.net/gampad/clk?id=48808831&iu=/4140/ostg.clktrk
_______________________________________________ Rdkit-discuss mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/rdkit-discuss

