Hi
I have numerous multimolecule sd files of unknown origin. Each file reads OK
and a few initial tests in pybel work (e.g. number of molecules etc). I now
want to do molecular weight distributions, similarity and identity searching
etc across them.
This is not a problem but since each file may well be generated from different
sources, they might have different charge assignments, hydrogens added/not
added. Will pybel fp_1 | fp_2 type similarity and canonical smiles identity
deal with this kind of issue automagically or is manual standardisation
required before the searches? I was thinking along the lines of stripping any
existing hydrogens then adding them by calling OB to ensure consistency. Is
this the best thing to do and how best to deal with formal charges and possibly
salts?
thanks,
Andy
------------------------------------------------------------------------------
Download Intel® Parallel Studio Eval
Try the new software tools for yourself. Speed compiling, find bugs
proactively, and fine-tune applications for parallel performance.
See why Intel Parallel Studio got high marks during beta.
http://p.sf.net/sfu/intel-sw-dev
_______________________________________________
OpenBabel-discuss mailing list
OpenBabel-discuss@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/openbabel-discuss