Oops, I didn't change the subject on this... Wrong thread. Sorry RDKitters. Best, Nick
Nicholas C. Firth | PhD Student | Cancer Therapeutics The Institute of Cancer Research | 15 Cotswold Road | Belmont | Sutton | Surrey | SM2 5NG T 020 8722 4033 | E [email protected] | W www.icr.ac.uk | Twitter @ICRnews ________________________________________ From: Nicholas Firth Sent: 03 October 2013 16:13 To: Greg Landrum Cc: [email protected] Subject: RE: [Rdkit-discuss] ambiguous detail Hi Greg, As you were so worried about about the size of the RDKit dist due to the addition of the SA score, I thought I might be able to help you out. I've done a bit of optimisation to Peter's code, and changed the format of the .pkl.gz file. This has meant a reduction in the size of the file from 4.76 MB to 3.85 MB (almost a 20% drop!) and also it has weirdly sped up the function readFragmentScores. I have attached the new .pkl.gz and the I have copied and pasted my version of the function below. I hope this helps you sleep better at night. def readFragmentScoresFirth(name='myZip'): import cPickle,gzip global _fscores _fscores = cPickle.load(gzip.open('%s.pkl.gz'%name)) outDict = {} for i in _fscores: for j in range(1,len(i)): outDict[i[j]] = float(i[0]) _fscores = outDict Best, Nick Nicholas C. Firth | PhD Student | Cancer Therapeutics The Institute of Cancer Research | 15 Cotswold Road | Belmont | Sutton | Surrey | SM2 5NG T 020 8722 4033 | E [email protected] | W www.icr.ac.uk | Twitter @ICRnews The Institute of Cancer Research: Royal Cancer Hospital, a charitable Company Limited by Guarantee, Registered in England under Company No. 534147 with its Registered Office at 123 Old Brompton Road, London SW7 3RP. This e-mail message is confidential and for use by the addressee only. If the message is received by anyone other than the addressee, please return the message to the sender by replying to it and then delete the message from your computer and network. ------------------------------------------------------------------------------ October Webinars: Code for Performance Free Intel webinars can help you accelerate application performance. Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from the latest Intel processors and coprocessors. See abstracts and register > http://pubads.g.doubleclick.net/gampad/clk?id=60134791&iu=/4140/ostg.clktrk _______________________________________________ Rdkit-discuss mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/rdkit-discuss

