Re: [Open Babel] Generating fs from sdf

2012-03-03 Thread Andrew Dalke
Hi Jochen, On Mar 3, 2012, at 2:59 AM, Jochen Schreiber wrote: > My problem now is that babel only one file expected and i have the complete > pubchem compound sdf files with over 2000 sdf files. > > If i concat all sdf files to one file it has over 187 GB and the index will > be over 2GB. You

Re: [Open Babel] Generating fs from sdf

2012-03-03 Thread Björn Grüning
Hi Jochen, one possibility is to convert all sdf-files to one SMILES file, that will be 1.9 GB. From that you can try to build a fs-index. Maybe split the SMILES file create different fs-indices and combine the results later. I would go with a database solution -> pgchem for the win! Ciao, Bjoern

Re: [Open Babel] Generating fs from sdf

2012-03-03 Thread Jochen Schreiber
Hello Bjoern, My usechase is to do a similarity search on local sdf files with babel and cdk. My problem now is that babel only one file expected and i have the complete pubchem compound sdf files with over 2000 sdf files. If i concat all sdf files to one file it has over 187 GB and the index w