"Robin Becker" <[EMAIL PROTECTED]> wrote in message news:[EMAIL PROTECTED]
Gerald Klix wrote:
Map the file into RAM by using the mmap module. The file's contents than is availabel as a seachable string.
that's a good idea, but I wonder if it actually saves on memory? I just tried regexing through a 25Mb file and end up with 40Mb as working set (it rose linearly as the loop progessed through the file). Am I actually saving anything by not letting normal vm do its thing?
You aren't saving memory in that sense, no. If you have any RAM spare the file will end up in it. However, if you are short on memory though, mmaping the file gives the VM the opportunity to discard pages from the file, instead of paging them out. Try again with a 25Gb file and watch the difference ;) YMMV.
:)
So we avoid dirty page writes etc etc. However, I still think I could get away with a small window into the file which would be more efficient.
--
Robin Becker
--
http://mail.python.org/mailman/listinfo/python-list