https://github.com/Sepero/SearchBin
   - compiles file in regex beforehand
   - python
=> ignored: naive
https://github.com/rsharo/bgrep
   - overcomplicated
   - no need for regex support, only offsets
   - no effective string match algs inside
   - no alg for mmap() usage
=> ignored: inadequate

Alg for mmap() is the most crucial part for performance after
algorithm itself. As alg+mmap() must be optimized to SDD/RAM/LCache
limitations and access type.


On Fri, Mar 24, 2017 at 11:37:15PM +0300, Alexander Krotov wrote:
On Fri, Mar 24, 2017 at 08:03:38PM +0200, Amer wrote:
Does anybody knows any cli tools which allows to search offset
of one binary file inside another ?

Find if ./chunk.bin contained and its offset inside ./dump.bin
    * simple case: 200kB inside 100MB
    * hard case: 2GB inside 100GB

I thought this idea is as old as PC itself, but I couldn't find any
tool in arch distro, nor by google, nor at github.
However, maybe my requests simply couldn't their descriptions.

I acknowledge that implementation isn't as simple as it seems,
when taking into account performance for hard case.

But I even couldn't find straight clean implementation in C / mmap().
I appreciate anything related, even links to sophisticated algorithms,
if there is really no tool / code-snippet in existence yet.
Not C, but Python: https://github.com/Sepero/SearchBin

For C, you can modify this one to make it read pattern from file or
mmap it: https://github.com/rsharo/bgrep



Reply via email to