The patch of "bio: modify __bio_add_page() to accept pages that
don't start a new segment" changes the way for adding one page
to bio:

        - previously by adding page after checking successfully
        - now by trying to add page and recover if it fails

Unfortunately the patch forgets to update bio->bi_iter.bi_size
before trying to add page, then the last vector for holding
the added page may not be covered if recouning segments is needed,
so bio->bi_phys_segments may become not consistent with the
actual bio page buffers after the page is added successfully
to the bio(after bi_iter.bi_size is added by 'len')

Suppose the page in the last vector can't be merged to bio, tragedy
will happen when __bio_add_page() is called to add another page:

        - blk_recount_segments() is called and the actual segments get
        figured out correctly

        - the actual segments may become queue_max_segments(q) plus one
        in failure path

        - driver will find the segment count is too big to handle.

The patch fixes the virtio-blk oops bug reported from Jet Chen in
below link:

        http://marc.info/?l=linux-kernel&m=140113053817095&w=2

Cc: Jens Axboe <ax...@kernel.dk>
Cc: Maurizio Lombardi <mlomb...@redhat.com>
Cc: Dongsu Park <dongsu.p...@profitbricks.com>
Cc: Christoph Hellwig <h...@lst.de>
Cc: Kent Overstreet <k...@daterainc.com>
Cc: Andrew Morton <a...@linux-foundation.org>
Reported-by: Jet Chen <jet.c...@intel.com>
Tested-by: Jet Chen <jet.c...@intel.com>
Signed-off-by: Ming Lei <ming....@canonical.com>
---
Andrew, could you put the patch in your -mm tree
because the previous two patches were routed from
your tree?

 block/bio.c |    4 +++-
 1 file changed, 3 insertions(+), 1 deletion(-)

diff --git a/block/bio.c b/block/bio.c
index 0443694..f9bae56 100644
--- a/block/bio.c
+++ b/block/bio.c
@@ -744,6 +744,7 @@ static int __bio_add_page(struct request_queue *q, struct 
bio *bio, struct page
                                }
                        }
 
+                       bio->bi_iter.bi_size += len;
                        goto done;
                }
        }
@@ -761,6 +762,7 @@ static int __bio_add_page(struct request_queue *q, struct 
bio *bio, struct page
        bvec->bv_offset = offset;
        bio->bi_vcnt++;
        bio->bi_phys_segments++;
+       bio->bi_iter.bi_size += len;
 
        /*
         * Perform a recount if the number of segments is greater
@@ -802,7 +804,6 @@ static int __bio_add_page(struct request_queue *q, struct 
bio *bio, struct page
                bio->bi_flags &= ~(1 << BIO_SEG_VALID);
 
  done:
-       bio->bi_iter.bi_size += len;
        return len;
 
  failed:
@@ -810,6 +811,7 @@ static int __bio_add_page(struct request_queue *q, struct 
bio *bio, struct page
        bvec->bv_len = 0;
        bvec->bv_offset = 0;
        bio->bi_vcnt--;
+       bio->bi_iter.bi_size -= len;
        blk_recount_segments(q, bio);
        return 0;
 }
-- 
1.7.9.5

--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Reply via email to