The blkverify block driver makes investigating image format data corruption much easier. A raw image initialized with the same contents as the test image (e.g. qcow2 file) must be provided. The raw image mirrors read/write operations and is used to verify that data read from the test image is correct. See docs/blkverify.txt for more information. Signed-off-by: Stefan Hajnoczi <stefanha@linux.vnet.ibm.com> Signed-off-by: Kevin Wolf <kwolf@redhat.com>
		
			
				
	
	
		
			70 lines
		
	
	
		
			3.0 KiB
		
	
	
	
		
			Plaintext
		
	
	
	
	
	
			
		
		
	
	
			70 lines
		
	
	
		
			3.0 KiB
		
	
	
	
		
			Plaintext
		
	
	
	
	
	
| = Block driver correctness testing with blkverify =
 | |
| 
 | |
| == Introduction ==
 | |
| 
 | |
| This document describes how to use the blkverify protocol to test that a block
 | |
| driver is operating correctly.
 | |
| 
 | |
| It is difficult to test and debug block drivers against real guests.  Often
 | |
| processes inside the guest will crash because corrupt sectors were read as part
 | |
| of the executable.  Other times obscure errors are raised by a program inside
 | |
| the guest.  These issues are extremely hard to trace back to bugs in the block
 | |
| driver.
 | |
| 
 | |
| Blkverify solves this problem by catching data corruption inside QEMU the first
 | |
| time bad data is read and reporting the disk sector that is corrupted.
 | |
| 
 | |
| == How it works ==
 | |
| 
 | |
| The blkverify protocol has two child block devices, the "test" device and the
 | |
| "raw" device.  Read/write operations are mirrored to both devices so their
 | |
| state should always be in sync.
 | |
| 
 | |
| The "raw" device is a raw image, a flat file, that has identical starting
 | |
| contents to the "test" image.  The idea is that the "raw" device will handle
 | |
| read/write operations correctly and not corrupt data.  It can be used as a
 | |
| reference for comparison against the "test" device.
 | |
| 
 | |
| After a mirrored read operation completes, blkverify will compare the data and
 | |
| raise an error if it is not identical.  This makes it possible to catch the
 | |
| first instance where corrupt data is read.
 | |
| 
 | |
| == Example ==
 | |
| 
 | |
| Imagine raw.img has 0xcd repeated throughout its first sector:
 | |
| 
 | |
|     $ ./qemu-io -c 'read -v 0 512' raw.img
 | |
|     00000000:  cd cd cd cd cd cd cd cd cd cd cd cd cd cd cd cd  ................
 | |
|     00000010:  cd cd cd cd cd cd cd cd cd cd cd cd cd cd cd cd  ................
 | |
|     [...]
 | |
|     000001e0:  cd cd cd cd cd cd cd cd cd cd cd cd cd cd cd cd  ................
 | |
|     000001f0:  cd cd cd cd cd cd cd cd cd cd cd cd cd cd cd cd  ................
 | |
|     read 512/512 bytes at offset 0
 | |
|     512.000000 bytes, 1 ops; 0.0000 sec (97.656 MiB/sec and 200000.0000 ops/sec)
 | |
| 
 | |
| And test.img is corrupt, its first sector is zeroed when it shouldn't be:
 | |
| 
 | |
|     $ ./qemu-io -c 'read -v 0 512' test.img
 | |
|     00000000:  00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  ................
 | |
|     00000010:  00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  ................
 | |
|     [...]
 | |
|     000001e0:  00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  ................
 | |
|     000001f0:  00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  ................
 | |
|     read 512/512 bytes at offset 0
 | |
|     512.000000 bytes, 1 ops; 0.0000 sec (81.380 MiB/sec and 166666.6667 ops/sec)
 | |
| 
 | |
| This error is caught by blkverify:
 | |
| 
 | |
|     $ ./qemu-io -c 'read 0 512' blkverify:a.img:b.img
 | |
|     blkverify: read sector_num=0 nb_sectors=4 contents mismatch in sector 0
 | |
| 
 | |
| A more realistic scenario is verifying the installation of a guest OS:
 | |
| 
 | |
|     $ ./qemu-img create raw.img 16G
 | |
|     $ ./qemu-img create -f qcow2 test.qcow2 16G
 | |
|     $ x86_64-softmmu/qemu-system-x86_64 -cdrom debian.iso \
 | |
|                                         -drive file=blkverify:raw.img:test.qcow2
 | |
| 
 | |
| If the installation is aborted when blkverify detects corruption, use qemu-io
 | |
| to explore the contents of the disk image at the sector in question.
 |