Jean-Pierre André wrote: > Richard W.M. Jones wrote: >> A user of my program 'virt-v2v' is unable to convert their Windows >> Server 2003 guest. Apparently the Windows guest contains a malformed >> filename in C:\Windows\System32 which causes ntfs-3g to error when >> reading this directory. Anything that accesses this directory >> (eg. 'ls' or my program that uses readdir) fails with: >> >> ls: reading directory /sysroot/WINDOWS/system32: Invalid or >> incomplete multibyte or wide character >> >> (This is errno EILSEQ). >> >> Under Windows itself, the directory appears normal -- it is able to be >> listed and so on. There are two files with non-ASCII characters, but >> deleting both of them (using Windows) did not change the problem with >> ntfs-3g. >> >> I am not able to get a copy of the broken disk image, because it's >> 130GB in size. >> >> But I did manage to create a broken filesystem that behaves in a >> similar manner. I did that by hexediting a NTFS disk image to add an >> illegal UCS-2 character (U+DF00) to a filename. You can get that disk >> image by downloading the attachment here: > > By default ntfs-3g treats ntfs file names as utf16-le, > which means Unicode points beyond U+ffff are expected > to be encoded as surrogate pairs (U+d800..U+dfff). > > Windows makes apparently no special case for these > codes... But they are not valid Unicode points and > cannot be translated to valid utf8. > >> >> https://bugzilla.redhat.com/show_bug.cgi?id=1301593#c6
After reading the original report, I do not think the error is caused by the use of an illegal Unicode point. File names like "Chaînes" are perfectly valid... provided that an utf8 locale is used. Also, when hexediting a file name you may change the collating sequence leading to failures in directory index searches. If you need to hexedit, use chkdsk to fix the directory indexes. Can you get an image of the original partition, restricted to metadata, hopefully leading to a manageable size ? ntfsclone -mst -O - /dev/something | gzip > metadata.gz Regards Jean-Pierre [...] ------------------------------------------------------------------------------ Site24x7 APM Insight: Get Deep Visibility into Application Performance APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month Monitor end-to-end web transactions and take corrective actions now Troubleshoot faster and improve end-user experience. Signup Now! http://pubads.g.doubleclick.net/gampad/clk?id=267308311&iu=/4140 _______________________________________________ ntfs-3g-devel mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/ntfs-3g-devel
