On 08/26/2012 06:49 PM, P. Michaud wrote: > Hello, > > I used the command > > "uniq -dc myfile.txt' > > here are some lines of the output > > 2 ☼ turvy > 2 ☼ with gay abandon > 2 ☼ with reckless abandon > 10 ☼ yyⅰ > 9 ☼ yyⅹⅲ > 2 ☼ yyⅺ > 12 ☼ zzⅰ > > > The three first lines above are correct and correspond to real duplicates > lines in the file, but the numbers on the 4 last one are erroneous, each of > them correspond to a single line in the file. > > Yours faithfully. > > Pierre Michaud
What system are you on What version of uniq What is the input exactly I suspect your locale is equating roman numerals (though that is surprising), but I can't reproduce with the following on coreutils-8.10-2.fc15.x86_64 at least. locale -a | while read locale; do LC_ALL=$locale uniq -dc t.in done | grep -v " *2" cheers, Pádraig.