Hi
Omer Efraim wrote:
> Ira Abramov wrote:
> >
> > On Thu, 2 Mar 2000, Boris Kreitchman wrote:
> > > http://www.microsoft.com/presspass/features/2000/02-28w2k.asp
> >
> > During the next 1-1/2 years, Bolosky, a researcher in Microsoft
> > Research's Systems and Networking Group, and three of his researchers
> > worked full time with the Windows 2000 team to build the technology,
> > now known as the Single Instance Store.
> >
> > man, when you have that much money, I guess you don't mind looking
> > rediculous once in a while :-)
>
> I don't like Microsoft any more than the next guy on this list,
> but if you must talk about these things, at least make sure you
> get it right (and I don't mean anyone in particular).
> When you act like this, you become no better than MS. This
> specific technology actually has almost nothing in common
> with symbolic links (except for the fact that several files can
> point to the same actual data). This technology actually seeks
> out duplicate files and saves space by storing them only
> once - it's completely transparent (unless you could your
> system crashing while it's looking for duplicate files not
> transparent - but with MS it usually is, you can never
> know why NT crashes :), and it's actually a darn good idea.
I will only consider it an innovation if it has much better features,
e.g. copy on write if it made a single copy without letting the user
know or something, doing it in real time and not every day/week/whatever
(and I have no idea how can that be done), safely keep the differences
of close files (not at all close to soft links, more like CVS or something)
without admin intervension, stuff like that.
If all you need is a program that finds duplicate files to run in crontab,
a quick search in debian's Packages file reveals (didn't I already say I
love debian?):
perforate - Utilities to save disk space
fdupes - Identifies duplicate files residing within given directories.
>
> I'm not too sure it's very useful though, these days when everyone
> has their one workstation, duplicate files on the servers are not
> very common any more (and even if they are, I'd imagine they wouldn't
> be very big - you'd only have one ISO image of Slack).
Once, when the disk of the home directories at work got full, I made
a list of big files, sorted by size to find dups, and find something
like 5 copies of the dancing baby animation (I guess many saw it), a
more than 10MB AVI. And it was a disk for ~70 users.
I know of a place that made a file server dedicated to all people's MP3s,
to prevent duplicates. Disks are cheap today, but not meaningless; you
have more GBs to backup, to restore (often when time is critical), ...
>
> All of this (except for the crashing part) is mentioned inside this
> article (although I personally heard about it from other more
> reliable publications).
>
> --
> "You will now die. Make whatever rituals are necessary for your species."
> - Ur-Quan, Kohr-Ah
>
> =================================================================
> To unsubscribe, send mail to [EMAIL PROTECTED] with
> the word "unsubscribe" in the message body, e.g., run the command
> echo unsubscribe | mail [EMAIL PROTECTED]
>
didi
=================================================================
To unsubscribe, send mail to [EMAIL PROTECTED] with
the word "unsubscribe" in the message body, e.g., run the command
echo unsubscribe | mail [EMAIL PROTECTED]