https://bugs.kde.org/show_bug.cgi?id=356357
k...@web.de changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |k...@web.de --- Comment #5 from k...@web.de --- > Vishesh, Baloo is a worthy attempt at an indexing system, and I commend your > work. It uses a quality database backend in the form of LMDB. But any which > way you and I might spin it, Baloo has a serious problem with I/O: it simply > causes too much of it, too frequently. Numerous users have complained about > this, and several currently open and closed bugs are traceable directly to > this behaviour. Several users' impressions of Baloo, and KDE writ large, are > tainted by Baloo's abusive disk activity. https://linux.die.net/man/1/ionice [...] Idle A program running with idle io priority will only get disk time when no other program has asked for disk io for a defined grace period. The impact of idle io processes on normal system activity should be zero. This scheduling class does not take a priority argument. Presently, this scheduling class is permitted for an ordinary user (since kernel 2.6.25). Best effort [...] Baloo's priority is set to Idle, which means it should not cause "abusive" disk activity. Unless your IO scheduler does not support ionice. Could this be your problem? https://blogs.kde.org/2014/10/15/ubuntus-linux-scheduler-or-why-baloo-might-be-slowing-your-system-1404 > Here's a relatively simple proposal: The indexer operates on a configurable > *duty cycle* D of 1%-50% and a time period T of 1s-3600s. For (1-D)*T seconds > per period, Baloo sleeps. For D*T seconds per period, Baloo *exclusively* > performs data/metadata reads from the filesystem, keeping an eye on > wall-clock time. Once D*T seconds of work have elapsed, make a *single > transaction* containing all of the stuff that the indexer read in the > previous duty cycle. Then go back to sleep again. In this way, exactly one > mdb_txn_commit() and fdatasync()/msync() occurs per time period, they are > likely to have accumulated far more than 40 files worth of information, and > 50-99% of I/O bandwidth is available for other uses, such as satisfying the > desktop UI's needs. You are suggesting rate limiting the IO. Rate limiting is inferior to scheduling (which is already being done) because: 1. The rate limit is wasting (1-D) of the available bandwidth in idle situations. (When baloo is the only application using IO.) 2. If Bmin < 1 is needed to satisfy the user requirements, (1-D) might still be smaller than Bmin. Scheduling with Idle priority will leave 1 instead of of (1-D) to the user, which is enough in any case. Also, now we don't need to find Bmin anymore. Anyway, minimizing the caused IO is still useful. -- You are receiving this mail because: You are watching all bug changes.