https://bugs.kde.org/show_bug.cgi?id=456108
Bug ID: 456108 Summary: baloo reindexes files with date of 0 or prior to UNIX epoch on every restart Product: frameworks-baloo Version: 5.94.0 Platform: Fedora RPMs OS: Linux Status: REPORTED Severity: normal Priority: NOR Component: general Assignee: baloo-bugs-n...@kde.org Reporter: skierp...@gmail.com Target Milestone: --- SUMMARY Baloo kept reindexing 12 of my files every time I enable/disable it or run `balooctl check`. All 12 files had weird modification times according to `ls -l` and `stat`, either: > Modify: 1969-12-31 16:00:00.000000000 -0800 > Modify: 1903-08-23 02:36:30.000000000 -0800 `balooshow` reports these times as either: > Mtime: 0 1969-12-31T16:00:00 > Mtime: 2200842286 2039-09-28T10:04:46 STEPS TO REPRODUCE 0. Enable baloo from System Settings > File Search 1. Run `balooctl monitor &` 2. Make a file with mtime of 0 in the Unix epoch e.g. `touch -m --date=1970-01-01T00:00:00Z $HOME/file_from_epoch.txt`; make a file with a mtime prior to that, e.g. `touch -m --date=19420102 $HOME/file_from_1942.txt` 3. Run `balooctl check` 4. Run `balooctl check` 5. Run `balooshow $HOME/file_from_epoch.txt $HOME/file_from_1942.txt` OBSERVED RESULT `balooctl monitor` reports > Started search for unindexed files > Checking for unindexed files > Indexing file content > Indexing: /home/spage//file_from_epoch.txt: Ok > Indexing: /home/spage//file_from_1942.txt: Ok `balooshow` reports > c8d800000028 40 51416 /home/spage/file_from_epoch.txt > Mtime: 0 1969-12-31T16:00:00 > Ctime: 1656472422 2022-06-28T20:13:42 > c8ce00000028 40 51406 /home/spage/file_from_1942.txt > Mtime: 3411469696 2078-02-07T06:28:16 > Ctime: 1656466455 2022-06-28T18:34:15 EXPECTED RESULT These files have arguably weird modification times, but they aren't changing, so baloorunner/baloo_file should not reindex them. Maybe baloo should handle file modification dates earlier than the start of the UNIX epoch, since my NTFS and BTRFS filesystems seem to allow them. SOFTWARE/OS VERSIONS Linux/KDE Plasma: KDE Plasma Version: 5.24.5 KDE Frameworks Version: 5.94.0 Qt Version: 5.13.3 on Wayland ADDITIONAL INFORMATION This sounds like bug 438074 but according to its reporter "mtime and ctime match in both files". My first file is 0 seconds in the Unix epoch on January 1 1970 (shown in my timezone) and `balooshow` agrees with the file system. But for modification times before that, `balooshow` thinks the date is in the future. Maybe baloorunner thinks all these odd modification times are in the future so hands the file off for reindexing; and/or something in baloo_file_extractor or kfilemetadata thinks its representation of the modification time is different than the modification time it gets from the file system, so it keeps reindexing. Perhaps if the file information from baloo_file_extractor is the same as the entry already in baloo's DB, baloo should not perform any update. Workaround: I used, for example, `touch -m --date=20210709 path/to/file` to set normal modification dates to these files within the UNIX epoch, and after one final reindex baloo no longer reindexes them. -- You are receiving this mail because: You are watching all bug changes.