How to cleanup tmp folder safely on Linux
Solution 1
You might want to try something like that:
find /tmp -mtime +7 -and -not -exec fuser -s {} ';' -and -exec echo {} ';'
find is used to find files that match certain criteria.
-
-mtime +7
only selects files that are older than 7 days (you may use any other value) -
-exec fuser -s {} ';'
calls fuser in silent mode for every file that matches the oldness criteria. fuser returns 0 (=true) for every file that's been accessed right now and 1 (= false) for the unaccessed ones. As we are only interested in the unaccessed ones, we put a-not
in front of this-exec
-
-exec echo {} ';'
just prints all file names matching the criteria. you might want use-exec rm {} ';'
instead here, but as this may delete some still-in-use files, I think it's safer to do a simple echo first. -
edit: You might want to add something like
-name 'foo*.bar'
or-uid 123
to limit the effects of the cleanup to specific file patterns or user IDs to avoid accidental effects.
To the last point: Consider that there might be files that are only written once (e.g. at system boot) but read frequently (e.g. any X-session-cookie). Therefore I recommend adding some name checks to only affect files created by your faulty programs.
edit2: To your last question: A file won't get deleted from disk until no process has an open handle to it (at least for native linux filesystems). The problem is that the directory entry is removed immediately which means that from the time you remove the file no new processes can open the file anymore (as there's no filename attached to it).
For details see: https://stackoverflow.com/questions/3181641/how-can-i-delete-a-file-upon-its-close-in-c-on-linux
edit3: But what if I wanted to automate the whole process?
As I said, there might be files that are written once and then read every once in a while (e.g. X session cookies, PID files, etc.). Those won't be excluded by this little removal script (which is the reason why you might wanna do a test run with echo
first before actually deleting files).
One way to implement a safe solution is to use atime
.
atime
stores the time each file was last accessed. But that file system option often is disabled because it has quite some performance impact (according to this blog somewhere in the 20-30% region). There's relatime
, but that one only writes the access time if mtime
has changed, so this one won't help us.
If you want to use atime
, I'd recommend to have /tmp
on a separate partition (ideally a ramdisk) so that the performance impact on the whole system isn't too big.
Once atime
is enabled, all you have to do is to replace the -mtime
parameter in the above command line with -atime
.
You might be able to remove the -not -exec fuser -s {} ';'
, but I'd keep it there just to be sure (in case applications keep files open for a long period of time).
But keep in mind to test the command using echo
before you end up removing stuff your system still needs!
Solution 2
Don't roll your own.
Debian/Ubuntu have tmpreaper, it's probably available in other dists as well.
# tmpreaper - cleans up files in directories based on their age
sudo apt-get install tmpreaper
cat /etc/tmpreaper.conf
Solution 3
Regarding the last part of your question:
While I don’t think that an ‘delete-this-if-I-die’ open/creation mode exists, a process can safely delete a file directly after creating it, as long as it keeps a handle to said file open. The kernel will then keep the file on disk and as soon as the last process who had opened the file exits (be it by crash or normally), the space occupied by the file will be freed.
For a general way around the problem that some processes sometimes don’t clean up /tmp, I would suggest having a look at mount namespaces, described, for example here or here. If the process in question is a system daemon, systemd and its native feature to allow private /tmp filesystems might be of interest.
Related videos on Youtube
Syncopated
Updated on September 18, 2022Comments
-
Syncopated over 1 year
I use RAM for my tmpfs /tmp, 2GB, to be exact. Normally, this is enough but sometimes, processes create files in there and fail to cleanup after themselves. This can happen if they crash. I need to delete these orphaned tmp files or else future process will run out of space on /tmp.
How can I safely garbage collect /tmp? Some people do it by checking last modification timestamp, but this approach is unsafe because there can be long-running processes that still need those files. A safer approach is to combine the last modification timestamp condition with the condition that no process has a file handle for the file. Is there a program/script/etc that embodies this approach or some other approach that is also safe?
Incidentally, does Linux/Unix allow a mode of file opening with creation wherein the created file is deleted when the creating process terminates, even if it's from a crash?
-
ott-- over 11 yearsCheck if you could use tmpfs instead of /tmp: kernel.org/doc/Documentation/filesystems/tmpfs.txt
-
Ciro Santilli Путлер Капут 六四事 about 5 years
-
-
nik over 11 yearsnice. What about files closed by a long-running process while it is not updating them? If they are context files, you could loose process context (admittedly, not a very smart process this; but one needs to know expected side-effects of a 'lateral'
/tmp/
cleanup) . -
mreithub over 11 yearsThat is the problem of this approach (as I was pointing out in the last paragraph). The best approach here would afaik be to add uid/gid or file pattern checks (edited the answer accordingly)
-
CMCDragonkai almost 10 yearsShould this be placed into a cron script...?
-
mreithub almost 10 years@CMCDragonkai Of course you can put this into crontab. But as I mentioned there might be files that are accessed but not written and therefore might not be filtered out by this little script. That's why it's safer to print the list of affected files first and then decide yourself whether to delete them or not. If your
/tmp
is on a separate partition (e.g. a ramdisk), you could enableatime
for it and use the-atime
parameter offind
. -
CMCDragonkai almost 10 yearsI'm planning to do this on a server. Therefore I can't be there to count all the files in tmp all the time. Would there be any issues? Also I thought we were meant to use relatime not atime?
-
mreithub almost 10 years@CMCDragonkai I can't say I'm an expert on
atime
. It was just meant as a hint to another possible solution for your problem. -
CMCDragonkai almost 10 yearsI guess atime isn't too much of an issue if you're using tmpfs or ramfs. I'm using tmpfs of course. Although I wish there would be some benchmarks showing the trade offs.
-
CMCDragonkai almost 10 yearsWhen you shutdown with files inside tmpfs/ramfs, does this mean the file exists on disk?
-
mreithub almost 10 years@CMCDragonkai No, those filesystems are RAM-only. As soon as it's unmounted, all the data is lost
-
CMCDragonkai almost 10 yearsRight and it gets unmounted during a shutdown!
-
Thalys over 9 yearsSU randomly arranges posts - so there's no above or below. Which post are you referring to?
-
Xiaodong Qi about 7 yearsIn the
/etc/tmpreaper.conf
file, if I set both/tmp
and/var/tmp
as the cleanup directories, can long do you recommend for theTMPREAPER_TIME
parameter or the max ago of the tmp files to be removed? I have heard it's better to keep a longer age for/var/tmp
files than/tmp
files. But if they can only be set up with the same max age, I have no clue. -
bertieb almost 6 yearsSuggesting to do something "and hope nothing breaks" doesn't really answer OP's "is there are a safe way to do this. Perhaps you could elaborate as to why your suggestion is safe?
-
Solomon Ucko almost 6 years@bertieb Good point. I guess it's probably safe if it's not run as root, but...