12 KiB
Known Issues and Bugs
mergerfs
FreeBSD version
- FreeBSD doesn't have per thread credentials meaning threads must block to change credentials as required by numerous filesystem functions. This impacts performance.
- FreeBSD's FUSE implementation is lacking many features of Linux.
- IO passthrough
- statx
- lazy umount
- oom_score_adj
- fuse_msg_size
- kernel symlink caching
- kernel readdir caching
- writeback caching
- ...
Supplemental user groups
Supplemental group caching
Due to the overhead of getgroups/setgroups mergerfs utilizes a cache. As necessary the supplemental group information will be queried and cached. That cached list of groups will be used to set the supplement groups as necessary. Due to the high cost of querying the group list the default expiry for said data is 1 hour and after 12 hours of no usage will be removed from the cache all together.
Host vs Container identity
While not a bug some users have found when using containers that supplemental groups defined inside the container don't work as expected. Since mergerfs lives outside the container it is querying the host's group database. Containers have their own user and group definitions unless setup otherwise just as different systems would.
Users should mount in the host group file into the containers or use a standard shared user and groups technology like NIS or LDAP.
directory mtime is not being updated
Remember that the default policy for getattr
is ff
. The
information for the first directory found will be returned. If it
wasn't the directory which had been updated then it will appear
outdated.
The reason this is the default is because any other policy would be more expensive and for many applications it is unnecessary. To always return the directory with the most recent mtime or a faked value based on all found would require a scan of all filesystems.
If you always want the directory information from the one with the
most recent mtime then use the newest
policy for getattr
.
'mv /mnt/pool/foo /mnt/disk1/foo' removes 'foo'
This is not a bug.
Run in verbose mode to better understand what's happening:
$ mv -v /mnt/pool/foo /mnt/disk1/foo
copied '/mnt/pool/foo' -> '/mnt/disk1/foo'
removed '/mnt/pool/foo'
$ ls /mnt/pool/foo
ls: cannot access '/mnt/pool/foo': No such file or directory
mv
, when working across devices, is copying the source to target and
then removing the source. Since the source is the target in this
case, depending on the unlink policy, it will remove the just copied
file and other files across the branches.
If you want to move files to one filesystem just copy them there and use mergerfs.dedup to clean up the old paths or manually remove them from the branches directly.
cached memory appears greater than it should be
Use cache.files=off
and/or dropcacheonclose=true
. See the section
on page caching.
3rd Party Software
Software using mmap
- software using sqlite3 w/ mmap
- Plex (for config)
- Jellyfin (for config)
- Emby (for config)
- Radarr
- Sonarr
- Lidarr
- other
- rtorrent
- qbittorrent (with 'Disk IO type = Memory mapped files')
mmap is a filesystem and
operating system feature allowing files to be interacted with as if it
was normal memory. Support for mmap
is required by a number of
projects. FUSE (and therefore mergerfs), in order to support mmap
,
requires page caching to be
enabled. IE cache.files
must not be set to off
(if using Linux before
v6.6 or mergerfs before v2.41.0.)
In newer versions of mergerfs and Linux off
will work as it can now
auto enable page caching (and therefore mmap
) as needed.
That said it is recommended that config and runtime files be stored on SSDs on a regular filesystem for performance reasons. See What should mergerfs NOT be used for?.
Technically sqlite3 does not require mmap
, and in fact the
default is not to use it, but many programs enable the feature and
provide no error handling when not supported. Ideally the software
would catch the error and use traditional IO instead. Not only is it
more compatible but could also be more performant in certain
situations.
backup software
Some software, like borgbackup and others, leverage inodes as one of a number of metrics to notice changes in files. Depending on how inode generation is configured it is possible that the same file may have different inodes one mount to another and as such interfere with the expected stability of the value. In such cases you may need to disable the use of the inode in the software if possible or change the inodecalc setting if possible.
Generally speaking, in modern setups where there are many non-fully complient POSIX filesystem in use, reliance on inode to have a specific meaning is not always reasonable. FUSE, for instance, has no requirement that inode values need to be unique or that two files that happen to be the same underlying file have the same inode.
NFS clients returning ESTALE / Stale file handle
NFS generally does not like out of band changes. Take a look at the section on NFS in the remote-filesystems for more details.
my 32bit software has problems
Some software have problems with 64bit inode values. The symptoms can
include EOVERFLOW errors when trying to list files. You can address
this by setting inodecalc
to one of the 32bit based algos as
described in the relevant section.
Moving files and directories fails with Samba
Workaround: Copy the file/directory and then remove the original rather than move.
This isn't an issue with Samba but some SMB clients. GVFS-fuse v1.20.3
and prior (found in Ubuntu 14.04 among others) failed to handle
certain error codes correctly. Particularly STATUS_NOT_SAME_DEVICE
which comes from the EXDEV
that is returned by rename
when the
call is crossing mount points. When a program gets an EXDEV
it needs
to explicitly take an alternate action to accomplish its goal. In the
case of mv
or similar it tries rename
and on EXDEV
falls back to
a copying the file to the destination and deleting the source. In
these older versions of GVFS-fuse if it received EXDEV
it would
translate that into EIO
. This would cause mv
or most any
application attempting to move files around on that SMB share to fail
with a generic IO error.
GVFS-fuse v1.22.0
and above fixed this issue but a large number of systems use the older
release. On Ubuntu, the version can be checked by issuing apt-cache showpkg gvfs-fuse
. Most distros released in 2015 seem to have the
updated release and will work fine but older systems may
not. Upgrading gvfs-fuse or the distro in general will address the
problem.
In Apple's MacOSX 10.9 they replaced Samba (client and server) with
their own product. It appears their new client does not handle
EXDEV
either and responds similarly to older releases of gvfs on
Linux.
Trashing files occasionally fails
This is the same issue as with Samba. rename
returns EXDEV
(in our
case that will really only happen with path preserving policies like
epmfs
) and the software doesn't handle the situation well. This is
unfortunately a common failure of software which moves files
around. The standard indicates that an implementation MAY choose
to support non-user home directory trashing of files (which is a
MUST). The implementation MAY also support "top directory
trashes" which many probably do.
To create a $topdir/.Trash
directory as defined in the standard use
the mergerfs-tools tool
mergerfs.mktrash
.
FUSE and Linux kernel
There have been a number of kernel issues / bugs over the years which mergerfs users have run into. Here is a list of them for reference and posterity.
NFS and EIO errors
https://lore.kernel.org/linux-fsdevel/20240228160213.1988854-1-mszeredi@redhat.com/T/
Over the years some users have reported that while exporting mergerfs via NFS, after significant filesystem activity, not only will the NFS client start returning ESTALE and EIO errors but mergerfs itself would start returning EIO errors. The problem was that no one could reliability reproduce the issue. After a string of reports in late 2023 and early 2024 more investigation was done.
In Linux 5.14 new validation was put into FUSE which caught a few invalid situations and would tag a FUSE node as invalid if a check failed. Such checks include invalid file type, changing of type from one request to another, a size greater than 63bit, and the generation of a inode changing while in use.
What happened was that mergerfs was using a different fixed, non-zero value for the generation of all nodes as it was suggested that unique inode + generation pairs are needed for proper integration with NFS. That non-zero value was being sent back to the kernel when a lookup request was made for root. The reason this was hard to track down was because NFS almost uniquely uses an API which can lead to a lookup of the root node that simply won't happen under normal workloads and usage. And that lookup will only happen if child nodes of the root were forgotten but NFS still had a handle to that node and later asked for details about it. It would trigger a set of requests to lookup info on those nodes.
This wasn't a bug in FUSE but mergerfs. However, the incorrect behavior of mergerfs lead to FUSE behave in an unexpected and incorrect manner. It would issue a lookup of the "parent of a child of the root" and mergerfs would send the invalid generation value. As a result the kernel would mark the root node as "bad" which would then trigger the kernel to issue a "forget root" message. In between those it would issue a request for the parent of the root... which doesn't exist.
So the kernel was doing two invalid things. Requesting the parent of the root and then when that failed issuing a forget for the root. These led to chasing after the wrong possible causes.
The change was for FUSE to revert the marking of root node bad if the generation is non-zero and warn about it. It will mark the node bad but not unhash/forget/remove it.
mergerfs in v2.40.1 ensures that generation for root is always 0 on lookup which should work across any kernel version.
Truncated files
This was a bug with mmap
and FUSE
on 32bit platforms. Should be
fixed in all LTS releases.
Crashing on OpenVZ
There was a bug in the OpenVZ kernel with regard to how it handles
ioctl
calls. It was making invalid requests which would lead to
crashes due to mergerfs not expecting them.
Really bad mmap performance
There was a bug in caching which affects overall performance of mmap
through FUSE
in Linux 4.x kernels. It is fixed in 4.4.10 and 4.5.4.