From the ext4(5) man page:
dir_nlink
Normally, ext4 allows an inode to have no more than 65,000 hard links. This applies to regular files as well as directories, which means that there can be no more than 64,998 sub‐directories in a directory (because each of the '.' and '..' entries, as well as the directory entry for the directory in its parent directory counts as a hard link). This feature lifts this limit by causing ext4 to use a link count of 1 to indicate that the number of hard links to a directory is not known when the link count might exceed the maximum count limit.
large_dir
This feature increases the limit on the number of files per directory by raising the maximum size of directories and, for hashed b-tree directories (see dir_index), the maximum height of the hashed b-tree used to store the directory entries.
Okay, let's take a look at dir_index too.
dir_index
Use hashed b-trees to speed up name lookups in large directories. This feature is supported by ext3 and ext4 file systems, and is ignored by ext2 file systems.
According to my understanding to the man page:
- When there is no item in a directory, there are 2 hard links, namely .(the directory pointing to itself) and..(this points to the parent directory, but the hard link refers to the reference from the parent directory to the directory), and the value of thest_nlinkfield in thestatstructure is2. When there is 1 item (say, 1 file) in a directory, there are 3 hard links, andst_nlinksays3;
- Therefore, to represent the actual number of hard links, st_nlinkmust be at least2. As a result, the value1is freed to represent something else, instead of having only 1 hard link, which does not make sense;
- If the ext4 file system is formatted without dir_nlink, thenst_nlinkcannot be greater than65000, and the system refuse to add more items when the limit is reached (I read about that modern kernels may automatically turn ondir_nlink, but let's ignore this for simplicity of the discussion);
- If the ext4 file system is formatted with dir_nlink, then when there are more than65000items in a directory, the value1will be written to thest_nlinkfield to indicate "unknown number of hard links". Clients (the code using the ext4 file system) must traverse the list of files (the data block of the directory) to count the actual number of items inside it;
- large_dirincreases the maximum size of directories;
- If dir_indexis used,large_dirincreases the maximum height of the hashed B-Tree.
My Questions
- If dir_nlinkalone is used, what is the maximum size of a directory? I used to think that it is infinite as long as there is enough data blocks to store the list of files in the directory, but now it seems that it is not.
- I think using large_diralone does not make sense, since the directory size is still capped by65000. Am I correct?
- If large_diris used, what is the maximum size of a directory?
- Is there any disadvantage using large_dir? The reason why I have this question is that in Ubuntu 20.04 LTS,dir_nlinkis set by default (see/etc/mke2fs.conf, it is in the list at[fs_types]>ext4>features), butlarge_diris not.