bcachefs.git - Unnamed repository; edit this file 'description' to name the repository.

Age	Commit message (Collapse)	Author
2020-10-26	xfs: repair extended attributes	Darrick J. Wong
	If the extended attributes look bad, try to sift through the rubble to find whatever keys/values we can, stage a new attribute structure in a temporary file and use the atomic extent swapping mechanism to commit the results in bulk. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>
2020-10-26	xfs: create a blob array data structure	Darrick J. Wong
	Create a simple 'blob array' data structure for storage of arbitrarily sized metadata objects that will be used to reconstruct metadata. For the intended usage (temporarily storing extended attribute names and values) we only have to support storing objects and retrieving them. Use the xfile abstraction to store the attribute information in memory that can be swapped out. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>
2020-10-26	xfs: online repair of realtime summariesrepair-rtsummary_2020-10-26	Darrick J. Wong
	Repair the realtime summary data by constructing a new rtsummary file in the scrub temporary file, then atomically swapping the contents. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>
2020-10-26	xfs: create temporary files and directories for online repair	Darrick J. Wong
	Teach the online repair code how to create temporary files or directories. These temporary files can be used to stage reconstructed information until we're ready to perform an atomic extent swap to commit the new metadata. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>
2020-10-26	xfs: create a new inode fork block unmap helper	Darrick J. Wong
	Create a new helper to unmap blocks from an inode's fork. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>
2020-10-26	xfs: enable atomic swapext featureatomic-file-updates_2020-10-26	Darrick J. Wong
	Add the atomic swapext feature to the set of features that we will permit. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>
2020-10-26	xfs: condense directories after an atomic swap	Darrick J. Wong
	The previous commit added a new swapext flag that enables us to perform post-swap processing on file2 once we're done swapping the extent maps. Now add this ability for directories. This isn't used anywhere right now, but we need to have the basic ondisk flags in place so that a future online directory repair feature can create salvaged dirents in a temporary directory and swap the data forks when ready. If one file is in extents format and the other is inline, we will have to promote both to extents format to perform the swap. After the swap, we can try to condense the fixed directory down to inline format if possible. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>
2020-10-26	xfs: condense extended attributes after an atomic swap	Darrick J. Wong
	Add a new swapext flag that enables us to perform post-swap processing on file2 once we're done swapping the extent maps. If we were swapping the extended attributes, we want to be able to convert file2's attr fork from block to inline format. This isn't used anywhere right now, but we need to have the basic ondisk flags in place so that a future online xattr repair feature can create salvaged attrs in a temporary file and swap the attr forks when ready. If one file is in extents format and the other is inline, we will have to promote both to extents format to perform the swap. After the swap, we can try to condense the fixed file's attr fork back down to inline format if possible. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>
2020-10-26	xfs: remove old swap extents implementation	Darrick J. Wong
	Migrate the old XFS_IOC_SWAPEXT implementation to use our shiny new one. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>
2020-10-26	xfs: allow xfs_swap_range to use older extent swap algorithms	Darrick J. Wong
	If userspace permits non-atomic swap operations, use the older code paths to implement the same functionality. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>
2020-10-26	xfs: refactor reflink flag handling in xfs_swap_extent_forks	Darrick J. Wong
	Refactor the old data fork swap function to use the new reflink flag helpers to propagate reflink flags between the two files. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>
2020-10-26	xfs: consolidate all of the xfs_swap_extent_forks code	Darrick J. Wong
	Consolidate the bmbt owner change scan code in xfs_swap_extent_forks, since it's not needed for the deferred bmap log item swapext implementation. The goal is to package up all three implementations into functions that have the same preconditions and leave the system in the same state. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>
2020-10-26	xfs: port xfs_swap_extents_rmap to our new code	Darrick J. Wong
	The inner loop of xfs_swap_extents_rmap does the same work as xfs_swapext_finish_one, so adapt it to use that. Doing so has the side benefit that the older code path no longer wastes its time remapping shared extents. This forms the basis of the non-atomic swaprange implementation. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>
2020-10-26	xfs: add error injection to test swapext recovery	Darrick J. Wong
	Add an errortag so that we can test recovery of swapext log items. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>
2020-10-26	xfs: add a ->swap_file_range handler	Darrick J. Wong
	Add a function to handle range swap requests from the vfs. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>
2020-10-26	xfs: create deferred log items for extent swapping	Darrick J. Wong
	Now that we've created the skeleton of a log intent item to track and restart extent swap operations, add the upper level logic to commit intent items and turn them into concrete work recorded in the log. We use the deferred item "multihop" feature that was introduced a few patches ago to constrain the number of active swap operations to one per thread. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>
2020-10-26	xfs: introduce a swap-extent log intent item	Darrick J. Wong
	Introduce a new intent log item to handle swapping extents. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>
2020-10-26	xfs: create a log incompat flag for atomic extent swapping	Darrick J. Wong
	Create a log incompat flag so that we only attempt to process swap extent log items if the filesystem supports it. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>
2020-10-26	xfs: support two inodes in the defer capture structure	Darrick J. Wong
	Make it so that xfs_defer_ops_capture_and_commit can capture two inodes. This will be needed by the atomic extent swap log item so that it can recover an operation involving two inodes. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>
2020-10-26	xfs: xfs_bmap_finish_one should map unwritten extents properly	Darrick J. Wong
	The deferred bmap work state and the log item can transmit unwritten state, so the XFS_BMAP_MAP handler must map in extents with that unwritten state. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>
2020-10-26	xfs: support deferred bmap updates on the attr fork	Darrick J. Wong
	The deferred bmap update log item has always supported the attr fork, so plumb this in so that higher layers can access this. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>
2020-10-26	vfs: introduce new file extent swap ioctl	Darrick J. Wong
	Introduce a new ioctl to handle swapping extents between two files. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>
2020-10-26	xfs: repair summary countersrepair-hard-problems_2020-10-26	Darrick J. Wong
	Use the same summary counter calculation infrastructure to generate new values for the in-core summary counters. The difference between the scrubber and the repairer is that the repairer will freeze the fs during setup, which means that the values should match exactly. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>
2020-10-26	xfs: repair the rmapbt	Darrick J. Wong
	Rebuild the reverse mapping btree from all primary metadata. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>
2020-10-26	xfs: ask to freeze if fscounters scrubber fails	Darrick J. Wong
	If the fscounters scrubber notices incorrect summary counters, it's entirely possible that scrub is simply racing with other threads that are updating the incore counters. Therefore, if there's a mismatch and the fs isn't frozen, ask userspace if we can freeze the fs to eliminate the race condition. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>
2020-10-26	xfs: introduce online scrub freeze	Darrick J. Wong
	Introduce a new 'online scrub freeze' that we can use to lock out all filesystem modifications and background activity so that we can perform global scans in order to rebuild metadata. This introduces a new IFLAG to the scrub ioctl to indicate that userspace is willing to allow a freeze. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>
2020-10-26	xfs: update health status if we get a clean bill of healthindirect-health-reporting_2020-10-26	Darrick J. Wong
	If scrub finds that everything is ok with the filesystem, we need a way to tell the health tracking that it can let go of indirect health flags, since indirect flags only mean that at some point in the past we lost some context. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>
2020-10-26	xfs: remember sick inodes that get inactivated	Darrick J. Wong
	If an unhealthy inode gets inactivated, remember this fact in the per-fs health summary. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>
2020-10-26	xfs: add secondary and indirect classes to the health tracking system	Darrick J. Wong
	Establish two more classes of health tracking bits: * Indirect problems, which suggest problems in other health domains that we weren't able to preserve. * Secondary problems, which track state that's related to primary evidence of health problems; and The first class we'll use in an upcoming patch to record in the AG health status the fact that we ran out of memory and had to inactivate an inode with defective metadata. The second class we use to indicate that repair knows that an inode is bad and we need to fix it later. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>
2020-10-26	xfs: create a polled function to force inode inactivationdeferred-inactivation_2020-10-26	Darrick J. Wong
	Create a polled version of xfs_inactive_force so that we can force inactivation while holding a lock (usually the umount lock) without tripping over the softlockup timer. This is for callers that hold vfs locks while calling inactivation, which is currently unmount, iunlink processing during mount, and rw->ro remount. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>
2020-10-26	xfs: parallelize inode inactivation	Darrick J. Wong
	Split the inode inactivation work into per-AG work items so that we can take advantage of parallelization. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>
2020-10-26	xfs: force inactivation before fallocate when space is low	Darrick J. Wong
	If we think that inactivation will free enough blocks to make it easier to satisfy an fallocate request, force inactivation. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>
2020-10-26	xfs: force inode inactivation and retry fs writes when there isn't space	Darrick J. Wong
	Any time we try to modify a file's contents and it fails due to ENOSPC or EDQUOT, force inactivation work to free up some resources and try one more time. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>
2020-10-26	xfs: deferred inode inactivation	Darrick J. Wong
	Instead of calling xfs_inactive directly from xfs_fs_destroy_inode, defer the inactivation phase to a separate workqueue. With this we avoid blocking memory reclaim on filesystem metadata updates that are necessary to free an in-core inode, such as post-eof block freeing, COW staging extent freeing, and truncating and freeing unlinked inodes. Now that work is deferred to a workqueue where we can do the freeing in batches. We introduce two new inode flags -- NEEDS_INACTIVE and INACTIVATING. The first flag helps our worker find inodes needing inactivation, and the second flag marks inodes that are in the process of being inactivated. A concurrent xfs_iget on the inode can still resurrect the inode by clearing NEEDS_INACTIVE (or bailing if INACTIVATING is set). Unfortunately, deferring the inactivation has one huge downside -- eventual consistency. Since all the freeing is deferred to a worker thread, one can rm a file but the space doesn't come back immediately. This can cause some odd side effects with quota accounting and statfs, so we also force inactivation scans in order to maintain the existing behaviors, at least outwardly. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>
2020-10-26	xfs: track unlinked inactive inode quota counters	Darrick J. Wong
	Set up quota counters to track the number of inodes and blocks that will be freed from inactivating unlinked inodes. We'll use this in the deferred inactivation patch to hide the effects of deferred processing. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>
2020-10-26	xfs: track unlinked inactive inode fs summary counters	Darrick J. Wong
	Set up counters to track the number of inodes and blocks that will be freed from inactivating unlinked inodes. We'll use this in the deferred inactivation patch to hide the effects of deferred processing. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>
2020-10-26	xfs: decide if inode needs inactivation	Darrick J. Wong
	Add a predicate function to decide if an inode needs (deferred) inactivation. Any file that has been unlinked or has speculative preallocations either for post-EOF writes or for CoW qualifies. This function will also be used by the upcoming deferred inactivation patch. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>
2020-10-26	xfs: parallelize block preallocation garbage collectioneofblocks-consolidation_2020-10-26	Darrick J. Wong
	Split the block preallocation garbage collection work into per-AG work items so that we can take advantage of parallelization. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>
2020-10-26	xfs: rename block gc start and stop functions	Darrick J. Wong
	Shorten the names of the two functions that start and stop block preallocation garbage collection and move them up to the other blockgc functions. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>
2020-10-26	xfs: only walk the incore inode tree once per blockgc scan	Darrick J. Wong
	Perform background block preallocation gc scans more efficiently by walking the incore inode tree once. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>
2020-10-26	xfs: consolidate the eofblocks and cowblocks workers	Darrick J. Wong
	Remove the separate cowblocks work items and knob so that we can control and run everything from a single blockgc work queue. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>
2020-10-26	xfs: consolidate incore inode radix tree posteof/cowblocks tags	Darrick J. Wong
	The clearing of posteof blocks and cowblocks serve the same purpose: removing speculative block preallocations from inactive files. We don't need to burn two radix tree tags on this, so combine them into one. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>
2020-10-26	xfs: refactor the predicate part of xfs_free_eofblocks	Darrick J. Wong
	Refactor the part of _free_eofblocks that decides if it's really going to truncate post-EOF blocks into a separate helper function. The upcoming deferred inode inactivation patch requires us to be able to decide this prior to actual inactivation. No functionality changes. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>
2020-10-26	xfs: flush speculative space allocations when we run out of spacereclaim-space-harder_2020-10-26	Darrick J. Wong
	If a fs modification (creation, file write, reflink, etc.) is unable to reserve enough space to handle the modification, try clearing whatever space the filesystem might have been hanging onto in the hopes of speeding up the filesystem. The flushing behavior will become particularly important when we add deferred inode inactivation because that will increase the amount of space that isn't actively tied to user data. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>
2020-10-26	xfs: flush speculative space allocations when we run out of quota	Darrick J. Wong
	If a fs modification (creation, file write, reflink, etc.) is unable to reserve enough quota to handle the modification, try clearing whatever space the filesystem might have been hanging onto in the hopes of speeding up the filesystem. The flushing behavior will become particularly important when we add deferred inode inactivation because that will increase the amount of space that isn't actively tied to user data. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>
2020-10-26	xfs: xfs_inode_free_quota_blocks should scan project quota	Darrick J. Wong
	Buffered writers who have run out of quota reservation call xfs_inode_free_quota_blocks to try to free any space reservations that might reduce the quota usage. Unfortunately, the buffered write path treats "out of project quota" the same as "out of overall space" so this function has never supported scanning for space that might ease an "out of project quota" condition. We're about to start using this function for cases where we actually /can/ tell if we're out of project quota, so add in this functionality. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>
2020-10-26	xfs: don't stall cowblocks scan if we can't take locks	Darrick J. Wong
	Don't stall the cowblocks scan on a locked inode if we possibly can. We'd much rather the background scanner keep moving. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>
2020-10-26	xfs: refactor messy xfs_inode_free_quota_* functions	Darrick J. Wong
	The functions to run an eof/cowblocks scan to try to reduce quota usage are kind of a mess -- the logic repeatedly initializes an eofb structure and there are logic bugs in the code that result in the cowblocks scan never actually happening. Replace all three functions with a single function that fills out an eofb if we're low on quota and runs both eof and cowblocks scans. Fixes: 83104d449e8c4 ("xfs: garbage collect old cowextsz reservations") Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>
2020-10-26	xfs: hide most of the incore inode walk interface	Darrick J. Wong
	Hide the incore inode walk interface because callers outside of the icache code don't need to know about iter_flags and radix tags and other implementation details of the incore inode cache. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>
2020-10-26	xfs: report XFS_CORRUPT_ON errors to the health systemcorruption-health-reports_2020-10-26	Darrick J. Wong
	Whenever we encounter XFS_CORRUPT_ON failures, we should report that to the health monitoring system for later reporting. I started with this and massaged everything until it built: @@ expression mp, test; @@ - if (XFS_CORRUPT_ON(mp, test)) return -EFSCORRUPTED; + if (XFS_CORRUPT_ON(mp, test)) { xfs_btree_mark_sick(cur); return -EFSCORRUPTED; } @@ expression mp, test; identifier label, error; @@ - if (XFS_CORRUPT_ON(mp, test)) { error = -EFSCORRUPTED; goto label; } + if (XFS_CORRUPT_ON(mp, test)) { xfs_btree_mark_sick(cur); error = -EFSCORRUPTED; goto label; } Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>