[PATCH v4 07/28] block: Introduce zone write plugging
Damien Le Moal
dlemoal at kernel.org
Tue Apr 2 16:38:15 PDT 2024
On 4/3/24 01:12, Christoph Hellwig wrote:
>> +static inline struct blk_zone_wplug *
>> +disk_lookup_zone_wplug(struct gendisk *disk, sector_t sector)
>> +{
>> + unsigned int zno = disk_zone_no(disk, sector);
>> + unsigned int idx = hash_32(zno, disk->zone_wplugs_hash_bits);
>> + struct blk_zone_wplug *zwplug;
>> +
>> + rcu_read_lock();
>> + hlist_for_each_entry_rcu(zwplug, &disk->zone_wplugs_hash[idx], node) {
>> + if (zwplug->zone_no == zno)
>> + goto unlock;
>> + }
>> + zwplug = NULL;
>> +
>> +unlock:
>> + rcu_read_unlock();
>> + return zwplug;
>> +}
>
> Did we lose an atomic_inc_unless_zero here? This now just does a lookup
> under RCU, but nothing to prevent the zwplug from beeing freed?
Nope. When disk_lookup_zone_wplug() is called directly, it is always for
handling requests/bios which are holding a reference on the plug and because
there are requests/BIOs in-flight, the plug is marked as busy
(BLK_ZONE_WPLUG_PLUGGED or BLK_ZONE_WPLUG_ERROR are set). In such state, the
plug is always hashed given that disk_should_remove_zone_wplug() retturns false
for busy plugs. So there is no reference increase here. The
atomic_inc_not_zero() is in disk_get_zone_wplug() which calls
disk_lookup_zone_wplug() + atomic_inc_not_zero() within an
rcu_read_lock()/rcu_read_unlock() section.
>
>> + /* Resize the zone write plug memory pool if needed. */
>> + if (disk->zone_wplugs_pool->min_nr != pool_size)
>> + return mempool_resize(disk->zone_wplugs_pool, pool_size);
>
> Note that a mempool_resize to the current size work just fine. It takes
> a pointless lock, but given that this is something that doesn't happen
> frequently that probably doesn't matter.
>
>> +#include <linux/mempool.h>
>
>> + mempool_t *zone_wplugs_pool;
>
> Please use struct mempool_s here so that you only need a forward
> declaration instead of pulling in another header.
--
Damien Le Moal
Western Digital Research
More information about the Linux-nvme
mailing list