[PATCH 3/3] media: hantro: add per-context fdinfo usage stats
Detlev Casanova
detlev.casanova at collabora.com
Fri Jun 12 10:25:02 PDT 2026
Hi Nicolas,
On 6/12/26 11:18, Nicolas Dufresne wrote:
> Hi,
>
> Le vendredi 12 juin 2026 à 10:26 -0400, Detlev Casanova a écrit :
>> From: Christopher Healy <healych at amazon.com>
>>
>> Add per-file-descriptor hardware utilization tracking to the Hantro
>> VPU stateless codec driver, exposed via /proc/<pid>/fdinfo/<fd>.
>>
>> Record a ktime timestamp when each job is submitted to hardware in
>> device_run(), and accumulate the elapsed nanoseconds when the job
>> completes in hantro_job_finish(). Report the accumulated time along
>> with the current clock frequency through the new V4L2 show_fdinfo
>> callback.
>>
>> The output uses a media- key prefix with the following keys:
>> media-driver: driver name
>> media-engine-<eng>: accumulated busy time in nanoseconds
>> media-maxfreq-<eng>: maximum engine frequency in Hz
>> media-curfreq-<eng>: current engine frequency in Hz
>>
>> Where <eng> is "decoder" or "encoder" depending on the context.
>>
>> This enables userspace monitoring tools to compute per-process decoder
>> and encoder utilization. The current and max frequency keys report the
>> same value today since the driver lacks devfreq support, but will
>> diverge once DVFS is added, allowing userspace to approximate true
>> capacity utilization without any fdinfo code changes. A future series
>> can add hardware cycle counter support (via media-cycles-<eng>) for
>> exact utilization under DVFS, with no changes to the existing uAPI.
>>
>> Signed-off-by: Christopher Healy <healych at amazon.com>
>> Signed-off-by: Detlev Casanova <detlev.casanova at collabora.com>
>> ---
>> drivers/media/platform/verisilicon/hantro.h | 5 +++++
>> drivers/media/platform/verisilicon/hantro_drv.c | 22 ++++++++++++++++++++++
>> 2 files changed, 27 insertions(+)
>>
>> diff --git a/drivers/media/platform/verisilicon/hantro.h b/drivers/media/platform/verisilicon/hantro.h
>> index 0353de154a1e..10287e5cebac 100644
>> --- a/drivers/media/platform/verisilicon/hantro.h
>> +++ b/drivers/media/platform/verisilicon/hantro.h
>> @@ -16,6 +16,7 @@
>> #include <linux/videodev2.h>
>> #include <linux/wait.h>
>> #include <linux/clk.h>
>> +#include <linux/ktime.h>
>> #include <linux/reset.h>
>>
>> #include <media/v4l2-ctrls.h>
>> @@ -268,6 +269,10 @@ struct hantro_ctx {
>> struct hantro_postproc_ctx postproc;
>> bool need_postproc;
>>
>> + /* Statistics for debugging and performance measurements. */
>> + ktime_t start_time;
>> + u64 total_ns;
>> +
>> /* Specific for particular codec modes. */
>> union {
>> struct hantro_h264_dec_hw_ctx h264_dec;
>> diff --git a/drivers/media/platform/verisilicon/hantro_drv.c b/drivers/media/platform/verisilicon/hantro_drv.c
>> index 2e81877f640f..139c2e6a83aa 100644
>> --- a/drivers/media/platform/verisilicon/hantro_drv.c
>> +++ b/drivers/media/platform/verisilicon/hantro_drv.c
>> @@ -90,6 +90,8 @@ static void hantro_job_finish(struct hantro_dev *vpu,
>> struct hantro_ctx *ctx,
>> enum vb2_buffer_state result)
>> {
>> + ctx->total_ns += ktime_to_ns(ktime_sub(ktime_get(), ctx->start_time));
>> +
>> pm_runtime_put_autosuspend(vpu->dev);
>>
>> clk_bulk_disable(vpu->variant->num_clocks, vpu->clocks);
>> @@ -186,6 +188,8 @@ static void device_run(void *priv)
>>
>> v4l2_m2m_buf_copy_metadata(src, dst);
>>
>> + ctx->start_time = ktime_get();
>> +
>> if (ctx->codec_ops->run(ctx))
>> goto err_cancel_job;
>>
>> @@ -701,10 +705,28 @@ static int hantro_release(struct file *filp)
>> return 0;
>> }
>>
>> +static void hantro_show_fdinfo(struct seq_file *m, struct file *f)
>> +{
>> + struct hantro_ctx *ctx = file_to_ctx(f);
>> + struct hantro_dev *vpu = ctx->dev;
>> +
>> + seq_printf(m, "media-driver:\t%s\n", DRIVER_NAME);
>> + seq_printf(m, "media-engine-%s:\t%llu ns\n",
>> + ctx->is_encoder ? "encoder" : "decoder",
>> + ctx->total_ns);
>> + seq_printf(m, "media-maxfreq-%s:\t%lu Hz\n",
>> + ctx->is_encoder ? "encoder" : "decoder",
>> + clk_get_rate(vpu->clocks[0].clk));
>> + seq_printf(m, "media-curfreq-%s:\t%lu Hz\n",
>> + ctx->is_encoder ? "encoder" : "decoder",
>> + clk_get_rate(vpu->clocks[0].clk));
> Similar to what was done in DRM helpers, I would create a common structure,
> maybe in a v4l2-stats library (or some better name). The driver would fill that
> structure directly, or through helpers (such as software timer helper, cycle
> count translation helper), and finally, this show_fd_info() would be a one
> liners, just like DRM do.
>
> This will ensure full consistency in the trace, and avoid possible future
> unresolvable compatibility issue due to past mistakes. If we ever let a bug
> through (of course we won't :-D) all driver will have the same bug, and it will
> be easier to not break userspace.
Yes, that makes a lot of sense actually, And will also help with the
ftrace HW info if we have stats and debug info centralized in v4l2
> Nicolas
>
>> +}
>> +
>> static const struct v4l2_file_operations hantro_fops = {
>> .owner = THIS_MODULE,
>> .open = hantro_open,
>> .release = hantro_release,
>> + .show_fdinfo = hantro_show_fdinfo,
>> .poll = v4l2_m2m_fop_poll,
>> .unlocked_ioctl = video_ioctl2,
>> .mmap = v4l2_m2m_fop_mmap,
More information about the Linux-rockchip
mailing list