[PATCH v1 3/5] drm/i915/gvt: GVTg support context submission pvmmio optimization

Zhang, Xiaolin xiaolin.zhang at intel.com
Tue Nov 6 05:42:02 UTC 2018


On 11/05/2018 06:03 PM, Zhenyu Wang wrote:
> On 2018.11.05 17:20:47 +0800, Xiaolin Zhang wrote:
>> implemented context submission pvmmio optimizaiton with GVTg.
>>
>> GVTg to read context submission data (elsp_data) from the shared_page
>> directly without trap cost to improve guest GPU peformrnace.
>>
>> v1: rebase
>> v0: RFC
>>
>> Cc: Zhenyu Wang <zhenyuw at linux.intel.com>
>> Cc: Zhi Wang <zhi.a.wang at intel.com>
>> Cc: Min He<min.he at intel.com>
>> Cc: Fei Jiang <fei.jiang at intel.com>
>> Cc: Zhipeng Gong <zhipeng.gong at intel.com>
>> Cc: Hang Yuan <hang.yuan at intel.com>
>> Cc: Zhiyuan Lv <zhiyuan.lv at intel.com>
>> Signed-off-by: Xiaolin Zhang <xiaolin.zhang at intel.com>
>> ---
>>  drivers/gpu/drm/i915/gvt/handlers.c | 12 ++++++++++++
>>  1 file changed, 12 insertions(+)
>>
>> diff --git a/drivers/gpu/drm/i915/gvt/handlers.c b/drivers/gpu/drm/i915/gvt/handlers.c
>> index bf14c66..cd3b602 100644
>> --- a/drivers/gpu/drm/i915/gvt/handlers.c
>> +++ b/drivers/gpu/drm/i915/gvt/handlers.c
>> @@ -1667,6 +1667,8 @@ static int elsp_mmio_write(struct intel_vgpu *vgpu, unsigned int offset,
>>  	int ring_id = intel_gvt_render_mmio_to_ring_id(vgpu->gvt, offset);
>>  	struct intel_vgpu_execlist *execlist;
>>  	u32 data = *(u32 *)p_data;
>> +	u32 elsp_data[4];
>> +	u32 elsp_data_off;
>>  	int ret = 0;
>>  
>>  	if (WARN_ON(ring_id < 0 || ring_id >= I915_NUM_ENGINES))
>> @@ -1674,6 +1676,16 @@ static int elsp_mmio_write(struct intel_vgpu *vgpu, unsigned int offset,
>>  
>>  	execlist = &vgpu->submission.execlist[ring_id];
>>  
>> +	if (VGPU_PVMMIO(vgpu) & PVMMIO_ELSP_SUBMIT) {
>> +		elsp_data_off = offsetof(struct gvt_shared_page, elsp_data);
>> +		intel_gvt_read_shared_page(vgpu, elsp_data_off, &elsp_data, 16);
>> +		execlist->elsp_dwords.data[3] = elsp_data[0];
>> +		execlist->elsp_dwords.data[2] = elsp_data[1];
>> +		execlist->elsp_dwords.data[1] = elsp_data[2];
>> +		execlist->elsp_dwords.data[0] = data;
>> +		return intel_vgpu_submit_execlist(vgpu, ring_id);
>> +	}
> I think we still need to do more check, e.g if ctx address is in valid vgpu range, etc?
I think your concern to check input data eslp_datt[4], but elsp_data[4]
store the context descriptor, not ctx address.  so I am not sure is
there any mechanism to validate context descriptor.
>
>> +
>>  	execlist->elsp_dwords.data[3 - execlist->elsp_dwords.index] = data;
>>  	if (execlist->elsp_dwords.index == 3) {
>>  		ret = intel_vgpu_submit_execlist(vgpu, ring_id);
>> -- 
>> 2.7.4
>>



More information about the intel-gvt-dev mailing list