New subject: sva: dvm and thp

26 Mar 2021

      Hi Zhangfei,
On Fri, Mar 26, 2021 at 12:35:49PM +0000, Zhangfei Gao via Linaro-open-discussions wrote:
...
Hi,
I am looking for some suggestions about dvm [1].
we are testing sva with openssl, and dvm  is enabled by default to broadcast
TLB maintenance.
And thp is enabled in system by default,  a daemon khugepaged
(mm/khugepaged.c) is running
to collapse memory to huge page in a period of time.
And we found thp: khugepaged may cause issue to sva test case.
https://github.com/Linaro/uadk/issues/215
Two cases:

Heavy weight test case, async mode, 36+ jobs.

openssl speed -elapsed -engine uadk -async_jobs 36 rsa2048
Once collapse_huge_page happens.
hardware may hung in io page fault, there maybe huge numbers of page fault
keeps happening,
while usually only several io page fault reported.

With high thp scan frequence, low weight test case, sync mode, 1 job.

data may not correct.
sudo openssl speed -engine uadk -seconds 1 rsa2048
Doing 2048 bits public rsa's for 1s: RSA verify failure
Two workarounds:

disable thp

echo never > /sys/kernel/mm/transparent_hugepage/enabled

enable thp but add tlbi and ignore dvm.

Adding arm_smmu_tlb_inv_range in arm-smmu-v3-sva.c:
arm_smmu_mm_invalidate_range.
It is called by khugepaged: collapse_huge_page->
mmu_notifier_invalidate_range_end
Looks dvm is not taking effect in some corner cases.
Questions

if khugepaged collapse the memory used by device, and then change tlb,

can dvm sync this tlb change to smmu.

Any possible dma is just using the memory, and collapsed by khugepaged,

can dvm handle this case?
    Or khugepaged should not touch memory using by device, looks khugepaged
can not distinguish.
I think hugepage is juste one symptom, and may not be the only way to
trigger the issue. On khugepaged collapse, a large invalidation is issued
which arch/arm64/include/asm/tlbflush.h transforms into a TLBI-by-ASID:
if ((!system_supports_tlb_range() &&
             (end - start) >= (MAX_TLBI_OPS * stride)) ||
            pages >= MAX_TLBI_RANGE_PAGES) {
                flush_tlb_mm(vma->vm_mm);
                return;
        }
With MAX_TLBI_OPS = 512 and stride = 0x1000 we hit the size limit of 2M,
which is the size of a hugepage. My guess is that on khugepage collapse we
issue a TLBI ASIDE1IS (rather than a TLBI VALE1IS), somehow that isn't
taken into account by the SMMU, and we end up with stale TLB entries
leading to memory corruption. If that's the case, I'd suggest keeping DVM
disabled on this platform (workaround 2), to force all SVA invalidation to
go through the command queue.
Thanks,
Jean

Re: [Linaro-open-discussions] sva: dvm and thp