LuoYuanke added a comment.

@lebedev.ri, this patch is mainly for discussing the approach that Florian 
proposed, so I didn't polish my code. Nevertheless your comments for amx_cast.c 
is right. For __tile_loadd() is to load a 2d tile from memory. There is an 
extra parameter stride. As I explain in llvm-dev, it load each row from memory 
to tile register and then base += stride. So the data is not contiguous in 
memory.


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D99152/new/

https://reviews.llvm.org/D99152

_______________________________________________
cfe-commits mailing list
cfe-commits@lists.llvm.org
https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

Reply via email to