# Introduction

The TVM community has worked since the v0.14.0 release to deliver the following 
new exciting improvements! The main tags are below (**bold text is with lots of 
progress**):

- Community, RFCs
- Adreno, ArmComputeLibrary, Metal, cuda & cutlass & tensorrt, micoNPU, Runtime
- **Frontend & Relay**
- Arith, **TOPI**, **TIR**, TVMScript
- Docs, CI, **Misc**, **BugFix**

Please visit the full listing of commits for a complete view: 
[v0.14.0...v0.15.0](https://github.com/apache/tvm/compare/v0.14.0...v0.15.0).

### Community
 * [#16172](https://github.com/apache/tvm/pull/16172) - Yixin Dong -> Reviewer
 * [#16162](https://github.com/apache/tvm/pull/16162) - Shuai Yuan -> Committer
 * [#16164](https://github.com/apache/tvm/pull/16164) - Qiang Zhang -> Committer
 * [#16166](https://github.com/apache/tvm/pull/16166) - Bohan Hou -> PMC
 * [#16165](https://github.com/apache/tvm/pull/16165) - Ruihang Lai -> PMC

### RFCs

 * [#105](https://github.com/apache/tvm-rfcs/pull/105) - Add a new backend 
language——SYCL

----

### Adreno
 * [#15991](https://github.com/apache/tvm/pull/15991) - [CI] Enhancements to 
Adreno specific CI utils
 * [#15786](https://github.com/apache/tvm/pull/15786) - [TOPI] Add conv2d 
transpose nchw texture schedule

### Arith
 * [#16227](https://github.com/apache/tvm/pull/16227) - Simplify nested 
if_then_else when constant is appearing in then_expr

### ArmComputeLibrary
 * [#15990](https://github.com/apache/tvm/pull/15990) - [ACL] Update Compute 
Library to v23.08

### Metal
 * [#16192](https://github.com/apache/tvm/pull/16192) - [Device] Fix metal warp 
size
 * [#16033](https://github.com/apache/tvm/pull/16033) - [Codegen] Disable 
cross-function call in Metal codegen

### cuda & cutlass & tensorrt
 * [#16061](https://github.com/apache/tvm/pull/16061) - [CUDA] Add an option 
for profiling cuda kernels

### micoNPU
 * [#16003](https://github.com/apache/tvm/pull/16003) - [microNPU][ETHOSU] Fix 
ConcatRewriter args processing
 * [#15929](https://github.com/apache/tvm/pull/15929) - [microNPU][ETHOSU] Fix 
rounding mode in requantize operation

### Runtime
 * [#15896](https://github.com/apache/tvm/pull/15896) - [CLML] Fix for CLML ops 
and enable more test case
 * [#16133](https://github.com/apache/tvm/pull/16133) - Parallel-for with 
threading backend
 * [#16066](https://github.com/apache/tvm/pull/16066) - Support clear global 
memory allocators
 * [#16030](https://github.com/apache/tvm/pull/16030) - Introduce 
`TVM_MODULE_VTABLE` Macros

### BugFix
 * [#16269](https://github.com/apache/tvm/pull/16269) - Update pillow usage
 * [#16272](https://github.com/apache/tvm/pull/16272) - Fixed Inappropriate 
Logical Expression
 * [#16216](https://github.com/apache/tvm/pull/16216) - [TIR] Fix dynamic smem 
merge leaf alloc
 * [#16190](https://github.com/apache/tvm/pull/16190) - Fix the error of 
reloading the model library on the ROCm platform: "MIOpen Error: No invoker was 
registered for convolution forward.”
 * [#16167](https://github.com/apache/tvm/pull/16167) - [Relay][Pytorch] Fix 
missing `.dtype`
 * [#16091](https://github.com/apache/tvm/pull/16091) - [Fix] Fix 
`topi.rms_norm` with float32 upscale
 * [#16081](https://github.com/apache/tvm/pull/16081) - [Fix] Broken Windows 
Build with LLVM
 * [#16051](https://github.com/apache/tvm/pull/16051) - [Fix][TIR] Fix dtype 
issues for match_buffer and ramp node
 * [#14655](https://github.com/apache/tvm/pull/14655) - [VTA] Fix FSIM compile 
error on macOS
 * [#16021](https://github.com/apache/tvm/pull/16021) - [FFI] Typo fix of 
IncRef to DecRef
 * [#16010](https://github.com/apache/tvm/pull/16010) - [Fix][TIR] fix mul 
dtype mismatch
 * [#16000](https://github.com/apache/tvm/pull/16000) - [Fix][TIR] fix symbolic 
strides lower
 * [#15970](https://github.com/apache/tvm/pull/15970) - [Hotfix] Mark 
python-FFI handling with TVM_DLL
 * [#15965](https://github.com/apache/tvm/pull/15965) - [CI] Better to pass the 
build folder

### CI
 * [#16110](https://github.com/apache/tvm/pull/16110) - Refactor unittest folder
 * [#16055](https://github.com/apache/tvm/pull/16055) - Fix broken links about 
Jenkins
 * [#16062](https://github.com/apache/tvm/pull/16062) - Use LLVM 17 for tests 
on `ci_arm`
 * [#16018](https://github.com/apache/tvm/pull/16018) - [Tests] Fix work_dir 
location used by test_micro_tuning_with_meta_schedule
 * [#16019](https://github.com/apache/tvm/pull/16019) - [Tests] Check 
int8+int32 testcases in test_estimate_peak_flops_cpu
 * [#16017](https://github.com/apache/tvm/pull/16017) - [Tests] Fix str vs. int 
comparison in test_num_threads

### Docs
 * [#16282](https://github.com/apache/tvm/pull/16282) - [Doc] Fix minor error 
in doc (Add an operator to Relay)
 * [#16152](https://github.com/apache/tvm/pull/16152) - [DOC] Add v0.14.0 docs 
to site
 * [#16127](https://github.com/apache/tvm/pull/16127) - Revert 
"[#15157][Rust][Doc] Re-enable the Rust documentation build (#15213)"
 * [#16097](https://github.com/apache/tvm/pull/16097) - Add missing backtick to 
contribute/code_guide.rst
 * [#16089](https://github.com/apache/tvm/pull/16089) - Fix error on linting by 
adding `--rev` argument
 * [#16024](https://github.com/apache/tvm/pull/16024) - Update 
release_process.rst about version number modification

### Frontend & Relay
 * [#16243](https://github.com/apache/tvm/pull/16243) - [TFLite] Add support 
for quantized mirror pad
 * [#15914](https://github.com/apache/tvm/pull/15914) - [TFLite]Support 
quantized SQUARE
 * [#16159](https://github.com/apache/tvm/pull/16159) - [KERAS] Fix bug concat 
convert for NCHW
 * [#16319](https://github.com/apache/tvm/pull/16319) - [Torch] add 
aten:broadcast_to
 * [#16131](https://github.com/apache/tvm/pull/16131) - [Pytorch] Add support 
for `aten::unflatten`
 * [#16105](https://github.com/apache/tvm/pull/16105) - [Pytorch] Add support 
for `aten::bitwise_and`
 * [#16079](https://github.com/apache/tvm/pull/16079) - [Pytorch] Add support 
for aten::swapaxes operator
 * [#15502](https://github.com/apache/tvm/pull/15502) - [Pytorch] aten::copy_ 
support for pytorch
 * [#16180](https://github.com/apache/tvm/pull/16180) - [Pytorch] Fix bug when 
converting models with torch.nn.ParameterList
 * [#16143](https://github.com/apache/tvm/pull/16143) - [Pytorch] Add support 
for `aten::scaled_dot_product_attention`
 * [#16123](https://github.com/apache/tvm/pull/16123) - [Pytorch] Add support 
for `aten::linalg_vector_norm`
 * [#16171](https://github.com/apache/tvm/pull/16171) - [Frontend] Preserve 
Pytorch Span Names
 * [#16217](https://github.com/apache/tvm/pull/16217) - [Frontend][QNN] fix 
access `param_debug_name_map` to node output name in fx-quantized graph node 
replacement
 * [#16199](https://github.com/apache/tvm/pull/16199) - [Frontend] Add support 
for aten::concat
 * [#16151](https://github.com/apache/tvm/pull/16151) - conv3d depthwise bug fix
 * [#15928](https://github.com/apache/tvm/pull/15928) - Expose qnn ops directly 
from relay.qnn module

### TOPI
 * [#16259](https://github.com/apache/tvm/pull/16259) - Add support for 
group_conv3d_transpose_ncdhw for generic
 * [#16052](https://github.com/apache/tvm/pull/16052) - Enhance `topi.nn.matmul`
 * [#16080](https://github.com/apache/tvm/pull/16080) - Reduce code redundancy 
in conv2d weights transformation
 * [#16248](https://github.com/apache/tvm/pull/16248) - [TOPI] Add support for 
group_conv1d_transpose_ncw for generic
 * [#16106](https://github.com/apache/tvm/pull/16106) - [TOPI] Add conv2d NHWC 
hybrid schedule for `arm_cpu`

### TIR 
 * [#16239](https://github.com/apache/tvm/pull/16239) - [Schedule] 
TileWithTensorIntrin skip incorrect ComputeInline for input-padding
 * [#16236](https://github.com/apache/tvm/pull/16236) - ConvertSSA process 
entry func first
 * [#16070](https://github.com/apache/tvm/pull/16070) - [Transform] Introduce 
new `InjectPermutedLayout` pass
 * [#16083](https://github.com/apache/tvm/pull/16083) - Enhance Python Type 
Annotations for TIR Expr
 * [#16073](https://github.com/apache/tvm/pull/16073) - Support more mma 
intrinsics and `get_mma_intrin_group` utility
 * [#16076](https://github.com/apache/tvm/pull/16076) - Enhance Python Type 
Annotations for TIR stmt
 * [#16074](https://github.com/apache/tvm/pull/16074) - Fix the thread binding 
iter_var dtype in `Bind` primitive
 * [#16063](https://github.com/apache/tvm/pull/16063) - Fix pass RenewDefs 
error in gather/take case
 * [#16027](https://github.com/apache/tvm/pull/16027) - Fix software pipeline 
with dynamic loop extent

### TVMScript
 * [#16271](https://github.com/apache/tvm/pull/16271) - Disable concise scoping 
when the scope stmt is explicitly annotated
 * [#16041](https://github.com/apache/tvm/pull/16041) - Fix mismatched dtype of 
IterVar in `T.thread_binding`
 * [#15953](https://github.com/apache/tvm/pull/15953) - [TIR] Pretty print TIR 
LLVM function name
 * [#15972](https://github.com/apache/tvm/pull/15972) - delete print extra info 
at parsing

### Misc
 * [#16279](https://github.com/apache/tvm/pull/16279) - replace deprecated 
np.int with int to avoid crash
 * [#16262](https://github.com/apache/tvm/pull/16262) - Update conv2d.py
 * [#16255](https://github.com/apache/tvm/pull/16255) - [Support] Add Interrupt 
Handling in Pipe
 * [#16104](https://github.com/apache/tvm/pull/16104) - [LoopPartition] Fix a 
bug of LoopPartition in single point scenarioes
 * [#16231](https://github.com/apache/tvm/pull/16231) - [Target] Add Jetson AGX 
Orin tags
 * [#16221](https://github.com/apache/tvm/pull/16221) - remove deprecated 
np.int in slice converter (pytorch)
 * [#16214](https://github.com/apache/tvm/pull/16214) - [Python] Fix setup.py 
for inplace build
 * [#16174](https://github.com/apache/tvm/pull/16174) - Bump cryptography from 
37.0.2 to 41.0.6 in /docker/python
 * [#16202](https://github.com/apache/tvm/pull/16202) - Fix IRModule 
initialization with attrs
 * [#16176](https://github.com/apache/tvm/pull/16176) - Enable ccache to 
accelerate contrib compilation
 * [#15968](https://github.com/apache/tvm/pull/15968) - Add missing backtick
 * [#16034](https://github.com/apache/tvm/pull/16034) - [Packaging] Include 
BYOC dynamic libraries into wheel
 * [#16087](https://github.com/apache/tvm/pull/16087) - Add _ffi_api.py under 
script folder
 * [#16039](https://github.com/apache/tvm/pull/16039) - [Target] Support obtain 
l2 cache size from target
 * [#16065](https://github.com/apache/tvm/pull/16065) - [Pylint] fix pylint 
issues from test_random to test_tedd
 * [#16031](https://github.com/apache/tvm/pull/16031) - [TRT] fix outdated 
module building method in tensorrt
 * [#16032](https://github.com/apache/tvm/pull/16032) - [CMake] Use llvm-config 
to locate Findzstd.cmake
 * [#16023](https://github.com/apache/tvm/pull/16023) - [Pylint] fix pylint 
issues for thrust&tflite_runtime&util
 * [#15998](https://github.com/apache/tvm/pull/15998) - [Codegen] Add shuffle 
for cuda and metal
 * [#16015](https://github.com/apache/tvm/pull/16015) - [Pylint] fix pylint 
issues for cblas
 * [#15955](https://github.com/apache/tvm/pull/15955) - [FFI][Python] Handle 
error propagation when line number is missing
 * [#15982](https://github.com/apache/tvm/pull/15982) - Bump werkzeug from 
2.2.3 to 3.0.1 in /apps/microtvm
 * [#15966](https://github.com/apache/tvm/pull/15966) - [CMake] Fix order of 
GNUInstallDirs module
 * [#15952](https://github.com/apache/tvm/pull/15952) - Update ci_arm Docker tag
 * [#15940](https://github.com/apache/tvm/pull/15940) - [Minor] Fix compilation 
warnings for clang
 * [#15947](https://github.com/apache/tvm/pull/15947) - Bump urllib3 from 
1.26.9 to 1.26.18 in /docker/python
 * [#15835](https://github.com/apache/tvm/pull/15835) - [CodeGenC][Redo] Handle 
GlobalVar callee as internal function call
 * [#15945](https://github.com/apache/tvm/pull/15945) - Bump urllib3 from 
1.26.15 to 1.26.18 in /apps/microtvm

-- 
View it on GitHub:
https://github.com/apache/tvm/releases/tag/v0.15.0.rc0
You are receiving this because you are subscribed to this thread.

Message ID: <apache/tvm/releases/137574...@github.com>

Reply via email to