You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
* Move triton_pb.Tensor to C++
* Add support for DLPack GPU tensor
* Add support for conversion between dlpack datatypes and triton
* Add support for output tensors in GPU
* Fix bug when the output tensors contain one of the input tensors
* Add clean up for DLPack tensors
* Fix deadlock when init takes long
* Fix DLPack for CPU tensors
* Clean up
* Fix up
* Add FORCE_CPU_ONLY_INPUT_TENSORS
* Minor fixes for TYPE_STRING and contiguous tensors
* Review edits
* Manually adjust the offset for cudaIpcMemHandle
* Print appropriate error when the tensor is not contiguous
* Refactor communication between the main process and the stub process (triton-inference-server#65)
* Fix CPU_PINNED_MEMORY
* Delete shared memory region if there is an exception
* Minor improvements
* Fix shutdown and identity tensor return
* Add more comment
0 commit comments