GPU implementation of singular value decomposition for high rank tensors
Abstract
Programming using the Python API (application programming interface) offers some advantages over using compiled languages. Here we implement a high rank tensor decomposition routine using the TensorFlow library which has native support for utilizing multi-core CPU, GPU, and TPU hardware. Specifically, a singular value decomposition algorithm was performed on a rank-5 tensor. The performance of this Python implementation was compared with a known C++ based library written specifically for tensor manipulations but without native GPU support. We report some use cases where the implementation on a consumer grade GPU was empirically faster than the C++ based library when the rank-5 tensor has more than 2 × 106 elements. With the acceptable performance of the implementation, it may be beneficial to have have a native implementation of tensor network operations on TensorFlow.
Downloads
Published
Issue
Section
License
By submitting their manuscript to the Samahang Pisika ng Pilipinas (SPP) for consideration, the Authors warrant that their work is original, does not infringe on existing copyrights, and is not under active consideration for publication elsewhere.
Upon acceptance of their manuscript, the Authors further agree to grant SPP the non-exclusive, worldwide, and royalty-free rights to record, edit, copy, reproduce, publish, distribute, and use all or part of the manuscript for any purpose, in any media now existing or developed in the future, either individually or as part of a collection.
All other associated economic and moral rights as granted by the Intellectual Property Code of the Philippines are maintained by the Authors.








