Use a ternary op since it seems the cuda compiler cannot resolve overloads by going through simple typedefs.
拖放文件到此处或者 点击上传