Hi,. So you exported your model using OpenNMT-tf 2.x? If yes, you should use a custom serving image
opennmt/tensorflow-serving:2.0.0-gpu
which includes additional op. See here for more info: https://github.com/OpenNMT/OpenNMT-tf/tree/master/examples/serving/tensorflow_serving#custom-tensorflow-serving-image
Is the openmt’s tf serving built using optimized version or not?