infer
> length_bucket_width
in https://opennmt.net/OpenNMT-tf/configuration.html. It is set to 5 with auto_config
but you can disable it with 0.
maximum_decoding_length
in the parameters: https://opennmt.net/OpenNMT-tf/configuration.html. Maybe length_penalty
can also help.