Sounds like a feature engineering problem, and we can't really help you here. That being said, categorical variables are one hot encoded in DD, and this can lead to a (too) high number of variables. You can try to preprocess some fields with a simpler scheme, e.g. mapping discrete values to integer. Some more complex scheme exist, from embeddings to grouping based on your underlying application.