Should impute -9999 for tree based models for nan rows. Impute median otherwise. Must set an indicator column with 0,1 for whether imputation has been applied or not