development of a computer-guided workflow for catalyst optimization. descriptor validation, subset selection, and training set analysis