The role of mixed selectivity and representation learning for compositional generalization