Soft-assignment random-forest with an application to discriminative representation of human actions in videos