Training different policies in different styles is a really interesting idea. Yo...

zardo · on March 3, 2016

Or, rather than multiple policies, one policy that takes a player vector as an input along with the board position. Players that you predict will make the same move from a given board have their vectors adjusted toward each other and away from a random sample of other player vectors.

If it works, you would be able to perform player vector math ala word2vec. (No idea if it will work)

momerath · on March 3, 2016

I don't know a lot about chess, but I would try picking several prolific players with what seem to you to be different styles, and training a classifier to identify the player, as an experiment in viability.