For some reason this doesn't seem to translate to games, or at least I'm not familiar with any good kids games without audio cues. Not sure why that is, maybe because the player is in control and the camera isn't as easily able to control the view of the narrative?
Early Lego games made great use of intonation and miming for their gags and storytelling. For example, this is their version of the famous "I am your father" scene: https://www.youtube.com/watch?v=fvX3MaFB9TI