What kind of voice are we talking about? TTS or normal voice?
The installed TTS engine might be a bit slow in actually speaking the text passed to it