You're better off trying to detect when the user begins typing a message as it will actually work properly and it means that the user does actually want to talk to the bot and didn't just open the conversation window by accident.
EDIT: Although I don't really see the problem with just waiting for the user to send a message, why does the bot have to talk before you do?