I don't think I could do a reliable clap, as that's just a slow peak in the amplitude, and even typing could do that. but what I am doing now is just using an algorithm (Goertzel) to detect the "volume" of a specific frequency (in my case 1680) and if that volume goes above a certain threshold for x milliseconds, it will send winamp a command that it has to pause/unpause
Works pretty reliable, although my sister for example is unable to whistle at my frequency