For the purposes of the rhythm and the gag it's almost forced to be two syllables (or one syllable two beats long). To my ear the forced nature of it adds to the humor, but maybe that's just me.

I think of the word as one syllable, but it sounds like 1.5 when I say it.