There you go, two Send Live Audio nodes. Also to note that 8k is quite a low sample rate, might have its own unwanted artifacts on top of it. I'd be more inclined to keep a higher sample rate (44.1k or 48k, or at least go back to original audio file for highest possible) and if you want to add some lo-fi quality try, for example, Vuo's audio filter.
I could be missing something, but it seems you simply need to select a protocol again to deselect it, then add a window and account for width/height and events used to run the comp. If using time port in a protocol, use Fire at Display Refresh. And Jaymie advised at some point that for some things to make sure events fire in proper order use Allow First Event (after Fire on Display Refresh) instead of using the Fire at Start that loads with the default empty comp.