I’d love of text to speech engines got smart enough to understand contextual visual things with Unicode abuse. Like smol text being whispered or Zalgo getting run through a bitcrusher with progressively fucked up formant manipulation
Instead they just spew a litany of codepoint descriptors
i made a thing
https://soundcloud.com/plaidfluff/all-work-and-no-play