Imagine training a voice‑recognition system without hand‑transcribing thousands of hours of audio. Traditional supervised learning demands that developers label every snippet with exact text—a costly, ...