Do speech sounds contain enough information to generate images?
I built a visualizer for human speech with a generative model called a Variational Auto Encoder (VAE), for Fundamentals of Speech Recognition (E6998), a graduate level speech recognition class at Columbia. The project was inspired by a paper by Zach Lieberman.