alex calderwood

Phonaesthesia

research language code

Do speech sounds contain enough information to generate images?

I built a visualizer for human speech with a generative model called a Variational Auto Encoder (VAE), for Fundamentals of Speech Recognition (E6998), a graduate level speech recognition class at Columbia. The project was inspired by a paper by Zach Lieberman.