#HTE
“This algorithm uses a recurrent neural network to predict sound features from videos and then produces a waveform from these features with an example-based synthesis procedure. We show that the sounds predicted by our model are realistic enough to fool participants in a “real or fake” psychophysical experiment, and that they convey significant information about material properties and physical interactions.”
Visually Indicated Sounds by Andrew Owens, Phillip Isola, Josh McDermott, Antonio Torralba, Edward H. Adelson, William T. Freeman
http://redirect.viglink.com?u=http%3A%2F%2Fjournal.benbashford.com%2Fpost%2F146224073508&key=ddaed8f51db7bb1330a6f6de768a69b8