DSpace Repository

A NEURAL NETWORK BASED STITCHING OF REALISTIC SOUND TO SILENT VIDEO

Show simple item record

dc.contributor.author Isaac, Renson Reg # 35686
dc.contributor.author Hashmi, Muzammil Reg # 32667
dc.contributor.author Naqvi, Anas Reg # 35627
dc.contributor.author Nawaz, Asif Reg # 35631
dc.date.accessioned 2020-11-28T01:06:33Z
dc.date.available 2020-11-28T01:06:33Z
dc.date.issued 2017
dc.identifier.uri http://hdl.handle.net/123456789/10369
dc.description Supervised by Azmat Khan en_US
dc.description.abstract Objects make distinctive sounds when they are hit or scratched. These sounds reveal aspects of an object s material properties, as well as the actions that produced them. In this paper, we propose the task of predicting what sound an object makes when struck as a way ofstudying physical interactions within a visual scene. We present an algorithm that synthesizes sound from silent videos of people hitting and scratching objects with a drumstick. This algorithm uses a recurrent neural network to predict sound features from videos and then produces a waveform from these features with an example-based synthesis procedure. This project uses the Recurrent Neural Network technique to develop the software. The main advantage of using this technique is that it predict sound features from videos and then produces a waveform from these features with an example-based synthesis procedure. Different models of neural network are discuss and Feed forward, Back propagation. After trials and errors, a suitable set of training parameters are define and network structure that consist of 1 input layer, 2 hidden layers and 1 output layer with 69 input neurons, 324 neurons for both hidden layers and 38 neurons for output layer is created. We show that the sounds predicted by our model are realistic enough to fool participants in a “real or fake” psychophysical experiment, and that they convey significant information about material properties and physical interactions en_US
dc.language.iso en_US en_US
dc.publisher Bahria University Karachi Campus en_US
dc.relation.ispartofseries BS CS;MFN BSCS 68
dc.title A NEURAL NETWORK BASED STITCHING OF REALISTIC SOUND TO SILENT VIDEO en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account