Inventors:
Oytun Turk - Istanbul, TR
Levent Mustafa Arslan - Istanbul, TR
Fred Deutsch - New York NY, US
Assignee:
Voxonic, Inc. - New York NY
International Classification:
G10L 13/06
Abstract:
The conversion of speech can be used to transform an utterance by a source speaker to match the speech characteristic of a target speaker, for applications such as dubbing a motion picture. During a training phase, utterances corresponding to the same sentences by both the target speaker and source speaker are force aligned according to the phonemes within the sentences. A transformation or mapping is trained so that each frame of the source utterances is mapped to a corresponding frame of the target utterance. After the completion of the training phase, a source utterance is divided into frames, which are transformed into target frames. After all target frames are created from the sequence of frames from the source utterance, a target utterance is created having the speech of the source speaker, but with the vocal characteristics of the target speaker.