Networked_Music_Review

Learning Auditory Models of Machine Voices

blendie.jpg LEARNING AUDITORY MODELS OF MACHINE VOICES [PDF] by Kelly Dobson and Brian Whitman (MIT Media Lab) and Daniel P.W. Ellis (LabROSA, Electrical Engineering, Columbia University).

ABSTRACT: Vocal imitation is often found useful in Machine Therapy sessions as it creates an emphatic relational bridge between human and machine. The feedback of the machine directly responding to the person’s imitation can strengthen the trust of this connection. However, vocal imitation of machines often bear little resemblance to the target due to physiological limitations. In practice, we need a way to detect human vocalization of machine sounds that can generalize to new machines. In this study we learn the relationship between vocal imitation of machine sounds and the target sounds to create a predictive model of vocalization of otherwise humanly impossible sounds. After training on a small set of machines and their imitations, we predict the correct target of a new set of imitations with high accuracy. The model outperforms distance metrics between human and machine sounds on the same task and takes into account auditory perception and constraints in vocal expression.

Blendie:

Machine Therapy:


May 22, 2007
Trackback URL

Leave a comment

Live Stage

Interviews

Current interview:
Jeff Talman

Previous Interviews:

Tags


music ~ livestage ~ sound ~ performance ~ installation ~ audio/visual ~ instrument ~ radio ~ audio ~ calls + opps ~ experimental ~ networked ~ event ~ festival ~ participatory ~ interactive ~ mobile ~ collaboration ~ live ~ video ~ electronic ~ reblog ~ distributed ~ environment ~ locative media ~ concert ~ workshop ~ exhibition ~ nature ~ electroacoustic ~ field recording ~ tool ~ software ~ recording ~ writings ~ improvisation ~ history ~ net_music_weekly ~ space ~ acoustic ~ voice ~ VJ/DJ ~ public ~ sonification ~ sound sculpture ~ immersion ~ body ~ mapping ~ soundscape ~ remix ~ noise ~ light ~ laptop ~ visualization ~ generative ~ wearable ~ site-specific ~ diy ~ found ~ city ~ virtual ~ interface ~ electromagnetic ~ algorithmic ~ architecture ~ platform ~ cinema ~ spatialization ~ sensor ~ second life ~ conference ~ controller ~ urban ~ robotic ~ intervention ~ hacktivism ~ net art ~ game ~ streaming ~ narrative ~ webcast ~ image ~ art + science ~ object ~ ecology ~ dance ~ responsive ~ biotechnology ~ score ~ circuit bending ~ ambient ~ resource ~ interviews/other ~ open source ~ sound walk ~ lecture ~ multimedia ~ data ~ composer ~ paper ~ wireless device ~ auralization ~ film ~ telematic ~ motion tracking ~ augmented ~ hybrid ~ mixed reality ~ mashup ~ social network ~ intermedia ~ text ~ listening ~ synesthesia ~ 3D ~ nmr_commission ~ surveillance ~ place ~ news ~ political ~ toy ~ livecoding ~ pyschogeography ~ acousmatic ~ wireless network ~ opera ~ residency ~ 8bit ~ conversation ~ gesture ~ privacy ~ interview ~ spoken word ~ theater ~ physical ~ podcast ~ sample ~ web 2.0 ~ copyright ~ newsletter ~ avatar ~ community ~ recycle ~ play ~ soundtrack ~ tactical ~ technology ~ upgrade! ~ broadcasts ~ processing ~ presence ~ emergence ~ tactile ~ identity ~ cassette ~ feedback ~ social media ~ language ~ aesthetics ~ new media ~ asynchronous ~ chance ~ interdisciplinary ~ tv ~ code ~ audio tour ~ glitch ~ hardware ~ e-literature ~ jazz ~ ubiquitous ~ Artificial Intelligence ~ tangible ~ chiptune ~ haptics ~ activist ~ business ~ symposium ~ courses ~ research ~ simulation ~ conductor ~ context-aware ~ post-convergence ~ synchronous ~ archives ~ im/material ~ satellite ~ audiotape ~ therapy ~ wiki ~ digital ~ speech technology ~ multimodal ~ relational ~
3D ~ 8bit ~ acousmatic ~ acoustic ~ activist ~ aesthetics ~ Artificial Intelligence ~ algorithmic ~ ambient ~ annotate ~ architecture ~ archives ~ art + science ~ audio tour ~ audiotape ~ augmented ~ auralization ~ audio/visual ~ avatar ~ biotechnology ~ body ~ broadcasts ~ business ~ calls + opps ~ cassette ~ chance ~ chiptune ~ circuit bending ~ city ~ code ~ collaboration ~ community ~ composer ~ concert ~ conductor ~ conference ~ context-aware ~ controller ~ conversation ~ copyright ~ courses ~ data ~ digital ~ distributed ~ diy ~ e-literature ~ ecology ~ electroacoustic ~ electromagnetic ~ electronic ~ emergence ~ environment ~ event ~ exhibition ~ experimental ~ feedback ~ festival ~ field recording ~ film ~ found ~ game ~ generative ~ gesture ~ glitch ~ hacktivism ~ haptics ~ hardware ~ hybrid ~ identity ~ image ~ im/material ~ immersion ~ improvisation ~ instrument ~ interactive ~ interdisciplinary ~ interface ~ intermedia ~ intervention ~ interview ~ interviews/other ~ jazz ~ language ~ laptop ~ lecture ~ light ~ listening ~ cinema ~ livecoding ~ livestage ~ locative media ~ mapping ~ mashup ~ mixed reality ~ mobile ~ motion tracking ~ multimedia ~ multimodal ~ nature ~ net_music_weekly ~ net art ~ networked ~ audio ~ dance ~ installation ~ live ~ music ~ narrative ~ radio ~ sound ~ text ~ theater ~ video ~ new media ~ news ~ newsletter ~ nmr_commission ~ noise ~ object ~ open source ~ opera ~ performance ~ platform ~ tool ~ play ~ physical ~ place ~ podcast ~ political ~ post-convergence ~ presence ~ privacy ~ processing ~ public ~ paper ~ pyschogeography ~ reblog ~ recording ~ recycle ~ relational ~ remix ~ research ~ residency ~ resource ~ responsive ~ robotic ~ sample ~ satellite ~ score ~ second life ~ sensor ~ simulation ~ site-specific ~ social media ~ social network ~ software ~ sonification ~ sound sculpture ~ sound walk ~ soundscape ~ soundtrack ~ space ~ spatialization ~ speech technology ~ spoken word ~ streaming ~ surveillance ~ symposium ~ synchronous ~ synesthesia ~ tactical ~ tangible ~ telematic ~ history ~ participatory ~ technology ~ asynchronous ~ wireless network ~ therapy ~ tactile ~ toy ~ tv ~ ubiquitous ~ upgrade! ~ urban ~ virtual ~ visualization ~ VJ/DJ ~ voice ~ wearable ~ web 2.0 ~ webcast ~ wiki ~ wireless device ~ workshop ~ writings ~

Archives

2008

Sep | Aug | Jul
Jun | May | Apr | Mar | Feb | Jan

2007

Dec | Nov | Oct | Sep | Aug | Jul | Jun | May | Apr

What is this?

Networked_Music_Review (NMR) is a research blog that focuses on emerging networked musical explorations.

Read more...

NMR Commissions

NMR commissioned the following artists to create new sound art works. More...
More NMR Commissions

Net_Music_Weekly

Roulette | UbuWeb TV

Roulette TV 2008 - David Behrman :: Marilyn Crispell with Lotte Anker :: Andrew Cyrille with Bob Stewart and Roy Campbell :: Joan La Barbara :: Oliver Lake :: Phoebe Legere :: ... Read more
Previous N_M_Weeklies

Newsletters & RSS

NMR offers a weekly review and a monthly e-mail newsletter and several RSS feeds. Read more...
Sign up to receive NMR by email

Bloggers

Guest Bloggers:

F.Y.I.

networked_performance
Turbulence
New York State Music Fund
Feed2Mobile
New American Radio
New York City Department for Cultural Affairs

Turbulence Works