YouTube - Speech Interface
| Projects
|
YouTube now offers open access to key parts of the YouTube video repository and user community, via an open API interface and RSS feeds. Using their APIs, you can integrate online videos from YouTube's rapidly growing repository of videos into an application. Once you have created a developer profile, you'll be ready to tap into the power of YouTube.
This project is concerned with the creation of a party video jukebox. Imagine the scene - you have all your friends around and your media center PC is blasting out an Artic Monkeys number. That's fine but wouldn't it be nice if all at the party had the choice of shouting "stop the music", play me "Brown Eyed Girl" by Van Morrison. Now, in the background - your application which has got voice recogition, quickly flies off - does a search for 'brown eyed girl' by van morrison and automatically starts playing it......
Therefore this project involves linking up a voice recognition system with the YouTube API.
You also need to check out which speech resources will integrate nicely with your application.
One to check out might include The Hidden Markov Model Toolkit (HTK) which is a portable toolkit for building and manipulating hidden Markov models; Wizzard Voice Tools which is a voice interaction development toolset that can be applied to new or existing programs; Java Speech API or indeed this List of Speech Technology systems.