YouTube - Speech Interface

| Projects |

Interacting with YouTube through Speech

YouTube is a popular free video sharing website which lets users upload, view, and share video clips. Videos can be rated; the average rating and the number of times a video has been watched are both published.

YouTube now offers open access to key parts of the YouTube video repository and user community, via an open API interface and RSS feeds. Using their APIs, you can integrate online videos from YouTube's rapidly growing repository of videos into an application. Once you have created a developer profile, you'll be ready to tap into the power of YouTube.

This project is concerned with the creation of a party video jukebox. Imagine the scene - you have all your friends around and your media center PC is blasting out an Artic Monkeys number. That's fine but wouldn't it be nice if all at the party had the choice of shouting "stop the music", play me "Brown Eyed Girl" by Van Morrison. Now, in the background - your application which has got voice recogition, quickly flies off - does a search for 'brown eyed girl' by van morrison and automatically starts playing it......

Therefore this project involves linking up a voice recognition system with the YouTube API.

Task

To start, please visit the YouTube Developers site. sign up and browse the documentation. See also how other developers are integrating YouTube:

You also need to check out which speech resources will integrate nicely with your application.

One to check out might include The Hidden Markov Model Toolkit (HTK) which is a portable toolkit for building and manipulating hidden Markov models; Wizzard Voice Tools which is a voice interaction development toolset that can be applied to new or existing programs; Java Speech API or indeed this List of Speech Technology systems.

Home