Retrieves the data of the media
A uniform resource location for the media
Retrieves the contents of the visual media. Content should be yielded as soon as it is identified.
The transcription of the audio. Should keep yielding the built up transcription as it is known from the media
This input provides video input