No. The analysis focuses on contextual elements that support content processing, not exhaustive object classification.
No. Visual context analysis augments audio transcription by providing additional signals.
No. All visual analysis is performed locally.