Posted inlearn
Enhancing Video Understanding with Contrastive Learning and Large Language Models
Video-language pre-training has significantly advanced video understanding tasks. While previous efforts primarily concentrated on short-form videos and sentence pairings, the domain of long-form video-language pre-training remains largely unexplored. Understanding long-form…