Semantic Pooling for Complex Event Analysis in Untrimmed Videos

Abstract

This paper introduces a novel semantic pooling method for event analysis tasks like detection, recognition, and recounting in long untrimmed Internet videos. Using semantic saliency, the approach ranks video shots to prioritize the most relevant ones, improving the classifier’s accuracy. The paper proposes a nearly-isotonic SVM classifier, validated with experiments on real-world datasets, showcasing significant performance improvements.

Publication
IEEE Transactions on Pattern Analysis and Machine Intelligence