The arrival of the digital age has created the need to be able to store, manage, and digitally use an ever-increasing amount of video and audio material. Thus, video cataloguing has emerged as a requirement of the times. Video Cataloguing: Structure Parsing and Content Extraction explains how to efficiently perform video structure analysis as well as extract the basic semantic contents for video summarization, which is essential for handling large-scale video data.
This book addresses the issues of video cataloguing, including video structure parsing and basic semantic word extraction, particularly for movie and teleplay videos. It starts by providing readers with a fundamental understanding of video structure parsing. It examines video shot boundary detection, recent research on video scene detection, and basic ideas for semantic word extraction, including video text recognition, scene recognition, and character identification.
The book lists and introduces some of the most commonly used features in video analysis. It introduces and analyzes the most popular shot boundary detection methods and also presents recent research on movie scene detection as another important and critical step for video cataloguing, video indexing, and retrieval.
The authors propose a robust movie scene recognition approach based on a panoramic frame and representative feature patch. They describe how to recognize characters in movies and TV series accurately and efficiently as well as how to use these character names as cataloguing items for an intelligent catalogue.
The book proposes an interesting application of highlight extraction in basketball videos and concludes by demonstrating how to design and implement a prototype system of automatic movie and teleplay cataloguing (AMTC) based on the approaches introduced in the book.
Introduction to Movie and Teleplay Cataloguing
Related Research State and Progress
Main Research Work
Visual Features Extraction
Scale-Invariant Feature Transform
Histogram of Oriented Gradients (HOG)
Maximally Stable Extremal Regions
Local Binary Pattern (LBP)
Accelerating Shot Boundary Detection
Frame Difference Calculation
Temporal Redundant Frame Reduction
Corner Distribution-Based MCFB Removal
Key Frame Extraction
Size of the Key Frame Set
Categories of Key Frame Extraction Methods
Key Frame Extraction Using a Panoramic Frame
Multimodality Movie Scene Detection
KCCA and Feature Fusion-Based Method
Experiment and Results
Video Text Detection and Recognition
Implementation of Video Text Recognition
‘‘Where" Entity: Video Scene Recognition
Representative Feature Patch Extraction
Scene Classification Using Latent Dirichlet Analysis
Enhanced Recognition Based on VSC Correlation
‘‘Who" Entity: Character Identification
Overview of Adaptive Learning
Adaptive Learning with Related Samples
Audiovisual Information-Based Highlight Extraction
Unrelated Scene Removal
Demo System of Automatic Movie or Teleplay Cataloguing
General Design of the Demo