A Python application that combines semantic analysis (CLIP) and object detection (YOLOv5) to extract and analyze key frames from video footage. This project provides a sophisticated video analysis ...