Read local video files and perform object detection and text recognition frame by frame Support pause/resume functionality, controllable via on-screen buttons or the spacebar Automatically output ...