The project consists of developing a video-dense captioning method through a neural network for classification, YOLO for object detection, key frame extraction, and abstractive summarization algorithms. The process is focused on a classroom environment to map the state of the students in classroom.
Abraham Marquez Meza, a01651150@tec.mx