📊💡Dataset for video analysis
CinePile is a question and answer based video understanding dataset. It was created using advanced large language models (LLMs). The dataset includes approximately 300,000 data points for training and 5,000 data points for testing.
Each row in the dataset consists of a question (dtype: string), five answer choices (dtype: list), and an answer_key (dtype: string). The auxiliary columns store the movie title, movie genre, video clip titles, etc.
To load a dataset via Python script, you can use the following command:
from datasets import load_dataset
dataset = load_dataset("tomg-group-umd/cinepile")
Обсуждение 0
Обсуждение не доступно в веб-версии. Чтобы написать комментарий, перейдите в приложение Telegram.
Обсудить в Telegram