avatar
Big Data Science
@bdscience
19.06.2024 20:59
📊💡Dataset for video analysis

CinePile is a question and answer based video understanding dataset. It was created using advanced large language models (LLMs). The dataset includes approximately 300,000 data points for training and 5,000 data points for testing.

Each row in the dataset consists of a question (dtype: string), five answer choices (dtype: list), and an answer_key (dtype: string). The auxiliary columns store the movie title, movie genre, video clip titles, etc.
To load a dataset via Python script, you can use the following command:
from datasets import load_dataset
dataset = load_dataset("tomg-group-umd/cinepile")
huggingface.co
tomg-group-umd/cinepile · Datasets at Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
1 1.5K

Обсуждение 0

Обсуждение не доступно в веб-версии. Чтобы написать комментарий, перейдите в приложение Telegram.

Обсудить в Telegram