Big Data Science (@bdscience): ⚡️A Scalable Dataset for Tuning Instructions in Software Mathematical Reasoning The Mathem…

Big Data Science

@bdscience

23.08.2024 20:59

⚡️A Scalable Dataset for Tuning Instructions in Software Mathematical Reasoning

The Mathematical Reasoning pipeline emphasizes separating numbers from mathematical problems to synthesize number-independent programs, enabling efficient and flexible scaling while minimizing dependence on specific numerical values.

As the authors note in their paper, experiments in fine-tuning open-source language and code models such as Llama2 and CodeLlama demonstrate the practical benefits of the InfinityMATH dataset.

In addition, these models have shown high reliability on the GSM8K+ and MATH+ benchmarks, which are improved versions of the benchmarks with minor changes to the numerical values.

📊Dataset
📖Research paper

huggingface.co

Paper page - InfinityMATH: A Scalable Instruction Tuning Dataset in Programmatic Mathematical Reasoning

Join the discussion on this paper page

👍 1

1.7K

Обсуждение 0

Обсуждение не доступно в веб-версии. Чтобы написать комментарий, перейдите в приложение Telegram.

Обсудить в Telegram

Обсуждение 0

Вход в экосистему

Ваши настройки cookie