avatar
Big Data Science
@bdscience
23.08.2024 20:59
⚡️A Scalable Dataset for Tuning Instructions in Software Mathematical Reasoning

The Mathematical Reasoning pipeline emphasizes separating numbers from mathematical problems to synthesize number-independent programs, enabling efficient and flexible scaling while minimizing dependence on specific numerical values.

As the authors note in their paper, experiments in fine-tuning open-source language and code models such as Llama2 and CodeLlama demonstrate the practical benefits of the InfinityMATH dataset.

In addition, these models have shown high reliability on the GSM8K+ and MATH+ benchmarks, which are improved versions of the benchmarks with minor changes to the numerical values.

📊Dataset
📖Research paper
huggingface.co
Paper page - InfinityMATH: A Scalable Instruction Tuning Dataset in Programmatic Mathematical Reasoning
Join the discussion on this paper page
👍 1
1.7K

Обсуждение 0

Обсуждение не доступно в веб-версии. Чтобы написать комментарий, перейдите в приложение Telegram.

Обсудить в Telegram