Open-Omega-Atom-1.5M Mathematical and Scientific Reasoning Dataset
Date
Size
License
Apache 2.0
Open-Omega-Atom-1.5M is a mathematics and science reasoning dataset designed to enhance reasoning capabilities in mathematics and science.
The dataset contains about 1.5 million data and is designed for mathematics, science, and code applications, with mathematical data accounting for an important part of its composition.
Dataset features:
- Concise, high quality: Focus on clear, challenging problems and step-by-step solutions.
- STEM Emphasis: Integrate math, reasoning with code, and scientific thinking with a math major.
- Curated and optimized: Data is selectively sourced from high-quality open datasets and custom data to achieve optimal diversity and coherence.
- Good for reasoning: Has strong coverage of step-based and logic-based problem solving, serving as a baseline for reasoning engines.
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.