JavaScript promise rejection: Loading CSS chunk katex failed. (error: https://git.tonybark.com/assets/css/katex.41d5cba5.css). Open browser console to see more details.
awesome-deepseek/docs/curator
Shreyas Pimpalgaonkar 1547c531a2 add curator
2025-01-27 11:11:52 -08:00
..
README.md add curator 2025-01-27 11:11:52 -08:00
README_cn.md add curator 2025-01-27 11:11:52 -08:00

image

Curator

Curator is an open-source tool to curate large scale datasets for post-training LLMs.

Curator was used to curate Bespoke-Stratos-17k, a reasoning dataset to train a fully open reasoning model Bespoke-Stratos.

Curator supports:

  • Calling Deepseek API for scalable synthetic data curation
  • Easy structured data extraction
  • Caching and automatic recovery
  • Dataset visualization
  • Saving $ using batch mode

Call Deepseek API with Curator easily:

image

Get Started here