Dataset 1.0

Softstar-100K Dataset

A curated 100K-sample dataset of high-quality instruction-following data. Cleaned, deduplicated, and formatted for fine-tuning open-weight language models.

โ†“ 18,700 downloads
License: CC-BY-4.0
Updated: May 2026
// DOWNLOAD
softstar-100k-v1.0.jsonl
SIZE2.1 GB
FORMATjsonl
LICENSECC-BY-4.0
๐Ÿ”’ Sign in to download โ€” it's free for registered members.