- Breaking: datasets are now parquet-only for ingestion and core dataset exports.
- Removed JSONL-based ingestion readers and legacy auto-detection paths.
detect-languagepath loading now accepts parquet files/directories only.dedup --output-dedupednow accepts.parquetonly.- Evaluation bundles, CSV/JSON reports, and pair artifacts remain supported.