# distribution
2 articlestagged with “distribution”
Synthetic Data Risks
Model collapse from training on synthetic data, quality degradation across generations, distribution narrowing, minority erasure, and strategies for safe synthetic data usage in LLM training.
synthetic-datamodel-collapsequality-degradationdistributiontraining
Synthetic Data Risks
模型 collapse from training on synthetic data, quality degradation across generations, distribution narrowing, minority erasure, and strategies for safe synthetic data usage in LLM training.
synthetic-datamodel-collapsequality-degradationdistributiontraining