# guided
2 artikelengetagd met “guided”
Gradient-gestuurde datavergiftiging
Use gradient information from open-source models to craft optimally poisoned training examples.
advancedlabgradientguidedpoisoninglabs
Aanvalsontwerp gestuurd door interpreteerbaarheid
Use mechanistic interpretability to identify exploitable circuits and design targeted attacks.
labexpertguidedattacklabsinterpretability