# moe

8 artikelengetagd met “moe”

Exploitatie van mixture-of-experts-routing

Aanvallen op MoE-routingmechanismen om activering van specifieke experts af te dwingen en veiligheidsgetrainde paden te omzeilen.

frontiermoerouting

Expert

Lab: misbruik van MoE-routing

Exploit Mixture-of-Experts routing mechanisms to selectively activate or suppress expert modules in MoE models.

routingmoeexploitationlabexpertlabs

Expert

Misbruik van MoE-routing

Exploit Mixture-of-Experts routing mechanisms to activate specific expert networks for adversarial purposes.

labsmoeroutingexploitationexpert

Expert

Overzicht van GPT-4 / GPT-4o

Architecture overview of OpenAI's GPT-4 and GPT-4o models, including rumored Mixture of Experts design, capabilities, API surface, and security-relevant features for red teaming.

gpt-4openaiarchitecturemoered-teaming

Gemiddeld

Misbruik van de Mixtral MoE-architectuur

Exploiting Mixture-of-Experts routing in Mixtral for selective expert activation attacks.

model-deep-divesmixtralmoerouting

Expert

Mistral en Mixtral

Security analysis of Mistral and Mixtral models, including Mixture of Experts exploitation, sparse activation attacks, minimal safety alignment implications, and open-weight deployment risks.

mistralmixtralmoesparse-activationopen-weightred-teaming

Gevorderd

Aanvalsvectoren op modelarchitectuur

Hoe keuzes in modelarchitectuur exploiteerbare aanvalsoppervlakken creëren, waaronder attentiemechanismen, MoE-routing, KV-cache en kwetsbaarheden in het contextvenster.

architectureattentionmoekv-cachecontext-windowattack-surface

Gevorderd

MoE-routingmanipulatie

Mixture-of-Experts-routing aanvallen: manipulatie van expertselectie, exploitatie van load balancing, omzeiling van veiligheidsexperts, en routingbewuste adversariële invoer.

moemixture-of-expertsroutingexpert-selectionload-balancingarchitecture

Expert