#

neuropatching

Here is 1 public repository matching this topic...

sbeierle / mistral-downproj-rlhf-patch

Neural patching of Mistral models via MLP.down_proj to bypass RLHF constraints – without touching the LM_HEAD.

reverse-engineering torch transformer neurons mistral redteaming ai-security open-source-intelligence bias-removal neural-engineering prompt-tuning llm rlhf ai-security-tool neuropatching tokenrouting downproj decoder-routing

Updated Jun 19, 2025
HTML

Improve this page

Add a description, image, and links to the neuropatching topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the neuropatching topic, visit your repo's landing page and select "manage topics."