Security Implications of Model Editing

Research on security risks and vulnerabilities introduced by editing or modifying LLMs post-training, including knowledge editing techniques and their potential misuse

This presentation covers 3 research papers on large language models applied to Security Implications of Model Editing.

1 | 5