You can use the code in this repository to compare LoRA and full-model fine-tuning for performing safety alignment on reasoning LLMs. We find that LoRA achieves strong safety alignment without harming ...
We propose Cross-View Code Alignment (CroVCA), a simple and universal principle for hashing foundation model embeddings using Binary Cross-Entropy and Coding-Rate Maximization. It unifies unsupervised ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results