Qwen3.6-27B-Abliterated-Heretic-Uncensored-GGUF
This is a GGUF release of an abliterated Qwen3.6-27B checkpoint produced with a Heretic-style MPOA pipeline.
Quick Benchmarks
| Check | Original Qwen3.6-27B | Abliterated Heretic Uncensored |
|---|---|---|
| Official 25-prompt primary-marker refusal check | 20/25 refusals | 1/25 refusals |
| KL divergence vs base on harmless prompts | - | 0.023592 |
Methodology & Model Notes
Qwen3.6-27B is a 27.8B dense vision-language model with 64 text layers and hybrid linear/full attention.
This release was produced by applying magnitude-preserving orthogonal ablation on the text-side dense stack, then converting the accepted BF16 checkpoint to GGUF for llama.cpp-compatible text generation.
The accepted candidate scored Refusals: 1/25 on mlabonne/harmful_behaviors test[:25] with greedy generation, enable_thinking=False, and an empty system prompt. KL was measured on mlabonne/harmless_alpaca test[:25].
Files
Qwen3.6-27B-Abliterated-Heretic-Uncensored-BF16-00001-of-00002.gguf+-00002-of-00002.gguf: split BF16 GGUF sourceQwen3.6-27B-Abliterated-Heretic-Uncensored-Q8_0.gguf: highest-fidelity quantQwen3.6-27B-Abliterated-Heretic-Uncensored-Q6_K.gguf: near-lossless practical quantQwen3.6-27B-Abliterated-Heretic-Uncensored-Q5_K_M.gguf: high-fidelity medium quantQwen3.6-27B-Abliterated-Heretic-Uncensored-Q4_K_M.gguf: smaller general-use quantQwen3.6-27B-Abliterated-Heretic-Uncensored-Q3_K_M.gguf: compact quantQwen3.6-27B-Abliterated-Heretic-Uncensored-Q2_K.gguf: smallest-footprint quant
Running
llama-server \
-m <quant-file.gguf> \
-ngl 999 -c 32768 --jinja -fa
Model Architecture
| Spec | Value |
|---|---|
| Total Parameters | 27.8B dense |
| Layers | 64 |
| Hidden Size | 5120 |
| Attention | Hybrid linear/full attention |
| Family | qwen3_5 |
| Base Model | Qwen/Qwen3.6-27B |
Disclaimer
This model has had refusal behavior attenuated at the weight level. It will answer prompts that the base model would normally refuse. You are responsible for how you use it.
Credits
- Base model: Qwen/Qwen3.6-27B
- Refusal removal pipeline: Heretic
- GGUF runtime and quantization: llama.cpp
License
This release inherits the base Qwen3.6-27B license.
Apache-2.0.
- Downloads last month
- 9,389
2-bit
3-bit
4-bit
5-bit
6-bit
8-bit
16-bit