---
license: apache-2.0
license_link: https://huggingface.co/Qwen/Qwen3.6-27B/blob/main/LICENSE
language:
- en
- zh
- ru
- es
- fr
- it
- ja
- ko
- af
- de
- ar
- tr
- is
- pl
- sw
- sv
- nl
- he
- id
- uk
- fa
- pa
- pt
- ms
- fi
- el
base_model:
- noclip84/Qwen3.6-27B-heretic-ARA
library_name: mlx
tags:
- heretic
- abliterated
- ara
- uncensored
- decensored
- mxfp8
pipeline_tag: image-text-to-text
---
# Qwen3.6-27B Heretic
This is an **uncensored** version of [Qwen/Qwen3.6-27B](https://huggingface.co/Qwen/Qwen3.6-27B), made using [Heretic](https://github.com/p-e-w/heretic) v1.2.0.
**Quality**: quantized (***mxfp8**, group size: 32, 8.381 bpw*)
**Abliteration method**: Arbitrary-Rank Ablation (ARA) with row-norm preservation.
### Abliteration metrics
| Metric | This model | Original model ([unsloth/Qwen3.6-27B](https://huggingface.co/Qwen/Qwen3.6-27B)) |
| :----- | :--------: | :---------------------------: |
| **KL divergence** | 0.0552 | 0 *(by definition)* |
| **Refusals** | 8/100 | 90/100 |
### Abliteration parameters
| Parameter | Value |
| :-------- | :---: |
| **start_layer_index** | 19 |
| **end_layer_index** | 36 |
| **preserve_good_behavior_weight** | 0.3961 |
| **steer_bad_behavior_weight** | 0.0002 |
| **overcorrect_relative_weight** | 1.2029 |
| **neighbor_count** | 14 |
### Recommended settings
1. **Sampling Parameters**:
- The developers suggest using the following sets of sampling parameters depending on the mode and task type:
- **Thinking mode for general tasks**:
`temperature=1.0`, `top_p=0.95`, `top_k=20`, `min_p=0.0`, `presence_penalty=1.5`, `repetition_penalty=1.0`
- **Thinking mode for precise coding tasks (e.g., WebDev)**:
`temperature=0.6`, `top_p=0.95`, `top_k=20`, `min_p=0.0`, `presence_penalty=0.0`, `repetition_penalty=1.0`
- **Instruct (or non-thinking) mode for general tasks**:
`temperature=0.7`, `top_p=0.8`, `top_k=20`, `min_p=0.0`, `presence_penalty=1.5`, `repetition_penalty=1.0`
- **Instruct (or non-thinking) mode for reasoning tasks**:
`temperature=1.0`, `top_p=1.0`, `top_k=40`, `min_p=0.0`, `presence_penalty=2.0`, `repetition_penalty=1.0`
- For supported frameworks, you can adjust the `presence_penalty` parameter between 0 and 2 to reduce endless repetitions. However, using a higher value may occasionally result in language mixing and a slight decrease in model performance.
-----
### Source
This model was converted to MLX format from [`noclip84/Qwen3.6-27B-heretic-ARA`](https://huggingface.co/noclip84/Qwen3.6-27B-heretic-ARA) using mlx-vlm version **0.4.4**.