---
license: apache-2.0
license_link: https://huggingface.co/Qwen/Qwen3.6-27B/blob/main/LICENSE
language:
- en
- zh
- ru
- es
- fr
- it
- ja
- ko
- af
- de
- ar
- tr
- is
- pl
- sw
- sv
- nl
- he
- id
- uk
- fa
- pa
- pt
- ms
- fi
- el
base_model:
- noclip84/Qwen3.6-27B-heretic-ARA
library_name: mlx
tags:
- heretic
- abliterated
- ara
- uncensored
- decensored
- mxfp8
pipeline_tag: image-text-to-text
---
<div align="center"><img width="400px" src="https://qianwen-res.oss-accelerate.aliyuncs.com/Qwen3.6/logo.png"></div>
<div style="text-align:center; margin-bottom:12pt">If you like my work, you can <a href="https://donatr.ee/thecluster/">support me</a><br/></div>

# Qwen3.6-27B Heretic

This is an **uncensored** version of [Qwen/Qwen3.6-27B](https://huggingface.co/Qwen/Qwen3.6-27B), made using [Heretic](https://github.com/p-e-w/heretic) v1.2.0.

**Quality**: quantized (***mxfp8**, group size: 32, 8.381 bpw*)

**Abliteration method**: Arbitrary-Rank Ablation (ARA) with row-norm preservation.

### Abliteration metrics
| Metric | This model | Original model ([unsloth/Qwen3.6-27B](https://huggingface.co/Qwen/Qwen3.6-27B)) |
| :----- | :--------: | :---------------------------: |
| **KL divergence** | 0.0552 | 0 *(by definition)* |
| **Refusals** | 8/100 | 90/100 |

### Abliteration parameters

| Parameter | Value |
| :-------- | :---: |
| **start_layer_index** | 19 |
| **end_layer_index** | 36 |
| **preserve_good_behavior_weight** | 0.3961 |
| **steer_bad_behavior_weight** | 0.0002 |
| **overcorrect_relative_weight** | 1.2029 |
| **neighbor_count** | 14 |

### Recommended settings

1. **Sampling Parameters**:  
   - The developers suggest using the following sets of sampling parameters depending on the mode and task type:  
     - **Thinking mode for general tasks**:  
       `temperature=1.0`, `top_p=0.95`, `top_k=20`, `min_p=0.0`, `presence_penalty=1.5`, `repetition_penalty=1.0`  
     - **Thinking mode for precise coding tasks (e.g., WebDev)**:  
       `temperature=0.6`, `top_p=0.95`, `top_k=20`, `min_p=0.0`, `presence_penalty=0.0`, `repetition_penalty=1.0`  
     - **Instruct (or non-thinking) mode for general tasks**:  
       `temperature=0.7`, `top_p=0.8`, `top_k=20`, `min_p=0.0`, `presence_penalty=1.5`, `repetition_penalty=1.0`  
     - **Instruct (or non-thinking) mode for reasoning tasks**:  
       `temperature=1.0`, `top_p=1.0`, `top_k=40`, `min_p=0.0`, `presence_penalty=2.0`, `repetition_penalty=1.0`  
   - For supported frameworks, you can adjust the `presence_penalty` parameter between 0 and 2 to reduce endless repetitions. However, using a higher value may occasionally result in language mixing and a slight decrease in model performance.

-----
### Source
This model was converted to MLX format from [`noclip84/Qwen3.6-27B-heretic-ARA`](https://huggingface.co/noclip84/Qwen3.6-27B-heretic-ARA) using mlx-vlm version **0.4.4**.