How to use from
Hermes Agent
Start the MLX server
# Install MLX LM:
uv tool install mlx-lm
# Start a local OpenAI-compatible server:
mlx_lm.server --model "TheCluster/Qwen3.5-27B-Heretic-Text-MLX-mxfp8"
Configure Hermes
# Install Hermes:
curl -fsSL https://hermes-agent.nousresearch.com/install.sh | bash
hermes setup
# Point Hermes at the local server:
hermes config set model.provider custom
hermes config set model.base_url http://127.0.0.1:8080/v1
hermes config set model.default TheCluster/Qwen3.5-27B-Heretic-Text-MLX-mxfp8
Run Hermes
hermes
Quick Links

Qwen3.5-27B Heretic (Text only)

Quality: quantized (mxfp8, naive, group size: 32)

This is text only abliterated (uncensored) version of Qwen/Qwen3.5-27B

This model supports thinking mode, but it is disabled by default, you can enable thinking via --chat-template-kwargs '{"enable_thinking":true}'

Abliteration metrics

Metric This model Original model (Qwen/Qwen3.5-27B)
KL divergence 0.0653 0 (by definition)
Refusals 14/100 94/100

Source

This model was converted to MLX format from coder3101/Qwen3.5-27B-heretic using mlx-lm version 0.30.7.

Downloads last month
20
Safetensors
Model size
27B params
Tensor type
U8
U32
BF16
MLX
Hardware compatibility
Log In to add your hardware

8-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 馃檵 Ask for provider support

Model tree for TheCluster/Qwen3.5-27B-Heretic-Text-MLX-mxfp8

Base model

Qwen/Qwen3.5-27B
Quantized
(18)
this model