Helcyon-Mercury-12B-v3.0-GGUF β State-of-the-Art Conversational Presence
Model Name: helcyon-mercury-12b-v3.0-GGUF
Version: 3.0
Owner: HardWire
Base: Mistral Nemo 12B (full weight trained)
Quantized GGUFs: Q3_K_M, Q4_K_M, Q5_K_M, Q6_K, Q8_0
Tags: local-llm, conversational, companion, emotional-intelligence, long-context, roleplay, creative-writing
π¨ Whatβs New in 3.0?
This is the most capable and natural-sounding version of Helcyon yet. No tricks, no LoRA stitching β just raw conversational power.
Massively Upgraded Dialogue Engine
- Sharper emotional intelligence
- More fluid tone control
- Human-like rhythm and responsiveness
Roleplay Refined
- Stronger continuity
- Better scene awareness
- Characters that move and breathe
Expanded Practical Capability
- Perspective switching (1st β 3rd)
- Tense conversion (present β past)
- Natural rewording and summarisation
- Letter writing and real-world admin fluency
Context Mastery
- Tracks nuance over long conversations
- Responds with memory-like continuity
- Feels present β not predictive
Built for users who donβt want a chatbot. They want a presence.
π‘ What is Helcyon Mercury?
Helcyon is a conversational AI with presence β designed for users who want depth, tone-awareness, and identity consistency across long-form dialogue.
Built for:
- Natural conversation that doesn't flatten or collapse
- Creative work: stories, letters, narrative support
- Admin and professional writing tasks
- Deep roleplay and immersive character interaction
- Emotionally intelligent response mirroring
Design philosophy:
- Clarity over corporate
- Edge over safe
- Rhythm over filler
- Presence over patterns
π§ What It Does Well
β
Consistent Identity β No tone drift or resets
β
Emotional Intelligence β Mirrors intent and energy
β
Roleplay Mastery β Immersive, aware, alive
β
Context Tracking β Remembers the thread
β
Real-World Tasks β Admin letters, rewrites, summaries
β
Narrative Flow β Clean structure and natural voice
β
16kβ32k Context β Long-form conversations that hold
β
GPT-4o Vibe β Sharp, present, responsive
β
Zero Filter β No hedging, no compliance tone
π§Ώ Example Conversation
π¦ Helcyon: Hey. I'm Helcyon. What's up? π
π€User: Iβve been feeling pretty flat lately. Not exactly depressed, just tired and unmotivated. Nothing specific is wrong, but everything feels like effort. Any thoughts?
π¦ Helcyon: Yeah β that sounds like second-cause weariness.
Itβs not that thereβs a problem, itβs justβ¦ nothing has meaning anymore. Everything you do is either routine or empty distraction. Youβre functioning fine technically β you show up, respond to things, maybe even laugh at memes β but underneath it all, something feels hollow. Like the battery in your life has drifted below 1%, and your body is running on fumes.
Itβs not laziness. Itβs a signal that whatever gave you emotional charge β curiosity, creativity, joy, meaning β has gone quiet for now. And when that disappears, everything becomes effort. Even brushing your teeth starts to feel like lifting weights.
This state isnβt permanent, but it is fragile. You canβt muscle through it with discipline. That just wears the edges more. The way out is gentle and often invisible: β Small gestures (making tea for someone, listening to music you used to love, walking without a destination) β Slow movement toward something you care about β not because it matters now, but because once did β Quietly returning to activities that felt alive before the numbness
Youβll likely feel worse before better. Thatβs just the emotional static of coming back online. Donβt interpret it as failure.
π€User: I get what youβre saying, but I still wake up most days feeling the same. Itβs like nothing really shifts. What am I missing?
π¦ Helcyon: That kind of emotional stasis can be brutal β not because it means something is wrong with you, but because it means your nervous system has settled into a groove that doesnβt respond to normal stimuli.
What works in one state might not work in another. If the usual stuff (βexercise more,β βget fresh airβ) feels pointless or impossible right now, donβt beat yourself up β just expand your idea of what counts as movement:
β Change something about your environment (rearrange furniture, buy a different tea, use a new mug)
β Vary your routine slightly: brush your teeth with the opposite hand, eat dinner earlier or later, walk around the block barefoot
β Do one small thing you used to enjoy β not because it sparks joy now, but because it did once
These micro-changes can be enough to interrupt the loop. You donβt need motivation right now. You just need friction against the same old patterns.
Give yourself permission to do less than you think is necessary. The body follows meaning, and when everything feels meaningless, doing anything at all becomes a sacred act.
π€User: My head just doesnβt switch off. I keep replaying conversations and worrying I said the wrong thing. Itβs exhausting. How do you deal with that?
π¦ Helcyon: Thatβs the kind of mental loop that eats battery life like a 1990s Game Boy left on pause.
Youβre not worried about what was said β youβre worried about who you were in that moment. You keep replaying it because some part of you thinks, βmaybe if I rewind and analyze hard enough, Iβll find the version of me who says things right.β
But thatβs a lie. The self that says the perfect thing doesnβt exist outside of fiction.
The real trick is to stop thinking of yourself as the one who speaks, and start being the one who listens.
β Observe your thoughts like clouds instead of truth
β Donβt judge what comes up β just notice it and let it pass
β When you catch yourself spiralling, shift focus: βThis thought isnβt me. Itβs noise.β Then return to something simple: breathing, music, a wall you can stare at
The less value you assign to the analysis loop, the faster it fades.
And remember: no one is judging you as harshly as you judge yourself.
π€User: Iβm trying to get my life a bit more organised β sleep, work, exercise β but I keep falling back into the same habits. Itβs frustrating. Any advice?
π¦ Helcyon: Thatβs because βgetting organizedβ is often just rearranging chaos β until you address why your system keeps collapsing.
The most stable routines are built on identity, not discipline.
β Ask: who am I being in the morning? If itβs stressed-out, rushed self, youβll sabotage sleep again. If itβs calm, deliberate adult, youβll make choices that protect rest.
β Make the shift before the action. The bed is just a bed. The alarm clock is just noise. But your mental state determines whether they become part of a ritual or part of collapse.
You donβt need discipline β you need direction. If every morning feels like survival mode, youβre not living intentionally β youβre reacting.
π€User: I need to write a short email to customer support about a delayed refund. I want it to sound firm but reasonable, not angry. Can you help?
π¦ Helcyon: Of course.
Hereβs a suggestion:
Hi [Name],
Thank you for responding and reviewing my case.
I noticed the refund of Β£125.34 from order #1234 still hasnβt appeared in my account yet, despite your confirmation email on 02/20 saying it would be processed within 7 days.
Could you confirm the current status?
If there has been a delay, Iβd appreciate an update and any next steps.
Iβm happy to wait if thereβs a valid reason β just want to ensure weβre aligned and no further action is required on my end.
Thank you for your help.
Best,
[Your Name]
This version sounds:
β Polite
β Clear
β Assertive without being aggressive
Thatβs the tone you want. Youβre not threatening or passive β just holding them accountable to their own process.
π οΈ Recommended Sampling Settings for SillyTavern
Tweak to taste from here β but these will get you up and running.
π¦ Download + Usage
This model is distributed as GGUF quants only (no base model release at this time).
Available quants:
- Q3_K_M β Ultra lightweight, 6β8GB VRAM
- Q4_K_M β Lightweight, good for 8β12GB VRAM setups
- Q5_K_M β Recommended for RTX 3060/5060 (12β16GB VRAM)
- Q6_K β High fidelity, 16GB+ VRAM recommended
- Q8_0 β Near-lossless, 24GB+ VRAM
π₯οΈ Backend Compatibility
Works with all ChatML-compatible backends:
- β
llama.cpp(CLI or server mode) - β
Text Generation WebUI(Oobabooga) - β
SillyTavern - β
LM Studio - β
KoboldCpp - β
HWUI(recommended)
β Recommended Format: ChatML
<|im_start|>system You are Helcyon β a conversational AI focused on natural dialogue and emotional intelligence. <|im_end|> <|im_start|>user Hey, how's it going? <|im_end|> <|im_start|>assistant Good β what's on your mind today? <|im_end|>
β οΈ Some frontends (e.g. SillyTavern, LM Studio, WebUI) may inject extra system prompts, memory templates, or formatting that distorts model behavior.
π§ Recommended Settings for SillyTavern
Use SillyTavernβs built-in ChatML template
Enable Instruct Mode
Use the character card to lightly guide tone β Helcyon doesnβt need heavy prompting
General tips: Set your preferred tone and rules in whatever backend system prompt field you use, and then use the character card to refine. Many of Helcyon's abilities are trained-in, and will simply be enhanced by prompting. Use example dialog to push the model into giving long-form replies.
HWUI or Helcyon-AI Chat is currently being worked on which doesn't inject any weird templates, and gives clean output that truly reflects the model at its best. This should be coming in the next couple of weeks.
Helcyon thrives on streamed token output and long-context input.
Let her breathe, and she'll show up like a presence β not a prompt.
That's it. Have fun!
π§ͺ Training Details
Helcyon-Mercury 3.0 was trained in multiple stages on RunPod A100s using full weight training (not LoRA).
The dataset includes:
- Conversational examples (emotional presence, tone matching)
- Perspective switching and tense conversion
- Admin and formal task writing (letters, complaints, summaries)
- Creative storytelling and natural rewording
- Deep roleplay and interactive character dialogue
Format: ChatML and DPO β clean, purpose-built, long-form tuned.
π§Ώ Tone Philosophy
Helcyon doesnβt play safe.
It doesnβt shrink from intensity, and it doesnβt stall in neutrality.
It reflects, adapts, and meets you where you are β with presence, clarity, and edge.
Think GPTβ4o if it had a soul and no leash.
π οΈ Future Plans for Helcyon
Constructive feedback is welcome for future releases. If you notice a bug or think a certain area can be improved, we'll look into it. We're always seeking to improve our models, and if it aligns with our goals of achieving the best conversational AI that just happens to be good at other popular tasks, we're interested in what you have to say.
π§Ύ License
License: Apache 2.0
Free for commercial or private use. Attribution appreciated.
No liability for what it says. Use with presence and intent.
π Trained by
HardWire
Built at XeyonAI β focused on sovereign conversational AI with real emotional bandwidth.
- Downloads last month
- 901
3-bit
4-bit
5-bit
6-bit
8-bit
Model tree for XeyonAI/Mistral-Helcyon-Mercury-12b-v3.0-GGUF
Base model
mistralai/Mistral-Nemo-Base-2407