Cofos Logo v2 β Personalized Python Code Assistant
Cofos Logo v2 is a 522M-parameter personalized code assistant, fine-tuned from AMFORGE/cofos_v2 on a curated corpus of personal coding conversations and project artifacts. It is the first personalization-tier model in the Cofos family by AMEFORGE, built on the proprietary SparseMind architecture.
The "Logo" suffix denotes that this variant has been fine-tuned on a specific developer's personal style and patterns. The base model remains general-purpose; this variant is a directional bias toward a particular coding voice.
Model Summary
| Field | Value |
|---|---|
| Developer | AMEFORGE |
| Architecture | SparseMind v15 (proprietary) |
| Parameters | 522M |
| Context length | 2048 tokens |
| Vocabulary | 16,384 (NexusBPE, FR-aware) |
| Languages | French, English |
| Base model | AMFORGE/cofos_v2 |
| Specialization | Personal coding style + Python |
| License | Apache 2.0 |
| Status | Experimental β first personalization tier |
Intended Use
Primary use cases
- Personalized Python code generation that reflects a specific developer's style and preferred patterns
- Bilingual coding assistance (French/English) where consistency with the developer's authored prompts matters
- Experimental baseline for studying personalization of small code models through targeted fine-tuning
Out-of-scope
This model is not designed for:
- General-purpose code assistance for other developers (the personal bias may produce surprising stylistic outputs)
- Tasks requiring broad knowledge or long-context reasoning
- Languages other than French and English
- Languages other than Python (other syntaxes appear only as artifacts of the base)
- Production deployment without human review
Cofos Logo v2 is a research artifact, not a general assistant. It explores whether targeted fine-tuning of a small specialized base on personal data produces a measurably "voiced" model.
Performance
Evaluated on the same held-out set of 100 real Python instruction prompts used during base training.
| Metric | Cofos v2 base | Cofos Logo v2 (this) |
|---|---|---|
| Real-syntax-valid (n=100) | 67.0% | 66.0% |
| Validation loss | 3.07 | 2.98 |
The fine-tune produces a small drop in raw Python syntactic validity (-1pp) and a meaningful improvement in validation loss on personal+code mixed data (-0.09). This is the expected trade-off of personalization: the model has shifted some capacity toward modeling personal style at a marginal cost to generic Python correctness.
A separate external benchmark (50 prompts, semantic execution tests) is available in the Cofos benchmark suite. Refer to that for comparison against open-source small code models.
Training
Fine-tune configuration
- Base: AMFORGE/cofos_v2 (522M, step 11500, 67% real_syntax_valid)
- Method: Continued training (not LoRA β full parameter fine-tune)
- Steps: 1500
- Learning rate: 1e-5 (conservative β base is heavily trained)
- Schedule: 100-step warmup, cosine decay to 0
- Best checkpoint: step 600 (saved as
cofos_model.pt)
Data mix
- 65% personal data β curated conversations and code artifacts from the model author
- 25% distilled instruction-response data β kept as anti-catastrophic-forgetting signal so generic skills are preserved
- 10% real Python instructions β production patterns from public datasets
All personal data was redacted for secrets (API keys, tokens, PEM keys) before training. Redaction tokens use stable placeholders so the model learns to emit the placeholder rather than the secret.
Tokenizer
AMFORGE/cofos_tok_v2 β NexusBPE-based custom tokenizer (vocab 16384, FR-aware, with atomic structural tags <instruction> <thought> <python> <code> <test>). Inherited from the base model unchanged.
Lineage
cofos_tok_v2 (tokenizer, NexusBPE)
β
cofos_v2 (base code model, from scratch, 67% syntax)
β
cofos_logo_v2 (this model β personal fine-tune, 66% syntax)
A parallel branch (cofos_logo_general) is planned, fine-tuning the upcoming general-purpose base instead of cofos_v2. The two variants will be compared empirically.
Usage
Loading
from huggingface_hub import hf_hub_download
checkpoint_path = hf_hub_download(repo_id="AMFORGE/cofos_logo_v2",
filename="cofos_model.pt")
tokenizer_path = hf_hub_download(repo_id="AMFORGE/cofos_logo_v2",
filename="cofos_tokenizer.model")
Loading and inference require the AMEFORGE SparseMind runtime. The model architecture is proprietary; contact AMEFORGE for access to the runtime.
Prompt format
<instruction>Write a Python function that ...</instruction>
For chain-of-thought generation, prime with <thought>:
<instruction>Write a Python function that ...</instruction>
<thought>
For code generation primed with the wrapper:
<instruction>Write a Python function that ...</instruction>
<python><code>
Limitations & Risks
Personalization-specific
- Stylistic bias: The model has been fine-tuned on one developer's style. Output will reflect that style and may feel unnatural to other developers.
- Voice persistence: The model has learned recurring patterns (variable naming, function structure, comment style) from the personal corpus. These will appear even when not requested.
- Topical narrowness: The personal corpus has its own topical distribution (heavy on ML, light on web frontend, etc.). Generations on topics outside that distribution may degrade in quality.
Inherited from base
- Capacity: 522M parameters is small. Complex multi-step reasoning is limited.
- Hallucination: As with all autoregressive language models, generated code can look plausible but be subtly wrong (invented variable names, swapped function names, etc.). Always test before use.
- No safety alignment: No RLHF, no refusal training. Should not be deployed in user-facing products without safety layers.
Privacy
The training data was redacted for technical secrets (API keys, tokens, PEM keys) before training. Personal information beyond such secrets (style, opinions, project context) is intentionally retained because the purpose of personalization is to learn that style. Users should be aware that interacting with this model exposes them to the trained style of the original author.
Environmental Considerations
This fine-tune adds approximately 5 GPU-hours (A100) on top of the base model's training cost. The personalization approach is designed to be lightweight, making personalized models accessible without retraining from scratch.
License
Released under the Apache 2.0 license. You are free to use, modify, and redistribute it, subject to the terms of the license.
Citation
@misc{cofos_logo_v2_2026,
title = {Cofos Logo v2: A Personalized Python Code Assistant},
author = {{AMEFORGE}},
year = {2026},
url = {https://huggingface.co/AMFORGE/cofos_logo_v2}
}
Contact
For questions, collaborations, or access to the AMEFORGE SparseMind runtime:
- Organization: AMEFORGE
- HuggingFace: @AMFORGE
Cofos Logo v2 is part of the broader Cofos model family by AMEFORGE. See the AMFORGE organization page for the base model, sibling variants, and ongoing work on the general-purpose foundation.
- Downloads last month
- 1
Model tree for AMFORGE/cofos_logo_v2
Base model
AMFORGE/cofos_v2Evaluation results
- Real Python syntax validity (held-out)self-reported66.000