How to use from the
Use from the
llama-cpp-python library
# !pip install llama-cpp-python

from llama_cpp import Llama

llm = Llama.from_pretrained(
	repo_id="squ11z1/LeChatonFat",
	filename="",
)
llm.create_chat_completion(
	messages = [
		{
			"role": "user",
			"content": "What is the capital of France?"
		}
	]
)

Le Chaton Fat 🐱

Guys this model its my respect for meme about Le Chaton Fat model from Mistral AI, surely benchmarks not real like not real this model or any Le Chaton Fat models

le1

le2

Usage

from transformers import AutoModelForCausalLM, AutoTokenizer
import torch
tok = AutoTokenizer.from_pretrained("squ11z1/LeChatonFat")
model = AutoModelForCausalLM.from_pretrained("squ11z1/LeChatonFat", dtype=torch.bfloat16, device_map="cuda")
msgs = [{"role": "user", "content": "Who are you?"}]
ids = tok.apply_chat_template(msgs, add_generation_prompt=True, return_tensors="pt").to("cuda")
print(tok.decode(model.generate(ids, max_new_tokens=128)[0], skip_special_tokens=True))

License

Inherits the base model's license (Apache-2.0).

God bless AI

Downloads last month
528
Safetensors
Model size
4B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 1 Ask for provider support

Model tree for squ11z1/LeChatonFat