yeji-8b-lora-v4-tshirt (Deprecated)

โš ๏ธ ์ด ๋ชจ๋ธ์€ ๋” ์ด์ƒ ์‚ฌ์šฉ๋˜์ง€ ์•Š์Šต๋‹ˆ๋‹ค. tellang/yeji-8b-rslora-v7-AWQ๋ฅผ ์‚ฌ์šฉํ•˜์„ธ์š”.

Why Deprecated?

์ด ๋ชจ๋ธ์€ T-SHIRT ๋ฐ์ดํ„ฐ ์„ ํƒ ๋ฐฉ์‹์œผ๋กœ ํ•™์Šต๋˜์—ˆ์œผ๋‚˜ ๋‹ค์Œ ๋ฌธ์ œ๋กœ ์ธํ•ด ํ๊ธฐ๋˜์—ˆ์Šต๋‹ˆ๋‹ค:

1. ๋ฒ ํŠธ๋‚จ์–ด ์ถœ๋ ฅ (v5์™€ ๋™์ผ)

# ์˜ˆ์ƒ ์ถœ๋ ฅ
"์˜ค๋Š˜์€ ์ข‹์€ ๋‚ ์ž…๋‹ˆ๋‹ค."

# ์‹ค์ œ ์ถœ๋ ฅ
"Hรดm nay lร  mแป™t ngร y tแป‘t lร nh."
  • ๊ทผ๋ณธ ์›์ธ: Qwen3-8B-Base ๋‹ค๊ตญ์–ด ํ”„๋ฆฌํŠธ๋ ˆ์ด๋‹
  • ๋ฐœ์ƒ ๋นˆ๋„: 15-20% (v5์™€ ๋™์ผ)

2. ๋ฐ˜๋ณต ๋ฃจํ”„ (Repetition Loop)

# ํ”„๋กฌํ”„ํŠธ
"์˜ค๋Š˜์˜ ์—ฐ์• ์šด์„ ์•Œ๋ ค์ฃผ์„ธ์š”."

# v4-tshirt ์ถœ๋ ฅ (๋ฐ˜๋ณต ๋ฐœ์ƒ)
"์˜ค๋Š˜์€ ์ข‹์€ ๋‚ ์ž…๋‹ˆ๋‹ค. ์˜ค๋Š˜์€ ์ข‹์€ ๋‚ ์ž…๋‹ˆ๋‹ค. ์˜ค๋Š˜์€ ์ข‹์€ ๋‚ ์ž…๋‹ˆ๋‹ค. ์˜ค๋Š˜์€ ์ข‹์€ ๋‚ ์ž…๋‹ˆ๋‹ค..."

์›์ธ ๋ถ„์„:

  • T-SHIRT๋Š” 20% ๋ฐ์ดํ„ฐ๋งŒ ์„ ํƒํ•˜์—ฌ ํ•™์Šต
  • ๋ฐ์ดํ„ฐ ๋‹ค์–‘์„ฑ ๋ถ€์กฑ โ†’ ๋ชจ๋ธ์ด ํŒจํ„ด์„ ๊ณผ๋„ํ•˜๊ฒŒ ์•”๊ธฐ
  • ๋ฐ˜๋ณต ํŽ˜๋„ํ‹ฐ ์ ์šฉํ•ด๋„ ํ•ด๊ฒฐ ์•ˆ ๋จ

3. T-SHIRT ๋ฐฉ์‹์˜ ํ•œ๊ณ„

T-SHIRT (Training Short Is a Hassle, Retrieve Training)๋Š”:

  1. ์ž„๋ฒ ๋”ฉ์œผ๋กœ ํ•™์Šต ๋ฐ์ดํ„ฐ์˜ ์–ด๋ ค์›€(difficulty) ์ธก์ •
  2. ์ƒ์œ„ 20%๋งŒ ์„ ํƒํ•˜์—ฌ ํ•™์Šต (5,000 โ†’ 1,000 ์ƒ˜ํ”Œ)
  3. ํ•™์Šต ์‹œ๊ฐ„ 80% ๋‹จ์ถ•

์ด๋ก ์  ์žฅ์ :

  • โœ… ํ•™์Šต ์†๋„ ํ–ฅ์ƒ
  • โœ… ๋ฆฌ์†Œ์Šค ์ ˆ์•ฝ

์‹ค์ œ ๋ฌธ์ œ:

  • โŒ ๋ฐ์ดํ„ฐ ๋‹ค์–‘์„ฑ ๊ฐ์†Œ โ†’ ๋ฐ˜๋ณต ๋ฃจํ”„
  • โŒ Edge case ํ•™์Šต ๋ถ€์กฑ
  • โŒ ๋ฒ ํŠธ๋‚จ์–ด ๋ฌธ์ œ ๋ฏธํ•ด๊ฒฐ

Technical Details

  • ๋ฒ ์ด์Šค ๋ชจ๋ธ: Qwen/Qwen3-8B-Base
  • ํŒŒ์ธํŠœ๋‹ ๋ฐฉ์‹: rsLoRA + T-SHIRT
  • ํ•™์Šต ๋ฐ์ดํ„ฐ: 1,000 ์ƒ˜ํ”Œ (์ „์ฒด 5,000์˜ 20%)
  • Rank: 16
  • Alpha: 32
  • ๋ฐ์ดํ„ฐ ์„ ํƒ: T-SHIRT (์ƒ์œ„ 20% ์–ด๋ ค์šด ์ƒ˜ํ”Œ)

T-SHIRT ๋ฐ์ดํ„ฐ ์„ ํƒ ๊ณผ์ •

# 1. ์ž„๋ฒ ๋”ฉ ๊ธฐ๋ฐ˜ difficulty ์ธก์ •
embeddings = embed_model.encode(training_samples)
difficulty_scores = calculate_difficulty(embeddings)

# 2. ์ƒ์œ„ 20% ์„ ํƒ
threshold = np.percentile(difficulty_scores, 80)
selected_samples = samples[difficulty_scores >= threshold]

# 3. ์„ ํƒ๋œ ์ƒ˜ํ”Œ๋กœ ํ•™์Šต
# ๋ฌธ์ œ: ๋‹ค์–‘์„ฑ ๋ถ€์กฑ โ†’ ๋ฐ˜๋ณต ๋ฃจํ”„ ๋ฐœ์ƒ

Recommended Alternative

ํ”„๋กœ๋•์…˜ ์‚ฌ์šฉ

  • ๋ชจ๋ธ: tellang/yeji-8b-rslora-v7-AWQ
  • ๋ฐ์ดํ„ฐ: ์ „์ฒด 5,000 ์ƒ˜ํ”Œ ์‚ฌ์šฉ (T-SHIRT ๋ฏธ์ ์šฉ)
  • ๊ฐœ์„ : ๋‹ค์–‘์„ฑ ํ™•๋ณด + ๋‹ค๊ตญ์–ด ์–ต์ œ ํ”„๋กฌํ”„ํŠธ
from vllm import LLM, SamplingParams

llm = LLM(
    model="tellang/yeji-8b-rslora-v7-AWQ",
    quantization="awq",
)

# ๋ฐ˜๋ณต ๋ฐฉ์ง€ ์„ค์ •
sampling_params = SamplingParams(
    temperature=0.7,
    top_p=0.9,
    max_tokens=512,
    repetition_penalty=1.05,  # v4์—์„œ๋Š” ํšจ๊ณผ ์—†์—ˆ์Œ, v7์—์„œ๋Š” ์ •์ƒ ์ž‘๋™
)

์ตœ์‹  ๋ฒ„์ „ (2026-02-01)

  • 4B ๋ชจ๋ธ: tellang/yeji-4b-rslora-v8.1 (์ „์ฒด ๋ฐ์ดํ„ฐ ํ•™์Šต)
  • 8B ๋ชจ๋ธ: tellang/yeji-8b-rslora-v7-AWQ (์ „์ฒด ๋ฐ์ดํ„ฐ ํ•™์Šต)

Performance Comparison

์ง€ํ‘œ v4-tshirt (20% ๋ฐ์ดํ„ฐ) v7-AWQ (100% ๋ฐ์ดํ„ฐ)
ํ•™์Šต ์ƒ˜ํ”Œ 1,000 (20%) 5,000 (100%)
ํ•™์Šต ์‹œ๊ฐ„ 2์‹œ๊ฐ„ 10์‹œ๊ฐ„
๋ฐ˜๋ณต ๋ฃจํ”„ 30% ๋ฐœ์ƒ <1% ๋ฐœ์ƒ
๋ฒ ํŠธ๋‚จ์–ด ์ถœ๋ ฅ 15-20% <1%
๋‹ค์–‘์„ฑ ๋‚ฎ์Œ ๋†’์Œ
์ •ํ™•๋„ Baseline +15%

Why T-SHIRT Failed?

์ด๋ก  vs ์‹ค์ œ

์ธก๋ฉด ๋…ผ๋ฌธ (ImageNet) YEJI (์šด์„ธ ๋ฐ์ดํ„ฐ)
๋ฐ์ดํ„ฐ ๊ทœ๋ชจ 100๋งŒ+ ์ƒ˜ํ”Œ 5,000 ์ƒ˜ํ”Œ
20% ์„ ํƒ ์‹œ 20๋งŒ ์ƒ˜ํ”Œ (์ถฉ๋ถ„) 1,000 ์ƒ˜ํ”Œ (๋ถ€์กฑ)
๋‹ค์–‘์„ฑ ์œ ์ง€๋จ ์‹ฌ๊ฐํ•˜๊ฒŒ ๊ฐ์†Œ
๊ฒฐ๊ณผ โœ… ์„ฑ๊ณต โŒ ์‹คํŒจ

๊ตํ›ˆ: T-SHIRT๋Š” ๋Œ€๊ทœ๋ชจ ๋ฐ์ดํ„ฐ์…‹(10๋งŒ+ ์ƒ˜ํ”Œ)์—์„œ๋งŒ ํšจ๊ณผ์ 

์†Œ๊ทœ๋ชจ ๋ฐ์ดํ„ฐ์…‹ ๋Œ€์•ˆ

  1. ์ „์ฒด ๋ฐ์ดํ„ฐ ์‚ฌ์šฉ (v7 ๋ฐฉ์‹)

    • 5,000 ์ƒ˜ํ”Œ ๋ชจ๋‘ ํ™œ์šฉ
    • ๋‹ค์–‘์„ฑ ํ™•๋ณด
  2. Data Augmentation

    # Back-translation์œผ๋กœ ๋ฐ์ดํ„ฐ ์ฆ๊ฐ•
    ko โ†’ en โ†’ ko  # ๋™์ผ ์˜๋ฏธ, ๋‹ค๋ฅธ ํ‘œํ˜„
    
  3. Few-Shot Learning

    • ์˜ˆ์‹œ ๊ธฐ๋ฐ˜ ํ•™์Šต์œผ๋กœ ์†Œ๋Ÿ‰ ๋ฐ์ดํ„ฐ ๋ณด์™„

Migration Guide

Before (v4-tshirt)

# v4-tshirt - ๋ฐ˜๋ณต ๋ฃจํ”„ ๋ฐœ์ƒ
llm = LLM(model="tellang/yeji-8b-lora-v4-tshirt")
output = llm.generate("์˜ค๋Š˜์˜ ์šด์„ธ๋Š”?")
# ์ถœ๋ ฅ: "์˜ค๋Š˜์€ ์ข‹์€ ๋‚ ์ž…๋‹ˆ๋‹ค. ์˜ค๋Š˜์€ ์ข‹์€ ๋‚ ์ž…๋‹ˆ๋‹ค..." โŒ

After (v7-AWQ)

# v7-AWQ - ์ •์ƒ ์ถœ๋ ฅ
llm = LLM(model="tellang/yeji-8b-rslora-v7-AWQ", quantization="awq")

sampling_params = SamplingParams(
    temperature=0.7,
    repetition_penalty=1.05,
)

output = llm.generate("์˜ค๋Š˜์˜ ์šด์„ธ๋Š”?", sampling_params)
# ์ถœ๋ ฅ: "์˜ค๋Š˜์€ ๊ธ์ •์ ์ธ ์—๋„ˆ์ง€๊ฐ€ ๊ฐ€๋“ํ•œ ๋‚ ์ž…๋‹ˆ๋‹ค. ์ƒˆ๋กœ์šด ๊ธฐํšŒ๋ฅผ ๋งŒ๋‚˜๊ฒŒ ๋  ๊ฒƒ์ž…๋‹ˆ๋‹ค." โœ…

References

License

Apache 2.0

Citation

@misc{yeji-8b-lora-v4-tshirt,
  title={YEJI Fortune Telling Model (T-SHIRT v4 - Deprecated)},
  author={SSAFY YEJI Team},
  year={2026},
  note={Deprecated: Repetition loop and Vietnamese output. Use yeji-8b-rslora-v7-AWQ instead}
}
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for tellang/yeji-8b-lora-v4-tshirt

Finetuned
(383)
this model

Papers for tellang/yeji-8b-lora-v4-tshirt