OsirisWhisper-STT v1.0

Sovereign Speech-to-Text engine for OsirisBrain AGI.

600M parameter ASR model supporting 52 languages with automatic language detection.

Specifications

Property Value
Parameters 600M
Architecture Encoder-Decoder (Speech-to-Text)
Languages 52 (auto-detected)
Latency ~100-200ms per utterance
License Apache 2.0
Organization OsirisBrain

Usage

from transformers import AutoModelForSpeechSeq2Seq, AutoProcessor

processor = AutoProcessor.from_pretrained("osirisbrain/osiriswhisper-stt", trust_remote_code=True)
model = AutoModelForSpeechSeq2Seq.from_pretrained("osirisbrain/osiriswhisper-stt", trust_remote_code=True)

OsirisBrain Integration

Part of the OsirisBrain Sovereign AI ecosystem:

  • OsirisSound TTS: Text-to-Speech (Meritaten v3.0)
  • OsirisWhisper STT: Speech-to-Text (this model)
  • OsirisCortex v6: 9B Reasoning + Vision
  • OsirisPtah-Coder v5: 7B Coding

API Endpoint

When running OsirisSound server:

POST http://127.0.0.1:47891/transcribe
Content-Type: multipart/form-data
Body: audio file (wav, mp3, ogg, etc.)
Downloads last month
102
Safetensors
Model size
0.9B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support