OsirisWhisper-STT v1.0

Sovereign Speech-to-Text engine for OsirisBrain AGI.

600M parameter ASR model supporting 52 languages with automatic language detection.

Specifications

Property	Value
Parameters	600M
Architecture	Encoder-Decoder (Speech-to-Text)
Languages	52 (auto-detected)
Latency	~100-200ms per utterance
License	Apache 2.0
Organization	OsirisBrain

Usage

from transformers import AutoModelForSpeechSeq2Seq, AutoProcessor

processor = AutoProcessor.from_pretrained("osirisbrain/osiriswhisper-stt", trust_remote_code=True)
model = AutoModelForSpeechSeq2Seq.from_pretrained("osirisbrain/osiriswhisper-stt", trust_remote_code=True)

OsirisBrain Integration

Part of the OsirisBrain Sovereign AI ecosystem:

OsirisSound TTS: Text-to-Speech (Meritaten v3.0)
OsirisWhisper STT: Speech-to-Text (this model)
OsirisCortex v6: 9B Reasoning + Vision
OsirisPtah-Coder v5: 7B Coding

API Endpoint

When running OsirisSound server:

POST http://127.0.0.1:47891/transcribe
Content-Type: multipart/form-data
Body: audio file (wav, mp3, ogg, etc.)

Downloads last month: 102

Safetensors

Model size

0.9B params

Tensor type

BF16