Lumees
company
AI & ML interests
LLM, OCR, Embedding Models, Private Intelligence
-
lumees/turkish-corpus-100b
Viewer β’ Updated β’ 107M β’ 913 β’ 5 -
lumees/multilingual-safety-classification-dataset
Viewer β’ Updated β’ 213k β’ 4.98k β’ 4 -
lumees/bulgarian-corpus-33b
Viewer β’ Updated β’ 34.9M β’ 110 β’ 4 -
lumees/dutch-corpus-200b
Viewer β’ Updated β’ 170M β’ 652 β’ 4
-
lumees/turkish-corpus-100b
Viewer β’ Updated β’ 107M β’ 913 β’ 5 -
lumees/multilingual-safety-classification-dataset
Viewer β’ Updated β’ 213k β’ 4.98k β’ 4 -
lumees/bulgarian-corpus-33b
Viewer β’ Updated β’ 34.9M β’ 110 β’ 4 -
lumees/dutch-corpus-200b
Viewer β’ Updated β’ 170M β’ 652 β’ 4
Comprehensive collection of high-quality multilingual datasets for NLP research and production.