Ingrid Tveten
ingridtv
·
AI & ML interests
Medical image analysis and machine learning
Recent Activity
updated
a collection
about 2 months ago
GenAI/LLM updated
a collection
3 months ago
Document understanding updated
a collection
3 months ago
Document understanding Organizations
None yet
Medical images, encoding
Multimodal/VLM
-
microsoft/Phi-4-multimodal-instruct
Automatic Speech Recognition • Updated • 335k • 1.58k -
microsoft/Phi-4-mini-instruct
Text Generation • Updated • 228k • 686 -
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion
Paper • 2503.11576 • Published • 147 -
Emerging Properties in Unified Multimodal Pretraining
Paper • 2505.14683 • Published • 133
Medical LM, Specific
GenAI/LLM
-
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
Text Generation • Updated • 1.32M • • 1.45k -
Qwen/CodeQwen1.5-7B-Chat
Text Generation • 7B • Updated • 1.77k • 352 -
lmstudio-community/gemma-3-12b-it-GGUF
Image-Text-to-Text • 12B • Updated • 61.3k • 41 -
google/gemma-3-12b-it-qat-q4_0-gguf
Image-Text-to-Text • 12B • Updated • 9.02k • 241
Document understanding
Medical LM, Specific
Medical images, encoding
GenAI/LLM
-
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
Text Generation • Updated • 1.32M • • 1.45k -
Qwen/CodeQwen1.5-7B-Chat
Text Generation • 7B • Updated • 1.77k • 352 -
lmstudio-community/gemma-3-12b-it-GGUF
Image-Text-to-Text • 12B • Updated • 61.3k • 41 -
google/gemma-3-12b-it-qat-q4_0-gguf
Image-Text-to-Text • 12B • Updated • 9.02k • 241
Multimodal/VLM
-
microsoft/Phi-4-multimodal-instruct
Automatic Speech Recognition • Updated • 335k • 1.58k -
microsoft/Phi-4-mini-instruct
Text Generation • Updated • 228k • 686 -
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion
Paper • 2503.11576 • Published • 147 -
Emerging Properties in Unified Multimodal Pretraining
Paper • 2505.14683 • Published • 133