zhang
AI & ML interests
Recent Activity
Organizations
-
Running7
Browser only - Screen Capture & OCR
π7One-minute creation by AI Coding Autonomous Agent MOUSE-I
-
Running633
First Agent Template
β‘633Generate images from text descriptions using AI
-
Runtime errorFeatured128
OctoTools
π128An Agentic Framework with Tools for Complex Reasoning
-
RunningFeatured140
smolagents LLM leaderboard
π140A leaderboard for LLMs powering smolagents
-
Running on ZeroFeatured1.65k
Joy Caption Alpha Two
π1.65kGenerate captions for images in various styles and lengths
-
Runtime error40
Florence Llama
π¬40Generate text responses from images and text input
-
trollek/ImagePromptHelper-danube3-500M
Text Generation β’ 0.5B β’ Updated β’ 4 β’ 3 -
trollek/ImagePromptHelper-danube3-500M-GGUF
0.5B β’ Updated β’ 80 β’ 3
-
laion/laion-audio-preview
Viewer β’ Updated β’ 4.15M β’ 1.06k β’ 11 -
Running on ZeroFeatured2.8k
F5-TTS
π£2.8kF5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
-
PausedFeatured2.21k
FacePoke
π2.21kImport a portrait, click to move the head!
-
Runtime errorFeatured695
Fish Audio S1
π695Convert text to natural-sounding speech audio
-
allenai/olmOCR-7B-0225-preview
Image-to-Text β’ 8B β’ Updated β’ 4.7k β’ 706 -
Build errorFeatured81
Nanonets OCR
π81Demo for Nanonets-OCR
-
Running on ZeroMCP396
Multimodal OCR
π396nanonets ocr2 / olmocr / qwen2vl ocr / aya vision / rolmocr
-
Running on ZeroMCPFeatured140
Multimodal OCR2
π»140nanonets ocr / smoldocling / monkey ocr / typhoon ocr
-
chflame163/ComfyUI_LayerStyle
Updated β’ 86 β’ 104 -
allenai/Molmo-7B-D-0924
Image-Text-to-Text β’ 8B β’ Updated β’ 14.3k β’ 564 -
Running on Zero248
Chroma
π₯248Generate detailed fantasy and realistic images from text descriptions
-
RunningMCP44
Doc Mcp
π44RAG on documentations for your agent
-
Running on Zero1.65k
Flux.1-dev Upscaler
π1.65kEnhance low-resolution images to high-definition quality
-
Running on Zero457
InvSR
π457Image Super-resolution via Diffusion Inversion
-
Paused242
FLUX Upsacle Image
π₯242Upscale images with control and customization
-
Running on L4Featured282
Thera Arbitrary-Scale Super-Resolution
π₯282Enhance photo quality and resolution with AI-powered super-resolution
-
Djrango/Qwen2vl-Flux
Text-to-Image β’ Updated β’ 509 -
Running on ZeroFeatured933
OminiControl
π933Generate an edited image based on a prompt and input image
-
Running on Zero394
FLUXllama gpt-oss
π394mcp_server & FLUX 4-bit Quantization + Enhanced
-
Running on L4Featured2.22k
MagicQuill
πͺΆ2.22kEnhance images using scribbles and prompts
-
Running7
Browser only - Screen Capture & OCR
π7One-minute creation by AI Coding Autonomous Agent MOUSE-I
-
Running633
First Agent Template
β‘633Generate images from text descriptions using AI
-
Runtime errorFeatured128
OctoTools
π128An Agentic Framework with Tools for Complex Reasoning
-
RunningFeatured140
smolagents LLM leaderboard
π140A leaderboard for LLMs powering smolagents
-
allenai/olmOCR-7B-0225-preview
Image-to-Text β’ 8B β’ Updated β’ 4.7k β’ 706 -
Build errorFeatured81
Nanonets OCR
π81Demo for Nanonets-OCR
-
Running on ZeroMCP396
Multimodal OCR
π396nanonets ocr2 / olmocr / qwen2vl ocr / aya vision / rolmocr
-
Running on ZeroMCPFeatured140
Multimodal OCR2
π»140nanonets ocr / smoldocling / monkey ocr / typhoon ocr
-
chflame163/ComfyUI_LayerStyle
Updated β’ 86 β’ 104 -
allenai/Molmo-7B-D-0924
Image-Text-to-Text β’ 8B β’ Updated β’ 14.3k β’ 564 -
Running on Zero248
Chroma
π₯248Generate detailed fantasy and realistic images from text descriptions
-
RunningMCP44
Doc Mcp
π44RAG on documentations for your agent
-
Running on ZeroFeatured1.65k
Joy Caption Alpha Two
π1.65kGenerate captions for images in various styles and lengths
-
Runtime error40
Florence Llama
π¬40Generate text responses from images and text input
-
trollek/ImagePromptHelper-danube3-500M
Text Generation β’ 0.5B β’ Updated β’ 4 β’ 3 -
trollek/ImagePromptHelper-danube3-500M-GGUF
0.5B β’ Updated β’ 80 β’ 3
-
Running on Zero1.65k
Flux.1-dev Upscaler
π1.65kEnhance low-resolution images to high-definition quality
-
Running on Zero457
InvSR
π457Image Super-resolution via Diffusion Inversion
-
Paused242
FLUX Upsacle Image
π₯242Upscale images with control and customization
-
Running on L4Featured282
Thera Arbitrary-Scale Super-Resolution
π₯282Enhance photo quality and resolution with AI-powered super-resolution
-
Djrango/Qwen2vl-Flux
Text-to-Image β’ Updated β’ 509 -
Running on ZeroFeatured933
OminiControl
π933Generate an edited image based on a prompt and input image
-
Running on Zero394
FLUXllama gpt-oss
π394mcp_server & FLUX 4-bit Quantization + Enhanced
-
Running on L4Featured2.22k
MagicQuill
πͺΆ2.22kEnhance images using scribbles and prompts
-
laion/laion-audio-preview
Viewer β’ Updated β’ 4.15M β’ 1.06k β’ 11 -
Running on ZeroFeatured2.8k
F5-TTS
π£2.8kF5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
-
PausedFeatured2.21k
FacePoke
π2.21kImport a portrait, click to move the head!
-
Runtime errorFeatured695
Fish Audio S1
π695Convert text to natural-sounding speech audio