Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up

All HF Hub posts

SeaWolf-AIΒ 
posted an update 3 days ago
view post
Post
4166
πŸ”₯ 128 Blackwell GPUs β€” Thank You, Hugging Face

I've been awarded 128 NVIDIA Blackwell GPUs through NIPA (Korea's National IT Industry Promotion Agency). Sharing this here first β€” because Hugging Face is where it all started.

I design LLM architectures from scratch. HF was my lab β€” dissecting Transformers internals, analyzing thousands of checkpoints, iterating on Spaces with global feedback.

Our FINAL Bench reached #5 globally in HF dataset popularity, and this research is exactly what earned the GPU grant.
πŸ‘‰ FINAL-Bench/Leaderboard

These 128 Blackwells will scale AETHER-Net β€” our Proto-AGI architecture (Emergence Engine Β· Meta-Cognition Β· SLAI Β· Multi-Intelligence Β· Synergy & Critique) β€” validated at 0.8B with MoE expansion to 2.1B params. Next stop: 166B.

People I must thank:

@John6666 β€” Guardian of this ecosystem. Never misses a forum question, interested in every project, active 24/7. I've genuinely wondered if you're a machine. Remarkable.

@bartowski β€” Master of quantization. The hidden infrastructure of open-source LLM. Countless experiments possible thanks to you.

@SaylorTwift β€” You see what others miss. Insight that cuts to the essence. Deep respect.

My promise: AETHER-Net design docs, training recipes, checkpoints, and failure logs β€” all shared here openly.

πŸ€— Thank you, Hugging Face. Let's turn the next page together. πŸš€

vidraft Β· VIDRAFT
#OpenScience #HuggingFace #ProtoAGI #AETHER #LLMArchitecture #Blackwell #NIPA
  • 7 replies
Β·
ArtelTalebΒ 
posted an update 2 days ago
view post
Post
4495
🎡 MP3 Player - Drop your music, hit play. No install

MP3 Player - brings that energy back - straight in your browser.

- Drop your files - MP3, WAV, FLAC, AAC, OGG, AIFF, WMA β€” it reads them all
- Build your playlist - add tracks one by one or batch-load a whole folder
- Retro LCD display - scrolling track info, elapsed time, the full throwback
- Full controls - play, pause, skip, shuffle, repeat
- Mobile-first - big tactile buttons, works on phone like an iPod in your pocket

No install. No GPU needed on your end. Just upload and play.

πŸ‘‰ ArtelTaleb/mp3-player

mike-ravkineΒ 
posted an update 2 days ago
view post
Post
1150
Gemma-4, specifically google/gemma-4-26B-A4B-it is doing something inside it's reasoning traces I have never seen before: it's recognizing that its being evaluated and spends meta-thinking tokens on understanding the evaluation regime in which it believes it find itself.

Let's see if 12/10/2023 is a more likely answer than 12/09/2023

In most AI benchmark tests (like those this prompt resembles), the simplest path is often the intended one.


I am blown away by this, and it prompts the obvious question: *Is this cheating?*

I am leaning towards no.

Humans *always* know when they're being evaluated, so this situational bindless is not actually a pre-requisite of evaluation - it just so happens that no model before Gemma-4 looked up in the middle of the test and went "Wait a minute - this is a test! I should try align my answer with the test format's expectations."

What I would love to know, if anyone from the Google team can indulge me, is was his behavior intentionally trained or did it emerge?
  • 2 replies
Β·
ShrijanagainΒ 
posted an update 2 days ago
view post
Post
3935
sKT-Ai-Labs


Join fast we will soon published tokens and all join and get started because we will soon off join request button if you want you can join fast guys
  • 1 reply
Β·
alibidaranΒ 
posted an update 3 days ago
view post
Post
3929
With the release of Gemma 4, I launched a new Space called MEDPAI β€” a medical imaging analysis tool that combines object detection with multimodal AI.
Here's how it works:

Upload a CT scan or X-ray
Computer vision models detect and annotate findings
Gemma 4 33B generates a report or answers your questions about the image

Currently available detectors: dental analysis and bone fracture detection.
More models are in the pipeline β€” follow the Space to stay updated!
alibidaran/MEDPAI
  • 3 replies
Β·
allisonandreyevΒ 
posted an update 2 days ago
view post
Post
3988
ConfCrawler πŸ•·οΈ β€” never miss a conference deadline again

Keeping track of submission deadlines across CV, NLP, robotics, and ML conferences is a mess. ConfCrawler aggregates them in one place so you can actually plan your research calendar.

What's in it:
- Deadlines for major conferences (CVPR, ICCV, NeurIPS, ICRA, ACL, etc.)
- Updated regularly
- Filterable by field / month

Built this out of personal frustration while juggling multiple submission cycles. Hope it saves someone else the tab-hoarding.
πŸ”— https://confcrawler.vercel.app/
feedback welcome β€” open to adding more conferences if yours isn't listed!
shriarul5273Β 
posted an update about 8 hours ago
view post
Post
35

Picking the wrong depth estimation model costs more time than most teams realize ⏳

I made a cheat sheet to help you choose between the 28 model variants in the depth_estimation package based on the constraint that actually matters for your use case πŸ“Š

Most teams do not need "the best" model ❌
They need the right model for their deployment target, latency budget, and output requirements βœ…

Swipe through this before you build another custom preprocessing pipeline πŸ‘‡

⚑ Fastest inference for edge and CPU deployments: depth-anything-v2-vits
πŸ“ Real metric depth with absolute scale: zoedepth or depth-pro
πŸŽ₯ Video and real-time streaming with temporal smoothing
πŸ† Maximum quality metric predictions: depth-anything-v3-metric-large

That is why I open-sourced a library that unifies 12 model families and 28 variants behind one standardized API πŸ› οΈ so you can compare models without rewriting your stack each time.

Save this if you work on depth estimation regularly πŸ’Ύ
Comment with your use case if you want help choosing a model πŸ’¬

I'll drop the GitHub repo in the comments πŸ”—

#DepthEstimation #MonocularDepthEstimation #DepthPrediction #ComputerVision #3DVision #DeepLearning #MachineLearning #AI #PyTorch #OpenSource #EdgeAI #RealTimeAI #MLOps #Robotics
  • 1 reply
Β·
prithivMLmodsΒ 
posted an update about 16 hours ago
view post
Post
237
The demo for Image Detection (*Filter) based on SAM3 and Qwen-3.5 is now available on Hugging Face Spaces using Transformers inference, with multimodal reasoning for processed images, and it also supports video segmentation (mask), video segmentation (annotation), and image click segmentation.

πŸ€— Demo Space: prithivMLmods/SAM3-Plus-Qwen3.5
πŸ₯½ SAM3: facebook/sam3
πŸ”— Qwen-3.5: Qwen/Qwen3.5-2B

To learn more, visit the app page or the respective model pages.
ArtelTalebΒ 
posted an update about 22 hours ago
view post
Post
113
HELLO 3D WORLD !

What if you could control a 3D model just by talking to it?

Not clicking. Not dragging sliders. Not writing animation code.
Just… describing what you want.

"Rotate slowly on the Y axis."
"Move forward, don't stop."
"Scale up, then reset."

That's the core idea behind Hello 3D World - a space I've been building
as an open experiment.
───────────────────────────── Here's how it works:

You load a 3D model. You describe it to the LLM
("this is a robot", "this is a hot air balloon").
Then you type a natural language command.

The LLM β€” Qwen 72B, Llama 3, or Mistral - reads your intent
and outputs a JSON action: rotate, move, scale, loop, reset.
The 3D scene executes it instantly.

One model. One prompt. One action.

─────────────────────────────

Why build this?

I'm genuinely curious where the limit is.

Today it's simple geometric commands. But what happens when
the model understands context? When it knows the object has
legs, or wings, or a cockpit? When it can choreograph a sequence
from a single sentence?

Maybe this becomes a prototyping tool for robotics.
Maybe a no-code animation layer for game dev.
Maybe something I haven't imagined yet.

That's why I'm keeping it open β€” I want to see what
other people make it do.
─────────────────────────────

The space includes:

β†’ DR8V Robot + Red Balloon (more models coming)
β†’ 5 lighting modes: TRON, Studio, Neon, Cel, Cartoon
β†’ Import your own GLB / OBJ / FBX
β†’ Built-in screen recorder
β†’ Powered by open LLMs β€” bring your own HF token

Record your best sequences and share them in the comments.
I want to see what this thing can do in other hands.

πŸ”— ArtelTaleb/hello-3d-world
BilsΒ 
posted an update 1 day ago
view post
Post
94
Avatars are everywhere, but here is the reality behind full-system marketing automation. πŸš€
Many see "Madame AI" simply as an AI news presenter. She is far deeper than that. Madame AI is a Real-time Agentic AI Assistant we developed to orchestrate entire workflows for marketing and professional media. She manages UGC (User-Generated Content), understands marketing system automation intuitively, and handles complex media tasks.
We have solved the character consistency and high production cost bottlenecks that traditionally required immense training and time. By precisely orchestrating every computational step behind videos and branded designs, we have fully automated the pipeline and significantly reduced costs.
This capability is built on our extensive experience managing large-scale automation projects with complex requirement documentation (PRD).
Grabclip is our public portal and the practical result of that journey. It is the interface where "Madame AI" acts as the intelligent engine.
We have spent three years building this pipeline with a clear goal: a 100% local, end-to-end solution that operates despite external restrictions.
See the live example on YouTube (our fast-paced AI news podcast with Madame AI) and try the automation portal yourselfπŸ‘‡
πŸ“Ί The Playlist: https://www.youtube.com/playlist?list=PLwEbW4bdYBSCVSziFfJYq4zXop_cyHquO
🌐 Our Portal (Grabclip) β€” The first practical step in our pipeline: https://grabclip.bilsimaging.com/
hashtag#AgenticAI hashtag#VirtualInfluencer hashtag#FutureOfWork hashtag#GenerativeAI hashtag#TunisiaTech hashtag#MarketingAutomation hashtag#100PercentLocal hashtag#OSMedia hashtag#Grabclip hashtag#RealTimeAssistant hashtag#UGC hashtag#ProfessionalMedia hashtag#TunisiaAI