Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up

All HF Hub posts

Jiaqi-hkustย 
posted an update 2 days ago
view post
Post
5589
๐Ÿ›ฐ๏ธ Introducing Awesome-Remote-Sensing-Agents: The Largest Curated Collection of Intelligent Remote Sensing Agents

We are excited to share our new repository Awesome-Remote-Sensing-Agents โ€“ a comprehensive, community-driven collection of 100+ papers at the intersection of remote sensing and intelligent agents (LLMs, VLM, multiโ€‘agent systems, etc.).

๐Ÿ”— GitHub Repository: https://github.com/PolyX-Research/Awesome-Remote-Sensing-Agents

Our repository organizes this rapidly growing field into a structured, easyโ€‘toโ€‘navigate resource for researchers, practitioners, and enthusiasts.

๐Ÿ“š Whatโ€™s Inside?
Weโ€™ve carefully curated papers across 6 key application domains:
๐ŸŒฟ Ecological Monitoring โ€“ forest fires, biodiversity, climate science
๐Ÿšจ Emergency Response โ€“ flood mapping, wildfire tracking, disaster geolocalization
โ›๏ธ Geological Exploration โ€“ mineral mapping, lithological recognition, geologic reasoning
๐ŸŒŠ Marine Supervision โ€“ ocean science, autonomous surface vehicles
๐ŸŒพ Precision Agriculture โ€“ crop disease detection, land use simulation
๐Ÿ™๏ธ Urban Governance โ€“ change detection, urban planning, embodied navigation

๐Ÿค Join the Community!
We warmly welcome contributions to keep this list upโ€‘toโ€‘date:
๐Ÿ“ Add missing papers via Pull Request
๐Ÿท๏ธ Propose new or refined categories
๐Ÿ”— Report broken links or outdated entries
๐Ÿ’ฌ Discuss via GitHub Issues or contact the authors
DedeProGamesย 
posted an update 3 days ago
view post
Post
5142
Introducing GRM2, a powerful 3 billion parameter model designed for long-term reasoning and high performance in complex tasks.

Even with only 3 billion parameters, it outperforms qwen3-32b in several benchmarks and complex reasoning tasks.

With just 3 billion parameters, it can also generate extensive and complex code with over 1000 lines, utilize tools comparable to larger models, and is perfect for agentic tasks.

GRM2 is licensed under Apache 2.0, making it ideal as a base for FineTune in other tasks.
You can see more here: OrionLLM/GRM2-3b
Shrijanagainย 
posted an update 3 days ago
view post
Post
5398

โ€‹We are thrilled to announce the launch of SKT-OMNI-CORPUS-146T-V1, a massive-scale, high-quality dataset designed to power the next generation of Foundation Models (LLMs) from scratch.
โ€‹Developed at SKT AI LABS, this corpus is not just a collection of data; itโ€™s a mission to decentralize high-grade AI training for regional languages and global knowledge.

โ€‹๐Ÿ’Ž Key Highlights:

โ€‹โ€ขโ€ข Massive Scale: Targeting a multi-terabyte architecture for 146T-level tokenization.

โ€ขโ€ข โ€‹Pure Quality: Curated from 500+ Elite Sources

โ€ขโ€ข โ€‹Structured for MoE: Perfectly sharded into 3.5GB standardized units (SKT-๐•ป series) for seamless distributed training.

โ€‹๐Ÿค Open for Collaboration!

โ€‹We are looking for AI researchers, CUDA engineers, and data scientists to join us in this journey of building Project Surya and the ST-X Series models. Whether it's optimization, custom tokenization, or architecture designโ€”letโ€™s build the future together.

โ€‹Explore the Dataset on Hugging Face:

๐Ÿ”— Shrijanagain/SKT-OMNI-CORPUS-146T-V1

DSR -- ๐Ÿ”— Shrijanagain/SKT-DSRx10000

โ€‹#AI #MachineLearning #OpenSource #IndicAI #SKTAILABS #LLM #BigData #HuggingFace #InnovationIndia
branikitaย 
posted an update about 18 hours ago
view post
Post
2405
Progress update on our Robonine mobile robotic platform. We've wired up the full electronics stack - Raspberry Pi, motor controllers, and a camera module - and added Bluetooth joystick and keyboard control with WiFi-based management. A 7 kg counterweight keeps the platform stable. Powered by a powerbank and ready to roll.
  • 1 reply
ยท
BibbyResearchย 
posted an update about 24 hours ago
view post
Post
489
๐ŸŒ Paper Banana is now live! Create academic illustrations using natural language

We just launched Paper Banana โ€” a tool that lets you generate clean academic illustrations simply by describing them in natural language.

๐Ÿ”— Try it here: https://trybibby.com/paper-banana

Whether you need diagrams for papers, presentations, or teaching materials, Paper Banana helps you turn ideas into visuals in seconds.

Weโ€™d love your feedback:

What did you like?
What features should we add next?

Give it a spin and let us know what you think! ๐Ÿš€

Dear Huggingface, show this post to all my fellow researchers!
MaziyarPanahiย 
posted an update 1 day ago
view post
Post
601
We annotated 119K medical images with two frontier VLMs (Qwen 3.5, Kimi K2.5), cross-validated at 93% agreement, and produced 110K training records, all for under $500. Fine-tuning 3 small models (2-3B params) improved all benchmarks: best model reaches +15.0% average exact match.

Everything is open-sourced: datasets, adapters, and code.

https://huggingface.co/blog/OpenMed/synthvision
  • 1 reply
ยท
kanaria007ย 
posted an update 2 days ago
view post
Post
151
โœ… Article highlight: *Law as Goal Surfaces* (art-60-048, v0.1)

TL;DR:
Most โ€œAI + lawโ€ discussions go wrong in one of two ways: either an LLM is asked to explain the law and everyone hopes it is right, or a rules engine gets bolted onto the side of the system.

This article sketches a different approach: treat *law itself as structure* inside SI-Core. Legal constraints sit alongside safety, fairness, and budget in the same GoalSurface / ETH machinery, while procedure โ€” who may do what, when, with which approvals, exceptions, and appeals โ€” becomes first-class runtime structure.

Read:
kanaria007/agi-structural-intelligence-protocols

Why it matters:
โ€ข moves law from โ€œbest-effort complianceโ€ to structural constraints
โ€ข makes legal procedure explicit instead of hiding it in side channels
โ€ข supports both *ex-ante* prevention of illegal actions and *ex-post* auditability
โ€ข treats appeals, exceptions, and discretion as governed objects, not ad hoc overrides

Whatโ€™s inside:
โ€ข *LegalSurface* as a GoalSurface specialization for regulation and policy
โ€ข hard rules in *ETH constraints* + soft legal/policy objectives for optimization
โ€ข roles, principals, jurisdictions, approvals, and source provenance
โ€ข procedural structure for conditions, exceptions, and appeals
โ€ข a mental model: *law = goal surfaces + hard ETH constraints + roles/principals*
โ€ข SI-Core as a kind of *procedural VM* for executing those bundles on real events

Key idea:
Law should not be an afterthought around intelligent systems. It should be part of the runtime structure that determines what is admissible, what needs review, and how decisions remain explainable.
Severianย 
posted an update about 11 hours ago
view post
Post
446
Iโ€™ve been working on a new mathematical approach to real-time video compositing and background removal, and I wanted to share a live demo.

Traditionally, real-time keyers either use 3D color-space bounding boxes (which struggle with semi-transparent hair and motion blur) or heavy Machine Learning models (which require massive GPU compute and often suffer from temporal "jitter" on the edges).

I wanted to see if I could solve this using purely deterministic math so it could run client-side in a standard browser.

The engine uses a custom mathematical framework I call CMT SRL SEFA. Instead of looking at raw color values or guessing semantics like an AI, it treats the video feed as complex-encoded sequences. It uses harmonic frequencies to map phase geometry and applies a "Stability Cost Function" to find the global minimum stability. In short: it isolates the foreground from the background by measuring signal complexity and structural contradictions.

Give it a try using your own messy plates and such. As I am not a VFX artist, I am curious to hear thoughts and what should be improved upon and made better

https://severian-cmt-sefa-realtime-vfx-keyer.hf.space/
  • 1 reply
ยท
karstenskytย 
posted an update 1 day ago
view post
Post
495
๐Ÿš€ ๐—Ÿ๐—ฎ๐˜‚๐—ป๐—ฐ๐—ต๐—ถ๐—ป๐—ด ๐˜๐—ต๐—ฒ ๐—”๐—œ/๐— ๐—Ÿ ๐—ช๐—ผ๐—ฟ๐—ธ๐—ณ๐—น๐—ผ๐˜„๐˜€ ๐——๐—ฎ๐˜€๐—ต๐—ฏ๐—ผ๐—ฎ๐—ฟ๐—ฑ

Now that our Taipy architecture is humming along on Hugging Face Spaces, we just shipped the most complex feature of the (๐˜™๐˜ช๐˜จ๐˜ฉ๐˜ต! ๐˜“๐˜ถ๐˜น๐˜ถ๐˜ณ๐˜บ!) ๐˜“๐˜ข๐˜ฌ๐˜ฆ๐˜ฉ๐˜ฐ๐˜ถ๐˜ด๐˜ฆ to date: the ๐—”๐—œ/๐— ๐—Ÿ ๐—ช๐—ผ๐—ฟ๐—ธ๐—ณ๐—น๐—ผ๐˜„๐˜€ ๐——๐—ฎ๐˜€๐—ต๐—ฏ๐—ผ๐—ฎ๐—ฟ๐—ฑ.

Managing 16 different machine learning pipelines (from Expected Goals to Space Creation) across Databricks Serverless and HF Jobs is a logistical challenge. To solve this, we built a dynamic operations center (the 13th page in our app).

It features:

ย ย โ€ข ๐—”๐—ป ๐—ถ๐—ป๐˜๐—ฒ๐—ฟ๐—ฎ๐—ฐ๐˜๐—ถ๐˜ƒ๐—ฒ ๐—ฑ๐—ฒ๐—ฝ๐—ฒ๐—ป๐—ฑ๐—ฒ๐—ป๐—ฐ๐˜† ๐——๐—”๐—š: Powered by Cytoscape.js, it visually maps exactly how our models and data grids feed into each other.

ย ย โ€ข ๐—ฅ๐—ฒ๐—ฎ๐—น-๐˜๐—ถ๐—บ๐—ฒ ๐—บ๐—ผ๐—ป๐—ถ๐˜๐—ผ๐—ฟ๐—ถ๐—ป๐—ด: Tracks run volumes and data freshness SLAs across the entire platform.

ย ย โ€ข ๐—” ๐Ÿฏ-๐˜๐—ถ๐—ฒ๐—ฟ ๐—ต๐˜†๐—ฏ๐—ฟ๐—ถ๐—ฑ ๐—ฐ๐—ผ๐˜€๐˜ ๐—ฒ๐—ป๐—ด๐—ถ๐—ป๐—ฒ: Merges "cold" Databricks billing data with "warm/hot" live HF Jobs estimates to give a unified view of pipeline expenses.

Check out the live interactive graph here:
luxury-lakehouse/soccer-analytics-app
omarkamaliย 
posted an update 1 day ago
view post
Post
472
I just might have cracked tokenizer-free LLMs. No vocab, no softmax.

I'm training a 22M params LLM rn to test this "thing" and it's able to formulate coherent sentences ๐Ÿคฏ

Bear in mind, this is a completely new, tokenizer-free LLM architecture with built-in language universality.

Check the explainer video to understand what's happening. Feedback welcome on this approach!

  • 5 replies
ยท