Shravan Nayak

BAJUKA

4 21

https://bajuka.github.io/

BAJUKA

AI & ML interests

NLP

Recent Activity

upvoted a paper 9 days ago

DataComp-VLM: Improved Open Datasets for Vision-Language Models

updated a dataset 10 days ago

BAJUKA/data_v2

published a dataset 10 days ago

BAJUKA/data_v2

View all activity

Organizations

upvoted a paper 9 days ago

DataComp-VLM: Improved Open Datasets for Vision-Language Models

Paper • 2606.28551 • Published 20 days ago • 51

updated a dataset 10 days ago

BAJUKA/data_v2

Preview • Updated 10 days ago • 569

published a dataset 10 days ago

BAJUKA/data_v2

Preview • Updated 10 days ago • 569

upvoted 2 papers about 1 month ago

Data Journalist Agent: Transforming Data into Verifiable Multimodal Stories

Paper • 2606.11176 • Published Jun 9 • 130

TRL-Bench: Standardizing Cross-Paradigm Representation-Level Evaluation of Tabular Encoders

Paper • 2606.09323 • Published Jun 8 • 53

upvoted 4 papers about 2 months ago

How and What to Imagine? Visual Thinking in Unified Multimodal Models for Cross-View Spatial Reasoning

Paper • 2605.27310 • Published May 26 • 20

updated a dataset 2 months ago

BAJUKA/data

Preview • Updated May 16 • 14

published a dataset 2 months ago

BAJUKA/data

Preview • Updated May 16 • 14

upvoted a paper 2 months ago

Do Enterprise Systems Need Learned World Models? The Importance of Context to Infer Dynamics

Paper • 2605.12178 • Published May 12 • 65

upvoted 2 papers 3 months ago

Sema Code: Decoupling AI Coding Agents into Programmable, Embeddable Infrastructure

Paper • 2604.11045 • Published Apr 13 • 26

FORGE:Fine-grained Multimodal Evaluation for Manufacturing Scenarios

Paper • 2604.07413 • Published Apr 8 • 97

updated a model 3 months ago

BAJUKA/llavanext-qwen25-3b-siglip-train1p5m-ovvideo

3B • Updated Apr 10 • 1

published a model 3 months ago

BAJUKA/llavanext-qwen25-3b-siglip-train1p5m-ovvideo

3B • Updated Apr 10 • 1

upvoted 2 papers 3 months ago

Communicating about Space: Language-Mediated Spatial Integration Across Partial Views

Paper • 2603.27183 • Published Mar 28 • 20

Terminal Agents Suffice for Enterprise Automation

Paper • 2604.00073 • Published Mar 31 • 98

New activity in ServiceNow/VideoCUA 4 months ago

Add video-text-to-text task category and usage instructions

#3 opened 4 months ago by

nielsr

authored a paper 4 months ago

CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents

Paper • 2603.24440 • Published Mar 25 • 99

Shravan Nayak

AI & ML interests

Recent Activity

Organizations

BAJUKA's activity

Add video-text-to-text task category and usage instructions