AI & ML interests
Enterprise AI and ML, Foundation Models, Responsible AI
Recent Activity
Papers
VAREX: A Benchmark for Multi-Modal Structured Extraction from Documents
Look Where It Matters: High-Resolution Crops Retrieval for Efficient VLMs
Articles
BlueBench Leaderboard
An open-source benchmark for enterprise use cases.
ScarfBench
Java Framework migration
VAKRA Leaderboard
Benchmark AI agents on multiβhop, multiβsource enterprise tasks
BPO Benchmark Evaluation
Run BPO recruiting benchmark with CUGA SDK
Ai Atlas Nexus Capabilities
Intent to AI Capability with AI Atlas Nexus
AssetOpsBench
Generate and benchmark machine learning models with ease
ITBench-Lite-Space
Develop and run interactive code notebooks with JupyterLab
CUGA Agent
Configurable Generalist Agent, leader in AppWorld Benchmark
3DGrid-VQGAN Demo
SMILES to 3D grid and reconstruction comparison demo
AssetOpsBench
Evaluating Autonomous AI Agents for Industry 4.0 Tasks
SMI-TED-demo1
Generate embeddings from SMILES strings or CSV files
Fm4m Eval Demo
FM4M-demo2
Generate and analyze molecular structures
FM4M-demo1
Generate and analyze molecular structures
README
Biomed.sm.mv Te 84m
Prediction task tests for biomed-multi-view models