IBM Research

company

https://research.ibm.com/

AI & ML interests

Enterprise AI and ML, Foundation Models, Responsible AI

Recent Activity

brmcg updated a Space about 5 hours ago

ibm-research/ScarfBench

wmgifford updated a model about 9 hours ago

ibm-research/patchtst-fm-r1

thrumbel updated a model 1 day ago

ibm-research/biomed.rna.llama.47m.wced.multitask.v1

View all activity

Papers

VAREX: A Benchmark for Multi-Modal Structured Extraction from Documents

Look Where It Matters: High-Resolution Crops Retrieval for Efficient VLMs

View all Papers

Articles

IBM and UC Berkeley Diagnose Why Enterprise Agents Fail Using IT-Bench and MAST

AssetOpsBench: Bridging the Gap Between AI Agent Benchmarks and Industrial Reality

CUGA on Hugging Face: Democratizing Configurable AI Agents

View all articles

ibm-research 's Spaces 16

BlueBench Leaderboard

An open-source benchmark for enterprise use cases.

ScarfBench

Java Framework migration

VAKRA Leaderboard

Benchmark AI agents on multi‑hop, multi‑source enterprise tasks

BPO Benchmark Evaluation

Run BPO recruiting benchmark with CUGA SDK

Ai Atlas Nexus Capabilities

Intent to AI Capability with AI Atlas Nexus

AssetOpsBench

Generate and benchmark machine learning models with ease

ITBench-Lite-Space

Develop and run interactive code notebooks with JupyterLab

CUGA Agent

Configurable Generalist Agent, leader in AppWorld Benchmark

3DGrid-VQGAN Demo

SMILES to 3D grid and reconstruction comparison demo

AssetOpsBench

Evaluating Autonomous AI Agents for Industry 4.0 Tasks

SMI-TED-demo1

Generate embeddings from SMILES strings or CSV files

Fm4m Eval Demo

FM4M-demo2

Generate and analyze molecular structures

FM4M-demo1

Generate and analyze molecular structures

README

Biomed.sm.mv Te 84m

Prediction task tests for biomed-multi-view models