Layer 6 AI

company

https://layer6.ai/

layer6ai-labs

layer-6-ai

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

alex-layer6 updated a model about 12 hours ago

Layer6/TabDPT

AnthonyCaterini authored a paper about 15 hours ago

Exposing flaws of generative model evaluation metrics and their unfair treatment of diffusion models

AnthonyCaterini authored a paper about 15 hours ago

TabDPT: Scaling Tabular Foundation Models

View all activity

Papers

Agentic Monte Carlo: Simulating Reinforcement Learning for Black-Box Agents

RankJudge: A Multi-Turn LLM-as-a-Judge Synthetic Benchmark Generator

View all Papers

alex-layer6

updated a model about 12 hours ago

Layer6/TabDPT

Other • Updated about 12 hours ago • 15 • 5

AnthonyCaterini

authored 2 papers about 15 hours ago

Exposing flaws of generative model evaluation metrics and their unfair treatment of diffusion models

Paper • 2306.04675 • Published Jun 7, 2023 • 1

TabDPT: Scaling Tabular Foundation Models

Paper • 2410.18164 • Published Oct 23, 2024 • 2

JesseCresswell

authored a paper about 16 hours ago

Agentic Monte Carlo: Simulating Reinforcement Learning for Black-Box Agents

Paper • 2606.05296 • Published 3 days ago • 8

AnthonyCaterini

authored a paper about 16 hours ago

Agentic Monte Carlo: Simulating Reinforcement Learning for Black-Box Agents

Paper • 2606.05296 • Published 3 days ago • 8

lilvjosephtang

updated a dataset 6 days ago

Layer6/RankJudge

Viewer • Updated 6 days ago • 14.3k • 260 • 10

JesseCresswell

authored 7 papers 9 days ago

Response Quality Assessment for Retrieval-Augmented Generation via Conditional Conformal Factuality

Paper • 2506.20978 • Published Jun 26, 2025 • 1

Conformal Agent Error Attribution

Paper • 2605.06788 • Published about 1 month ago • 7

RankJudge: A Multi-Turn LLM-as-a-Judge Synthetic Benchmark Generator

Paper • 2605.21748 • Published 17 days ago • 16

Chartographer: Counterfactual Chart Generation for Evaluating Vision-Language Models

Paper • 2605.27311 • Published 11 days ago • 3

authored a paper 10 days ago

RankJudge: A Multi-Turn LLM-as-a-Judge Synthetic Benchmark Generator

Paper • 2605.21748 • Published 17 days ago • 16

lilvjosephtang

submitted a paper to Daily Papers 11 days ago

RankJudge: A Multi-Turn LLM-as-a-Judge Synthetic Benchmark Generator

Paper • 2605.21748 • Published 17 days ago • 16

lilvjosephtang

published a dataset 16 days ago

Layer6/RankJudge

Viewer • Updated 6 days ago • 14.3k • 260 • 10

lilvjosephtang

authored 3 papers 24 days ago

LLM Safety From Within: Detecting Harmful Content with Internal Representations

Paper • 2604.18519 • Published Apr 20 • 26

Maia-2: A Unified Model for Human-AI Alignment in Chess

Paper • 2409.20553 • Published Oct 31, 2024

Generate, Filter, Control, Replay: A Comprehensive Survey of Rollout Strategies for LLM Reinforcement Learning

Paper • 2605.02913 • Published Apr 8 • 9

JesseCresswell

submitted a paper to Daily Papers 25 days ago

Conformal Agent Error Attribution

Paper • 2605.06788 • Published about 1 month ago • 7

AI & ML interests

Recent Activity

Papers

Team members 7

Layer6's activity