ServiceNow-AI

company

AI & ML interests

None defined yet.

Recent Activity

gabegma authored a paper about 5 hours ago

Azimuth: Systematic Error Analysis for Text Classification

lindsaybrin authored a paper about 5 hours ago

Azimuth: Systematic Error Analysis for Text Classification

tarabogavelli authored a paper about 6 hours ago

EVA-Bench: A New End-to-end Framework for Evaluating Voice Agents

View all activity

Papers

EVA-Bench: A New End-to-end Framework for Evaluating Voice Agents

Do Enterprise Systems Need Learned World Models? The Importance of Context to Infer Dynamics

View all Papers

Articles

vLLM V0 to V1: Correctness Before Corrections in RL

A New Framework for Evaluating Voice Agents (EVA)

Introducing SyGra Studio

🚀 SyGra V2.0.0

AprielGuard: A Guardrail for Safety and Adversarial Robustness in Modern LLM Systems

Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance

Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models

SyGra: The One-Stop Framework for Building Data for LLMs and SLMs

View all articles

authored a paper about 5 hours ago

Developing Safe and Responsible Large Language Models -- A Comprehensive Framework

Paper • 2404.01399 • Published Apr 1, 2024 • 1

authored a paper about 5 hours ago

Azimuth: Systematic Error Analysis for Text Classification

Paper • 2212.08216 • Published Dec 16, 2022

authored a paper about 5 hours ago

M2Lingual: Enhancing Multilingual, Multi-Turn Instruction Alignment in Large Language Models

Paper • 2406.16783 • Published Jun 24, 2024 • 4

authored a paper about 6 hours ago

Prompting with Phonemes: Enhancing LLM Multilinguality for non-Latin Script Languages

Paper • 2411.02398 • Published Nov 4, 2024 • 1

authored a paper about 6 hours ago

Prompting with Phonemes: Enhancing LLM Multilinguality for non-Latin Script Languages

Paper • 2411.02398 • Published Nov 4, 2024 • 1

authored a paper about 6 hours ago

DNA Bench: When Silence is Smarter -- Benchmarking Over-Reasoning in Reasoning LLMs

Paper • 2503.15793 • Published Mar 20, 2025

authored a paper about 6 hours ago

Layer-Wise Quantization: A Pragmatic and Effective Method for Quantizing LLMs Beyond Integer Bit-Levels

Paper • 2406.17415 • Published Jun 25, 2024

authored a paper about 6 hours ago

Cats Confuse Reasoning LLM: Query Agnostic Adversarial Triggers for Reasoning Models

Paper • 2503.01781 • Published Mar 3, 2025 • 2

authored a paper about 6 hours ago

Apriel-Nemotron-15B-Thinker

Paper • 2508.10948 • Published Aug 13, 2025 • 6

authored 2 papers about 6 hours ago

Apriel-Nemotron-15B-Thinker

Paper • 2508.10948 • Published Aug 13, 2025 • 6

AU-Harness: An Open-Source Toolkit for Holistic Evaluation of Audio LLMs

Paper • 2509.08031 • Published Sep 9, 2025 • 21

authored a paper about 6 hours ago

AU-Harness: An Open-Source Toolkit for Holistic Evaluation of Audio LLMs

Paper • 2509.08031 • Published Sep 9, 2025 • 21

authored a paper about 6 hours ago

Apriel-1.5-15b-Thinker

Paper • 2510.01141 • Published Oct 1, 2025 • 123

authored a paper about 6 hours ago

RECODE-H: A Benchmark for Research Code Development with Interactive Human Feedback

Paper • 2510.06186 • Published Oct 7, 2025

authored a paper about 6 hours ago

Do Enterprise Systems Need Learned World Models? The Importance of Context to Infer Dynamics

Paper • 2605.12178 • Published 7 days ago • 60

authored a paper about 6 hours ago

EVA-Bench: A New End-to-end Framework for Evaluating Voice Agents

Paper • 2605.13841 • Published 6 days ago • 60

authored a paper about 6 hours ago

EVA-Bench: A New End-to-end Framework for Evaluating Voice Agents

Paper • 2605.13841 • Published 6 days ago • 60

authored a paper about 6 hours ago

EVA-Bench: A New End-to-end Framework for Evaluating Voice Agents

Paper • 2605.13841 • Published 6 days ago • 60

authored a paper about 6 hours ago

EVA-Bench: A New End-to-end Framework for Evaluating Voice Agents

Paper • 2605.13841 • Published 6 days ago • 60

authored a paper about 6 hours ago

EVA-Bench: A New End-to-end Framework for Evaluating Voice Agents

Paper • 2605.13841 • Published 6 days ago • 60