13 9

Алексеев Алексей

VictoriaWilliam

AI & ML interests

None yet

Recent Activity

upvoted a paper about 20 hours ago

Shallow Prefill, Deep Decoding: Efficient Long-Context Inference via Layer-Asymmetric KV Visibility

upvoted a paper 4 days ago

R^3-SQL: Ranking Reward and Resampling for Text-to-SQL

upvoted a paper 8 days ago

OpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents

View all activity

Organizations

None yet

upvoted a paper about 20 hours ago

Shallow Prefill, Deep Decoding: Efficient Long-Context Inference via Layer-Asymmetric KV Visibility

Paper • 2605.06105 • Published 8 days ago • 3

upvoted a paper 4 days ago

R^3-SQL: Ranking Reward and Resampling for Text-to-SQL

Paper • 2604.25325 • Published 17 days ago • 3

upvoted a paper 8 days ago

OpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents

Paper • 2605.05185 • Published 9 days ago • 96

liked a dataset 8 days ago

cadene/droid

Preview • Updated Feb 27, 2025 • 287k • 15

liked a dataset 14 days ago

allenai/c4

Viewer • Updated Jan 9, 2024 • 10.4B • 814k • 572

liked a model 21 days ago

aioaneid/nanochat_n_layer_12_seq_len_1024_n_embd_1024

Updated about 2 hours ago • 2

upvoted 2 papers 22 days ago

WebCompass: Towards Multimodal Web Coding Evaluation for Code Language Models

Paper • 2604.18224 • Published 25 days ago • 22

LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model

Paper • 2604.20796 • Published 23 days ago • 240

upvoted 4 papers about 1 month ago

Self-Execution Simulation Improves Coding Models

Paper • 2604.03253 • Published Mar 11 • 35

Adam's Law: Textual Frequency Law on Large Language Models

Paper • 2604.02176 • Published Apr 2 • 501

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published Apr 3 • 628

DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models

Paper • 2603.26164 • Published Mar 27 • 364

liked 4 models about 1 month ago

arithmetic-circuit-overloading/Llama-3.3-70B-Instruct-v2-3d-4M-400K-0.1-reverse-padzero-99-128D-2L-2H-512I

Text Generation • 662k • Updated Apr 4 • 73 • 1

liked 2 datasets about 1 month ago

Eimhin03/NM3-irish-augmented-iter5

Viewer • Updated Apr 1 • 10.8k • 206 • 1

HuggingFaceFW/finephrase

Viewer • Updated Mar 31 • 1.02B • 427k • 111

upvoted a paper about 1 month ago

SEAR: Schema-Based Evaluation and Routing for LLM Gateways

Paper • 2603.26728 • Published Mar 20 • 12

upvoted a paper about 2 months ago

Astrolabe: Steering Forward-Process Reinforcement Learning for Distilled Autoregressive Video Models

Paper • 2603.17051 • Published Mar 17 • 109

Алексеев Алексей

AI & ML interests

Recent Activity

Organizations

VictoriaWilliam's activity