-
StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets?
Paper • 2510.02209 • Published • 57 -
AI-Trader: Benchmarking Autonomous Agents in Real-Time Financial Markets
Paper • 2512.10971 • Published • 4 -
When AI Meets Finance (StockAgent): Large Language Model-based Stock Trading in Simulated Real-world Environments
Paper • 2407.18957 • Published • 3 -
INVESTORBENCH: A Benchmark for Financial Decision-Making Tasks with LLM-based Agent
Paper • 2412.18174 • Published • 1
Kuo-Hsin Tu
dapumptu
AI & ML interests
None yet
Recent Activity
liked
a model about 23 hours ago
janhq/Jan-code-4b liked
a model about 23 hours ago
ByteDance-Seed/cudaLLM-8B liked
a dataset about 23 hours ago
ByteDance-Seed/cudaLLM-data Organizations
llm
agent
-
PC Agent: While You Sleep, AI Works -- A Cognitive Journey into Digital World
Paper • 2412.17589 • Published • 14 -
Agent-SafetyBench: Evaluating the Safety of LLM Agents
Paper • 2412.14470 • Published • 13 -
Compound AI Systems Optimization: A Survey of Methods, Challenges, and Future Directions
Paper • 2506.08234 • Published • 9 -
Adaptive Domain Modeling with Language Models: A Multi-Agent Approach to Task Planning
Paper • 2506.19592 • Published • 1
finance
-
UCFE: A User-Centric Financial Expertise Benchmark for Large Language Models
Paper • 2410.14059 • Published • 63 -
A Survey of Large Language Models in Finance (FinLLMs)
Paper • 2402.02315 • Published • 1 -
Construction of Domain-specified Japanese Large Language Model for Finance through Continual Pre-training
Paper • 2404.10555 • Published • 3 -
Baichuan4-Finance Technical Report
Paper • 2412.15270 • Published • 4
code
-
OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models
Paper • 2411.04905 • Published • 127 -
Granite Code Models: A Family of Open Foundation Models for Code Intelligence
Paper • 2405.04324 • Published • 25 -
Seed-Coder: Let the Code Model Curate Data for Itself
Paper • 2506.03524 • Published • 6 -
Qwen2.5-Coder Technical Report
Paper • 2409.12186 • Published • 153
reasoning
trade
-
StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets?
Paper • 2510.02209 • Published • 57 -
AI-Trader: Benchmarking Autonomous Agents in Real-Time Financial Markets
Paper • 2512.10971 • Published • 4 -
When AI Meets Finance (StockAgent): Large Language Model-based Stock Trading in Simulated Real-world Environments
Paper • 2407.18957 • Published • 3 -
INVESTORBENCH: A Benchmark for Financial Decision-Making Tasks with LLM-based Agent
Paper • 2412.18174 • Published • 1
finance
-
UCFE: A User-Centric Financial Expertise Benchmark for Large Language Models
Paper • 2410.14059 • Published • 63 -
A Survey of Large Language Models in Finance (FinLLMs)
Paper • 2402.02315 • Published • 1 -
Construction of Domain-specified Japanese Large Language Model for Finance through Continual Pre-training
Paper • 2404.10555 • Published • 3 -
Baichuan4-Finance Technical Report
Paper • 2412.15270 • Published • 4
llm
code
-
OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models
Paper • 2411.04905 • Published • 127 -
Granite Code Models: A Family of Open Foundation Models for Code Intelligence
Paper • 2405.04324 • Published • 25 -
Seed-Coder: Let the Code Model Curate Data for Itself
Paper • 2506.03524 • Published • 6 -
Qwen2.5-Coder Technical Report
Paper • 2409.12186 • Published • 153
agent
-
PC Agent: While You Sleep, AI Works -- A Cognitive Journey into Digital World
Paper • 2412.17589 • Published • 14 -
Agent-SafetyBench: Evaluating the Safety of LLM Agents
Paper • 2412.14470 • Published • 13 -
Compound AI Systems Optimization: A Survey of Methods, Challenges, and Future Directions
Paper • 2506.08234 • Published • 9 -
Adaptive Domain Modeling with Language Models: A Multi-Agent Approach to Task Planning
Paper • 2506.19592 • Published • 1
reasoning