HyperAI
HyperAI
Home
Console
Docs
News
Papers
Tutorials
Datasets
Wiki
SOTA
LLM Models
GPU Leaderboard
Events
Search
About
Terms of Service
Privacy Policy
English
HyperAI
HyperAI
Toggle Sidebar
Search the site…
⌘
K
Command Palette
Search for a command to run...
Console
Home
SOTA
Code Generation
Code Generation On Res Q
Code Generation On Res Q
Metrics
pass@1
Results
Performance results of various models on this benchmark
Columns
Model Name
pass@1
Paper Title
QurrentOS-coder + Claude 3.5 Sonnet
58.0
RES-Q: Evaluating Code-Editing Large Language Model Systems at the Repository Scale
QurrentOS-coder + GPT-4o
46.0
RES-Q: Evaluating Code-Editing Large Language Model Systems at the Repository Scale
QurrentOS-coder + GPT-4 Turbo
37.0
RES-Q: Evaluating Code-Editing Large Language Model Systems at the Repository Scale
QurrentOS-coder + Claude 3 Opus
36.0
RES-Q: Evaluating Code-Editing Large Language Model Systems at the Repository Scale
QurrentOS-coder + Gemini 1.5 Pro
30.0
RES-Q: Evaluating Code-Editing Large Language Model Systems at the Repository Scale
QurrentOS-coder + GPT-4
30.0
RES-Q: Evaluating Code-Editing Large Language Model Systems at the Repository Scale
QurrentOS-coder + DeepSeek-Coder-V2
29.0
RES-Q: Evaluating Code-Editing Large Language Model Systems at the Repository Scale
QurrentOS-coder + Llama 3 70b
20.0
RES-Q: Evaluating Code-Editing Large Language Model Systems at the Repository Scale
QurrentOS-coder + Qwen-72B-Instruct
18.0
RES-Q: Evaluating Code-Editing Large Language Model Systems at the Repository Scale
0 of 9 row(s) selected.
Previous
Next
HyperAI
HyperAI
Home
Console
Docs
News
Papers
Tutorials
Datasets
Wiki
SOTA
LLM Models
GPU Leaderboard
Events
Search
About
Terms of Service
Privacy Policy
English
HyperAI
HyperAI
Toggle Sidebar
Search the site…
⌘
K
Command Palette
Search for a command to run...
Console
Home
SOTA
Code Generation
Code Generation On Res Q
Code Generation On Res Q
Metrics
pass@1
Results
Performance results of various models on this benchmark
Columns
Model Name
pass@1
Paper Title
QurrentOS-coder + Claude 3.5 Sonnet
58.0
RES-Q: Evaluating Code-Editing Large Language Model Systems at the Repository Scale
QurrentOS-coder + GPT-4o
46.0
RES-Q: Evaluating Code-Editing Large Language Model Systems at the Repository Scale
QurrentOS-coder + GPT-4 Turbo
37.0
RES-Q: Evaluating Code-Editing Large Language Model Systems at the Repository Scale
QurrentOS-coder + Claude 3 Opus
36.0
RES-Q: Evaluating Code-Editing Large Language Model Systems at the Repository Scale
QurrentOS-coder + Gemini 1.5 Pro
30.0
RES-Q: Evaluating Code-Editing Large Language Model Systems at the Repository Scale
QurrentOS-coder + GPT-4
30.0
RES-Q: Evaluating Code-Editing Large Language Model Systems at the Repository Scale
QurrentOS-coder + DeepSeek-Coder-V2
29.0
RES-Q: Evaluating Code-Editing Large Language Model Systems at the Repository Scale
QurrentOS-coder + Llama 3 70b
20.0
RES-Q: Evaluating Code-Editing Large Language Model Systems at the Repository Scale
QurrentOS-coder + Qwen-72B-Instruct
18.0
RES-Q: Evaluating Code-Editing Large Language Model Systems at the Repository Scale
0 of 9 row(s) selected.
Previous
Next
Code Generation On Res Q | SOTA | HyperAI