HyperAI
HyperAI
Home
Console
Docs
News
Papers
Tutorials
Datasets
Wiki
SOTA
LLM Models
GPU Leaderboard
Events
Search
About
Terms of Service
Privacy Policy
English
HyperAI
HyperAI
Toggle Sidebar
Search the site…
⌘
K
Command Palette
Search for a command to run...
Console
Home
SOTA
Code Generation
Code Generation On Turbulence
Code Generation On Turbulence
Metrics
CorrSc
Results
Performance results of various models on this benchmark
Columns
Model Name
CorrSc
Paper Title
GPT-4
0.848
Turbulence: Systematically and Automatically Testing Instruction-Tuned Large Language Models for Code
GPT-3.5-Turbo
0.617
Turbulence: Systematically and Automatically Testing Instruction-Tuned Large Language Models for Code
CodeLlama:13B-4bit-quantised
0.327
Turbulence: Systematically and Automatically Testing Instruction-Tuned Large Language Models for Code
CodeLlama:7B-4bit-quantised
0.289
Turbulence: Systematically and Automatically Testing Instruction-Tuned Large Language Models for Code
Command
0.063
Turbulence: Systematically and Automatically Testing Instruction-Tuned Large Language Models for Code
0 of 5 row(s) selected.
Previous
Next
HyperAI
HyperAI
Home
Console
Docs
News
Papers
Tutorials
Datasets
Wiki
SOTA
LLM Models
GPU Leaderboard
Events
Search
About
Terms of Service
Privacy Policy
English
HyperAI
HyperAI
Toggle Sidebar
Search the site…
⌘
K
Command Palette
Search for a command to run...
Console
Home
SOTA
Code Generation
Code Generation On Turbulence
Code Generation On Turbulence
Metrics
CorrSc
Results
Performance results of various models on this benchmark
Columns
Model Name
CorrSc
Paper Title
GPT-4
0.848
Turbulence: Systematically and Automatically Testing Instruction-Tuned Large Language Models for Code
GPT-3.5-Turbo
0.617
Turbulence: Systematically and Automatically Testing Instruction-Tuned Large Language Models for Code
CodeLlama:13B-4bit-quantised
0.327
Turbulence: Systematically and Automatically Testing Instruction-Tuned Large Language Models for Code
CodeLlama:7B-4bit-quantised
0.289
Turbulence: Systematically and Automatically Testing Instruction-Tuned Large Language Models for Code
Command
0.063
Turbulence: Systematically and Automatically Testing Instruction-Tuned Large Language Models for Code
0 of 5 row(s) selected.
Previous
Next