Evaluating Large Language Models Trained on Code

paper-0131 · paper · 2021

Mark Chen et al.

Codex; LLMs write code, the capability that transformed software work.

Academic, score -0.2039

Metric	Status	Value	Norm.	Weight	Contribution	Source	Confidence	Provenance
citation_count	present	1428.0	0.006424	0.5	0.003212	OpenAlex	high	link
library_holdings	missing	recorded as missing, penalized by rule, never imputed			−0.1	recorded as missing; penalized by rule, never imputed
readership_persistence	present	6.0	0.357143	0.05	0.017857	OpenAlex	medium	link
syllabus_adoptions	missing	recorded as missing, penalized by rule, never imputed			−0.125	recorded as missing; penalized by rule, never imputed

Broad Influence, score -0.0559

Metric	Status	Value	Norm.	Weight	Contribution	Source	Confidence	Provenance
citation_count	present	1428.0	0.006424	0.2	0.001285	OpenAlex	high	link
library_holdings	missing	recorded as missing, penalized by rule, never imputed			−0.125	recorded as missing; penalized by rule, never imputed
readership_persistence	present	6.0	0.357143	0.4	0.142857	OpenAlex	medium	link
syllabus_adoptions	missing	recorded as missing, penalized by rule, never imputed			−0.075	recorded as missing; penalized by rule, never imputed

Governance Practitioner, score -0.2877

Metric	Status	Value	Norm.	Weight	Contribution	Source	Confidence	Provenance
citation_count	present	1428.0	0.006424	0.25	0.001606	OpenAlex	high	link
library_holdings	missing	recorded as missing, penalized by rule, never imputed			−0.15	recorded as missing; penalized by rule, never imputed
readership_persistence	present	6.0	0.357143	0.1	0.035714	OpenAlex	medium	link
syllabus_adoptions	missing	recorded as missing, penalized by rule, never imputed			−0.175	recorded as missing; penalized by rule, never imputed

A rank is not a verdict on intrinsic worth. It is a transparent output of declared evidence, weights, and missing-data rules at a specific release date.

Disagree with this rank or a number? Challenge it with your evidence. Every challenge gets a public identifier and a published resolution.