Score breakdown

Training Language Models to Follow Instructions with Human Feedback

paper-0144 · paper · 2022

Long Ouyang et al.

InstructGPT; RLHF at scale, the technique behind aligned chat models.

Academic, score -0.1939

MetricStatusValueNorm.WeightContributionSourceConfidenceProvenance
citation_countpresent4292.00.0193160.50.009658OpenAlexhighlink
library_holdingsmissingrecorded as missing, penalized by rule, never imputed−0.1recorded as missing; penalized by rule, never imputed
readership_persistencepresent7.00.4285710.050.021429OpenAlexmediumlink
syllabus_adoptionsmissingrecorded as missing, penalized by rule, never imputed−0.125recorded as missing; penalized by rule, never imputed

Broad Influence, score -0.0247

MetricStatusValueNorm.WeightContributionSourceConfidenceProvenance
citation_countpresent4292.00.0193160.20.003863OpenAlexhighlink
library_holdingsmissingrecorded as missing, penalized by rule, never imputed−0.125recorded as missing; penalized by rule, never imputed
readership_persistencepresent7.00.4285710.40.171429OpenAlexmediumlink
syllabus_adoptionsmissingrecorded as missing, penalized by rule, never imputed−0.075recorded as missing; penalized by rule, never imputed

Governance Practitioner, score -0.2773

MetricStatusValueNorm.WeightContributionSourceConfidenceProvenance
citation_countpresent4292.00.0193160.250.004829OpenAlexhighlink
library_holdingsmissingrecorded as missing, penalized by rule, never imputed−0.15recorded as missing; penalized by rule, never imputed
readership_persistencepresent7.00.4285710.10.042857OpenAlexmediumlink
syllabus_adoptionsmissingrecorded as missing, penalized by rule, never imputed−0.175recorded as missing; penalized by rule, never imputed

A rank is not a verdict on intrinsic worth. It is a transparent output of declared evidence, weights, and missing-data rules at a specific release date.

Disagree with this rank or a number? Challenge it with your evidence. Every challenge gets a public identifier and a published resolution.