Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning

paper-0033 · paper · 1992

Ronald J. Williams

REINFORCE; the original policy-gradient method.

Academic, score -0.1582

Metric	Status	Value	Norm.	Weight	Contribution	Source	Confidence	Provenance
citation_count	present	7463.0	0.033589	0.5	0.016795	OpenAlex	high	link
library_holdings	missing	recorded as missing, penalized by rule, never imputed			−0.1	recorded as missing; penalized by rule, never imputed
readership_persistence	present	15.0	1.0	0.05	0.05	OpenAlex	medium	link
syllabus_adoptions	missing	recorded as missing, penalized by rule, never imputed			−0.125	recorded as missing; penalized by rule, never imputed

Broad Influence, score 0.2067

Metric	Status	Value	Norm.	Weight	Contribution	Source	Confidence	Provenance
citation_count	present	7463.0	0.033589	0.2	0.006718	OpenAlex	high	link
library_holdings	missing	recorded as missing, penalized by rule, never imputed			−0.125	recorded as missing; penalized by rule, never imputed
readership_persistence	present	15.0	1.0	0.4	0.4	OpenAlex	medium	link
syllabus_adoptions	missing	recorded as missing, penalized by rule, never imputed			−0.075	recorded as missing; penalized by rule, never imputed

Governance Practitioner, score -0.2166

Metric	Status	Value	Norm.	Weight	Contribution	Source	Confidence	Provenance
citation_count	present	7463.0	0.033589	0.25	0.008397	OpenAlex	high	link
library_holdings	missing	recorded as missing, penalized by rule, never imputed			−0.15	recorded as missing; penalized by rule, never imputed
readership_persistence	present	15.0	1.0	0.1	0.1	OpenAlex	medium	link
syllabus_adoptions	missing	recorded as missing, penalized by rule, never imputed			−0.175	recorded as missing; penalized by rule, never imputed

A rank is not a verdict on intrinsic worth. It is a transparent output of declared evidence, weights, and missing-data rules at a specific release date.

Disagree with this rank or a number? Challenge it with your evidence. Every challenge gets a public identifier and a published resolution.