Computational references are not experiments: pre-registered validation of machine-learned sodium-cathode voltages

arXiv:2606.23725v1 Announce Type: cross Abstract: Machine-learning screens for battery materials are trained and judged almost entirely against computed reference voltages, and those references carry their own systematic errors. We report a case in which this matters quantitatively: our own screening stack (a graph-network voltage screen, a prior-art triage layer, and a local PBE+U bench) fails pre-registered validation against experiment-anchored literature values. Verdict thresholds, failure modes, and the primary metric were committed before analysis. On an operator-audited set of known Na-
This research emerges as machine learning is increasingly applied to materials science, particularly in battery discovery, making critical evaluation of ML 'computational references' timely.
It highlights a significant methodological flaw in the prevalent machine learning for battery materials, emphasizing that computational references are not substitutes for experimental validation, which affects the reliability and trustworthiness of ML-driven discoveries.
The understanding that ML models, when trained solely on computational references, can fail pre-registered experimental validation, necessitating a re-evaluation of current screening practices and a greater emphasis on anchoring models to real-world data.
- · Experimental materials scientists
- · Battery manufacturers prioritizing robust validation
- · Data scientists focused on experimental data integration
- · ML models based solely on computational references
- · Researchers over-relying on theoretical simulations for validation
- · Investment in ML-driven battery startups without strong experimental ties
Increased scrutiny and demand for experimental validation in machine learning for materials science.
Development of new ML methodologies and benchmarks that incorporate experimental data more rigorously, potentially slowing down initial screening but improving accuracy.
A shift in funding and research priorities towards hybrid computational-experimental approaches in materials discovery, impacting the pace and direction of battery technology development.
This signal links to a primary source. Continuum Brief monitors and indexes it as part of the live intelligence stream — we do not republish source content.
Read at arXiv cs.LG