Evaluation of Population Initialization Methods for Genetic Programming-based Symbolic Regression

arXiv:2606.31990v1 Announce Type: cross Abstract: We analyze the effect of optimizing the initial population of genetic programming (GP) for symbolic regression (SR) on the accuracy and complexity of solutions. We compare three well-established random initialization methods as well as initialization with small optimized solutions from exhaustive symbolic regression (ESR) using a GP/SR implementation which is based on the multi-objective evolutionary algorithm NSGA-II. We compare the final Pareto fronts found with each initialization method on twelve synthetic problems of varying complexity and
The continuous drive for more efficient and robust AI models underpins research into optimizing foundational processes like genetic programming initialization.
Improving genetic programming's efficiency and accuracy directly contributes to the development of more sophisticated and performant AI systems, impacting various downstream applications.
Optimized initialization methods could lead to faster development cycles and more effective solutions for symbolic regression, a key component in complex AI problem-solving.
- · AI developers
- · Machine learning researchers
- · SaaS companies leveraging AI
- · Computational science
- · Inefficient AI development paradigms
More accurate and less complex AI models are developed using genetic programming.
Accelerated progress in fields relying on symbolic regression, such as drug discovery and materials science, where AI finds complex relationships in data.
Enhanced AI capabilities could allow for the automation of increasingly nuanced and previously manual white-collar tasks, further enabling AI agents.
This signal links to a primary source. Continuum Brief monitors and indexes it as part of the live intelligence stream — we do not republish source content.
Read at arXiv cs.LG