You are here: Decision Strategy Manager > Managing predictive and text analytics > Predictive analytics > Settings for the Genetic algorithm model

Settings for the genetic algorithm model

In the Edit project settings section of a predictive model, you can change settings for creating a genetic algorithm model. The settings include default values for particular options.

Button Description
Add pool Add a pool (or population) of predictive models under the control of the genetic algorithm. Click the name of the pool to change its settings.
Default settings Define the settings for the construction and operation of a pool.

Pool settings

Setting

Description

Pool details

Pool name Enter a name of the pool.
Pool size Enter several models in the pool.

Technique

Technique

Select the type of genetic algorithm that is used for developing the pool:

Optimize new (sub)models Select this option to optimize the parameters of each part of the model as it is changed. This ensures the best use of the predictive information is made throughout the model.
Sampling mechanism

Select the sampling mechanism that is used for developing the pool:

Scaling method

Select the scaling method that is used for developing the pool:

Elite size Number of the top-performing models in one generation that are carried onto the next generation. Enter 1 to prevent the pool from losing its best model.
Replacement count Enter the number of models to replace at each generation of the steady state algorithm.
Tournament size Enter the number of tournament contestants for the tournament sampling.
Scaling parameter Enter the number for the parameter or parameters that are used in each scaling method for fine-tuning.

Model construction

Use bivariate statistic Select this option to use the operators and their parameters that are identified as best at modeling the interactions between predictors when you create a bivariate model.
Use predictor groups Select this option to use one predictor from each of the groups that are identified during predictor grouping and only replace a predictor with another one from the same group. This option prevents the inclusion of duplicate predictors and minimizes the size of the model that is required to incorporate all information. Clear this option to increase model depth and allow more freedom to the genetic algorithm
Enable intelligent genetics Enable intelligent genetics to develop non-linear models (where non-linearity is assumed from the outset) that might outperform models that are developed by structural genetics. This strategy initially generates models with a lower performance, and it is a slow and computationally more expensive process. The result is identical size models and, if the relationship between data and behavior is non-linear, these models have greater predictive power.
Enable structural genetics Structural genetics is the default strategy to develop near-linear models that are at least as powerful as regression models. Non-linear operators are introduced only where they improve performance. Initially, structural genetics generates models with higher performance, and model generation is faster. The result is variable size models with greater data efficiency, which is translated in achieving more power from the same data. The models are easier to understand because they are more linear and robust, and more likely to perform as expected on different data.
Maximum tree depth

Specify the maximum number of levels in the models. For balanced models, the minimum is given by the following formula:

nodes=(2*(number of predictors+number of constants))-1

Crossover mutation

Crossover probability Specify the probability of crossover occurrence during the creation of the offspring. Crossover is the process of creating models by exchanging branches of parent trees.
Mutation probability Specify the probability of mutation occurrence on the created offspring. Mutation is the random alteration of a (randomly selected) node in a model.
Branch replacement Specify the probability of replacing whole branches with randomly created ones during mutation.
Node replacement Specify the probability of changing only the type of a node in a model.
Argument swapping Specify the probability of changing the child order (argument order) of a node in a model.

Simulated annealing

Initial temperature Specify the initial value of the temperature that controls the amount of change to models.
Temperature decrease Specify the rate at which the temperature decreases with each generation.