Skip to main content



simRegression Model Specifications

Model Specifications modify a model's behavior and access to server resources.

simRegression parameters guide the model training process by dictating the number and size of decision trees to be constructed, as well as other aspects of the learning procedure. [Therefore, it is highly recommended to use a grid to select the appropriate parameters. <- We should add this once we support grids for Regression.]

IterationsNumber of gradient boosting algorithm iterations/trees. More iterations can improve accuracy but increase overfitting risk.
Max DepthMaximum tree depth. Deeper trees capture more complex patterns but may overfit.
ETALearning rate shrinkage to prevent overfitting. Smaller ETA values slow down learning and help in avoiding overfitting.
Col Sample by TreeFeature sub-sample ratio per tree. Lower ratios increase diversity and prevent overfitting.
Regression ObjectiveLoss function used for training. It should match the regression problem's specifics. The allowed values are: squared_error: Minimizes the squared difference between actual and predicted values, emphasizing larger errors. squared_log_error: Uses squared logarithmic loss, suitable for targets greater than -1. logistic: Utilized in logistic regression, it outputs probabilities. pseudo_huber_error: Employs Pseudo Huber loss, an alternative to absolute loss. absolute_error: Involves L1 error for regression.