XGBoost Flashcards | Uplatz Blog

🚀 XGBoost Flashcards

Extreme Gradient Boosting for fast and accurate decision tree models

XGBoost is an optimized gradient boosting algorithm used for supervised learning problems, designed for performance and efficiency.

High speed and accuracy on structured/tabular data; outperforms many models in ML competitions.

Install via pip install xgboost or conda. GPU support requires pip install xgboost[dask].

Widely used in classification, regression, ranking, fraud detection, and Kaggle competitions.

n_estimators, max_depth, eta, gamma, subsample, colsample_bytree.

Supports AUC, RMSE, MAE, Logloss, and custom evaluation functions.

Includes L1 (alpha) and L2 (lambda) regularization to avoid overfitting.

Stop training when no improvement is seen after a number of rounds on validation set.

Save model with model.save_model(), load with model.load_model().

Use XGBClassifier and XGBRegressor seamlessly with pipelines and GridSearchCV.

Accelerate training using tree_method="gpu_hist" for large datasets.

Distributed training support for big data workloads using Dask API.