catboost.save_pool {catboost}R Documentation

Save the dataset

Description

Save the dataset to the CatBoost format. Files with the following data are created:

Use the catboost.load_pool function to read the resulting files. These files can also be used in the Command-line version and the Python library.

Usage

catboost.save_pool(data, label = NULL, weight = NULL,
  baseline = NULL, pool_path = "data.pool", cd_path = "cd.pool")

Arguments

data

A data.frame with features. The following column types are supported:

  • double

  • factor. It is assumed that categorical features are given in this type of columns. A standard CatBoost processing procedure is applied to this type of columns:

    1.

    The values are converted to strings.

    2.

    The ConvertCatFeatureToFloat function is applied to the resulting string.

Default value: Required argument

label

The label vector.

weight

The weights of the label vector.

baseline

Vector of initial (raw) values of the label function for the object. Used in the calculation of final values of trees.

pool_path

The path to the otuptut file that contains the dataset description.

cd_path

The path to the output file that contains the column descriptions.


[Package catboost version 0.20 Index]