| dataset_factory {arrow} | R Documentation |
A Dataset can constructed using one or more DatasetFactorys.
This function helps you construct a DatasetFactory that you can pass to
open_dataset().
dataset_factory(
x,
filesystem = c("auto", "local"),
format = c("parquet", "arrow", "ipc", "feather"),
partitioning = NULL,
allow_not_found = FALSE,
recursive = TRUE,
...
)
x |
A string file x containing data files, or
a list of |
filesystem |
A string identifier for the filesystem corresponding to
|
format |
A string identifier of the format of the files in |
partitioning |
One of
|
allow_not_found |
logical: is |
recursive |
logical: should files be discovered in subdirectories of
|
... |
Additional arguments passed to the FileSystem |
If you would only have a single DatasetFactory (for example, you have a
single directory containing Parquet files), you can call open_dataset()
directly. Use dataset_factory() when you
want to combine different directories, file systems, or file formats.
A DatasetFactory object. Pass this to open_dataset(),
in a list potentially with other DatasetFactory objects, to create
a Dataset.