Datasets #
Note
There are two main classes related to datasets handling in Dataiku’s Python APIs:
-
dataiku.core.dataset.Dataset
in thedataiku
package, which deals primarily with reading and writing data. It has the most flexibility when it comes to reading and writing -
dataikuapi.dss.dataset.DSSDataset
in thedataikuapi
package which is mostly used for creating datasets, managing their settings, building flows, creating ML models, and performing a wider range of operations on datasets.
For more details on the two packages, please see Getting started
For starting code samples, please see Python Recipes .
Detailed samples about interacting with datasets can be found in:
Reference documentation for the classes supporting interaction with datasets can be found in Datasets