Datasets #

Note

There are two main classes related to datasets handling in Dataiku’s Python APIs:

  • dataiku.core.dataset.Dataset in the dataiku package, which deals primarily with reading and writing data. It has the most flexibility when it comes to reading and writing

  • dataikuapi.dss.dataset.DSSDataset in the dataikuapi package which is mostly used for creating datasets, managing their settings, building flows, creating ML models, and performing a wider range of operations on datasets.

For more details on the two packages, please see Getting started

For starting code samples, please see Python Recipes .

Detailed samples about interacting with datasets can be found in:

Reference documentation for the classes supporting interaction with datasets can be found in Datasets