Datasets are tabular data objects that are searchable and live in folders rather than within schemas, essentially like a Benchling CSV.
What are datasets for?
Datasets are representations of tabular data within Benchling for the purpose of saving a discrete set of data without adding it to your published databases. Some examples of how Datasets can be used as a fast and flexible way to track intermediate data from an instrument, a list of limits or parameters for a particular assay, or temporary storage for well roles. Datasets are often intermediate pieces of a larger data analysis pipeline.
Datasets In Analysis
Datasets can be both data inputs and/or outputs to an Analysis. Below are examples of both import and export:
To ingest a Dataset into Analysis:
- Click the + icon to add a new Analysis table
- From the list of table creation options, click From dataset
- Use the text box to search for your source dataset
- Define the table name, then click the Add table button
Once you have completed your analysis, you can export a Dataset as an output as follows:
- Select the dataset that contains data you want to use to create an output
- Click create outputs in the top right corner
- Select items to associate with an output
- Define the folder where the dataset will be located
- Click create outputs at the bottom of the modal
These outputs will populate at the bottom of Outputs as shown in the selected Analysis in the left toolbar
Datasets In Connect
When configuring an Output file in a run schema, the “Record dataset” option (under Benchling action) allows for the Output file to be processed and recorded as Datasets. Using a Dataset eliminates the need for schema configuration, such as registry or result schemas. Datasets are recommended when the output data will serve as an input for subsequent analysis.
When a run is executed using the "Record dataset" option to process an output file, the processed data is stored and displayed in a tabular format and as a new Dataset object, as illustrated below:
This Dataset can be sent directly to either a new analysis or an existing analysis.
Datasets & Studies
To tag a Study within an Analysis:
1. Click on the info icon in the upper right corner of the Analysis to view the metadata
2. Click on the edit icon under Studies to add or edit the Study that the Analysis is tagged with
3. Use the text box to search for the relevant Study then click on the check mark icon to save the tag
This will allow the Datasets associated with an Analysis to appear within the Study Items, where all Datasets and Analyses linked with the Study will appear. When you are done viewing or updating metadata in this modal, click the Save button.
Datasets are not the same as results tables, registry tables, or unstructured tables within the notebook
This is the Dataset workspace. Datasets can be optionally tagged with a Study, or they can have Custom Fields to tag with custom metadata.