DNA and RNA sequence overview

Benchling’s DNA and RNA sequence features offer a modern, collaborative way to manage, analyze, and design genetic constructs. Instead of relying on disparate software and files, you can centralize your RNA sequences in Benchling, creating a single source of truth for research. This not only improves organization but also facilitates collaboration and ensures data integrity.

Benchling's cloud-based nature allows for seamless collaboration. Teams can work on the same sequences in real-time, reducing redundancy and improving hand-offs. The platform helps standardize sequence analysis and registration across an entire organization.

This article provides comprehensive guidance on managing DNA and RNA sequences within Benchling, covering importing, viewing, editing, exporting, bulk operations, sharing, and locking sequences.

Supported sequence file formats

Benchling supports many common and proprietary file formats. Using a supported file format means that Benchling will import the sequence, annotations, and comments.

We recommend GenBank for processed sequences, ab1 for trace data, and then FASTA and raw bases for raw sequence files over any other file types. The list below provides additional non-principally supported file types that can still be imported into Benchling.

If you encounter an import error, ensure your file extensions are formatted correctly. If you encounter an import error due to an unsupported file format, try saving or exporting your file in GenBank or FASTA format. Benchling does not currently support .pdf, .doc, or .docx file types for sequence import. To import sequences from these files, you can manually copy and paste the bases into Benchling.

Format Name	Extension [1]	Multi [2]	Trace [3]
Ape	*.ape
Applied Biosystems Inc. Format	*.ab1		x
SCF Trace Format	*.scf		x
Zip Archive	*.zip	x
Clone Manager	*.cm5
DNASTAR Lasergene Seqbuilder	*.seq
MacVector	*.nucl
Snapgene	*.dna
Serial Cloner	*.xdna
DNA Strider	*.xdna
Gene Construction Kit	.gcc, .gck
INSDC XML	*.xml	x
JBEI-ICE XML	*.xml	x
FinchTV	*.ftv		x
SBOL RDF	.xml, .rdf	x
Geneious Archive	*.geneious	x
Raw Bases	.txt, .rdf
GenBank	.gb, .gbk	x
Fasta	.txt, .fasta	x
Vector NTI Archive	*.ma4	x

Certain formats support additional capabilities:

Multi-file types – support multiple sequences that are automatically split by Benchling when imported at once
Trace file types – results from Sanger sequencing and contain chromatogram traces and quality data when imported as an alignment

Fields on GenBank files

GenBank files come with fields of different categories and sub-categories which are respected when loaded into Benchling.

There are:

1. Fields that characterize the whole sequence like accession, notes, etc

When imported into Benchling these will populate as custom fields on the metadata page for the whole sequence

2. Features that characterize regions of the sequence

Features of type CDS will become translations on the sequence. In GenBank these have trans_table properties that set the genetic code. Translations can have custom fields
Features of other types will become annotations on the sequence. Annotations can also have custom fields

Create DNA and RNA sequences

You can manually type in a sequence or import a sequence from a file on your computer, a database, or a genomic locus.

To create a sequence in Benchling:

Click the Global Create in the left sidebar
Hover over DNA/RNA Sequence, then click New DNA/RNA Sequence
This opens the Create DNA / RNA Sequence modal, which displays four ways to create a sequence, select the sequence input method by clicking:
- Create New to manually input a sequence by pasting the bases into the text bar
- Upload Files to upload a sequence file from your computer
- Import From Database to import common sequences from external databases
- Select Chromosomal Region to import from a genomic locus

The sections below discuss the steps for each method of sequence creation.

Create new

Click into the Create New tab
On this page define the sequence name, set the nucleotide type as DNA or RNA based on the sequence type you’re creating and select a folder to save it to
In the Bases text box, paste the sequence from your clipboard (recommended) or type it manually
After ensuring all sequence information is complete, click Create

Upload file

Click into the Upload Files tab
Select the folder to save your sequence to
Click Choose a file and select the file(s) from your computer or drag and drop the file(s) into the box
Update the file location, sequence topology, or tags associated with the file, as needed
Click Open Sequence to open the file, or click Close to close the modal

Import from database

To import sequences from external databases, click into the Import From Database tab
Use the Sequence text box to enter the name of the gene before clicking Search
- You can specify different IDs depending on where you want to import the sequence from
If multiple matches are found, use the Genome dropdown to select the species for the sequence you want to import, then click Search
Define the attributes of the sequence by editing the name so it displays as you want it to appear in Benchling
Select how you want to import the sequence as either a Genomic Sequence or DNA or RNA
Define a folder for storage, nucleotide type and optionally a schema if you wish to register the sequence

Benchling supports the following inputs for pulling active DNA sequences from external databases:

Addgene URL
BioBrick ID
Ensembl ID
Gene name or symbol in Ensembl
Joint BioEnergy Institute (JBEI) ID
NCBI Accession Number

Tip: If searching by a gene name returns a result in Ensembl, you can update or select multiple transcripts.

Create DNA sequences by selecting chromosomal region

Unlike the options listed above that are shared sequence creation methods between DNA and RNA sequences, selecting a chromosomal region is an option limited to creating a new DNA sequence.

Click into the Select Chromosomal Region tab
Use the Genome text box to enter your search terms to find active genomes, then click Search
- You can search by common name, species, or version
- We support the most recent Ensembl genome versions
  - In limited cases, we support prior versions
Use the dropdown to select the chromosome of interest and the start and end loci of the target sequences
Define a folder for storage, nucleotide type and optionally a schema if you wish to register the sequence
Click Import

Create sequences using a Registration table

If your experimental protocol results in new sequences that you want associated with the experimental context in the Notebook, you can create and register the sequence using a Registration table.

From the Insert menu in the toolbar, select Registration table
In the Insert registration table modal that opens, choose whether you want to Create new entities or Register existing entities, depending on whether you want to import new entities or register existing, unregistered ones
Use the dropdown to select the Registry schema for the entity. A table preview will display for you to review before inserting into the entry. Use this to confirm your selection, then click Insert
- If you select a DNA or RNA sequence, a Bases column populates next to the Entity Name column
Enter information about the entity into a row of the Registration table. Each row corresponds to one entity
Click Submit to register the entities
- Note: The columns of the table are defined in the Registry configuration

Create sequences using spreadsheet import

Benchling allows for the efficient creation and registration of many DNA or RNA sequences at once. This feature is particularly useful for larger data sets. In some cases you may want to bring in Registry metadata and fields as you create the sequence entities. This can be done through a spreadsheet upload then “reimporting” can be used to bring in the sequence annotations in bulk.

Prepare a spreadsheet for DNA or RNA import

For a successful import, the spreadsheet should be formatted as outlined below:

Name (Required) - The values in the column need to exactly match the name of your DNA or RNA sequence files from Snapgene, Geneious, or other software
Bases (Required) - If you are hoping to import annotations and other sequence features, keep this blank for now, the sequence maps can be reimported in a later step
Schema fields (Optional) - Fields that have been configured on the schema to capture entity metadata as part of a data model ex. Vendor, Antibiotic resistance, etc.

Navigate to the Registry on the navigation bar
Click the + and then click Import Entities
Select the schema of the entities you're importing and use the check box to indicate your DNA or RNA is circular, if necessary, then click Next
- Note: If you're using the Inventory application and would like to simultaneously create a container and deposit it into a Location (i.e. Box), then select Transfer entities to new or existing containers and designate the container schema
Now add your spreadsheet, you can either import the full spreadsheet or copy and paste the fields from your spreadsheet template into the raw text box. Please ensure that you include the column names.
Once this is done, click Next
In the next menu, ensure that the column names pulled from your spreadsheet match the column type fields. Once they're matched click Next
Benchling will now check for errors, if no there are no errors, click Import
- If there are errors, you will need to resolve them before you can continue
Your entities will now be viewable in the Registry

Reimport sequences

Reimporting is useful in cases where you are trying to bring in sequence features (such as annotations, translations, etc.) for sequences that have already been imported and registered in Benchling.

Within the Registry click the local create button and select Reimport DNA/RNA Sequences
In this menu, you can now drag and drop in the sequence files, the file name must be identical to the entity name in Benchling to be matched with the entities you created earlier
Once you add all the files, select Reimport
- Each sequence file should contain only one sequence per file, as Benchling will generate an error if multiple sequences are found in a single sequence file

Now that you have imported your sequences you can analyze various information about them.

View DNA and RNA sequences and change display features

When viewing a sequence in Benchling, you can highlight any section of RNA by clicking and dragging your mouse over that section. The bottom status bar will populate with information about that sequence. You can view the following properties:

Bases, or the length of the entire sequence including parts that are not highlighted
Start base
End base
Length of the highlighted section
Melting temperature (Tm)
GC content

Edit and personalize your sequence view

You may want to have different sequence tabs open at once to compare and contrast different views or easily gather data from one tab into another.

Toggle on or off the split workspace by clicking Split Workspace on the bottom right corner. You can also drag the tabs of the plasmid map around in your preferred order.

Control sequence map visual settings

Click the gear symbol on the upper right corner to toggle on and off different features of the plasmid map.

Annotations - determines whether annotations linked to the sequence will be displayed on the sequence map
Primers - determines whether primers linked to the sequence will be displayed on the sequence map
Primer bases - determines whether primers are displayed simply as arrows or if you will see the bases that make up the primer on the sequence map
Cut sites - determines if the various cut sites within the sequence are displayed on the sequence map
Complement - determines if the sequence is shown as a single or double strand within the sequence map
Axis - determines whether the sequence length ruler is shown on the sequence map
Translations - determines whether translations linked to the sequence will be displayed on the sequence map
Amino acid indices - determines whether the amino acid numbers are displayed on the sequence map (only available for AA sequences)
Show cuts - determines if the restriction enzymes cuts are displayed on the sequence map

Change the topology of DNA sequences

To change the topology of a plasmid from circular to linear:

Click the Information button on the Molecular Biology toolbar
Then, change Topology from Circular to Linear and click Update Information

Edit DNA and RNA sequences

Benchling's "edit sequences" functionality enables you to manually modify DNA or RNA sequences similar to a word processor, allowing for insertion and deletion of bases. This feature is valuable as it provides flexibility and direct control over sequence data, making it easy to make precise adjustments as needed during experimental design or analysis.

It targets workflows such as cloning, primer design, and any other process requiring direct manipulation of DNA RNA sequences.

Navigate to the sequence you would like to edit and select the Sequence map sub-tab
To manually insert bases, click on the position within your sequence where you would like to insert
Type or paste (cmd+V or ctrl+V) in the bases you wish to add, and click ENTER in the resulting modal
- You can also copy and paste regions of a sequence and paste them into other regions of the same or another sequence and this will preserve annotations
To delete bases, highlight the region of your sequence you would like to delete by clicking and dragging your mouse
You can also highlight an annotation by clicking on it. Next, either press the delete button on your keyboard, or right-click and select Delete Bases

Update a linear DNA sequence start index

Indexes on DNA sequences help assess the relative location of bases on your linear and circular sequences. Keep your indexes updated in Benchling to more easily manage your sequences.

When importing linear sequences from databases, Benchling maintains the genomic coordinates of the sequence. For sequence files uploaded using a different method, all sequence indexes begin at 1.

To update a linear sequence to match genomic coordinates:

When viewing a DNA sequence, click the Information button at the bottom of the right-side menu to open the Information panel
Enter the new start coordinate in the Start Index field
Click Update Information
- The updated genomic coordinates are reflected in the numbering on the sequence axis. The first number is re-labeled as the new Start Index number

Update the start index of a circular sequence

Reindexing of circular sequences works similarly but is accessed differently. It can be done from the sequence map, linear map, or plasmid views:

Right-click the base to re-index and select Re-index
In the modal, enter the axial location you want the location to inherit. Then click Re-index
Confirm your selected location in the Current Location field
- For example, if you enter 75 into the New Location field, the selected current location becomes the 75th base in the sequence, and the location of all other bases adjusts accordingly.

Export DNA and RNA sequences (.gb, .fasta, .csv, .svg, .rdf, .zip)

Benchling's export sequences functionality enables you to export individual or multiple DNA or RNA sequences in various common file formats. This feature is valuable because it allows you to share your sequence data with collaborators, transfer it to other software, or archive it for future use, ensuring data portability and interoperability.

Export an individual sequence

To perform a single sequence export:

Click into your object and click on the Information icon located on the Molecular Biology Panel
Scroll to the bottom of the Sequence Info modal to set export format and choose parameters
Click Download to export your sequence

Note: This is the only interface where you can export your sequence as Vector Linear Map (.svg), SINGLE FASTA (.fasta), or SBOL RDF (.rdf) formats.

Export multiple files from expanded view

Benchling supports easy bulk file export in the expanded view from Global search:

Click the Global Search icon
On the side tab that pops up click the Arrow Icon to pull up the expanded view
Use the checkboxes on the leftmost side to bulk-select only RNA sequences
Once your files are selected, click on the export icon in the top right corner
From here you can either export your files as Multipart Genbank (.gb), Multi-FASTA (.fasta), or CSV (.csv), or as individual genbank (.gb) files in a ZIP folder

Export sequences in bulk

To export your sequences in bulk, use the Export Data modal.

Note: Certain enterprise tenants do not have the Export Data modal enabled due to company policy.

To access this modal, click on your avatar (located on the bottom-left corner of the navigation bar), and select Data Export
In the Export Data modal, click on the Sequences export option. In this pop-up, you can designate the locations from which you'd like to export your sequences and also whether or not you'd like to include archived sequences (by reason) in your export
Finally, you can designate whether or not you'd like your files as individual Genbank (.gb) files in a ZIP folder or as a single multipart Genbank (.gb) file

Share a sequence file

Once a sequence has been imported:

Click on the sequence file that you’d like to share, click the Share button, and copy the Read-only access link
Send the link to the person you wish to share this sequence with
Anyone with the link will now be able to view your sequence (provided they have an account on the tenant), but will not be able to edit or analyze the sequence
You can also turn off link sharing by clicking the Turn off link sharing button

Lock Sequence(s)

Individual sequences can be locked to prevent accidental edits by you or your collaborators. The bases, annotations, primers, and translations of a locked sequence cannot be edited. Linked analyses and metadata can still be updated.

To lock a sequence:

Select the sequence you intend to lock
Click on the Information icon in the Molecular Biology toolbar
Select Yes next to Locked and click Update Information
Locked sequences will display an indicator and cannot be edited until unlocked by an authorized user
To unlock a locked sequence open the same menu and select No next to Locked

Note: Any user who has WRITE access to a sequence can lock or unlock it.

Note: You can also lock projects. Locking a project will lock all of its current sequences and new ones you add. You can still unlock individual sequences in a locked project.

Frequently asked questions

Q: Can I add overhangs on my sequences?

A: Benchling does not currently support overhangs on sequences. However, you can specify overhangs on attached primers.

Q: Can I annotate methylation on my DNA sequences?

A: Benchling does not currently support methylation sites.