Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

...

...

...

...

...

...

...

Intro

Welcome to the 3DM walkthrough. A 3DM system is a data integration platform, we collect and integrate data about a protein superfamily. 

Registration

Most importantly, to go through this walkthrough you will need a 3DM account (if you already have one you can skip this step).

...

Once you submit the registration form you will receive an e-mail with an activation link. Once you follow the link to activate your account you're ready to use the 3DM services.

YASARA or PyMol

If you want to use 3DM's protein visualization features you will need Yasara or PyMol and the 3DM plugin (installation instructions). This tutorial uses Yasara to demonstrate the visualization options.

System selection page

Now go again to http://3dm.bio-prodict.nl and log in. You land on a page where you can select the system you'll be working on. If you have already purchased a system(s) it will be listed here. For this tutorial we will use a nuclear receptors system, which is publicly available.

Click on Public 3DM Systems and choose the Nuclear Receptors system from the list. You will be redirected to the system's main page.

System overview

From the menu in the panel on the left-hand side click on System > System info. Here you see the summary of the system, that can give you an idea of how much data there is - how many sequences are aligned, how many mutations are mapped on proteins in the system, etc.


Protein detail page

For now we're going to work on a human androgen receptor (UniProt accession P10275) - follow this link to go to its protein detail page. 

...

Let's now again go back to the INFORMATION tab and click on the link in the aligned in subfamily field, show protein in subfamily alignment - the ID is the PDB id of the structure that was used as a template for the subfamily alignment.

Alignment page

Now we're at the alignment where your protein of interest is aligned. The protein is marked with a slightly lighter shade (see the arrow on the picture above). The displayed residues are only residues that are aligned in the core regions - core regions is part of the alignment that is aligned across the whole superfamily. 

...

The lighter coloured residues are the variable regions and the bright-coloured ones are core regions. Keep in mind that only the aligned parts of the variable regions are displayed so you often don't see the full sequence in this view.

Alignment statistics

Let's now navigate to the alignment statistics page, which you can access from the menu on the left side of the page. 

...

Another thing you can do here is visualise this data in yasara - to do that click on the little button with a protein helix symbol . You will be redirected to the visualize page, but don't do it for now, we'll get to it later.


Alignment position details pages

You can also view protein data that's available for a specific position in the alignment (protein residue data from all aligned proteins on a structurally conserved position) - the alignment position pages can be accessed in multiple ways, e.g.

...

On the histogram click on the bar for position 42. Now we can take a look around the alignment position page. Here in the different tabs you can see what data is mapped onto this position across all proteins in the 3DM systems. For example, in the mutations tab you can see all the mutations that we've found in the literature for this alignment position across all proteins in the system.

Visualize

Go to the Visualize page from the menu, and click on the structures tab. From the templates menu select the 1IE8A structure - this is the template of the subfamily where our protein of interest is aligned.

...

Go to the contacts tab and click on the topmost checkbox in the Ligand Contacts table (by clicking on the checkbox in the top row you toggle between selecting and deselecting all positions). Now click on the VISUALIZE IN YASARA button and a yasara scene with the selected data mapped onto the 1IE8A structure will be downloaded.


YASARA

Open the downloaded scene in YASARA. You can see that some parts of the backbone are green(ish) and some are gray - the gray color indicates that these residues fall in the variable regions, while green are core residues. 

...

You can also access 3DM data from within yasara using our 3DM plugin - in the menu bar there is a '3DM' tab - there are numerous options of mapping your data onto the structure(s). Let's for example have a look at the mutation data. Go to 3DM > Show superfamily data > Mutations - now residues corresponding to alignment positions with the highest number of mutations are shown - you can see that they are mostly located in the pocket where you previously saw a lot of residues with ligand contacts. It makes sense that these residues are the ones that are most often mutated by researchers to investigate their function and the effect of mutations on ligand binding. (You can switch between the views with the tabs right below the menu bar)


Phylogeny

Now click on the phylogeny item from the menu on the left. This shows an overall phylogeny tree where each node is one subfamily template. When you mouse over a subfamily ID you can display more information about the template structure. 

Search options

There are multiple ways to search proteins in a 3DM system. While most of them are quite straightforward and probably don't need to be explained, we'll have a closer look at the 'Search proteins by position motif' and 'Search proteins by sequence motif'. 

...

In the case of search proteins by position motif, we're looking for specific motifs on specific positions - for example we might want to find all proteins that have a tyrosine on alignment position 35 and an isoleucine on alignment position 90, then we will use the search term "Y35,I90"


Advanced


Numbering schemes

You can simplify your work with the protein of interest even more by creating a custom numbering scheme - that will cause the alignment positions to be renumbered to match the residue numbering of your protein.

...

To switch between the different numbering schemes click on the dropdown menu in the numbering scheme at the top of the page. And don't worry, creating new numbering schemes doesn't erase the previously existing ones, so after creating a custom numbering scheme you can still switch to the original 3DM numbering or other numbering schemes that you created.

Subsets

If we want to analyse only some of the proteins present in the system - for example only the closest homologs of the query protein than we'll need the subsets functionality. We're going to create a subset of 100 closest proteins to our P10275 protein. To do that we need to again go to the protein detail page P10275 and click in the sequence tab.

...

Note that you can also create subsets from results from all the other searches not only BLAST search - these will be described in the next section.

Hotspots

You can also use 3DM to find hotspots - important residues, affecting e.g. protein specificity or thermostability.

What if you don't have a protein of interest?

Panel design - protein selection tool

This is a tool to facilitate the design of enzyme selection panels (for example based on the sequence diversity on the selected hotspots). For this you're gonna need a more advanced course.

...

You can mail us or use the "Send feedback" form (linked on the bottom of the page)







Appendix

Correlation network and enrichment

Let's now go back to the website. Click on the correlated mutations item in the menu on the left. Now you see a network of correlated mutations - you can play around with the score cut-offs using the slider on the right. What you can also do is check if the positions involved in this network have any mutations with certain keywords assigned to them.

...