Skip to the content.

Peak Picking in 4D Spectrum with POKY

Overview

The general workflow of this tutorial starts with referencing and converting the spectra to POKY/Sparky format. Then, we will create 2D projections from the 4D spectrum, which will help us reference it properly.

For peak picking, we will follow a systematic strategy that, although more involved, will allow for higher precision in identifying peaks in the 4D spectrum while minimizing noise. Since the 2D projections are derived directly from the 4D spectrum, they provide a more accurate reference than the HSQC spectra, whose peak centers may deviate slightly from those in the projections.

To ensure accurate peak selection, we will use the 2D projections as intermediate reference points for restricted peak picking in the 4D spectrum. The workflow is as follows:

  1. Overlay the projections onto the corresponding HSQC spectra
  2. Identify peak centers by using the HSQC spectra as references
  3. Use these peak centers to perform restricted peak picking in the 4D spectrum
  4. Unfold or unalias peaks as necessary
  5. Remove noise peaks from the 4D spectrum

By following this approach, we ensure that the final set of picked peaks in the 4D spectrum is as accurate and noise-free as possible.

Prerequisites

Steps


Step 1. Reference the HSQC Spectra in Topspin

Follow the instructions to reference the 1H-15N and 1H-13C HSQC in Topspin with BioTop.


2.2 Convert Spectra to UCSF Format

Enter the directory where each spectrum is saved in Bruker format and run bruk2ucsf from there—running it from another directory will fail.
For example, to convert the 1H-15N, 1H-13C HSQC spectra, and the 4D HCNH NOESY:

 bruk2ucsf_run 6/pdata/1/2rr /srv/NMR/Peak_Picking/Nanoluc/15N_HSQC.ucsf
 bruk2ucsf_run 7/pdata/1/2rr /srv/NMR/Peak_Picking/Nanoluc/13C_HSQC.ucsf
 bruk2ucsf_run 5/pdata/1/4rrr /srv/NMR/Peak_Picking/Nanoluc/4D_HCNH_NOESY.ucsf

Note: You can also convert the spectra from Bruker to UCSF format in POKY/Sparky, but you cannot rename the axes in that process.

2.3 Rename Axes

Rename the axes in the 1H-15N and 1H-13C HSQC spectra:

ucsfdata -a1 N -a2 HN 15N_HSQC.ucsf
ucsfdata -a1 C -a2 HC 13C_HSQC.ucsf

Print the axis values of the 4D HCNH NOESY:

ucsfdata 4D_HCNH_NOESY.ucsf

Example output:

axis                          w1          w2          w3          w4
nucleus                       1H         13C         15N          1H
matrix size                  256         256         256         416
block size                     8           8           8          13
upfield ppm                1.194       6.301     101.402       5.279
downfield ppm              8.208      73.001     133.002      10.622
spectrum width Hz       6666.667   15939.978    3043.445    5078.125
transmitter MHz          950.374     238.980      96.311     950.374

From the upfield and downfield rows, you can guess which axis is HC and which is HN. In this example, the following command renames them properly—amidic protons have higher shift values than the aliphatic protons:

ucsfdata -a1 HC -a2 C -a3 N -a4 HN 4D_HCNH_NOESY.ucsf

IMPORTANT: Make sure that axes are named consistently in all spectra; otherwise, you will encounter problems during peak picking.

2.4 Create C-HC and N-HN Projections

For a detailed tutorial, see Create_2D_projections_from_4D_spectrum. Briefly, extract the N-HN projection from the 4D HCNH NOESY. You may need to adjust the -p[1-4] values according to your 4D spectrum dimension order:

ucsfdata -p1 -r -o C-N-HN.ucsf 4D_HCNH_NOESY.ucsf
ucsfdata -p1 -r -o 2D_N-HN_proj.ucsf C-N-HN.ucsf

Similarly, for the C-HC projection:

ucsfdata -p4 -r -o HC-C-N.ucsf 4D_HCNH_NOESY.ucsf
ucsfdata -p3 -r -o 2D_HC-C_proj.ucsf HC-C-N.ucsf

Step 3. Loading the Spectra

Load the spectra


Step 4. Adjusting the Spectra

Synchronize Spectra

Correct the contour levels and colors

Align the 2D_N-HN_proj to the 15N_HSQC

Align the 2D_HC-C_proj to the 13C_HSQC
Follow the same procedure described in the previous step.

Reference the 4D_HCNH_NOESY


Step 5. Peak Picking

Adjusting Contour Levels and Preparing Reference Peaks

Below you see the final peak selection on the N-HN_proj (orange) adjusted to the peak centers. The 15N_HSQC is overlaid with cyan and yellow color. Notice that some peaks that were not present in the N-HN_proj (less sensitive experiment) were added from the 15N_HSQC and the inverse (although much fewer peaks were added like this).

here you see

#——————————————

Pick all peaks in the HC-C projection that match with reference 1H-13C HSQC
Since 1H-13C HSQC is very crowded and is not ideal for setting landmarks for restricted peak selection, we will use it in combinations ith the 2D_HC-C_proj. Overlay the two spectra and select with F8 all the peaks in 2D_HC-C_proj. The add manually the peaks that you believe are within the general boundaries defined by the 1H-13C HSQC and are not noise - a bit of intuition will be helpful here. Synchronizing the two spectra with yt and viewing them next to each other will help you. In the following Figure I show how I add a new peak in the 2D_HC-C_proj based on the density of the 1H-13C HSQC.

These peaks in 2D_HC-C_proj will be your landmarks for restrictive peak selection.

Optimizing Restricted Peak Picking for Higher Accuracy

Since this protein is large, we will perform restricted peak picking in two rounds.
This is necessary because the screen updates every time a picking cycle completes, and for large proteins, this eventually becomes terribly slow.

⚠️ Important: Do not delete any peaks yet, as we did in the first round—otherwise, you will lose some of the peaks identified earlier.

💡 For large proteins with tens of thousands of peaks, it is recommended to delete them in two batches rather than all at once.

This is how the final peak selection on the HC-C plane of the 4D NOESY should look.

[FIGURE]

Step X. Unalias/Unfold 4D Peaks

Next, we will perform unaliasing/unfolding of peaks. For more details, please read the respective article.

Aliased Peaks usually occur in the ranges C < 25 ppm and HC > 3 ppm.

In this spectrum, we have some aliased peaks that appear on top.

[FIGURE here]


Step X. Manual Refinement of 4D Peak List

Next, we will manually inspect all the peaks and remove those that are not located in density regions—neither of the HC-C projection nor of the N-HN projection.

You should now have two different views of the 4D spectrum:

  1. One showing the selected peaks on the HC-C plane
  2. The other showing the selected peaks on the N-HN plane

What you must do next is manually inspect the peaks and delete those not in density regions. This requires a bit of intuition and a sharp eye. Unfortunately, it cannot be automated—it must be supervised manually by pressing st.

Discard the noise peaks using a S/N cutoff

** OPTIONAL: Discard the noise peaks using a weak peak as a reference**

Note: This comparison is made using signed intensities: negative peaks with large absolute intensity will also be selected! Check them manually and deselect if needed: hold Ctrl and drag over the selected peaks with your mouse.

peak list

(^ There should be a GIF here showing large negative peaks.)


Step X. Exporting Peak Lists

Export Picked Peaks for 4D-GRAPHS
Go to the 4D peak list (type lt) and select the columns w1, w2, w3, w4, Data Height and Note. Click Apply, then Save…. Further editing of the projections and the HSQCs is needed.

Improve the Precision of the 13C HSQC Peak List using the 4D HCNH NOESY Peak List

We follow this approach because the 13C HSQC spectrum is very noisy, with large dispersion effects, meaning that the peak centers deviate from those identified in 4D HCNH NOESY spectrum.
Consequently, since the entire assignment relies on the 4D spectrum, it is more accurate to use the peak markers from it, but enhanced with the intensity signs present in the 13C HSQC spectrum.
The 13C HSQC spectrum provides information on whether a peak corresponds to a methylene group, which improves both accuracy and coverage for chemical shift assignment in 4D-GRAPHS.

Improve the Precision of the 15N HSQC Peak List using the 2D N-HN projection Peak List

The 15N HSQC peak list for 4D-GRAPHS must have only one peak for each spin system, therefore we will apply the previous trick but using the `*N-HN projection peak list instead of the 4D HCNH NOESY peak list.


Notes for Special Cases

Unaliasing Peaks in POKY
When you do restricted peak picking (kr) using as a reference Peaks that have not been unaliased or unfolded, POKY will automatically check for possible aliased peaks. If the spectrum width of the source 2D is larger than that of the nD (n = [3,4]), POKY will find and mark the peaks in the 3D as aliased.

However, BEWARE that when your reference peaks are aliased or unfolded, POKY won’t match the correct peaks in the target spectrum unless they are also unalias/unafolded. It may catch some peaks but they will be irrelevant. Therefore, do not unalias/unfold the peaks in the 2D HC-C and N-HN projections! Do the unaliasing/unfolding directly on the 4D HCNH NOESY.

Below are examples of the 13C-HSQC spectra with aliased peaks (in yellow boxes):

Protein 1 Example 13C-HSQC - Protein 2
13C-HSQC-ac1 13C-HSQC-sy15

Authors