Tutorial for model-free analysis sam mahdi

From relax wiki
Jump to navigation Jump to search

Background

This follow the discussion with Sam Mahdi, at the mailing list: https://www.nmr-relax.com/mail.gna.org/public/relax-users/2016-09/threads.html#00001

Files has been uploaded to: Bug #25044

  • Model Free Analysis problems when attempting to run on multi-processors and using the script to run

Available data

  • R1 600 MHz
  • R2 600 MHz
  • R1 800 MHz
  • R2 800 MHz
  • ssNOE 600 MHz
  • ssNOE 800 MHz

Version of relax and computer

Version of relax is 4.0.2

Hardware information:
    Machine:                 x86_64
    Processor:               x86_64
    Processor name:          Intel(R) Core(TM)2 Duo CPU     E7500  @ 2.93GHz
    Endianness:              little
    Total RAM size:          2756 Mb
    Total swap size:         2815 Mb

Operating system information:
    System:                  Linux
    Release:                 4.5.7-300.fc24.x86_64
    Version:                 #1 SMP Wed Jun 8 18:12:45 UTC 2016
    GNU/Linux version:       Fedora 24 Twenty Four
    Distribution:            fedora 24 Twenty Four
    Full platform string:    Linux-4.5.7-300.fc24.x86_64-x86_64-with-fedora-24-Twenty_Four

Python information:
    Architecture:            64bit
    Python version:          2.7.12
    Python branch:          
    Python build:            default, Aug  9 2016 15:48:18
    Python compiler:         GCC 6.1.1 20160621 (Red Hat 6.1.1-3)
    Libc version:            glibc 2.2.5
    Python implementation:   CPython
    Python revision:        
    Python executable:       /usr/bin/python
    Python flags:            sys.flags(debug=0, py3k_warning=0, division_warning=0, division_new=0, inspect=0, interactive=0, optimize=0, dont_write_bytecode=0, no_user_site=0, no_site=0, ignore_environment=0, tabcheck=0, verbose=0, unicode=0, bytes_warning=0, hash_randomization=0)
    Python float info:       sys.float_info(max=1.7976931348623157e+308, max_exp=1024, max_10_exp=308, min=2.2250738585072014e-308, min_exp=-1021, min_10_exp=-307, dig=15, mant_dig=53, epsilon=2.220446049250313e-16, radix=2, rounds=1)
    Python module path:      ['/home/student/relax-4.0.2', '/usr/lib/python27.zip', '/usr/lib64/python2.7', '/usr/lib64/python2.7/plat-linux2', '/usr/lib64/python2.7/lib-tk', '/usr/lib64/python2.7/lib-old', '/usr/lib64/python2.7/lib-dynload', '/usr/lib64/python2.7/site-packages', '/usr/lib64/python2.7/site-packages/openmpi', '/usr/lib64/python2.7/site-packages/gtk-2.0', '/usr/lib/python2.7/site-packages']

Python packages and modules (most are optional):

Name               Installed    Version             Path                                                
minfx              True         1.0.12              /home/student/relax-4.0.2/minfx                     
bmrblib            True         1.0.4               /home/student/relax-4.0.2/bmrblib                   
numpy              True         1.11.0              /usr/lib64/python2.7/site-packages/numpy            
scipy              True         0.16.1              /usr/lib64/python2.7/site-packages/scipy            
wxPython           False                                                                                
matplotlib         True         1.5.2rc2            /usr/lib64/python2.7/site-packages/matplotlib       
mpi4py             True         2.0.0               /usr/lib64/python2.7/site-packages/openmpi/mpi4py   
epydoc             False                                                                                
optparse           True         1.5.3               /usr/lib64/python2.7/optparse.pyc                   
readline           True                             /usr/lib64/python2.7/lib-dynload/readline.so        
profile            True                             /usr/lib64/python2.7/profile.pyc                    
bz2                True                             /usr/lib64/python2.7/lib-dynload/bz2.so             
gzip               True                             /usr/lib64/python2.7/gzip.pyc                       
io                 True                             /usr/lib64/python2.7/io.pyc                         
xml                True         0.8.4 (internal)    /usr/lib64/python2.7/xml/__init__.pyc               
xml.dom.minidom    True                             /usr/lib64/python2.7/xml/dom/minidom.pyc            

relax information:
    Version:                 4.0.2
    Processor fabric:        Uni-processor.

Current script

Sam has provided this script.
It is not entirely clear how the "The dauvergne_protocol model-free auto-analysis" was initiated. Maybe a combination of GUI and scripts?

relax> from time import asctime, localtime
relax> from auto_analyses.dauvergne_protocol import dAuvergne_protocol 
relax> DIFF_MODEL=['local_tm','sphere','prolate','oblate','ellipsoid','final']
relax> MF_MODELS=['m0','m1','m2','m3','m4','m5','m6','m7','m8','m9']
relax> LOCAL_TM_MODELS=['tm0','tm1','tm2','tm3','tm4','tm5','tm6','tm7','tm7','tm8','tm9']
relax> GRID_INC=11
relax> MIN_ALGOR='newton'
relax> MC_NUM=500
relax> CONV_LOOP=True
relax> pipe_bundle="mf(%s)"%asctime(localtime())
relax> name="origin-"+pipe_bundle
relax> pipe.create(name,'mf',bundle=pipe_bundle)
relax> structure.read_pdb('2d9j.pdb',set_mol_name='hRGS7')
relax> structure.load_spins('@N',ave_pos=True)
relax> structure.load_spins('@NE1',ave_pos=True)
relax> structure.load_spins('@H',ave_pos=True)
relax> structure.load_spins('@HE1',ave_pos=True)
relax> spin.isotope('15N',spin_id='@N*')
relax> spin.isotope('1H',spin_id='@H*')
relax> interatom.define(spin_id1='@N',spin_id2='@H', direct_bond=True)
relax> interatom.define(spin_id1='@NE1',spin_id2='@HE1', direct_bond=True)
relax> interatom.set_dist(spin_id1='@N*',spin_id2='@H*',ave_dist=1.02*1e-10)
relax> interatom.unit_vectors()
relax> value.set(-172*1e-6,'csa',spin_id='@N*')

Make suggestion script

Copy sample script

To provide a suggestion script, the relax example script is copied to the folder with the data.

# Find where relax is called from. It's a symbolic link in the users bin folder.
tlinnet@linmac:Sam_25044$ which relax
/Users/tlinnet/bin/relax

# Read the link to the folder where relax is installed.
tlinnet@linmac:Sam_25044$ readlink `which relax`
/Users/tlinnet/software/relax_trunk_svn/relax

# The current working folder with data
tlinnet@linmac:Sam_25044$ pwd
/Users/tlinnet/Desktop/Sam_25044

# Copy sample script
cp /Users/tlinnet/software/relax_trunk_svn/sample_scripts/model_free/dauvergne_protocol.py sam_script.py

Now modify sample script

We are going to edit "sam_script.py".

Initial script

# Python module imports.
from time import asctime, localtime
# relax module imports.
from auto_analyses.dauvergne_protocol import dAuvergne_protocol


# Analysis variables.
#####################
# The diffusion model.
DIFF_MODEL = 'local_tm'

# The model-free models.  Do not change these unless absolutely necessary, the protocol is likely to fail if these are changed.
MF_MODELS = ['m0', 'm1', 'm2', 'm3', 'm4', 'm5', 'm6', 'm7', 'm8', 'm9']
LOCAL_TM_MODELS = ['tm0', 'tm1', 'tm2', 'tm3', 'tm4', 'tm5', 'tm6', 'tm7', 'tm8', 'tm9']

# The grid search size (the number of increments per dimension).
GRID_INC = 11

# The optimisation technique.
MIN_ALGOR = 'newton'

# The number of Monte Carlo simulations to be used for error analysis at the end of the analysis.
MC_NUM = 10

# Automatic looping over all rounds until convergence (must be a boolean value of True or False).
CONV_LOOP = True

# Set up the data pipe.
#######################

# The following sequence of user function calls can be changed as needed.

# Create the data pipe.
pipe_bundle = "mf (%s)" % asctime(localtime())
name = "origin - " + pipe_bundle
pipe.create(name, 'mf', bundle=pipe_bundle)

Now we are just going to try this.

relax sam_script.py

This works fine. Lets add the next line.

Loading model from pdb file

From Sams script, we see that he uses the PDB ID: 2d9j

# Get pdb
wget http://www.rcsb.org/pdb/files/2d9j.pdb.gz
# Extract
gzip -d 2d9j.pdb.gz

We open the .pdb file in PyMOL and see that it is a NMR file with 20 states.
We need to figure out how to use the command structure.read_pdb

So we open relax in the prompt, and use the help command.

relax
# Relax start
help(structure.read_pdb)
exit

In the help menu, it says:

    read_model:  If set, only the given model number(s) from the PDB file will be read.  Otherwise
all models will be read.  This can be a single number or list of numbers.

Now add the following line to "sam_script.py".

structure.read_pdb('2d9j.pdb', set_mol_name='hRGS7', read_model=1)

Now we are just going to try this.

relax sam_script.py

This works fine. Lets add the next line.

Create data containers

In the sample script, we see that we should use the command "structure.load_spins".

Since we do not know what that is, we use the help command.

So we open relax in the prompt, and use the help command.

relax
# Relax start
help(structure.load_spins)
exit

There is a lengthy description. But what relax need to do internal inside python, is to create data-containers where it can add data.
It is as simple as that. We need some place to store relax data, and relax generate some "variables" on a naming scheme from the pdb file.
After this, we alter the nitrogens to be isotope N15, and the same for hydrogen.

Now add the following line to "sam_script.py".

# Set up the 15N and 1H spins (both backbone and Trp indole sidechains).
structure.load_spins('@N', ave_pos=True)
structure.load_spins('@NE1', ave_pos=True)
structure.load_spins('@H', ave_pos=True)
structure.load_spins('@HE1', ave_pos=True)
spin.isotope('15N', spin_id='@N*')
spin.isotope('1H', spin_id='@H*')

Now we are just going to try this.

relax sam_script.py

Check the spin containers via GUI

Right now, we want to check the spin containers.

First we do in the the GUI.
Now add the following line to "sam_script.py".

state.save('ini_setup', force=True)

Now we are just going to try this. Run relax to save the state, and then start in GUI.

relax sam_script.py

# Start relax in GUI
relax -g

Now do in the GUI

  1. File -> Open relax state
  2. Point to "ini_setup.bz2"
  3. Go to: View -> Spin Viewer

And inspect the spins!
Close relax

Check the spin containers via script

Instead of using the GUI to inspect the spins, we can use the command

Add the following line to "sam_script.py".

# Import
from pipe_control.mol_res_spin import spin_loop

# Loop and print
for spin, mol_name, res_num, res_name, spin_id in spin_loop(full_info=True, return_id=True, skip_desel=True):
    print spin, mol_name, res_num, res_name, spin_id

Now we are just going to try this.

relax sam_script.py

That gives some output in the command as this.

Objects:
  element: 'H'
  isotope: '1H'
  name: 'H'
  num: 2204
  pos: array([-26.478999999999999,  -4.86             ,   6.999            ])
  select: True
 hRGS7 139 GLY #hRGS7:139@H

This looks good.

Try to load the relaxation data

Now comes the time to load the relaxation data.

In the sample script, we see that we should use the command "relax_data.read".

Since we do not know what that is, we use the help command.

So we open relax in the prompt, and use the help command.

relax
# Relax start
help(relax_data.read)
exit

Let us inspect just one file date. Open in a text browser "R1_600".

It says

Residue	R1	Error
1		
2		
3		
4		
5		
6		
7		
8		
9		
10		
11		
12	1.58	0.06
13	1.23	0.05
14	1.48	0.1
15	2.06	0.04
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
139

This will probably not work. The file does not contain the same separators in the file.
But let us try!

Add the following line to "sam_script.py".

relax_data.read(ri_id='R1_600',  ri_type='R1',  frq=599.719*1e6, file='R1_600',  res_num_col=1, data_col=2, error_col=3)

Now we are just going to try this.

relax sam_script.py

That gives some output in the command as this.

RelaxWarning: The sequence data in the line ['Residue', 'R1', 'Error'] is invalid, the residue number data 'Residue' is invalid.
RelaxWarning: The sequence data in the line ['1'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['2'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['3'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['4'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['5'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['6'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['7'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['8'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['9'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['10'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['11'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['17'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['18'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['19'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['20'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['21'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['22'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['23'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['24'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['25'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['26'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['27'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['28'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['29'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['30'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['31'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['32'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['33'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['34'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['35'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['36'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['37'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['38'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['39'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['40'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['41'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['42'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['43'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['44'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['45'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['46'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['47'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['48'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['49'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['50'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['51'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['52'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['53'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['54'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['55'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['56'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['57'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['58'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['59'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['60'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['61'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['62'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['63'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['64'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['65'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['66'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['67'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['68'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['69'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['70'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['71'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['72'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['73'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['74'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['75'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['76'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['77'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['78'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['79'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['80'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['81'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['82'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['83'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['84'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['85'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['86'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['87'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['88'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['89'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['90'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['91'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['92'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['93'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['94'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['95'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['96'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['97'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['98'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['99'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['100'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['101'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['102'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['103'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['104'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['105'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['106'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['107'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['108'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['109'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['110'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['111'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['112'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['113'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['114'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['115'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['116'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['117'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['118'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['119'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['120'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['121'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['122'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['123'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['124'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['125'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['126'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['127'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['128'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['129'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['130'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['131'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['132'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['133'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['134'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['135'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['136'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['137'] is invalid, the data is missing.
RelaxWarning: The sequence data in the line ['139'] is invalid, the data is missing.
RelaxError: The spin ID '#hRGS7:12' corresponds to multiple spins, including '#hRGS7:12@N' and '#hRGS7:12@H'.

A lot of problems!
We modify the data file to look like this. Note the hashtag "#".

#Residue	R1	Error	
12	1.58	0.06
13	1.23	0.05
14	1.48	0.1
15	2.06	0.04

Now we are just going to try this.

relax sam_script.py

That gives some output in the command as this.

RelaxError: The spin ID '#hRGS7:12' corresponds to multiple spins, including '#hRGS7:12@N' and '#hRGS7:12@H'.

NOTE: Instead of a "Warning", this is an "Error".
This will not work.

And why? Relax does not know if the associated data is the R1 for the nitrogen or proton of spin 12. And then relax STOPS.
Errors make relax STOP.

The data is insufficient or not labelled correct.

How could it look like?
Have a look in: "relax_installation/test_suite/shared_data/model_free/sphere"

r1.500.out

# mol_name     res_num    res_name    spin_num    spin_name    value                 error                   
sphere_mol1    1          GLY         1           N            1.4639691658223886    0.029279383316447773    
sphere_mol1    1          GLY         2           H            None                  None                    
sphere_mol1    2          GLY         3           N            1.4639691658223886    0.029279383316447773    
sphere_mol1    2          GLY         4           H            None                  None

Let us then alter R1_600 file.

#res_num    spin_name    R1     Error
12          N           1.58	0.06
13	        N           1.23	0.05
14	        N           1.48	0.1
15	        N           2.06	0.04

Now MODIFY the following line in "sam_script.py".

# From
relax_data.read(ri_id='R1_600',  ri_type='R1',  frq=599.719*1e6, file='R1_600',  res_num_col=1, data_col=2, error_col=3)

# To
relax_data.read(ri_id='R1_600',  ri_type='R1',  frq=599.719*1e6, file='R1_600',  res_num_col=1, spin_name_col=2, data_col=3, error_col=4)

Now we are just going to try this.

relax sam_script.py

Now it works.

Opening the file 'R1_600' for reading.

The following 599.719 MHz R1 relaxation data with the ID 'R1_600' has been loaded into the relax data store:

# Spin_ID      Value    Error    
#hRGS7:12@N    1.58     0.06     
#hRGS7:13@N    1.23     0.05     
#hRGS7:14@N    1.48     0.1      
#hRGS7:15@N    2.06     0.04

Alter rest of the data

Let us then alter R1_800 file.

#res_num    spin_name    R1     Error	
12          N           1.15	0.03
13	        N           0.832	0.011
14	        N           1.07	0.05
15	        N           0.78	0.17

Let us then alter R2_600 file.

#res_num    spin_name    R2     Error	
12	        N           18.6	0.6
13	        N           14.58	0.39

Let us then alter ssNOE_600 file.

#res_num    spin_name    Noe            Error
13	        N           0.663870952	    0.0386263336
14	        N           0.8132898597    0.1578765885

Let us then alter ssNOE_800 file.

#res_num    spin_name    Noe            Error
12	        N           0.4833302155	0.1078027894
13	        N           0.8142253659	0.0290794774
14	        N           0.7922308188	0.0801096067

Now add the following lines in "sam_script.py", so there is 6 in total..

# From before
relax_data.read(ri_id='R1_600', ri_type='R1', frq=599.719*1e6, file='R1_600', res_num_col=1, spin_name_col=2, data_col=3, error_col=4)
# Add this
relax_data.read(ri_id='R2_600', ri_type='R2', frq=599.719*1e6, file='R2_600',  res_num_col=1, spin_name_col=2, data_col=3, error_col=4)
relax_data.read(ri_id='ssNOE_600',  ri_type='NOE',  frq=599.719*1e6, file='ssNOE_600',  res_num_col=1, spin_name_col=2, data_col=3, error_col=4)

relax_data.read(ri_id='R1_800', ri_type='R1',  frq=799.719*1e6, file='R1_800',  res_num_col=1, spin_name_col=2, data_col=3, error_col=4)
relax_data.read(ri_id='R2_800', ri_type='R2',  frq=799.719*1e6, file='R2_800',  res_num_col=1, spin_name_col=2, data_col=3, error_col=4)
relax_data.read(ri_id='ssNOE_800', ri_type='NOE',  frq=799.719*1e6, file='ssNOE_800', res_num_col=1, spin_name_col=2, data_col=3, error_col=4)

Now we are just going to try this.

relax sam_script.py


This looks fine

relax> relax_data.read(ri_id='R1_600', ri_type='R1', frq=599719000.0, file='R1_600', dir=None, spin_id_col=None, mol_name_col=None, res_num_col=1, res_name_col=None, spin_num_col=None, spin_name_col=2, data_col=3, error_col=4, sep=None, spin_id=None)
Opening the file 'R1_600' for reading.

The following 599.719 MHz R1 relaxation data with the ID 'R1_600' has been loaded into the relax data store:

# Spin_ID      Value    Error    
#hRGS7:12@N    1.58     0.06     
#hRGS7:13@N    1.23     0.05     
#hRGS7:14@N    1.48     0.1      
#hRGS7:15@N    2.06     0.04     

relax> relax_data.read(ri_id='R2_600', ri_type='R2', frq=599719000.0, file='R2_600', dir=None, spin_id_col=None, mol_name_col=None, res_num_col=1, res_name_col=None, spin_num_col=None, spin_name_col=2, data_col=3, error_col=4, sep=None, spin_id=None)
Opening the file 'R2_600' for reading.

The following 599.719 MHz R2 relaxation data with the ID 'R2_600' has been loaded into the relax data store:

# Spin_ID      Value    Error    
#hRGS7:12@N    18.6     0.6      
#hRGS7:13@N    14.58    0.39     

relax> relax_data.read(ri_id='ssNOE_600', ri_type='NOE', frq=599719000.0, file='ssNOE_600', dir=None, spin_id_col=None, mol_name_col=None, res_num_col=1, res_name_col=None, spin_num_col=None, spin_name_col=2, data_col=3, error_col=4, sep=None, spin_id=None)
Opening the file 'ssNOE_600' for reading.

The following 599.719 MHz NOE relaxation data with the ID 'ssNOE_600' has been loaded into the relax data store:

# Spin_ID      Value           Error           
#hRGS7:13@N    0.663870952     0.0386263336    
#hRGS7:14@N    0.8132898597    0.1578765885    

relax> relax_data.read(ri_id='R1_800', ri_type='R1', frq=799719000.0, file='R1_800', dir=None, spin_id_col=None, mol_name_col=None, res_num_col=1, res_name_col=None, spin_num_col=None, spin_name_col=2, data_col=3, error_col=4, sep=None, spin_id=None)
Opening the file 'R1_800' for reading.

The following 799.719 MHz R1 relaxation data with the ID 'R1_800' has been loaded into the relax data store:

# Spin_ID      Value    Error    
#hRGS7:12@N    1.15     0.03     
#hRGS7:13@N    0.832    0.011    
#hRGS7:14@N    1.07     0.05     
#hRGS7:15@N    0.78     0.17     

relax> relax_data.read(ri_id='R2_800', ri_type='R2', frq=799719000.0, file='R2_800', dir=None, spin_id_col=None, mol_name_col=None, res_num_col=1, res_name_col=None, spin_num_col=None, spin_name_col=2, data_col=3, error_col=4, sep=None, spin_id=None)
Opening the file 'R2_800' for reading.

The following 799.719 MHz R2 relaxation data with the ID 'R2_800' has been loaded into the relax data store:

# Spin_ID      Value    Error    
#hRGS7:12@N    24.9     0.9      
#hRGS7:13@N    16.1     0.3      
#hRGS7:14@N    27.0     3.0      
#hRGS7:15@N    22.96    0.19     

relax> relax_data.read(ri_id='ssNOE_800', ri_type='NOE', frq=799719000.0, file='ssNOE_800', dir=None, spin_id_col=None, mol_name_col=None, res_num_col=1, res_name_col=None, spin_num_col=None, spin_name_col=2, data_col=3, error_col=4, sep=None, spin_id=None)
Opening the file 'ssNOE_800' for reading.

The following 799.719 MHz NOE relaxation data with the ID 'ssNOE_800' has been loaded into the relax data store:

# Spin_ID      Value           Error           
#hRGS7:12@N    0.4833302155    0.1078027894    
#hRGS7:13@N    0.8142253659    0.0290794774    
#hRGS7:14@N    0.7922308188    0.0801096067

Define the magnetic dipole-dipole relaxation interaction

In the sample script, we see that we should use the command "interatom.define".

Use the help command to find out more.

Add this

# Define the magnetic dipole-dipole relaxation interaction.
interatom.define(spin_id1='@N', spin_id2='@H', direct_bond=True)
interatom.define(spin_id1='@NE1', spin_id2='@HE1', direct_bond=True)
interatom.set_dist(spin_id1='@N*', spin_id2='@H*', ave_dist=1.02 * 1e-10)
interatom.unit_vectors()

# Define the chemical shift relaxation interaction.
value.set(-172 * 1e-6, 'csa', spin_id='@N*')

Now we are just going to try this.

relax sam_script.py

We see that relax calculates a lot of stuff, for spins we have no data for.

Let us practice to select and deselect spins.

So instead we do this:

# Deselect
deselect.all()
# Select
#select.spin(':12@N|:12@H|:13@N|:13@H|:14@N|:14@H|:15@N|:15@H')
select.spin(':12|:13|:14|:15')
for spin, mol_name, res_num, res_name, spin_id in spin_loop(full_info=True, return_id=True, skip_desel=True):
    print spin, mol_name, res_num, res_name, spin_id

# Define the magnetic dipole-dipole relaxation interaction.
interatom.define(spin_id1='@N', spin_id2='@H', direct_bond=True)
interatom.define(spin_id1='@NE1', spin_id2='@HE1', direct_bond=True)
interatom.set_dist(spin_id1='@N*', spin_id2='@H*', ave_dist=1.02 * 1e-10)
interatom.unit_vectors()

# Define the chemical shift relaxation interaction.
value.set(-172 * 1e-6, 'csa', spin_id='@N*')

Let us try to execute

Now add the following lines in "sam_script.py"

# Execution.
############

# Change some opt params.
# COMMENT THESE OUT FOR REAL CALCULATION. THIS IS ONLY TO SPEED UP THE PROCESS IN ERROR SEARCHING.
dAuvergne_protocol.opt_func_tol = 1e-5
dAuvergne_protocol.opt_max_iterations = 1000

# Do not change!
dAuvergne_protocol(pipe_name=name, pipe_bundle=pipe_bundle, diff_model=DIFF_MODEL, mf_models=MF_MODELS, local_tm_models=LOCAL_TM_MODELS, grid_inc=GRID_INC, min_algor=MIN_ALGOR, mc_sim_num=MC_NUM, conv_loop=CONV_LOOP)

Now we are just going to try this AND using a log file

relax sam_script.py -t log.txt

The whole script file looked like this

# Python module imports.
from time import asctime, localtime
# relax module imports.
from auto_analyses.dauvergne_protocol import dAuvergne_protocol


# Analysis variables.
#####################
# The diffusion model.
DIFF_MODEL = 'local_tm'

# The model-free models.  Do not change these unless absolutely necessary, the protocol is likely to fail if these are changed.
MF_MODELS = ['m0', 'm1', 'm2', 'm3', 'm4', 'm5', 'm6', 'm7', 'm8', 'm9']
LOCAL_TM_MODELS = ['tm0', 'tm1', 'tm2', 'tm3', 'tm4', 'tm5', 'tm6', 'tm7', 'tm8', 'tm9']

# The grid search size (the number of increments per dimension).
GRID_INC = 11

# The optimisation technique.
MIN_ALGOR = 'newton'

# The number of Monte Carlo simulations to be used for error analysis at the end of the analysis.
MC_NUM = 10

# Automatic looping over all rounds until convergence (must be a boolean value of True or False).
CONV_LOOP = True

# Set up the data pipe.
#######################

# The following sequence of user function calls can be changed as needed.

# Create the data pipe.
pipe_bundle = "mf (%s)" % asctime(localtime())
name = "origin - " + pipe_bundle
pipe.create(name, 'mf', bundle=pipe_bundle)

# Load the PDB file.
structure.read_pdb('2d9j.pdb', set_mol_name='hRGS7', read_model=1)

# Set up the 15N and 1H spins (both backbone and Trp indole sidechains).
structure.load_spins('@N', ave_pos=True)
structure.load_spins('@NE1', ave_pos=True)
structure.load_spins('@H', ave_pos=True)
structure.load_spins('@HE1', ave_pos=True)
spin.isotope('15N', spin_id='@N*')
spin.isotope('1H', spin_id='@H*')

state.save('ini_setup', force=True)

# Import
from pipe_control.mol_res_spin import spin_loop

# Loop and print
for spin, mol_name, res_num, res_name, spin_id in spin_loop(full_info=True, return_id=True, skip_desel=True):
    print spin, mol_name, res_num, res_name, spin_id

# From before
relax_data.read(ri_id='R1_600', ri_type='R1', frq=599.719*1e6, file='R1_600', res_num_col=1, spin_name_col=2, data_col=3, error_col=4)
# Add this
relax_data.read(ri_id='R2_600', ri_type='R2', frq=599.719*1e6, file='R2_600',  res_num_col=1, spin_name_col=2, data_col=3, error_col=4)
relax_data.read(ri_id='ssNOE_600',  ri_type='NOE',  frq=599.719*1e6, file='ssNOE_600',  res_num_col=1, spin_name_col=2, data_col=3, error_col=4)

relax_data.read(ri_id='R1_800', ri_type='R1',  frq=799.719*1e6, file='R1_800',  res_num_col=1, spin_name_col=2, data_col=3, error_col=4)
relax_data.read(ri_id='R2_800', ri_type='R2',  frq=799.719*1e6, file='R2_800',  res_num_col=1, spin_name_col=2, data_col=3, error_col=4)
relax_data.read(ri_id='ssNOE_800', ri_type='NOE',  frq=799.719*1e6, file='ssNOE_800', res_num_col=1, spin_name_col=2, data_col=3, error_col=4)

# Deselect
deselect.all()
# Select
#select.spin(':12@N|:12@H|:13@N|:13@H|:14@N|:14@H|:15@N|:15@H')
select.spin(':12|:13|:14|:15')
for spin, mol_name, res_num, res_name, spin_id in spin_loop(full_info=True, return_id=True, skip_desel=True):
    print spin, mol_name, res_num, res_name, spin_id

# Define the magnetic dipole-dipole relaxation interaction.
interatom.define(spin_id1='@N', spin_id2='@H', direct_bond=True)
interatom.define(spin_id1='@NE1', spin_id2='@HE1', direct_bond=True)
interatom.set_dist(spin_id1='@N*', spin_id2='@H*', ave_dist=1.02 * 1e-10)
interatom.unit_vectors()

# Define the chemical shift relaxation interaction.
value.set(-172 * 1e-6, 'csa', spin_id='@N*')

# Execution.
############

# Change some opt params.
# COMMENT THESE OUT FOR REAL CALCULATION. THIS IS ONLY TO SPEED UP THE PROCESS IN ERROR SEARCHING.
dAuvergne_protocol.opt_func_tol = 1e-5
dAuvergne_protocol.opt_max_iterations = 1000

# Do not change!
dAuvergne_protocol(pipe_name=name, pipe_bundle=pipe_bundle, diff_model=DIFF_MODEL, mf_models=MF_MODELS, local_tm_models=LOCAL_TM_MODELS, grid_inc=GRID_INC, min_algor=MIN_ALGOR, mc_sim_num=MC_NUM, conv_loop=CONV_LOOP)

Make suggestion script for GUI

Let us try to analyse it in the GUI.

First copy the script over

cp sam_script.py sam_script_GUI.py

Modify it, so it look like this

# Load the PDB file.
structure.read_pdb('2d9j.pdb', set_mol_name='hRGS7', read_model=1)

# Set up the 15N and 1H spins (both backbone and Trp indole sidechains).
structure.load_spins('@N', ave_pos=True)
structure.load_spins('@NE1', ave_pos=True)
structure.load_spins('@H', ave_pos=True)
structure.load_spins('@HE1', ave_pos=True)
spin.isotope('15N', spin_id='@N*')
spin.isotope('1H', spin_id='@H*')

# read data
relax_data.read(ri_id='R1_600', ri_type='R1', frq=599.719*1e6, file='R1_600', res_num_col=1, spin_name_col=2, data_col=3, error_col=4)
relax_data.read(ri_id='R2_600', ri_type='R2', frq=599.719*1e6, file='R2_600',  res_num_col=1, spin_name_col=2, data_col=3, error_col=4)
relax_data.read(ri_id='ssNOE_600',  ri_type='NOE',  frq=599.719*1e6, file='ssNOE_600',  res_num_col=1, spin_name_col=2, data_col=3, error_col=4)

relax_data.read(ri_id='R1_800', ri_type='R1',  frq=799.719*1e6, file='R1_800',  res_num_col=1, spin_name_col=2, data_col=3, error_col=4)
relax_data.read(ri_id='R2_800', ri_type='R2',  frq=799.719*1e6, file='R2_800',  res_num_col=1, spin_name_col=2, data_col=3, error_col=4)
relax_data.read(ri_id='ssNOE_800', ri_type='NOE',  frq=799.719*1e6, file='ssNOE_800', res_num_col=1, spin_name_col=2, data_col=3, error_col=4)

# Deselect
deselect.all()
# Select
#select.spin(':12@N|:12@H|:13@N|:13@H|:14@N|:14@H|:15@N|:15@H')
select.spin(':12|:13|:14|:15')

# Define the magnetic dipole-dipole relaxation interaction.
interatom.define(spin_id1='@N', spin_id2='@H', direct_bond=True)
interatom.define(spin_id1='@NE1', spin_id2='@HE1', direct_bond=True)
interatom.set_dist(spin_id1='@N*', spin_id2='@H*', ave_dist=1.02 * 1e-10)
interatom.unit_vectors()

# Define the chemical shift relaxation interaction.
value.set(-172 * 1e-6, 'csa', spin_id='@N*')

Start relax in GUI

relax -g -t log_gui.txt

Now do

  1. File -> New analysis
  2. Select "Model-free" -> Next
  3. Just hit Start in the box for pipe name
  4. Goto "User functions (n-z)" -> Script
  5. Find and open the script file "sam_script_GUI.py"

This should now fill out the data. Inspect the data

  1. View -> Spin Viewer

Save the state before execution. This state is easy to load in the GUI afterward

  1. File -> Save relax state -> as "test.bz2"

Exit relax. And start relax again

relax -g -t log_gui_2.txt

Now do

  1. File -> Open relax state
  2. Select "test.bz2"

It should now show all the data.

Hit "Execute"

Consider

  1. Setting the number of Monte-Carlo simulations to 10, for initial faster calculation

Conclusion

The data was insufficient or not labelled correct.

That made relax make an ERROR and make a STOP.

Please use the help function.

See also