| Col | Sort | Min | Max | Num | % | Mean | Mean | Resolution | Type | Column | |
| num | order | Missing | complete | abs. | Low High | label | |||||
| 1 | ASC | 0 | 36 | 0 | 100.00 | 13.6 | 13.6 | 45.01 1.70 | H | H | |
| 2 | NONE | 0 | 38 | 0 | 100.00 | 14.1 | 14.1 | 45.01 1.70 | H | K | |
| 3 | NONE | 0 | 43 | 0 | 100.00 | 16.2 | 16.2 | 45.01 1.70 | H | L | |
| 4 | NONE | 6.3 | 1068.6 | 9890 | 41.71 | 162.59 | 162.59 | 45.01 1.70 | F | FSE1 | |
| 5 | NONE | 0.8 | 24.3 | 9890 | 41.71 | 4.09 | 4.09 | 45.01 1.70 | Q | SIGFSE1 | |
| 6 | NONE | 0.0 | 360.0 | 9796 | 42.26 | 171.93 | 171.93 | 45.01 1.70 | P | PHIB_123p | |
| 7 | NONE | 0.000 | 1.000 | 9796 | 42.26 | 0.560 | 0.560 | 45.01 1.70 | W | FOM_123p | |
| 8 | NONE | 0.0 | 360.0 | 9674 | 42.98 | 169.49 | 169.49 | 45.01 1.70 | P | PHIDM_123p | |
| 9 | NONE | 0.000 | 1.000 | 9674 | 42.98 | 0.736 | 0.736 | 45.01 1.70 | W | FOMDM_123p | |
| 10 | NONE | 1.3 | 930.8 | 1567 | 90.76 | 109.03 | 109.03 | 45.01 1.70 | F | F17 | |
| 11 | NONE | 0.6 | 36.2 | 1567 | 90.76 | 3.15 | 3.15 | 45.01 1.70 | Q | SIGF17 | |
| 12 | NONE | 9.9 | 1048.5 | 5126 | 69.79 | 152.04 | 152.04 | 45.01 1.70 | F | FP | |
| 13 | NONE | 0.9 | 46.8 | 5126 | 69.79 | 3.85 | 3.85 | 45.01 1.70 | Q | SIGFP | |
| 14 | NONE | 0.0 | 19.0 | 0 | 100.00 | 9.51 | 9.51 | 45.01 1.70 | I | FreeR_flag |
The initial question was if you you would like to use free atom model
density modification at all. If you had answered 'N' (if for example
you had good phases and just wanted to run warpNtrace) then the setup would skip
to the refinement protocol choosing, but in this example we assumed 'Y'.
We now have to decide how many cycles we need per iteration. In one iteration each model
is refined with unrestrained ARP
. After it has finished it rejects lots of bad atoms,
limits B factors and randomises coordinates a bit, to escape from local minima.
The higher the resolution, the fewer cycles you need. In the last iteration
it is recommended to use a few more cycles to let the models converge better.
In most cases you can just use just a few cycles for the 1st iteration (around 15) and
skip the second and the third one.
Do you want multiple free atom models averaging (Y/N) ? Y
How many models do you plan to use for averaging ? 6
You will be now asked how many processors you can use at the SAME time
for running arp/warp jobs. Remember that these machines should share
a common home directory.
If you are not sure of what you are doing please consult the local System manager.
How many processors can you use simultaneously ? 3
Processor 1 is in a machine named: edmund
edmund is OK.
Processor 2 is in a machine named: baldrick
baldrick is OK.
Processor 3 is in a machine named: percy
percy is OK.
Multiple models setup finished.
The use of multiple models is extensively described
in [3]. Given the power of maximum likelihood refinement we recommend to exploit this
(time consuming) option if your data are poorer than 1.8 - 2.0 Å, which is not
very unlikely to be honest.
Single unrestrained ARP jobs are perfectly reasonable provided your
data are higher than 1.8 Å.
Anyhow, here the answer was Y.
If you answer N, the setup assumes that you will be using a single model and goes
on to the refinement protocol choosing.
Since the answer was Y,
you are asked to provide some details for how many models you want to use,
the machine names, how many cycles of wARP you would like to run, etc.
In this case we have decided to average 6 models. Averaging 2 models is not very helpful.
Averaging of 3, 4 or 5 models is possible but not recommended, 6 is a much better number.
Then you are asked how many processors you can use. Suppose you have a 4-processor machine.
Using all 4 is not very wise and may be impolite to others.
Since we request 6 refinement runs the script will first run 4 of them and then the remaining 2.
In total this would take 2 'job cycles'. If we choose to use 3 processors, the script will
run 3+3 jobs while leaving the fourth processor free for something else.
Now, we will proceed to protocols choosing, where you end up all the time:
You can choose between the following REFMAC protocols:
F: A fast protocol that works with good data.
S: A considerably slower one which might work better in difficult cases.
R: The slow protocol together with Rfree.
P: Phased maximum likelihood refinement.
O: The good old SFALL ...
H: Optimised parameters for starting from heavy atoms alone.
W: Optimised parameters for solvent building.
A: Advanced mode for setting parameters manually.
What is your choice ? (F/S/O/R/H/W) F
Advanced parameters set to default values for mode F
These questions are basically specific to the use of REFMAC.
The Fast protocol will setup the job so as it
will not use an R
factor for monitoring
refinement progress. Lots of people like using R
(and in general they
do well to do so!) and you are right to get suspicious if this is not done.
Just to clarify things:
The ARP/wARP authors believe that R
is essential for a restrained model refinement
to validate the protocol (unless the protocol has already been proven to be valid under
the conditions used). However if no geometry is present there is certainly no danger of
over-weighting or down-weighting X-ray data against geometry terms,
which is what basically R
tells you ...
The Slow protocol will run CDIR minimisation applying 0.3 of calculated shifts.
It will also run 4 internal REFMAC cycles before the model is updated by ARP
.
The Rfree protocol, is the slow one plus usage of
R
,
not only to be used as a test set, but
most important for calculating
weights based on the free set.
Although theoretically more sound it often fails with very bad starting models.
But, it is worth a try.
The Phased protocol, enables to use phased maximum likelihood refinement as
implemented in REFMAC. It has not been much tested but we would use it preferably
with realistic HL coefficients (e.g. from SHARP).
The Heavy protocol is optimised for starting from very few atoms.
It runs lots of REFMAC cycles, fixes solvent scaling parameters, etc.
We must say, that we do not have much experience with it. The parameters chosen will
work with rubredoxin.
If you have some high resolution data on a metalloprotein and this protocol does not work,
we strongly encourage you to contact us.
The Water protocol is basically the same as Rfree one,
i.e. it DOES use R
,
since it refers
to serious model building and you should make sure you monitor R
to see if you are
doing anything sensible. It is also assumed that the model is in a good state.
The Advanced protocol is naturally meant for advanced users.
These parameters are REFMAC specific (see CCP4 documentation), if
you don't set them up then standard default values will be chosen, they should
work well but may be not optimally.
Before setting up advanced parameters on your own, please at least make sure you
understand the following points, otherwise don't bother.