Skip to content

Commit 35a18b9

Browse files
committed
Documentation for Linked GTR exchangeabilities model
1 parent 32cf7dc commit 35a18b9

2 files changed

Lines changed: 47 additions & 0 deletions

File tree

doc/Command-Reference.md

Lines changed: 15 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -326,6 +326,21 @@ Further options:
326326
iqtree -s prot.phy -m LG+C10+F+G
327327

328328

329+
| Option | Usage and meaning |
330+
|----------|------------------------------------------------------------------------------|
331+
| `--link-exchange-rates` | Turn on linked exchangeability estimation for a profile mixture model. Note that the model must have specified `GTR20` exchangeabilities for eg.`GTR20+C20+G`. |
332+
| `--gtr20-model` | Specify the initial exchangeabilities for linked exchangeability estimation. Note that this must be used with `--link-exchange-rates.` |
333+
| `--rates-file` | Produces a nexus file with the exchangeability matrix obtained from the optimization. This file can be later used for phylogenetic inference with the use of the `-mdef` flag |
334+
335+
### Example usages:
336+
337+
* Estimate linked exchangeabilities for a protein alignment `prot.phy` under C60+G model and a guide tree `guide.treefile`, where optimization is initialized from LG exchangeabilities
338+
339+
iqtree -s prot.phy -m GTR20+C60+G --link-exchange-rates --gtr20-model LG -te guide.treefile
340+
341+
>**NOTE**: For better and faster performance, read the [recommendations](Complex-Models#linked-gtr-exchangeabilities-models) provided in the Complex Models section.
342+
343+
329344
Rate heterogeneity
330345
------------------
331346
<div class="hline"></div>

doc/Complex-Models.md

Lines changed: 32 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -14,6 +14,8 @@ sections:
1414
url: partition-models
1515
- name: Mixture models
1616
url: mixture-models
17+
- name: Linked GTR exchangeabilities models
18+
url: linked-gtr-exchangeabilities-models
1719
- name: Site-specific frequency models
1820
url: site-specific-frequency-models
1921
- name: Heterotachy models
@@ -195,6 +197,36 @@ Sometimes one only wants to model the changes in nucleotide or amino-acid freque
195197

196198
>**NOTE**: The amino-acid order in this file is: A R N D C Q E G H I L K M F P S T W Y V.
197199
200+
Linked GTR exchangeabilities models
201+
---------------------------------------
202+
<div class="hline"></div>
203+
204+
Starting with version 2.2.X.X, IQ-TREE allows the user to estimate exchangeabilities under profile mixture models.
205+
206+
### Exchangeability estimation
207+
208+
To start with, we show an example:
209+
210+
iqtree -s <alignment> -m GTR20+C60+G4 --link-exchange-rates -te <guide_tree> -me 0.99
211+
212+
In this example exchangeabilities will be estimated for a profile mixture model `C60+G4` but any profile mixture model and rates can be used. To estimate a single set of linked exchangeabilities, in the model definition the matrix `GTR20` must be specified (resp. GTR for nucleotide data) together with the flag `--link-exchange-rates`. While a guide tree is not needed, we highly recommend using a fixed tree topology to estimate exchangeabilities. Since matrix estimation can be time-consuming, we also recommend using the flag `-me 0.99` to reduce the optimization threshold for faster optimization. Simulations have shown that changing this parameter has no significant effect on exchangeability estimation.
213+
214+
The user can determine the starting exchangeabilities before optimization. Choosing adequate exchangeabilities can make estimation considerably faster. For example:
215+
216+
iqtree -s example.phy -m GTR20+C60+G4 --link-exchange-rates --gtr20-model LG -te <guide_tree> -me 0.99
217+
218+
specifies the LG matrix as the starting matrix via the flag `--gtr20-model` (the default starting matrix is POISSON, i.e. equal exchangeabilities). For this flag, the user can specify any matrix, even those matrices defined by the user via the `-mdef` flag. If the user is agnostic of the exchangeabilities, we recommend using the default matrix (although it can be time-consuming).
219+
220+
Note that the user can estimate exchangeabilities jointly with weights of the profiles, branch lengths, and rates. This can be very time-consuming. If the goal is to optimize exchange abilities, one can fix the other parameters to reasonable estimates (for eg. fixing branch lengths and rates has been shown to perform adequately for estimation of exchangeabilities)
221+
222+
There is an additional flag `--rates-file` that will produce a nexus file with the exchangeability matrix obtained from the optimization. This file can be later used for phylogenetic inference with the use of the `-mdef` flag.
223+
224+
225+
If you use this routine in a publication please cite:
226+
227+
> __H. Banos et al.__ (2024) Estimating Linked Exchangeabilities for Profile Mixture Models. _Bioraxiv.
228+
229+
198230
Here, the NEXUS file contains a `models` block to define new models. More explicitly, we define four AA profiles `Fclass1` to `Fclass4`, each containing 20 AA frequencies. Then, the frequency mixture is defined with
199231

200232
FMIX{empirical,Fclass1,Fclass2,Fclass3,Fclass4}

0 commit comments

Comments
 (0)