You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
`j` option tells it to use all CPU cores to speed up the compilation. Without this option, `make` uses only one core, which might be slow.
108
108
109
-
This creates an executable `iqtree2` (`iqtree` for version 1). It can be copied to your system search path so that IQ-TREE can be called from the Terminal simply with the command line `iqtree2`.
109
+
This creates an executable `iqtree3` (`iqtree` for version 1). It can be copied to your system search path so that IQ-TREE can be called from the Terminal simply with the command line `iqtree3`.
110
110
111
111
To compile IQ-TREE under Linux with ARM processor, use either GCC 10 (but not above), or Clang 14 or above.
Copy file name to clipboardExpand all lines: doc/Complex-Models.md
+5-5Lines changed: 5 additions & 5 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -347,7 +347,7 @@ If you use this model in a publication please cite:
347
347
348
348
Starting with version 2.3.0, the MAST model can be executed by adding `+T` to the model option (`-m`) and providing a newick file with multiple trees by the option (`-te`). For example if one wants to fit a MAST model with different topologies contained in `trees.nwk` in conjunction with the `GTR` model to sequences in `data.fst`, one would use the following command:
349
349
350
-
iqtree2 -s data.fst -m "GTR+T" -te trees.nwk
350
+
iqtree3 -s data.fst -m "GTR+T" -te trees.nwk
351
351
352
352
The above command will *link* GTR parameters across all the trees. That means all trees will have the same GTR model. IQ-TREE will check the number of trees inside the newick file, and then estimate the model parameters and the weights of each tree: the proportion of sites belonging to each tree.
353
353
@@ -359,11 +359,11 @@ An example of the newick file with 3 topologies:
359
359
360
360
You can also link the GTR parameters, frequency array, and the rate-heterogeneity-across-site (RHAS) model across all the trees by including the frequency and the RHAS model in the model option (`-m`). For example:
361
361
362
-
iqtree2 -s data.fst -m "GTR+FO+G+T" -te trees.nwk
362
+
iqtree3 -s data.fst -m "GTR+FO+G+T" -te trees.nwk
363
363
364
364
If one would like to have *unlink* components across the trees (for example, each tree has its own substitution model, frequency array and RHAS model), one can specify the unlinked components via the `TMIX` keyword in the model string. For example:
The above command specifies the `GTR+FO+G` model for the first topology (inside the newick file), the `F81+FO+R3` model for the second topology, and the `HKY+FO+I` model for the third topology. These components are given in curly brackets and separated with a comma. Note that the number of components has to match with the number of topologies in the newick file.
369
369
@@ -388,15 +388,15 @@ Note: subst - substitution model; freq - DNA/AA frequency array; RHAS - rate het
388
388
One can use `+TR` instead of `+T` to represent the branch-length-Restricted MAST model.
389
389
In this model, the length of branch `x` of a tree <code>T<sub>i</sub></code> is constrained to be equal to the length of branch `y` of a tree <code>T<sub>j</sub></code> if the branches `x` and `y` split the trees <code>T<sub>i</sub></code> and <code>T<sub>j</sub></code> into the same two sets of taxa. For example:
In the above command, all trees share the same GTR model, DNA frequencies and gamma model, and the lengths of the branches across the trees which split the taxa set into the same partition are restricted the same.
394
394
395
395
**Weight-constrained MAST model**
396
396
397
397
One can define a constraint array following `+T` to restrict the tree weights. The constraint array can be defined as <code>[s<sub>1</sub>,s<sub>2</sub>,...,s<sub>n</sub>]</code> where <code>s<sub>i</sub></code> can be any string. The weight of tree <code>T<sub>i</sub></code> and that of tree <code>T<sub>j</sub></code> are restricted the same value if <code>s<sub>i</sub> = s<sub>j</sub></code>. For example, assuming there are 3 topologies in the newick file:
In the above command, all trees share the same GTR model, DNA frequencies and gamma model, and the weight of the first tree is constrained as the same as the weight of the second tree.
Copy file name to clipboardExpand all lines: doc/Concordance-Factor.md
+23-23Lines changed: 23 additions & 23 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -46,25 +46,25 @@ First, you need to infer a reference tree (e.g. a species tree), on which the co
46
46
47
47
As an example, you can apply an [edge-linked proportional partition model](Complex-Models) with ultrafast bootstrap (1000 replicates; for comparison with concordance factors):
where `ALN_FILE` and `PARTITION_FILE` are your input files. `-T AUTO` is to detect the best number of CPU cores. Here we use a prefix `concat`, so that all output files (`concat.*`) do not interfere with analyses below. If `--prefix` is omitted, all output files will be `PARTITION_FILE.*`.
52
52
53
-
Moreover, IQ-TREE 2 provides a new convenient feature: if you have a directory with many (locus) alignments, you can specify this directory directly with `-p` option:
53
+
Moreover, IQ-TREE 3 provides a new convenient feature: if you have a directory with many (locus) alignments, you can specify this directory directly with `-p` option:
54
54
55
-
iqtree2 -p ALN_DIR --prefix concat -B 1000 -T AUTO
55
+
iqtree3 -p ALN_DIR --prefix concat -B 1000 -T AUTO
56
56
57
57
IQ-TREE detects if `-p` argument is a directory and automatically load all alignment files and concatenate them into a supermatrix for the partition analysis.
58
58
59
59
60
60
Inferring gene/locus trees
61
61
--------------------
62
62
63
-
We now construct a set of gene/locus trees. One can manually do a for-loop, but IQ-TREE 2 provides a new convenient option `-S` to compute individual locus trees given a partition file or a directory:
63
+
We now construct a set of gene/locus trees. One can manually do a for-loop, but IQ-TREE 3 provides a new convenient option `-S` to compute individual locus trees given a partition file or a directory:
64
64
65
-
iqtree2 -s ALN_FILE -S PARTITION_FILE --prefix loci -T AUTO
65
+
iqtree3 -s ALN_FILE -S PARTITION_FILE --prefix loci -T AUTO
66
66
# or
67
-
iqtree2 -S ALN_DIR --prefix loci -T AUTO
67
+
iqtree3 -S ALN_DIR --prefix loci -T AUTO
68
68
69
69
In the second case, IQ-TREE automatically detects that `ALN_DIR` is a directory and will load all alignment files within the directory. So `-S` takes the same argument as `-p` except that it performs model selection (ModelFinder) and tree inference separately for each partition/alignment. The output files are similar to those from a partitioned analysis, except that `loci.treefile` now contains a set of trees.
70
70
@@ -73,7 +73,7 @@ Gene concordance factor (gCF)
73
73
74
74
Given the species tree `concat.treefile` and the set of locus trees `loci.treefile` computed above, you can calculate gCF for each branch of the species tree as the fraction of decisive gene trees concordant with this branch:
Note that `-t` accepts any reference tree (e.g., by coalescent/reconciliation approach) and `--gcf` accepts any set of trees (e.g. locus trees and bootstrap trees), which may contain a subset of taxa from the reference tree. IQ-Tree will write three files:
79
79
@@ -92,7 +92,7 @@ Site concordance factor (sCF)
92
92
Given the species tree `concat.treefile` and the alignment, you can calculate sCF for each branch of the species tree as the fraction of decisive alignment sites supporting that branch:
Here, each branch of `concord.cf.tree` will be assigned (or appended) with `gCF/sCF` values and `concord.cf.stat` will be written with both gCF and sCF values.
116
116
@@ -122,30 +122,30 @@ Putting it all together
122
122
If you have separate alignments for each locus in a folder, then perform the following commands:
123
123
124
124
# infer a concatenation-based species tree with 1000 ultrafast bootstrap and an edge-linked partition model
125
-
iqtree2 -p ALN_DIR --prefix concat -B 1000 -T AUTO
125
+
iqtree3 -p ALN_DIR --prefix concat -B 1000 -T AUTO
That command will have figured out for you the model of evolution, all the parameters of that model, and the branch lengths of the corresponding tree. We can re-use all of that useful information in the final step. It just takes a little bit of effort to find what you need.
166
166
@@ -183,21 +183,21 @@ To put all of that together, we are going to change the final command of the tut
183
183
184
184
# simple command, with per-locus alignments
185
185
# compute site concordance factor using likelihood with v2.2.2
To one of these, where we add the two extra commands via `-blfix` and `-m`, to fix all the parameters we already calculated. A reminder - do NOT use the exact commandlines above. You have to replace everything after the `-m` with what you found in your own `concat.iqtree` file:
193
193
194
194
# faster analysis, using pre-computed model parameters, with per-locus alignments
195
195
# compute site concordance factor using likelihood with v2.2.2
All this does is tells IQ-TREE to use the model parameters and branch lengths you already calculated. On large datasets this can save a lot of analysis time.
0 commit comments