You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: doc/Estimating-gene,-site,-and-quartet-concordance-vectors.md
+6-4Lines changed: 6 additions & 4 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -52,7 +52,7 @@ You can download these 400 alignments from here: [bird_400.tar.gz](https://githu
52
52
tar -xzf bird_400.tar.gz
53
53
```
54
54
55
-
For the sake of reproducibility, you can also create your own set of 400 randomly-selected loci from the intergenic regions sequenced for this paper using the following commands:
55
+
For the sake of reproducibility, you can also create your own set of 400 randomlyselected loci from the intergenic regions sequenced for this paper using the following commands:
56
56
57
57
```bash
58
58
# Get the data from the paper's supplementary data repository
The last set of commands will produce a file just like the one you can download above, with 400 randomly-selected loci. Note that you should expect to get a slightly different species tree and concordance factors, because there's a *lot* of discordance along the backbone of the species tree of birds, so different groups of 400 loci are highly likely to give different species trees.
70
+
The last set of commands will produce a file just like the one you can download above, with 400 randomlyselected loci. Note that you should expect to get a slightly different species tree and concordance factors, because there's a *lot* of discordance along the backbone of the species tree of birds, so different groups of 400 loci are highly likely to give different species trees.
71
71
72
72
# Estimating the gene trees
73
73
@@ -109,6 +109,8 @@ This analysis will produce two files. For convenience you can download these her
109
109
110
110
Now we want to calculate gene, site, and quartet concordance vectors, and posterior probabilities (support values calculated by ASTRAL) for every branch in our species tree. To do that, we need our species tree (of course); our gene trees (gene and quartet concordance vectors are calculated from these); our alignments (site concordance vectors are calculated from these).
111
111
112
+
> Note that concordance factors and support values apply to *branches* in trees, not nodes.
113
+
112
114
### Estimate the support and quartet concordance vectors in ASTRAL
113
115
114
116
We use ASTRAL to calculate quartet concordance vectors and posterior support values (which are calculated from the quartet support values, see below for an explanation of both).
@@ -127,7 +129,7 @@ There are two output files here, which you can download here:
127
129
*`astral_species_annotated.tree`: the species tree with annotations on every branch
128
130
*`astral_species_annotated.log`: the log file for ASTRAL
129
131
130
-
The annotated tree contains a lot of extra information on every node, e.g.:
132
+
The annotated tree contains a lot of extra information on every branch, e.g.:
@@ -284,7 +286,7 @@ The concordance factors tell you a certain amount, but to understand things bett
284
286
285
287
# Generate concordance tables for branches of interest
286
288
287
-
A concordance table is just a table of the three concordance vectors, as shown in the Lanfear and Hahn paper. The `concordance_table.R` script lets you generate a concordance table for any node, based on the branch ID. Here we'll do that for two branches that were recovered in the original Nature paper, discussed in Lanfear and Hahn, and also recovered in the ASTRAL tree we estimated here from 400 loci (I found the branch IDs for these branches by studying the tree labelled with branch IDs that I made above):
289
+
A concordance table is just a table of the three concordance vectors, as shown in the Lanfear and Hahn paper. The `concordance_table.R` script lets you generate a concordance table for any branch, based on the branch ID. Here we'll do that for two branches that were recovered in the original Nature paper, discussed in Lanfear and Hahn, and also recovered in the ASTRAL tree we estimated here from 400 loci (I found the branch IDs for these branches by studying the tree labelled with branch IDs that I made above):
288
290
289
291
***Branch 598**: the Palaeognathae (kiwis and other cool birds)
290
292
***Branch 545**: the Telluraves (passerines and other closely related groups)
0 commit comments