Skip to content

Commit 6200421

Browse files
committed
Updated Estimating gene, site, and quartet concordance vectors (markdown)
1 parent ec98e63 commit 6200421

1 file changed

Lines changed: 6 additions & 4 deletions

File tree

doc/Estimating-gene,-site,-and-quartet-concordance-vectors.md

Lines changed: 6 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -52,7 +52,7 @@ You can download these 400 alignments from here: [bird_400.tar.gz](https://githu
5252
tar -xzf bird_400.tar.gz
5353
```
5454

55-
For the sake of reproducibility, you can also create your own set of 400 randomly-selected loci from the intergenic regions sequenced for this paper using the following commands:
55+
For the sake of reproducibility, you can also create your own set of 400 randomly selected loci from the intergenic regions sequenced for this paper using the following commands:
5656

5757
```bash
5858
# Get the data from the paper's supplementary data repository
@@ -67,7 +67,7 @@ find 63k_alns/ -type f ! -name '.*' | shuf -n 400 | xargs -I {} mv {} bird_400/
6767
tar -czf bird_400.tar.gz -C bird_400 .
6868
```
6969

70-
The last set of commands will produce a file just like the one you can download above, with 400 randomly-selected loci. Note that you should expect to get a slightly different species tree and concordance factors, because there's a *lot* of discordance along the backbone of the species tree of birds, so different groups of 400 loci are highly likely to give different species trees.
70+
The last set of commands will produce a file just like the one you can download above, with 400 randomly selected loci. Note that you should expect to get a slightly different species tree and concordance factors, because there's a *lot* of discordance along the backbone of the species tree of birds, so different groups of 400 loci are highly likely to give different species trees.
7171

7272
# Estimating the gene trees
7373

@@ -109,6 +109,8 @@ This analysis will produce two files. For convenience you can download these her
109109

110110
Now we want to calculate gene, site, and quartet concordance vectors, and posterior probabilities (support values calculated by ASTRAL) for every branch in our species tree. To do that, we need our species tree (of course); our gene trees (gene and quartet concordance vectors are calculated from these); our alignments (site concordance vectors are calculated from these).
111111

112+
> Note that concordance factors and support values apply to *branches* in trees, not nodes.
113+
112114
### Estimate the support and quartet concordance vectors in ASTRAL
113115

114116
We use ASTRAL to calculate quartet concordance vectors and posterior support values (which are calculated from the quartet support values, see below for an explanation of both).
@@ -127,7 +129,7 @@ There are two output files here, which you can download here:
127129
* `astral_species_annotated.tree`: the species tree with annotations on every branch
128130
* `astral_species_annotated.log`: the log file for ASTRAL
129131

130-
The annotated tree contains a lot of extra information on every node, e.g.:
132+
The annotated tree contains a lot of extra information on every branch, e.g.:
131133

132134
```
133135
[q1=0.9130236794171221;q2=0.04753773093937029;q3=0.03943858964350768;f1=334.1666666666667;f2=17.398809523809526;f3=14.43452380952381;pp1
@@ -284,7 +286,7 @@ The concordance factors tell you a certain amount, but to understand things bett
284286
285287
# Generate concordance tables for branches of interest
286288

287-
A concordance table is just a table of the three concordance vectors, as shown in the Lanfear and Hahn paper. The `concordance_table.R` script lets you generate a concordance table for any node, based on the branch ID. Here we'll do that for two branches that were recovered in the original Nature paper, discussed in Lanfear and Hahn, and also recovered in the ASTRAL tree we estimated here from 400 loci (I found the branch IDs for these branches by studying the tree labelled with branch IDs that I made above):
289+
A concordance table is just a table of the three concordance vectors, as shown in the Lanfear and Hahn paper. The `concordance_table.R` script lets you generate a concordance table for any branch, based on the branch ID. Here we'll do that for two branches that were recovered in the original Nature paper, discussed in Lanfear and Hahn, and also recovered in the ASTRAL tree we estimated here from 400 loci (I found the branch IDs for these branches by studying the tree labelled with branch IDs that I made above):
288290

289291
* **Branch 598**: the Palaeognathae (kiwis and other cool birds)
290292
* **Branch 545**: the Telluraves (passerines and other closely related groups)

0 commit comments

Comments
 (0)