You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
<li>`-m MIX"{GTR+FO,GTR+FO}"+I+R3` to specify a mixture model with two separate GTR models ([Ren et al., 2025]) with optimized nucleotide frequencies as well as invariant sites and 3 freerate categories. The quotes aroung the curly braces are important, otherwise bash will try to do variable expansion.</li>
279
-
<li>`--prefix turtle2` to keep file names informative and tidy. </li>
280
-
</ul>
281
-
282
-
> Now use the same command structure to try mixture models with 3, 4, and 5 classes!
283
-
284
-
</div>
285
-
286
-
287
-
> **QUESTIONS:**
288
-
>
289
-
> * Look at the report file `turtle2.iqtree`, 'turtle3.iqtree', etc. What are the AIC/AICc/BIC scores of the various mixture models? Is it better than a 1-class model?
290
-
>
291
-
> * Look at the tree in `turtle2.iqtree` or visualize `turtle2.treefile`
292
-
> in FigTree. What relationship among [three trees](#1-input-data) does this tree support? What about for the 3-, 4-, and 5-class models?
293
-
>
294
-
> * What is the ultrafast bootstrap support (%) for the relevant clade under each model?
295
-
>
296
-
> * Which of these trees agree with the published tree ([Chiari et al., 2012])?
297
-
{: .tip}
298
-
299
-
300
257
5) Tree topology tests
301
258
----------
302
259
<divclass="hline"></div>
@@ -411,7 +368,7 @@ Options explained:
411
368
412
369
> **QUESTIONS:**
413
370
>
414
-
> * Look at `turle.mix.iqtree` for the line printing the tree weights. Which tree has a higher weight?
371
+
> * Look at `turtle.mix.iqtree` for the line printing the tree weights. Which tree has a higher weight?
415
372
> * Is it the tree having higher likelihood found in step 5?
416
373
{: .tip}
417
374
@@ -461,7 +418,7 @@ log-likelikehood subtraction as pointed out above).
461
418
We now try to construct a tree without these "influential" genes.
462
419
To do so, copy the partition file `turtle.nex` to a new file and
463
420
remove the lines defining the `charset` of these genes, and then
464
-
repeat the IQ-TREE run with a parititon model (see section 4).
421
+
repeat the IQ-TREE run with a partition model (see section 4).
465
422
You will need to figure out a command line to run IQ-TREE yourself here.
466
423
467
424
> **QUESTIONS:**
@@ -546,11 +503,62 @@ Similarly, you can compute gCF and sCF for the tree under unpartitioned model:
546
503
> contradicting branch?
547
504
{: .tip}
548
505
506
+
507
+
10) Applying a mixture model
508
+
---------------------------
509
+
<divclass="hline"></div>
510
+
511
+
512
+
We now perform a mixture model analysis ([Ren et al., 2025]), where
513
+
each site is described as a mixture of models. For this exercise we will use a slightly different model, GTR+FO+I+R3, which is the best fit to the entire turtle dataset.
514
+
515
+
Estimate a phylogeny under this model as a baseline:
<li>`-m MIX"{GTR+FO,GTR+FO}"+I+R3` to specify a mixture model with two separate GTR models ([Ren et al., 2025]) with optimized nucleotide frequencies as well as invariant sites and 3 freerate categories. The quotes aroung the curly braces are important, otherwise bash will try to do variable expansion. </li>
533
+
<li>`--prefix turtle2` to keep file names informative and tidy. </li>
534
+
</ul>
535
+
536
+
> Now use the same command structure to try a mixture model with 4 classes, and another one with 6 classes! These will take some time to run, more as the number of classes increases. In the meantime, check out [Ren et al., 2025], which performed a similar analysis of mixture models with different numbers of model classes - on a larger turtles dataset.
537
+
538
+
</div>
539
+
540
+
541
+
> **QUESTIONS:**
542
+
>
543
+
> * Look at the report files `turtlebest.iqtree`, `turtle2.iqtree`, `turtle4.iqtree`, and `turtle6.iqtree`. What are the BIC scores of the various mixture models? Which is best?
544
+
>
545
+
> * Look at the tree in `turtle2.iqtree` or visualize `turtle2.treefile`
546
+
> in FigTree. What relationship among [three trees](#1-input-data) does this tree support? What about for the 4- and 6-class models?
547
+
>
548
+
> * What is the ultrafast bootstrap support (%) for the relevant clade under each model?
549
+
>
550
+
> * Which of these trees agree with the published tree ([Chiari et al., 2012])? Do the relationships or ultrafast bootstrap values change with model complexity?
551
+
> *[Ren et al., 2025] performed a similar analysis of the effect of number of model classes on the relationships between turtle, bird, and crocodile. What relationship among [three trees](#1-input-data) do these authors find supported by a 1-class model? What about a 6-class model? Do our results agree?
552
+
{: .tip}
553
+
554
+
549
555
> **FINAL QUESTIONS:**
550
556
>
551
557
> * Given all analyses you have done in this tutorial, which relationship between
552
558
> Turtle, Crocodile and Bird is true in your opinion?
553
559
560
+
561
+
554
562
[Adachi and Hasegawa, 1996]: http://www.is.titech.ac.jp/~shimo/class/doc/csm96.pdf
555
563
[Anisimova et al., 2011]: https://doi.org/10.1093/sysbio/syr041
556
564
[Brinkmann et al., 2005]: https://doi.org/10.1080/10635150500234609
@@ -573,6 +581,7 @@ Similarly, you can compute gCF and sCF for the tree under unpartitioned model:
573
581
[Minh et al., 2020]: https://doi.org/10.1093/molbev/msaa106
574
582
[Nei et al., 2001]: https://doi.org/10.1073/pnas.051611498
575
583
[Nguyen et al., 2015]: https://doi.org/10.1093/molbev/msu300
584
+
[Ren et al., 2025]: https://doi.org/10.1093/molbev/msae264
576
585
[Shimodaira and Hasegawa, 1999]: https://doi.org/10.1093/oxfordjournals.molbev.a026201
0 commit comments