A simple modular Bash pipeline to run multiple sequence alignment, trim it, and infer a phylogenetic tree using IQ-TREE 2.
conda env create -f environment.yml
conda activate msa-phyloThis will install:
- MAFFT (multiple sequence alignment)
- TrimAl (automated trimming)
- IQ-TREE 2 (phylogenetic inference)
sudo apt install mafft
sudo apt install trimal
conda install -c bioconda iqtreeinput/sequences.fasta: your input sequencesoutput/: all results will be saved here
cd msa-phylo-pipeline/
chmod +x *.sh
./run_pipeline.shYou can change threads or memory by editing run_pipeline.sh:
THREADS=86
MEMORY=200Goutput/aligned.fasta: MAFFT alignmentoutput/aligned_trimmed.fasta: TrimAl-trimmed and reformatted alignmentoutput/tree.treefile: Final phylogenetic tree from IQ-TREE
- IQ-TREE will detect identical sequences and remove them by default (see
.iqtreelog). - To retain duplicates, add
--keep-identtotree.sh.
MIT