You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Fast ITS region extraction in Rust (HMMER-based), designed for long-read amplicon data (ONT / PacBio HiFi) and general FASTA/FASTQ inputs.
3
+
ITS subregion extraction for fungal metabarcoding at long-read scale.
4
+
5
+
As long-read amplicon sequencing (Oxford Nanopore and PacBio HiFi) becomes routine, extracting ITS subregions (ITS1, 5.8S, ITS2, full ITS) reliably at scale can become a throughput and robustness bottleneck. ITSxRust is a Rust-based ITS extractor that follows the standard approach of locating conserved ribosomal flanks using profile-HMMs (via HMMER), while adding long-read–oriented features for reproducible, high-throughput processing.
4
6
5
7
## Features
6
-
- Extract ITS1, ITS2, and/or full ITS region(s)
7
-
- Works with FASTA and FASTQ inputs (optionally gzipped if supported in your build)
8
-
- Produces extracted sequences plus optional boundary/anchor reporting
9
-
- Designed to be fast and reproducible
8
+
- HMMER/profile-HMM–based detection of conserved ribosomal flanks to extract ITS subregions
- Optional dereplication to reduce redundant HMMER searches
11
+
- Partial-chain fallback: recover subregions using two-anchor pairs when a full four-anchor chain is unavailable
12
+
- Structured failure diagnostics and QC summaries to help understand why reads were skipped or partially recovered
13
+
- Works with FASTA and FASTQ inputs
10
14
11
15
## Install
12
16
13
-
### From source (developer install)
17
+
### Prebuilt binaries (recommended)
18
+
Download the appropriate binary for your OS from GitHub Releases:
19
+
20
+
- GitHub → Releases → `v0.1.0`
21
+
22
+
Then:
23
+
24
+
```bash
25
+
chmod +x itsxrust
26
+
./itsxrust --help
27
+
```
28
+
29
+
### From source
14
30
Requires Rust (stable) and Cargo.
15
31
16
32
```bash
@@ -25,39 +41,35 @@ cargo install --path .
25
41
itsxrust --help
26
42
```
27
43
28
-
### Planned distribution
29
-
The manuscript version will provide:
30
-
- Bioconda recipe
31
-
- Prebuilt binaries (GitHub Releases)
32
-
- Container images (GHCR)
44
+
### Dependency: HMMER
45
+
ITSxRust coordinates HMMER searches (e.g., `hmmscan`) to locate ribosomal flanks. Ensure HMMER is available in your environment for typical extraction workflows.
33
46
34
47
## Usage
35
48
36
-
Basic help:
49
+
Help:
37
50
38
51
```bash
39
52
itsxrust --help
40
53
itsxrust extract --help
41
54
```
42
55
43
-
Example extraction (adjust flags to match your CLI):
0 commit comments