You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
To make obitools available everywhere, add both the obitools binary and the obitools `/export/bin` folder to your path.
12
13
@@ -125,19 +126,22 @@ Do the rest separately for your two amplicon types.
125
126
As above, use obiannotate to annotate your sequences.
126
127
`01_scripts/obiannotate_ident.sh`
127
128
129
+
After completing this step, one can use the following scripts to account for reads from each amplicon type:
130
+
`01_scripts/account_reads_annot.sh`
131
+
`02_scripts/account_reads_annot.R`
132
+
128
133
Move on to [Part 2](#part-2-main-analysis).
129
134
130
135
## Part 2. Main Analysis
131
136

132
137
133
138
### 2.1. Retain Only Unique Reads
134
-
Input is a single fastq file containing all samples for a specific amplicon, annotated with sample name.
139
+
Input is a single fastq file containing all samples for a specific amplicon, annotated in the fastq accession with sample name.
135
140
136
-
Use obiuniq to keep one record per unique amplicon in the fastq (outputs fasta).
141
+
Use obiuniq to keep one record per unique amplicon, retaining the count number (outputs fasta):
137
142
For paired-end data: `./01_scripts/03_retain_unique_PE.sh`
138
143
For single-end data: `./01_scripts/03_retain_unique_SE.sh`
139
-
(in brief: `obiuniq -m sample 04_samples/*assi.fq > 04_samples/*_uniq.fa`)
140
-
Note: one can also add other -m flags, such as `run`, or `pcr_rep`, etc., anything that you may want to summarize over using obitab later.
144
+
Note: one can also edit this script to add other -m flags, such as `run`, or `pcr_rep`, etc., anything that you may want to summarize over using obitab later.
141
145
142
146
Audit: sum up the count value to make sure all reads are accounted for:
@@ -207,8 +202,13 @@ This will use the R script `read_counts_to_annotations.R`, run interactively.
207
202
Necessary inputs:
208
203
Amplicon annotation output from MEGAN, and amplicon read count from `obitab`
209
204
205
+
This script is currently highly customized to the two projects using this analysis. To change it to fit your data, you would have to provide the appropriate filenames, datatypes, locations of samples. This script would be more considered to be a template to modify for the user for other studies.
206
+
210
207
In brief, this will merge these two inputs, attach locations, aggregate different amplicons with same annotation, calculate proportions, save out proportion plots and count/proportion tables.
211
208
212
209
Within here, one can apply a low expression filter to remove any counts less than 10.
210
+
Output will be saved to `06_output_figures` (for figures) and `05_annotated` (for counts/proportions).
211
+
212
+
There is also a second script that is more customized, specifically tailored to the HABs project to deal with the Variant pipeline types `01_scripts/read_counts_to_annotations_HABs.R`. This script is not currently for broader use, and is only used for this project.
213
213
214
-
(note: currently working on improving this script to be more universal. See a larger version on `read_counts_to_annotations_HABs.R`)
214
+
These Rscripts may be continually developed if there is interest in increasing the generality of this pipeline. Please contact the author for more information or comments.
0 commit comments