Skip to content

Update MEDI to new DB builder and NCBI Taxonomy#39

Open
cdiener wants to merge 85 commits into
Gibbons-Lab:mainfrom
dienerlab:feature/new_db
Open

Update MEDI to new DB builder and NCBI Taxonomy#39
cdiener wants to merge 85 commits into
Gibbons-Lab:mainfrom
dienerlab:feature/new_db

Conversation

@cdiener

@cdiener cdiener commented Jan 29, 2026

Copy link
Copy Markdown
Collaborator

TODO:

  • update architeuthis
  • explain need for Kraken2 patches
  • document the new Docker image
  • provide a small example DB

@szimmerman92

Copy link
Copy Markdown

Hello. I have been making some edits to MEDI to fix the rsync issues caused by making to many requests to download genomes. I switched rsync to databases CLI like as done in this pull request, but I noticed the dehydrate option was not included if I am reading your commits correctly.

If you do something like

datasets download genome accession --inputfile accessions.txt --dehydrated --filename my-genomes.zip
unzip my-genomes.zip -d my-genomes
datasets rehydrate --directory my-genomes/

the genomes download very fast, compared to not using the dehydrated/rehydrate commands.

I hope this helps.

Best,
Sam

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants