MiGA Manual
AboutCodebaseMiGA Online
Primary version
Primary version
  • Introduction
  • Part I: What is MiGA?
    • How can MiGA help me?
    • Who is using MiGA?
    • Who is behind MiGA?
    • Definitions
  • Part II: Getting started
    • Requirements
      • Using Homebrew
      • Using apt-get
      • Using Conda
      • Installing from source
      • MyTaxa Utils
    • Installation
    • MiGA types
    • Input data
    • Distances
    • Clustering
  • Part III: Interfaces
    • MiGA API
    • MiGA CLI
    • MiGA Web
  • Part IV: Deploying examples
    • RefSeq in MiGA
    • Build a clade collection
    • Launching daemons
    • Setting up MiGA in a cluster
  • Part V: Additional details
    • Advanced configuration
    • MiGA workflow
    • Metadata
    • External Software
  • Part VI: Workflows
    • Quality
    • Dereplicate
    • Classify
    • Preprocess
    • Index
    • Summaries
Powered by GitBook
On this page
  • Expected output
  • Indexing publicly available genomes
  1. Part VI: Workflows

Index

PreviousPreprocessNextSummaries

Last updated 4 years ago

This workflow automates the indexing of your genomic collection.

To execute the workflow, run:

miga index_wf -o my_project path/to/genomes/*.fasta

For additional options, run:

miga index_wf -h

Expected output

Once your run is complete, you may expect the for cds, assembly, essential_genes, and ssu. Additionally all the intermediate files are preserved, including assemblies, predicted genes, and detected essential and ribosomal genes. Importantly, all-vs-all comparisons are generated using Average Amino Acid Identity (AAI) and Average Nucleotide Identity (ANI), making this a queriable project.

Indexing publicly available genomes

It is also possible to use this workflow on genomes publicly available in NCBI. This enables taxonomic analysis, such as using this indexed project as a reference database for the .

For example, to download and index all the genomes from the species Xanthomonas vesicatoria using Diamond for AAI and FastANI for ANI estimation (--fast flag), run:

miga index_wf -o X_vesicatoria \
  -T 'Xanthomonas vesicatoria' --project-type clade --fast -v
standard summaries
classification workflow