Introduction to Pangenomics


Figure 1

 Venn diagram of a) a closed pangenome and b) an open pangenome, comparing the sizes of their core and accessory genomes. c) Graphic depicting the differences between closed and open pangenomes regarding their size, total genes in pangenome, and the number of sequenced genomes.

Downloading Genomic Data


Annotating Genomic Data


Measuring Sequence Similarity


Clustering with BLAST Results


Figure 1

 Bidirectional best-hit algorithm

Clustering Protein Sequences


Exploring Pangenome Graphs


Figure 1

Bar graph depicting the gene family frequency distribution, represented by a U-shaped plot. The number of organisms is plotted in the x-axis and the number of gene families in the y axis.

Figure 2

Tile plot displaying the gene families present within six strains of Streptococcus agalactiae, including the cloud gene families

Figure 3

default Gephi visualization after layout specifications

Figure 4

Gephi visualization with orange nodes for the persistent families, blue for cloud, and green for shell.

Figure 5

Gephi visualization with 8 different colors.

Figure 6

Gephi visualization with the nodes colored according to the number of genes that are part of the family.

Figure 7

Gephi visualization with all hypothetical proteins are in pink and the rest in gray.

Interactive Pangenome Plots


Figure 1

Interactive Anvio pan genome analysis of six S. agalactiae genomes.
                                                               	Each circle corresponds to one genome and each radius represents a gene family.

Other Resources


Figure 1

Example of network made with Graphia.