Skip to main content
Account

Table 2 Top 20 most abundant uncharacterized arCOGs in “dark matter islands”

From: Dark matter in archaeal genomes: a rich source of novel mobile elements, defense systems and secretory complexes

arCOG

Number in islands

Annotation and comment

arCOG08821

18

Membrane protein, expanded in Thaumarchaeota

arCOG10873

14

Membrane protein

arCOG10027

12

Thermococcus specific secreted protein, paralogs belong to arCOG10066 and arCOG10028, they form genes clusters

arCOG10864

11

Predicted peptidase of C39 family; possibly associated with pseudo-murein binding domains

arCOG10897

10

Small membrane protein

arCOG06558

10

Likely a secreted protease, Propeptide PepSY and peptidase M4

arCOG10066

9

Thermococcus specific secreted protein, same as (see arCOG10027)

arCOG10865

9

Methanobacterium specific

arCOG09441

8

HJR family endonuclease, PD-DEXK superfamily, associated with arCOG07809, viral Primase fused to AAA DnaA-like ATPAse and Zn finger domain

arCOG10959

8

Virus/plasmid associated, often co-occur with primase in particular arCOG06914

arCOG09176

8

Often associated with viruses or plasmids

arCOG09593

8

Secreted protein with immunoglobulin-like domain

arCOG10866

8

Methanosaeta specific

arCOG09761

8

Large secreted protein; pyrobaculum specific expansion

arCOG11121

7

Uncharacterized conserved membrane protein

arCOG03316

7

Secreted enzyme present in bacteria and eukaryotes, duplication in methanosarcina acetivorans DUF3160

arCOG03631

7

Methanosarcina specific, present in bacteria

arCOG07691

7

Secreted protein associated with membrane protein of a number of related arCOGs (e.g., arCOG09771), mostly beta stranded; pyrobaculum specific expansion

arCOG06827

7

Membrane protein expansion in Methanosarcina, MGWCP motif, DUF1673

arCOG10868

7

PD-(D/E)XK nuclease family transposase

arCOG10363

7

Associated with Zn-finger containing protein from arCOG08887

  1. Genes that belong to the mobilome genes are highlighted by bold type