The ever-increasing number of sequenced genomes presents us with an exciting opportunity to discover highly conserved gene families of unknown function, then characterize them experimentally. We have detected these gene families across the kingdom Fungi (see Methods at the bottom of this page), and invite the international research community to functionally characterize their individual members and propagate their annotations across the Fungal Tree of Life. Investigators can register, login, click on any cluster below (the number of the left), and add notes to any protein from the list along with the methods used for functional characterization.

Conserved Genes Families of Unknown Function:2024
Total Families:
339
Total Genes:
162,087
Total Unique Species:
1,753
Total Annotated Genes:
0
Total Unique PFAM Domains:
80
Total Unique Uniprot:
126
Total Unique PDB:
66
Updated:
2024-03-08
##GenesExpressed Genes %Genes with PhenotypesUnique SpeciesProteins with PFAM DomainsUnique PFAM Domains CountProtein PFAM DomainsUniprot HMM HintPDB HMM HintAlphaFold pLDDTFoldseek PDB HintConserved InUser Curated ModelsAvg. Protein Length
11,6418414610078Fungi0244
21,348501726222OG-Fe dioxygenase:
1
2OG-Fe(II) oxygenase superfamily:
1
88Universal0356
31,301850356322Arrestin (or S-antigen), N-terminal domain:
31
REV protein (anti-repression trans-activator protein):
1
86Fungi0423
41,1276711,0670090Fungi and Viridiplantae0148
51,0636005600076Fungi0379
61,0456007300082Eukaryota0220
79698103620075Fungi0326
89676509470081Fungi and Viridiplantae0118
99566009160061Universal0495
109516109210060Eukaryota0641
1194461092911SOS response associated peptidase (SRAP):
1
71Fungi and Viridiplantae0391
129406309310083Fungi and Viridiplantae0303
139255908640051Universal0397
149206209110082Universal0354
159156408980076Fungi and Viridiplantae0183
1691364090421Membrane-associating domain:
2
74Eukaryota0203
179106308920058Fungi and Viridiplantae01,106
188996407240090Fungi and Viridiplantae0114
198976208850050Eukaryota0430
208956208730060Fungi and Viridiplantae0501
218956508900067Fungi0397
228946308880086Fungi0305
2389362088511Cytochrome c oxidase biogenesis protein Cmc1 like:
1
83Eukaryota0323
248886308820068Fungi and Viridiplantae0325
258806108400096Fungi0183
26879731839452NAD dependent epimerase/dehydratase family:
25
NAD(P)H-binding:
20
91Fungi and Viridiplantae0299
278726108480045Fungi and Viridiplantae0420
288676408570050Universal0563
2986562184351Bombesin-like peptide:
5
78Eukaryota0290
3086461082911Atrophin-1 family:
1
58Fungi and Viridiplantae0684
318616208500080Fungi and Viridiplantae0624
328586008340076Fungi and Viridiplantae0225
338466307590078Fungi and Viridiplantae0227
348426408310061Fungi0278
358406508270058Fungi and Viridiplantae0391
3683865182921Peptidase family M41:
2
60Universal0409
378376408200068Fungi and Viridiplantae0590
388376308290057Universal0438
398346207930078Fungi and Viridiplantae0964
408335908150079Eukaryota0263
418316707190091Fungi and Viridiplantae0162
428306308090069Fungi and Viridiplantae0365
438266408050055Fungi and Viridiplantae0527
448246708170068Fungi and Viridiplantae0182
45820661810241Glutathione-dependent formaldehyde-activating enzyme:
24
71Eukaryota0250
468176308021031Altered inheritance of mitochondria protein 19:
103
79Fungi and Viridiplantae0137
478146508060068Universal0596
488136608000071Fungi0284
498126307930069Universal0176
5081164078221BLOC-1-related complex sub-unit 6 C-terminal helix:
2
78Fungi and Viridiplantae0169
5180464076711HupE / UreJ protein:
1
87Fungi and Viridiplantae0179
528046807960075Fungi0341
538036507980068Eukaryota0424
5479859077911Protein of unknown function (DUF4030):
1
68Fungi and SAR0136
557986307720071Fungi and Viridiplantae0536
567976307900079Fungi0240
577956207860046Fungi and Viridiplantae0656
587906407810079Eukaryota0565
597906507840078Fungi and Viridiplantae0295
607896407850075Fungi and Viridiplantae0415
617895807800082Universal0683
627836217670049Fungi and Prokaryotes0528
637816407360051Eukaryota0562
647816507740082Fungi and Viridiplantae0351
6577978157721Fungal protein of unknown function (DUF1774):
2
90Fungi0270
667726007640081Universal0638
677706507640057Fungi and SAR0439
6876785040711COPI associated protein:
1
74Fungi0279
697626607460053Fungi0266
707476107150079Fungi and Viridiplantae0560
717456607430065Eukaryota0451
727396407350071Fungi and Viridiplantae0120
737366307340069Eukaryota0526
747306407180073Fungi and SAR0522
757296807120064Fungi0150
7672966072211Prefoldin subunit:
1
64Eukaryota01,017
777258103780089Eukaryota0319
787238603570090Fungi0188
797235606620070Fungi and Viridiplantae0342
807207107150056Fungi and Viridiplantae0484
817156307000068Eukaryota0289
826976606970063Fungi0333
836856606620085Fungi0329
846776606700077Fungi0334
8567762067253AT hook motif:
2
Leucine Rich Repeat:
2
Leucine Rich Repeat:
1
71Fungi and Viridiplantae0716
866726106550066Fungi and SAR0284
8766860065982F-box domain:
7
F-box-like:
1
73Fungi0592
886656506610083Fungi0152
896605906510061Eukaryota0410
9065772163311F-box-like:
1
89Fungi and Viridiplantae0544
916515706190071Fungi and Viridiplantae0360
926436806290090Fungi and Viridiplantae0183
936216006120064Eukaryota0271
946146605540078Universal0823
956126406080057Fungi and Viridiplantae01,169
966106506010057Fungi and Viridiplantae0374
976106706100050Fungi0329
986016605740062Fungi and Viridiplantae0373
9959957059011Metchnikowin family:
1
58Universal0240
1005946405880078Fungi and Viridiplantae0569
1015945905910081Eukaryota0596
102592751445951Domain of unknown function (DUF4336):
95
95Fungi0252
1035916705860062Eukaryota01,711
10458667058511Coiled-coil domain-containing protein 34:
1
56Fungi and SAR0408
1055805815780068Fungi and Viridiplantae0357
1065775705310073Eukaryota01,096
1075766105720072Universal0325
1085716405620079Fungi and Viridiplantae0218
1095665102940071Fungi0393
1105636605600058Fungi and Viridiplantae0233
1115586605570072Fungi and Viridiplantae0308
1125518403450082Fungi0182
11354468053111AN1-like Zinc finger:
1
63Fungi and Viridiplantae0156
11454165053421Small acid-soluble spore protein P family:
2
58Fungi and Viridiplantae0435
11554160052411Protein of unknown function (DUF1698):
1
93Fungi and Viridiplantae0397
1165396805370075Fungi and Viridiplantae0236
11753567052911Insulinase (Peptidase family M16):
1
81Fungi0771
1185317005290079Fungi and Viridiplantae0172
1195276305200070Fungi0117
1205106205050055Fungi and Viridiplantae0641
1215108703540090Fungi and Viridiplantae0203
1225086205010058Eukaryota0866
1235056504990070Fungi and Viridiplantae0456
1244996204810078Universal0379
12549469049352PPR repeat:
4
Pentatricopeptide repeat domain:
1
76Fungi and Viridiplantae0806
1264787204650073Fungi and Viridiplantae0167
1274655504110053Fungi0599
1284648113890093Fungi0176
129461690457124F-box domain:
9
Protein of unknown function (DUF2442):
1
Serum albumin family:
1
F-box-like:
1
78Fungi and Viridiplantae0507
1304607014290085Fungi0192
1314548403993031Protein of unknown function (DUF1308):
303
80Fungi0610
1324505804230056Fungi and SAR0388
1334487103210092Fungi0160
1344446404430078Fungi0411
1354436604390087Fungi and Viridiplantae0634
1364388603472014'-phosphopantetheinyl transferase superfamily:
20
92Fungi0289
1374376604340062Eukaryota0257
1384326604060076Fungi0222
1394318703630089Fungi0301
1404298203990073Fungi0259
14142866042341PT repeat:
4
60Eukaryota0449
1424268403980083Fungi0382
1434236304210058Universal0716
1444188603750088Fungi0205
1454178303990085Fungi and Viridiplantae0316
1464168204050070Fungi0290
1474148503880082Fungi and Viridiplantae0634
1484116204070055Fungi0534
1494096304090063Eukaryota0603
1504088603420070Fungi0236
15140860140511Chromatin assembly factor 1 complex p150 subunit, N-terminal:
1
58Universal0433
1524076504050088Fungi0370
1534068603750076Fungi0157
1544056404000058Fungi0472
1554046003980074Fungi0531
1564048603670094Fungi0182
1574038303883191WD domain, G-beta repeat:
319
89Fungi0372
1583998303850075Fungi0309
15939864039351Meiosis-specific protein Mei4:
5
72Fungi and Viridiplantae0514
1603987003860079Fungi and Viridiplantae0524
1613958403560094Fungi and Viridiplantae0208
1623947301480073Fungi and SAR0222
1633948403740087Fungi0232
1643938603490071Fungi0316
1653928803780081Fungi0197
1663917103830074Fungi0406
1673908803640052Fungi and Viridiplantae0470
16838987036211Heme oxygenase:
1
78Fungi0230
1693876203830044Fungi01,052
1703868703620094Fungi0194
17138685036421SUR7/PalI family:
2
88Fungi0235
1723858503701922V-type proton ATPase subunit S1, luminal domain:
191
PF08319:
1
80Fungi0281
1733856303640055Fungi and SAR0603
1743846403810050Fungi0325
1753848603780060Fungi0782
1763828203681683Ankyrin repeats (3 copies):
139
Ankyrin repeat:
29
Ankyrin repeats (many copies):
2
86Fungi0272
17738182023462EthD domain:
4
Antibiotic biosynthesis monooxygenase:
2
95Universal0235
17838062037021Chromatin assembly factor 1 complex p150 subunit, N-terminal:
2
49Fungi0481
1793808603750071Universal0383
18037866037711AT hook motif:
1
77Fungi0167
1813748903580080Fungi0247
1823738503640078Fungi0532
1833728803640060Fungi0393
1843716403690081Fungi0252
1853718803350085Fungi0143
1863718703540076Fungi0482
1873707103680053Fungi0347
18837088033622Nitrate and nitrite sensing:
1
Putative transmembrane protein:
1
79Fungi0203
1893708703640063Fungi0395
1903698503540072Fungi0371
1913686203670077Fungi0138
1923678703600088Fungi0372
1933666003640072Eukaryota0193
1943658703370093Fungi0177
1953648503440064Fungi0694
1963648403440059Fungi0246
197364850340293Leucine Rich repeat:
20
Leucine Rich Repeat:
8
Leucine Rich Repeat:
1
82Fungi0714
1983637503610070Fungi and Viridiplantae0664
1993638703500062Fungi0451
2003598413450080Fungi and Viridiplantae0348
20135989034211Protein of unknown function (DUF724):
1
58Fungi and Viridiplantae0925
2023588603520084Fungi0296
2033578703550051Fungi and Viridiplantae0948
2043565403040053Fungi0448
2053558703450059Fungi0863
2063558803380079Fungi0674
2073558503490083Fungi0437
2083558403180053Fungi and Viridiplantae0334
2093548503530079Fungi and Viridiplantae0527
2103537203520075Eukaryota0260
2113528603320057Fungi0213
21235086032743Protein of unknown function (DUF1180):
2
Ykl077w/Psg1 (Pma1 Stabilization in Golgi):
1
Glycophorin A:
1
67Fungi and Viridiplantae0428
2133508403440088Fungi0211
2143498803460060Fungi and Viridiplantae0392
2153498703420082Fungi0215
2163476403350071Fungi and Viridiplantae0136
2173468403350057Fungi and Viridiplantae0723
2183448603360057Fungi0225
2193438703290083Fungi0520
22034185033811HSCB C-terminal oligomerisation domain:
1
68Fungi and Viridiplantae0962
2213408703310087Fungi0152
2223388703360049Fungi0434
2233388903350072Fungi0313
2243376303340072Fungi and Viridiplantae01,019
2253377303360075Fungi0194
2263358803300094Fungi0130
2273348603150058Fungi0480
2283316603280056Fungi0263
22933088032831Transient receptor potential (TRP) ion channel:
3
76Eukaryota01,157
2303308603300072Fungi and Viridiplantae0516
2313288703060071Fungi and Viridiplantae0536
2323286303250049Fungi0501
2333288603240045Fungi0992
2343278503240054Fungi and Viridiplantae0300
23532388032111F-box domain:
1
88Fungi0434
23632372032211Collagen triple helix repeat (20 copies):
1
66Fungi and Viridiplantae0463
2373196303180052Fungi0453
2383188403120060Fungi0220
2393188703080087Fungi0341
240314850305721DASH complex subunit Spc34:
72
64Fungi0508
2413136503100048Fungi0679
2423116703080051Fungi and SAR0363
2433096103080053Fungi and Viridiplantae0974
2443088502830079Fungi0153
2453078413070066Fungi and Prokaryotes0473
2463067103040057Fungi and Viridiplantae0446
2473056503040058Fungi0425
2483058802990064Fungi0155
249303860290191Glutaredoxin:
19
80Fungi and Viridiplantae0267
2503025803010065Fungi0411
2513008802920063Fungi and Viridiplantae0291
2522988402960080Fungi0628
25329665029411Porin subfamily:
1
71Fungi0372
2542956402930059Fungi0160
2552946502930056Fungi0324
2562936502930069Fungi0293
2572916502900067Fungi and SAR0235
2582918702731592Ykl077w/Psg1 (Pma1 Stabilization in Golgi):
158
Amino acid permease:
1
59Fungi0516
2592916502900054Universal0393
2602884802450089Eukaryota0262
2612888602830063Fungi0227
2622856502850044Fungi0877
26328471027831AT hook motif:
3
63Fungi and SAR0668
2642837102830062Fungi0245
26528287026911Alba:
1
80Fungi0209
2662828802770048Fungi and Viridiplantae01,109
2672818802750072Fungi0361
2682818902810080Fungi0203
26928089027222UbiA prenyltransferase family:
1
Tetraspanin family:
1
70Fungi0303
2702806602800060Fungi and Viridiplantae0468
2712798802690072Fungi and Viridiplantae0360
2722796602790064Fungi0652
2732777902760073Fungi0213
2742758502660061Fungi0370
2752748702660080Fungi0467
2762716902690051Fungi0410
2772697202690071Fungi0331
2782688302360078Fungi0402
2792657002630067Fungi0158
28026488026221Nematode cuticle collagen N-terminal domain:
2
58Fungi and Viridiplantae0568
2812629202600054Fungi and Viridiplantae0933
2822628902590077Fungi0155
2832587202580062Fungi0610
2842567502560062Fungi0824
2852556902540088Fungi0140
286254500245151F-box domain:
15
76Universal0493
28725286024622Amastin surface glycoprotein:
1
PMP-22/EMP/MP20/Claudin tight junction:
1
87Fungi0216
28825082025083EF-hand domain pair:
8
EF hand associated:
6
EF hand associated:
6
79Fungi and Viridiplantae0153
2892468502430062Fungi0306
2902438802390065Fungi0813
2912417202360068Fungi and Viridiplantae0168
2922387602380057Fungi and Viridiplantae0274
2932374101400091Fungi0253
29423678023611Dephospho-CoA kinase:
1
52Fungi0549
2952367602350068Fungi0344
2962358802330048Fungi0829
2972347902330052Fungi and Viridiplantae0628
2982324501470080Fungi and Viridiplantae0965
2992318502100071Fungi0315
300226840205171Ricin-type beta-trefoil lectin domain:
17
83Fungi0270
3012229202090071Fungi0297
3022208802180068Fungi0557
3032177802060085Fungi0205
3042148002130055Fungi01,193
3052128802070064Fungi and Viridiplantae0581
3062115302040071Fungi and Viridiplantae0179
3072114601670084Universal0552
3082106801970088Fungi and Viridiplantae0481
30920560020191Nuclear pore complex subunit Nro1:
9
85Fungi and Prokaryotes0247
3102048402040074Fungi and SAR0370
3112037101930093Fungi0285
3122016301520070Fungi0396
3131998701990044Fungi and Viridiplantae01,131
3141972601310086Fungi and opisthokonts0153
3151823201250087Fungi0124
3161748301600082Fungi0210
3171696601660076Fungi and Viridiplantae0335
3181646501600067Fungi and Viridiplantae0175
3191556701520080Fungi and Viridiplantae0219
3201513201290090Universal0210
3211516801430052Fungi0230
3221506601460075Fungi0340
3231506901440078Fungi and Prokaryotes0173
3241496501430085Fungi0197
3251475501430071Fungi and Viridiplantae0406
3261446901420082Fungi and Viridiplantae0729
3271447201420067Fungi and Viridiplantae0182
3281436901380069Fungi0105
3291427701270073Fungi0137
33014140012811KH domain:
1
81Fungi0636
3311404601390077Eukaryota0245
3321367001330067Fungi and Prokaryotes0232
3331302801120077Fungi0152
3341192701040082Fungi0123
3351183601160079Fungi0326
3361156701030064Fungi0211
337110310107142HEAT repeat:
11
HEAT repeat associated with sister chromatid cohesion:
3
83Fungi0814
3381067601060056Fungi0398
3391022501020088Fungi and opisthokonts0169

Methods

Over 18 millions proteins encoded in 1282 fungal genomes from Mycocosm were clustered into families using cascaded MMseqs2 with default parameters (Steinegger et al, 2017). Our subset of 142 clusters have the following 3 properties. Each is:

An individual family member may have manual curations retrieved from MycoCosm or functional domains not shared with the rest of its family. Families as a whole may also have similarity to distant protein families in Uniprot or Protein Data Bank (PDB), as found by pairwise HMM-based HHblits searches (Steinnegger et al, 2019) against the non-redundant Uniprot20_2016 (defined by <20% sequence identity) and PDB70 (defined by <70% sequence identity) sets of protein sequences. Such distantly related proteins are presented in the list as "hints" (‘Uniprot HMM Hint’ and ‘PDB HMM Hint’ columns).

References

  1. Steinegger M, Söding J. MMseqs2 enables sensitive protein sequence searching for the analysis of massive data sets. Nat Biotechnol. 2017 Nov;35(11):1026-1028. doi: 10.1038/nbt.3988. Epub 2017 Oct 16. PMID: 29035372.
  2. Steinegger M, Meier M, Mirdita M, Vöhringer H, Haunsberger SJ, Söding J. HH-suite3 for fast remote homology detection and deep protein annotation. BMC Bioinformatics. 2019 Sep 14;20(1):473. doi: 10.1186/s12859-019-3019-7. PMID: 31521110; PMCID: PMC6744700.