CmaCh05G001100 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh05G001100
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionpiRNA biogenesis protein EXD1-like isoform X2
LocationCma_Chr05: 471827 .. 477622 (+)
RNA-Seq ExpressionCmaCh05G001100
SyntenyCmaCh05G001100
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TAAAAGCACCAAGTTCAGAGAGAAAATTGAAAGATTCGCTCAAATGGCAAACACTCCTTCTCACCAGACGTATATTCCTCTGCCTTCGGACTCAGGTCACTCTCTCTTTCCCCTTTCTTGCGTTCTTGGATTGAACTTCCATTTTCATTCATGATCAATTTCATATTGTTTTCTTTGTCTATCTTTCTTCCTCTTAATGAATGCTTCGTTATTGCAAGTTCGTGGTCTATGGATTGACGTGCGATTCTTCGTAGCAAGTTGTTTTCTCTCTTATCGAAGCTTTAATTGCTGTTTGTTTGGCCAATCAGAAATCGTGGAATATACTTGACTGATGTTGGAAACTGAATTCTGATGCAAGAGAATCACAATGATGTAGTTATCAATGGTTAGAGTAGAAGAATTTGTTTGGAAACTTCAAAAGACATGTTGCTCTAAGTATAGGCCTGTATTTTCTCTATTTCCATCGTTCTTTCACCTTTTTACGTCACTTGGATTTAGTTCCAGTTGCTGCGGATAGGTTTGGTTTTCTTGATTGCTTTTGCCGCGCACGGTTGCTTGATTACTTTACCCGCGCATGTTCCATCAAGTTTTTTAGTTAGCATCTTCATTTTCGCTTCGAGACGATGGAATCTTTTCTGTGGTTGATAGTTTGCGCAATGTTTGATATAAAATAGAACTGGAGATGTTAATTCTCCTCTCGTTCTCCAGCTCCTGTCCTCCATACGATCATTTCGCTCCAAGATTTTTATCATCTTCCAATGTTTTGGTTTTCTTGATTGCTTTTTCCGCGCACAGTTGCTTGATTGCTTTACCCGCGCATGCTCCATCAAGTTTTTTAGTTAGTATGTTTGTTTTCGCTTCGAGACGATGGAATATTTTCTGTGGTTGATAGTTTGCGCACTGTTTGATATAAAATAGAACTGGAGAGGTTAATTCTCCCATCGTTCCCCAGCTCCTGTCCTTCATACGTTCATTTCGCTCCAAGATTTTTATCATCTTCCAATGAGTTCCATTTCATGCGGATTTCAGGTGAAAATCAGAACGATCCTGAAGCCAACAAAACCTTGGTTCCTATTCATATTGTTACTCATGCATCTCAACTCCCTAACGAATTTGTTGATCCATCACCTGAAAAGCCTCTGGTAGTTGGCTTTGATTGTGAAGGTGTCAGCCTGTGTTGCCATGGAAGTCTTTGTATCATGCAGGTAAATAAAATGCTAAGAACAAATCGCAGTTTTACCATCCTTGTGGAGAGTGAAATTGTATGGCAGAGGCAGATGTGCTAGCTAACCTTCCCAACTGAGGGAGTCAATGGTTCTAACTTATAATTTCATTCTTTTAGTCATCTGGTGATACTGTCAATTCATAGTCCAAGCCGCTTTCTTGCAAGAAAACAGCTAATATTATAGCTTAAGAAATTTAATTAGCGCTCCTGGATAGTCCTCGACTTGTAGCTATAGTAACTCATCCTTTAAATTACAGGGGTTGATTACTGTCAAAGGCTTGGACATTCAAGAATGTTCAGTTTCTACTTTACTAAAAAACAGTGATTCAAATTAGTTGACATTATCTATAAATCTGTGTTGTTCATTTCTTGGCAGATTGCATTTCCAGATGCTATATATCTGGTTGATGCTGTTCAGGGAGGAGAGGAACTTGTGAAAGTCTGTAAGCCTGCCCTTGAGTCCACATATGTCACGAAAGTTATTCATGATTGTAAACGAGATAGTGAGGTCTCGTCTGAACTTTGTTTCAGTAACAGATTGCCGTCACAGAGACATGATATGTTTTAGCTAATGTTGACTCTATTTTTTTCCCTGTAGGCATTGTACTTTCAGTTTGGTATTAAGTTGAACAACGTTATTGATACACAGGTGATTATCAATTGTGAGAAGCCTGACACCTCTTTACTTGTTTACACCCTTTTCTCACGCATTATATTTTATCTTATTTTAGAGTGCTTAGCCACTTAGTGTGCCAATTTCATGTTACTAGAACATAAGAATATTTTTTGTGCTTTTCTTCTTCTATGTTTTCTTGGATAGAAAATGAAACTATCATATCAGGAAGATGGATTCTACTATTCCACTGATCGTTGTTCCATTATTTCTTATATAAGTTCCTAGAAGTTTTCATGATGTTAATTTTTCACATCTACTTTGCACTTGTACGATTTGAATCTACAGTCTGCCAGTATTTTTATAAATTTATAACTCTACGTACTCATTTTAAAGGTTTATGTAAATTTTCTCTCAATGAGCTCTAATCTTAACTTACTATGGATTATTAACTGATTGAAATCTGCTATCTGATAATTTTTAAGGTTGAGAGTCTGCATGTCTTCAAAAAGGAAAAAAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGTGGTTATCTTGTGACACAAATGCCAATAAAGAGTATGATTTTCTTGATTTAAAAGAATGATCTGAACAATTCATTTTAAGTTTGAGTTACCTTGTAATTACTTTGGAGCCAAGTGTTTGTATGTGTAAGCTGCAATATGGAGGAAGGCTGAGTTATTTGGACCTTCCTGCTTTAATTTTCAATTGTATTACTTGATTCAGATTGCATATTCACTTATAGAGGAGCAAGAGGGATGGACAAAGACGCCAGATAACTATATCTCCTTTGTTGGTCTTCTTGCAGATTCACGTTATTGTGGTATTATACTAATCCTGGTATCTAGATATTCTCCATTCTTTATCATGGAATTTCATGCATTTTTCTTTTTCTTCTAAATATCTCGGATAAAATTACCTATACTCACTCAAAAATATGAACTGATATTTCTTATGAAACTTCAAAAGCAGCAAAGCTTGATTTATTGCTTGCCATCTTCATGTTAAGAAGCTCCATACTTACTACTTATTTTGAGGGTCGTGTTTCCATGAGCTGCATGGTTGGGGATGGTTAATTAACCAGAACCTCCTGCCTGAAAAGGAGAAAAAAAAGTATAAATTTAAATTTGGAAGCCATCACATGACTAATCCCTAGCGGTACGGTGGAGGGACAATAATTGCACTTGGTTAATTGGATCTGTGTCTCTAAGCGAAATTAATCGAGCTTGTGGGTGCTATAATAACATCAACATAAAGAACTTGATTTTAATTGGAAAGGGAGCTTCATAACACTTCTCGCCCCTTTCCATCTCAAAATATCGAGCTCTTAATTTCATAGACCAAGTACAAGCTTGAGAATTGTGATAAGATTATAATTGGAAACGTTATGTGGATTGCGCATTCATCTCTCGCATACTTTGTTTCTCATTCTTAATTTTGTGTAATGTTACAATTCTGGTACCAAGGAAATCAGAATGACCGACTAGGAGCCTAATTTTAGAGTCTAAAGGCATCGTACATTGATTGTGTACACGTAATGTTTCTCATTCTTCATTTTAGAGTCTTTGTAAGCACGAGGAGGTTAAGTAAAGTATAACCAAGGGGTTCAGTGCTCATGGGGCTGTGTAAGAGAATTGCATTTCATGCATTCTCCATTCAAAACTGTTGCTATTCGTGCTTGTGAAATGAGAGATTCTTGTTCTCATAAAGATGGCCGTATATTTTTCATTCCTCTTGGTTGCCGTTGTAATACCCTTTGTTTCATTTTTCTTTCTTTCCCTTGTTTGTTGCCTGGAAACGTTTCCTTCCTGGAGAAATCTACGTGAGCTTTTTTTTTTTTTCTTTTTTCTTGTTACTGAAAATGTTGGTGGATGTCGCTTGGATCTAAATTGTTTGGCATGCTAATTTAATTATCAGAATTTTCTGGAAGGAATAATCTAGATTCTACGTCTAAGTTCTTGTGGTCAGTGGAGAAGTTGACAAGCTTTTGTTATATGAATGCATTTCTTTATTATTTTAAAACTTAATGCTTTTTGCTGCTAATGCTTACATTTTTACTATATATTGAATTAATATGTAATCCTCGCTCATAAATGTAGGTGTATCATATGTGGAGAAGGAAGAGGTCCGCCTCCTACTTAGGCAGGTTAGTTTTTTCATGGTCGTATTGCTGTGGATATTGTTGTATCTTGGCAAGAAGTGTTCTATATATTAAGCATTGTTTTTCGTTATGAGGTATTTTATAGGTCAGATGTGCTTGATTCCTAAATGTAGTATCCTTCTTTTGGAGTAGACAAGTATAAATATAATGTAGAGACAATTCAAGTCCATTGAGCTAACAAAAAGGACATGGAAACTAAGATTATTAGTCTGCAGGTACTGTACTTATGTCCATAATATGATAAGATTCAAAATGTTATCTCGTCATAAAATTATATGCAGGTATCGTGTAATTGTTCCACAAGTTTATGTATCTTTGCATAAAAACTAAATTCTAAATTTGTATTTGACAGGACCCGAAGTTTTGGACCTATAGACCATTGTCTGAATTGATGGTCCGTGCAGCTGCTGATGATGTACGCTTTCTGCTTTACATCTATCACAAGTTGATGGAGAAATTGAATCATCAATCGCTGTGGTACCTCGCAGTGCGTGGTGCTTTGTACTGCCGGTGTTTCTGCATCAGTGATAATGGATATGCTGACTGGCCTCCCCTCCCTTCTGTTCCAGGTTTTCTTTAAGAACCCACCTTACCATGTGCAGTCCATATAATGAGAATAATTCCAAATTTAACGTTCTAATATGCTAAGTTTATTAAACTATCATTGCAAACTTTTCAACATAATATGATTCTGTGGTGTTTCAATATTGTTGGCTTCTTTTATGTTATGATTTCACAATTGATTGGGTTTAAAATTTAAAGTGGCTATTTAGCAATGTGAGAAATGTTGGCTCACCATCTTTCATTCAATTATCCCAATGCTGCTGGAATCTGCAAAACATCAAATATGTTAAAATTTATAATGTTGTATTATCTAATTATTATTGACACTTGGTATGTGGTTTCTATCCTTCGGGTTCTTAGACTACGTTTCATTTCAAATCTTATTTTTTTCATTTGATGCGTCTAAAACCTTTTGTTACATGGTCTCAATTGATCACCTTTATTGGTCTTACATGTTTTTTCAGATAACCTTGTAGTAGAGGGCAAAGCTCCTGAGGAAGAAATTCTTTCAGTCTTAGACATTCCCCGTGGAATGATGGGTCGCGTAATTGGTAGGAGAGGAGCCTCAATTTTGGCAATAAAGGAATCTTGCAAGTAATTCATTATCGACAAATCCCTTTCGACTTCATTTATATCCTTTAGGCTAAAAAAATACCATAGCCGTTGTTTTTTCTCACCTTGGGTTCTTAAAATGCAGGGCGGAAATTCTCGTTGGCGGCGCTAAGGGCCCACCGGACAAGGTAAGTCTCTCTTTTCATTCACTTCAAAACAGACCACTATGAAGCCCAAAATCCTGTTGAATGTTGATGGATATTGAAAGTTGTTCTGTCAAGAAGATGAATGTAATTGTGGTGATGTAGGTTTTCGTCATTGGACCCGTGAGGCAGGTAAGGAAGGCAGAAGCTATGATACGAGGAAGCTTGCTTGAAATATGATTGAATGAATATGAAAACCGTGTTGTGATGAACATGAACGCAAAATCCTCAATATTAAGAATACAGTTTCAGCTGCTAATACTAGTAGATCCCGTATGGAGGGGATTACTTTACATACTTCACAATCTTTTGTATAGGTACAGTAGCTTTGAGAAGCAAATTTTATGTGAAAAATATTATCAAATCAAAATATGAAAATTGAAATAAAACTTTGTTTTTACCTTCAAACTTAATAAATCTAAAATAAATTCAAACCATTAATTCACCTGG

mRNA sequence

TAAAAGCACCAAGTTCAGAGAGAAAATTGAAAGATTCGCTCAAATGGCAAACACTCCTTCTCACCAGACGTATATTCCTCTGCCTTCGGACTCAGGTGAAAATCAGAACGATCCTGAAGCCAACAAAACCTTGGTTCCTATTCATATTGTTACTCATGCATCTCAACTCCCTAACGAATTTGTTGATCCATCACCTGAAAAGCCTCTGGTAGTTGGCTTTGATTGTGAAGGTGTCAGCCTGTGTTGCCATGGAAGTCTTTGTATCATGCAGATTGCATTTCCAGATGCTATATATCTGGTTGATGCTGTTCAGGGAGGAGAGGAACTTGTGAAAGTCTGTAAGCCTGCCCTTGAGTCCACATATGTCACGAAAGTTATTCATGATTGTAAACGAGATAGTGAGGCATTGTACTTTCAGTTTGGTATTAAGTTGAACAACGTTATTGATACACAGATTGCATATTCACTTATAGAGGAGCAAGAGGGATGGACAAAGACGCCAGATAACTATATCTCCTTTGTTGGTCTTCTTGCAGATTCACGTTATTGTGGTGTATCATATGTGGAGAAGGAAGAGGTCCGCCTCCTACTTAGGCAGGACCCGAAGTTTTGGACCTATAGACCATTGTCTGAATTGATGGTCCGTGCAGCTGCTGATGATGTACGCTTTCTGCTTTACATCTATCACAAGTTGATGGAGAAATTGAATCATCAATCGCTGTGGTACCTCGCAGTGCGTGGTGCTTTGTACTGCCGGTGTTTCTGCATCAGTGATAATGGATATGCTGACTGGCCTCCCCTCCCTTCTGTTCCAGATAACCTTGTAGTAGAGGGCAAAGCTCCTGAGGAAGAAATTCTTTCAGTCTTAGACATTCCCCGTGGAATGATGGGTCGCGTAATTGGTAGGAGAGGAGCCTCAATTTTGGCAATAAAGGAATCTTGCAAGGCGGAAATTCTCGTTGGCGGCGCTAAGGGCCCACCGGACAAGGTTTTCGTCATTGGACCCGTGAGGCAGGTAAGGAAGGCAGAAGCTATGATACGAGGAAGCTTGCTTGAAATATGATTGAATGAATATGAAAACCGTGTTGTGATGAACATGAACGCAAAATCCTCAATATTAAGAATACAGTTTCAGCTGCTAATACTAGTAGATCCCGTATGGAGGGGATTACTTTACATACTTCACAATCTTTTGTATAGGTACAGTAGCTTTGAGAAGCAAATTTTATGTGAAAAATATTATCAAATCAAAATATGAAAATTGAAATAAAACTTTGTTTTTACCTTCAAACTTAATAAATCTAAAATAAATTCAAACCATTAATTCACCTGG

Coding sequence (CDS)

ATGGCAAACACTCCTTCTCACCAGACGTATATTCCTCTGCCTTCGGACTCAGGTGAAAATCAGAACGATCCTGAAGCCAACAAAACCTTGGTTCCTATTCATATTGTTACTCATGCATCTCAACTCCCTAACGAATTTGTTGATCCATCACCTGAAAAGCCTCTGGTAGTTGGCTTTGATTGTGAAGGTGTCAGCCTGTGTTGCCATGGAAGTCTTTGTATCATGCAGATTGCATTTCCAGATGCTATATATCTGGTTGATGCTGTTCAGGGAGGAGAGGAACTTGTGAAAGTCTGTAAGCCTGCCCTTGAGTCCACATATGTCACGAAAGTTATTCATGATTGTAAACGAGATAGTGAGGCATTGTACTTTCAGTTTGGTATTAAGTTGAACAACGTTATTGATACACAGATTGCATATTCACTTATAGAGGAGCAAGAGGGATGGACAAAGACGCCAGATAACTATATCTCCTTTGTTGGTCTTCTTGCAGATTCACGTTATTGTGGTGTATCATATGTGGAGAAGGAAGAGGTCCGCCTCCTACTTAGGCAGGACCCGAAGTTTTGGACCTATAGACCATTGTCTGAATTGATGGTCCGTGCAGCTGCTGATGATGTACGCTTTCTGCTTTACATCTATCACAAGTTGATGGAGAAATTGAATCATCAATCGCTGTGGTACCTCGCAGTGCGTGGTGCTTTGTACTGCCGGTGTTTCTGCATCAGTGATAATGGATATGCTGACTGGCCTCCCCTCCCTTCTGTTCCAGATAACCTTGTAGTAGAGGGCAAAGCTCCTGAGGAAGAAATTCTTTCAGTCTTAGACATTCCCCGTGGAATGATGGGTCGCGTAATTGGTAGGAGAGGAGCCTCAATTTTGGCAATAAAGGAATCTTGCAAGGCGGAAATTCTCGTTGGCGGCGCTAAGGGCCCACCGGACAAGGTTTTCGTCATTGGACCCGTGAGGCAGGTAAGGAAGGCAGAAGCTATGATACGAGGAAGCTTGCTTGAAATATGA

Protein sequence

MANTPSHQTYIPLPSDSGENQNDPEANKTLVPIHIVTHASQLPNEFVDPSPEKPLVVGFDCEGVSLCCHGSLCIMQIAFPDAIYLVDAVQGGEELVKVCKPALESTYVTKVIHDCKRDSEALYFQFGIKLNNVIDTQIAYSLIEEQEGWTKTPDNYISFVGLLADSRYCGVSYVEKEEVRLLLRQDPKFWTYRPLSELMVRAAADDVRFLLYIYHKLMEKLNHQSLWYLAVRGALYCRCFCISDNGYADWPPLPSVPDNLVVEGKAPEEEILSVLDIPRGMMGRVIGRRGASILAIKESCKAEILVGGAKGPPDKVFVIGPVRQVRKAEAMIRGSLLEI
Homology
BLAST of CmaCh05G001100 vs. ExPASy Swiss-Prot
Match: Q8NHP7 (piRNA biogenesis protein EXD1 OS=Homo sapiens OX=9606 GN=EXD1 PE=1 SV=4)

HSP 1 Score: 84.0 bits (206), Expect = 3.7e-15
Identity = 52/172 (30.23%), Postives = 87/172 (50.58%), Query Frame = 0

Query: 52  EKPLVVGFDCEGVSLCCHGSLCIMQIAFPDAIYLVDA-VQGGEELVKVCKPALESTYVTK 111
           +K  V+    EG ++C HG LC +Q+A    +YL D  + G        +  LE   + K
Sbjct: 96  KKQNVLSVAAEGANVCRHGKLCWLQVATNCRVYLFDIFLLGSRAFHNGLQMILEDKRILK 155

Query: 112 VIHDCKRDSEALYFQFGIKLNNVIDTQIAYSLIEEQEGWTKTPDNYISFVGLLADSRYCG 171
           VIHDC+  S+ L  Q+GI LNNV DTQ+A  L    E     P+   +    L       
Sbjct: 156 VIHDCRWLSDCLSHQYGILLNNVFDTQVADVLQFSMETGGYLPNCITTLQESLIKHLQVA 215

Query: 172 VSYVE-KEEVRLLLRQDPKFWTYRPLSELMVRAAADDVRFLLYIYHKLMEKL 222
             Y+   E+ + L++++P+ W  RP+S  +++  A +  +LL +   L++++
Sbjct: 216 PKYLSFLEKRQKLIQENPEVWFIRPVSPSLLKILALEATYLLPLRLALLDEM 267

BLAST of CmaCh05G001100 vs. ExPASy Swiss-Prot
Match: Q8CDF7 (piRNA biogenesis protein EXD1 OS=Mus musculus OX=10090 GN=Exd1 PE=1 SV=1)

HSP 1 Score: 81.3 bits (199), Expect = 2.4e-14
Identity = 54/173 (31.21%), Postives = 87/173 (50.29%), Query Frame = 0

Query: 52  EKPLVVGFDCEGVSLCCHGSLCIMQIAFPDAIYLVDA-VQGGEELVKVCKPALESTYVTK 111
           +K  V+    EG ++C HG LC +Q+A    +YL D  + G        +  LE   + K
Sbjct: 153 KKQSVLSVAAEGANVCRHGKLCWLQVATNSRVYLFDIFLLGSRAFNNGLQMILEDKRILK 212

Query: 112 VIHDCKRDSEALYFQFGIKLNNVIDTQIAYSLIEEQEGWTKTPDNYISFV--GLLADSRY 171
           VIHDC+  S+ L  Q+GI LNNV DTQ+A  L    E     P N IS +   L+   + 
Sbjct: 213 VIHDCRWLSDCLSHQYGIMLNNVFDTQVADVLQFSMETGGFLP-NCISTLQESLIRHLKV 272

Query: 172 CGVSYVEKEEVRLLLRQDPKFWTYRPLSELMVRAAADDVRFLLYIYHKLMEKL 222
                   EE +  ++++P+ W  RPL   +++  A +  +LL +   L++++
Sbjct: 273 APRYLFFLEERQKRIQENPEIWLTRPLPPSLLKILALETTYLLPLRLVLLDEV 324

BLAST of CmaCh05G001100 vs. ExPASy Swiss-Prot
Match: Q6NRD5 (piRNA biogenesis protein EXD1 OS=Xenopus laevis OX=8355 GN=exd1 PE=2 SV=1)

HSP 1 Score: 64.7 bits (156), Expect = 2.3e-09
Identity = 46/173 (26.59%), Postives = 84/173 (48.55%), Query Frame = 0

Query: 56  VVGFDCEGVSLCCHGSLCIMQIAFPDAIYLVDAVQGGEELVK-VCKPALESTYVTKVIHD 115
           V+     G ++C HG L  +Q A    +YL D +  G ++ K   +  LE   + KVIHD
Sbjct: 122 VISIGAVGQNICRHGKLSWLQFATRSRVYLFDVLVLGSKVFKNGLQMVLEDKGILKVIHD 181

Query: 116 CKRDSEALYFQFGIKLNNVIDTQI--AYSLIEEQEGW----TKTPDNYISFVGLLADSRY 175
           C+   + L  Q+GI LNNV DTQ+   Y    E  G+    T+T +  +     +  S+ 
Sbjct: 182 CRWLGDILSHQYGIILNNVFDTQVGDVYLFSMETGGFLPHGTRTLEECLIHHLSMLPSK- 241

Query: 176 CGVSYVEKEEVRLLLRQDPKFWTYRPLSELMVRAAADDVRFLLYIYHKLMEKL 222
             VS++   +   L ++    W  RP+   +++  + +V +L+ +   +++ +
Sbjct: 242 --VSFLAHRQT--LTKEYHDIWFDRPMDPTLLKLLSLEVTYLMPLRSAMLDAM 289

BLAST of CmaCh05G001100 vs. ExPASy Swiss-Prot
Match: Q0P3U3 (piRNA biogenesis protein EXD1 OS=Danio rerio OX=7955 GN=exd1 PE=2 SV=1)

HSP 1 Score: 60.1 bits (144), Expect = 5.7e-08
Identity = 45/170 (26.47%), Postives = 76/170 (44.71%), Query Frame = 0

Query: 56  VVGFDCEGVSLCCHGSLCIMQIAFPDAIYLVD-AVQGGEELVKVCKPALESTYVTKVIHD 115
           V+G   +         LC +Q+A    +YL D  + GG          LE+T++ KV+HD
Sbjct: 136 VIGIGADVYGQSGQERLCWLQVATKKVVYLFDILLLGGPAFKNGLSMILENTHILKVLHD 195

Query: 116 CKRDSEALYFQFGIKLNNVIDTQIAYSLIEEQEGWTKTPDNYISFVGLLADSRYCGVSYV 175
           C+  +  L  +F ++L NV DTQ+A  L+   E     PD   S   LL    +  ++  
Sbjct: 196 CRCITRCLRTEFRVQLTNVFDTQVAELLLFFNESGGFLPDRPASLPELL--QLHLRLTTA 255

Query: 176 EKEEV---RLLLRQDPKFWTYRPLSELMVRAAADDVRFLLYIYHKLMEKL 222
           E + +   +   R+  + W  RP    ++      V+ LL +   L++ L
Sbjct: 256 EIQPLCSKQQQSRECVQLWYVRPCPPDLLSLMCSSVQHLLSLRLLLLDAL 303

BLAST of CmaCh05G001100 vs. ExPASy Swiss-Prot
Match: P56960 (Exosome component 10 OS=Mus musculus OX=10090 GN=Exosc10 PE=1 SV=2)

HSP 1 Score: 58.5 bits (140), Expect = 1.7e-07
Identity = 45/159 (28.30%), Postives = 71/159 (44.65%), Query Frame = 0

Query: 70  GSLCIMQIAFPDAIYLVDAVQGGEELVKVCKPALESTYVTKVIHDCKRDSEALYFQFGIK 129
           G  C+MQI+     ++VD ++   ++  +   +L    + KV H    D E L   FG+ 
Sbjct: 324 GLTCLMQISTRTEDFIVDTLELRSDMY-ILNESLTDPAIVKVFHGADSDIEWLQKDFGLY 383

Query: 130 LNNVIDTQIAYSLIEEQEGWTKTPDNYISFVGLLADSRYCGVSYVEKEEVRLLLRQDPKF 189
           + N+ DT  A  L+        + D+ +          YCGV   ++ ++          
Sbjct: 384 VVNMFDTHQAARLLNLAR---HSLDHLLRL--------YCGVESNKQYQL--------AD 443

Query: 190 WTYRPLSELMVRAAADDVRFLLYIYHK----LMEKLNHQ 225
           W  RPL E M+  A DD  +LLYIY +    L E+ NHQ
Sbjct: 444 WRIRPLPEEMLSYARDDTHYLLYIYDRMRLELWERGNHQ 462

BLAST of CmaCh05G001100 vs. TAIR 10
Match: AT2G25910.1 (3'-5' exonuclease domain-containing protein / K homology domain-containing protein / KH domain-containing protein )

HSP 1 Score: 508.1 bits (1307), Expect = 5.6e-144
Identity = 236/338 (69.82%), Postives = 288/338 (85.21%), Query Frame = 0

Query: 2   ANTPSHQTYIPLPSDSGENQNDPEANKTLVPIHIVTHASQLPNEFVDPSPEKPLVVGFDC 61
           +++P+   ++P+P + G      EAN+  VPI+IVT   QLP +F++PSPEK LV+GFDC
Sbjct: 3   SSSPTQLAHVPIPPEPGGRSPTQEANEPPVPIYIVTDPFQLPADFLNPSPEKKLVIGFDC 62

Query: 62  EGVSLCCHGSLCIMQIAFPDAIYLVDAVQGGEELVKVCKPALESTYVTKVIHDCKRDSEA 121
           EGV LC HG LCIMQIAF +AIYLVD ++GGE ++K CKPALES Y+TKVIHDCKRDSEA
Sbjct: 63  EGVDLCRHGKLCIMQIAFSNAIYLVDVIEGGEVIMKACKPALESNYITKVIHDCKRDSEA 122

Query: 122 LYFQFGIKLNNVIDTQIAYSLIEEQEGWTKTPDNYISFVGLLADSRYCGVSYVEKEEVRL 181
           LYFQFGI+L+NV+DTQIAYSLIEEQEG  +  D+YISFV LLAD RYCG+SY EKEEVR+
Sbjct: 123 LYFQFGIRLHNVVDTQIAYSLIEEQEGRRRPLDDYISFVSLLADPRYCGISYEEKEEVRV 182

Query: 182 LLRQDPKFWTYRPLSELMVRAAADDVRFLLYIYHKLMEKLNHQSLWYLAVRGALYCRCF- 241
           L+RQDPKFWTYRP++ELM+RAAADDVRFLLY+YHK+M KLN +SLW+LAVRGALYCRC  
Sbjct: 183 LMRQDPKFWTYRPMTELMIRAAADDVRFLLYLYHKMMGKLNQRSLWHLAVRGALYCRCLC 242

Query: 242 CISDNGYADWPPLPSVPDNLVVEGKAPEEEILSVLDIPRGMMGRVIGRRGASILAIKESC 301
           C++D  +ADWP +P +PDNL  E +  EEEILSVLD+P G MGRVIGR+GASILAIKE+C
Sbjct: 243 CMNDADFADWPTVPPIPDNLKSEDQCLEEEILSVLDVPPGKMGRVIGRKGASILAIKEAC 302

Query: 302 KAEILVGGAKGPPDKVFVIGPVRQVRKAEAMIRGSLLE 339
            AEIL+GGAKGPPDK+FVIGPVR+VRKAEA++RG +++
Sbjct: 303 NAEILIGGAKGPPDKIFVIGPVREVRKAEAILRGRMID 340

BLAST of CmaCh05G001100 vs. TAIR 10
Match: AT2G25910.2 (3'-5' exonuclease domain-containing protein / K homology domain-containing protein / KH domain-containing protein )

HSP 1 Score: 503.4 bits (1295), Expect = 1.4e-142
Identity = 236/339 (69.62%), Postives = 288/339 (84.96%), Query Frame = 0

Query: 2   ANTPSHQTYIPLPSDSGENQNDPEANKTLVPIHIVTHASQLPNEFVDPSPEKPLVVGFDC 61
           +++P+   ++P+P + G      EAN+  VPI+IVT   QLP +F++PSPEK LV+GFDC
Sbjct: 3   SSSPTQLAHVPIPPEPGGRSPTQEANEPPVPIYIVTDPFQLPADFLNPSPEKKLVIGFDC 62

Query: 62  EGVSLCCHGSLCIMQIAFPDAIYLVDAVQGGEELVKVCKPALESTYVTKVIHDCKRDSEA 121
           EGV LC HG LCIMQIAF +AIYLVD ++GGE ++K CKPALES Y+TKVIHDCKRDSEA
Sbjct: 63  EGVDLCRHGKLCIMQIAFSNAIYLVDVIEGGEVIMKACKPALESNYITKVIHDCKRDSEA 122

Query: 122 LYFQFGIKLNNVIDTQIAYSLIEEQEGWTKTPDNYISFVGLLADSRYCGVSYVEKEEVRL 181
           LYFQFGI+L+NV+DTQIAYSLIEEQEG  +  D+YISFV LLAD RYCG+SY EKEEVR+
Sbjct: 123 LYFQFGIRLHNVVDTQIAYSLIEEQEGRRRPLDDYISFVSLLADPRYCGISYEEKEEVRV 182

Query: 182 LLRQDPKFWTYRPLSELMVRAAADDVRFLLYIYHKLMEKLNHQSLWYLAVRGALYCRCF- 241
           L+RQDPKFWTYRP++ELM+RAAADDVRFLLY+YHK+M KLN +SLW+LAVRGALYCRC  
Sbjct: 183 LMRQDPKFWTYRPMTELMIRAAADDVRFLLYLYHKMMGKLNQRSLWHLAVRGALYCRCLC 242

Query: 242 CISDNGYADWPPLPSVPDNLVVEGKAPEEEILSVLDIPRGMMGRVIGRRGASILAIKESC 301
           C++D  +ADWP +P +PDNL  E +  EEEILSVLD+P G MGRVIGR+GASILAIKE+C
Sbjct: 243 CMNDADFADWPTVPPIPDNLKSEDQCLEEEILSVLDVPPGKMGRVIGRKGASILAIKEAC 302

Query: 302 -KAEILVGGAKGPPDKVFVIGPVRQVRKAEAMIRGSLLE 339
             AEIL+GGAKGPPDK+FVIGPVR+VRKAEA++RG +++
Sbjct: 303 NSAEILIGGAKGPPDKIFVIGPVREVRKAEAILRGRMID 341

BLAST of CmaCh05G001100 vs. TAIR 10
Match: AT1G54440.1 (Polynucleotidyl transferase, ribonuclease H fold protein with HRDC domain )

HSP 1 Score: 57.8 bits (138), Expect = 2.0e-08
Identity = 45/153 (29.41%), Postives = 67/153 (43.79%), Query Frame = 0

Query: 70  GSLCIMQIAFPDAIYLVDAVQGGEELVKVCKPALESTYVTKVIHDCKRDSEALYFQFGIK 129
           G  C+MQI+     Y+VD  +  + +    +   +     KVIH   RD   L   FGI 
Sbjct: 151 GLTCLMQISTRTEDYIVDIFKLWDHIGPYLRELFKDPKKKKVIHGADRDIIWLQRDFGIY 210

Query: 130 LNNVIDTQIAYSLIEEQEGWTKTPDNYISFVGLLADSRYCGVSYVEKEEVRLLLRQDPKF 189
           + N+ DT  A  ++       K   N + F+       YCGV+   KE            
Sbjct: 211 VCNLFDTGQASRVL-------KLERNSLEFL----LKHYCGVA-ANKE-------YQKAD 270

Query: 190 WTYRPLSELMVRAAADDVRFLLYIYHKLMEKLN 223
           W  RPL ++M R A +D  +LLYIY  +  +L+
Sbjct: 271 WRIRPLPDVMKRYAREDTHYLLYIYDVMRMELH 284

BLAST of CmaCh05G001100 vs. TAIR 10
Match: AT1G54440.2 (Polynucleotidyl transferase, ribonuclease H fold protein with HRDC domain )

HSP 1 Score: 57.8 bits (138), Expect = 2.0e-08
Identity = 45/153 (29.41%), Postives = 67/153 (43.79%), Query Frame = 0

Query: 70  GSLCIMQIAFPDAIYLVDAVQGGEELVKVCKPALESTYVTKVIHDCKRDSEALYFQFGIK 129
           G  C+MQI+     Y+VD  +  + +    +   +     KVIH   RD   L   FGI 
Sbjct: 151 GLTCLMQISTRTEDYIVDIFKLWDHIGPYLRELFKDPKKKKVIHGADRDIIWLQRDFGIY 210

Query: 130 LNNVIDTQIAYSLIEEQEGWTKTPDNYISFVGLLADSRYCGVSYVEKEEVRLLLRQDPKF 189
           + N+ DT  A  ++       K   N + F+       YCGV+   KE            
Sbjct: 211 VCNLFDTGQASRVL-------KLERNSLEFL----LKHYCGVA-ANKE-------YQKAD 270

Query: 190 WTYRPLSELMVRAAADDVRFLLYIYHKLMEKLN 223
           W  RPL ++M R A +D  +LLYIY  +  +L+
Sbjct: 271 WRIRPLPDVMKRYAREDTHYLLYIYDVMRMELH 284

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q8NHP73.7e-1530.23piRNA biogenesis protein EXD1 OS=Homo sapiens OX=9606 GN=EXD1 PE=1 SV=4[more]
Q8CDF72.4e-1431.21piRNA biogenesis protein EXD1 OS=Mus musculus OX=10090 GN=Exd1 PE=1 SV=1[more]
Q6NRD52.3e-0926.59piRNA biogenesis protein EXD1 OS=Xenopus laevis OX=8355 GN=exd1 PE=2 SV=1[more]
Q0P3U35.7e-0826.47piRNA biogenesis protein EXD1 OS=Danio rerio OX=7955 GN=exd1 PE=2 SV=1[more]
P569601.7e-0728.30Exosome component 10 OS=Mus musculus OX=10090 GN=Exosc10 PE=1 SV=2[more]
Match NameE-valueIdentityDescription
AT2G25910.15.6e-14469.823'-5' exonuclease domain-containing protein / K homology domain-containing prote... [more]
AT2G25910.21.4e-14269.623'-5' exonuclease domain-containing protein / K homology domain-containing prote... [more]
AT1G54440.12.0e-0829.41Polynucleotidyl transferase, ribonuclease H fold protein with HRDC domain [more]
AT1G54440.22.0e-0829.41Polynucleotidyl transferase, ribonuclease H fold protein with HRDC domain [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR0025623'-5' exonuclease domainSMARTSM0047435exoneu6coord: 33..222
e-value: 4.8E-25
score: 99.2
IPR0025623'-5' exonuclease domainPFAMPF01612DNA_pol_A_exo1coord: 45..221
e-value: 1.3E-23
score: 83.7
IPR004087K Homology domainSMARTSM00322kh_6coord: 269..337
e-value: 1.1E-7
score: 41.5
IPR036612K Homology domain, type 1 superfamilyGENE3D3.30.1370.10K Homology domain, type 1coord: 260..336
e-value: 3.1E-10
score: 41.5
IPR036612K Homology domain, type 1 superfamilySUPERFAMILY54791Eukaryotic type KH-domain (KH-domain type I)coord: 267..336
IPR004088K Homology domain, type 1PFAMPF00013KH_1coord: 275..333
e-value: 1.7E-9
score: 37.4
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 3..236
e-value: 8.5E-41
score: 142.1
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..25
NoneNo IPR availablePANTHERPTHR46814:SF4SUBFAMILY NOT NAMEDcoord: 1..336
NoneNo IPR availablePANTHERPTHR46814EGALITARIAN, ISOFORM Bcoord: 1..336
NoneNo IPR availablePROSITEPS50084KH_TYPE_1coord: 270..332
score: 11.552393
NoneNo IPR availableCDDcd06148Egl_like_exocoord: 46..241
e-value: 1.42448E-76
score: 231.79
NoneNo IPR availableCDDcd00105KH-Icoord: 275..332
e-value: 4.59771E-9
score: 50.2511
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 27..243

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh05G001100.1CmaCh05G001100.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
biological_process GO:0006139 nucleobase-containing compound metabolic process
molecular_function GO:0008408 3'-5' exonuclease activity
molecular_function GO:0003723 RNA binding
molecular_function GO:0003676 nucleic acid binding