CmaCh00G002740 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh00G002740
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionGag/pol protein
LocationCma_Chr00: 22283196 .. 22290830 (+)
RNA-Seq ExpressionCmaCh00G002740
SyntenyCmaCh00G002740
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTAATCTTCCGAAGAGAAAGGAATCTCAAAACCCGGTACAAATCATCGAACATGACGATGGAAGCGTTGAAATACAATTCAACGAAGAGCCTTCGTCTGAATCAAAAGTAAGAGAATTCTTGAGTTCTAGACCAAGTACGTCGGGAGTTTCAAGATCAATGTATGACCCCTTAAAGGTAAAAGAAGTCAGTTATGATAAAAAAAGTGCATCAATTCACTATGAAGATGGCTCAAGATCTCCAACCCATACAGATATGGATACTCAATCCGTCTACGAAAGTCAGCTAAATGTTATTACAGCTAATTTCCAGATAGACAAAGAATTCTTGAAAACAGATTTTTTGTCAGTTGTCAACTCTTCAAAGAGAAAAGCTTTCTTCCAAACTTACAAAGAAGTTGAGCGAAGTGAATTAAGAGCTCAGTGGTATTCTCATATGGAAACCGTAGAGGAAAACATACCGTTTTGAGCGAAGTGAATTAAGAGCTCAGTGGTATTCTCATATGGAAACCGTAGAGGAAAACATACCGTTATTCGAATGGTTTGAAGAAAATATTTTGCAAATATGCACGTTGACTCAAAGAGGCTGGAAGACTACCAGAAGAGACAGGTATATTCAAAACATCCTCCATTGGAAGAAGTCGAATTTGACAATTACTACNAAGTCATCGTCTTTAGACTGACAATGCACAAGCAACCCAAGTTTGCTTAGCTCGTTAACCACATGAATGTAATATCTAGAAGGTCCTGGTGGTGGTAACGCGACCCTTGGAACCTCAGCCCCCACTAAAGCCACCGTGAAAAGAAAGAAAACGACAATATTCTTCATCATTTCCTAATCAACCTTTAGCTTAACTACCAATTTGTGTTGGACATGTGAATAAATATACATACTTATANNNNNNNNNNTCCATTGGAAGAAGTCGAATTTGACAATTACTACGGAGAAAAGGTCAAAGCTAGTCCTTTCAAAACTATCCCAGAAGAACTAGAAAAGGGTAATCCGACCTTAAAGGATATCAAGAATATCCAGCACCAGAATAATTATTCAAACAAGATTTTATCCACGATCTCCACCCAGCTTTAGAGTATCGAGGGAAAAATCTCGAAGAATTCGGGTACTCTGCAAGAACAAGCAGGCAGCTCTGTACCAAAGGTCGATGAGTCAATCCCAATACTCAGACAGGCAAACCTCGACGTATTCACAAAAAGGGTCTCGAAAGAATAAGCTGCAATAGCCAATATAAAAGAAAAGTTGGATAGAATTCTAAATCCTAGAATAATACATCAGGATTCATCTGTCAATGTCGTTAACAATGACGACGAGATCCAGGACTTCGAAGATGGATCATAAATGAGGCAGAGGAACCTCTCTACAATCGGATCGAAAGGATCTCAAGAAGAAGTCAAAATAATACCAGCAATCAGAAAAACTGGTATCCTCAACCGTTTTTTCCGGATATTCAATTGGAAGAAAAGACGCTACAAACTCAAGCCCACTATGATGACCTGACAATCTATGAATGGAGCATCGATGGATTGTCTAACTATCTTGTGATGAACGTTGTCAATGAAATGATGATGGCCGCAGGCGTGTATAAATTATAGGGCCATAAATCCGACCATCAAATAGCTCGTCTTAATAACCGGATTCACCAGGAAACTCAAAGACTGGTGGGACAAATATCTTGACGAATCAACCCGCTAACAGATATTAAACCACTATGTCATCAGGCCAACAATTCAAATCATCAAAGTAGAAGGTCCATCAACTAGGACCGAAGTACAACATGGAAGGGTAGAAGATGCTGTCAATACTCTATTCTATGCCCTCATAGAATTCTTCGTTGGCGACCCCCTAAAATACCGGAGAGATCCGCCGAAATACTCATGAACCTGAAATGCCCTACTCTAGAAGATTTTAGGTGGTACAAAGACATGTATTTCAACAAAGTTCTTATTAGAACAAATAGTTCGTTGGAATTCTGGAAAGAGAATTTTGTCAATGGACGACTAAAGCACTTCTCAAGAAGGATCAAAGACGGTTTGAAGACAAAGTATAATGGAACAATTCCATGGTAGACTTTGTCATATGGATCAATAGCATCCTTCATCATAGAAGGACTCAGACTTTGTAATGAGTCAAAGATCCAAAACAAGCTCAATTCTTCAATATCAAACAAAAAAGAGCTTGGAAGATTTTGCGATCAATATGGATGCAAGGAAATAGAAGCCCCCTCAACTTCCAGAAGGAAGAAGATTAGGACGCATCCAAAGCTTTATAATGCGTACAGGCGTAGAGAAAAGTATCGGATTAAGCCTTTACAACCTCAAAAGTCAACGTATTCTAAGAAAAGGTACGCTCCCACAAAAATTCATAAAGGAAAGAAAAAGAAAACTTGCTTCAAATGTCGAGAAGAAGGACACCATGCTAAAAAGTGTCCAATCAGAGGAAAGATCAACGAGTTGGATATAGATGAAGAGCTGAAAAATCAGCTACTACGGCTAACTCTAACCGGCTCAGAACAATCAAGCGAGGGAGAAATCCTCGAACTTCAAGAAGGATCCGATTCATATTCTAGCACAGAGTACGAATCAGAACAAGAAGGAAGGACGTGCGAAGGATGCATAAACGTCCTTACAAAGGATCAAGAAATTCTACTTGAGGTGGTAGAGAAAATTCAAGATCCAGAAATTCAACAAAAAATTGCTCGACGACTTAGAGATGCTATGACAATCTCTAAACCACCTGAAAGGGAAAAAAGAAATCCCTACAGACTTCAATCAGTTCTCCAAAGGTTTGAAAAACTGAGAGAACTCACTACTCAAGACCTACAAAGAGAGATTAATAATCTCAAACAAGAAGTACAGGTATTAAGATCTGAAACCAGGTCAGAATCCTATATGCTTCGCCAAGAAATCCTCAACCTTCAGGAAAGGCGGCCCGAACAAAAGGAAACGACACAACCAAACCTGGACGAAGATGATTTTCAAAGTTCCTTCGTTGGAGCCATCACCACTTCTCAATATCAAAAATGGTACGCTCTCGTTACCTTGAAAATTTATGATTTCAAAATTACACTAAAGGCTCTCATAGATACTAGATCCGATCAAAATTGCATTCAAGAAGGCCTAATACCTACTAAGTATTTTGAAAAAACTACCGAAGGCCTTAGAGGTGCAAACAACAACAAACTCAAAACCAATTACAAACTATCAAAAGTCCATGTTTGTATTGACAGAATTTGCTTCGTAAATTCCTTCCTCTTAGTAAAGGACTTAGGACAGGAACTAATCCTAGGTACTCCTTTCATTACTCAATTATACCCTTTTAAAATAACTGAAAAAGGGTTAAAGTCAAGAGCCTTAGGAAAGAAGATAAAATTTAATTGTCTTTCCCCTATAAGAATTAACGAAATAAATAACCTTCAAAAGAATACAATTATTCAATCAATAAATGTATTAAATTATAAAGAAAAACAAATTTCTTTTCCGAAGGAAGAAATTTCATTCAAAAGAATAGAAGAAAAGCTTCAAAGCAAGATCGTCCAGTCTAAAATCAAAACATTACAAGCTCAAATAGAAAGGGAAATTTGTTCGGAAATTTGTTCAACTGTCCCTAATGCCTTTTGTAATAGGAAGCAACACATAGTTAATCTTCCTTATATCAAACCTTTCGAGGAGAAAAATATACCAACGAAGGCCCGACCTATTCAATTGAATACAGAATTACTAACATTCTGCAAAGCAGAAATTGATGATCTTCTGCAAAAAGGTCTTATTAAGGCCATCAAAGAGCCCCTGGTTATGTGCAGCTTTTTATGTCAACAACCAGGCAGAGAAGGAGCGTGGAGTTCCACGACTGGTAATAAACCACAAACCCTTAAATAAGGTTCTGGAATGGATTAGATATCCAATTCCAAATAAATCAGATTTAATGCAAAGAATTTCAAATGCGAAAGTATTTTCAAAACTTGATATGAAATCTTGATTTCAGTAAAAGACAAGTATAAAACGGCTTTCAACGTCCCTTTCGGACAATACGAGTGGAACATTATGCCATTTGGGCTAAAAAACGCTCCATCTGAATTTCAAAAAATTATGAATCATATCTTCAATTCTTTTCAAGAATTTTCAATTGTCTATATTGTCGATGTTCTGATTTTCTCTCAAAGCATCGATCAACATTTTAAACATCTTCAAACATTTTTGGCCATAGTCAAAAGAAACCGTTGAGTTGTATCTAAACCAAAAATAAAACTTTTTCAGACAAAAATCCGCTTTCTTGGATTTAAAATTCATTTGGGCTTAATTAAACCCATCCAAAGGTCTATCGAGTTTGCCTCAAATTTTCCGGACCAAATCACGGATAAAACCCAACTACAAAGATTCTTGGGATGTCTCAATTATGTGTCTGATTTCCTCCAAAATATCAGACCCATTTGCAAACAACTTTTTGTGGGACTAAAAAAGAATCCCAAGCCTTGGACCGAAGAACATACCAAAACAGCCCAAAAAATCAAATCCCTTGTCAAATCAATTTCATGTTTATCTCTGATAAATCAAAGGGCTGGTTTAATAGTAGAGACCGACGCATCAGTTATCGAAGATGGAGGAATCCTGAAACAGTCCCTTGACAAGAAAGAATCCATTGTCCGATTCCATTCAGGAATATGGAATTCTACTACAAAAAAAATTACTCAACAGTAAAGAAAAAAATACTTTCAATAGTACTTTGTGTACAAAAATTTCAAAGTGATTTAGTTAACAAAAGGTTTGTTATCAATATAGATTCTAGGGCATCTAAATTTGTTTTAGAAAAAGATGTCAAGAATCTTGTATCTAAACCAATTTTTGCTAGATGGCAAGCCATTTTATCTTTCTTCGATTTCCAAATTTTGCCTATAAAAGGGGTCGAAAACCCTTTGGCTTACTACCTCTCAAGAGAGTTTTTACTCTCAAAGAAAAACTCTCCCTAGCCACTCTTCCATCGAGAGATGACCAGAAAGAAAGCGATAGAGAAAGGAAAAAAGCCCACTGCTTCTTCAAACATCCCTACTCCAATGACATCCAACAGCTATGCGATGGATGTTGGTTTCACACTGGTGACCAAGTCTAAAGCAAGGCTTTCTAAAAAACAGGCGGAGGTTATCTCTCTGATAAGGCCTTCTGCCTCTTCAACATGGCCATCTGCCGTTACTCCATCGAGACCTACTGTCTCTTCCACTTCAAAGGGACCTTTTGTCCCCTCGACCTACTCGGACGCAGTTTTTCCTATTTGATTTTCTCCAGTTCATGAAATCAGGACCTATTTTAAAAAATCTGTGTCTATACAAGAAGCTTTGGTAGAACCAGAATACGACGACACAAAAATCAGCCAGGTCGTCAAGAAAGCCTATCCATCGAGATTCTTTTATATCCCAGAAGATCTTCATAAAACTAGAAGGTTTTATGAATTCATTCTAGTCAACACAAAGTCTGTTGAAATTATTCATATTCCGAGTCGCGATGACCCTTCGAAAATCGCGTATTCAAAATTAAAAATTTTCAAAGTCATGAACCCGACGTATTGGAATCAAGATCCATACACGGAGAAGACTTGTTCAAACCCTTTTGTTCCTCATTCTTACTCATACAGAGATTATCAGATGGCATGGTTCAATGTCATTTGGTATCAAAATTACGAGCACTCATGGTTTATTCAATTCTGCCAGAATGCTCATAAAATTCATTACCTATTTTGGTTCATCATGAGCATTCTGGCAATGAGGTTTTATAAGCATTCTGGTTTTATGAGAAAGCAAGCTGAAAAGAAGAAGAAGATGGAAAAGGTGGTTCGAGTGTTTTTTTTTGCTATTGGGGTACGGTTCAACGTTTTTAGGGGTTGGTTCACCCAGTTCGTGATCCTCGGGAGCGGTTCAAGTTATTTTTAATTATTAATGTAATAATTTAGTTAATTGCTTGCTTTAAAATGCCAAAACCAATCCACATTAGCGTGATTAATTAATTATGCATTATGAATGTATGTTTTAATTAAATAGAAAATGTATATACATAAAGTATGTCATATAGATTTAAAATCCCACCATATGCTAAGCATGTTTCATGCATTCCAAGTATGTTATAAGTGTTATAATGTATAGTATGCATGTTGAGGGTTCCATGAATGATAGATATAAATTGTTATATTATTGCATGGAATAGATGATTATATAATTGTTATATAAAGCATGTTAGATGCATGTTATAGGTTATTTTCAACGTCCTAAAACAAGATAGACGTTAAAATCTATAATAAAGATAACTGCATGCTCACTTAGGGTTAAGAACCAAACACGATCAATTTTAATAGTTAAAATAGGTCGATGTCTTATAAAATCGCGGTTACAAGAAAGCTCACCTGGCACGATCTTGTCTAAGATTGGAGGTACTTAAGTTGACGGTTTACGGAACACCTACTACCGGAAGATTGGACCTGTACTTAAGCTTAGTTAGCCCAGTTTTATGAGCATGCTTGAGTGATATAAGTGTTAAAAAAATCACCTAGACTTAAGTTATATTTAATTAGCTGGAATATACCTAAGTAATGAGTATACCCAACCATAAAAAATACTTAGTATGAGGAAAAAGGTATATGAGATACATCGTTTTTCTTTCATACTTTCTGAAAAATTCACACCATGAGATTCATTCTCAGTCTTGTGTCGCCCTGGGAGCATCCTCCATTTGGAAGGTATTTGCATGAGTCAATAACGATGTGAATTGAGAGAGTATTTATAGTAAGTGGGAGAAGGAAGTATATCAACACGTCCTATGGTCTCCACCACTAGGTTGCATCGTGAGATTCCCATGTTCCATCTGCATGTTGCCTTAGAGCAACTCCCTATTCGGAGGGTCGTAACATGGGAGTCGAAACAATGCAAATTCAAGAAATGGATAAGATTTTTTAGGTCCATTACCAGCTTTGGCTTTTTTATTCGATAGCATTGTTAGGACCGATCTCTAAGGTCCGAAATTGATGGGTCACATTTATGAAGAATTACTAAGAGTTAGTATATTCTTGACCAAAATAGCGATAACTAAAACACTATAGGAACATGAGTTATTCTGAGATTAGTTTTTAAGGCAAGAATGTGTTGGTTTAGTTGAGTGATTGACTGCCCCTCGGTAGCAGTTGCTCTAACTCACTAAAGTGTCGTTGCAAATTAATAATTTTTGGGTACATTATTACATTTGCTAAACCTGACTGGATTAAAGAGGATTAAATAAAATAACTAATAGAATTTTCTTTATATCACAGCATGACAAACTCAATAGTACAATTACTCGCTTCTGAGAAATTAAACGGCGACAATTACGCAACATGGAAATCAAACCTAAACATAATACTTGTAATTGATGATTTAAGGTTTGTTTTAACTGAGGAATGTCCTCCAAACCCCAGCTCAAATGCAAACCGAACAGTTCGGGATGCATATGACAGATGGACAAAAGCAAATGACAAAGCCCGAGTGTACATTTTAGCCAGCCTTTTTGATGTTTTGGCTAAGAAACACGATGCCATGGGTACTACTAAAGAGATTAGGGAATCTCTGAAAATGGATGTTTGGACAACCGTCCTTCTCCCTCCTTAG

mRNA sequence

ATGCTAATCTTCCGAAGAGAAAGGAATCTCAAAACCCGGTACAAATCATCGAACATGACGATGGAAGCGTTGAAATACAATTCAACGAAGAGCCTTCGTCTGAATCAAAAGCGTAGAGAAAAGTATCGGATTAAGCCTTTACAACCTCAAAAGTCAACGTATTCTAAGAAAAGGTACGCTCCCACAAAAATTCATAAAGGAAAGAAAAAGAAAACTTGCTTCAAATGTCGAGAAGAAGGACACCATGCTAAAAAGTGTCCAATCAGAGGAAAGATCAACGAGTTGGATATAGATGAAGAGCTGAAAAATCAGCTACTACGGCTAACTCTAACCGGCTCAGAACAATCAAGCGAGGGAGAAATCCTCGAACTTCAAGAAGGATCCGATTCATATTCTAGCACAGAGTACGAATCAGAACAAGAAGGAAGGACGTGCGAAGGATGCATAAACGTCCTTACAAAGGATCAAGAAATTCTACTTGAGGTGGTAGAGAAAATTCAAGATCCAGAAATTCAACAAAAAATTGCTCGACGACTTAGAGATGCTATGACAATCTCTAAACCACCTGAAAGGGAAAAAAGAAATCCCTACAGACTTCAATCAGTTCTCCAAAGGTTTGAAAAACTGAGAGAACTCACTACTCAAGACCTACAAAGAGAGATTAATAATCTCAAACAAGAAGTACAGGTATTAAGATCTGAAACCAGGTCAGAATCCTATATGCTTCGCCAAGAAATCCTCAACCTTCAGGAAAGGCGGCCCGAACAAAAGGAAACGACACAACCAAACCTGGACGAAGATGATTTTCAAAGTTCCTTCGTTGGAGCCATCACCACTTCTCAATATCAAAAATGGTTTGTTTTAACTGAGGAATGTCCTCCAAACCCCAGCTCAAATGCAAACCGAACAGTTCGGGATGCATATGACAGATGGACAAAAGCAAATGACAAAGCCCGAGTGTACATTTTAGCCAGCCTTTTTGATGTTTTGGCTAAGAAACACGATGCCATGGGTACTACTAAAGAGATTAGGGAATCTCTGAAAATGGATGTTTGGACAACCGTCCTTCTCCCTCCTTAG

Coding sequence (CDS)

ATGCTAATCTTCCGAAGAGAAAGGAATCTCAAAACCCGGTACAAATCATCGAACATGACGATGGAAGCGTTGAAATACAATTCAACGAAGAGCCTTCGTCTGAATCAAAAGCGTAGAGAAAAGTATCGGATTAAGCCTTTACAACCTCAAAAGTCAACGTATTCTAAGAAAAGGTACGCTCCCACAAAAATTCATAAAGGAAAGAAAAAGAAAACTTGCTTCAAATGTCGAGAAGAAGGACACCATGCTAAAAAGTGTCCAATCAGAGGAAAGATCAACGAGTTGGATATAGATGAAGAGCTGAAAAATCAGCTACTACGGCTAACTCTAACCGGCTCAGAACAATCAAGCGAGGGAGAAATCCTCGAACTTCAAGAAGGATCCGATTCATATTCTAGCACAGAGTACGAATCAGAACAAGAAGGAAGGACGTGCGAAGGATGCATAAACGTCCTTACAAAGGATCAAGAAATTCTACTTGAGGTGGTAGAGAAAATTCAAGATCCAGAAATTCAACAAAAAATTGCTCGACGACTTAGAGATGCTATGACAATCTCTAAACCACCTGAAAGGGAAAAAAGAAATCCCTACAGACTTCAATCAGTTCTCCAAAGGTTTGAAAAACTGAGAGAACTCACTACTCAAGACCTACAAAGAGAGATTAATAATCTCAAACAAGAAGTACAGGTATTAAGATCTGAAACCAGGTCAGAATCCTATATGCTTCGCCAAGAAATCCTCAACCTTCAGGAAAGGCGGCCCGAACAAAAGGAAACGACACAACCAAACCTGGACGAAGATGATTTTCAAAGTTCCTTCGTTGGAGCCATCACCACTTCTCAATATCAAAAATGGTTTGTTTTAACTGAGGAATGTCCTCCAAACCCCAGCTCAAATGCAAACCGAACAGTTCGGGATGCATATGACAGATGGACAAAAGCAAATGACAAAGCCCGAGTGTACATTTTAGCCAGCCTTTTTGATGTTTTGGCTAAGAAACACGATGCCATGGGTACTACTAAAGAGATTAGGGAATCTCTGAAAATGGATGTTTGGACAACCGTCCTTCTCCCTCCTTAG

Protein sequence

MLIFRRERNLKTRYKSSNMTMEALKYNSTKSLRLNQKRREKYRIKPLQPQKSTYSKKRYAPTKIHKGKKKKTCFKCREEGHHAKKCPIRGKINELDIDEELKNQLLRLTLTGSEQSSEGEILELQEGSDSYSSTEYESEQEGRTCEGCINVLTKDQEILLEVVEKIQDPEIQQKIARRLRDAMTISKPPEREKRNPYRLQSVLQRFEKLRELTTQDLQREINNLKQEVQVLRSETRSESYMLRQEILNLQERRPEQKETTQPNLDEDDFQSSFVGAITTSQYQKWFVLTEECPPNPSSNANRTVRDAYDRWTKANDKARVYILASLFDVLAKKHDAMGTTKEIRESLKMDVWTTVLLPP
Homology
BLAST of CmaCh00G002740 vs. ExPASy TrEMBL
Match: A0A6J1EYM2 (uncharacterized protein LOC111439730 OS=Cucurbita moschata OX=3662 GN=LOC111439730 PE=4 SV=1)

HSP 1 Score: 263.1 bits (671), Expect = 1.7e-66
Identity = 139/162 (85.80%), Postives = 149/162 (91.98%), Query Frame = 0

Query: 39  REKYRIKPLQPQKSTYSKKRYAPTKIHKGKKKKTCFKCREEGHHAKKCPIRGKINELDID 98
           RE YR KP+Q QK  YS+++Y PTK H GKKK+TCFKCREEGH+A KCPIRGKINELDID
Sbjct: 138 RETYRNKPVQSQKPAYSRRKYIPTKTHGGKKKQTCFKCREEGHYANKCPIRGKINELDID 197

Query: 99  EELKNQLLRLTLTGSEQSSEGEILELQEGSDSYSSTEYESEQEG-RTCEGCINVLTKDQE 158
           +ELKNQLL LTLT SEQSSEGEILELQE SDSYSSTEYESEQEG RTCEGCINVLTKDQE
Sbjct: 198 QELKNQLLPLTLTDSEQSSEGEILELQEESDSYSSTEYESEQEGKRTCEGCINVLTKDQE 257

Query: 159 ILLEVVEKIQDPEIQQKIARRLRDAMTISKPPEREKRNPYRL 200
           ILLEVVEK+QDPEIQQKIA+RLRDAMTISKPPERE+RNPYRL
Sbjct: 258 ILLEVVEKVQDPEIQQKIAQRLRDAMTISKPPEREERNPYRL 299

BLAST of CmaCh00G002740 vs. ExPASy TrEMBL
Match: A0A6J1EW44 (uncharacterized protein LOC111436618 OS=Cucurbita moschata OX=3662 GN=LOC111436618 PE=4 SV=1)

HSP 1 Score: 151.8 bits (382), Expect = 5.6e-33
Identity = 79/101 (78.22%), Postives = 88/101 (87.13%), Query Frame = 0

Query: 39  REKYRIKPLQPQKSTYSKKRYAPTKIHKGKKKKTCFKCREEGHHAKKCPIRGKINELDID 98
           RE Y  KP+Q QK  YS+++Y PTK H+GKKK+T FKCREEGH+  KCPIRGKINELDID
Sbjct: 138 RETYLNKPVQSQKPRYSRRKYIPTKTHRGKKKQTSFKCREEGHYVNKCPIRGKINELDID 197

Query: 99  EELKNQLLRLTLTGSEQSSEGEILELQEGSDSYSSTEYESE 140
           +ELKNQLLRLTLT SEQSSEGEILELQ+ SDSYSSTEYESE
Sbjct: 198 QELKNQLLRLTLTDSEQSSEGEILELQKESDSYSSTEYESE 238

BLAST of CmaCh00G002740 vs. ExPASy TrEMBL
Match: A0A6J1EZI3 (uncharacterized protein LOC111440658 OS=Cucurbita moschata OX=3662 GN=LOC111440658 PE=4 SV=1)

HSP 1 Score: 142.9 bits (359), Expect = 2.6e-30
Identity = 75/107 (70.09%), Postives = 89/107 (83.18%), Query Frame = 0

Query: 39  REKYRIKPLQPQKSTYSKKRYAPTKIHKGKKKKTCFKCREEGHHAKKCPIRGKINELDID 98
           RE YR KP+Q QK TYS+++Y PTK H+GKKK+TCFKCREEGH+A KCPIRGKINELDID
Sbjct: 231 RETYRSKPVQSQKPTYSRRKYIPTKTHRGKKKQTCFKCREEGHYANKCPIRGKINELDID 290

Query: 99  EELKNQLLRLTLTGSEQSSEGEILELQEGSDSY--SSTEYESEQEGR 144
           +ELKNQLLRLTLT SEQSS+GEIL+LQE SD     +T    +++GR
Sbjct: 291 QELKNQLLRLTLTDSEQSSKGEILKLQEESDYILAQNTNQNKKEKGR 337

BLAST of CmaCh00G002740 vs. ExPASy TrEMBL
Match: A0A6J1IFV1 (uncharacterized protein LOC111472952 OS=Cucurbita maxima OX=3661 GN=LOC111472952 PE=4 SV=1)

HSP 1 Score: 112.5 bits (280), Expect = 3.8e-21
Identity = 53/68 (77.94%), Postives = 61/68 (89.71%), Query Frame = 0

Query: 39  REKYRIKPLQPQKSTYSKKRYAPTKIHKGKKKKTCFKCREEGHHAKKCPIRGKINELDID 98
           REKYR KP+Q QK TYSKKRY PTK H+GKK++ CFKCREEGH+A KCPI+GKINELDID
Sbjct: 186 REKYRSKPMQSQKPTYSKKRYTPTKTHRGKKRQACFKCREEGHYANKCPIKGKINELDID 245

Query: 99  EELKNQLL 107
           +ELK+QLL
Sbjct: 246 QELKDQLL 253

BLAST of CmaCh00G002740 vs. ExPASy TrEMBL
Match: A0A6J1EWB4 (uncharacterized protein LOC111436716 OS=Cucurbita moschata OX=3662 GN=LOC111436716 PE=4 SV=1)

HSP 1 Score: 101.7 bits (252), Expect = 6.7e-18
Identity = 51/65 (78.46%), Postives = 56/65 (86.15%), Query Frame = 0

Query: 39  REKYRIKPLQPQKSTYSKKRYAPTKIHKGKKKKTCFKCREEGHHAKKCPIRGKINELDID 98
           RE YR KP+Q QK TYS++RY PTK HKGKK+KTCFKCREEGH+A KCPIRGKINELDI 
Sbjct: 250 REAYRNKPVQSQKPTYSRRRYTPTKPHKGKKRKTCFKCREEGHYANKCPIRGKINELDI- 309

Query: 99  EELKN 104
            ELKN
Sbjct: 310 -ELKN 312

BLAST of CmaCh00G002740 vs. NCBI nr
Match: XP_022933039.1 (uncharacterized protein LOC111439730 [Cucurbita moschata])

HSP 1 Score: 263.1 bits (671), Expect = 3.6e-66
Identity = 139/162 (85.80%), Postives = 149/162 (91.98%), Query Frame = 0

Query: 39  REKYRIKPLQPQKSTYSKKRYAPTKIHKGKKKKTCFKCREEGHHAKKCPIRGKINELDID 98
           RE YR KP+Q QK  YS+++Y PTK H GKKK+TCFKCREEGH+A KCPIRGKINELDID
Sbjct: 138 RETYRNKPVQSQKPAYSRRKYIPTKTHGGKKKQTCFKCREEGHYANKCPIRGKINELDID 197

Query: 99  EELKNQLLRLTLTGSEQSSEGEILELQEGSDSYSSTEYESEQEG-RTCEGCINVLTKDQE 158
           +ELKNQLL LTLT SEQSSEGEILELQE SDSYSSTEYESEQEG RTCEGCINVLTKDQE
Sbjct: 198 QELKNQLLPLTLTDSEQSSEGEILELQEESDSYSSTEYESEQEGKRTCEGCINVLTKDQE 257

Query: 159 ILLEVVEKIQDPEIQQKIARRLRDAMTISKPPEREKRNPYRL 200
           ILLEVVEK+QDPEIQQKIA+RLRDAMTISKPPERE+RNPYRL
Sbjct: 258 ILLEVVEKVQDPEIQQKIAQRLRDAMTISKPPEREERNPYRL 299

BLAST of CmaCh00G002740 vs. NCBI nr
Match: XP_023520850.1 (uncharacterized protein LOC111784362 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 207.6 bits (527), Expect = 1.8e-49
Identity = 115/159 (72.33%), Postives = 132/159 (83.02%), Query Frame = 0

Query: 21  MEALKYNSTKSLRLNQK------RREKYRIKPLQPQKSTYSKKRYAPTKIHKGKKKKTCF 80
           +EA K +  K ++ + K       RE YR KP+Q QK TYS+++Y PTK H+GKKK+TCF
Sbjct: 544 IEAPKTSRQKKVKTHPKPYHSYRPREAYRSKPVQSQKPTYSRRKYIPTKTHRGKKKQTCF 603

Query: 81  KCREEGHHAKKCPIRGKINELDIDEELKNQLLRLTLTGSEQSSEGEILELQEGSDSYSST 140
           KCR EGH+AKKCPI+GKINELDID+ELKNQLLRLTLT SEQS EGEIL+LQE SDS SST
Sbjct: 604 KCRVEGHYAKKCPIKGKINELDIDQELKNQLLRLTLTNSEQSGEGEILKLQEESDSNSST 663

Query: 141 EYESEQEG-RTCEGCINVLTKDQEILLEVVEKIQDPEIQ 173
           +YESEQEG RTCEGCINVLTKDQEILLEVVEK+QD EIQ
Sbjct: 664 KYESEQEGKRTCEGCINVLTKDQEILLEVVEKVQDQEIQ 702

BLAST of CmaCh00G002740 vs. NCBI nr
Match: XP_023521035.1 (uncharacterized protein LOC111784623 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 206.1 bits (523), Expect = 5.2e-49
Identity = 109/137 (79.56%), Postives = 121/137 (88.32%), Query Frame = 0

Query: 39  REKYRIKPLQPQKSTYSKKRYAPTKIHKGKKKKTCFKCREEGHHAKKCPIRGKINELDID 98
           REKYR KP+Q QK TYS+++Y PTK H+GKKK+TCFKCREEGH+A +CPIRGKINELDID
Sbjct: 220 REKYRNKPVQSQKPTYSRRKYTPTKTHRGKKKQTCFKCREEGHYANECPIRGKINELDID 279

Query: 99  EELKNQLLRLTLTGSEQSSEGEILELQEGSDSYSSTEYESEQEG-RTCEGCINVLTKDQE 158
           +ELKNQLLRL LT SEQSSEGEILELQE SDSYSSTEYES QEG RTCEGCINVLTKDQE
Sbjct: 280 QELKNQLLRLALTDSEQSSEGEILELQEESDSYSSTEYESGQEGKRTCEGCINVLTKDQE 339

Query: 159 ILLEVVEKIQDPEIQQK 175
           ILLEVVEK +  +  +K
Sbjct: 340 ILLEVVEKFKIQKFNRK 356

BLAST of CmaCh00G002740 vs. NCBI nr
Match: XP_023552915.1 (uncharacterized protein LOC111810441 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 170.2 bits (430), Expect = 3.1e-38
Identity = 86/112 (76.79%), Postives = 99/112 (88.39%), Query Frame = 0

Query: 39  REKYRIKPLQPQKSTYSKKRYAPTKIHKGKKKKTCFKCREEGHHAKKCPIRGKINELDID 98
           RE YR KP+Q QK TYS+++Y PTK HKGKK++TCFKCREEGH+A KCPIRGKINEL+ID
Sbjct: 259 RETYRNKPVQSQKPTYSRRKYTPTKPHKGKKRQTCFKCREEGHYANKCPIRGKINELEID 318

Query: 99  EELKNQLLRLTLTGSEQSSEGEILELQEGSDSYSSTEYESEQEGRTCEGCIN 151
           +ELKNQLLRL LT SEQSS+GEILELQE SDSYS+TEYESEQEG+  +GC N
Sbjct: 319 QELKNQLLRLALTDSEQSSKGEILELQEESDSYSNTEYESEQEGKRTKGCPN 370

BLAST of CmaCh00G002740 vs. NCBI nr
Match: XP_023522280.1 (uncharacterized protein LOC111786173, partial [Cucurbita pepo subsp. pepo])

HSP 1 Score: 166.8 bits (421), Expect = 3.5e-37
Identity = 85/105 (80.95%), Postives = 95/105 (90.48%), Query Frame = 0

Query: 39  REKYRIKPLQPQKSTYSKKRYAPTKIHKGKKKKTCFKCREEGHHAKKCPIRGKINELDID 98
           REKYR KP+Q QK TYS+++Y PTK H+GKKK+TCFKCREEGH+A +CPIRGKINELDID
Sbjct: 249 REKYRNKPVQSQKPTYSRRKYTPTKTHRGKKKQTCFKCREEGHYANECPIRGKINELDID 308

Query: 99  EELKNQLLRLTLTGSEQSSEGEILELQEGSDSYSSTEYESEQEGR 144
           +ELKNQLLRL LT SEQSSEGEILELQE SDSYSSTEYES QEG+
Sbjct: 309 QELKNQLLRLALTDSEQSSEGEILELQEESDSYSSTEYESGQEGK 353

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1EYM21.7e-6685.80uncharacterized protein LOC111439730 OS=Cucurbita moschata OX=3662 GN=LOC1114397... [more]
A0A6J1EW445.6e-3378.22uncharacterized protein LOC111436618 OS=Cucurbita moschata OX=3662 GN=LOC1114366... [more]
A0A6J1EZI32.6e-3070.09uncharacterized protein LOC111440658 OS=Cucurbita moschata OX=3662 GN=LOC1114406... [more]
A0A6J1IFV13.8e-2177.94uncharacterized protein LOC111472952 OS=Cucurbita maxima OX=3661 GN=LOC111472952... [more]
A0A6J1EWB46.7e-1878.46uncharacterized protein LOC111436716 OS=Cucurbita moschata OX=3662 GN=LOC1114367... [more]
Match NameE-valueIdentityDescription
XP_022933039.13.6e-6685.80uncharacterized protein LOC111439730 [Cucurbita moschata][more]
XP_023520850.11.8e-4972.33uncharacterized protein LOC111784362 [Cucurbita pepo subsp. pepo][more]
XP_023521035.15.2e-4979.56uncharacterized protein LOC111784623 [Cucurbita pepo subsp. pepo][more]
XP_023552915.13.1e-3876.79uncharacterized protein LOC111810441 [Cucurbita pepo subsp. pepo][more]
XP_023522280.13.5e-3780.95uncharacterized protein LOC111786173, partial [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 214..234
NoneNo IPR availableGENE3D4.10.60.10coord: 65..116
e-value: 1.1E-5
score: 27.7
IPR001878Zinc finger, CCHC-typePFAMPF00098zf-CCHCcoord: 71..87
e-value: 1.2E-4
score: 22.0
IPR001878Zinc finger, CCHC-typePROSITEPS50158ZF_CCHCcoord: 73..87
score: 9.817547
IPR036875Zinc finger, CCHC-type superfamilySUPERFAMILY57756Retrovirus zinc finger-like domainscoord: 58..91

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh00G002740.1CmaCh00G002740.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding