CmaCh18G003900 (gene) Cucurbita maxima (Rimu)

NameCmaCh18G003900
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionMyosin heavy chain-like protein, putative
LocationCma_Chr18 : 2127780 .. 2132785 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATCATCATCATTAAGATTTTCATCGTCCAACGTCTCTTCTCAAATAAAGTACGTCTATTCTTAATCGAGATTCGACTTCTTTCTCTTTTGAGTCTTTGTTAGATATTTGAGAATTTACCGATCTATTGACATTACTAAGTTTAGGCATACCCTTTAGTACCAGCTCTGGCTTTGAGATAGTATTATAATTTAATATTTTTCTAATTTAAATTTCATAACTTTTAATTAAAAAATTTATTAAAAAAACTTTATTTATTAATCAAAGATTATTTAAAAAACAAAATTTTAAAATAATAATATTACATAAAAAATTATAATTTTAATAACATTTATAAAAAATAAATAATTAATATTGATTATTATTATTATTATTATTATTTATAGAGGAAATGATTTGCAGGGTGGCTCGACGTTGCAGAAGAGTTTTTGTGTGAAAGCTCTGCATAAATGGCGGCTCATCCCATTAAGATCTTGTATCTCTTTCCCGCCTTATTCTTACCTCTCTTCGCTGTTTGCGAAATCGCTTCGTTCGCTCGTTCAAGTAACGATGTGTTGATCGATGATTTGCTGGAAATCAAGCTCAGGATCTCTCGCTTAGGTGCATATTGTTTACTGTTTTCATTTCACATGACATTGAGAGGTTTTTAATTCTTCATATGATTTAGCTAATTCTAGTGTCTGTCAATCTAGACTTTAGGCGTTGCATTTATTACGTAAGTTTTCTGTGGTGTCGCTATCTTTTACTTAATAAAATCGAATGCAGAATCCGTTCTGGAGGGAAGCAAGCAAAATTTAACCGAGAAAGGCAATGAAATTTTGGCGCAGGAAAGGTTAATTGAGGCTATGTCTCATAAGATTCAATACTTGGAGTCTGCTCTATTTGATATGAAGGTTTGATTCGTGCATTATAATCAAGAAAGTGTTGCTTCATCTGTGCATTTTCAGAGATTCATTCTGTCTCTTGTTTTTTGATATTTTTCGTACATCAGAATATGTTTGATGTGTTCAATTACAGACTTTGGAGTGGGCTATTAGCCATTCACACGTCCGAATTTACTAAATGATTGCAAAATATTGTCTGATTATAGGTTAAGTTTTCTGTTTCAACAATTGCTTGTCAATAGGGAGAAAATTTACAGTCAGACATCACCTGAATCTTTAATTCTTTGAGGAAATGTCGAAATTTAGTATTGAGACTGATATTAGTTTAGTTATTTTGCAGAGGAAAACATTAAGTCACGATGAAAGGATTGCTTCTCTGGAGGATGAGGTTTCTTTCTACTTTTTCCTATTTAATTTTTTTATTTGAAAAGTTGCATCGGGAGATTTTCTGGTCCAGGAAACAGTTATCTGGTAAATTTTGTTGAGTTACTTTCAAAGGTTCAATTGATCCGTGTAGTTGGCTTGGGCATGTTAGGTACGACTTCTCTGGGCTGTATTAAGAAAGAACAACTTTGATATTCATCTTTTGGAAGTCAAGGTACATGAAGCTGAGGAAAAACTTGAAGAGGTTACCTCACAAGTTGAGAAGGTATAAATTTAGACTAGGGGTTTTTTTACTTTTCATAGCAGACATCGTTTGAATGTTTCGCGCAACATGCTCACATATTTGTCATGGAAAATGTATAAAACTAAGACAAATATTTTATTATTATGTTCCATGTTTATGTGACACAATATATATAATTTTTTTTTTTTTTTGGCCTATACACTGTGGTATCTAGTAGATTCCCTTGTCATGCAATGATAATTGTAGATTTAGCCAGACACTTTCGTTTGCCTAGCGTGTCCTTGTCTGATACGAGTTGGCACTTGGACACTCCTAACACTTTTTTGGATACATCTTAGACACTTATTAGCACAAAAGATGTATTAGACACTAGTTTTACAAAGCCAATATAGGTCAACATTTATTAGACACGTATCAAACTCTTGTTAACTATACTCTATAGTTTGAGAGTGAAATACATCAAACTCATATTTTTCAGCATATAAATTTGTGAACTTTGAATTTCTTTGTGTAGAAATGACTTGTCCTTGCCATGTCCGTGTGCTAAATTTTAAAAAATAACGTGTCAATTTGAATGCATGCTTCTTAAATTGAAGACAATAAAAAATGAGAAATCCTTCGACATCCTACATTTTTGAAGTTTTTCATGTTTTCCTACCATTAGTTCACGATTTACCTAACGGAAGAAATGCGGGGGATGCACTTGTAAATGTTCTGGACTCTGATTGAGAATGAGTTCATTTCTCTTTATAATTAACATATCTTCTCCTCAATCAACTTATATCCATGGATAAAATGTCTCTTTATGCAAACATTCCTTTGTTAGTGTGCTTACTTTGTGACCATTTTATTTTCCAGATGTCTAGCATAGTATCTGAACAGTGGATTCATATTCGGCATCTAGAGCAGGCCCTTGAAATAACAAAAGTATGTGTTACGGGAGTGGGATCAATGATTGAAAATTTTATTGTTGAGCTGTTGTGCTGGTGTATAAACTAACTTTATTGGCGTGTGACTAGATAAATGCTTTGAAGGTCCGCCAGCAAGTTGCCCTAACCAGATGCACCTTTTTAAAGGTAATTTTTTGACAAGCCTGGTCGAACATTTTCTTGTCATTCACTCTGGCTCACCCGTGCATTATTTGTTGGCTGCACCTTCATAGAGTGCTGCACTATAAGTTATAGCCAATGTTAGAATCAAATAACGTTAACCATTTTAGATTTTTTCCAAATAATTTCACTTGTAAGCTTTATCAAATGTTTAATTTTGTTTATAGTTGGTCAACACACGTTTTATCAATCAACTTCAGAAGGCCCTACAGACATTGAACTATCATCTGTTCAGCAAAGTGCCTACCTTGAGTTCCATGGTTACAGGAGCCATCCAATACTTCCAAGGAGTCTATGAAGAAGCCACAAAGTATCACCATGAGGTATGTGTTTGTGAACGGAAAGATTAAAGTCAATCTAATAACTTCTGCAAAGACATGGGAACGAAACCAATGGCCCACCGTATAATTCTTTTAGAATTTTTTACCTCCATCCCTTGATCTTTGACATTTGTATATCATACGGGAATATAATTGTTTCAGCTGAATATGCATTGCCAATCTCAGCTATATCTCTTCGTCATAATGACTATAAAGTTTGAATATTCATATCTGCAGTTACAAAGACTGATTAAACAAGAAATGGAAAGAAGGGACTATGCAGCATATCTTGTAAACAGGGAAGTTATTTTCTTTCTGGTAAGCTGCAAAAATCTCGATGCTTCATCTTGAATCTCTTGCCTGTTTTTCTGCGTACTTGCATAATCTAATCATGTTTTTTTATGGATTTTTAGGCGTCTGCTATAGTTACCTTCCCAATTTTTGGTGCTTGGATGTTCCTTTCTTCATGGTTTTCCAGGTGAAAGGGTTATCCGTGAGAAAAGAGGGCTAGGTTATACTCGAGAACCTTTTTTTTTCATGACTATCCTGATAGTTTCGCTTAGCCTGGAGATCAGTTTTCCAATATTAGATGTGAATTCCTTCATGAACAAGTCTGTATATAGTTGCAACTTTTAGCATTCAAATGAAATATTCATACTGTGTTAAAAGTCAAGTTTTTTGTTGCATAACACTTCATATAAATATACATGAAAAAGAAAATGCTGGATAACAGAACTGATGAGTATCAATTCCCACAAAATGCATAGATTTATAATGGGGGACTATATAGTTGGGCAACCAGATATGAAAGAAAGCTGTGGCACCAGAAATTCTCACCCCCAAAAAATTGTTCCAAAAAATGCTTAATACAGCAGAGATTTACATTCTAAAAATACTGTTTGTTTCACTATGAAATAAGAATGGTGTTTTGTGGCTGACTATATAGGAATAATATACAAAACACATTTACATCAGAAAAATAGTTGTCTACTCACTGCTTAGTTTTCTCTTTATCAGATCATAGAATCCTCTGACAATATGGTTAACAATGAGTATTTCACTTTGATAGAGATCCGGGCACCTTCAACACAAAGCTTGGCTCCAGTGACCACCCAGTATCCAGGTGAGTTCTCAGGGCCACGCACCATTTCTTTCGTATCGATGATGTTCAACATTTTCATTGCTCTCACAGTCAAAGGCGGACCATCGGGATAAACAGCCGAGTTCAATTCAATTTTTGGTTGCGCTTCTGGCTGCTGCAGCCCTGTACTAAACCTTGTGCTGATAAAGGTGGATATGAGTCCAGATTTTCGGGATGACGTTGAAGGTCCTTCCCATTCTGATCGACGAATCATTGCTGATGCCACCGTCGAAAACCCAAGCCTCAGGAACAATACCTTTCTCATTCCAATAACCTTAACTTCAAACCAAGCCTTTGTGACGATAGAAGCAGTGTCGTCATCAATTAGTGAACCATTGTACTGCACTGGAGCAGTGCAGACATGTGAAAATATGCTCCATTTAACTGGTTGGAAGTAGCCCTTCTCCTCTGGCTCGTCAATGGGTCCATAACTCAGATCATCTGAGAGTTGGAGAGTCTTGGGAAGAGTTGATAGATGCTGAAGATGAATTGCAAGATGGTCACTTTTCTTTCCTTCCAGAAACAACCGAACTCCCGTTACTGGACGACTGTTCGAATCAACCTGCAGTTGCAAAATTGTGATTATCACGTGTTCAATCTCGGCATCGCCATTTTGACAAATCCGTCTTCAAGTCAGAAGATTACCTTAGTAGTATTTACATACAGCTTGGGGCCCATCAAAGTGAACTTCAGAGATGGAGAGGCTTGTTTCCTATGCCGAGTAGCAAGAGGAAGATCGGCGTACACCGGTGCCCATTGTCGAGGCAGCTGAAATTCCAGAAACTGCTGAAGCTCCTCAATTGGAGGTTTATCTGCACAAATTTTCAATGATCAAAATCTATCACTAATGGTAAATTCAATGTGAATTAGAACTCTTGCTGGCTTGAATCTGGTGTCAACATTTTAATGGCTTCTCAAGTACTAAG

mRNA sequence

ATCATCATCATTAAGATTTTCATCGTCCAACGTCTCTTCTCAAATAAAGGTGGCTCGACGTTGCAGAAGAGTTTTTGTGTGAAAGCTCTGCATAAATGGCGGCTCATCCCATTAAGATCTTGTATCTCTTTCCCGCCTTATTCTTACCTCTCTTCGCTGTTTGCGAAATCGCTTCGTTCGCTCGTTCAAGTAACGATGTGTTGATCGATGATTTGCTGGAAATCAAGCTCAGGATCTCTCGCTTAGAATCCGTTCTGGAGGGAAGCAAGCAAAATTTAACCGAGAAAGGCAATGAAATTTTGGCGCAGGAAAGGTTAATTGAGGCTATGTCTCATAAGATTCAATACTTGGAGTCTGCTCTATTTGATATGAAGAGGAAAACATTAAGTCACGATGAAAGGATTGCTTCTCTGGAGGATGAGGTACGACTTCTCTGGGCTGTATTAAGAAAGAACAACTTTGATATTCATCTTTTGGAAGTCAAGGTACATGAAGCTGAGGAAAAACTTGAAGAGGTTACCTCACAAGTTGAGAAGATGTCTAGCATAGTATCTGAACAGTGGATTCATATTCGGCATCTAGAGCAGGCCCTTGAAATAACAAAAATAAATGCTTTGAAGGTCCGCCAGCAAGTTGCCCTAACCAGATGCACCTTTTTAAAGTTGGTCAACACACGTTTTATCAATCAACTTCAGAAGGCCCTACAGACATTGAACTATCATCTGTTCAGCAAAGTGCCTACCTTGAGTTCCATGGTTACAGGAGCCATCCAATACTTCCAAGGAGTCTATGAAGAAGCCACAAAGTATCACCATGAGTTACAAAGACTGATTAAACAAGAAATGGAAAGAAGGGACTATGCAGCATATCTTGTAAACAGGGAAGTTATTTTCTTTCTGGCGTCTGCTATAGTTACCTTCCCAATTTTTGGTGCTTGGATGTTCCTTTCTTCATGGTTTTCCAGGTGAAAGGGTTATCCGTGAGAAAAGAGGGCTAGAGATCCGGGCACCTTCAACACAAAGCTTGGCTCCAGTGACCACCCAGTATCCAGTCAAAGGCGGACCATCGGGATAAACAGCCGAGTTCAATTCAATTTTTGGTTGCGCTTCTGGCTGCTGCAGCCCTGTACTAAACCTTGTGCTGATAAAGGTGGATATGAGTCCAGATTTTCGGGATGACGTTGAAGGTCCTTCCCATTCTGATCGACGAATCATTGCTGATGCCACCGTCGAAAACCCAAGCCTCAGGAACAATACCTTTCTCATTCCAATAACCTTAACTTCAAACCAAGCCTTTGTGACGATAGAAGCAGTGTCGTCATCAATTAGTGAACCATTGTACTGCACTGGAGCAGTGCAGACATGTGAAAATATGCTCCATTTAACTGGTTGGAAGTAGCCCTTCTCCTCTGGCTCGTCAATGGGTCCATAACTCAGATCATCTGAGAGTTGGAGAGTCTTGGGAAGAGTTGATAGATGCTGAAGATGAATTGCAAGATGGTCACTTTTCTTTCCTTCCAGAAACAACCGAACTCCCGTTACTGGACGACTGTTCGAATCAACCTGCAGTTGCAAAATTGTGATTATCACGTGTTCAATCTCGGCATCGCCATTTTGACAAATCCGTCTTCAAGTCAGAAGATTACCTTAGTAGTATTTACATACAGCTTGGGGCCCATCAAAGTGAACTTCAGAGATGGAGAGGCTTGTTTCCTATGCCGAGTAGCAAGAGGAAGATCGGCGTACACCGGTGCCCATTGTCGAGGCAGCTGAAATTCCAGAAACTGCTGAAGCTCCTCAATTGGAGGTTTATCTGCACAAATTTTCAATGATCAAAATCTATCACTAATGGTAAATTCAATGTGAATTAGAACTCTTGCTGGCTTGAATCTGGTGTCAACATTTTAATGGCTTCTCAAGTACTAAG

Coding sequence (CDS)

ATGGCGGCTCATCCCATTAAGATCTTGTATCTCTTTCCCGCCTTATTCTTACCTCTCTTCGCTGTTTGCGAAATCGCTTCGTTCGCTCGTTCAAGTAACGATGTGTTGATCGATGATTTGCTGGAAATCAAGCTCAGGATCTCTCGCTTAGAATCCGTTCTGGAGGGAAGCAAGCAAAATTTAACCGAGAAAGGCAATGAAATTTTGGCGCAGGAAAGGTTAATTGAGGCTATGTCTCATAAGATTCAATACTTGGAGTCTGCTCTATTTGATATGAAGAGGAAAACATTAAGTCACGATGAAAGGATTGCTTCTCTGGAGGATGAGGTACGACTTCTCTGGGCTGTATTAAGAAAGAACAACTTTGATATTCATCTTTTGGAAGTCAAGGTACATGAAGCTGAGGAAAAACTTGAAGAGGTTACCTCACAAGTTGAGAAGATGTCTAGCATAGTATCTGAACAGTGGATTCATATTCGGCATCTAGAGCAGGCCCTTGAAATAACAAAAATAAATGCTTTGAAGGTCCGCCAGCAAGTTGCCCTAACCAGATGCACCTTTTTAAAGTTGGTCAACACACGTTTTATCAATCAACTTCAGAAGGCCCTACAGACATTGAACTATCATCTGTTCAGCAAAGTGCCTACCTTGAGTTCCATGGTTACAGGAGCCATCCAATACTTCCAAGGAGTCTATGAAGAAGCCACAAAGTATCACCATGAGTTACAAAGACTGATTAAACAAGAAATGGAAAGAAGGGACTATGCAGCATATCTTGTAAACAGGGAAGTTATTTTCTTTCTGGCGTCTGCTATAGTTACCTTCCCAATTTTTGGTGCTTGGATGTTCCTTTCTTCATGGTTTTCCAGGTGA

Protein sequence

MAAHPIKILYLFPALFLPLFAVCEIASFARSSNDVLIDDLLEIKLRISRLESVLEGSKQNLTEKGNEILAQERLIEAMSHKIQYLESALFDMKRKTLSHDERIASLEDEVRLLWAVLRKNNFDIHLLEVKVHEAEEKLEEVTSQVEKMSSIVSEQWIHIRHLEQALEITKINALKVRQQVALTRCTFLKLVNTRFINQLQKALQTLNYHLFSKVPTLSSMVTGAIQYFQGVYEEATKYHHELQRLIKQEMERRDYAAYLVNREVIFFLASAIVTFPIFGAWMFLSSWFSR
BLAST of CmaCh18G003900 vs. TrEMBL
Match: A0A0A0LSE6_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G075570 PE=4 SV=1)

HSP 1 Score: 421.0 bits (1081), Expect = 1.2e-114
Identity = 230/291 (79.04%), Postives = 247/291 (84.88%), Query Frame = 1

Query: 1   MAAHPIKIL-YLFPALFLPLFAVCEIASFARSSNDVLIDDLLEIKLRISRLESVLEGSKQ 60
           MA   IKIL YLFPA  LPLFAVCEI S  +S ND LI +LL IKLRIS LESVLE SKQ
Sbjct: 1   MAVCSIKILLYLFPAFVLPLFAVCEILSPGQSGNDALIHELLGIKLRISHLESVLEESKQ 60

Query: 61  NLTEKGNEILAQERLIEAMSHKIQYLESALFDMKRKTLSHDERIASLEDEVRLLWAVLRK 120
           NLTEK NE+ AQE+LIE +SHKIQYLESA+ DMKRK  S DERIA LEDEVR LW   RK
Sbjct: 61  NLTEKSNELKAQEKLIEDVSHKIQYLESAISDMKRKISSDDERIAVLEDEVRRLWDAKRK 120

Query: 121 NNFDIHLLEVKVHEAEEKLEEVTSQVEKMSSIVSEQWIHIRHLEQALEITKINALKVRQQ 180
           NNFDIHLL+ KV EAEEKLEEVTSQVEK SSI+SEQWI IRHLEQALE++KI ALKVRQQ
Sbjct: 121 NNFDIHLLKAKVQEAEEKLEEVTSQVEKKSSIISEQWIQIRHLEQALEMSKIQALKVRQQ 180

Query: 181 VALTRCTFLKLVNTRFINQLQKALQTLNYHLFSKVPTLSSMVTGAIQYFQGVYEEATKYH 240
            ALTRCTF+KLVNTRF NQLQKA QTLN+H+FSKVPTLSS VTGAI YFQ VYEEA KYH
Sbjct: 181 FALTRCTFVKLVNTRFANQLQKAFQTLNHHVFSKVPTLSSRVTGAIHYFQRVYEEAKKYH 240

Query: 241 HELQRLIKQEMERRDYAAYLVNREVIFFLASAIVTFPIFGAWMFLSSWFSR 291
           HELQRLIKQEMER +YAA+L N E+IFFLASA+  FPIFGAWMFLSSWFSR
Sbjct: 241 HELQRLIKQEMERNEYAAHLANPELIFFLASALAIFPIFGAWMFLSSWFSR 291

BLAST of CmaCh18G003900 vs. TrEMBL
Match: A0A061GFK7_THECC (Epidermal growth factor receptor substrate 15-like 1, putative OS=Theobroma cacao GN=TCM_029808 PE=4 SV=1)

HSP 1 Score: 231.1 bits (588), Expect = 1.7e-57
Identity = 133/281 (47.33%), Postives = 186/281 (66.19%), Query Frame = 1

Query: 9   LYLFPALFLPLFAVCEIASFARSSNDVLIDDLLEIKLRISRLESVLEGSKQNLTEKGNEI 68
           L L PA+F   F      +   S++++LI +L + KL+ISRLESVLE S QN+  K   +
Sbjct: 5   LILIPAIFSCFFFSLASQNEQNSNHNLLIRELHDAKLKISRLESVLEESVQNMNAKTLYL 64

Query: 69  LAQERLIEAMSHKIQYLESALFDMKRKTLSHDERIASLEDEVRLLWAVLRKNNFDIHLLE 128
             +E+L+E M+ KI YL+S L  +K  +L  DER+  LE+EVRLLWAV RKNNF++H+LE
Sbjct: 65  KEREKLLEEMADKITYLQSTLSSLKDDSLLADERLKDLEEEVRLLWAVSRKNNFELHVLE 124

Query: 129 VKVHEAEEKLEEVTSQVEKMSSIVSEQWIHIRHLEQALEITKINALKVRQQVALTRCTFL 188
            K  EAE+KLE VT QVEKM+ +V+EQWI I+HLEQAL+I ++ A + R++  + RCTFL
Sbjct: 125 SKAQEAEDKLEVVTLQVEKMAEVVTEQWIQIQHLEQALQIAEMRASQARRERNM-RCTFL 184

Query: 189 KLVNTRFINQLQKALQTLNYHLFSKVPTLSSMVTGAIQYFQGVYEEATKYHHELQRLIKQ 248
           K ++      + K    L  +  SK   ++  ++ A+Q  + ++    KYHHELQ  IKQ
Sbjct: 185 KFISDLSERHVPKMFGALGSNSLSKGSAINYYISQALQQLRRLFSAIKKYHHELQGFIKQ 244

Query: 249 EMERRDYAAYLVNREVIFFLASAIVTFPIFGAWMFLSSWFS 290
           EM R ++ A  VN E++FFLASA++TFPI  AWM L S FS
Sbjct: 245 EMRRNEFTAAFVNDELVFFLASALITFPILSAWMLLLSQFS 284

BLAST of CmaCh18G003900 vs. TrEMBL
Match: D7TJD6_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_10s0003g00240 PE=4 SV=1)

HSP 1 Score: 229.9 bits (585), Expect = 3.8e-57
Identity = 120/246 (48.78%), Postives = 175/246 (71.14%), Query Frame = 1

Query: 43  IKLRISRLESVLEGSKQNLTEKGNEILAQERLIEAMSHKIQYLESALFDMKRKTLSHDER 102
           +KLRI++LE+V+E   QN  EK   +  +E+LIE  SHKI +L+S L+ +K  +   +ER
Sbjct: 1   MKLRITQLETVMEEIVQNFNEKYLYLKQREKLIEEFSHKIHHLQSVLYSIKGDSSHANER 60

Query: 103 IASLEDEVRLLWAVLRKNNFDIHLLEVKVHEAEEKLEEVTSQVEKMSSIVSEQWIHIRHL 162
           +A+LE+EVRLLWA  RKNNFD+H LE K  +AE++L  V+ QVE+++ +V+EQWI I+ L
Sbjct: 61  LAALEEEVRLLWAASRKNNFDLHTLESKAQDAEDRLNVVSKQVEQLADVVTEQWIQIQQL 120

Query: 163 EQALEITKINALKVRQQVALTRCTFLKLVNTRFINQLQKALQTLNYHLFSKVPTLSSMVT 222
           EQAL++ ++ ALK ++QV++ RCTFLK +N  F N L+K    L+ +LF +  TLSS  +
Sbjct: 121 EQALQMAELRALKAKRQVSMMRCTFLKFINNLFGNHLEKVFGMLDPYLFGRGSTLSSYKS 180

Query: 223 GAIQYFQGVYEEATKYHHELQRLIKQEMERRDYAAYLVNREVIFFLASAIVTFPIFGAWM 282
             +   + ++  A  YHHELQ  IKQEME+ ++ A L N E++FF+ASA++TFPI GAWM
Sbjct: 181 RFLHQLKRMWSAAKAYHHELQGFIKQEMEKYEFTAALANDELVFFVASALITFPIMGAWM 240

Query: 283 FLSSWF 289
            +SS F
Sbjct: 241 LVSSQF 246

BLAST of CmaCh18G003900 vs. TrEMBL
Match: A5AJH7_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_005069 PE=4 SV=1)

HSP 1 Score: 224.6 bits (571), Expect = 1.6e-55
Identity = 118/246 (47.97%), Postives = 173/246 (70.33%), Query Frame = 1

Query: 37  IDDLLEIKLRISRLESVLEGSKQNLTEKGNEILAQERLIEAMSHKIQYLESALFDMKRKT 96
           I +L E+KLRI++LE+V+E   QN  EK   +  +E+LIE  SHKI +L+S L+ +K  +
Sbjct: 43  ICELQEMKLRITQLETVMEEIVQNFNEKYLYLKQREKLIEEFSHKIHHLQSVLYSIKGDS 102

Query: 97  LSHDERIASLEDEVRLLWAVLRKNNFDIHLLEVKVHEAEEKLEEVTSQVEKMSSIVSEQW 156
              +ER+A+LE+EVRLLWA  RKNNFD+H LE K  +AE++L  V+ QVE+++  V+EQW
Sbjct: 103 SHANERLAALEEEVRLLWAASRKNNFDLHTLESKAQDAEDRLNMVSKQVEQLADXVTEQW 162

Query: 157 IHIRHLEQALEITKINALKVRQQVALTRCTFLKLVNTRFINQLQKALQTLNYHLFSKVPT 216
           I I+ LEQAL++ ++ ALK ++QV++ RCTFLK +N  F N L+K    L+ +LF +  T
Sbjct: 163 IQIQQLEQALQMAELRALKAKRQVSMMRCTFLKFINNLFGNHLEKVFGMLDPYLFGRGST 222

Query: 217 LSSMVTGAIQYFQGVYEEATKYHHELQRLIKQEMERRDYAAYLVNREVIFFLASAIVTFP 276
           LSS  +  +   + ++  A  YHHELQ  IKQE E+ ++ A L N E++FF+ASA++TFP
Sbjct: 223 LSSYKSRFLHQLKRMWSAAKAYHHELQGFIKQEXEKYEFTAALANDELVFFVASALITFP 282

Query: 277 IFGAWM 283
           I GAW+
Sbjct: 283 IMGAWI 288

BLAST of CmaCh18G003900 vs. TrEMBL
Match: A0A0D2UN96_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_011G030100 PE=4 SV=1)

HSP 1 Score: 215.7 bits (548), Expect = 7.5e-53
Identity = 121/282 (42.91%), Postives = 183/282 (64.89%), Query Frame = 1

Query: 11  LFPALFLPLFAVCEIAS---FARSSNDVLIDDLLEIKLRISRLESVLEGSKQNLTEKGNE 70
           LFPA+F   F    + S      + +++LI +L + KL++SRLESVLE + Q++  K   
Sbjct: 5   LFPAIFTGFFFFISLTSQDQLKNNDHNLLIRELDDAKLKLSRLESVLEETIQSIDAKTLL 64

Query: 71  ILAQERLIEAMSHKIQYLESALFDMKRKTLSHDERIASLEDEVRLLWAVLRKNNFDIHLL 130
           +  +E+L+E M +KI +L+S +  +K  +L  DE++ +L++EVRLLW   RKNNF++H++
Sbjct: 65  LKEREKLLEGMENKITHLQSVISTLKDDSLLADEKLKALQEEVRLLWDASRKNNFELHVM 124

Query: 131 EVKVHEAEEKLEEVTSQVEKMSSIVSEQWIHIRHLEQALEITKINALKVRQQVALTRCTF 190
           E +  + E+++E V  +VEKM+ +V+EQWI I+HLEQAL + +  AL+ ++Q    RC+F
Sbjct: 125 ESEAQDTEDRVEAVNLKVEKMAEVVTEQWIQIQHLEQALHLAQRRALQDQRQ-RYMRCSF 184

Query: 191 LKLVNTRFINQLQKALQTLNYHLFSKVPTLSSMVTGAIQYFQGVYEEATKYHHELQRLIK 250
           LK  N      L K L  L Y+ F K  T+   ++ A+Q  +  Y    KYHH+LQ  IK
Sbjct: 185 LKFFNDLSERHLPKMLGALEYYSFGKGSTIKYYMSQALQQLRRFYSAIKKYHHQLQGFIK 244

Query: 251 QEMERRDYAAYLVNREVIFFLASAIVTFPIFGAWMFLSSWFS 290
           QEM R ++ A  VN E++FFLASA++TFP+ GAWM LSS FS
Sbjct: 245 QEMRRNEFTAAFVNDELVFFLASALITFPVLGAWMVLSSQFS 285

BLAST of CmaCh18G003900 vs. TAIR10
Match: AT1G28410.1 (AT1G28410.1 FUNCTIONS IN: molecular_function unknown)

HSP 1 Score: 180.3 bits (456), Expect = 1.8e-45
Identity = 115/281 (40.93%), Postives = 166/281 (59.07%), Query Frame = 1

Query: 9   LYLFPALFL--PLFAVCEIASFARSSNDVLIDDLLEIKLRISRLESVLEGSKQNLTEKGN 68
           L+L    F+  PLF           SN  LI+DL   KLRIS+LE+VLE + Q L  K  
Sbjct: 3   LHLAALFFIISPLFVFSSQVQQQIGSNQNLINDLDSAKLRISQLEAVLEATIQKLDGKTL 62

Query: 69  EILAQERLIEAMSHKIQYLESALFDMKRKTLSHDERIASLEDEVRLLWAVLRKNNFDIHL 128
            +  +E+LI+    +I  L+SA +  K       +RI+ LE+EV+LLWA LR  NF++H+
Sbjct: 63  YLKEREKLIQVAETQILDLQSASYIDKSGLPLVQKRISELEEEVKLLWAALRTTNFELHV 122

Query: 129 LEVKVHEAEEKLEEVTSQVEKMSSIVSEQWIHIRHLEQALEITKINALKVRQQVALTRCT 188
           LE K  EA+ KL+    +VE+M+ +V+EQWI ++HLEQ  E         R+    +RC 
Sbjct: 123 LEDKAREAKNKLKAKALEVEQMTEVVTEQWIQVQHLEQMREFNN------RRHHTPSRCP 182

Query: 189 FLKLVNTRFINQLQKALQTLNYH-LFSKVPTLSSMVTGAIQYFQGVYEEATKYHHELQRL 248
           F+KL++      L K  +  + H    KV ++   +T A+   + ++   TKYHH+LQ  
Sbjct: 183 FVKLMSDIQRKHLPKVDEAFDIHWKGKKVLSVQPYLTKALSQLKSLWAAVTKYHHQLQGF 242

Query: 249 IKQEMERRDYAAYLVNREVIFFLASAIVTFPIFGAWMFLSS 287
           I+ EMER +  A L NREV+FF+ASA++TFP+FGAWM LSS
Sbjct: 243 IEHEMERTEITAALANREVVFFMASALITFPVFGAWMLLSS 277

BLAST of CmaCh18G003900 vs. TAIR10
Match: AT2G24420.1 (AT2G24420.1 DNA repair ATPase-related)

HSP 1 Score: 58.5 bits (140), Expect = 7.7e-09
Identity = 41/149 (27.52%), Postives = 78/149 (52.35%), Query Frame = 1

Query: 39  DLLEIKLRISRLESVLEGSKQNLTEKGNEILAQERLIEAMSHKIQYLESALFDMKRKTLS 98
           +L ++  +I  LES ++   + L  +   +  +E+L++    K+  LE+ +  +++K  S
Sbjct: 45  ELDQLNAKIRALESQIDDKTKELKGREELVTEKEKLLQERQDKVASLETEVSSLRKKGSS 104

Query: 99  HDERIAS--------LEDEVRLLWAVLRKNNFDIHLLEVKVHEAEEKLEEVTSQVEKMSS 158
               + S        LE +V +L   L + N +  L+E +  E E+KL E+ S+VEK+  
Sbjct: 105 DSVELLSKAQARATELEKQVEVLKKFLEQKNKEKELIEAQTSETEKKLNELNSRVEKLHK 164

Query: 159 IVSEQWIHIRHLEQALEITKINALKVRQQ 180
              EQ   IR LE+AL+I++   L+ + +
Sbjct: 165 TNEEQKNKIRKLERALKISEEEMLRTKHE 193

BLAST of CmaCh18G003900 vs. NCBI nr
Match: gi|449457763|ref|XP_004146617.1| (PREDICTED: uncharacterized protein LOC101213056 isoform X1 [Cucumis sativus])

HSP 1 Score: 421.0 bits (1081), Expect = 1.7e-114
Identity = 230/291 (79.04%), Postives = 247/291 (84.88%), Query Frame = 1

Query: 1   MAAHPIKIL-YLFPALFLPLFAVCEIASFARSSNDVLIDDLLEIKLRISRLESVLEGSKQ 60
           MA   IKIL YLFPA  LPLFAVCEI S  +S ND LI +LL IKLRIS LESVLE SKQ
Sbjct: 1   MAVCSIKILLYLFPAFVLPLFAVCEILSPGQSGNDALIHELLGIKLRISHLESVLEESKQ 60

Query: 61  NLTEKGNEILAQERLIEAMSHKIQYLESALFDMKRKTLSHDERIASLEDEVRLLWAVLRK 120
           NLTEK NE+ AQE+LIE +SHKIQYLESA+ DMKRK  S DERIA LEDEVR LW   RK
Sbjct: 61  NLTEKSNELKAQEKLIEDVSHKIQYLESAISDMKRKISSDDERIAVLEDEVRRLWDAKRK 120

Query: 121 NNFDIHLLEVKVHEAEEKLEEVTSQVEKMSSIVSEQWIHIRHLEQALEITKINALKVRQQ 180
           NNFDIHLL+ KV EAEEKLEEVTSQVEK SSI+SEQWI IRHLEQALE++KI ALKVRQQ
Sbjct: 121 NNFDIHLLKAKVQEAEEKLEEVTSQVEKKSSIISEQWIQIRHLEQALEMSKIQALKVRQQ 180

Query: 181 VALTRCTFLKLVNTRFINQLQKALQTLNYHLFSKVPTLSSMVTGAIQYFQGVYEEATKYH 240
            ALTRCTF+KLVNTRF NQLQKA QTLN+H+FSKVPTLSS VTGAI YFQ VYEEA KYH
Sbjct: 181 FALTRCTFVKLVNTRFANQLQKAFQTLNHHVFSKVPTLSSRVTGAIHYFQRVYEEAKKYH 240

Query: 241 HELQRLIKQEMERRDYAAYLVNREVIFFLASAIVTFPIFGAWMFLSSWFSR 291
           HELQRLIKQEMER +YAA+L N E+IFFLASA+  FPIFGAWMFLSSWFSR
Sbjct: 241 HELQRLIKQEMERNEYAAHLANPELIFFLASALAIFPIFGAWMFLSSWFSR 291

BLAST of CmaCh18G003900 vs. NCBI nr
Match: gi|778658813|ref|XP_011653316.1| (PREDICTED: uncharacterized protein LOC101213056 isoform X2 [Cucumis sativus])

HSP 1 Score: 310.8 bits (795), Expect = 2.4e-81
Identity = 163/199 (81.91%), Postives = 174/199 (87.44%), Query Frame = 1

Query: 92  MKRKTLSHDERIASLEDEVRLLWAVLRKNNFDIHLLEVKVHEAEEKLEEVTSQVEKMSSI 151
           MKRK  S DERIA LEDEVR LW   RKNNFDIHLL+ KV EAEEKLEEVTSQVEK SSI
Sbjct: 1   MKRKISSDDERIAVLEDEVRRLWDAKRKNNFDIHLLKAKVQEAEEKLEEVTSQVEKKSSI 60

Query: 152 VSEQWIHIRHLEQALEITKINALKVRQQVALTRCTFLKLVNTRFINQLQKALQTLNYHLF 211
           +SEQWI IRHLEQALE++KI ALKVRQQ ALTRCTF+KLVNTRF NQLQKA QTLN+H+F
Sbjct: 61  ISEQWIQIRHLEQALEMSKIQALKVRQQFALTRCTFVKLVNTRFANQLQKAFQTLNHHVF 120

Query: 212 SKVPTLSSMVTGAIQYFQGVYEEATKYHHELQRLIKQEMERRDYAAYLVNREVIFFLASA 271
           SKVPTLSS VTGAI YFQ VYEEA KYHHELQRLIKQEMER +YAA+L N E+IFFLASA
Sbjct: 121 SKVPTLSSRVTGAIHYFQRVYEEAKKYHHELQRLIKQEMERNEYAAHLANPELIFFLASA 180

Query: 272 IVTFPIFGAWMFLSSWFSR 291
           +  FPIFGAWMFLSSWFSR
Sbjct: 181 LAIFPIFGAWMFLSSWFSR 199

BLAST of CmaCh18G003900 vs. NCBI nr
Match: gi|731404498|ref|XP_003633019.2| (PREDICTED: uncharacterized protein LOC100853536 [Vitis vinifera])

HSP 1 Score: 233.4 bits (594), Expect = 5.0e-58
Identity = 123/252 (48.81%), Postives = 179/252 (71.03%), Query Frame = 1

Query: 37  IDDLLEIKLRISRLESVLEGSKQNLTEKGNEILAQERLIEAMSHKIQYLESALFDMKRKT 96
           I +L E+KLRI++LE+V+E   QN  EK   +  +E+LIE  SHKI +L+S L+ +K  +
Sbjct: 44  ICELQEMKLRITQLETVMEEIVQNFNEKYLYLKQREKLIEEFSHKIHHLQSVLYSIKGDS 103

Query: 97  LSHDERIASLEDEVRLLWAVLRKNNFDIHLLEVKVHEAEEKLEEVTSQVEKMSSIVSEQW 156
              +ER+A+LE+EVRLLWA  RKNNFD+H LE K  +AE++L  V+ QVE+++ +V+EQW
Sbjct: 104 SHANERLAALEEEVRLLWAASRKNNFDLHTLESKAQDAEDRLNVVSKQVEQLADVVTEQW 163

Query: 157 IHIRHLEQALEITKINALKVRQQVALTRCTFLKLVNTRFINQLQKALQTLNYHLFSKVPT 216
           I I+ LEQAL++ ++ ALK ++QV++ RCTFLK +N  F N L+K    L+ +LF +  T
Sbjct: 164 IQIQQLEQALQMAELRALKAKRQVSMMRCTFLKFINNLFGNHLEKVFGMLDPYLFGRGST 223

Query: 217 LSSMVTGAIQYFQGVYEEATKYHHELQRLIKQEMERRDYAAYLVNREVIFFLASAIVTFP 276
           LSS  +  +   + ++  A  YHHELQ  IKQEME+ ++ A L N E++FF+ASA++TFP
Sbjct: 224 LSSYKSRFLHQLKRMWSAAKAYHHELQGFIKQEMEKYEFTAALANDELVFFVASALITFP 283

Query: 277 IFGAWMFLSSWF 289
           I GAWM +SS F
Sbjct: 284 IMGAWMLVSSQF 295

BLAST of CmaCh18G003900 vs. NCBI nr
Match: gi|590624225|ref|XP_007025546.1| (Epidermal growth factor receptor substrate 15-like 1, putative [Theobroma cacao])

HSP 1 Score: 231.1 bits (588), Expect = 2.5e-57
Identity = 133/281 (47.33%), Postives = 186/281 (66.19%), Query Frame = 1

Query: 9   LYLFPALFLPLFAVCEIASFARSSNDVLIDDLLEIKLRISRLESVLEGSKQNLTEKGNEI 68
           L L PA+F   F      +   S++++LI +L + KL+ISRLESVLE S QN+  K   +
Sbjct: 5   LILIPAIFSCFFFSLASQNEQNSNHNLLIRELHDAKLKISRLESVLEESVQNMNAKTLYL 64

Query: 69  LAQERLIEAMSHKIQYLESALFDMKRKTLSHDERIASLEDEVRLLWAVLRKNNFDIHLLE 128
             +E+L+E M+ KI YL+S L  +K  +L  DER+  LE+EVRLLWAV RKNNF++H+LE
Sbjct: 65  KEREKLLEEMADKITYLQSTLSSLKDDSLLADERLKDLEEEVRLLWAVSRKNNFELHVLE 124

Query: 129 VKVHEAEEKLEEVTSQVEKMSSIVSEQWIHIRHLEQALEITKINALKVRQQVALTRCTFL 188
            K  EAE+KLE VT QVEKM+ +V+EQWI I+HLEQAL+I ++ A + R++  + RCTFL
Sbjct: 125 SKAQEAEDKLEVVTLQVEKMAEVVTEQWIQIQHLEQALQIAEMRASQARRERNM-RCTFL 184

Query: 189 KLVNTRFINQLQKALQTLNYHLFSKVPTLSSMVTGAIQYFQGVYEEATKYHHELQRLIKQ 248
           K ++      + K    L  +  SK   ++  ++ A+Q  + ++    KYHHELQ  IKQ
Sbjct: 185 KFISDLSERHVPKMFGALGSNSLSKGSAINYYISQALQQLRRLFSAIKKYHHELQGFIKQ 244

Query: 249 EMERRDYAAYLVNREVIFFLASAIVTFPIFGAWMFLSSWFS 290
           EM R ++ A  VN E++FFLASA++TFPI  AWM L S FS
Sbjct: 245 EMRRNEFTAAFVNDELVFFLASALITFPILSAWMLLLSQFS 284

BLAST of CmaCh18G003900 vs. NCBI nr
Match: gi|297740426|emb|CBI30608.3| (unnamed protein product [Vitis vinifera])

HSP 1 Score: 229.9 bits (585), Expect = 5.5e-57
Identity = 120/246 (48.78%), Postives = 175/246 (71.14%), Query Frame = 1

Query: 43  IKLRISRLESVLEGSKQNLTEKGNEILAQERLIEAMSHKIQYLESALFDMKRKTLSHDER 102
           +KLRI++LE+V+E   QN  EK   +  +E+LIE  SHKI +L+S L+ +K  +   +ER
Sbjct: 1   MKLRITQLETVMEEIVQNFNEKYLYLKQREKLIEEFSHKIHHLQSVLYSIKGDSSHANER 60

Query: 103 IASLEDEVRLLWAVLRKNNFDIHLLEVKVHEAEEKLEEVTSQVEKMSSIVSEQWIHIRHL 162
           +A+LE+EVRLLWA  RKNNFD+H LE K  +AE++L  V+ QVE+++ +V+EQWI I+ L
Sbjct: 61  LAALEEEVRLLWAASRKNNFDLHTLESKAQDAEDRLNVVSKQVEQLADVVTEQWIQIQQL 120

Query: 163 EQALEITKINALKVRQQVALTRCTFLKLVNTRFINQLQKALQTLNYHLFSKVPTLSSMVT 222
           EQAL++ ++ ALK ++QV++ RCTFLK +N  F N L+K    L+ +LF +  TLSS  +
Sbjct: 121 EQALQMAELRALKAKRQVSMMRCTFLKFINNLFGNHLEKVFGMLDPYLFGRGSTLSSYKS 180

Query: 223 GAIQYFQGVYEEATKYHHELQRLIKQEMERRDYAAYLVNREVIFFLASAIVTFPIFGAWM 282
             +   + ++  A  YHHELQ  IKQEME+ ++ A L N E++FF+ASA++TFPI GAWM
Sbjct: 181 RFLHQLKRMWSAAKAYHHELQGFIKQEMEKYEFTAALANDELVFFVASALITFPIMGAWM 240

Query: 283 FLSSWF 289
            +SS F
Sbjct: 241 LVSSQF 246

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0LSE6_CUCSA1.2e-11479.04Uncharacterized protein OS=Cucumis sativus GN=Csa_1G075570 PE=4 SV=1[more]
A0A061GFK7_THECC1.7e-5747.33Epidermal growth factor receptor substrate 15-like 1, putative OS=Theobroma caca... [more]
D7TJD6_VITVI3.8e-5748.78Putative uncharacterized protein OS=Vitis vinifera GN=VIT_10s0003g00240 PE=4 SV=... [more]
A5AJH7_VITVI1.6e-5547.97Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_005069 PE=4 SV=1[more]
A0A0D2UN96_GOSRA7.5e-5342.91Uncharacterized protein OS=Gossypium raimondii GN=B456_011G030100 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G28410.11.8e-4540.93 FUNCTIONS IN: molecular_function unknown[more]
AT2G24420.17.7e-0927.52 DNA repair ATPase-related[more]
Match NameE-valueIdentityDescription
gi|449457763|ref|XP_004146617.1|1.7e-11479.04PREDICTED: uncharacterized protein LOC101213056 isoform X1 [Cucumis sativus][more]
gi|778658813|ref|XP_011653316.1|2.4e-8181.91PREDICTED: uncharacterized protein LOC101213056 isoform X2 [Cucumis sativus][more]
gi|731404498|ref|XP_003633019.2|5.0e-5848.81PREDICTED: uncharacterized protein LOC100853536 [Vitis vinifera][more]
gi|590624225|ref|XP_007025546.1|2.5e-5747.33Epidermal growth factor receptor substrate 15-like 1, putative [Theobroma cacao][more]
gi|297740426|emb|CBI30608.3|5.5e-5748.78unnamed protein product [Vitis vinifera][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh18G003900.1CmaCh18G003900.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableunknownCoilCoilcoord: 124..151
scor
NoneNo IPR availablePANTHERPTHR34360FAMILY NOT NAMEDcoord: 14..289
score: 1.4
NoneNo IPR availablePANTHERPTHR34360:SF2SUBFAMILY NOT NAMEDcoord: 14..289
score: 1.4

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
CmaCh18G003900CmaCh13G006260Cucurbita maxima (Rimu)cmacmaB216