Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTTCTTAGTTGAATCTCTGTGGCTTTAGTTCGTGTTTATCTGTTGGATGGTTTCTCATATTAAAATATAAATTGGTTCTTCGTTCGAATCTGTTGATTCTTGCCGCTCCACCGTTCTTTTGGCTTTTTTAAAGGTAACTTCCCTTTTCCTTACGCGTTATTCTCTATGGATGAGCTTGCTGGAATTAGTTTACTCGTTGTTCCATTACAGATTACGAAGTGATTGTTTTGCTGTAATTATCCGTTCTTGCTGGTTCTTTTACGCTGTTTTTATGCTTTATTGCTAGTTACAGTGCTGGATTTAGGGTTTTTAACATTTTCTGGTCCTGCTTTTGGGGTTTTAGATTGTGGATTTCCTATTTCATCTGTTAGAGTGTTTAAATGATCGAGAAAAAGTTTGAGGAGTTTGAGATTTGAGGTTTTTTTTTTTATACGTAATGGTGGAGATGTGAGATTGTTCTCAATTTTGGGGAAAATGGATTCATTCAAAGGATTGGGAATTGTGTTGAGCTTTGTTTTTGGATTCCTTCTATTTGCTCTCTTTGCAGAGCTTTTCTATTTTCTTTGGTGGAAGAAAAGAAGATTCACTATAGTAGATGAAGATAAATTCACCAAATATGTAAAGAAATCCTTTCGCCATATTTGTTTGAATTGGAAAAGGAGTTCTTCTTCTTCTTCTTCTTCTTCTCCTCTGCAATTCAACGATTCTAGAGGAAATCAAATGAATTCAGAGCTGAGAAATCAAGAATCTGAGATTGAAAATGGCTGCAACAAGGATTTGTTGTTGTTGAAGTCAGGTGGAGGAGATGGGGCGGCGGGGGGGTTAAATCCGGCGAGGCTGCATAATCTTTTAGGGCCACAGAGCTTTCTCTGGTCCATTAAAGAGGAAACCAAAGAAGATTTAGAGTCTGAAGATAGAAGTAGGAAGGGATCGAGAGCAAGAAGCTTAAGTGATATGTCTCATTTTGTGTCCCCTTTGCCTTCTCCTCCATTGAAAGCACCATGTCCATTAAACCCTTTCAGTTCTTTCAAACACCCTGAATTAAATATAAACACTCTGACTCTCTTTGAATCATCAGCAGAATTAGATTTGAACATGGGATTACCTTCATCTCCTTCAAAATTTAGGTTCCTAAGGGAGGCAGAAGAGAAACTGTACAGAAGATTAACTGAAGAAGCTCTGCATAAGGCCTGTAAAAATGGTGGGTCAGCTCAGAATTCTGAAACTAAAGCCATTTCCAATCCCAAAATGATAGTTGTAGACGATGACGACGATAACGACAACAACCGGTTTTGGTGTACAAATGAGAAAAAGCCTCAATATCATCAAATGTCACAGTGTTCTTCAAGTTCTTCACAGGTTCTTCCTCTAGCTTCTTCTCCTTCAGGGCATAGATTTGATAATGTTTCATTACATAGGTCTTAAACTTATCTTAAAGCTTAATTGTAAGTTAGTAAATCAAAATCTTCCTTTCACGTTTTTGTCTTTTTTTTCAGGGCGGAGATTGAATCACAGGTATTGATAATTGGTCTTTTATCCACTATGCTCGACTTCTACTAAAAAGAAAAAAACATTTTTTTTCTTTAGTCTATGTCCACAAGTGTTACTTGAACATAGTGGATTATAAAATATTTGATGTAAATTTTGTATTTAGCATAGAAG
mRNA sequence
TTTCTTAGTTGAATCTCTGTGGCTTTAGTTCGTGTTTATCTGTTGGATGGTTTCTCATATTAAAATATAAATTGGTTCTTCGTTCGAATCTGTTGATTCTTGCCGCTCCACCGTTCTTTTGGCTTTTTTAAAGGTAACTTCCCTTTTCCTTACGCGTTATTCTCTATGGATGAGCTTGCTGGAATTAGTTTACTCGTTGTTCCATTACAGATTACGAAGTGATTGTTTTGCTGTAATTATCCGTTCTTGCTGGTTCTTTTACGCTGTTTTTATGCTTTATTGCTAGTTACAGTGCTGGATTTAGGGTTTTTAACATTTTCTGGTCCTGCTTTTGGGGTTTTAGATTGTGGATTTCCTATTTCATCTGTTAGAGTGTTTAAATGATCGAGAAAAAGTTTGAGGAGTTTGAGATTTGAGGTTTTTTTTTTTATACGTAATGGTGGAGATGTGAGATTGTTCTCAATTTTGGGGAAAATGGATTCATTCAAAGGATTGGGAATTGTGTTGAGCTTTGTTTTTGGATTCCTTCTATTTGCTCTCTTTGCAGAGCTTTTCTATTTTCTTTGGTGGAAGAAAAGAAGATTCACTATAGTAGATGAAGATAAATTCACCAAATATGTAAAGAAATCCTTTCGCCATATTTGTTTGAATTGGAAAAGGAGTTCTTCTTCTTCTTCTTCTTCTTCTCCTCTGCAATTCAACGATTCTAGAGGAAATCAAATGAATTCAGAGCTGAGAAATCAAGAATCTGAGATTGAAAATGGCTGCAACAAGGATTTGTTGTTGTTGAAGTCAGGTGGAGGAGATGGGGCGGCGGGGGGGTTAAATCCGGCGAGGCTGCATAATCTTTTAGGGCCACAGAGCTTTCTCTGGTCCATTAAAGAGGAAACCAAAGAAGATTTAGAGTCTGAAGATAGAAGTAGGAAGGGATCGAGAGCAAGAAGCTTAAGTGATATGTCTCATTTTGTGTCCCCTTTGCCTTCTCCTCCATTGAAAGCACCATGTCCATTAAACCCTTTCAGTTCTTTCAAACACCCTGAATTAAATATAAACACTCTGACTCTCTTTGAATCATCAGCAGAATTAGATTTGAACATGGGATTACCTTCATCTCCTTCAAAATTTAGGTTCCTAAGGGAGGCAGAAGAGAAACTGTACAGAAGATTAACTGAAGAAGCTCTGCATAAGGCCTGTAAAAATGGTGGGTCAGCTCAGAATTCTGAAACTAAAGCCATTTCCAATCCCAAAATGATAGTTGTAGACGATGACGACGATAACGACAACAACCGGTTTTGGTGTACAAATGAGAAAAAGCCTCAATATCATCAAATGTCACAGTGTTCTTCAAGTTCTTCACAGGGCGGAGATTGAATCACAGGTATTGATAATTGGTCTTTTATCCACTATGCTCGACTTCTACTAAAAAGAAAAAAACATTTTTTTTCTTTAGTCTATGTCCACAAGTGTTACTTGAACATAGTGGATTATAAAATATTTGATGTAAATTTTGTATTTAGCATAGAAG
Coding sequence (CDS)
ATGGATTCATTCAAAGGATTGGGAATTGTGTTGAGCTTTGTTTTTGGATTCCTTCTATTTGCTCTCTTTGCAGAGCTTTTCTATTTTCTTTGGTGGAAGAAAAGAAGATTCACTATAGTAGATGAAGATAAATTCACCAAATATGTAAAGAAATCCTTTCGCCATATTTGTTTGAATTGGAAAAGGAGTTCTTCTTCTTCTTCTTCTTCTTCTCCTCTGCAATTCAACGATTCTAGAGGAAATCAAATGAATTCAGAGCTGAGAAATCAAGAATCTGAGATTGAAAATGGCTGCAACAAGGATTTGTTGTTGTTGAAGTCAGGTGGAGGAGATGGGGCGGCGGGGGGGTTAAATCCGGCGAGGCTGCATAATCTTTTAGGGCCACAGAGCTTTCTCTGGTCCATTAAAGAGGAAACCAAAGAAGATTTAGAGTCTGAAGATAGAAGTAGGAAGGGATCGAGAGCAAGAAGCTTAAGTGATATGTCTCATTTTGTGTCCCCTTTGCCTTCTCCTCCATTGAAAGCACCATGTCCATTAAACCCTTTCAGTTCTTTCAAACACCCTGAATTAAATATAAACACTCTGACTCTCTTTGAATCATCAGCAGAATTAGATTTGAACATGGGATTACCTTCATCTCCTTCAAAATTTAGGTTCCTAAGGGAGGCAGAAGAGAAACTGTACAGAAGATTAACTGAAGAAGCTCTGCATAAGGCCTGTAAAAATGGTGGGTCAGCTCAGAATTCTGAAACTAAAGCCATTTCCAATCCCAAAATGATAGTTGTAGACGATGACGACGATAACGACAACAACCGGTTTTGGTGTACAAATGAGAAAAAGCCTCAATATCATCAAATGTCACAGTGTTCTTCAAGTTCTTCACAGGGCGGAGATTGA
Protein sequence
MDSFKGLGIVLSFVFGFLLFALFAELFYFLWWKKRRFTIVDEDKFTKYVKKSFRHICLNWKRSSSSSSSSSPLQFNDSRGNQMNSELRNQESEIENGCNKDLLLLKSGGGDGAAGGLNPARLHNLLGPQSFLWSIKEETKEDLESEDRSRKGSRARSLSDMSHFVSPLPSPPLKAPCPLNPFSSFKHPELNINTLTLFESSAELDLNMGLPSSPSKFRFLREAEEKLYRRLTEEALHKACKNGGSAQNSETKAISNPKMIVVDDDDDNDNNRFWCTNEKKPQYHQMSQCSSSSSQGGD
Homology
BLAST of CmaCh20G004130 vs. ExPASy TrEMBL
Match:
A0A6J1J8W7 (uncharacterized protein LOC111484511 OS=Cucurbita maxima OX=3661 GN=LOC111484511 PE=4 SV=1)
HSP 1 Score: 584.7 bits (1506), Expect = 2.2e-163
Identity = 298/298 (100.00%), Postives = 298/298 (100.00%), Query Frame = 0
Query: 1 MDSFKGLGIVLSFVFGFLLFALFAELFYFLWWKKRRFTIVDEDKFTKYVKKSFRHICLNW 60
MDSFKGLGIVLSFVFGFLLFALFAELFYFLWWKKRRFTIVDEDKFTKYVKKSFRHICLNW
Sbjct: 1 MDSFKGLGIVLSFVFGFLLFALFAELFYFLWWKKRRFTIVDEDKFTKYVKKSFRHICLNW 60
Query: 61 KRSSSSSSSSSPLQFNDSRGNQMNSELRNQESEIENGCNKDLLLLKSGGGDGAAGGLNPA 120
KRSSSSSSSSSPLQFNDSRGNQMNSELRNQESEIENGCNKDLLLLKSGGGDGAAGGLNPA
Sbjct: 61 KRSSSSSSSSSPLQFNDSRGNQMNSELRNQESEIENGCNKDLLLLKSGGGDGAAGGLNPA 120
Query: 121 RLHNLLGPQSFLWSIKEETKEDLESEDRSRKGSRARSLSDMSHFVSPLPSPPLKAPCPLN 180
RLHNLLGPQSFLWSIKEETKEDLESEDRSRKGSRARSLSDMSHFVSPLPSPPLKAPCPLN
Sbjct: 121 RLHNLLGPQSFLWSIKEETKEDLESEDRSRKGSRARSLSDMSHFVSPLPSPPLKAPCPLN 180
Query: 181 PFSSFKHPELNINTLTLFESSAELDLNMGLPSSPSKFRFLREAEEKLYRRLTEEALHKAC 240
PFSSFKHPELNINTLTLFESSAELDLNMGLPSSPSKFRFLREAEEKLYRRLTEEALHKAC
Sbjct: 181 PFSSFKHPELNINTLTLFESSAELDLNMGLPSSPSKFRFLREAEEKLYRRLTEEALHKAC 240
Query: 241 KNGGSAQNSETKAISNPKMIVVDDDDDNDNNRFWCTNEKKPQYHQMSQCSSSSSQGGD 299
KNGGSAQNSETKAISNPKMIVVDDDDDNDNNRFWCTNEKKPQYHQMSQCSSSSSQGGD
Sbjct: 241 KNGGSAQNSETKAISNPKMIVVDDDDDNDNNRFWCTNEKKPQYHQMSQCSSSSSQGGD 298
BLAST of CmaCh20G004130 vs. ExPASy TrEMBL
Match:
A0A6J1FYI2 (uncharacterized protein LOC111448700 OS=Cucurbita moschata OX=3662 GN=LOC111448700 PE=4 SV=1)
HSP 1 Score: 552.4 bits (1422), Expect = 1.2e-153
Identity = 284/299 (94.98%), Postives = 291/299 (97.32%), Query Frame = 0
Query: 1 MDSFKGLGIVLSFVFGFLLFALFAELFYFLWWKKRRFTIVDEDKFTKYVKKSFRHICLNW 60
MDSFKGLGIVLSFVFGFLLFALFAELFYFLWWKKRRFT+ DEDKFTKYVKK FRHICLNW
Sbjct: 1 MDSFKGLGIVLSFVFGFLLFALFAELFYFLWWKKRRFTLEDEDKFTKYVKKFFRHICLNW 60
Query: 61 KRSSSSSSS-SSPLQFNDSRGNQMNSELRNQESEIENGCNKDLLLLKSGGGDGAAGGLNP 120
KRSSSSS S SSPLQFNDSRGNQ+NSELRNQESEIENGCNKDLLLLKSGGGDGAAG L+P
Sbjct: 61 KRSSSSSPSFSSPLQFNDSRGNQLNSELRNQESEIENGCNKDLLLLKSGGGDGAAGALDP 120
Query: 121 ARLHNLLGPQSFLWSIKEETKEDLESEDRSRKGSRARSLSDMSHFVSPLPSPPLKAPCPL 180
ARL+NLLGP SFL SIKEETKEDLESEDRSRKGSRARSLSDMSHFVSPLPSPPLKAPCPL
Sbjct: 121 ARLYNLLGPPSFLCSIKEETKEDLESEDRSRKGSRARSLSDMSHFVSPLPSPPLKAPCPL 180
Query: 181 NPFSSFKHPELNINTLTLFESSAELDLNMGLPSSPSKFRFLREAEEKLYRRLTEEALHKA 240
NPFSSFKHPE NINT+TLFESSAELDLNMGLPSSPSKFRFLREAEEKLYRRLTEEALHKA
Sbjct: 181 NPFSSFKHPEFNINTVTLFESSAELDLNMGLPSSPSKFRFLREAEEKLYRRLTEEALHKA 240
Query: 241 CKNGGSAQNSETKAISNPKMIVVDDDDDNDNNRFWCTNEKKPQYHQMSQCSSSSSQGGD 299
CKNGGSAQ+SETKA+SNPKMIVVDDDDDNDNNRFWCTNEKKPQYHQMSQCSSSSSQGGD
Sbjct: 241 CKNGGSAQDSETKALSNPKMIVVDDDDDNDNNRFWCTNEKKPQYHQMSQCSSSSSQGGD 299
BLAST of CmaCh20G004130 vs. ExPASy TrEMBL
Match:
A0A6J1D323 (uncharacterized protein LOC111017117 OS=Momordica charantia OX=3673 GN=LOC111017117 PE=4 SV=1)
HSP 1 Score: 294.3 bits (752), Expect = 5.8e-76
Identity = 185/301 (61.46%), Postives = 214/301 (71.10%), Query Frame = 0
Query: 3 SFKGLGIVLSFVFGFLLFALFAELFYFLWWKKRRFTI-VDEDKFTKYVKKSFRHICLNWK 62
SF GLGI LS VFG LLFAL AEL+Y LWWKK F+ V++D+FT Y K+ F IC W+
Sbjct: 4 SFSGLGIGLSLVFGCLLFALVAELYYLLWWKKSSFSSEVEDDEFTNYAKEFFHLIC--WE 63
Query: 63 RSSSSSSSSSPLQFNDSRGNQMNSELRNQESEIENGCNKDLLLLKSGGGDGAAGGLNPAR 122
R+ +SSSS LQ N+SR N+ N +LRNQE ++E G +KDLLL SGG DG L R
Sbjct: 64 RAPASSSS---LQPNNSRPNETNPQLRNQEPDMEIGSSKDLLLKSSGGEDGVEVEL--MR 123
Query: 123 LHNLLGPQSFLWSIKEETKEDLESEDRSRKGSRARSLSDM-----SHFVSPLPSPPLKAP 182
LHNL GP FL++IKEETKEDLESEDRSRKGSR RSLSD+ + F++PLPSPPLKAP
Sbjct: 124 LHNLAGPPRFLFTIKEETKEDLESEDRSRKGSRTRSLSDLILTADTPFLTPLPSPPLKAP 183
Query: 183 CPLNPFSSFKHPELNINTLTLFESSAELDLNMGLPSSPSKFRFLREAEEKLYRRLTEEAL 242
PLNPF S+KH NIN LFESS EL+LN L S P KFRFLREAEEKLYRRL EE
Sbjct: 184 SPLNPFGSYKHHGFNIN--PLFESSTELELNRLLSSPPPKFRFLREAEEKLYRRLMEETQ 243
Query: 243 HKACK--NGGSAQNSETKAISNPKMIVVDDDDDNDNNRFWCTNEKKPQYHQMSQCSSSSS 296
KA + N GSAQNSETKAIS PK ++++ F T EK+PQ H MSQ SSSSS
Sbjct: 244 KKASRSNNDGSAQNSETKAISKPKTAAEEEEEG-----FCFTEEKEPQNHHMSQFSSSSS 290
BLAST of CmaCh20G004130 vs. ExPASy TrEMBL
Match:
A0A6J1KZH4 (uncharacterized protein LOC111499670 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111499670 PE=4 SV=1)
HSP 1 Score: 290.0 bits (741), Expect = 1.1e-74
Identity = 183/300 (61.00%), Postives = 206/300 (68.67%), Query Frame = 0
Query: 1 MDSFKGLGIVLSFVFGFLLFALFAELFYFLWWKKRRFTIVDEDKFTKYVKKSFRHICLNW 60
M SF GLGI LS VFG LLFAL AEL+Y LWWKKR F ED+FT Y K+ F IC W
Sbjct: 1 MISFSGLGIGLSLVFGCLLFALVAELYYLLWWKKRSFNSEVEDEFTSYAKEFFHLIC--W 60
Query: 61 KRSSSSSSSSSPLQFNDSRGNQMNSELRNQESEIENGCNKDLLLLKSGGGDGAAGGLNPA 120
+RSSSSSSS +Q N+S N E+RNQE +IE G +KDLLL SGG DG L
Sbjct: 61 RRSSSSSSS---IQANNS---LRNPEIRNQEPDIEIGSSKDLLLKSSGGEDGVE--LELM 120
Query: 121 RLHNLLGPQSFLWSIKEETKEDLESEDRSRKGSRARSLSDM-----SHFVSPLPSPPLKA 180
RLHNL GP FL++IKEETKEDLESEDRSRKGSR RSLSD+ + F++PLPSPP
Sbjct: 121 RLHNLAGPPRFLFTIKEETKEDLESEDRSRKGSRTRSLSDLILALDTPFLTPLPSPPFNP 180
Query: 181 PCPLNPFSSFKHPELNINTLTLFESSAELDLNMGLPSSPSKFRFLREAEEKLYRRLTEEA 240
P +PFSSFKH NIN LFESS + DLN + S P KFRFLREAEEKLYRRL EE+
Sbjct: 181 P---SPFSSFKHHGFNIN--PLFESSTDFDLNRLISSPPPKFRFLREAEEKLYRRLMEES 240
Query: 241 LHKACKNGGSAQNSETKAISNPKMIVVDDDDDNDNNRFWCTNEKKPQYHQMSQCSSSSSQ 296
KACKNGGS QN ET AISNP D++DD F C EK+ Q H +SQ SSSSSQ
Sbjct: 241 QKKACKNGGSIQNPETPAISNPIAAAEDEEDD----EFCCRKEKELQNHYVSQYSSSSSQ 281
BLAST of CmaCh20G004130 vs. ExPASy TrEMBL
Match:
A0A6J1KXJ9 (uncharacterized protein LOC111499670 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111499670 PE=4 SV=1)
HSP 1 Score: 290.0 bits (741), Expect = 1.1e-74
Identity = 183/300 (61.00%), Postives = 206/300 (68.67%), Query Frame = 0
Query: 1 MDSFKGLGIVLSFVFGFLLFALFAELFYFLWWKKRRFTIVDEDKFTKYVKKSFRHICLNW 60
M SF GLGI LS VFG LLFAL AEL+Y LWWKKR F ED+FT Y K+ F IC W
Sbjct: 1 MISFSGLGIGLSLVFGCLLFALVAELYYLLWWKKRSFNSEVEDEFTSYAKEFFHLIC--W 60
Query: 61 KRSSSSSSSSSPLQFNDSRGNQMNSELRNQESEIENGCNKDLLLLKSGGGDGAAGGLNPA 120
+RSSSSSSS +Q N+S N E+RNQE +IE G +KDLLL SGG DG L
Sbjct: 61 RRSSSSSSS---IQANNS---LRNPEIRNQEPDIEIGSSKDLLLKSSGGEDGVE--LELM 120
Query: 121 RLHNLLGPQSFLWSIKEETKEDLESEDRSRKGSRARSLSDM-----SHFVSPLPSPPLKA 180
RLHNL GP FL++IKEETKEDLESEDRSRKGSR RSLSD+ + F++PLPSPP
Sbjct: 121 RLHNLAGPPRFLFTIKEETKEDLESEDRSRKGSRTRSLSDLILALDTPFLTPLPSPPFNP 180
Query: 181 PCPLNPFSSFKHPELNINTLTLFESSAELDLNMGLPSSPSKFRFLREAEEKLYRRLTEEA 240
P +PFSSFKH NIN LFESS + DLN + S P KFRFLREAEEKLYRRL EE+
Sbjct: 181 P---SPFSSFKHHGFNIN--PLFESSTDFDLNRLISSPPPKFRFLREAEEKLYRRLMEES 240
Query: 241 LHKACKNGGSAQNSETKAISNPKMIVVDDDDDNDNNRFWCTNEKKPQYHQMSQCSSSSSQ 296
KACKNGGS QN ET AISNP D++DD F C EK+ Q H +SQ SSSSSQ
Sbjct: 241 QKKACKNGGSIQNPETPAISNPIAAAEDEEDD----EFCCRKEKELQNHYVSQYSSSSSQ 281
BLAST of CmaCh20G004130 vs. NCBI nr
Match:
XP_022986917.1 (uncharacterized protein LOC111484511 [Cucurbita maxima])
HSP 1 Score: 584.7 bits (1506), Expect = 4.4e-163
Identity = 298/298 (100.00%), Postives = 298/298 (100.00%), Query Frame = 0
Query: 1 MDSFKGLGIVLSFVFGFLLFALFAELFYFLWWKKRRFTIVDEDKFTKYVKKSFRHICLNW 60
MDSFKGLGIVLSFVFGFLLFALFAELFYFLWWKKRRFTIVDEDKFTKYVKKSFRHICLNW
Sbjct: 1 MDSFKGLGIVLSFVFGFLLFALFAELFYFLWWKKRRFTIVDEDKFTKYVKKSFRHICLNW 60
Query: 61 KRSSSSSSSSSPLQFNDSRGNQMNSELRNQESEIENGCNKDLLLLKSGGGDGAAGGLNPA 120
KRSSSSSSSSSPLQFNDSRGNQMNSELRNQESEIENGCNKDLLLLKSGGGDGAAGGLNPA
Sbjct: 61 KRSSSSSSSSSPLQFNDSRGNQMNSELRNQESEIENGCNKDLLLLKSGGGDGAAGGLNPA 120
Query: 121 RLHNLLGPQSFLWSIKEETKEDLESEDRSRKGSRARSLSDMSHFVSPLPSPPLKAPCPLN 180
RLHNLLGPQSFLWSIKEETKEDLESEDRSRKGSRARSLSDMSHFVSPLPSPPLKAPCPLN
Sbjct: 121 RLHNLLGPQSFLWSIKEETKEDLESEDRSRKGSRARSLSDMSHFVSPLPSPPLKAPCPLN 180
Query: 181 PFSSFKHPELNINTLTLFESSAELDLNMGLPSSPSKFRFLREAEEKLYRRLTEEALHKAC 240
PFSSFKHPELNINTLTLFESSAELDLNMGLPSSPSKFRFLREAEEKLYRRLTEEALHKAC
Sbjct: 181 PFSSFKHPELNINTLTLFESSAELDLNMGLPSSPSKFRFLREAEEKLYRRLTEEALHKAC 240
Query: 241 KNGGSAQNSETKAISNPKMIVVDDDDDNDNNRFWCTNEKKPQYHQMSQCSSSSSQGGD 299
KNGGSAQNSETKAISNPKMIVVDDDDDNDNNRFWCTNEKKPQYHQMSQCSSSSSQGGD
Sbjct: 241 KNGGSAQNSETKAISNPKMIVVDDDDDNDNNRFWCTNEKKPQYHQMSQCSSSSSQGGD 298
BLAST of CmaCh20G004130 vs. NCBI nr
Match:
XP_022944170.1 (uncharacterized protein LOC111448700 [Cucurbita moschata])
HSP 1 Score: 552.4 bits (1422), Expect = 2.4e-153
Identity = 284/299 (94.98%), Postives = 291/299 (97.32%), Query Frame = 0
Query: 1 MDSFKGLGIVLSFVFGFLLFALFAELFYFLWWKKRRFTIVDEDKFTKYVKKSFRHICLNW 60
MDSFKGLGIVLSFVFGFLLFALFAELFYFLWWKKRRFT+ DEDKFTKYVKK FRHICLNW
Sbjct: 1 MDSFKGLGIVLSFVFGFLLFALFAELFYFLWWKKRRFTLEDEDKFTKYVKKFFRHICLNW 60
Query: 61 KRSSSSSSS-SSPLQFNDSRGNQMNSELRNQESEIENGCNKDLLLLKSGGGDGAAGGLNP 120
KRSSSSS S SSPLQFNDSRGNQ+NSELRNQESEIENGCNKDLLLLKSGGGDGAAG L+P
Sbjct: 61 KRSSSSSPSFSSPLQFNDSRGNQLNSELRNQESEIENGCNKDLLLLKSGGGDGAAGALDP 120
Query: 121 ARLHNLLGPQSFLWSIKEETKEDLESEDRSRKGSRARSLSDMSHFVSPLPSPPLKAPCPL 180
ARL+NLLGP SFL SIKEETKEDLESEDRSRKGSRARSLSDMSHFVSPLPSPPLKAPCPL
Sbjct: 121 ARLYNLLGPPSFLCSIKEETKEDLESEDRSRKGSRARSLSDMSHFVSPLPSPPLKAPCPL 180
Query: 181 NPFSSFKHPELNINTLTLFESSAELDLNMGLPSSPSKFRFLREAEEKLYRRLTEEALHKA 240
NPFSSFKHPE NINT+TLFESSAELDLNMGLPSSPSKFRFLREAEEKLYRRLTEEALHKA
Sbjct: 181 NPFSSFKHPEFNINTVTLFESSAELDLNMGLPSSPSKFRFLREAEEKLYRRLTEEALHKA 240
Query: 241 CKNGGSAQNSETKAISNPKMIVVDDDDDNDNNRFWCTNEKKPQYHQMSQCSSSSSQGGD 299
CKNGGSAQ+SETKA+SNPKMIVVDDDDDNDNNRFWCTNEKKPQYHQMSQCSSSSSQGGD
Sbjct: 241 CKNGGSAQDSETKALSNPKMIVVDDDDDNDNNRFWCTNEKKPQYHQMSQCSSSSSQGGD 299
BLAST of CmaCh20G004130 vs. NCBI nr
Match:
KAG7010561.1 (hypothetical protein SDJN02_27355, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 552.0 bits (1421), Expect = 3.2e-153
Identity = 284/299 (94.98%), Postives = 292/299 (97.66%), Query Frame = 0
Query: 1 MDSFKGLGIVLSFVFGFLLFALFAELFYFLWWKKRRFTIVDEDKFTKYVKKSFRHICLNW 60
MDSFKGLGIVLSFVFGFLLFALFAELFYFLWWKKRRFT+ DEDKFTKYVKK FRHICLNW
Sbjct: 1 MDSFKGLGIVLSFVFGFLLFALFAELFYFLWWKKRRFTLEDEDKFTKYVKKFFRHICLNW 60
Query: 61 KRSSSSSSS-SSPLQFNDSRGNQMNSELRNQESEIENGCNKDLLLLKSGGGDGAAGGLNP 120
KRSSSSSSS SSPLQFNDSRGNQ+NSELRNQESEIENGCNKDLLLLK+GGGDGAAG L+P
Sbjct: 61 KRSSSSSSSFSSPLQFNDSRGNQLNSELRNQESEIENGCNKDLLLLKAGGGDGAAGALDP 120
Query: 121 ARLHNLLGPQSFLWSIKEETKEDLESEDRSRKGSRARSLSDMSHFVSPLPSPPLKAPCPL 180
ARL+NLLGP SFL SIKEETKEDLESEDRSRKGSRARSLSDMSHFVSPLPSPPLKAPCPL
Sbjct: 121 ARLYNLLGPPSFLCSIKEETKEDLESEDRSRKGSRARSLSDMSHFVSPLPSPPLKAPCPL 180
Query: 181 NPFSSFKHPELNINTLTLFESSAELDLNMGLPSSPSKFRFLREAEEKLYRRLTEEALHKA 240
NPFSSFKHPE NINT+TLFESSAELDLNMGLPSSPSKFRFLREAEEKLYRRLTEEALHKA
Sbjct: 181 NPFSSFKHPEFNINTVTLFESSAELDLNMGLPSSPSKFRFLREAEEKLYRRLTEEALHKA 240
Query: 241 CKNGGSAQNSETKAISNPKMIVVDDDDDNDNNRFWCTNEKKPQYHQMSQCSSSSSQGGD 299
CKNGGSAQ+SETKAISNPKMIVVDDDDD+DNNRFWCTNEKKPQYHQMSQCSSSSSQGGD
Sbjct: 241 CKNGGSAQDSETKAISNPKMIVVDDDDDSDNNRFWCTNEKKPQYHQMSQCSSSSSQGGD 299
BLAST of CmaCh20G004130 vs. NCBI nr
Match:
KAG6570715.1 (hypothetical protein SDJN03_29630, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 545.0 bits (1403), Expect = 3.9e-151
Identity = 281/295 (95.25%), Postives = 287/295 (97.29%), Query Frame = 0
Query: 1 MDSFKGLGIVLSFVFGFLLFALFAELFYFLWWKKRRFTIVDEDKFTKYVKKSFRHICLNW 60
MDSFKGLGIVLSFVFGFLLFALFAELFYFLWWKKRRFT+ DEDKFTKYVKK FRHICLNW
Sbjct: 1 MDSFKGLGIVLSFVFGFLLFALFAELFYFLWWKKRRFTLEDEDKFTKYVKKFFRHICLNW 60
Query: 61 KRSSSSSSSSSPLQFNDSRGNQMNSELRNQESEIENGCNKDLLLLKSGGGDGAAGGLNPA 120
KR SSSSS SSPLQFNDSRGNQ+NSELRNQESEIENGCNKDLLLLKSGGGDGAAG L+PA
Sbjct: 61 KR-SSSSSFSSPLQFNDSRGNQLNSELRNQESEIENGCNKDLLLLKSGGGDGAAGALDPA 120
Query: 121 RLHNLLGPQSFLWSIKEETKEDLESEDRSRKGSRARSLSDMSHFVSPLPSPPLKAPCPLN 180
RL+NLLGP SFL SIKEETKEDLESEDRSRKGSRARSLSDMSHFVSPLPSPPLKAPCPLN
Sbjct: 121 RLYNLLGPPSFLCSIKEETKEDLESEDRSRKGSRARSLSDMSHFVSPLPSPPLKAPCPLN 180
Query: 181 PFSSFKHPELNINTLTLFESSAELDLNMGLPSSPSKFRFLREAEEKLYRRLTEEALHKAC 240
PFSSFKHPE NINT+TLFESSAELDLNMGLPSSPSKFRFLREAEEKLYRRLTEEALHKAC
Sbjct: 181 PFSSFKHPEFNINTVTLFESSAELDLNMGLPSSPSKFRFLREAEEKLYRRLTEEALHKAC 240
Query: 241 KNGGSAQNSETKAISNPKMIVVDDDDDNDNNRFWCTNEKKPQYHQMSQCSSSSSQ 296
KNGGSAQ+SETKAISNPKMIVVDDDDDNDNNRFWCTNEKKPQYHQMSQCSSSSSQ
Sbjct: 241 KNGGSAQDSETKAISNPKMIVVDDDDDNDNNRFWCTNEKKPQYHQMSQCSSSSSQ 294
BLAST of CmaCh20G004130 vs. NCBI nr
Match:
XP_023512262.1 (uncharacterized protein LOC111777053 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 535.4 bits (1378), Expect = 3.1e-148
Identity = 279/296 (94.26%), Postives = 286/296 (96.62%), Query Frame = 0
Query: 1 MDSFKGLGIVLSFVFGFLLFALFAELFYFLWWKKRRFTIVDEDKFTKYVKKSFRHICLNW 60
MDSFKGLGIVLSFVFGFLLFALFAELFYFLWWKKRRFT DEDKFTKYVKK FRHICLNW
Sbjct: 1 MDSFKGLGIVLSFVFGFLLFALFAELFYFLWWKKRRFTSEDEDKFTKYVKKFFRHICLNW 60
Query: 61 KRSSSSSSS-SSPLQFNDSRGNQMNSELRNQESEIENGCNKDLLLLKSGGGDGAAGGLNP 120
KR+SSSSSS SSPLQFNDSRGNQ+NSELRNQESEIENGCNKDLLLLKSGGGDGAAG L+P
Sbjct: 61 KRNSSSSSSFSSPLQFNDSRGNQLNSELRNQESEIENGCNKDLLLLKSGGGDGAAGALDP 120
Query: 121 ARLHNLLGPQSFLWSIKEETKEDLESEDRSRKGSRARSLSDMSHFVSPLPSPPLKAPCPL 180
ARL+NLLGP SFL SIKEETKEDLESEDRSRKGSRARSLSDMSHFVSPLPSPPLKAPCPL
Sbjct: 121 ARLYNLLGPPSFLCSIKEETKEDLESEDRSRKGSRARSLSDMSHFVSPLPSPPLKAPCPL 180
Query: 181 NPFSSFKHPELNINTLTLFESSAELDLNMGLPSSPSKFRFLREAEEKLYRRLTEEALHKA 240
NPFSSFKHPE NINT+TLFESSAELDLNMGLPSSPSKFRFLREAEEKLYRRLTEEALHKA
Sbjct: 181 NPFSSFKHPEFNINTVTLFESSAELDLNMGLPSSPSKFRFLREAEEKLYRRLTEEALHKA 240
Query: 241 CKNGGSAQNSETKAISNPKMIVVDDDDDNDNNRFWCTNEKKPQYHQMSQCSSSSSQ 296
CKNGGSAQ+SETKAISNPKMIVV DD+DNDNNRFWCTNEKKPQYH MSQCSSSSSQ
Sbjct: 241 CKNGGSAQDSETKAISNPKMIVV-DDNDNDNNRFWCTNEKKPQYHHMSQCSSSSSQ 295
BLAST of CmaCh20G004130 vs. TAIR 10
Match:
AT5G59350.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )
HSP 1 Score: 91.7 bits (226), Expect = 1.1e-18
Identity = 92/254 (36.22%), Postives = 126/254 (49.61%), Query Frame = 0
Query: 1 MDSFKGLGIVLSFVFGFLLFALFAELFYFLWWKKRRFTIV---------DEDKFTKYVKK 60
M + GLGI LS +FGFLL AL E++Y L KK + ++ +E + Y K+
Sbjct: 1 MKTISGLGIGLSLMFGFLLLALVGEVYYLLRCKKHKKRVISQESEEEKEEEQQQNGYAKE 60
Query: 61 SFRHICLNWKRSSSSSSSSSPLQFNDSRGNQMNSELRNQES--EIENGCNKDLLLLKSGG 120
+ C P + G + N++ ++E G K L +GG
Sbjct: 61 LIQLFCF-----------KKPQSLQQNNGGREGEVSMNEDGNPDLELGLMKHL----NGG 120
Query: 121 GDGAAGGLNPARLHNLLGPQSFLWSIKEETKEDLESED--RSRKGSRA--RSLSDMSHFV 180
G L +LHN Q FL++I EETK DLESED +SR GSR+ RSLSD+ +
Sbjct: 121 DLGFEAEL--MKLHN----QRFLFTIMEETKADLESEDGGKSRLGSRSRRRSLSDVPNDC 180
Query: 181 SPLPSPPLKAP-CPLNPFSSFKHPELNINTLTLFESSAELDLNMGLPSS---PSKFRFLR 236
+ PL +P +P S+ H N LFES EL+ N SS P KF+F+R
Sbjct: 181 NTPGFTPLASPKKSSSPLESYPHHGFN----PLFESDGELEFNKFFRSSSSPPPKFKFMR 229
BLAST of CmaCh20G004130 vs. TAIR 10
Match:
AT2G39560.1 (Putative membrane lipoprotein )
HSP 1 Score: 72.0 bits (175), Expect = 9.0e-13
Identity = 84/259 (32.43%), Postives = 125/259 (48.26%), Query Frame = 0
Query: 1 MDSFKGLGIVLSFVFGFLLFALFAELFYFLWWKKRRFTIVDE---DKFTKYVKKSFRHIC 60
M S +G+ LS VFG LL AL AEL+Y LW KKR T + D T ++ C
Sbjct: 1 MRSLSSVGLALSIVFGCLLLALLAELYYLLWCKKRSTTRRPDFRNDYSTPGTRELLFIFC 60
Query: 61 LNWKRSSSSSSSSSPLQFNDSRGNQMNSELRNQESEIENGCNKDLLLLKSGGGDGAAGGL 120
+ SS++ SSSSP + S ++++ Q+ + NG ++ GG G
Sbjct: 61 CS---SSTNPSSSSPSSSSFSNPKPIDTQ---QQCPLNNG-------FENVGGPG----- 120
Query: 121 NPARLHNLLGPQSFLWSIKEETKEDLESEDRSRKGSRARSLSDM----------SHFVSP 180
L P+ FL++I EET E++ESED ++ +SL+D+ +++P
Sbjct: 121 --------LVPR-FLFTIMEETVEEMESED--VVSTKGKSLNDLFLNMESGVITPPYLTP 180
Query: 181 LPSPPLKAPCPLNPFSSFKHPELNINTLTLFESSAELDLNMGLPSSP------------S 235
SP L P PL P +LFESS++ + N + SSP S
Sbjct: 181 RASPSLFTP-PLTPLLMESCNGRKEEISSLFESSSDAEFNRLVRSSPLSSSHSPSSSPLS 229
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1J8W7 | 2.2e-163 | 100.00 | uncharacterized protein LOC111484511 OS=Cucurbita maxima OX=3661 GN=LOC111484511... | [more] |
A0A6J1FYI2 | 1.2e-153 | 94.98 | uncharacterized protein LOC111448700 OS=Cucurbita moschata OX=3662 GN=LOC1114487... | [more] |
A0A6J1D323 | 5.8e-76 | 61.46 | uncharacterized protein LOC111017117 OS=Momordica charantia OX=3673 GN=LOC111017... | [more] |
A0A6J1KZH4 | 1.1e-74 | 61.00 | uncharacterized protein LOC111499670 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1KXJ9 | 1.1e-74 | 61.00 | uncharacterized protein LOC111499670 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
Match Name | E-value | Identity | Description | |
XP_022986917.1 | 4.4e-163 | 100.00 | uncharacterized protein LOC111484511 [Cucurbita maxima] | [more] |
XP_022944170.1 | 2.4e-153 | 94.98 | uncharacterized protein LOC111448700 [Cucurbita moschata] | [more] |
KAG7010561.1 | 3.2e-153 | 94.98 | hypothetical protein SDJN02_27355, partial [Cucurbita argyrosperma subsp. argyro... | [more] |
KAG6570715.1 | 3.9e-151 | 95.25 | hypothetical protein SDJN03_29630, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_023512262.1 | 3.1e-148 | 94.26 | uncharacterized protein LOC111777053 [Cucurbita pepo subsp. pepo] | [more] |
Match Name | E-value | Identity | Description | |
AT5G59350.1 | 1.1e-18 | 36.22 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |
AT2G39560.1 | 9.0e-13 | 32.43 | Putative membrane lipoprotein | [more] |