Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TCAGCATAGGCGAATTATCGTCCCCATTGCAGCATAGGCGAGAGCGGATAAGCCAACGAAAGGAAGAAATGTCTGCAACAATTCAGCACTCATTTCTTTCTCCTTCAAATCCAATTCATCTCTCCGCAATCTCTACAAACTTTCTCAAATCACACCCTTCCTCCATGATCATCTCGCACAAACCTTTAGGTTTTTCAACACCGCGTCTCAAAATCGTGAAATGTATCTCGGAATCGAACGATGAAGCTCAAAACAATGTCAATTTGAATAATGCTTTGTCCAGTATGGTCGGTGAGCAGATTGAAGAACTTTTGAACAAGGAAGAGAATAGGAGTTTGCTCGATGGCTTGGAGAAGGCGTCGATGAGAGTTGAAATCGCCAAAAGGCAGTTGGCTGAGATCGAGAAACAGGAACTGGAGCTCAAACGATTCAAGGATTATATTAACCAGCTCGAAAGTAGAGCATCGGAGGTATTTAATTAATCTCTAATTAATTTCTTGGTGCGACAAAGCATTCCTTTGTTATCTGTTTAAGTGCAAACCGAGAACCTGAGATGCTATTAGATTCAATGATTATATTAGCCAGCTTGAAAACATAGCTTCGGAGGTTTGTTAATCTCTTGATACTCCTGTTATGCAATGAAGTGTTCTTTCTTTGATTATCTATCTTAATGCAAACCAAGAAATCAAGGATGCTCTAGGATTCAAGGATTATATTAACCAACTCGCGGTTTGTTTAATTAATCTGTTACATATTCTTGTTGTGTGATAAAACACTCTTTTGTAATTTGCCTGTGAGGATGCTCTAGGAATCAAGGATTATATTAACTCCCTGAAAACGAAGCTTCAGAAGTTTGTTAATCAATCTCTTTCGTACTCTTGTTGAATCTGTTTGAATGCAAACCAAGAAATTGAGGTTGCTGTAGGGTGTGCATTTTGCTAATTTCTTTTGTTCGTGGCTGTTTTGAATTGTATTTAGTGAAATGGGAATCCTTGCTTGTATGGTATGTATGCGTTAACAATTATTATAGCAAATTTGTAACAGCCCAAGCCCACCGCTAGCGGATATTGTCCTCTTTTGGCTTTCTCTTTCGGGCTTTCCCTCGAGGTTTTTAGAACGCATGTGCTAGGGAGAGGTTTCCACACTTATAAAGAATGTTTCGTTCTCCTCCCCAACCGATGTGTAATCTCACAATCCATCCCCCTTCGGGGCCAGCGTCCTTGCTGGCACACCGCTTAGTGTCTAGCTCTGATACTATTTGTAACCACCCAAGCCCACCGCTAGCAGATATGGTCCTCTTTAGGCTTTCTCTTTCGGGCTTCCTCTCAAGGTTTTTAAAACGCATCTGCTAGGGGAGGTTTCCACACACTTATAAAGAATGTTTTGTTCTCCTCCCTAACCGATGTGGAATCTCACAATCCACCCCCCTTCGGGGCCCAGTGTCCTTGCTAGCACACCGCCTCATGTCCACCCTCCTTGCTGGCACATCGCCCGATGTATAGCTCTGATACCATTTGTAACTGCCCAAGCTTAACGCTAGCAGATATTGTCTTCTTTTGGTTTTCTCTTTCGGGCTTCCCCTCGAGGTTTTTAAAACGCGTCTGCTAGAGAGAAGTTTCCACACACTTATAAAGAAAGTTCTGTTCTCCTCCCCAACCGATGTGGGATCTCACAAAATTTCAGCTGAGAGACCGTGAACTCGTGGTTGATTCAAGTGAGGAAAAGGAAATGGTCAACCATATTTGATCCCGTGCAAAATTAAGCCAATTTTTATTACCTTTTCATTTGAGGAAGGACCCAATAGCCTAGTAACCCTCCTCAATCGACCATTTTGAACTGTTTTATAACCTATAGGCAGGGTAGCAACCAAAATCCATTTTGAACTGTTTTATAACCTATAGGCTGGGTAGCAACCAAAATCCATCACTTATCGAAGTGAATTATAAGTGCTGTTTCCAAGTGACAATAATATTGTTGTCAACTACAAATAACAGTAACATCTGGGTCACTTTGCTCATTCAAGATTGAAGAATGTCAAAAGGAGATATTGGAGGCAAGAGGGATGATCGAGGAGGCCGAGCGCTCTCTCTCACAAAGCGAGGGTGGAAATGCAAAAAGAGACGAGGAAGACGGAGGAATTGACAGAGATGAAGAGAGATTGGAATCTGTAAAAGCAGCGTCCATATCAGCCATTGTTGGCACACTTGCAGGGCTGCCTATCTTTCTTAATCAAGTGACCAGCATTTCTCAGCTTGCACTCCCTACGGCAATCACCTTTATTAGCTGTGCCTTGTTTGGAGTTACATTCCGATACACAATAAGGAGAGACTTGGATAATATTCAGCTCAAGACTGGAACATCTGCCGCTTTCGGTTTCGTTAAAGGTATGAAAAATAGCGTCTTGAATAAAGAATTGAGTTATCTAAATCCATTGAGGAAGTTATATTGTTGATAAATGCAGGTCTTGCAACACTAGATGGTGGAGTACCTCTGGAATTTAATGCTGAGAGTTTCTCATCACATGTCACTGATGCTGCTGTGTATGTATCTGAAAACCTTTATATATTTATTTCTGCTGCTGTTGCACTGGAATTCTGTTTCAGGATGAGGTTATTAAGCCCTTTTCCTATAAGAAAATCAGTATAAGGGTCATTTGGATATGAACTTCATCTCAAGAAGATTAAACTTAGTTCATGTTATCCAACTCCCTGGGCCATCGGGCTACGATAACAACACGACCTATTATTTACATGAAACATGTCTTAGAAATCTGATCAGTAAGCATTGTGTTATTCTTGAAATGT
mRNA sequence
TCAGCATAGGCGAATTATCGTCCCCATTGCAGCATAGGCGAGAGCGGATAAGCCAACGAAAGGAAGAAATGTCTGCAACAATTCAGCACTCATTTCTTTCTCCTTCAAATCCAATTCATCTCTCCGCAATCTCTACAAACTTTCTCAAATCACACCCTTCCTCCATGATCATCTCGCACAAACCTTTAGGTTTTTCAACACCGCGTCTCAAAATCGTGAAATGTATCTCGGAATCGAACGATGAAGCTCAAAACAATGTCAATTTGAATAATGCTTTGTCCAGTATGGTCGGTGAGCAGATTGAAGAACTTTTGAACAAGGAAGAGAATAGGAGTTTGCTCGATGGCTTGGAGAAGGCGTCGATGAGAGTTGAAATCGCCAAAAGGCAGTTGGCTGAGATCGAGAAACAGGAACTGGAGCTCAAACGATTCAAGGATTATATTAACCAGCTCGAAAGTAGAGCATCGGAGATTGAAGAATGTCAAAAGGAGATATTGGAGGCAAGAGGGATGATCGAGGAGGCCGAGCGCTCTCTCTCACAAAGCGAGGGTGGAAATGCAAAAAGAGACGAGGAAGACGGAGGAATTGACAGAGATGAAGAGAGATTGGAATCTGTAAAAGCAGCGTCCATATCAGCCATTGTTGGCACACTTGCAGGGCTGCCTATCTTTCTTAATCAAGTGACCAGCATTTCTCAGCTTGCACTCCCTACGGCAATCACCTTTATTAGCTGTGCCTTGTTTGGAGTTACATTCCGATACACAATAAGGAGAGACTTGGATAATATTCAGCTCAAGACTGGAACATCTGCCGCTTTCGGTTTCGTTAAAGGTCTTGCAACACTAGATGGTGGAGTACCTCTGGAATTTAATGCTGAGAGTTTCTCATCACATGTCACTGATGCTGCTGTGTATGTATCTGAAAACCTTTATATATTTATTTCTGCTGCTGTTGCACTGGAATTCTGTTTCAGGATGAGGTTATTAAGCCCTTTTCCTATAAGAAAATCAGTATAAGGGTCATTTGGATATGAACTTCATCTCAAGAAGATTAAACTTAGTTCATGTTATCCAACTCCCTGGGCCATCGGGCTACGATAACAACACGACCTATTATTTACATGAAACATGTCTTAGAAATCTGATCAGTAAGCATTGTGTTATTCTTGAAATGT
Coding sequence (CDS)
ATGTCTGCAACAATTCAGCACTCATTTCTTTCTCCTTCAAATCCAATTCATCTCTCCGCAATCTCTACAAACTTTCTCAAATCACACCCTTCCTCCATGATCATCTCGCACAAACCTTTAGGTTTTTCAACACCGCGTCTCAAAATCGTGAAATGTATCTCGGAATCGAACGATGAAGCTCAAAACAATGTCAATTTGAATAATGCTTTGTCCAGTATGGTCGGTGAGCAGATTGAAGAACTTTTGAACAAGGAAGAGAATAGGAGTTTGCTCGATGGCTTGGAGAAGGCGTCGATGAGAGTTGAAATCGCCAAAAGGCAGTTGGCTGAGATCGAGAAACAGGAACTGGAGCTCAAACGATTCAAGGATTATATTAACCAGCTCGAAAGTAGAGCATCGGAGATTGAAGAATGTCAAAAGGAGATATTGGAGGCAAGAGGGATGATCGAGGAGGCCGAGCGCTCTCTCTCACAAAGCGAGGGTGGAAATGCAAAAAGAGACGAGGAAGACGGAGGAATTGACAGAGATGAAGAGAGATTGGAATCTGTAAAAGCAGCGTCCATATCAGCCATTGTTGGCACACTTGCAGGGCTGCCTATCTTTCTTAATCAAGTGACCAGCATTTCTCAGCTTGCACTCCCTACGGCAATCACCTTTATTAGCTGTGCCTTGTTTGGAGTTACATTCCGATACACAATAAGGAGAGACTTGGATAATATTCAGCTCAAGACTGGAACATCTGCCGCTTTCGGTTTCGTTAAAGGTCTTGCAACACTAGATGGTGGAGTACCTCTGGAATTTAATGCTGAGAGTTTCTCATCACATGTCACTGATGCTGCTGTGTATGTATCTGAAAACCTTTATATATTTATTTCTGCTGCTGTTGCACTGGAATTCTGTTTCAGGATGAGGTTATTAAGCCCTTTTCCTATAAGAAAATCAGTATAA
Protein sequence
MSATIQHSFLSPSNPIHLSAISTNFLKSHPSSMIISHKPLGFSTPRLKIVKCISESNDEAQNNVNLNNALSSMVGEQIEELLNKEENRSLLDGLEKASMRVEIAKRQLAEIEKQELELKRFKDYINQLESRASEIEECQKEILEARGMIEEAERSLSQSEGGNAKRDEEDGGIDRDEERLESVKAASISAIVGTLAGLPIFLNQVTSISQLALPTAITFISCALFGVTFRYTIRRDLDNIQLKTGTSAAFGFVKGLATLDGGVPLEFNAESFSSHVTDAAVYVSENLYIFISAAVALEFCFRMRLLSPFPIRKSV
Homology
BLAST of Cp4.1LG14g10010 vs. NCBI nr
Match:
XP_023551900.1 (uncharacterized protein LOC111809732 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 580 bits (1496), Expect = 4.11e-208
Identity = 315/315 (100.00%), Postives = 315/315 (100.00%), Query Frame = 0
Query: 1 MSATIQHSFLSPSNPIHLSAISTNFLKSHPSSMIISHKPLGFSTPRLKIVKCISESNDEA 60
MSATIQHSFLSPSNPIHLSAISTNFLKSHPSSMIISHKPLGFSTPRLKIVKCISESNDEA
Sbjct: 1 MSATIQHSFLSPSNPIHLSAISTNFLKSHPSSMIISHKPLGFSTPRLKIVKCISESNDEA 60
Query: 61 QNNVNLNNALSSMVGEQIEELLNKEENRSLLDGLEKASMRVEIAKRQLAEIEKQELELKR 120
QNNVNLNNALSSMVGEQIEELLNKEENRSLLDGLEKASMRVEIAKRQLAEIEKQELELKR
Sbjct: 61 QNNVNLNNALSSMVGEQIEELLNKEENRSLLDGLEKASMRVEIAKRQLAEIEKQELELKR 120
Query: 121 FKDYINQLESRASEIEECQKEILEARGMIEEAERSLSQSEGGNAKRDEEDGGIDRDEERL 180
FKDYINQLESRASEIEECQKEILEARGMIEEAERSLSQSEGGNAKRDEEDGGIDRDEERL
Sbjct: 121 FKDYINQLESRASEIEECQKEILEARGMIEEAERSLSQSEGGNAKRDEEDGGIDRDEERL 180
Query: 181 ESVKAASISAIVGTLAGLPIFLNQVTSISQLALPTAITFISCALFGVTFRYTIRRDLDNI 240
ESVKAASISAIVGTLAGLPIFLNQVTSISQLALPTAITFISCALFGVTFRYTIRRDLDNI
Sbjct: 181 ESVKAASISAIVGTLAGLPIFLNQVTSISQLALPTAITFISCALFGVTFRYTIRRDLDNI 240
Query: 241 QLKTGTSAAFGFVKGLATLDGGVPLEFNAESFSSHVTDAAVYVSENLYIFISAAVALEFC 300
QLKTGTSAAFGFVKGLATLDGGVPLEFNAESFSSHVTDAAVYVSENLYIFISAAVALEFC
Sbjct: 241 QLKTGTSAAFGFVKGLATLDGGVPLEFNAESFSSHVTDAAVYVSENLYIFISAAVALEFC 300
Query: 301 FRMRLLSPFPIRKSV 315
FRMRLLSPFPIRKSV
Sbjct: 301 FRMRLLSPFPIRKSV 315
BLAST of Cp4.1LG14g10010 vs. NCBI nr
Match:
KAG7015805.1 (hypothetical protein SDJN02_23443 [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 573 bits (1478), Expect = 2.28e-205
Identity = 311/315 (98.73%), Postives = 313/315 (99.37%), Query Frame = 0
Query: 1 MSATIQHSFLSPSNPIHLSAISTNFLKSHPSSMIISHKPLGFSTPRLKIVKCISESNDEA 60
MSATI+HSFLSPSNPIHLSAISTNFLKSHPSSMIISHKPLGFSTPRLKIVKC SESNDEA
Sbjct: 1 MSATIKHSFLSPSNPIHLSAISTNFLKSHPSSMIISHKPLGFSTPRLKIVKCSSESNDEA 60
Query: 61 QNNVNLNNALSSMVGEQIEELLNKEENRSLLDGLEKASMRVEIAKRQLAEIEKQELELKR 120
QNNVNLNNALSSMVGEQIEELLNKEENRSLLDGLEKASMRVEIAKRQLAEIEKQELELKR
Sbjct: 61 QNNVNLNNALSSMVGEQIEELLNKEENRSLLDGLEKASMRVEIAKRQLAEIEKQELELKR 120
Query: 121 FKDYINQLESRASEIEECQKEILEARGMIEEAERSLSQSEGGNAKRDEEDGGIDRDEERL 180
FKDYINQLESRASEIEECQKEILEARGMIEEAERSLSQSEGGNAKRDEEDGGIDRDEERL
Sbjct: 121 FKDYINQLESRASEIEECQKEILEARGMIEEAERSLSQSEGGNAKRDEEDGGIDRDEERL 180
Query: 181 ESVKAASISAIVGTLAGLPIFLNQVTSISQLALPTAITFISCALFGVTFRYTIRRDLDNI 240
ESVKAASISAIVGTLAGLPIFLNQVTSISQLALP AITFISCALFGVTFRYTIRRDLDNI
Sbjct: 181 ESVKAASISAIVGTLAGLPIFLNQVTSISQLALPAAITFISCALFGVTFRYTIRRDLDNI 240
Query: 241 QLKTGTSAAFGFVKGLATLDGGVPLEFNAESFSSHVTDAAVYVSENLYIFISAAVALEFC 300
QLKTGTSAAFGFVKGLATLDGGVPLEFNAESFSSHVTDAAVYVSENLYIFISAAVALEFC
Sbjct: 241 QLKTGTSAAFGFVKGLATLDGGVPLEFNAESFSSHVTDAAVYVSENLYIFISAAVALEFC 300
Query: 301 FRMRLLSPFPIRKSV 315
F+MRLLSPFPIRKSV
Sbjct: 301 FKMRLLSPFPIRKSV 315
BLAST of Cp4.1LG14g10010 vs. NCBI nr
Match:
XP_022923575.1 (uncharacterized protein LOC111431219 [Cucurbita moschata])
HSP 1 Score: 573 bits (1476), Expect = 4.59e-205
Identity = 311/315 (98.73%), Postives = 313/315 (99.37%), Query Frame = 0
Query: 1 MSATIQHSFLSPSNPIHLSAISTNFLKSHPSSMIISHKPLGFSTPRLKIVKCISESNDEA 60
MSATI+HSFLSPSNPIHLSAISTNFLKSHPSSMIISHKPLGFSTPRLKIVKC SESNDEA
Sbjct: 1 MSATIKHSFLSPSNPIHLSAISTNFLKSHPSSMIISHKPLGFSTPRLKIVKCSSESNDEA 60
Query: 61 QNNVNLNNALSSMVGEQIEELLNKEENRSLLDGLEKASMRVEIAKRQLAEIEKQELELKR 120
QNNVNLNNALSSMVGEQIEELLNKEENRSLLDGLEKASMRVEIAKRQLAEIEKQELELKR
Sbjct: 61 QNNVNLNNALSSMVGEQIEELLNKEENRSLLDGLEKASMRVEIAKRQLAEIEKQELELKR 120
Query: 121 FKDYINQLESRASEIEECQKEILEARGMIEEAERSLSQSEGGNAKRDEEDGGIDRDEERL 180
FKDYINQLESRASEIEECQKEILEARGMIEEAERSLSQSEGGNAKRD EDGGIDRDEERL
Sbjct: 121 FKDYINQLESRASEIEECQKEILEARGMIEEAERSLSQSEGGNAKRDVEDGGIDRDEERL 180
Query: 181 ESVKAASISAIVGTLAGLPIFLNQVTSISQLALPTAITFISCALFGVTFRYTIRRDLDNI 240
ESVKAASISAIVGTLAGLPIFLNQVTSISQLALPTAITFISCALFGVTFRYTIRRDLDNI
Sbjct: 181 ESVKAASISAIVGTLAGLPIFLNQVTSISQLALPTAITFISCALFGVTFRYTIRRDLDNI 240
Query: 241 QLKTGTSAAFGFVKGLATLDGGVPLEFNAESFSSHVTDAAVYVSENLYIFISAAVALEFC 300
QLKTGTSAAFGFVKGLATLDGGVPLEFNAESFSSHVTDAAVYVSENLYIFISAAVALEFC
Sbjct: 241 QLKTGTSAAFGFVKGLATLDGGVPLEFNAESFSSHVTDAAVYVSENLYIFISAAVALEFC 300
Query: 301 FRMRLLSPFPIRKSV 315
F+MRLLSPFPIRKSV
Sbjct: 301 FKMRLLSPFPIRKSV 315
BLAST of Cp4.1LG14g10010 vs. NCBI nr
Match:
XP_022965244.1 (uncharacterized protein LOC111465167 [Cucurbita maxima])
HSP 1 Score: 565 bits (1455), Expect = 7.28e-202
Identity = 307/315 (97.46%), Postives = 311/315 (98.73%), Query Frame = 0
Query: 1 MSATIQHSFLSPSNPIHLSAISTNFLKSHPSSMIISHKPLGFSTPRLKIVKCISESNDEA 60
MSATI+HSFLSPSNPIHLSAISTNFLKSHPSSMIISH PL FSTPRLKIVKC SESNDEA
Sbjct: 1 MSATIKHSFLSPSNPIHLSAISTNFLKSHPSSMIISHIPLSFSTPRLKIVKCSSESNDEA 60
Query: 61 QNNVNLNNALSSMVGEQIEELLNKEENRSLLDGLEKASMRVEIAKRQLAEIEKQELELKR 120
QNNVNLNNALSSMVGEQIEELLNKEENRSLLDGLEKASMRVEIAKRQLAEIEKQELELKR
Sbjct: 61 QNNVNLNNALSSMVGEQIEELLNKEENRSLLDGLEKASMRVEIAKRQLAEIEKQELELKR 120
Query: 121 FKDYINQLESRASEIEECQKEILEARGMIEEAERSLSQSEGGNAKRDEEDGGIDRDEERL 180
FKD+INQLESRASEIEECQKEILEARGMIEEAERSLSQSEGGNAKRDEEDGGIDRDEERL
Sbjct: 121 FKDHINQLESRASEIEECQKEILEARGMIEEAERSLSQSEGGNAKRDEEDGGIDRDEERL 180
Query: 181 ESVKAASISAIVGTLAGLPIFLNQVTSISQLALPTAITFISCALFGVTFRYTIRRDLDNI 240
ESVKAASISAIVGTLAGLPIFLNQVTSISQLALP AITFISCALFGVTFRYTIRRDLDNI
Sbjct: 181 ESVKAASISAIVGTLAGLPIFLNQVTSISQLALPLAITFISCALFGVTFRYTIRRDLDNI 240
Query: 241 QLKTGTSAAFGFVKGLATLDGGVPLEFNAESFSSHVTDAAVYVSENLYIFISAAVALEFC 300
QLKTGTSAAFGFVKGLATLDGGVPLEFNAESFSSHVTDAAVYVSENLYIFISAAVALEFC
Sbjct: 241 QLKTGTSAAFGFVKGLATLDGGVPLEFNAESFSSHVTDAAVYVSENLYIFISAAVALEFC 300
Query: 301 FRMRLLSPFPIRKSV 315
F++RLLSPFPIRKSV
Sbjct: 301 FKLRLLSPFPIRKSV 315
BLAST of Cp4.1LG14g10010 vs. NCBI nr
Match:
XP_008448570.1 (PREDICTED: uncharacterized protein LOC103490708 [Cucumis melo] >ADN33930.1 hypothetical protein [Cucumis melo subsp. melo] >KAA0052973.1 uncharacterized protein E6C27_scaffold344G00830 [Cucumis melo var. makuwa] >TYK11429.1 uncharacterized protein E5676_scaffold139G00840 [Cucumis melo var. makuwa])
HSP 1 Score: 513 bits (1320), Expect = 3.10e-181
Identity = 276/315 (87.62%), Postives = 294/315 (93.33%), Query Frame = 0
Query: 1 MSATIQHSFLSPSNPIHLSAISTNFLKSHPSSMIISHKPLGFSTPRLKIVKCISESNDEA 60
M+ATI+HSFLS SNPIHL ISTNFLKSHPSS I HK LGFSTPRLKI+KC SESND+A
Sbjct: 1 MAATIKHSFLSSSNPIHLPPISTNFLKSHPSSTKICHKRLGFSTPRLKILKCTSESNDQA 60
Query: 61 QNNVNLNNALSSMVGEQIEELLNKEENRSLLDGLEKASMRVEIAKRQLAEIEKQELELKR 120
QN+ NL NALSSMVGEQ+EELLN+EENRSLLDGLEKASMRVEIAK+QLAEIEKQELELKR
Sbjct: 61 QNDFNLKNALSSMVGEQVEELLNREENRSLLDGLEKASMRVEIAKKQLAEIEKQELELKR 120
Query: 121 FKDYINQLESRASEIEECQKEILEARGMIEEAERSLSQSEGGNAKRDEEDGGIDRDEERL 180
FKDY++QLE+RASEIEECQKEILEARGMIEEAERSL+QSEGGNA RD EDGG+DRDEER
Sbjct: 121 FKDYVSQLENRASEIEECQKEILEARGMIEEAERSLAQSEGGNAIRDGEDGGLDRDEERF 180
Query: 181 ESVKAASISAIVGTLAGLPIFLNQVTSISQLALPTAITFISCALFGVTFRYTIRRDLDNI 240
ESVK ASISAIVGTLAGLPIFLNQV S SQL LPTAITFISCALFGVTFRYTIRRDLDNI
Sbjct: 181 ESVKVASISAIVGTLAGLPIFLNQVNSTSQLLLPTAITFISCALFGVTFRYTIRRDLDNI 240
Query: 241 QLKTGTSAAFGFVKGLATLDGGVPLEFNAESFSSHVTDAAVYVSENLYIFISAAVALEFC 300
QLKTGTSAAFGFVKGLATLDGGVPLE +AESFSSHV DAAVYVSENLYIFI AAVAL++C
Sbjct: 241 QLKTGTSAAFGFVKGLATLDGGVPLELSAESFSSHVIDAAVYVSENLYIFICAAVALDYC 300
Query: 301 FRMRLLSPFPIRKSV 315
F+M LLSPFPIRKS+
Sbjct: 301 FKMSLLSPFPIRKSI 315
BLAST of Cp4.1LG14g10010 vs. ExPASy TrEMBL
Match:
A0A6J1EC82 (uncharacterized protein LOC111431219 OS=Cucurbita moschata OX=3662 GN=LOC111431219 PE=4 SV=1)
HSP 1 Score: 573 bits (1476), Expect = 2.22e-205
Identity = 311/315 (98.73%), Postives = 313/315 (99.37%), Query Frame = 0
Query: 1 MSATIQHSFLSPSNPIHLSAISTNFLKSHPSSMIISHKPLGFSTPRLKIVKCISESNDEA 60
MSATI+HSFLSPSNPIHLSAISTNFLKSHPSSMIISHKPLGFSTPRLKIVKC SESNDEA
Sbjct: 1 MSATIKHSFLSPSNPIHLSAISTNFLKSHPSSMIISHKPLGFSTPRLKIVKCSSESNDEA 60
Query: 61 QNNVNLNNALSSMVGEQIEELLNKEENRSLLDGLEKASMRVEIAKRQLAEIEKQELELKR 120
QNNVNLNNALSSMVGEQIEELLNKEENRSLLDGLEKASMRVEIAKRQLAEIEKQELELKR
Sbjct: 61 QNNVNLNNALSSMVGEQIEELLNKEENRSLLDGLEKASMRVEIAKRQLAEIEKQELELKR 120
Query: 121 FKDYINQLESRASEIEECQKEILEARGMIEEAERSLSQSEGGNAKRDEEDGGIDRDEERL 180
FKDYINQLESRASEIEECQKEILEARGMIEEAERSLSQSEGGNAKRD EDGGIDRDEERL
Sbjct: 121 FKDYINQLESRASEIEECQKEILEARGMIEEAERSLSQSEGGNAKRDVEDGGIDRDEERL 180
Query: 181 ESVKAASISAIVGTLAGLPIFLNQVTSISQLALPTAITFISCALFGVTFRYTIRRDLDNI 240
ESVKAASISAIVGTLAGLPIFLNQVTSISQLALPTAITFISCALFGVTFRYTIRRDLDNI
Sbjct: 181 ESVKAASISAIVGTLAGLPIFLNQVTSISQLALPTAITFISCALFGVTFRYTIRRDLDNI 240
Query: 241 QLKTGTSAAFGFVKGLATLDGGVPLEFNAESFSSHVTDAAVYVSENLYIFISAAVALEFC 300
QLKTGTSAAFGFVKGLATLDGGVPLEFNAESFSSHVTDAAVYVSENLYIFISAAVALEFC
Sbjct: 241 QLKTGTSAAFGFVKGLATLDGGVPLEFNAESFSSHVTDAAVYVSENLYIFISAAVALEFC 300
Query: 301 FRMRLLSPFPIRKSV 315
F+MRLLSPFPIRKSV
Sbjct: 301 FKMRLLSPFPIRKSV 315
BLAST of Cp4.1LG14g10010 vs. ExPASy TrEMBL
Match:
A0A6J1HQG8 (uncharacterized protein LOC111465167 OS=Cucurbita maxima OX=3661 GN=LOC111465167 PE=4 SV=1)
HSP 1 Score: 565 bits (1455), Expect = 3.53e-202
Identity = 307/315 (97.46%), Postives = 311/315 (98.73%), Query Frame = 0
Query: 1 MSATIQHSFLSPSNPIHLSAISTNFLKSHPSSMIISHKPLGFSTPRLKIVKCISESNDEA 60
MSATI+HSFLSPSNPIHLSAISTNFLKSHPSSMIISH PL FSTPRLKIVKC SESNDEA
Sbjct: 1 MSATIKHSFLSPSNPIHLSAISTNFLKSHPSSMIISHIPLSFSTPRLKIVKCSSESNDEA 60
Query: 61 QNNVNLNNALSSMVGEQIEELLNKEENRSLLDGLEKASMRVEIAKRQLAEIEKQELELKR 120
QNNVNLNNALSSMVGEQIEELLNKEENRSLLDGLEKASMRVEIAKRQLAEIEKQELELKR
Sbjct: 61 QNNVNLNNALSSMVGEQIEELLNKEENRSLLDGLEKASMRVEIAKRQLAEIEKQELELKR 120
Query: 121 FKDYINQLESRASEIEECQKEILEARGMIEEAERSLSQSEGGNAKRDEEDGGIDRDEERL 180
FKD+INQLESRASEIEECQKEILEARGMIEEAERSLSQSEGGNAKRDEEDGGIDRDEERL
Sbjct: 121 FKDHINQLESRASEIEECQKEILEARGMIEEAERSLSQSEGGNAKRDEEDGGIDRDEERL 180
Query: 181 ESVKAASISAIVGTLAGLPIFLNQVTSISQLALPTAITFISCALFGVTFRYTIRRDLDNI 240
ESVKAASISAIVGTLAGLPIFLNQVTSISQLALP AITFISCALFGVTFRYTIRRDLDNI
Sbjct: 181 ESVKAASISAIVGTLAGLPIFLNQVTSISQLALPLAITFISCALFGVTFRYTIRRDLDNI 240
Query: 241 QLKTGTSAAFGFVKGLATLDGGVPLEFNAESFSSHVTDAAVYVSENLYIFISAAVALEFC 300
QLKTGTSAAFGFVKGLATLDGGVPLEFNAESFSSHVTDAAVYVSENLYIFISAAVALEFC
Sbjct: 241 QLKTGTSAAFGFVKGLATLDGGVPLEFNAESFSSHVTDAAVYVSENLYIFISAAVALEFC 300
Query: 301 FRMRLLSPFPIRKSV 315
F++RLLSPFPIRKSV
Sbjct: 301 FKLRLLSPFPIRKSV 315
BLAST of Cp4.1LG14g10010 vs. ExPASy TrEMBL
Match:
A0A5A7UAK6 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold139G00840 PE=4 SV=1)
HSP 1 Score: 513 bits (1320), Expect = 1.50e-181
Identity = 276/315 (87.62%), Postives = 294/315 (93.33%), Query Frame = 0
Query: 1 MSATIQHSFLSPSNPIHLSAISTNFLKSHPSSMIISHKPLGFSTPRLKIVKCISESNDEA 60
M+ATI+HSFLS SNPIHL ISTNFLKSHPSS I HK LGFSTPRLKI+KC SESND+A
Sbjct: 1 MAATIKHSFLSSSNPIHLPPISTNFLKSHPSSTKICHKRLGFSTPRLKILKCTSESNDQA 60
Query: 61 QNNVNLNNALSSMVGEQIEELLNKEENRSLLDGLEKASMRVEIAKRQLAEIEKQELELKR 120
QN+ NL NALSSMVGEQ+EELLN+EENRSLLDGLEKASMRVEIAK+QLAEIEKQELELKR
Sbjct: 61 QNDFNLKNALSSMVGEQVEELLNREENRSLLDGLEKASMRVEIAKKQLAEIEKQELELKR 120
Query: 121 FKDYINQLESRASEIEECQKEILEARGMIEEAERSLSQSEGGNAKRDEEDGGIDRDEERL 180
FKDY++QLE+RASEIEECQKEILEARGMIEEAERSL+QSEGGNA RD EDGG+DRDEER
Sbjct: 121 FKDYVSQLENRASEIEECQKEILEARGMIEEAERSLAQSEGGNAIRDGEDGGLDRDEERF 180
Query: 181 ESVKAASISAIVGTLAGLPIFLNQVTSISQLALPTAITFISCALFGVTFRYTIRRDLDNI 240
ESVK ASISAIVGTLAGLPIFLNQV S SQL LPTAITFISCALFGVTFRYTIRRDLDNI
Sbjct: 181 ESVKVASISAIVGTLAGLPIFLNQVNSTSQLLLPTAITFISCALFGVTFRYTIRRDLDNI 240
Query: 241 QLKTGTSAAFGFVKGLATLDGGVPLEFNAESFSSHVTDAAVYVSENLYIFISAAVALEFC 300
QLKTGTSAAFGFVKGLATLDGGVPLE +AESFSSHV DAAVYVSENLYIFI AAVAL++C
Sbjct: 241 QLKTGTSAAFGFVKGLATLDGGVPLELSAESFSSHVIDAAVYVSENLYIFICAAVALDYC 300
Query: 301 FRMRLLSPFPIRKSV 315
F+M LLSPFPIRKS+
Sbjct: 301 FKMSLLSPFPIRKSI 315
BLAST of Cp4.1LG14g10010 vs. ExPASy TrEMBL
Match:
E5RDC3 (Uncharacterized protein OS=Cucumis melo subsp. melo OX=412675 PE=4 SV=1)
HSP 1 Score: 513 bits (1320), Expect = 1.50e-181
Identity = 276/315 (87.62%), Postives = 294/315 (93.33%), Query Frame = 0
Query: 1 MSATIQHSFLSPSNPIHLSAISTNFLKSHPSSMIISHKPLGFSTPRLKIVKCISESNDEA 60
M+ATI+HSFLS SNPIHL ISTNFLKSHPSS I HK LGFSTPRLKI+KC SESND+A
Sbjct: 1 MAATIKHSFLSSSNPIHLPPISTNFLKSHPSSTKICHKRLGFSTPRLKILKCTSESNDQA 60
Query: 61 QNNVNLNNALSSMVGEQIEELLNKEENRSLLDGLEKASMRVEIAKRQLAEIEKQELELKR 120
QN+ NL NALSSMVGEQ+EELLN+EENRSLLDGLEKASMRVEIAK+QLAEIEKQELELKR
Sbjct: 61 QNDFNLKNALSSMVGEQVEELLNREENRSLLDGLEKASMRVEIAKKQLAEIEKQELELKR 120
Query: 121 FKDYINQLESRASEIEECQKEILEARGMIEEAERSLSQSEGGNAKRDEEDGGIDRDEERL 180
FKDY++QLE+RASEIEECQKEILEARGMIEEAERSL+QSEGGNA RD EDGG+DRDEER
Sbjct: 121 FKDYVSQLENRASEIEECQKEILEARGMIEEAERSLAQSEGGNAIRDGEDGGLDRDEERF 180
Query: 181 ESVKAASISAIVGTLAGLPIFLNQVTSISQLALPTAITFISCALFGVTFRYTIRRDLDNI 240
ESVK ASISAIVGTLAGLPIFLNQV S SQL LPTAITFISCALFGVTFRYTIRRDLDNI
Sbjct: 181 ESVKVASISAIVGTLAGLPIFLNQVNSTSQLLLPTAITFISCALFGVTFRYTIRRDLDNI 240
Query: 241 QLKTGTSAAFGFVKGLATLDGGVPLEFNAESFSSHVTDAAVYVSENLYIFISAAVALEFC 300
QLKTGTSAAFGFVKGLATLDGGVPLE +AESFSSHV DAAVYVSENLYIFI AAVAL++C
Sbjct: 241 QLKTGTSAAFGFVKGLATLDGGVPLELSAESFSSHVIDAAVYVSENLYIFICAAVALDYC 300
Query: 301 FRMRLLSPFPIRKSV 315
F+M LLSPFPIRKS+
Sbjct: 301 FKMSLLSPFPIRKSI 315
BLAST of Cp4.1LG14g10010 vs. ExPASy TrEMBL
Match:
A0A1S3BK07 (uncharacterized protein LOC103490708 OS=Cucumis melo OX=3656 GN=LOC103490708 PE=4 SV=1)
HSP 1 Score: 513 bits (1320), Expect = 1.50e-181
Identity = 276/315 (87.62%), Postives = 294/315 (93.33%), Query Frame = 0
Query: 1 MSATIQHSFLSPSNPIHLSAISTNFLKSHPSSMIISHKPLGFSTPRLKIVKCISESNDEA 60
M+ATI+HSFLS SNPIHL ISTNFLKSHPSS I HK LGFSTPRLKI+KC SESND+A
Sbjct: 1 MAATIKHSFLSSSNPIHLPPISTNFLKSHPSSTKICHKRLGFSTPRLKILKCTSESNDQA 60
Query: 61 QNNVNLNNALSSMVGEQIEELLNKEENRSLLDGLEKASMRVEIAKRQLAEIEKQELELKR 120
QN+ NL NALSSMVGEQ+EELLN+EENRSLLDGLEKASMRVEIAK+QLAEIEKQELELKR
Sbjct: 61 QNDFNLKNALSSMVGEQVEELLNREENRSLLDGLEKASMRVEIAKKQLAEIEKQELELKR 120
Query: 121 FKDYINQLESRASEIEECQKEILEARGMIEEAERSLSQSEGGNAKRDEEDGGIDRDEERL 180
FKDY++QLE+RASEIEECQKEILEARGMIEEAERSL+QSEGGNA RD EDGG+DRDEER
Sbjct: 121 FKDYVSQLENRASEIEECQKEILEARGMIEEAERSLAQSEGGNAIRDGEDGGLDRDEERF 180
Query: 181 ESVKAASISAIVGTLAGLPIFLNQVTSISQLALPTAITFISCALFGVTFRYTIRRDLDNI 240
ESVK ASISAIVGTLAGLPIFLNQV S SQL LPTAITFISCALFGVTFRYTIRRDLDNI
Sbjct: 181 ESVKVASISAIVGTLAGLPIFLNQVNSTSQLLLPTAITFISCALFGVTFRYTIRRDLDNI 240
Query: 241 QLKTGTSAAFGFVKGLATLDGGVPLEFNAESFSSHVTDAAVYVSENLYIFISAAVALEFC 300
QLKTGTSAAFGFVKGLATLDGGVPLE +AESFSSHV DAAVYVSENLYIFI AAVAL++C
Sbjct: 241 QLKTGTSAAFGFVKGLATLDGGVPLELSAESFSSHVIDAAVYVSENLYIFICAAVALDYC 300
Query: 301 FRMRLLSPFPIRKSV 315
F+M LLSPFPIRKS+
Sbjct: 301 FKMSLLSPFPIRKSI 315
BLAST of Cp4.1LG14g10010 vs. TAIR 10
Match:
AT4G24090.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; Has 144 Blast hits to 142 proteins in 73 species: Archae - 3; Bacteria - 62; Metazoa - 7; Fungi - 13; Plants - 44; Viruses - 0; Other Eukaryotes - 15 (source: NCBI BLink). )
HSP 1 Score: 273.1 bits (697), Expect = 2.8e-73
Identity = 150/246 (60.98%), Postives = 187/246 (76.02%), Query Frame = 0
Query: 65 NLNNALSSMVGEQIEELLNKEENRSLLDGLEKASMRVEIAKRQLAEIEKQELELKRFKDY 124
+L N+LS +VG Q+EELL++EEN+ LLDGLEKAS+RVEIAKR+L +IE+QE+E K +DY
Sbjct: 60 DLKNSLSGIVGNQVEELLSREENKGLLDGLEKASLRVEIAKRELEDIERQEIEAKLLQDY 119
Query: 125 INQLESRASEIEECQKEILEARGMIEEAERSLSQSEGGNAKRDEEDGGIDRDEERLESVK 184
INQLESRA+EI ECQ+EI AR M+EEAERSLS ++ E+ ID+D+ERLES K
Sbjct: 120 INQLESRAAEIAECQQEIDAARSMVEEAERSLSLADNSTIGSSEKGYSIDKDKERLESAK 179
Query: 185 AASISAIVGTLAGLPIFLNQVTSISQLALPTAITFISCALFGVTFRYTIRRDLDNIQLKT 244
AA I+A VGT+A LP L+QV S+ QL LP I F SCALFGVTFRY +RRDLD+ LK+
Sbjct: 180 AAVIAAAVGTIAELPFALSQVASMEQLVLPLGIAFASCALFGVTFRYAVRRDLDDNHLKS 239
Query: 245 GTSAAFGFVKGLATLDGGVPLEFNAESFSSHVTDAAVYVSENLYIFISAAVALEFCFRMR 304
G AAFGFVKGL L G PLE + ES SH D AV VS+++ IF A++ L+FCF+M+
Sbjct: 240 GAVAAFGFVKGLGMLSRGPPLELSWESLFSHGIDGAVLVSQSVLIFAFASIGLDFCFKMK 299
Query: 305 LLSPFP 311
LL PFP
Sbjct: 300 LLRPFP 305
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_023551900.1 | 4.11e-208 | 100.00 | uncharacterized protein LOC111809732 [Cucurbita pepo subsp. pepo] | [more] |
KAG7015805.1 | 2.28e-205 | 98.73 | hypothetical protein SDJN02_23443 [Cucurbita argyrosperma subsp. argyrosperma] | [more] |
XP_022923575.1 | 4.59e-205 | 98.73 | uncharacterized protein LOC111431219 [Cucurbita moschata] | [more] |
XP_022965244.1 | 7.28e-202 | 97.46 | uncharacterized protein LOC111465167 [Cucurbita maxima] | [more] |
XP_008448570.1 | 3.10e-181 | 87.62 | PREDICTED: uncharacterized protein LOC103490708 [Cucumis melo] >ADN33930.1 hypot... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1EC82 | 2.22e-205 | 98.73 | uncharacterized protein LOC111431219 OS=Cucurbita moschata OX=3662 GN=LOC1114312... | [more] |
A0A6J1HQG8 | 3.53e-202 | 97.46 | uncharacterized protein LOC111465167 OS=Cucurbita maxima OX=3661 GN=LOC111465167... | [more] |
A0A5A7UAK6 | 1.50e-181 | 87.62 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... | [more] |
E5RDC3 | 1.50e-181 | 87.62 | Uncharacterized protein OS=Cucumis melo subsp. melo OX=412675 PE=4 SV=1 | [more] |
A0A1S3BK07 | 1.50e-181 | 87.62 | uncharacterized protein LOC103490708 OS=Cucumis melo OX=3656 GN=LOC103490708 PE=... | [more] |
Match Name | E-value | Identity | Description | |
AT4G24090.1 | 2.8e-73 | 60.98 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |