Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
TGAATAAACCGGTTGGTGATCTTCCCCCTGCCCGTCTTTTACTAAGATCCCTGGCTTCCACCTCCTGCAGTCCTCTCGCTCTGTGTTCGACAAATGCCTGTAAGTCTCTCCAACTTTAAGATCAGATCTCGTAGAACATCAACTGAATCTCAAATTCTAGGGTTCTTCTTTCTATATTCATTTTGTTTTCTTATGCAATGGAATTTGTGAGTTTCTGATCCGGGAACATGATTGCGTTGGCCGCAGTTCACGACGTTCATTGAAGTGGAACCACCCGGTCCACTGCGATACATAATTGGGGCCGTTATAATGATGATCGGAGTTGTATTGCCCCTCGGATATATGGTGTTCCGGAACAAGCGTGGTTCTTCTTCTTCTTCTTACTCCAAACAGACGTAGGTGTGCCTATGAAATTTACTTGGTTTCGATTCTTCCTTTTCGCTTTTTTGTTAATTAGTTCTTTCTTCTTGATCTTTCGCGTTATCTGTATAATTTTATCATTCCTGTGATCTGTTTATGACTGGAAATCAAGGTCTTCTTCCTGTTGTTTTATATGAAATTGGATTTGTTCTTTCATTCTTCAAATCGCTTTGACCTGACTTATGGACTTTCGGAGCTCGGTTTTTACAGGAAGAAATCGGACTTGGGTACAAATGGTAGTTGAACATTTACTTCTAGTATTGGTTATTAGGATTATGCGGTCTGATGCACCTTGTAAATTTCCAAATGAGGTTTTGGTTGGATTTCTGTCTAAAAGATGAAGGGAATTTCTCGTTGTGCAGTGAATTATTTCTTAGGCCTGTTGAGAAAAGAGTGATTATAATAAGAAAGCAAAAATACTTGACTCTAGCGTAGGAACTCCTTGTTTCCTTACCTCTATGGCTCCTTTTGTTTAGGCTCTCTTCTTGAACTTCTCGTGGAAATGTCAGAAATCTGGCAACTGATCATATGAACCAAAATGAGTTTTCTGGATGGAGGTAGAGGGCGGGTGAGATATTTTTTTAGCAACTAAAATTAGTGTTTTGGATGGCCAGGATATTGTGAATTAGTATTTGTGTTTTAGATATCATTTGAATTGGCACATGTTGGAAGTTTCGGTATGCATTTAGTCATTCGACTTACAAATCTTCATACTTCTTTTTCGAAATTAAAACTACCTCAAACAGAACATCGTCTTCTCCTTTCCTTCGACCTTGCTGTATCCGCTTTTCCTTCACTTTCCTGTACTACATTAGTTCATCATCTGCTGGTACATCCATTGTTAGCCATGCCCTCGTGTTGGAAAGGCAATGTCAAAAAGTAGGAAGCAATAGATGGAGGGGCAGTGTGCGTGGGGTTTTCATTTTCCTAATAAGTGATAGGGGAGAGTATATTAGAACATTCAATTTGCAAAATGTAATAATATGTGAAGAAATATGAAGTATTAATTGTGCTTTTATGATGAAATCTTGAAACAGAGCTACTCAGAAAGTGAAATATGTATAGGAACTTTCATCGGTTACACACTTTTCTATATCTCTATGTTTAGGGTGAAATGAACTATTTATTTGAATGATGCCCCCGTTGTTCTTTGTGTCTTTCCATTGTACTCCTAATTCTTTTTTCCTTCTGAAATATTAGTCCTTGTAGCCCTTTTTTTGTTAAGTCAAACTTCATTGGAGATTGTACATACACGTTGTATATGTCATTAAAGTAGGCGTTTCATAGCTCATAAAGATAAAGTTCCCTCTTTTCTTTATGGAGGAGCATCATCTTGGATTTTTTTAGGGAAGAAGATTATCCTGTGAGTTGGACGTAGGAAGATTATTCCATATTTCCTCAAGCATATAGGACTAAGATACTCCTCTCCTATCTGTACCATTTCAGAGCTATCCATGTAAAGGTTACAGGATTCATATGCATAACTTTCGATAAGTTTAATGCAAGGATGGAACTGTAGTTTAAAAAAGGGCAGTGTAAGCGCTTGTATGCATGTGTAAGCGGATCATCGATAGGAACAAGTAGCAAGTAACGAGTTTAGTCTTTTGTAGTACATTGATGTTTTTTTATGATCGACTCTAATGGGAATTCTGCAATTTGGAATTTCAGGATCAAGGTTTTGATTTAGTTGCATAGAGGAAGAGACACATCTACTTCAGTTTTGGGCTCAGCTAATGGAGGATGCTATTCACAGGGTAATGTAAATGCATTGAGATTTAAAATTTCTGTTGAAACCGACACGATTATGGCCCAATTATTGACCTTAAGCAATGCATTTCATCGTTGCTTTTTCTTGAAATCAATGGCATACAACTGCATCTTGTCATTTGTATTGTTGAAGGAGTTATGTGTGATTACAGTTGACCCACACTTAGTATTATTGAGATACAACCACAAAGGCATGAAAGTACTCTTTGTTGGTCTCCTATGGGTATCTTTCCTCCCAGTGACTTTGAATTGAAAGCTTTAGGTTTAGTCGATTATGACTTTGATCAACATTTTATTGCTACTAATTTCGAGTTGAAAGCCGCCATGTGCAAGGATTGGACACCAAAGTAACCAAAGAACCAATGAACGAGAAACCACTTGTCATGATGATGCTTGATAGTTAAAAGCCAAAGCAGGCAAGGGCATGTTTGAAAGTAATTTTAAAAGATTAAAAATTACTTTTTTATGTTTCAAAATAATTTCAAAACGTGTATTTAGTCACTAAAAATTAATTTAATGTTGATTTTCTATGTTTAAACACGACTTTTATACTATCAAAATTGATATTGAATGATCAAACATATGTTTTGATGTGATATGTACAATTTAGACTCGTTTCAAAATCACTTCTAAACATGACCTGTTAAATCTTTCGGGCTAATGTCACTATTTCTATTCTATTGCCATTTCGTTTTTTTTTTAATAATAATAATTATTATTATTATTATTATGGGCTTATCAGTAAATTAACCACTAAGCTTCTTCATTATTCTTAAGCTTTTTTTTTTATCTTGTTATAAAGATAAACCACTTGTTCTATTTTAGTAAGTACTTAAATGTACAAAATCAGTCTATCAACTTCAAACAGTTTCACACTTTGTATTTTGTTCATATGCATTATTTAACTATTTTGCAAAGTTCAGTAAAAACTACCATCAAATAAACATTATTAAGAGTAGATGACTAAATTTTAATATTTAAAAGTTGTATGAACTAAAATGATATTTTAGTTGACTTTTTTTTTTAACATTTTTTAATTTAATATGCAGAGTGAAATATTCGAATCTCTAA
mRNA sequence
TGAATAAACCGGTTGGTGATCTTCCCCCTGCCCGTCTTTTACTAAGATCCCTGGCTTCCACCTCCTGCAGTCCTCTCGCTCTGTGTTCGACAAATGCCTTTCACGACGTTCATTGAAGTGGAACCACCCGGTCCACTGCGATACATAATTGGGGCCGTTATAATGATGATCGGAGTTGTATTGCCCCTCGGATATATGGTGTTCCGGAACAAGCGTGGTTCTTCTTCTTCTTCTTACTCCAAACAGACTTGCATAGAGGAAGAGACACATCTACTTCAGTTTTGGGCTCAGCTAATGGAGGATGCTATTCACAGGGTAATAGTGAAATATTCGAATCTCTAA
Coding sequence (CDS)
ATGCCTTTCACGACGTTCATTGAAGTGGAACCACCCGGTCCACTGCGATACATAATTGGGGCCGTTATAATGATGATCGGAGTTGTATTGCCCCTCGGATATATGGTGTTCCGGAACAAGCGTGGTTCTTCTTCTTCTTCTTACTCCAAACAGACTTGCATAGAGGAAGAGACACATCTACTTCAGTTTTGGGCTCAGCTAATGGAGGATGCTATTCACAGGGTAATAGTGAAATATTCGAATCTCTAA
Protein sequence
MPFTTFIEVEPPGPLRYIIGAVIMMIGVVLPLGYMVFRNKRGSSSSSYSKQTCIEEETHLLQFWAQLMEDAIHRVIVKYSNL
Homology
BLAST of Cp4.1LG13g02470.1 vs. NCBI nr
Match:
KAG7016882.1 (hypothetical protein SDJN02_21993 [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 151 bits (381), Expect = 6.48e-45
Identity = 73/75 (97.33%), Postives = 75/75 (100.00%), Query Frame = 0
Query: 1 MPFTTFIEVEPPGPLRYIIGAVIMMIGVVLPLGYMVFRNKRGSSSSSYSKQTCIEEETHL 60
MPFTTFIEVEPPGPLRYIIGAVIM+IGVVLPLGYMVFRNKRGSSSSSYSKQTCIEEETHL
Sbjct: 1 MPFTTFIEVEPPGPLRYIIGAVIMIIGVVLPLGYMVFRNKRGSSSSSYSKQTCIEEETHL 60
Query: 61 LQFWAQLMEDAIHRV 75
LQFWAQLMEDAIHR+
Sbjct: 61 LQFWAQLMEDAIHRI 75
BLAST of Cp4.1LG13g02470.1 vs. NCBI nr
Match:
XP_008437061.1 (PREDICTED: uncharacterized protein LOC103482600 isoform X2 [Cucumis melo])
HSP 1 Score: 95.5 bits (236), Expect = 5.66e-24
Identity = 49/53 (92.45%), Postives = 51/53 (96.23%), Query Frame = 0
Query: 1 MPFTTFIEVEPPGPLRYIIGAVIMMIGVVLPLGYMVFRNKRG-SSSSSYSKQT 52
MPF+TFIEVEPP PLRYIIGAVIMMIGVVLPLGYM+FRNKRG SSSSSYSKQT
Sbjct: 1 MPFSTFIEVEPPSPLRYIIGAVIMMIGVVLPLGYMLFRNKRGPSSSSSYSKQT 53
BLAST of Cp4.1LG13g02470.1 vs. NCBI nr
Match:
XP_008437060.1 (PREDICTED: uncharacterized protein LOC103482600 isoform X1 [Cucumis melo])
HSP 1 Score: 95.5 bits (236), Expect = 6.40e-24
Identity = 49/53 (92.45%), Postives = 51/53 (96.23%), Query Frame = 0
Query: 1 MPFTTFIEVEPPGPLRYIIGAVIMMIGVVLPLGYMVFRNKRG-SSSSSYSKQT 52
MPF+TFIEVEPP PLRYIIGAVIMMIGVVLPLGYM+FRNKRG SSSSSYSKQT
Sbjct: 1 MPFSTFIEVEPPSPLRYIIGAVIMMIGVVLPLGYMLFRNKRGPSSSSSYSKQT 53
BLAST of Cp4.1LG13g02470.1 vs. NCBI nr
Match:
KAG7036117.1 (hypothetical protein SDJN02_02917, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 97.4 bits (241), Expect = 7.01e-24
Identity = 62/102 (60.78%), Postives = 66/102 (64.71%), Query Frame = 0
Query: 3 FTTFIEVEPPGPLRYIIGAVIMMIGVVLPLGYMVFRNKRG-SSSSSYSKQTCIE-----E 62
F TF+EVEPP PLRYIIGAVIMMIGVVLPLGYM+FRNKRG SSSSSYSKQT E E
Sbjct: 24 FGTFVEVEPPSPLRYIIGAVIMMIGVVLPLGYMMFRNKRGPSSSSSYSKQTFSEHMRNLE 83
Query: 63 ETHLLQ--FW--------------------AQLMEDAIHRVI 76
H Q FW +QLMEDAIHRV+
Sbjct: 84 NDHDPQRVFWFEDQSFDLVRGRDASVLVLGSQLMEDAIHRVM 125
BLAST of Cp4.1LG13g02470.1 vs. NCBI nr
Match:
XP_022154814.1 (uncharacterized protein LOC111021976 isoform X2 [Momordica charantia])
HSP 1 Score: 95.1 bits (235), Expect = 8.05e-24
Identity = 48/53 (90.57%), Postives = 51/53 (96.23%), Query Frame = 0
Query: 1 MPFTTFIEVEPPGPLRYIIGAVIMMIGVVLPLGYMVFRNKRG-SSSSSYSKQT 52
MPF+TFIEVEPP PLRYIIGAV+MMIGVVLPLGYM+FRNKRG SSSSSYSKQT
Sbjct: 1 MPFSTFIEVEPPSPLRYIIGAVVMMIGVVLPLGYMMFRNKRGPSSSSSYSKQT 53
BLAST of Cp4.1LG13g02470.1 vs. ExPASy TrEMBL
Match:
A0A1S3ATP1 (uncharacterized protein LOC103482600 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103482600 PE=4 SV=1)
HSP 1 Score: 95.5 bits (236), Expect = 2.74e-24
Identity = 49/53 (92.45%), Postives = 51/53 (96.23%), Query Frame = 0
Query: 1 MPFTTFIEVEPPGPLRYIIGAVIMMIGVVLPLGYMVFRNKRG-SSSSSYSKQT 52
MPF+TFIEVEPP PLRYIIGAVIMMIGVVLPLGYM+FRNKRG SSSSSYSKQT
Sbjct: 1 MPFSTFIEVEPPSPLRYIIGAVIMMIGVVLPLGYMLFRNKRGPSSSSSYSKQT 53
BLAST of Cp4.1LG13g02470.1 vs. ExPASy TrEMBL
Match:
A0A1S3AT37 (uncharacterized protein LOC103482600 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103482600 PE=4 SV=1)
HSP 1 Score: 95.5 bits (236), Expect = 3.10e-24
Identity = 49/53 (92.45%), Postives = 51/53 (96.23%), Query Frame = 0
Query: 1 MPFTTFIEVEPPGPLRYIIGAVIMMIGVVLPLGYMVFRNKRG-SSSSSYSKQT 52
MPF+TFIEVEPP PLRYIIGAVIMMIGVVLPLGYM+FRNKRG SSSSSYSKQT
Sbjct: 1 MPFSTFIEVEPPSPLRYIIGAVIMMIGVVLPLGYMLFRNKRGPSSSSSYSKQT 53
BLAST of Cp4.1LG13g02470.1 vs. ExPASy TrEMBL
Match:
A0A6J1DNA4 (uncharacterized protein LOC111021976 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111021976 PE=4 SV=1)
HSP 1 Score: 95.1 bits (235), Expect = 3.90e-24
Identity = 48/53 (90.57%), Postives = 51/53 (96.23%), Query Frame = 0
Query: 1 MPFTTFIEVEPPGPLRYIIGAVIMMIGVVLPLGYMVFRNKRG-SSSSSYSKQT 52
MPF+TFIEVEPP PLRYIIGAV+MMIGVVLPLGYM+FRNKRG SSSSSYSKQT
Sbjct: 1 MPFSTFIEVEPPSPLRYIIGAVVMMIGVVLPLGYMMFRNKRGPSSSSSYSKQT 53
BLAST of Cp4.1LG13g02470.1 vs. ExPASy TrEMBL
Match:
A0A6J1DLB6 (uncharacterized protein LOC111021976 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111021976 PE=4 SV=1)
HSP 1 Score: 95.1 bits (235), Expect = 4.19e-24
Identity = 48/53 (90.57%), Postives = 51/53 (96.23%), Query Frame = 0
Query: 1 MPFTTFIEVEPPGPLRYIIGAVIMMIGVVLPLGYMVFRNKRG-SSSSSYSKQT 52
MPF+TFIEVEPP PLRYIIGAV+MMIGVVLPLGYM+FRNKRG SSSSSYSKQT
Sbjct: 1 MPFSTFIEVEPPSPLRYIIGAVVMMIGVVLPLGYMMFRNKRGPSSSSSYSKQT 53
BLAST of Cp4.1LG13g02470.1 vs. ExPASy TrEMBL
Match:
A0A0A0KPA4 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G162600 PE=4 SV=1)
HSP 1 Score: 94.0 bits (232), Expect = 1.12e-23
Identity = 48/53 (90.57%), Postives = 50/53 (94.34%), Query Frame = 0
Query: 1 MPFTTFIEVEPPGPLRYIIGAVIMMIGVVLPLGYMVFRNKRG-SSSSSYSKQT 52
MPF+TFIEVEPP PLRYI GAVIMMIGVVLPLGYM+FRNKRG SSSSSYSKQT
Sbjct: 1 MPFSTFIEVEPPSPLRYIFGAVIMMIGVVLPLGYMLFRNKRGPSSSSSYSKQT 53
BLAST of Cp4.1LG13g02470.1 vs. TAIR 10
Match:
AT4G16695.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )
HSP 1 Score: 80.1 bits (196), Expect = 9.1e-16
Identity = 39/52 (75.00%), Postives = 44/52 (84.62%), Query Frame = 0
Query: 1 MPFTTFIEVEPPGPLRYIIGAVIMMIGVVLPLGYMVFRNKRGSSSSSYSKQT 53
MPF T IEVEPP LRY+IG+ +MMIGVVLP+GYM+FRNKR SSSYSKQT
Sbjct: 1 MPFKTVIEVEPPSLLRYLIGSAVMMIGVVLPVGYMMFRNKRVPFSSSYSKQT 52
BLAST of Cp4.1LG13g02470.1 vs. TAIR 10
Match:
AT4G16695.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; Has 21 Blast hits to 21 proteins in 11 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 21; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 80.1 bits (196), Expect = 9.1e-16
Identity = 39/52 (75.00%), Postives = 44/52 (84.62%), Query Frame = 0
Query: 1 MPFTTFIEVEPPGPLRYIIGAVIMMIGVVLPLGYMVFRNKRGSSSSSYSKQT 53
MPF T IEVEPP LRY+IG+ +MMIGVVLP+GYM+FRNKR SSSYSKQT
Sbjct: 1 MPFKTVIEVEPPSLLRYLIGSAVMMIGVVLPVGYMMFRNKRVPFSSSYSKQT 52
BLAST of Cp4.1LG13g02470.1 vs. TAIR 10
Match:
AT4G16695.3 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; Has 21 Blast hits to 21 proteins in 11 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 21; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 80.1 bits (196), Expect = 9.1e-16
Identity = 39/52 (75.00%), Postives = 44/52 (84.62%), Query Frame = 0
Query: 1 MPFTTFIEVEPPGPLRYIIGAVIMMIGVVLPLGYMVFRNKRGSSSSSYSKQT 53
MPF T IEVEPP LRY+IG+ +MMIGVVLP+GYM+FRNKR SSSYSKQT
Sbjct: 1 MPFKTVIEVEPPSLLRYLIGSAVMMIGVVLPVGYMMFRNKRVPFSSSYSKQT 52
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
KAG7016882.1 | 6.48e-45 | 97.33 | hypothetical protein SDJN02_21993 [Cucurbita argyrosperma subsp. argyrosperma] | [more] |
XP_008437061.1 | 5.66e-24 | 92.45 | PREDICTED: uncharacterized protein LOC103482600 isoform X2 [Cucumis melo] | [more] |
XP_008437060.1 | 6.40e-24 | 92.45 | PREDICTED: uncharacterized protein LOC103482600 isoform X1 [Cucumis melo] | [more] |
KAG7036117.1 | 7.01e-24 | 60.78 | hypothetical protein SDJN02_02917, partial [Cucurbita argyrosperma subsp. argyro... | [more] |
XP_022154814.1 | 8.05e-24 | 90.57 | uncharacterized protein LOC111021976 isoform X2 [Momordica charantia] | [more] |
Match Name | E-value | Identity | Description | |
A0A1S3ATP1 | 2.74e-24 | 92.45 | uncharacterized protein LOC103482600 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A1S3AT37 | 3.10e-24 | 92.45 | uncharacterized protein LOC103482600 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A6J1DNA4 | 3.90e-24 | 90.57 | uncharacterized protein LOC111021976 isoform X2 OS=Momordica charantia OX=3673 G... | [more] |
A0A6J1DLB6 | 4.19e-24 | 90.57 | uncharacterized protein LOC111021976 isoform X1 OS=Momordica charantia OX=3673 G... | [more] |
A0A0A0KPA4 | 1.12e-23 | 90.57 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G162600 PE=4 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
AT4G16695.2 | 9.1e-16 | 75.00 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |
AT4G16695.1 | 9.1e-16 | 75.00 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |
AT4G16695.3 | 9.1e-16 | 75.00 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |