Sgr021751.1 (mRNA) Monk fruit (Qingpiguo) v1

Overview
NameSgr021751.1
TypemRNA
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
Description(thale cress) hypothetical protein
Locationtig00153826: 345786 .. 350458 (+)
Sequence length891
RNA-Seq ExpressionSgr021751.1
SyntenySgr021751.1
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTCGTCCGCTGTGTCGTCGTCTTTGGAGATCCTCATCCTCAGGCCCGCCCGTCATGTCGTCGAAATCTGGTAATTTCTGGAGTTTTACAATGCTCAGATTCCGAACAAGCTTGCTTCAACTTTTCCAGAACACAGAAAACTGCGCCGGCCGTGGCGGTGGCGGTGGCGGTGGAGGCCGTCGTCGTTCTTCTCTGCCGCCGCAGGCCTCCGTCGCCGTGCTTCCTTACGCGCGCTCCTACAACCAATCAATCTATCCAATGTTTCGCTTCTTTGCTTCCAGATCCACAACAGATACATCCACTTCTTCCTATTTCACCAGGTAGTTTGAGGTTATTTCATTATATTTTTTAAATGATAAATTTCTTTTTTACTAATATTACTTTGTTTTGGTATTTATTTCTTAAAAGAAAAAATTAAATGTTGAAACCTTAGGTAGATTTTAATTATAGGTTAAATTATAAATCTAGTCTCTAAACTTTGAGAGTTACACCTAATTGATTTCTAAACTTTAAAATGTGTCTAACGAGTAAGTCTCTAAAATTTTAAATTGTGTGTAATAAATCTATAAATTTTTAATTTTGTGTTCAATAGTCTCCTGAATTTTAAAAAGTGTCTATTAGATACAATGACCTATTAGATATAACATTAAATCTTTAAGGATTTATTAGACGATATTTTTTTTAAGTTTAAGAATCTATTAGACACAACTTTTATAATTTAATATTTCAAAAGTGATTTTCTTAAGGTCCATTTAGTATTTATTTCATATTTTCTTTCTTTCTGTCTTTCTTTTTTTAAATGCTAGGGTGAAGCTTAACAAGGAAAAGGACCAGCCACCAACAGCTCATTTTTCTTTATTCCCTGGTTGGTAAGTAAAACCATGGAATTTAAAATGAAAAATGATCTTCTATTTTTCTATTAAATTGCATTTACTTTTTTCTTTTCGTTTTTGCTTTAATGAATTTAGGGCAAAATGGATTTTTGGCTCCTTATTGTCTCTCTTGATTCCCACTTGGAAGCAAAGTTGGAATAAACTGCAAACTCTTGAAGGTAAAAAAATAATTAAAAAAAAAATTCATCATTAAAAAAAAAAAAAACAAATTAGTAATGAATTTTTTTTTTTTTTTTTTCAGGAGAAGCAGAAATGGTAATTGAAGAGGCAGAAAGTGTAGCAAAGGTAGTGGAAAAGGTAGCAGAATTAACAGAGAAGGTATCAGCAGAAATTGCTGAGAAGTTTCCTGAGAAGAGTAAACTCAAAGAAGCAGCTCAAGTGCTGGAAAACTACTCCAAGGAAATTGCCCATGATGCCCACTTAACACAAGATATCCTCCACAAGGTTTTTATATATATATTATTTAATTTTTTTTATTAATATTACAAATGTGGTCTCGGAATTCGAAAGTTCAACCTATAATGAGATAGGTTATTAATATTTTAATTGTTGAATTATGATCAAGTTGGCCTTCACAGAATATCTAAACATTAGATTTCACAAACTTTTTTATATTTTCATCTAATCTTAAGCACATTCATTATTAACTTAATAACCGGTTGGTTCTTGTGTTTTTATTTATTTTTGGTTAAAAAATATTTTTGGTCTCTATACTTTCATTGAAATAACAATTTAGTCCTTGAACTTTAATATGTAACAATTTGGTCCCTATATTTTTAAATTTGTAACAATTTAGTCTCTAAACTTTCATATGTAATAATTTAGTCGTTGTAGTTTAAAATTTGTAACAATTTAGTCTCGACCATGAAAAATCTCATCAAGGTAAAATATCAATTTTTATTATGTAACGATTTAGTCCCTGTAGTTTACAAACACATTTGGTCTTTAAGCGATTTATTGATCTACTTATGTAAGAATCCCCATTAAATCTTAACAAAATTTTACATGGTAGAGATTAAATTGTTACAAAATTTAAACTATATGAACTAAATTGCTACATATTTTGTTCAAGGACTAAATTATTACAAAATTAAAAATACAGAAACTAAATCGTTACATACTAAAGTTCATGGACTAAATTGTTACTTCAATGAAAATATAGAGACAAAAAACATTTTTTAACCTTTATTTTTTAAAGATATTAATTTAGAAATATAATATATTAAATTCAAATTTATATTCTTTTTTAATTTCACGTTGGTCTCTAAACTTTTAGGCTTATACTCTATTTTAGTTCTTGAAACTATATATAAAAAAAAATAGTCAATATGGTTCTTATTGTTATTCAATTGTCAATATCCGTCCTGCCAAATACCCATCTTAGCAAGTAGGGGTGTCAATTTTTCTTGTGAGGACCCGCCCCGTTCCCCATTTAAGGTAGGGGTAGGGGCAGGGACGAGAGGAAAATTTTCTCCATATTTATTTTGGGGATGGGGACGGGGACCCTTGCTCCGTTCCCCATTTAAATATATATATTTTTTATTTTTTTATTATATATATTATATATCCTAATATATATATTAAAAATTTTTAAAAATTATTATGTTTTCTCAGTTAGCCCACTCCCATTCATTTCTTTTTTATTTTTTCTTCCTAATAGCTATAAGTAAAGAAAGATTATTTTACTAAAAGAAAAAAAAAAAGAATTTGTAAACAGAGAAAAATTTCCTGTGGAGAATCCGACCCCAAATTTTTTCAAAAATTTTATAGGAACGGAAAATGATATATCGGGGATGGGGATGGGGACGGAAAAGCCATCCCCGTCATGCCCCACCCCATTGATACCCCTATTAGCTAGTGTGTGTGGCTTAATAAATGCCCACAACAATGCTTCACTACACATATTTAAATCACGCAATACACACGAATACAACAAAATATGTTTTTTTTTAAAAAAAAAAACTTCTATCAAATAAATCTTCTGCCTCCTTTTTTTTATTCCATCAACTTTTGTTTTTATCTGTCATTATAGGTGGAAGAATGGAAGCAAAAGCCAGAGAAGCCAGAGACAACTATTAATGAACAGGTCAAGAAGAAAGAAGGCATAGCAAACAAGTGAAAAAGGATATTATATGGATCAATATAACATGTAAAAATGAAAACATAAAGAAATAACTTTGATTTGTTTTTATCAATTTTTTTATTTTTTAAAAATGTATTTAATGTATAACCAAAAGAGAAGGGTTAGAATTCCTATATAGTATCATATAAATGATTGCTCAAATCGGAATTTTGATAAGTAGACTATTTTATTGAACTCTTGTACAATAGATAGATGCCTATACAAGTTCACTATGGGCTAGAAGCCATATTCGAATCAAATAAGTAATTGCAAATCTAATTTAATGTAGACAAAATTAGAACACGTGTAATAATTATCATAGTTAATCAATATAAAAGTTAATTATCAATTTTTGTGAATTAATAACTGTTATATAAAATTAAAAAACTATTTTACTAGTATTAAAGTTAATCAATAATTAATAATTTTTTTAAAAAAAATACAATATTTCCAGTTATTTTTTCTAGAAAAAGGCTATATTTATAACTCTTTTTATAAAGGTAGGTTGTATTTGTAATTATTTTTATAAAAAATATTTATATATTTATGAATATTTTAGAAAATCATGCTATTTTTCCTATTTAAATTGGTGACAATTTTTTAATCCTTTGCCTTATTAACTAAAAAAATGTTTCATCTTCAAGCTAAATAAATGACCAAACACAAACCATAAAAATATTCACACTTTTTTTTTGTTGAATACATAAAATATTCATACTCAATCATAAAAGCTAATAGAAAAAAATATCCAAACTCACACAATTAAATGCCCAAATTTAGCAACAAAAAAAAAAAAGAAAAAGAAAAAATCAACACAACTCTAAAAGAAACCAACATTCACAAAAAATAGTCTGATCCAATTAAAAAAAAAACTCTCGTTTAATCACAAATGTAACTTTAAAGTTTTAACGCCTTACTTTAAAGAGTAAAAAACTTGAGCTAGTTAATTATCAGCTGAGTAACTTTTTCCATAAATTGAGAGTTCAAAATTCTATCCTATATTTATAATATAACATTATCTTTTAAAAAATAAGTAGTAAAAAACCAATTTTACCCTCATCTTTATGACACATTTCGCCTTAGTTATATGGAGACAGCAGAGATTTTGTCCACAAGGTACATAGAAACTGACACCTCATTATGGGAATTCCAGAAATAATTTTTTTATTTAAATTATAATTACGTGAATGTGATATTTAATTTTTTTAGTACAACAACAGGAATAACGAAGAAATTCAAACATCTAATTTTTACTGAAATTAGTGAGTGCTTTAAAAAATTAAACTATAAATAAAAGATAAGAAAAAAATAAAAAGAATAAAGGAAAACTAGGGACAGAATTGTAATTTAGTACATCAAAATAATTCGCTCAGACCGAAAGAGTACGAGCTCACGGATCTACATCACGAAAGCAGCGGCGCAGGCGGCGTCGAAGAGAGAACCGGCGGCGGCGAAAGTATTGAGTGCAGCATTGTTTCCGGCGGGAGCAGGCGCGTCGGCTGGAGGAGAGGAGATGAAGGATGGAGGAACGGCCCGCGAAGGAGAAGTAGCATGAGGAGGCTTACCGGTGGCCGGAGACGGCGTGGATTTAGGAGAAAGCTTAGGCGAGTCGGCGGCCGGCGAGGGAGAAGCTGCGTGAGAAGATTTCTTAGGTGA

mRNA sequence

ATGTTCGTCCGCTGTGTCGTCGTCTTTGGAGATCCTCATCCTCAGGCCCGCCCGTCATGTCGTCGAAATCTGGTAATTTCTGGAGTTTTACAATGCTCAGATTCCGAACAAGCTTGCTTCAACTTTTCCAGAACACAGAAAACTGCGCCGGCCGTGGCGGTGGCGGTGGCGGTGGAGGCCGTCGTCGTTCTTCTCTGCCGCCGCAGGCCTCCGTCGCCGTGCTTCCTTACGCGCGCTCCTACAACCAATCAATCTATCCAATGTTTCGCTTCTTTGCTTCCAGATCCACAACAGATACATCCACTTCTTCCTATTTCACCAGGAGAAGCAGAAATGGTAATTGAAGAGGCAGAAAGTGTAGCAAAGGTAGTGGAAAAGGTAGCAGAATTAACAGAGAAGGTATCAGCAGAAATTGCTGAGAAGTTTCCTGAGAAGAGTAAACTCAAAGAAGCAGCTCAAGTGCTGGAAAACTACTCCAAGGAAATTGCCCATGATGCCCACTTAACACAAGATATCCTCCACAAGGTGGAAGAATGGAAGCAAAAGCCAGAGAAGCCAGAGACAACTATTAATGAACAGGTCAAGAAGAAAGAAGGCATAGCAAACAAACCGAAAGAGTACGAGCTCACGGATCTACATCACGAAAGCAGCGGCGCAGGCGGCGTCGAAGAGAGAACCGGCGGCGGCGAAAGTATTGAGTGCAGCATTGTTTCCGGCGGGAGCAGGCGCGTCGGCTGGAGGAGAGGAGATGAAGGATGGAGGAACGGCCCGCGAAGGAGAAGTAGCATGAGGAGGCTTACCGGTGGCCGGAGACGGCGTGGATTTAGGAGAAAGCTTAGGCGAGTCGGCGGCCGGCGAGGGAGAAGCTGCGTGAGAAGATTTCTTAGGTGA

Coding sequence (CDS)

ATGTTCGTCCGCTGTGTCGTCGTCTTTGGAGATCCTCATCCTCAGGCCCGCCCGTCATGTCGTCGAAATCTGGTAATTTCTGGAGTTTTACAATGCTCAGATTCCGAACAAGCTTGCTTCAACTTTTCCAGAACACAGAAAACTGCGCCGGCCGTGGCGGTGGCGGTGGCGGTGGAGGCCGTCGTCGTTCTTCTCTGCCGCCGCAGGCCTCCGTCGCCGTGCTTCCTTACGCGCGCTCCTACAACCAATCAATCTATCCAATGTTTCGCTTCTTTGCTTCCAGATCCACAACAGATACATCCACTTCTTCCTATTTCACCAGGAGAAGCAGAAATGGTAATTGAAGAGGCAGAAAGTGTAGCAAAGGTAGTGGAAAAGGTAGCAGAATTAACAGAGAAGGTATCAGCAGAAATTGCTGAGAAGTTTCCTGAGAAGAGTAAACTCAAAGAAGCAGCTCAAGTGCTGGAAAACTACTCCAAGGAAATTGCCCATGATGCCCACTTAACACAAGATATCCTCCACAAGGTGGAAGAATGGAAGCAAAAGCCAGAGAAGCCAGAGACAACTATTAATGAACAGGTCAAGAAGAAAGAAGGCATAGCAAACAAACCGAAAGAGTACGAGCTCACGGATCTACATCACGAAAGCAGCGGCGCAGGCGGCGTCGAAGAGAGAACCGGCGGCGGCGAAAGTATTGAGTGCAGCATTGTTTCCGGCGGGAGCAGGCGCGTCGGCTGGAGGAGAGGAGATGAAGGATGGAGGAACGGCCCGCGAAGGAGAAGTAGCATGAGGAGGCTTACCGGTGGCCGGAGACGGCGTGGATTTAGGAGAAAGCTTAGGCGAGTCGGCGGCCGGCGAGGGAGAAGCTGCGTGAGAAGATTTCTTAGGTGA

Protein sequence

MFVRCVVVFGDPHPQARPSCRRNLVISGVLQCSDSEQACFNFSRTQKTAPAVAVAVAVEAVVVLLCRRRPPSPCFLTRAPTTNQSIQCFASLLPDPQQIHPLLPISPGEAEMVIEEAESVAKVVEKVAELTEKVSAEIAEKFPEKSKLKEAAQVLENYSKEIAHDAHLTQDILHKVEEWKQKPEKPETTINEQVKKKEGIANKPKEYELTDLHHESSGAGGVEERTGGGESIECSIVSGGSRRVGWRRGDEGWRNGPRRRSSMRRLTGGRRRRGFRRKLRRVGGRRGRSCVRRFLR
Homology
BLAST of Sgr021751.1 vs. NCBI nr
Match: XP_022141966.1 (uncharacterized protein LOC111012212 [Momordica charantia])

HSP 1 Score: 146.0 bits (367), Expect = 5.2e-31
Identity = 82/112 (73.21%), Postives = 93/112 (83.04%), Query Frame = 0

Query: 92  LLPDPQQIHPLLPISPGEAEMVIEEAESVAKVVEKVAELTEKVSAEIAEKFPEKSKLKEA 151
           L+P  +Q    L    GEAEMVIEEAESVA+VVEK AE+ EK SAEIA+K PEKSKLKEA
Sbjct: 109 LIPTWKQSSNKLQTLEGEAEMVIEEAESVAEVVEKAAEIAEKASAEIAKKLPEKSKLKEA 168

Query: 152 AQVLENYSKEIAHDAHLTQDILHKVEEWKQKPEKPETTINEQVKKKEGIANK 204
           A+V+E YSK+IAHDAHLTQDILHKVEEWKQK +K ET INEQ++KKEG ANK
Sbjct: 169 AEVVETYSKQIAHDAHLTQDILHKVEEWKQKLDKSETAINEQIRKKEGPANK 220

BLAST of Sgr021751.1 vs. NCBI nr
Match: XP_038888803.1 (uncharacterized protein LOC120078589 [Benincasa hispida])

HSP 1 Score: 141.7 bits (356), Expect = 9.9e-30
Identity = 77/89 (86.52%), Postives = 84/89 (94.38%), Query Frame = 0

Query: 109 EAEMVIEEAESVAKVVEKVAELTEKVSAEIAEKFPEKSKLKEAAQVLENYSKEIAHDAHL 168
           EAEMVIEEAE+VAKVVE+VAELTEKVSAEIAEK PEKSKLKEAAQV+ENYSKE+AHDAHL
Sbjct: 113 EAEMVIEEAENVAKVVEEVAELTEKVSAEIAEKLPEKSKLKEAAQVVENYSKEVAHDAHL 172

Query: 169 TQDILHKVEEWKQKPEKPETTINEQVKKK 198
           TQDILHKVEEWKQK +  ET +NEQ+KKK
Sbjct: 173 TQDILHKVEEWKQKLDTSETIVNEQIKKK 201

BLAST of Sgr021751.1 vs. NCBI nr
Match: XP_008455165.1 (PREDICTED: uncharacterized protein LOC103495399 [Cucumis melo])

HSP 1 Score: 127.5 bits (319), Expect = 1.9e-25
Identity = 70/89 (78.65%), Postives = 80/89 (89.89%), Query Frame = 0

Query: 109 EAEMVIEEAESVAKVVEKVAELTEKVSAEIAEKFPEKSKLKEAAQVLENYSKEIAHDAHL 168
           EAEMV+EE E+VA+VVEKVAELTEKVS EI+EK PEKSKLKEAAQV+ENYSKEIAHDAHL
Sbjct: 119 EAEMVMEEVENVAEVVEKVAELTEKVSTEISEKLPEKSKLKEAAQVVENYSKEIAHDAHL 178

Query: 169 TQDILHKVEEWKQKPEKPETTINEQVKKK 198
           TQDILHKVEEWKQK +K +  +NE  K++
Sbjct: 179 TQDILHKVEEWKQKIDKSKIDMNESNKER 207

BLAST of Sgr021751.1 vs. NCBI nr
Match: XP_011658811.1 (uncharacterized protein LOC105436091 [Cucumis sativus] >KAE8645815.1 hypothetical protein Csa_017132 [Cucumis sativus])

HSP 1 Score: 121.3 bits (303), Expect = 1.4e-23
Identity = 68/89 (76.40%), Postives = 78/89 (87.64%), Query Frame = 0

Query: 109 EAEMVIEEAESVAKVVEKVAELTEKVSAEIAEKFPEKSKLKEAAQVLENYSKEIAHDAHL 168
           EAEMVIEEAE+VA+VVEKVAELTEKVS +I EK PEKSKLKEAA+V+E+YSKEIAHDAHL
Sbjct: 122 EAEMVIEEAENVAEVVEKVAELTEKVSTKICEKLPEKSKLKEAAEVVESYSKEIAHDAHL 181

Query: 169 TQDILHKVEEWKQKPEKPETTINEQVKKK 198
           TQDILHKVEEWK K +K +   NE  K++
Sbjct: 182 TQDILHKVEEWKLKVDKSKIDTNEPNKEE 210

BLAST of Sgr021751.1 vs. NCBI nr
Match: KAG7011540.1 (hypothetical protein SDJN02_26446, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 117.9 bits (294), Expect = 1.5e-22
Identity = 68/90 (75.56%), Postives = 77/90 (85.56%), Query Frame = 0

Query: 109 EAEMVIEEAESVAKVVEKVAELTEKVSAEIAEKFPEKSKLKEAAQVLENYSKEIAHDAHL 168
           EAE +IEEAE+VA+VVEKVAELTEKVSAEI EK  E+SK+KEAA+V+E YSKEIAH A L
Sbjct: 97  EAEKMIEEAENVAEVVEKVAELTEKVSAEIGEKVGEESKVKEAAEVVEKYSKEIAHHALL 156

Query: 169 TQDILHKVEEWKQKPEKPETTINEQVKKKE 199
            Q ILHKVEEWKQK +K E  INEQ+KKKE
Sbjct: 157 AQHILHKVEEWKQKLDKSEADINEQIKKKE 186

BLAST of Sgr021751.1 vs. ExPASy TrEMBL
Match: A0A6J1CKS7 (uncharacterized protein LOC111012212 OS=Momordica charantia OX=3673 GN=LOC111012212 PE=4 SV=1)

HSP 1 Score: 146.0 bits (367), Expect = 2.5e-31
Identity = 82/112 (73.21%), Postives = 93/112 (83.04%), Query Frame = 0

Query: 92  LLPDPQQIHPLLPISPGEAEMVIEEAESVAKVVEKVAELTEKVSAEIAEKFPEKSKLKEA 151
           L+P  +Q    L    GEAEMVIEEAESVA+VVEK AE+ EK SAEIA+K PEKSKLKEA
Sbjct: 109 LIPTWKQSSNKLQTLEGEAEMVIEEAESVAEVVEKAAEIAEKASAEIAKKLPEKSKLKEA 168

Query: 152 AQVLENYSKEIAHDAHLTQDILHKVEEWKQKPEKPETTINEQVKKKEGIANK 204
           A+V+E YSK+IAHDAHLTQDILHKVEEWKQK +K ET INEQ++KKEG ANK
Sbjct: 169 AEVVETYSKQIAHDAHLTQDILHKVEEWKQKLDKSETAINEQIRKKEGPANK 220

BLAST of Sgr021751.1 vs. ExPASy TrEMBL
Match: A0A1S3C099 (uncharacterized protein LOC103495399 OS=Cucumis melo OX=3656 GN=LOC103495399 PE=4 SV=1)

HSP 1 Score: 127.5 bits (319), Expect = 9.3e-26
Identity = 70/89 (78.65%), Postives = 80/89 (89.89%), Query Frame = 0

Query: 109 EAEMVIEEAESVAKVVEKVAELTEKVSAEIAEKFPEKSKLKEAAQVLENYSKEIAHDAHL 168
           EAEMV+EE E+VA+VVEKVAELTEKVS EI+EK PEKSKLKEAAQV+ENYSKEIAHDAHL
Sbjct: 119 EAEMVMEEVENVAEVVEKVAELTEKVSTEISEKLPEKSKLKEAAQVVENYSKEIAHDAHL 178

Query: 169 TQDILHKVEEWKQKPEKPETTINEQVKKK 198
           TQDILHKVEEWKQK +K +  +NE  K++
Sbjct: 179 TQDILHKVEEWKQKIDKSKIDMNESNKER 207

BLAST of Sgr021751.1 vs. ExPASy TrEMBL
Match: A0A0A0K622 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G064570 PE=3 SV=1)

HSP 1 Score: 121.3 bits (303), Expect = 6.7e-24
Identity = 68/89 (76.40%), Postives = 78/89 (87.64%), Query Frame = 0

Query: 109 EAEMVIEEAESVAKVVEKVAELTEKVSAEIAEKFPEKSKLKEAAQVLENYSKEIAHDAHL 168
           EAEMVIEEAE+VA+VVEKVAELTEKVS +I EK PEKSKLKEAA+V+E+YSKEIAHDAHL
Sbjct: 386 EAEMVIEEAENVAEVVEKVAELTEKVSTKICEKLPEKSKLKEAAEVVESYSKEIAHDAHL 445

Query: 169 TQDILHKVEEWKQKPEKPETTINEQVKKK 198
           TQDILHKVEEWK K +K +   NE  K++
Sbjct: 446 TQDILHKVEEWKLKVDKSKIDTNEPNKEE 474

BLAST of Sgr021751.1 vs. ExPASy TrEMBL
Match: A0A6J1I3G6 (uncharacterized protein LOC111470651 OS=Cucurbita maxima OX=3661 GN=LOC111470651 PE=4 SV=1)

HSP 1 Score: 116.7 bits (291), Expect = 1.6e-22
Identity = 69/95 (72.63%), Postives = 77/95 (81.05%), Query Frame = 0

Query: 109 EAEMVIEEAESVAKVVEKVAELTEKVSAEIAEKFPEKSKLKEAAQVLENYSKEIAHDAHL 168
           EAE  IEEAE VA+VVEKVAELTEKVSAEI EK PEKS++K+AA+ +E YSKEIAHDA L
Sbjct: 97  EAEKGIEEAEHVAEVVEKVAELTEKVSAEIGEKLPEKSRMKDAAEAVEKYSKEIAHDALL 156

Query: 169 TQDILHKVEEWKQKPEKPETTINEQVKKKEGIANK 204
            Q ILHKVEEWKQK +K E  INEQ+KK   I NK
Sbjct: 157 AQHILHKVEEWKQKLDKSEADINEQMKK---IVNK 188

BLAST of Sgr021751.1 vs. ExPASy TrEMBL
Match: A0A6J1GN56 (uncharacterized protein LOC111455450 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111455450 PE=4 SV=1)

HSP 1 Score: 115.5 bits (288), Expect = 3.7e-22
Identity = 67/90 (74.44%), Postives = 77/90 (85.56%), Query Frame = 0

Query: 109 EAEMVIEEAESVAKVVEKVAELTEKVSAEIAEKFPEKSKLKEAAQVLENYSKEIAHDAHL 168
           EAE +IEEAE+VA+VVEKVAELTEKVSAEI EK  E+SK+KEAA+V+E YSKEIAH A L
Sbjct: 135 EAEKMIEEAENVAEVVEKVAELTEKVSAEIGEKVGEESKVKEAAEVVEKYSKEIAHHALL 194

Query: 169 TQDILHKVEEWKQKPEKPETTINEQVKKKE 199
            Q ILHKVEEWKQK +K +  INEQ+KKKE
Sbjct: 195 AQHILHKVEEWKQKLDKSKADINEQMKKKE 224

BLAST of Sgr021751.1 vs. TAIR 10
Match: AT2G14095.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: mitochondrion; EXPRESSED IN: 7 plant structures; EXPRESSED DURING: 4 anthesis, C globular stage, petal differentiation and expansion stage; Has 106 Blast hits to 103 proteins in 21 species: Archae - 0; Bacteria - 5; Metazoa - 0; Fungi - 4; Plants - 87; Viruses - 0; Other Eukaryotes - 10 (source: NCBI BLink). )

HSP 1 Score: 87.0 bits (214), Expect = 2.7e-17
Identity = 52/109 (47.71%), Postives = 75/109 (68.81%), Query Frame = 0

Query: 108 GEAEMVIEEAESVAKVVEKVAELTEKVSAEIAEKFPEKSKLKEAAQVLENYSKEIAHDAH 167
           GEAE+V+E  E+VA++VEKVA  T++++ E+AEK PEK+KLK+ A VLE+ S+  AH+AH
Sbjct: 122 GEAELVVEGVEAVAEMVEKVATATDEMAEEMAEKLPEKNKLKQVALVLEHISEVAAHEAH 181

Query: 168 LTQDILHKVEEWKQKPEKPETTINEQVKKKEGIANKPKEYELTDLHHES 217
           LTQD LHKVE+  Q  +  E  I   + KK   A   ++ +  + +HES
Sbjct: 182 LTQDFLHKVEKVTQDIDDLEAMIKPLIDKKVANAETKQQTKEEEANHES 230

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022141966.15.2e-3173.21uncharacterized protein LOC111012212 [Momordica charantia][more]
XP_038888803.19.9e-3086.52uncharacterized protein LOC120078589 [Benincasa hispida][more]
XP_008455165.11.9e-2578.65PREDICTED: uncharacterized protein LOC103495399 [Cucumis melo][more]
XP_011658811.11.4e-2376.40uncharacterized protein LOC105436091 [Cucumis sativus] >KAE8645815.1 hypothetica... [more]
KAG7011540.11.5e-2275.56hypothetical protein SDJN02_26446, partial [Cucurbita argyrosperma subsp. argyro... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1CKS72.5e-3173.21uncharacterized protein LOC111012212 OS=Momordica charantia OX=3673 GN=LOC111012... [more]
A0A1S3C0999.3e-2678.65uncharacterized protein LOC103495399 OS=Cucumis melo OX=3656 GN=LOC103495399 PE=... [more]
A0A0A0K6226.7e-2476.40Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G064570 PE=3 SV=1[more]
A0A6J1I3G61.6e-2272.63uncharacterized protein LOC111470651 OS=Cucurbita maxima OX=3661 GN=LOC111470651... [more]
A0A6J1GN563.7e-2274.44uncharacterized protein LOC111455450 isoform X2 OS=Cucurbita moschata OX=3662 GN... [more]
Match NameE-valueIdentityDescription
AT2G14095.12.7e-1747.71unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 248..270
NoneNo IPR availablePANTHERPTHR33735:SF10EXPRESSED PROTEINcoord: 93..203
NoneNo IPR availablePANTHERPTHR33735EXPRESSED PROTEINcoord: 93..203

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Sgr021751Sgr021751gene


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
Sgr021751.1.exon1Sgr021751.1.exon1exon
Sgr021751.1.exon2Sgr021751.1.exon2exon
Sgr021751.1.exon3Sgr021751.1.exon3exon
Sgr021751.1.exon4Sgr021751.1.exon4exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
cds.Sgr021751.1cds.Sgr021751.1CDS
cds.Sgr021751.1cds.Sgr021751.1_2CDS
cds.Sgr021751.1cds.Sgr021751.1_3CDS
cds.Sgr021751.1cds.Sgr021751.1_4CDS


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Sgr021751.1Sgr021751.1-proteinpolypeptide


GO Annotation
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane