Sgr011656 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr011656
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionReverse transcriptase domain-containing protein
Locationtig00153024: 188889 .. 195178 (-)
RNA-Seq ExpressionSgr011656
SyntenySgr011656
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCGAAAGATCAGGAGATGTCCTTTCTGGATATCCTTTCAACAGTGGGAGCTGGAGTCCAACCCCACCGAGAGCCATTAGTACTTTATTTTGGAATGTTGGAAAGGTGGGGTCCCTGAGAACAGTTCAACATTTGTACCAATTGGTGCAGGATAAAAAACCCCAATTCTTGTATTTTTTTGAAGCGGAAGCTGTAAGAAGTAGAATAGAAAAGATTAAACAAAGGCTTGGTTTTTCTTTCTGTTTTATTGTTGAGTGCTGGGGAAAGAGTGGCGGGCTTGCTCTTCTTTGGTAGGAGGCATCAAAATTTAATCTCATTTCCTTCTCAATTAATCACATTGATGGTTGGGTGGAACATGAGGGAAAAACGTGGAGACTCATTAGCTTCTATGGAAATCCAAGACATGAATCACAAAAGATTTCATGGAGTCTCTTGAAAAGTTTAAAGGGAAGTGATAGTACTCCTTGGGTGGTGGGAGGGGGGGGGGGGGGGGAGGGAGTTGGATATTAATGAGATTCTTTCCCATGGTGAGAAGGATGGAGGTAGATTAAGCGATGATAATGCTATAACTAAATTTCGAGAAGCAGTAGATGTGTGTGGATTAATGGATTTGTGCTATGAGGGTGATGATTTCACCTGGTCAAACAAATGCCCAAACGAGGGGATTATTTTGAACGTTTGAATAGAGTGTTGTGCAATCTGGCTTGGCGTACAATGTTCTAGTCCAGTTTCGTTCATAATTTGGAGTATAATTTCTCTGATCATCGTCCTATGGAATTGTGCATGGAATTGGCTGCTAGACTTATTGGGAAGAAAAAATAGAGAATTTTTCCATTTGAAAAAGTATGGTTGAGTTCGTTGGAGTGTGAAAATGTTGTGGAGAATAGTTGGGATACTCGGCACGAAACTAATTGGATTTCTACTATTGTGAAAAAATCTAGAAATTGCTTGAGCACACTTATGGATTGGGGAAAGGACAAATTTGGGGGATATCCAAAAAAGATCAAAGAGGTAGATTTGAAGTTTGGGGGAAGCAGACACAAGTTGATTTCCGCAGAGACTTTAGAGAAAATATTAATGAAAGAAGAGATTTATTGGCAACAAAGATCTCAAGAGCATTGGGTAAAATGGGATGACAGGAATACTCGGTGGTTTCATAATAAAGCGTCTTATAGAAGGAAGATAAATGAAATAATAGGCTTGGAGGGTGAAAATGGGCAGTGGACGCAAAATAAGAAGCCTATAGCAGATATTAGTTCTAACTACTTTGTTAAGATTTATTCTTCTTCTGTACTTGGTGATGCTGATTTTGGTGAGATTCTTAAGGATGTCCCTTCTTCAGTCTCTATTTTCTGCATTATAATGATAACACAGGGCTTGTTCTCGTCACTCAAATTCTCACTGGTGAGAATTACTCTTCATGGAGTAGATCGATGCTCATAGCCTTGTCAGTGAAGAAAATTGGCTTCACAGATGGATCGATCGAGCGTCCTACTTGTGAACTTCTCCCTGTGTGGATTTGGAACAATCACGTGGTCATCGCTTGGATACTTAATTCGGTGTTTAAAGAAATTTCTACGAGCATGATTTTCTCTAATTCTGCTCGTGACATCTGAACCGATTTAAAGGAGCGCTTCCAAAGTAAGAACGGTCCAAGGATTTTCCAACTCCGATGAGAACTAGCTACTCTTGCTCAGGATCAGCAGTCGGTGAGCATTAACCATCTGGGATGAATTAGGATCCTATCGTCCTACATGTACTTATGGATTGTGTACGTGTGGAGGTATGAAAGCAATTGCAAATTACTTTTAGTATGAGTATATGATGTGCTTCCTTATGAGACTCAATGACTTATTTGCCCAAACCCATACTCAATTGCTGCTTATGGAGCCTCCTCCCACTATCAACAAGGCTTCTCGCTAGTCACGAAAGAGGAACAACACACAATTGGCATTCTTCCTACGCCTACACCAATTGCTCTTGTTGTGCAAGGTAATCCGAGATCAAACAAAAGCAAACGAGATCGCCCTGTCTGCACACATTGTGGAATTTAGAGTCATATGATGGACAAGTGCTACAAGTTACACAGTTATCCTCCTGGATACAAGTCACGGAATCAGCGCTCCCTTCCATCACAAATTCAGACACAGTCTACGAATGTTATTGGCCCTCAAGATCAGCAACAGTCCAAGCTTCTGATTCTTAAGCCCTGGCTGTTACTCATTATGCTGCTGACATTGAGGCATTGGCAGTGCAATGTCAGCAACTCTTTCTCAAGTTACAGTCTCAACTCTCTCTGTTGCTAAAGCTTCTCCTGACTGTGAAGCTTCAACCTTATATATGGATGGTATCTCTCCTCTTCTGCCCTTTTCGATGGGGCTTTGGATTTTGGATTCATGGGCTTTGTCTCACATTTTCTTTATGAAGTCTTCTTTTGAGTCTCTTTCACGCATCTTTCCTAAGTTTGTTAATCTACTAAATCAGTTCTTTATTGTTGTGGAGTTTGTTGAAATTATTCGGTTCACTGGCGATATTGTTCTTCGTGATGTCCTGTACATTCCTCAGTTCCGTTTCAACCTTATTTCTCTCAGTGTTTTATTAAAGGACATGCCTAATGTTTTGGTGGAATTCTCTAATTCTCAATGCATTATTCAGGACAAGTTCTCCTTGAAGAAGATTGGCAAGGGTAGCCTTCAACATGGACTCTATGTCTTTGAGCGTGCTGTCTCCCATACTGGTGCTCCAAATCAGACTGATATTTGCTTGACCTCTACAAAGTTGAATAAAATCTCTTTTACTACTTGGCATGCTAGGCTTGGTCATCCATCGTTTAAAAGACTTTCTATTTTAAGAGATCTTTTGCAGTTTGATTCTTCTATTCATAAGATTTCTTCGCCATACATGATTTGCCCCTTGGCTAAATAGAAACAATTGTCTTTTGATTCACACAATAATATGTCTTTCCATGCTTTTGACCTTATACACGTTGATATCTGGGGTCCTTACACTACAACTATGTACAATGGCTATAAATATTTTCTTACTATTGTCGATGACTGCATGAGGTTCACATGAGTTTATATGCTTCAAGCCAAGTCTAATGTGTTGCATGTCATACCTCAATTTTTTGACCTTGTTTATACTCAATTTGGCAAAATCATCAAAGTTTTCAGGTCTAATAATGCCCCTGTGTTACTGTTTAAGGAGTTTTTTGCTTCAAAAGGGGTTGTGCATCAGTTTTCTTGTGTGGAGAGGCCCGAACAGAACTCTATGGTGGAAAGGAAGCATCAACATATTTTAAACATTGCTAGGTCTCTTTTCTTTCAATCTAGAGTACTAATCACTTTTTGGGGAGATTGTATCTGAACTGCTGTATATTTGATTAACCGAACACCATCACAAGTATTAGAATGGAAATCTCCTTTTCAGGTTCTTTACGACATTGTCCCTGATTATGGTTTGTTGCGAGTGTTTGGATCACTCTGCTTTGCTTCCACATTGAAGCATAACAGGCATAAGTTTCATCCTCAAACCATTGTTTCTATTTTTGTGGATATCTGCCTGATGTCAAAGGTTACCGTCTGTATGATATTGAGCAAAAACAGTTCTTTATTTCAAGAGACGTCTTTCATGAGTCAATTTTTCCATTTCATTCAGTCACTAATCTTTCTGTTCAGCCTGATCCACTTTCAGATCTTGTCTTGCCAAGGTCCTTTGACTTGCCTGAGGATAGTCCTCAAGTTACTTCTAGAGCTCCAGTGCAGGAAACTGTTGCAAGTGCACCTGACATTGCTGACTCTTCCCTAGGCACTCGTTTAGCTAATATAGATCATGTTGAGGTAGTTTTGTTTGCTGCAGATCAGGTTAATGTTCCTCCCACGACAATACCAGTCACAAGGAAGTCTACTCGAATTTCCAACCTCCATCTTATCTGCGTGACTACCACTGTAGTCTTTTGACCATTACTGACATTCCCTCTATTTTTTCTAAGCATCTGCTGGCTCACTATGTTTCCTATTCTCATTTATCTCCTTCTTATCGCTTATTTGTTTTGAATGTTAGTAATGCTTTTGAACCTCAGTTTTACTATTAAGAGATTGGATTCCAGCACTGGCATGATGCTATGAATGCTAAGTTAGAGACCATGGAGACAAATCACACTTGGACAGTGGTTCTTCTTCCCCCCGGGCATCATATTGTGGGCTGTAAGTGGATCTATAGGATTAGGCATCGATCTGACAGATCCATTGAACGATATAAAGTGAGGCTTGTGGCCAAAGGGTTCAAGCAACAAGAGGGTCTTGACTTCATTGAGGCATTCTCGCCCATAGCCAAACTTGTCATGGTCAAAGTCATCCTCACCATTGCTGTTTCCAACAATTGGCCCTTGATACAGTTAGATGTCAATAACACATTCCTACATGGGGACTTCTTGGAGGAAGTATACATGGACTTACCTTTGGGATACAAACCTAAGTCTGTTGTTCATGGATAGAGGGAGAAGCTGGTTTGTCATCTTAAGAAGTCCATTTATGGCTTGCGGCAAGCATGAAGGTAGTAGTTCAACAAATTTTCAATTGCTTTGTTGGCTCTTGGGTTTATGCAATCCAAGTTTGATTGCTCTTTCTTTGTCAAAGGGTCGGGCACCTCTTTCATTGCATTGCTAGTTTATGTCGACGACATCATAATCAATGGTGCAAGTTCCTCTGCTATTCACACACCTTAAGTCCCTTCTCAATAGTCGCTCCAAGCTTAAAGATTTGAGTTCATTGAAGTACTTCCTTGGCTTAGAGCTTGCACGAAATTCCTAAGGGATTTGTCTAACATAGAGGAATTACACTCTAAATCTTCTTAAGGGGGCTGGTTTACTAGGGTGCAAGCCTGCTATGCTTCCAATGGATCCAAACCTCAAGGTTCGGTCCAATTATGGTGAGGTTCTTTCTGACCCCTCTATTTTTAGACACCTTGTTGGTCGTTTGTTGTACCTAACCATCTCTAGGCCACACATCACTTTCGTTGTTCATAAATTGAGTCAATATGTATCCAAGGCATGTAAATCTCATTTATCTGCCGCACAACACCTGCTGCGCTACTTGAAAAATAGCCCTAGCAAGGGAATTTTCCTTTTTGCCTCTTCATCATTTTAGTTACATGCCTTTGCAGATGCTGATTGGGGCTCCTGTTTGGATACACGACGTTCCATCACCAGCTTTTGTGTATTTTTTAGAGACTCAATGATTTCGTGGAAAGCAAAGAAACAAACGACAGTTTCTCATTCCTCAGTAGAGGCTGAATATCGCTCACTTGCTACTGTCACTTGTGAACTCCTTTGGCTGGTGAGTCTTTTATGCGATCAGCAAGTGCCCTTTCAGCCTCCTGCTCTCCTCTTTTGTGACAACCAAGCTGCTATAGATATTGCTTCAAATCCAATCTTCCATGAGAGGACTAAGCATATAGAGCTCGACTGTCACTTTATTAGAGATAAGGTCTCCGATGGCTTCCTTCGATTGCTGCCCGTTCGTTTAAAGGATCAGCTTGCTGATATTTTTACTAAGTCGTTGCCTGCCACTTCTCTTTTTCCTCTTGTTTCTAGGATGGGCCTTCTGGATATCCATGCTCCATCTTGAGGGTGAGTATTAGATATTAGTTAGATGTATATAAGTTGTTAGTTAGTTTAGTCAGTTGATTATTGTTCAATTGTTAGTCACTTGGATATTTTTCTATTTATTGGAGGTTTGTAATTGAACTTCGATGAATGAGAAAGAACTCACTTTTCATTCCTCCACTTTCTGGTTCAATATGGTATCATTCGCATAGGAATTTCTGCCTCCCTGCCTGCCTCTTTTTTTTTCCCTTTCCTGTCAAGTTCTTCTTCTCTGTTCTTCATCTCTTTCTAATTTCTTCTCGTCAGTATGCAAACTAATCTCACTTCGATCTCAAATCTTCGTCGATGAATCACAATTCCGGTGAGAACTCTTCAAATCCTTATTTTCTGCATCATAATGATAACACAGGGCTTATGAGACAGATTCTTTGAGGATTTTCCATGTGCTTATCACAGAGAATGAGGATATTTCAGAGGTGGGGAATATAATTTCAAAGGTAAAGCATAAGATTGTTGGTTCTTCTTCTACCTTCTTCACTTTCACGAGGCCAGAAGGCAATGTTGTTGCTCATTTGCTTTCAAAGATGGCTTTGGAGAAACGATGGACGAACGTGTGGTTGGAAAGCTGGCCAGATTCTTTTATGTCATGTCTGGTAGCTGAGTGTGCGGATGTGTTGTCCTAA

mRNA sequence

ATGTCGAAAGATCAGGAGATGTCCTTTCTGGATATCCTTTCAACAGTGGGAGCTGGAGTCCAACCCCACCGAGAGCCATTAGTACTTTATTTTGGAATGTTGGAAAGAAATTGCTTGAGCACACTTATGGATTGGGGAAAGGACAAATTTGGGGGATATCCAAAAAAGATCAAAGAGGTAGATTTGAAGTTTGGGGGAAGCAGACACAAGTTGATTTCCGCAGAGACTTTAGAGAAAATATTAATGAAAGAAGAGATTTATTGGCAACAAAGATCTCAAGAGCATTGGGTAAAATGGGATGACAGGAATACTCGGTGGTTTCATAATAAAGCGTCTTATAGAAGGAAGATAAATGAAATAATAGGCTTGGAGGGTGAAAATGGGCAGTGGACGCAAAATAAGAAGCCTATAGCAGATATTAGTTCTAACTACTTTGTTAAGATTTATTCTTCTTCTGTACTTGGTGATGCTGATTTTGTGTTTGGATCACTCTGCTTTGCTTCCACATTGAAGCATAACAGGCATAAGTTTCATCCTCAAACCATTGTTTCTATTTTTGTGGATATCTGCCTGATGTCAAAGGTTACCGTCTTCACTAATCTTTCTGTTCAGCCTGATCCACTTTCAGATCTTGTCTTGCCAAGGTCCTTTGACTTGCCTGAGGATAGTCCTCAAGTTACTTCTAGAGCTCCAGTGCAGGAAACTGTTGCAAGTGCACCTGACATTGCTGACTCTTCCCTAGGCACTCGTTTAGCTAATATAGATCATGTTGAGGTAGTTTTGTTTGCTGCAGATCAGGTTAATGTTCCTCCCACGACAATACCAGTCACAAGGAAGGCTTATGAGACAGATTCTTTGAGGATTTTCCATGTGCTTATCACAGAGAATGAGGATATTTCAGAGGTGGGGAATATAATTTCAAAGGTAAAGCATAAGATTGTTGGTTCTTCTTCTACCTTCTTCACTTTCACGAGGCCAGAAGGCAATGTTGTTGCTCATTTGCTTTCAAAGATGGCTTTGGAGAAACGATGGACGAACGTGTGGTTGGAAAGCTGGCCAGATTCTTTTATGTCATGTCTGGTAGCTGAGTGTGCGGATGTGTTGTCCTAA

Coding sequence (CDS)

ATGTCGAAAGATCAGGAGATGTCCTTTCTGGATATCCTTTCAACAGTGGGAGCTGGAGTCCAACCCCACCGAGAGCCATTAGTACTTTATTTTGGAATGTTGGAAAGAAATTGCTTGAGCACACTTATGGATTGGGGAAAGGACAAATTTGGGGGATATCCAAAAAAGATCAAAGAGGTAGATTTGAAGTTTGGGGGAAGCAGACACAAGTTGATTTCCGCAGAGACTTTAGAGAAAATATTAATGAAAGAAGAGATTTATTGGCAACAAAGATCTCAAGAGCATTGGGTAAAATGGGATGACAGGAATACTCGGTGGTTTCATAATAAAGCGTCTTATAGAAGGAAGATAAATGAAATAATAGGCTTGGAGGGTGAAAATGGGCAGTGGACGCAAAATAAGAAGCCTATAGCAGATATTAGTTCTAACTACTTTGTTAAGATTTATTCTTCTTCTGTACTTGGTGATGCTGATTTTGTGTTTGGATCACTCTGCTTTGCTTCCACATTGAAGCATAACAGGCATAAGTTTCATCCTCAAACCATTGTTTCTATTTTTGTGGATATCTGCCTGATGTCAAAGGTTACCGTCTTCACTAATCTTTCTGTTCAGCCTGATCCACTTTCAGATCTTGTCTTGCCAAGGTCCTTTGACTTGCCTGAGGATAGTCCTCAAGTTACTTCTAGAGCTCCAGTGCAGGAAACTGTTGCAAGTGCACCTGACATTGCTGACTCTTCCCTAGGCACTCGTTTAGCTAATATAGATCATGTTGAGGTAGTTTTGTTTGCTGCAGATCAGGTTAATGTTCCTCCCACGACAATACCAGTCACAAGGAAGGCTTATGAGACAGATTCTTTGAGGATTTTCCATGTGCTTATCACAGAGAATGAGGATATTTCAGAGGTGGGGAATATAATTTCAAAGGTAAAGCATAAGATTGTTGGTTCTTCTTCTACCTTCTTCACTTTCACGAGGCCAGAAGGCAATGTTGTTGCTCATTTGCTTTCAAAGATGGCTTTGGAGAAACGATGGACGAACGTGTGGTTGGAAAGCTGGCCAGATTCTTTTATGTCATGTCTGGTAGCTGAGTGTGCGGATGTGTTGTCCTAA

Protein sequence

MSKDQEMSFLDILSTVGAGVQPHREPLVLYFGMLERNCLSTLMDWGKDKFGGYPKKIKEVDLKFGGSRHKLISAETLEKILMKEEIYWQQRSQEHWVKWDDRNTRWFHNKASYRRKINEIIGLEGENGQWTQNKKPIADISSNYFVKIYSSSVLGDADFVFGSLCFASTLKHNRHKFHPQTIVSIFVDICLMSKVTVFTNLSVQPDPLSDLVLPRSFDLPEDSPQVTSRAPVQETVASAPDIADSSLGTRLANIDHVEVVLFAADQVNVPPTTIPVTRKAYETDSLRIFHVLITENEDISEVGNIISKVKHKIVGSSSTFFTFTRPEGNVVAHLLSKMALEKRWTNVWLESWPDSFMSCLVAECADVLS
Homology
BLAST of Sgr011656 vs. NCBI nr
Match: XP_022150918.1 (uncharacterized protein LOC111018954 [Momordica charantia])

HSP 1 Score: 103.2 bits (256), Expect = 4.9e-18
Identity = 53/131 (40.46%), Postives = 81/131 (61.83%), Query Frame = 0

Query: 32  GMLERNCLSTLMDWGKDKFGGYPKKIK--EV-------DLKFGGSRHKLISAET-LEKIL 91
           GM    C+ +L+ WG++K G +  ++K  EV       DL F  +R     A T + ++L
Sbjct: 169 GMKLNQCVLSLVHWGRNKTGNFRNRLKVAEVMLQSAIHDLPFAPNREAFQQAITNMNQLL 228

Query: 92  MKEEIYWQQRSQEHWVKWDDRNTRWFHNKASYRRKINEIIGLEGENGQWTQNKKPIADIS 151
            +EEI+W+QRS++ W K  DRNT+WFH KAS+RR+ NEI GL  + G W +NK  +  + 
Sbjct: 229 KEEEIFWRQRSRDLWHKHGDRNTKWFHTKASHRRRTNEIKGLLDQQGTWEENKFKVVGMI 288

Query: 152 SNYFVKIYSSS 153
            +YF +++SSS
Sbjct: 289 ESYFTELFSSS 299

BLAST of Sgr011656 vs. NCBI nr
Match: XP_022150918.1 (uncharacterized protein LOC111018954 [Momordica charantia])

HSP 1 Score: 62.0 bits (149), Expect = 1.2e-05
Identity = 29/72 (40.28%), Postives = 40/72 (55.56%), Query Frame = 0

Query: 297  EDISEVGNIISKVKHKIVGSSSTFFTFTRPEGNVVAHLLSKMALEKRWTNVWLESWPDSF 356
            ED+SE G I+ K K+    S    F F + EGN  AH+L++ AL     ++W+E WP   
Sbjct: 1066 EDLSETGEIVLKAKNFWTQSLHASFNFVKREGNKAAHMLARRALLLHEFSIWMEDWPLEL 1125

Query: 357  MSCLVAECADVL 369
             SCL  EC + L
Sbjct: 1126 KSCLEMECLEEL 1137


HSP 2 Score: 93.2 bits (230), Expect = 5.0e-15
Identity = 42/135 (31.11%), Postives = 81/135 (60.00%), Query Frame = 0

Query: 28  VLYFGMLERNCLSTLMDWGKDKFGGYPKKIKEVDLKFGGSRHKLISAET----------L 87
           V  F  + +  ++ L++W K +F G  K+++++  +  G + + +  E+          +
Sbjct: 239 VYLFRKIAKESMARLLNWSKREFRGREKQLEQLQKQLQGWKQRGVQYESRKEIKMVENQI 298

Query: 88  EKILMKEEIYWQQRSQEHWVKWDDRNTRWFHNKASYRRKINEIIGLEGENGQWTQNKKPI 147
           + I++ EEIYW+QRS+  W+K  D+NT++FH+KAS R+K N I G+E  +G W +  K +
Sbjct: 299 QNIIIDEEIYWKQRSRADWLKEGDKNTKFFHHKASNRKKKNRIWGIENSSGDWLKRAKDV 358

Query: 148 ADISSNYFVKIYSSS 153
            D   NYF ++++++
Sbjct: 359 EDEFCNYFTELFTTT 373

BLAST of Sgr011656 vs. NCBI nr
Match: OMO99000.1 (reverse transcriptase [Corchorus capsularis])

HSP 1 Score: 92.8 bits (229), Expect = 6.6e-15
Identity = 55/142 (38.73%), Postives = 80/142 (56.34%), Query Frame = 0

Query: 55  KKIKEVDLKFGGSRHKLIS-AETLEKILMKEEIYWQQRSQEHWVKWDDRNTRWFHNKASY 114
           K + E D ++G  R K     E L ++L +EE YW QRS+ +W+   DRNT +FH +AS 
Sbjct: 645 KVVAETD-RWGNMRKKEADLREELNRLLEEEETYWMQRSRVNWLSDGDRNTSFFHAQASK 704

Query: 115 RRKINEIIGLEGENGQWTQNKKPIADISSNYFVKIYSSSVLGDADFVFGSLCFASTLKHN 174
           RRK N I GLEG++GQWT +   I +I+SNYF K++ SS     D +  ++  + T + N
Sbjct: 705 RRKKNSIEGLEGDDGQWTNDLADIQEITSNYFKKLFDSSGFLQFDEILEAVNPSITAEMN 764

Query: 175 RHKFHPQTIVSIFVDICLMSKV 196
            H     T   IF  +  M  +
Sbjct: 765 EHLLTEFTAEEIFTALKQMHPI 785

BLAST of Sgr011656 vs. NCBI nr
Match: CAB4316864.1 (unnamed protein product [Prunus armeniaca])

HSP 1 Score: 92.0 bits (227), Expect = 1.1e-14
Identity = 48/132 (36.36%), Postives = 73/132 (55.30%), Query Frame = 0

Query: 37  NCLSTLMDWGKDKFGGYPKKIKEVDLKFGGSRHKLISAET----------LEKILMKEEI 96
           NC S L  W  +K G  PKK+KE+ L+    +    S +T          L+K L +EEI
Sbjct: 154 NCASNLSRWSAEKGGQVPKKVKELRLRLASLQSDEPSTQTFHTRSLIETELDKCLEQEEI 213

Query: 97  YWQQRSQEHWVKWDDRNTRWFHNKASYRRKINEIIGLEGENGQWTQNKKPIADISSNYFV 156
           YW QRS+ HW++  DRNT +FH +A+ RRK N ++G+  EN +W +    I  +   +F 
Sbjct: 214 YWHQRSRVHWLQHGDRNTSFFHKQATSRRKKNALVGILDENDRWQREYDKIGGVFVEFFT 273

Query: 157 KIYSSSVLGDAD 159
            +++   +G AD
Sbjct: 274 NLFTLD-MGVAD 284

BLAST of Sgr011656 vs. NCBI nr
Match: XP_008237273.1 (PREDICTED: uncharacterized protein LOC103336015 [Prunus mume])

HSP 1 Score: 92.0 bits (227), Expect = 1.1e-14
Identity = 48/132 (36.36%), Postives = 73/132 (55.30%), Query Frame = 0

Query: 37  NCLSTLMDWGKDKFGGYPKKIKEVDLKFGGSRHKLISAET----------LEKILMKEEI 96
           NC S L  W  +K G  PKK+KE+ L+    +    S +T          L+  L +EEI
Sbjct: 675 NCASNLSRWSAEKGGQVPKKVKELRLRLASLQSDEPSTQTFHNRSLIETELDTCLEQEEI 734

Query: 97  YWQQRSQEHWVKWDDRNTRWFHNKASYRRKINEIIGLEGENGQWTQNKKPIADISSNYFV 156
           YW QRS+ HW++  DRNT +FH +A+ RRK N ++G+  EN +W +    I  +   +F 
Sbjct: 735 YWHQRSRVHWLQHGDRNTSFFHKQATSRRKKNALVGILDENDRWQREYDKIGGVFVEFFT 794

Query: 157 KIYSSSVLGDAD 159
            +++S  +G AD
Sbjct: 795 NLFTSD-MGVAD 805

BLAST of Sgr011656 vs. ExPASy TrEMBL
Match: A0A6J1DAR4 (uncharacterized protein LOC111018954 OS=Momordica charantia OX=3673 GN=LOC111018954 PE=4 SV=1)

HSP 1 Score: 103.2 bits (256), Expect = 2.4e-18
Identity = 53/131 (40.46%), Postives = 81/131 (61.83%), Query Frame = 0

Query: 32  GMLERNCLSTLMDWGKDKFGGYPKKIK--EV-------DLKFGGSRHKLISAET-LEKIL 91
           GM    C+ +L+ WG++K G +  ++K  EV       DL F  +R     A T + ++L
Sbjct: 169 GMKLNQCVLSLVHWGRNKTGNFRNRLKVAEVMLQSAIHDLPFAPNREAFQQAITNMNQLL 228

Query: 92  MKEEIYWQQRSQEHWVKWDDRNTRWFHNKASYRRKINEIIGLEGENGQWTQNKKPIADIS 151
            +EEI+W+QRS++ W K  DRNT+WFH KAS+RR+ NEI GL  + G W +NK  +  + 
Sbjct: 229 KEEEIFWRQRSRDLWHKHGDRNTKWFHTKASHRRRTNEIKGLLDQQGTWEENKFKVVGMI 288

Query: 152 SNYFVKIYSSS 153
            +YF +++SSS
Sbjct: 289 ESYFTELFSSS 299

BLAST of Sgr011656 vs. ExPASy TrEMBL
Match: A0A6J1DAR4 (uncharacterized protein LOC111018954 OS=Momordica charantia OX=3673 GN=LOC111018954 PE=4 SV=1)

HSP 1 Score: 62.0 bits (149), Expect = 6.0e-06
Identity = 29/72 (40.28%), Postives = 40/72 (55.56%), Query Frame = 0

Query: 297  EDISEVGNIISKVKHKIVGSSSTFFTFTRPEGNVVAHLLSKMALEKRWTNVWLESWPDSF 356
            ED+SE G I+ K K+    S    F F + EGN  AH+L++ AL     ++W+E WP   
Sbjct: 1066 EDLSETGEIVLKAKNFWTQSLHASFNFVKREGNKAAHMLARRALLLHEFSIWMEDWPLEL 1125

Query: 357  MSCLVAECADVL 369
             SCL  EC + L
Sbjct: 1126 KSCLEMECLEEL 1137


HSP 2 Score: 92.8 bits (229), Expect = 3.2e-15
Identity = 55/142 (38.73%), Postives = 80/142 (56.34%), Query Frame = 0

Query: 55  KKIKEVDLKFGGSRHKLIS-AETLEKILMKEEIYWQQRSQEHWVKWDDRNTRWFHNKASY 114
           K + E D ++G  R K     E L ++L +EE YW QRS+ +W+   DRNT +FH +AS 
Sbjct: 645 KVVAETD-RWGNMRKKEADLREELNRLLEEEETYWMQRSRVNWLSDGDRNTSFFHAQASK 704

Query: 115 RRKINEIIGLEGENGQWTQNKKPIADISSNYFVKIYSSSVLGDADFVFGSLCFASTLKHN 174
           RRK N I GLEG++GQWT +   I +I+SNYF K++ SS     D +  ++  + T + N
Sbjct: 705 RRKKNSIEGLEGDDGQWTNDLADIQEITSNYFKKLFDSSGFLQFDEILEAVNPSITAEMN 764

Query: 175 RHKFHPQTIVSIFVDICLMSKV 196
            H     T   IF  +  M  +
Sbjct: 765 EHLLTEFTAEEIFTALKQMHPI 785

BLAST of Sgr011656 vs. ExPASy TrEMBL
Match: A0A6J5Y0D5 (Uncharacterized protein OS=Prunus armeniaca OX=36596 GN=ORAREDHAP_LOCUS43248 PE=4 SV=1)

HSP 1 Score: 92.0 bits (227), Expect = 5.4e-15
Identity = 48/132 (36.36%), Postives = 73/132 (55.30%), Query Frame = 0

Query: 37  NCLSTLMDWGKDKFGGYPKKIKEVDLKFGGSRHKLISAET----------LEKILMKEEI 96
           NC S L  W  +K G  PKK+KE+ L+    +    S +T          L+K L +EEI
Sbjct: 154 NCASNLSRWSAEKGGQVPKKVKELRLRLASLQSDEPSTQTFHTRSLIETELDKCLEQEEI 213

Query: 97  YWQQRSQEHWVKWDDRNTRWFHNKASYRRKINEIIGLEGENGQWTQNKKPIADISSNYFV 156
           YW QRS+ HW++  DRNT +FH +A+ RRK N ++G+  EN +W +    I  +   +F 
Sbjct: 214 YWHQRSRVHWLQHGDRNTSFFHKQATSRRKKNALVGILDENDRWQREYDKIGGVFVEFFT 273

Query: 157 KIYSSSVLGDAD 159
            +++   +G AD
Sbjct: 274 NLFTLD-MGVAD 284

BLAST of Sgr011656 vs. ExPASy TrEMBL
Match: A0A6J5TIF9 (Reverse transcriptase domain-containing protein OS=Prunus armeniaca OX=36596 GN=CURHAP_LOCUS4077 PE=4 SV=1)

HSP 1 Score: 90.5 bits (223), Expect = 1.6e-14
Identity = 48/132 (36.36%), Postives = 72/132 (54.55%), Query Frame = 0

Query: 37  NCLSTLMDWGKDKFGGYPKKIKEVDLKFGGSRHKLISAET----------LEKILMKEEI 96
           NC S L  W  +K G  PKK+KE+ L+    +    S +T          L+K L +EEI
Sbjct: 237 NCASNLSRWCAEKGGQVPKKVKELRLRLASLQSDEPSTQTFHTRSLIETELDKCLEQEEI 296

Query: 97  YWQQRSQEHWVKWDDRNTRWFHNKASYRRKINEIIGLEGENGQWTQNKKPIADISSNYFV 156
           YW QRS+  W++  DRNT +FH +A+ RRK N ++G+  EN +W      I  +   +F 
Sbjct: 297 YWHQRSRVQWLQHGDRNTSFFHKQATSRRKKNALVGILDENDRWQSENDKIGGVFVEFFT 356

Query: 157 KIYSSSVLGDAD 159
            +++S  +G AD
Sbjct: 357 NLFTSD-MGVAD 367

BLAST of Sgr011656 vs. ExPASy TrEMBL
Match: A0A803QF96 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)

HSP 1 Score: 90.5 bits (223), Expect = 1.6e-14
Identity = 55/146 (37.67%), Postives = 79/146 (54.11%), Query Frame = 0

Query: 22  PHREPLVLYFGMLERNCLSTLMDWGKDKFGGYPKKIKEVDLK----------FGGSRHKL 81
           P  +P+V+    LE  C S L  W  DK+G   KKI +  LK             + + L
Sbjct: 16  PSLDPIVVVLANLE-ECASNLQKWHIDKYGNMKKKIIDAQLKVETLNNAPYRIAEAMNSL 75

Query: 82  ISAET-LEKILMKEEIYWQQRSQEHWVKWDDRNTRWFHNKASYRRKINEIIGLEGENGQW 141
            ++E  L+++L +EEIYWQQRSQ  W+   DRNT++FH KAS R+  N I  +  ENG  
Sbjct: 76  KNSEAFLDELLEQEEIYWQQRSQVDWLNEGDRNTKFFHAKASARKSNNTIKFMHVENGTR 135

Query: 142 TQNKKPIADISSNYFVKIYSSSVLGD 157
              K  IA    +YF +I+++S L +
Sbjct: 136 VTAKHEIAAAIHDYFAEIFTASTLDE 160

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022150918.14.9e-1840.46uncharacterized protein LOC111018954 [Momordica charantia][more]
XP_022150918.11.2e-0540.28uncharacterized protein LOC111018954 [Momordica charantia][more]
OMO99000.16.6e-1538.73reverse transcriptase [Corchorus capsularis][more]
CAB4316864.11.1e-1436.36unnamed protein product [Prunus armeniaca][more]
XP_008237273.11.1e-1436.36PREDICTED: uncharacterized protein LOC103336015 [Prunus mume][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1DAR42.4e-1840.46uncharacterized protein LOC111018954 OS=Momordica charantia OX=3673 GN=LOC111018... [more]
A0A6J1DAR46.0e-0640.28uncharacterized protein LOC111018954 OS=Momordica charantia OX=3673 GN=LOC111018... [more]
A0A6J5Y0D55.4e-1536.36Uncharacterized protein OS=Prunus armeniaca OX=36596 GN=ORAREDHAP_LOCUS43248 PE=... [more]
A0A6J5TIF91.6e-1436.36Reverse transcriptase domain-containing protein OS=Prunus armeniaca OX=36596 GN=... [more]
A0A803QF961.6e-1437.67Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 364..369
NoneNo IPR availablePANTHERPTHR33710BNAC02G09200D PROTEINcoord: 61..152
NoneNo IPR availablePANTHERPTHR33710:SF24SUBFAMILY NOT NAMEDcoord: 61..152

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr011656.1Sgr011656.1mRNA