Sgr024234 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr024234
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionTyrosine-specific transport protein, putative
Locationtig00001047: 4367859 .. 4369850 (+)
RNA-Seq ExpressionSgr024234
SyntenySgr024234
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTTTGTTCTGATCCGGGTACAGGACACTCCAGGCCCAATTGGGGTGTACCTGATATGCTGTGGATCACGGCAAAGGCCCATGTTTGGTTCTGATCCGAACACCAACATATTTGGAATCGAAAGGGAATCCAATCTTTTGGAATTGGACCCCATGGAATCTTCTCCAACATATTTGGAAACTTCCATTCACAATTGGATCCATTTCCAGCCACTACTTTCAAGCTTCAACTGACCCTCTTCGCCTCTGCATTTGGCTTCCATGAACTTGCACTCCATCATCATCTCTTCCACCTTATTGTGCCCCAAAAGCCACGAACGACGCCTGCGAACTCTTCCGCCATCGCCATGTTCAAGGATCGCTCTACAAAAGGGTAGAATCGGTGACAGATCAAAGAAGTAACATGCCTCACTTTGGTGTTTCTTTTTCGTCTCTTTAATCCGCCAATTTAGCTGCGTTTTCATGCTCTGTTTGATATGAATCTTGAGTTTCGCTTGCTCTTACCAGCACCTACAACCGTCGCTTTCTCTGCTACAACCAGAAAGAGAAAAGCCTTCAATCAAGAGAAGAGCTACAGCCTGTAGGGCGCCTGAAAAAAAGGAAATGTAGCAGGAGCCATGGCTCTCGTAATTGGCACCAGTATCGGATCAGGGTTTCTTGCACTTCCAGAGAAAGCATCTCCGGCTGTAACTCTCTTTCCCTCTCTTTCTCTCTCTCGGCTTTAGCAAATTTATTCGAAATATGTAGAAAACAGAGGTGCCGATAGATTATTCTTTTGTTTTCGACGTTTATTTCTTTTTCAGAAAAGGCCGTGTAATTCTGTGGCAGGGACTTTTCCCAGTTCGATATCTATAATGCTATGTTGGGGTTTCTTCTAGTAGAAGCACTCGTGCTCATTGAAATTAATGTGGTTCTGTGGAGGAAGAAGAAGAAGAAGAATGAAGAGGGAGAGACGGGGATGGAGGTGATTTCCGTCAGGACTATGGCGCAGGAGACGCTAGGGGACTGCGGTGGAACCCTGGCCACTGTTGCCTATGTTTTCTCGGGCTCACTTCCATGGTTGCCTATATTTCCAAGTCCGGAGAGATCCTTCTCCATTCATTCAATCTTCCATCTCCACTTTCAGGCTTCCTCTTCACTTTAATTTTTACTCTGCTTATCTCCATAGGTAGGACCATAGCCATAGATCAAGTCAACCAATGGCTTACAGCTTGTATGATAGGTAACTTCAGGACTCGAACCAGTATTATTCCCTTCACCTGGATAATTAAAAGCTGTTCTATGGATAATCTGTTTCGAGAAGACGGCAGTTGTTATTACTTCTCAATTATGCACCTGTAAATAATGGTTTCAAATGTAGAATAATCTTCCACTTGATAATGGTTTTTAGAACAGTTTTATAAACAGAACTATAAAACTGTTTTGAAACAGCCTCCCAAATATGCCATAAGTTTTCTATACAAATGTAGATAAAAATGAATCAGTAATACTTTAAAATGAATCTCATATCTTCCCTGTTGCAAAAGCAAGCAGTTGACTCCACATTCTTGAATGTCTTCGTGGAACAGGTTTACTACTGGGAATTGAGGTGATAGCGGTTCAATTTGGAGGATGGTTTGCAATGGACGGTGGAGGAGACTGGGGAAGGTCCCAACTACAGTACCTGTCATAATCTTCGCTTTGGTATATCATGATGTAATACCAGGTAAAATTATATTAGAAATTTGATTTATTTGTTATCATTACTTTGTTAACATGATTTGTGAAAATATGAAGCATACACCCCCCCCCCCCCCCCCCCATTAGGTTGATAAAATCTTGTACTCTTTGCAGTTCTTTGTGCTTATTTGGGTGGTGACCTTCCTCGCCTAAGAGTTTCAGTTTTGCTTGGTAGCTTTATTCCATTGCTAGCATTGCTTGTTTGGGATGCAGTTGCGCTTAGCTTGTCAGCGCAAGCTGATCAAGTGGTTGATCCTGTGAATTGCTCTTGA

mRNA sequence

ATGGTTTGTTCTGATCCGGGTACAGGACACTCCAGGCCCAATTGGGGTGTACCTGATATGCTGTGGATCACGGCAAAGGCCCATGGAATCCAATCTTTTGGAATTGGACCCCATGGAATCTTCTCCAACATATTTGGAAACTTCCATTCACAATTGGATCCATTTCCAGCCACTACTTTCAAGCTTCAACTGACCCTCTTCGCCTCTGCATTTGGCTTCCATGAACTTGCACTCCATCATCATCTCTTCCACCTTATTGTGCCCCAAAAGCCACGAACGACGCCTGCGAACTCTTCCGCCATCGCCATGTTCAAGGATCGCTCTACAAAAGGCACCTACAACCGTCGCTTTCTCTGCTACAACCAGAAAGAGAAAAGCCTTCAATCAAGAGAAGAGCTACAGCCTGTAGGGCGCCTGAAAAAAAGGAAATGTAGCAGGAGCCATGGCTCTCGTAATTGGCACCAGTATCGGATCAGGGTTTCTTGCACTTCCAGAGAAAGCATCTCCGGCTGTAACTCTCTTTCCCTCTCTTTCTCTCTCTCGGCTTTAGCAAATTTATTCGAAATATGTAGAAAACAGAGGGACTTTTCCCAGTTCGATATCTATAATGCTATGTTGGGGTTTCTTCTAGTAGAAGCACTCGTGCTCATTGAAATTAATGTGGTTCTGTGGAGGAAGAAGAAGAAGAAGAATGAAGAGGGAGAGACGGGGATGGAGGTGATTTCCGTCAGGACTATGGCGCAGGAGACGCTAGGGGACTGCGGTGGAACCCTGGCCACTGTTGCCTATGTTTTCTCGGGCTCACTTCCATGGATGGTTTGCAATGGACGGTGGAGGAGACTGGGGAAGGTCCCAACTACAGTACCTGTCATAATCTTCGCTTTGGTATATCATGATGTAATACCAGTTCTTTGTGCTTATTTGGGTGGTGACCTTCCTCGCCTAAGAGTTTCAGTTTTGCTTGGTAGCTTTATTCCATTGCTAGCATTGCTTGTTTGGGATGCAGTTGCGCTTAGCTTGTCAGCGCAAGCTGATCAAGTGGTTGATCCTGTGAATTGCTCTTGA

Coding sequence (CDS)

ATGGTTTGTTCTGATCCGGGTACAGGACACTCCAGGCCCAATTGGGGTGTACCTGATATGCTGTGGATCACGGCAAAGGCCCATGGAATCCAATCTTTTGGAATTGGACCCCATGGAATCTTCTCCAACATATTTGGAAACTTCCATTCACAATTGGATCCATTTCCAGCCACTACTTTCAAGCTTCAACTGACCCTCTTCGCCTCTGCATTTGGCTTCCATGAACTTGCACTCCATCATCATCTCTTCCACCTTATTGTGCCCCAAAAGCCACGAACGACGCCTGCGAACTCTTCCGCCATCGCCATGTTCAAGGATCGCTCTACAAAAGGCACCTACAACCGTCGCTTTCTCTGCTACAACCAGAAAGAGAAAAGCCTTCAATCAAGAGAAGAGCTACAGCCTGTAGGGCGCCTGAAAAAAAGGAAATGTAGCAGGAGCCATGGCTCTCGTAATTGGCACCAGTATCGGATCAGGGTTTCTTGCACTTCCAGAGAAAGCATCTCCGGCTGTAACTCTCTTTCCCTCTCTTTCTCTCTCTCGGCTTTAGCAAATTTATTCGAAATATGTAGAAAACAGAGGGACTTTTCCCAGTTCGATATCTATAATGCTATGTTGGGGTTTCTTCTAGTAGAAGCACTCGTGCTCATTGAAATTAATGTGGTTCTGTGGAGGAAGAAGAAGAAGAAGAATGAAGAGGGAGAGACGGGGATGGAGGTGATTTCCGTCAGGACTATGGCGCAGGAGACGCTAGGGGACTGCGGTGGAACCCTGGCCACTGTTGCCTATGTTTTCTCGGGCTCACTTCCATGGATGGTTTGCAATGGACGGTGGAGGAGACTGGGGAAGGTCCCAACTACAGTACCTGTCATAATCTTCGCTTTGGTATATCATGATGTAATACCAGTTCTTTGTGCTTATTTGGGTGGTGACCTTCCTCGCCTAAGAGTTTCAGTTTTGCTTGGTAGCTTTATTCCATTGCTAGCATTGCTTGTTTGGGATGCAGTTGCGCTTAGCTTGTCAGCGCAAGCTGATCAAGTGGTTGATCCTGTGAATTGCTCTTGA

Protein sequence

MVCSDPGTGHSRPNWGVPDMLWITAKAHGIQSFGIGPHGIFSNIFGNFHSQLDPFPATTFKLQLTLFASAFGFHELALHHHLFHLIVPQKPRTTPANSSAIAMFKDRSTKGTYNRRFLCYNQKEKSLQSREELQPVGRLKKRKCSRSHGSRNWHQYRIRVSCTSRESISGCNSLSLSFSLSALANLFEICRKQRDFSQFDIYNAMLGFLLVEALVLIEINVVLWRKKKKKNEEGETGMEVISVRTMAQETLGDCGGTLATVAYVFSGSLPWMVCNGRWRRLGKVPTTVPVIIFALVYHDVIPVLCAYLGGDLPRLRVSVLLGSFIPLLALLVWDAVALSLSAQADQVVDPVNCS
Homology
BLAST of Sgr024234 vs. NCBI nr
Match: KAG7033001.1 (Tyrosine-specific transport protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 231.9 bits (590), Expect = 8.7e-57
Identity = 165/400 (41.25%), Postives = 211/400 (52.75%), Query Frame = 0

Query: 35  IGPHGIFSNIFGNFHSQLDPFPATTFKLQLTLFASAFGFHELALH--------HHLFHLI 94
           +GPHGIFSNIFGN+HS+ D FPA TF+   +   S+     + +         H     I
Sbjct: 10  VGPHGIFSNIFGNYHSKWDSFPAITFQSPSSTLLSSMNLLSVVVSSTLSCPKTHERCLQI 69

Query: 95  VPQKPRTTPANSSAI---AMFKDRSTKGTYNRRFLCYNQKEKSLQSREELQPVGRLKKRK 154
           +P+ P + P + + +       +RS +  Y RRFLCY QKE+ ++SREELQPV       
Sbjct: 70  LPRSPWSLPCSRTTLQNRRRIGERSKR--YKRRFLCYEQKEERVESREELQPV------- 129

Query: 155 CSRSHGSRNWHQYRIRVSCTSRESISGCNSLSLSFSLSALANLFEICRKQRDFSQFDIYN 214
                            S   + +I+G  +  +  S+   + +  + +K      F    
Sbjct: 130 ----------------TSPEKKGTIAGAVAFIIGTSVG--SGILALPQKASPAGFFPSSI 189

Query: 215 AML---GFLLVEALVLIEINVVLWRKKKKKNEEGETGMEVISVRTMAQETLGDCGGTLAT 274
           +++   GFLLVEAL+L+EI+VV+   ++KK E GETGM+VISVRTMAQETLGD GGTLAT
Sbjct: 190 SIILCWGFLLVEALLLVEISVVM---RRKKTERGETGMKVISVRTMAQETLGDFGGTLAT 249

Query: 275 VAYVFSG-------------------SLP-------------WMVCNGRWRRLG------ 334
           VAYVF G                   +LP              ++  GR R +       
Sbjct: 250 VAYVFLGYISMVAYISKSEEILLQSFNLPAPLSGLFFTLVFTLLISVGRTRTVDQVNQWL 309

Query: 335 -------------------------------KVPTTVPVIIFALVYHDVIPVLCAYLGGD 352
                                          KVPTT+PVIIFALVYHDVIPVLCAYL GD
Sbjct: 310 TACMIGLLLGIEVLAVQFGGWFAMEGGGDWTKVPTTIPVIIFALVYHDVIPVLCAYLEGD 369

BLAST of Sgr024234 vs. NCBI nr
Match: KAG6602318.1 (hypothetical protein SDJN03_07551, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 230.7 bits (587), Expect = 1.9e-56
Identity = 165/400 (41.25%), Postives = 212/400 (53.00%), Query Frame = 0

Query: 35  IGPHGIFSNIFGNFHSQLDPFPATTFKLQLTLFASAFGFHELALH--------HHLFHLI 94
           +GPHGIFSNIFGN+HS+ D FPA TF+   +   S+     + +         H     I
Sbjct: 10  VGPHGIFSNIFGNYHSKWDSFPAITFQSPSSTLLSSMNLLSVVVSSTLSCPKTHDRCLQI 69

Query: 95  VPQKPRTTPANSSAI---AMFKDRSTKGTYNRRFLCYNQKEKSLQSREELQPVGRLKKRK 154
           +P+ P + P + + +       +RS +  Y RRFLCY QKE+ ++SREELQPV   +K  
Sbjct: 70  LPRSPWSLPCSRTTLQNRRRIGERSKR--YKRRFLCYEQKEERVESREELQPVTLPEK-- 129

Query: 155 CSRSHGSRNWHQYRIRVSCTSRESISGCNSLSLSFSLSALANLFEICRKQRDFSQFDIYN 214
                                + +I+G  +  +  S+   + +  + +K      F    
Sbjct: 130 ---------------------KGTIAGAVAFIIGTSVG--SGILALPQKASPAGFFPSSI 189

Query: 215 AML---GFLLVEALVLIEINVVLWRKKKKKNEEGETGMEVISVRTMAQETLGDCGGTLAT 274
           +++   GFLLVEAL+L+EI+VV+   ++KK E GETGM+VISVRTMAQETLGD GGTLAT
Sbjct: 190 SIILCWGFLLVEALLLVEISVVM---RRKKTERGETGMKVISVRTMAQETLGDFGGTLAT 249

Query: 275 VAYVFSG-------------------SLP-------------WMVCNGRWRRLG------ 334
           VAYVF G                   +LP              ++  GR R +       
Sbjct: 250 VAYVFLGYISMVAYISKSEEILLQSFNLPAPLSGLFFTLVFTLLISVGRTRTVDQVNQWL 309

Query: 335 -------------------------------KVPTTVPVIIFALVYHDVIPVLCAYLGGD 352
                                          KVPTT+PVIIFALVYHDVIPVLCAYL GD
Sbjct: 310 TACMIGLLLGIEVLAVQFGGWFAMEGGGDWTKVPTTIPVIIFALVYHDVIPVLCAYLEGD 369

BLAST of Sgr024234 vs. NCBI nr
Match: XP_022133501.1 (uncharacterized protein LOC111006064 isoform X3 [Momordica charantia])

HSP 1 Score: 224.2 bits (570), Expect = 1.8e-54
Identity = 168/387 (43.41%), Postives = 200/387 (51.68%), Query Frame = 0

Query: 44  IFGNFHSQLDPFPATTFK---LQLTLFASAFGFHELALHHHLF-----HLIVPQKPRTTP 103
           + GNF   +D   +  F+     L   +   G H + +   LF        +   PR+  
Sbjct: 1   MLGNFSFPMDFISSHYFQSLSSPLNFASMVMGLHSIIISSALFCPKSHERRLQTFPRSPS 60

Query: 104 ANSSAIAMFKDRSTKGTYNRRFLCYNQKEKSLQSREELQPVGRLKKRKCSRSHGSRNWHQ 163
             S  I    DRS K + NRR LC+ QKE+SLQSREELQPV   +K              
Sbjct: 61  PCSRIIRRIFDRS-KNSSNRRLLCFKQKEESLQSREELQPVRASEK-------------- 120

Query: 164 YRIRVSCTSRESISGCNSLSLSFSL-SALANLFEICRKQRDFSQFDIYNAMLGFLLVEAL 223
                    + +++G  +L +  S+ S +  L E       F          GFLL+EAL
Sbjct: 121 ---------KGTVAGAMALVIGTSIGSGILALPEKASPAGFFPSSITIILCWGFLLLEAL 180

Query: 224 VLIEINVVLW-RKKKKKNEEGETGMEVISVRTMAQETLGDCGGTLATVAYVFSG------ 283
           +LIEINVVLW R+KKKK EEGETGMEVISVRTM QETLGDCGGTLA+VAYVF G      
Sbjct: 181 LLIEINVVLWRRRKKKKKEEGETGMEVISVRTMVQETLGDCGGTLASVAYVFLGYTSMVA 240

Query: 284 -------------SLP-----------------------------WM------------- 343
                        +LP                             W+             
Sbjct: 241 YISKSGEILLHSFNLPTPLSGFLFTLTFTMLISVGRTIAIDQVNQWLTACMIGLLLGIEV 300

Query: 344 --VCNGRWRRL------GKVPTTVPVIIFALVYHDVIPVLCAYLGGDLPRLRVSVLLGSF 352
             V +G W  +      GK PTT+PVIIFALVYHDVIPVLCAYL GDL RLRVSVLLGSF
Sbjct: 301 LAVQSGGWFAMEGGGDWGKAPTTIPVIIFALVYHDVIPVLCAYLEGDLHRLRVSVLLGSF 360

BLAST of Sgr024234 vs. NCBI nr
Match: XP_022133499.1 (uncharacterized protein LOC111006064 isoform X1 [Momordica charantia])

HSP 1 Score: 224.2 bits (570), Expect = 1.8e-54
Identity = 168/387 (43.41%), Postives = 200/387 (51.68%), Query Frame = 0

Query: 44  IFGNFHSQLDPFPATTFK---LQLTLFASAFGFHELALHHHLF-----HLIVPQKPRTTP 103
           + GNF   +D   +  F+     L   +   G H + +   LF        +   PR+  
Sbjct: 1   MLGNFSFPMDFISSHYFQSLSSPLNFASMVMGLHSIIISSALFCPKSHERRLQTFPRSPS 60

Query: 104 ANSSAIAMFKDRSTKGTYNRRFLCYNQKEKSLQSREELQPVGRLKKRKCSRSHGSRNWHQ 163
             S  I    DRS K + NRR LC+ QKE+SLQSREELQPV   +K              
Sbjct: 61  PCSRIIRRIFDRS-KNSSNRRLLCFKQKEESLQSREELQPVRASEK-------------- 120

Query: 164 YRIRVSCTSRESISGCNSLSLSFSL-SALANLFEICRKQRDFSQFDIYNAMLGFLLVEAL 223
                    + +++G  +L +  S+ S +  L E       F          GFLL+EAL
Sbjct: 121 ---------KGTVAGAMALVIGTSIGSGILALPEKASPAGFFPSSITIILCWGFLLLEAL 180

Query: 224 VLIEINVVLW-RKKKKKNEEGETGMEVISVRTMAQETLGDCGGTLATVAYVFSG------ 283
           +LIEINVVLW R+KKKK EEGETGMEVISVRTM QETLGDCGGTLA+VAYVF G      
Sbjct: 181 LLIEINVVLWRRRKKKKKEEGETGMEVISVRTMVQETLGDCGGTLASVAYVFLGYTSMVA 240

Query: 284 -------------SLP-----------------------------WM------------- 343
                        +LP                             W+             
Sbjct: 241 YISKSGEILLHSFNLPTPLSGFLFTLTFTMLISVGRTIAIDQVNQWLTACMIGLLLGIEV 300

Query: 344 --VCNGRWRRL------GKVPTTVPVIIFALVYHDVIPVLCAYLGGDLPRLRVSVLLGSF 352
             V +G W  +      GK PTT+PVIIFALVYHDVIPVLCAYL GDL RLRVSVLLGSF
Sbjct: 301 LAVQSGGWFAMEGGGDWGKAPTTIPVIIFALVYHDVIPVLCAYLEGDLHRLRVSVLLGSF 360

BLAST of Sgr024234 vs. NCBI nr
Match: XP_022133500.1 (uncharacterized protein LOC111006064 isoform X2 [Momordica charantia])

HSP 1 Score: 221.9 bits (564), Expect = 9.0e-54
Identity = 167/387 (43.15%), Postives = 198/387 (51.16%), Query Frame = 0

Query: 44  IFGNFHSQLDPFPATTFK---LQLTLFASAFGFHELALHHHLF-----HLIVPQKPRTTP 103
           + GNF   +D   +  F+     L   +   G H + +   LF        +   PR+  
Sbjct: 1   MLGNFSFPMDFISSHYFQSLSSPLNFASMVMGLHSIIISSALFCPKSHERRLQTFPRSPS 60

Query: 104 ANSSAIAMFKDRSTKGTYNRRFLCYNQKEKSLQSREELQPVGRLKKRKCSRSHGSRNWHQ 163
             S  I    DRS     NRR LC+ QKE+SLQSREELQPV   +K              
Sbjct: 61  PCSRIIRRIFDRSKNS--NRRLLCFKQKEESLQSREELQPVRASEK-------------- 120

Query: 164 YRIRVSCTSRESISGCNSLSLSFSL-SALANLFEICRKQRDFSQFDIYNAMLGFLLVEAL 223
                    + +++G  +L +  S+ S +  L E       F          GFLL+EAL
Sbjct: 121 ---------KGTVAGAMALVIGTSIGSGILALPEKASPAGFFPSSITIILCWGFLLLEAL 180

Query: 224 VLIEINVVLW-RKKKKKNEEGETGMEVISVRTMAQETLGDCGGTLATVAYVFSG------ 283
           +LIEINVVLW R+KKKK EEGETGMEVISVRTM QETLGDCGGTLA+VAYVF G      
Sbjct: 181 LLIEINVVLWRRRKKKKKEEGETGMEVISVRTMVQETLGDCGGTLASVAYVFLGYTSMVA 240

Query: 284 -------------SLP-----------------------------WM------------- 343
                        +LP                             W+             
Sbjct: 241 YISKSGEILLHSFNLPTPLSGFLFTLTFTMLISVGRTIAIDQVNQWLTACMIGLLLGIEV 300

Query: 344 --VCNGRWRRL------GKVPTTVPVIIFALVYHDVIPVLCAYLGGDLPRLRVSVLLGSF 352
             V +G W  +      GK PTT+PVIIFALVYHDVIPVLCAYL GDL RLRVSVLLGSF
Sbjct: 301 LAVQSGGWFAMEGGGDWGKAPTTIPVIIFALVYHDVIPVLCAYLEGDLHRLRVSVLLGSF 360

BLAST of Sgr024234 vs. ExPASy TrEMBL
Match: A0A6J1BVA3 (uncharacterized protein LOC111006064 isoform X3 OS=Momordica charantia OX=3673 GN=LOC111006064 PE=4 SV=1)

HSP 1 Score: 224.2 bits (570), Expect = 8.8e-55
Identity = 168/387 (43.41%), Postives = 200/387 (51.68%), Query Frame = 0

Query: 44  IFGNFHSQLDPFPATTFK---LQLTLFASAFGFHELALHHHLF-----HLIVPQKPRTTP 103
           + GNF   +D   +  F+     L   +   G H + +   LF        +   PR+  
Sbjct: 1   MLGNFSFPMDFISSHYFQSLSSPLNFASMVMGLHSIIISSALFCPKSHERRLQTFPRSPS 60

Query: 104 ANSSAIAMFKDRSTKGTYNRRFLCYNQKEKSLQSREELQPVGRLKKRKCSRSHGSRNWHQ 163
             S  I    DRS K + NRR LC+ QKE+SLQSREELQPV   +K              
Sbjct: 61  PCSRIIRRIFDRS-KNSSNRRLLCFKQKEESLQSREELQPVRASEK-------------- 120

Query: 164 YRIRVSCTSRESISGCNSLSLSFSL-SALANLFEICRKQRDFSQFDIYNAMLGFLLVEAL 223
                    + +++G  +L +  S+ S +  L E       F          GFLL+EAL
Sbjct: 121 ---------KGTVAGAMALVIGTSIGSGILALPEKASPAGFFPSSITIILCWGFLLLEAL 180

Query: 224 VLIEINVVLW-RKKKKKNEEGETGMEVISVRTMAQETLGDCGGTLATVAYVFSG------ 283
           +LIEINVVLW R+KKKK EEGETGMEVISVRTM QETLGDCGGTLA+VAYVF G      
Sbjct: 181 LLIEINVVLWRRRKKKKKEEGETGMEVISVRTMVQETLGDCGGTLASVAYVFLGYTSMVA 240

Query: 284 -------------SLP-----------------------------WM------------- 343
                        +LP                             W+             
Sbjct: 241 YISKSGEILLHSFNLPTPLSGFLFTLTFTMLISVGRTIAIDQVNQWLTACMIGLLLGIEV 300

Query: 344 --VCNGRWRRL------GKVPTTVPVIIFALVYHDVIPVLCAYLGGDLPRLRVSVLLGSF 352
             V +G W  +      GK PTT+PVIIFALVYHDVIPVLCAYL GDL RLRVSVLLGSF
Sbjct: 301 LAVQSGGWFAMEGGGDWGKAPTTIPVIIFALVYHDVIPVLCAYLEGDLHRLRVSVLLGSF 360

BLAST of Sgr024234 vs. ExPASy TrEMBL
Match: A0A6J1BW54 (uncharacterized protein LOC111006064 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111006064 PE=4 SV=1)

HSP 1 Score: 224.2 bits (570), Expect = 8.8e-55
Identity = 168/387 (43.41%), Postives = 200/387 (51.68%), Query Frame = 0

Query: 44  IFGNFHSQLDPFPATTFK---LQLTLFASAFGFHELALHHHLF-----HLIVPQKPRTTP 103
           + GNF   +D   +  F+     L   +   G H + +   LF        +   PR+  
Sbjct: 1   MLGNFSFPMDFISSHYFQSLSSPLNFASMVMGLHSIIISSALFCPKSHERRLQTFPRSPS 60

Query: 104 ANSSAIAMFKDRSTKGTYNRRFLCYNQKEKSLQSREELQPVGRLKKRKCSRSHGSRNWHQ 163
             S  I    DRS K + NRR LC+ QKE+SLQSREELQPV   +K              
Sbjct: 61  PCSRIIRRIFDRS-KNSSNRRLLCFKQKEESLQSREELQPVRASEK-------------- 120

Query: 164 YRIRVSCTSRESISGCNSLSLSFSL-SALANLFEICRKQRDFSQFDIYNAMLGFLLVEAL 223
                    + +++G  +L +  S+ S +  L E       F          GFLL+EAL
Sbjct: 121 ---------KGTVAGAMALVIGTSIGSGILALPEKASPAGFFPSSITIILCWGFLLLEAL 180

Query: 224 VLIEINVVLW-RKKKKKNEEGETGMEVISVRTMAQETLGDCGGTLATVAYVFSG------ 283
           +LIEINVVLW R+KKKK EEGETGMEVISVRTM QETLGDCGGTLA+VAYVF G      
Sbjct: 181 LLIEINVVLWRRRKKKKKEEGETGMEVISVRTMVQETLGDCGGTLASVAYVFLGYTSMVA 240

Query: 284 -------------SLP-----------------------------WM------------- 343
                        +LP                             W+             
Sbjct: 241 YISKSGEILLHSFNLPTPLSGFLFTLTFTMLISVGRTIAIDQVNQWLTACMIGLLLGIEV 300

Query: 344 --VCNGRWRRL------GKVPTTVPVIIFALVYHDVIPVLCAYLGGDLPRLRVSVLLGSF 352
             V +G W  +      GK PTT+PVIIFALVYHDVIPVLCAYL GDL RLRVSVLLGSF
Sbjct: 301 LAVQSGGWFAMEGGGDWGKAPTTIPVIIFALVYHDVIPVLCAYLEGDLHRLRVSVLLGSF 360

BLAST of Sgr024234 vs. ExPASy TrEMBL
Match: A0A6J1BVF1 (uncharacterized protein LOC111006064 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111006064 PE=4 SV=1)

HSP 1 Score: 221.9 bits (564), Expect = 4.4e-54
Identity = 167/387 (43.15%), Postives = 198/387 (51.16%), Query Frame = 0

Query: 44  IFGNFHSQLDPFPATTFK---LQLTLFASAFGFHELALHHHLF-----HLIVPQKPRTTP 103
           + GNF   +D   +  F+     L   +   G H + +   LF        +   PR+  
Sbjct: 1   MLGNFSFPMDFISSHYFQSLSSPLNFASMVMGLHSIIISSALFCPKSHERRLQTFPRSPS 60

Query: 104 ANSSAIAMFKDRSTKGTYNRRFLCYNQKEKSLQSREELQPVGRLKKRKCSRSHGSRNWHQ 163
             S  I    DRS     NRR LC+ QKE+SLQSREELQPV   +K              
Sbjct: 61  PCSRIIRRIFDRSKNS--NRRLLCFKQKEESLQSREELQPVRASEK-------------- 120

Query: 164 YRIRVSCTSRESISGCNSLSLSFSL-SALANLFEICRKQRDFSQFDIYNAMLGFLLVEAL 223
                    + +++G  +L +  S+ S +  L E       F          GFLL+EAL
Sbjct: 121 ---------KGTVAGAMALVIGTSIGSGILALPEKASPAGFFPSSITIILCWGFLLLEAL 180

Query: 224 VLIEINVVLW-RKKKKKNEEGETGMEVISVRTMAQETLGDCGGTLATVAYVFSG------ 283
           +LIEINVVLW R+KKKK EEGETGMEVISVRTM QETLGDCGGTLA+VAYVF G      
Sbjct: 181 LLIEINVVLWRRRKKKKKEEGETGMEVISVRTMVQETLGDCGGTLASVAYVFLGYTSMVA 240

Query: 284 -------------SLP-----------------------------WM------------- 343
                        +LP                             W+             
Sbjct: 241 YISKSGEILLHSFNLPTPLSGFLFTLTFTMLISVGRTIAIDQVNQWLTACMIGLLLGIEV 300

Query: 344 --VCNGRWRRL------GKVPTTVPVIIFALVYHDVIPVLCAYLGGDLPRLRVSVLLGSF 352
             V +G W  +      GK PTT+PVIIFALVYHDVIPVLCAYL GDL RLRVSVLLGSF
Sbjct: 301 LAVQSGGWFAMEGGGDWGKAPTTIPVIIFALVYHDVIPVLCAYLEGDLHRLRVSVLLGSF 360

BLAST of Sgr024234 vs. ExPASy TrEMBL
Match: A0A1S4E1A6 (tyrosine-specific transport protein 1-like isoform X3 OS=Cucumis melo OX=3656 GN=LOC103496418 PE=4 SV=1)

HSP 1 Score: 213.0 bits (541), Expect = 2.0e-51
Identity = 148/343 (43.15%), Postives = 178/343 (51.90%), Query Frame = 0

Query: 86  IVPQKPRTTPANSSAIAMFKDRSTKGTYNRRFLCYNQKEKSLQSREELQPVGRLKKRKCS 145
           ++P+ P  +P   + +   +       YNRR LC+ QKE+ LQS EELQPV   +K    
Sbjct: 24  LLPRSPLPSPCPRTTLQNLQTSKRAKRYNRRLLCFEQKEEGLQSTEELQPVSSSEK---- 83

Query: 146 RSHGSRNWHQYRIRVSCTSRESISGCNSLSLSFSLSALANLFEICRKQRDFSQFDIYNAM 205
                              + +++G  +  +  S+   + +  I  K      F    ++
Sbjct: 84  -------------------KGTVAGAMAFIIGTSIG--SGILAIPEKASPAGFFPSSISI 143

Query: 206 L---GFLLVEALVLIEINVVLWR--KKKKKNEEGETGMEVISVRTMAQETLGDCGGTLAT 265
           +   GFLLVEALVL+EI+VVLWR  KKKKK EEGETGMEVISVRTMAQETLGD GGTLAT
Sbjct: 144 IICWGFLLVEALVLVEISVVLWRRKKKKKKGEEGETGMEVISVRTMAQETLGDFGGTLAT 203

Query: 266 VAYVFSG-------------------SLP-----------------------------WM 325
           V YVF G                   +LP                             W+
Sbjct: 204 VTYVFLGYTSMVAYISKSGEILLQSFNLPSPLSGFLFTLFFSLLISVGRTRAVDQVNQWL 263

Query: 326 VC------------------------NGRWRRLGKVPTTVPVIIFALVYHDVIPVLCAYL 352
                                      G WR   KVPTT+PVIIFALVYHDVIPVLCAYL
Sbjct: 264 TACMIGLLLGIEVLAVQFGGWSAMDGGGDWR---KVPTTIPVIIFALVYHDVIPVLCAYL 323

BLAST of Sgr024234 vs. ExPASy TrEMBL
Match: A0A1S4E201 (tyrosine-specific transport protein 1-like isoform X1 OS=Cucumis melo OX=3656 GN=LOC103496418 PE=4 SV=1)

HSP 1 Score: 213.0 bits (541), Expect = 2.0e-51
Identity = 148/343 (43.15%), Postives = 178/343 (51.90%), Query Frame = 0

Query: 86  IVPQKPRTTPANSSAIAMFKDRSTKGTYNRRFLCYNQKEKSLQSREELQPVGRLKKRKCS 145
           ++P+ P  +P   + +   +       YNRR LC+ QKE+ LQS EELQPV   +K    
Sbjct: 24  LLPRSPLPSPCPRTTLQNLQTSKRAKRYNRRLLCFEQKEEGLQSTEELQPVSSSEK---- 83

Query: 146 RSHGSRNWHQYRIRVSCTSRESISGCNSLSLSFSLSALANLFEICRKQRDFSQFDIYNAM 205
                              + +++G  +  +  S+   + +  I  K      F    ++
Sbjct: 84  -------------------KGTVAGAMAFIIGTSIG--SGILAIPEKASPAGFFPSSISI 143

Query: 206 L---GFLLVEALVLIEINVVLWR--KKKKKNEEGETGMEVISVRTMAQETLGDCGGTLAT 265
           +   GFLLVEALVL+EI+VVLWR  KKKKK EEGETGMEVISVRTMAQETLGD GGTLAT
Sbjct: 144 IICWGFLLVEALVLVEISVVLWRRKKKKKKGEEGETGMEVISVRTMAQETLGDFGGTLAT 203

Query: 266 VAYVFSG-------------------SLP-----------------------------WM 325
           V YVF G                   +LP                             W+
Sbjct: 204 VTYVFLGYTSMVAYISKSGEILLQSFNLPSPLSGFLFTLFFSLLISVGRTRAVDQVNQWL 263

Query: 326 VC------------------------NGRWRRLGKVPTTVPVIIFALVYHDVIPVLCAYL 352
                                      G WR   KVPTT+PVIIFALVYHDVIPVLCAYL
Sbjct: 264 TACMIGLLLGIEVLAVQFGGWSAMDGGGDWR---KVPTTIPVIIFALVYHDVIPVLCAYL 323

BLAST of Sgr024234 vs. TAIR 10
Match: AT5G19500.1 (Tryptophan/tyrosine permease )

HSP 1 Score: 68.2 bits (165), Expect = 1.5e-11
Identity = 40/100 (40.00%), Postives = 56/100 (56.00%), Query Frame = 0

Query: 257 TLATVAYVFSGSLPWMVCNGRWRRLGKVPTTVPVIIFALVYHDVIPVLCAYLGGDLPRLR 316
           + A +  V SG L W            VP +VP+I  + VY +V+PVLC  L GDLPR+R
Sbjct: 261 SFAALVAVASGDLHWEAL--LKANFEAVPMSVPIIALSFVYQNVVPVLCTDLEGDLPRVR 320

Query: 317 VSVLLGSFIPLLALLVWDAVAL-----SLSAQADQVVDPV 352
            +++LG+ IPL   LVWDAV L           +++VDP+
Sbjct: 321 TAIVLGTAIPLGLFLVWDAVILGSFPVDTGVAVEKMVDPL 358

BLAST of Sgr024234 vs. TAIR 10
Match: AT2G33260.1 (Tryptophan/tyrosine permease )

HSP 1 Score: 42.7 bits (99), Expect = 7.0e-04
Identity = 21/78 (26.92%), Postives = 40/78 (51.28%), Query Frame = 0

Query: 280 RLGKVPTTVPVIIFALVYHDVIPVLCAYLGGDLPRLRVSVLLGSFIPLLALLVWDAVALS 339
           ++  V   VPV++  L +H + P +C   G  +   R ++L+G  +PL  +L W+ + L 
Sbjct: 194 KVSMVLPAVPVMVLTLGFHVITPFICNLAGDSVSDARRAILVGGVVPLAMVLSWNLIVLG 253

Query: 340 LS-----AQADQVVDPVN 353
           L+     A     +DP++
Sbjct: 254 LARITVPAAPSSTIDPIS 271

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAG7033001.18.7e-5741.25Tyrosine-specific transport protein, partial [Cucurbita argyrosperma subsp. argy... [more]
KAG6602318.11.9e-5641.25hypothetical protein SDJN03_07551, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022133501.11.8e-5443.41uncharacterized protein LOC111006064 isoform X3 [Momordica charantia][more]
XP_022133499.11.8e-5443.41uncharacterized protein LOC111006064 isoform X1 [Momordica charantia][more]
XP_022133500.19.0e-5443.15uncharacterized protein LOC111006064 isoform X2 [Momordica charantia][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1BVA38.8e-5543.41uncharacterized protein LOC111006064 isoform X3 OS=Momordica charantia OX=3673 G... [more]
A0A6J1BW548.8e-5543.41uncharacterized protein LOC111006064 isoform X1 OS=Momordica charantia OX=3673 G... [more]
A0A6J1BVF14.4e-5443.15uncharacterized protein LOC111006064 isoform X2 OS=Momordica charantia OX=3673 G... [more]
A0A1S4E1A62.0e-5143.15tyrosine-specific transport protein 1-like isoform X3 OS=Cucumis melo OX=3656 GN... [more]
A0A1S4E2012.0e-5143.15tyrosine-specific transport protein 1-like isoform X1 OS=Cucumis melo OX=3656 GN... [more]
Match NameE-valueIdentityDescription
AT5G19500.11.5e-1140.00Tryptophan/tyrosine permease [more]
AT2G33260.17.0e-0426.92Tryptophan/tyrosine permease [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR018227Amino acid/polyamine transporter 2PFAMPF03222Trp_Tyr_permcoord: 286..338
e-value: 2.9E-9
score: 36.5
NoneNo IPR availablePANTHERPTHR32195:SF24TRYPTOPHAN/TYROSINE PERMEASEcoord: 207..266
NoneNo IPR availablePANTHERPTHR32195FAMILY NOT NAMEDcoord: 281..352
NoneNo IPR availablePANTHERPTHR32195:SF24TRYPTOPHAN/TYROSINE PERMEASEcoord: 281..352
NoneNo IPR availablePANTHERPTHR32195FAMILY NOT NAMEDcoord: 207..266

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr024234.1Sgr024234.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0003333 amino acid transmembrane transport