Sgr014740 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr014740
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionUnknown protein
Locationtig00001047: 552403 .. 555819 (+)
RNA-Seq ExpressionSgr014740
SyntenySgr014740
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGATCGAGGCAGCTTGTCCGGAGATTTCTGTTGCGACAATGAGCCCTAGAATTTCATTTTCTCACGACTTCTGCCAGACTGAAGCTATTCCGGTAGAACAACGGCCTAATTCCTCCCGATCCAATTCTTCCGGTTTGAATTCCAGCATTGATTTCGACTTCTGCATTCGTGAGTGTTCCGATCAGGAGTCGTCTTCCGCGGATGAAATTTTCTCCCACGGAAAAATTCTGCCGCTCGAAATCAAGAAGAAACCTGAAGAGCCTCCTGTGCGAGTCGATCAGTCTTCTTCTACTCATGCTCCATTGACGCGAACACAATCTCTTGATGTTAACGCCGAAAAATGTTTGAAAGAAGATAGATCGTCAAAGGAAACCAAGGCAGCGAATAGCGACTCCGAAGAGAAGCAAAGTTCCAAGTCCTTTTGGCGTTTCAAAAGAAGCAGCAGCTGTGGCTCTGGATACACTCATAGCTTATGTCCTTTGCCGCTTCTATCACGAAGCAATTCAACTGGCTCTGCACCGAACATTAAGCGAACGCCATTGTCCAAGGACGGTGTAAATCACAAGCAGAGCTCCCATAGAAATGCCACCAAAACTTCATCACAGTGTTCGTCTTCAATGGGATATCAGAAACCTCCATTGAAGAAGGTACATGGTTCGTACCGTAACGGAGTTCAAGTAACTCCTATTCTAAATGTTCATTCGGGGAATCTTTTTGGTTTGGGCTCAATATTCTCCTCTGGCAAGGATAGAAGCAAGAAAAAGTGATTGGTTTTTTTTTTTTTTTTCTTAATAATCCGATTGTCTGATTTGCAAAAAAGTTGTGGAATCTGTAATTTTTTTTTCCTTTTCCCTTTTGATCTCGAACATGATTCTTAAAACTTAAAATAAAAGTTTGGAAGAGTTGCTTTTCTGCCCTGATTCACTGATTGATTGATGTTAAACAAAACCTCTGTTGTTAATTTAACAAAGTCAAAACAAAACCCATCTTGAAATATTAAAAAAATGGCTTATGGGGTTGAATTTAGACAGAGAGAAGAAGAAGATGATGAAGGTGGATAATGTATGGTAGGAGGTGACAGAGTGGTGGTATAATTCAAAGGCACATGCAGGCTTTTGGAGCAAAAAAGGACAGGTTAGGCATTTGCTTTTCATTGTTTGTCTATTGCCTTCTTCTTTTTCTGCTTCTGGTATTTTCTGATTTAAATTAAATTTAGACTTAACTTTTAAAGTTGTGTCTAAAGCTTTAAACTTTAAAAATTATTAATAAGTCTTAAATATTTAATTTTATGTCTATTTGATCTCTAAAATTTAAAAATATCTTATAGGTTCTTGCGTCTAATAAATCTCTGATTCGTTAGAAAATTTTAAAATCAATATATCTATTATACACAAAATTTAATTTTGTATCTAATAAGTCTTGTTTTTCAATCTTGTATTTAAAAGGTTCGTAAATTTTAAAAAATGTCTAATAAGTCAGGAATCTATTAGACACAAAATTCAAAGTTTATGGGCTATTAGACACTTTTTAAAGTTCAGTGACCTATTTGACACAACTTTGAAAGTTCAGGAATTAAACTTGTAATTTAACCTACTCTCTCACACTCTTCGACTCAGAAAACATGGACTACACTGAACGTTCAGAAACTTGTAATTTAATATATCATTTTTTTCTACACAAAAAACATTGTACTTTGTGTGTGGTCTCAAGTGTTTGTATTTTGGCTTCCAGGGAATATTCATGAGTGTGAATCTATTATTGGGATCTTTTTCTAATTTCTAGTTTTACTTTTAAATATTATGTAAAAAATAATGTATTTTCCAAGTATAAGATTTCGAGTTTGAGATAGACTTTTGAAAATAGTATTGTTTTAGTGAGATTTTTTTTTTATAAATACTTTTTGGAGAAGCACTAATCAAACTTTTCTTTAAGTATTTTTAAAATTTTAAAATTACTTTTAATTATATACTCAAACGTTAAATTTTTTTTTAAAAAAAACTTTTTATTGACAAAAACACGTTCACTCTTCAAAAGTCATATTAAACTCACTTTTAAAATTACTATTCATAAATCATATCAAACCCTGTACAATAGTTCAAATAGAGATTTCTTTCTATAAATTAAGTCTTTTTTTTAATTTGAGAATGTTAAATTATTTATAGAATAATAAAGTTTCATTAACAAAATAAACCATTAATGTTGTATGGAATATTTTGTTTTGTTATCTTCTTGTATTATATTGTAGAAGGAAAAAAAATATGTAAAACTTAAAGGTAACTATAAATTACATGCCTTTATTTTGTGAAATGGGAACTTTTAGTTTATTTTATTTTGATCTAAAAAATTTTAAAAGCCTATTTTAGTCTTTGAGCTTTAAAAAATAATAATTTTAATTATCGTTATTTTGTAGGTGTGTGTTAAAGCTATATAAGTGGACTAAGGTGGGTATTGAATGAGGTGAAAATTGACAACTTGAAAGTTTAGAGACTAAACTGAAATTTAAACATAAATTATATTCTTCTATTTAAACAATAGTTTTAAATATTCTTGTGATTTAAATCCTGTTTTGGTCCCAAAACTTTATGCTTTTTTTCTTAGTACTTTAGTTCAAACTTTCAAACATTTTGTTTTAATCTCTATACTTGGTATAAAAAACCATTTTAATTATTGTTATTAATGTTTTAGGCAAAGTATTTAATGAAAACATGACATAAGCATGCCTTTTGTTAGTTTACCTCATGAATAGATGTTTACATAAAAAAAATGTATGAACTCCCATTTAGTTGCTCAACTATTAAATTTCGAAATGAATCTGTTTTTCTGTTAAATGATTAGTATAAAATTAACGATAATGACTAAAATAATTTGTAGAATATTTGAAAGTTAAAGAACTAAAATATAATATTTAAAAATTTAAAAATGAAAATAAATAAATATAAAAGTTGATGTCAATCGATCAAAATATGATTTAAAGCAACTCTGAGAGCATGTCTGTCTATGAGTTATGAGTACAACCTTTTCTTTTCTTTAAATTATTTATATGTAAATTCGTCTCGACTAGAGGAAATTCAACTTTTCTTTTCTTTTCTTTTTATTAAAAAAAAAACAGTTTCGAGTTTCCACTAAAATAATAATAAACATTATGGATCGTAATTGGGAAAGCCCAAGTCAAGGCTTACCAGGCCCAAGACTAAGTTGTAAATCCAGGCTTAAACTGCGGTGACCGGAAACAGCATCATCTCCACAAATCGACCGGCCGCGACAAAACTCCGGCGTGCCGCCGAACCGCTTCCGACAAAGATCTCATCACGCACCGTCTTAACCTCCCTGCAGATTCAGATGTGGCCTCCCCGGAGAACCACGCACCTGCACCGGCACCGCCGTGGGACAGAGGTGGCCGAGACATAGCGTAA

mRNA sequence

ATGGCGATCGAGGCAGCTTGTCCGGAGATTTCTGTTGCGACAATGAGCCCTAGAATTTCATTTTCTCACGACTTCTGCCAGACTGAAGCTATTCCGGTAGAACAACGGCCTAATTCCTCCCGATCCAATTCTTCCGGTTTGAATTCCAGCATTGATTTCGACTTCTGCATTCGTGAGTGTTCCGATCAGGAGTCGTCTTCCGCGGATGAAATTTTCTCCCACGGAAAAATTCTGCCGCTCGAAATCAAGAAGAAACCTGAAGAGCCTCCTGTGCGAGTCGATCAGTCTTCTTCTACTCATGCTCCATTGACGCGAACACAATCTCTTGATGTTAACGCCGAAAAATGTTTGAAAGAAGATAGATCGTCAAAGGAAACCAAGGCAGCGAATAGCGACTCCGAAGAGAAGCAAAGTTCCAAGTCCTTTTGGCGTTTCAAAAGAAGCAGCAGCTGTGGCTCTGGATACACTCATAGCTTATGTCCTTTGCCGCTTCTATCACGAAGCAATTCAACTGGCTCTGCACCGAACATTAAGCGAACGCCATTGTCCAAGGACGGTGTAAATCACAAGCAGAGCTCCCATAGAAATGCCACCAAAACTTCATCACAGTGTTCGTCTTCAATGGGATATCAGAAACCTCCATTGAAGAAGGTACATGGTTCGTACCGCTTAAACTGCGGTGACCGGAAACAGCATCATCTCCACAAATCGACCGGCCGCGACAAAACTCCGGCGTGCCGCCGAACCGCTTCCGACAAAGATCTCATCACGCACCGTCTTAACCTCCCTGCAGATTCAGATGTGGCCTCCCCGGAGAACCACGCACCTGCACCGGCACCGCCGTGGGACAGAGGTGGCCGAGACATAGCGTAA

Coding sequence (CDS)

ATGGCGATCGAGGCAGCTTGTCCGGAGATTTCTGTTGCGACAATGAGCCCTAGAATTTCATTTTCTCACGACTTCTGCCAGACTGAAGCTATTCCGGTAGAACAACGGCCTAATTCCTCCCGATCCAATTCTTCCGGTTTGAATTCCAGCATTGATTTCGACTTCTGCATTCGTGAGTGTTCCGATCAGGAGTCGTCTTCCGCGGATGAAATTTTCTCCCACGGAAAAATTCTGCCGCTCGAAATCAAGAAGAAACCTGAAGAGCCTCCTGTGCGAGTCGATCAGTCTTCTTCTACTCATGCTCCATTGACGCGAACACAATCTCTTGATGTTAACGCCGAAAAATGTTTGAAAGAAGATAGATCGTCAAAGGAAACCAAGGCAGCGAATAGCGACTCCGAAGAGAAGCAAAGTTCCAAGTCCTTTTGGCGTTTCAAAAGAAGCAGCAGCTGTGGCTCTGGATACACTCATAGCTTATGTCCTTTGCCGCTTCTATCACGAAGCAATTCAACTGGCTCTGCACCGAACATTAAGCGAACGCCATTGTCCAAGGACGGTGTAAATCACAAGCAGAGCTCCCATAGAAATGCCACCAAAACTTCATCACAGTGTTCGTCTTCAATGGGATATCAGAAACCTCCATTGAAGAAGGTACATGGTTCGTACCGCTTAAACTGCGGTGACCGGAAACAGCATCATCTCCACAAATCGACCGGCCGCGACAAAACTCCGGCGTGCCGCCGAACCGCTTCCGACAAAGATCTCATCACGCACCGTCTTAACCTCCCTGCAGATTCAGATGTGGCCTCCCCGGAGAACCACGCACCTGCACCGGCACCGCCGTGGGACAGAGGTGGCCGAGACATAGCGTAA

Protein sequence

MAIEAACPEISVATMSPRISFSHDFCQTEAIPVEQRPNSSRSNSSGLNSSIDFDFCIRECSDQESSSADEIFSHGKILPLEIKKKPEEPPVRVDQSSSTHAPLTRTQSLDVNAEKCLKEDRSSKETKAANSDSEEKQSSKSFWRFKRSSSCGSGYTHSLCPLPLLSRSNSTGSAPNIKRTPLSKDGVNHKQSSHRNATKTSSQCSSSMGYQKPPLKKVHGSYRLNCGDRKQHHLHKSTGRDKTPACRRTASDKDLITHRLNLPADSDVASPENHAPAPAPPWDRGGRDIA
Homology
BLAST of Sgr014740 vs. NCBI nr
Match: XP_022133535.1 (uncharacterized protein LOC111006095 [Momordica charantia])

HSP 1 Score: 372.1 bits (954), Expect = 4.4e-99
Identity = 198/227 (87.22%), Postives = 209/227 (92.07%), Query Frame = 0

Query: 1   MAIEAACPEISVATMSPRISFSHDFCQTEAIPVEQRPNSSRSNSSGLNSSIDFDFCIREC 60
           MAIEA CP+ISV  MSPRISFSHDFCQ+EAIPVEQRP  SRSNSSGLNSSIDFDFCIREC
Sbjct: 1   MAIEAVCPDISVPAMSPRISFSHDFCQSEAIPVEQRP-KSRSNSSGLNSSIDFDFCIREC 60

Query: 61  SDQESSSADEIFSHGKILPLEIKKKPEEPPVRVDQSSSTHAPLTRTQSLDVNAEKCLKED 120
           SDQESSSADEIFSHG+ILPLEIKKKPE+PPV +DQSSS  APL RT+SLD + EKCLK+D
Sbjct: 61  SDQESSSADEIFSHGRILPLEIKKKPEDPPVLIDQSSSAPAPLARTRSLDADVEKCLKKD 120

Query: 121 RSSKETKAANSDSEEKQS--SKSFWRFKRSSSCGSGYTHSLCPLPLLSRSNSTGSAPNIK 180
           RSSKE KAANSDSEEKQS  SKSFWRFKRSSSCGSGYTHSLCPLPLLSRSNSTGSAPNIK
Sbjct: 121 RSSKEIKAANSDSEEKQSSNSKSFWRFKRSSSCGSGYTHSLCPLPLLSRSNSTGSAPNIK 180

Query: 181 RTPLSKDGVNHKQSSHRNATKTS---SQCSSSMGYQKPPLKKVHGSY 223
           RTPLSKDG +HKQSSHRN++KTS   SQCSSSMGYQKPPLKKVHGSY
Sbjct: 181 RTPLSKDGASHKQSSHRNSSKTSSSHSQCSSSMGYQKPPLKKVHGSY 226

BLAST of Sgr014740 vs. NCBI nr
Match: KAG6602701.1 (hypothetical protein SDJN03_07934, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 324.3 bits (830), Expect = 1.1e-84
Identity = 184/227 (81.06%), Postives = 192/227 (84.58%), Query Frame = 0

Query: 1   MAIEAACPEISVATMSPRISFSHDFCQTEAIPVEQRPNSSRSNSSGLNSSIDFDFCIREC 60
           MAIEA  P+I V  +SPRISFSHDF   EAIPVEQRPN SRS+SS  NSS DFDFCIREC
Sbjct: 1   MAIEAVSPDIPVVALSPRISFSHDFIHAEAIPVEQRPN-SRSSSSAFNSSFDFDFCIREC 60

Query: 61  SDQESSSADEIFSHGKILPLEIKKKPEEPPVRVDQSS-STHA-PLTRTQSLDVNAEKCLK 120
           S QESSSADEIFSHGKILPLEIKKK EEP +RVDQSS S H+ PLTR +SLD NAEKCLK
Sbjct: 61  SHQESSSADEIFSHGKILPLEIKKKLEEPHLRVDQSSFSNHSPPLTRAKSLDSNAEKCLK 120

Query: 121 EDRSSKETK-AANSDSEEKQSS--KSFWRFKRSSSCGSGYTHSLCPLPLLSRSNSTGSAP 180
           +DRS KE K A +SDSEEKQSS  KSFW FKRSSSCGSGYT SLCPLPLLSRSNSTGSAP
Sbjct: 121 KDRSPKEIKDAVSSDSEEKQSSNFKSFWGFKRSSSCGSGYTRSLCPLPLLSRSNSTGSAP 180

Query: 181 NIKRTPLSKDGVNHKQSSHRNATKTSSQCSSSMGYQKPPLKKVHGSY 223
           NIKRT LSKDGV  KQSSHRNA K S  CSSSMGYQKPPLKKVHGSY
Sbjct: 181 NIKRTTLSKDGVTQKQSSHRNAPKNSQHCSSSMGYQKPPLKKVHGSY 226

BLAST of Sgr014740 vs. NCBI nr
Match: XP_022953381.1 (uncharacterized protein LOC111455949 isoform X1 [Cucurbita moschata])

HSP 1 Score: 323.2 bits (827), Expect = 2.3e-84
Identity = 183/227 (80.62%), Postives = 192/227 (84.58%), Query Frame = 0

Query: 1   MAIEAACPEISVATMSPRISFSHDFCQTEAIPVEQRPNSSRSNSSGLNSSIDFDFCIREC 60
           MAIEA  P+I V  +SPRISFSHDF   EAIPVEQRPN SRS+SS  NSS DFDFCIREC
Sbjct: 1   MAIEAVSPDIPVVALSPRISFSHDFIHAEAIPVEQRPN-SRSSSSAFNSSFDFDFCIREC 60

Query: 61  SDQESSSADEIFSHGKILPLEIKKKPEEPPVRVDQSS-STHA-PLTRTQSLDVNAEKCLK 120
           S QESSSADEIFSHGKILPLEIKKK EEP +R+DQSS S H+ PLTR +SLD NAEKCLK
Sbjct: 61  SHQESSSADEIFSHGKILPLEIKKKLEEPHLRLDQSSFSNHSPPLTRAKSLDSNAEKCLK 120

Query: 121 EDRSSKETK-AANSDSEEKQSS--KSFWRFKRSSSCGSGYTHSLCPLPLLSRSNSTGSAP 180
           +DRS KE K A +SDSEEKQSS  KSFW FKRSSSCGSGYT SLCPLPLLSRSNSTGSAP
Sbjct: 121 KDRSPKEIKDAVSSDSEEKQSSNFKSFWGFKRSSSCGSGYTRSLCPLPLLSRSNSTGSAP 180

Query: 181 NIKRTPLSKDGVNHKQSSHRNATKTSSQCSSSMGYQKPPLKKVHGSY 223
           NIKRT LSKDGV  KQSSHRNA K S  CSSSMGYQKPPLKKVHGSY
Sbjct: 181 NIKRTTLSKDGVTQKQSSHRNAPKNSQHCSSSMGYQKPPLKKVHGSY 226

BLAST of Sgr014740 vs. NCBI nr
Match: KAG7033387.1 (hypothetical protein SDJN02_07443, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 322.4 bits (825), Expect = 4.0e-84
Identity = 183/227 (80.62%), Postives = 192/227 (84.58%), Query Frame = 0

Query: 1   MAIEAACPEISVATMSPRISFSHDFCQTEAIPVEQRPNSSRSNSSGLNSSIDFDFCIREC 60
           MAIEA  P+I V  +SPRISFSHDF   EAIPVEQRPN SRS+SS  NSS DFDFCIREC
Sbjct: 1   MAIEAVSPDIPVVALSPRISFSHDFIHAEAIPVEQRPN-SRSSSSAFNSSFDFDFCIREC 60

Query: 61  SDQESSSADEIFSHGKILPLEIKKKPEEPPVRVDQSS-STHA-PLTRTQSLDVNAEKCLK 120
           S QESSSADEIFSHGKILPLEIKKK EEP +RVDQSS S H+ PLTR +SLD NAEKCLK
Sbjct: 61  SHQESSSADEIFSHGKILPLEIKKKLEEPHLRVDQSSFSNHSPPLTRAKSLDSNAEKCLK 120

Query: 121 EDRSSKETK-AANSDSEEKQSS--KSFWRFKRSSSCGSGYTHSLCPLPLLSRSNSTGSAP 180
           +DRS KE K A +SDSEEKQSS  KSFW FKRSSSCGSGYT SLCPLPLLSRSNSTGSAP
Sbjct: 121 KDRSPKEIKDAVSSDSEEKQSSNFKSFWGFKRSSSCGSGYTRSLCPLPLLSRSNSTGSAP 180

Query: 181 NIKRTPLSKDGVNHKQSSHRNATKTSSQCSSSMGYQKPPLKKVHGSY 223
           NIKRT LSKDGV  KQSSHRNA K S  CSSSMG+QKPPLKKVHGSY
Sbjct: 181 NIKRTTLSKDGVTQKQSSHRNAPKNSQHCSSSMGHQKPPLKKVHGSY 226

BLAST of Sgr014740 vs. NCBI nr
Match: XP_022990991.1 (uncharacterized protein LOC111487716 isoform X1 [Cucurbita maxima])

HSP 1 Score: 322.0 bits (824), Expect = 5.2e-84
Identity = 183/227 (80.62%), Postives = 191/227 (84.14%), Query Frame = 0

Query: 1   MAIEAACPEISVATMSPRISFSHDFCQTEAIPVEQRPNSSRSNSSGLNSSIDFDFCIREC 60
           MAIEA  P+I V  +SPRISFSHDF   EAIPVEQR N SRS+SS  NSS DFDFCIREC
Sbjct: 1   MAIEAVSPDIPVVALSPRISFSHDFIHAEAIPVEQRSN-SRSSSSAFNSSFDFDFCIREC 60

Query: 61  SDQESSSADEIFSHGKILPLEIKKKPEEPPVRVDQSS-STHA-PLTRTQSLDVNAEKCLK 120
           S QESSSADEIFSHGKILPLEIKKK EEP +RVDQSS S H+ PLTR +SLD NAEKCLK
Sbjct: 61  SHQESSSADEIFSHGKILPLEIKKKSEEPHLRVDQSSFSNHSPPLTRAKSLDSNAEKCLK 120

Query: 121 EDRSSKETK-AANSDSEEKQSS--KSFWRFKRSSSCGSGYTHSLCPLPLLSRSNSTGSAP 180
           +DRS KE K A +SDSEEKQSS  KSFW FKRSSSCGSGYT SLCPLPLLSRSNSTGSAP
Sbjct: 121 KDRSPKEIKEAVSSDSEEKQSSNFKSFWGFKRSSSCGSGYTRSLCPLPLLSRSNSTGSAP 180

Query: 181 NIKRTPLSKDGVNHKQSSHRNATKTSSQCSSSMGYQKPPLKKVHGSY 223
           NIKRT LSKDGV  KQSSHRNA K S  CSSSMGYQKPPLKKVHGSY
Sbjct: 181 NIKRTTLSKDGVTQKQSSHRNAPKNSQHCSSSMGYQKPPLKKVHGSY 226

BLAST of Sgr014740 vs. ExPASy TrEMBL
Match: A0A6J1BVI8 (uncharacterized protein LOC111006095 OS=Momordica charantia OX=3673 GN=LOC111006095 PE=4 SV=1)

HSP 1 Score: 372.1 bits (954), Expect = 2.1e-99
Identity = 198/227 (87.22%), Postives = 209/227 (92.07%), Query Frame = 0

Query: 1   MAIEAACPEISVATMSPRISFSHDFCQTEAIPVEQRPNSSRSNSSGLNSSIDFDFCIREC 60
           MAIEA CP+ISV  MSPRISFSHDFCQ+EAIPVEQRP  SRSNSSGLNSSIDFDFCIREC
Sbjct: 1   MAIEAVCPDISVPAMSPRISFSHDFCQSEAIPVEQRP-KSRSNSSGLNSSIDFDFCIREC 60

Query: 61  SDQESSSADEIFSHGKILPLEIKKKPEEPPVRVDQSSSTHAPLTRTQSLDVNAEKCLKED 120
           SDQESSSADEIFSHG+ILPLEIKKKPE+PPV +DQSSS  APL RT+SLD + EKCLK+D
Sbjct: 61  SDQESSSADEIFSHGRILPLEIKKKPEDPPVLIDQSSSAPAPLARTRSLDADVEKCLKKD 120

Query: 121 RSSKETKAANSDSEEKQS--SKSFWRFKRSSSCGSGYTHSLCPLPLLSRSNSTGSAPNIK 180
           RSSKE KAANSDSEEKQS  SKSFWRFKRSSSCGSGYTHSLCPLPLLSRSNSTGSAPNIK
Sbjct: 121 RSSKEIKAANSDSEEKQSSNSKSFWRFKRSSSCGSGYTHSLCPLPLLSRSNSTGSAPNIK 180

Query: 181 RTPLSKDGVNHKQSSHRNATKTS---SQCSSSMGYQKPPLKKVHGSY 223
           RTPLSKDG +HKQSSHRN++KTS   SQCSSSMGYQKPPLKKVHGSY
Sbjct: 181 RTPLSKDGASHKQSSHRNSSKTSSSHSQCSSSMGYQKPPLKKVHGSY 226

BLAST of Sgr014740 vs. ExPASy TrEMBL
Match: A0A6J1GMV2 (uncharacterized protein LOC111455949 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111455949 PE=4 SV=1)

HSP 1 Score: 323.2 bits (827), Expect = 1.1e-84
Identity = 183/227 (80.62%), Postives = 192/227 (84.58%), Query Frame = 0

Query: 1   MAIEAACPEISVATMSPRISFSHDFCQTEAIPVEQRPNSSRSNSSGLNSSIDFDFCIREC 60
           MAIEA  P+I V  +SPRISFSHDF   EAIPVEQRPN SRS+SS  NSS DFDFCIREC
Sbjct: 1   MAIEAVSPDIPVVALSPRISFSHDFIHAEAIPVEQRPN-SRSSSSAFNSSFDFDFCIREC 60

Query: 61  SDQESSSADEIFSHGKILPLEIKKKPEEPPVRVDQSS-STHA-PLTRTQSLDVNAEKCLK 120
           S QESSSADEIFSHGKILPLEIKKK EEP +R+DQSS S H+ PLTR +SLD NAEKCLK
Sbjct: 61  SHQESSSADEIFSHGKILPLEIKKKLEEPHLRLDQSSFSNHSPPLTRAKSLDSNAEKCLK 120

Query: 121 EDRSSKETK-AANSDSEEKQSS--KSFWRFKRSSSCGSGYTHSLCPLPLLSRSNSTGSAP 180
           +DRS KE K A +SDSEEKQSS  KSFW FKRSSSCGSGYT SLCPLPLLSRSNSTGSAP
Sbjct: 121 KDRSPKEIKDAVSSDSEEKQSSNFKSFWGFKRSSSCGSGYTRSLCPLPLLSRSNSTGSAP 180

Query: 181 NIKRTPLSKDGVNHKQSSHRNATKTSSQCSSSMGYQKPPLKKVHGSY 223
           NIKRT LSKDGV  KQSSHRNA K S  CSSSMGYQKPPLKKVHGSY
Sbjct: 181 NIKRTTLSKDGVTQKQSSHRNAPKNSQHCSSSMGYQKPPLKKVHGSY 226

BLAST of Sgr014740 vs. ExPASy TrEMBL
Match: A0A6J1JPH3 (uncharacterized protein LOC111487716 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111487716 PE=4 SV=1)

HSP 1 Score: 322.0 bits (824), Expect = 2.5e-84
Identity = 183/227 (80.62%), Postives = 191/227 (84.14%), Query Frame = 0

Query: 1   MAIEAACPEISVATMSPRISFSHDFCQTEAIPVEQRPNSSRSNSSGLNSSIDFDFCIREC 60
           MAIEA  P+I V  +SPRISFSHDF   EAIPVEQR N SRS+SS  NSS DFDFCIREC
Sbjct: 1   MAIEAVSPDIPVVALSPRISFSHDFIHAEAIPVEQRSN-SRSSSSAFNSSFDFDFCIREC 60

Query: 61  SDQESSSADEIFSHGKILPLEIKKKPEEPPVRVDQSS-STHA-PLTRTQSLDVNAEKCLK 120
           S QESSSADEIFSHGKILPLEIKKK EEP +RVDQSS S H+ PLTR +SLD NAEKCLK
Sbjct: 61  SHQESSSADEIFSHGKILPLEIKKKSEEPHLRVDQSSFSNHSPPLTRAKSLDSNAEKCLK 120

Query: 121 EDRSSKETK-AANSDSEEKQSS--KSFWRFKRSSSCGSGYTHSLCPLPLLSRSNSTGSAP 180
           +DRS KE K A +SDSEEKQSS  KSFW FKRSSSCGSGYT SLCPLPLLSRSNSTGSAP
Sbjct: 121 KDRSPKEIKEAVSSDSEEKQSSNFKSFWGFKRSSSCGSGYTRSLCPLPLLSRSNSTGSAP 180

Query: 181 NIKRTPLSKDGVNHKQSSHRNATKTSSQCSSSMGYQKPPLKKVHGSY 223
           NIKRT LSKDGV  KQSSHRNA K S  CSSSMGYQKPPLKKVHGSY
Sbjct: 181 NIKRTTLSKDGVTQKQSSHRNAPKNSQHCSSSMGYQKPPLKKVHGSY 226

BLAST of Sgr014740 vs. ExPASy TrEMBL
Match: A0A6J1GN78 (uncharacterized protein LOC111455949 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111455949 PE=4 SV=1)

HSP 1 Score: 312.0 bits (798), Expect = 2.6e-81
Identity = 178/222 (80.18%), Postives = 187/222 (84.23%), Query Frame = 0

Query: 1   MAIEAACPEISVATMSPRISFSHDFCQTEAIPVEQRPNSSRSNSSGLNSSIDFDFCIREC 60
           MAIEA  P+I V  +SPRISFSHDF   EAIPVEQRPN SRS+SS  NSS DFDFCIREC
Sbjct: 1   MAIEAVSPDIPVVALSPRISFSHDFIHAEAIPVEQRPN-SRSSSSAFNSSFDFDFCIREC 60

Query: 61  SDQESSSADEIFSHGKILPLEIKKKPEEPPVRVDQSS-STHA-PLTRTQSLDVNAEKCLK 120
           S QESSSADEIFSHGKILPLEIKKK EEP +R+DQSS S H+ PLTR +SLD NAEKCLK
Sbjct: 61  SHQESSSADEIFSHGKILPLEIKKKLEEPHLRLDQSSFSNHSPPLTRAKSLDSNAEKCLK 120

Query: 121 EDRSSKETK-AANSDSEEKQSS--KSFWRFKRSSSCGSGYTHSLCPLPLLSRSNSTGSAP 180
           +DRS KE K A +SDSEEKQSS  KSFW FKRSSSCGSGYT SLCPLPLLSRSNSTGSAP
Sbjct: 121 KDRSPKEIKDAVSSDSEEKQSSNFKSFWGFKRSSSCGSGYTRSLCPLPLLSRSNSTGSAP 180

Query: 181 NIKRTPLSKDGVNHKQSSHRNATKTSSQCSSSMGYQKPPLKK 218
           NIKRT LSKDGV  KQSSHRNA K S  CSSSMGYQKPPLKK
Sbjct: 181 NIKRTTLSKDGVTQKQSSHRNAPKNSQHCSSSMGYQKPPLKK 221

BLAST of Sgr014740 vs. ExPASy TrEMBL
Match: A0A6J1JKF9 (uncharacterized protein LOC111487716 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111487716 PE=4 SV=1)

HSP 1 Score: 310.8 bits (795), Expect = 5.8e-81
Identity = 178/222 (80.18%), Postives = 186/222 (83.78%), Query Frame = 0

Query: 1   MAIEAACPEISVATMSPRISFSHDFCQTEAIPVEQRPNSSRSNSSGLNSSIDFDFCIREC 60
           MAIEA  P+I V  +SPRISFSHDF   EAIPVEQR N SRS+SS  NSS DFDFCIREC
Sbjct: 1   MAIEAVSPDIPVVALSPRISFSHDFIHAEAIPVEQRSN-SRSSSSAFNSSFDFDFCIREC 60

Query: 61  SDQESSSADEIFSHGKILPLEIKKKPEEPPVRVDQSS-STHA-PLTRTQSLDVNAEKCLK 120
           S QESSSADEIFSHGKILPLEIKKK EEP +RVDQSS S H+ PLTR +SLD NAEKCLK
Sbjct: 61  SHQESSSADEIFSHGKILPLEIKKKSEEPHLRVDQSSFSNHSPPLTRAKSLDSNAEKCLK 120

Query: 121 EDRSSKETK-AANSDSEEKQSS--KSFWRFKRSSSCGSGYTHSLCPLPLLSRSNSTGSAP 180
           +DRS KE K A +SDSEEKQSS  KSFW FKRSSSCGSGYT SLCPLPLLSRSNSTGSAP
Sbjct: 121 KDRSPKEIKEAVSSDSEEKQSSNFKSFWGFKRSSSCGSGYTRSLCPLPLLSRSNSTGSAP 180

Query: 181 NIKRTPLSKDGVNHKQSSHRNATKTSSQCSSSMGYQKPPLKK 218
           NIKRT LSKDGV  KQSSHRNA K S  CSSSMGYQKPPLKK
Sbjct: 181 NIKRTTLSKDGVTQKQSSHRNAPKNSQHCSSSMGYQKPPLKK 221

BLAST of Sgr014740 vs. TAIR 10
Match: AT1G67050.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G38320.1); Has 617 Blast hits to 318 proteins in 80 species: Archae - 0; Bacteria - 16; Metazoa - 141; Fungi - 62; Plants - 128; Viruses - 2; Other Eukaryotes - 268 (source: NCBI BLink). )

HSP 1 Score: 161.8 bits (408), Expect = 8.4e-40
Identity = 111/222 (50.00%), Postives = 138/222 (62.16%), Query Frame = 0

Query: 13  ATMSPRISFSHDFCQTEAIPVEQRP-NSSRSNSSGLNSSIDFDFCI------RECSDQES 72
           + MSPRISFS DFCQ++AIP+E+RP  SS S  S LNSSIDFDFCI       E  DQ S
Sbjct: 10  SNMSPRISFSRDFCQSDAIPIEKRPLRSSNSKPSSLNSSIDFDFCIPGGVNSGESFDQGS 69

Query: 73  SSADEIFSHGKILPLEIKKKPEEPPVRVDQSSSTHAPLTRTQSLDVNAEKCLKEDRSSKE 132
            SADE+FS+GKILP EIKKKPE      +       P +R Q    N E+  +ED     
Sbjct: 70  WSADELFSNGKILPTEIKKKPEPGKKEPEPKPVKSKPDSRKQRKQPNEEQ--QEDDVIIT 129

Query: 133 TKAANSDSEEKQSSKSFWRFKRSSS--CGSGYTHSLCPLPLLSRSNSTGSAPNIKRTPLS 192
           T       EEK ++KSFW FKRSSS  CGS Y  SLCPLPLL+RSNSTGS  + ++   S
Sbjct: 130 T-------EEKTNTKSFWGFKRSSSLNCGSTYGRSLCPLPLLNRSNSTGSTSSKQKQSSS 189

Query: 193 KDGVNH---KQSSHRNATKTSSQCSSSMGYQKPPLKKVHGSY 223
           +    H   +QSS  +++ ++S   S+ G+ KPPLKK +G Y
Sbjct: 190 RKHNEHVKLQQSSSLSSSSSASSSLSNNGFSKPPLKKSYGGY 222

BLAST of Sgr014740 vs. TAIR 10
Match: AT1G48780.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G18300.1); Has 89 Blast hits to 89 proteins in 11 species: Archae - 0; Bacteria - 2; Metazoa - 0; Fungi - 0; Plants - 86; Viruses - 0; Other Eukaryotes - 1 (source: NCBI BLink). )

HSP 1 Score: 79.3 bits (194), Expect = 5.5e-15
Identity = 83/213 (38.97%), Postives = 108/213 (50.70%), Query Frame = 0

Query: 18  RISFSHDFCQTEAIP---VEQRPNSSRSNSSGLNSSIDFDFCIRECSDQ-ESSSADEIFS 77
           RISFS D  Q++  P   +E      R  +   +S+ DF+F I    D  +SS ADEIF+
Sbjct: 9   RISFSSDLGQSDKAPPPVIEPSGLIRRDETLLDSSNSDFEFHISNSFDPGDSSPADEIFA 68

Query: 78  HGKILPLEIKK---------KPEEPPVRVDQSSSTHAPLTRTQSLDVNAEKCLKEDRSSK 137
            G ILP  +           K E PP+    SS + +PL+       ++EK      ++ 
Sbjct: 69  DGMILPFHVTAASTVPKRLYKYELPPI---TSSLSPSPLSPQPLPTKHSEK-----ETNG 128

Query: 138 ETKAANSDSEEKQSSKSFWRFKRSSSCGSGYTHSL-CPLPLLSRSNSTGSAPNIKRTPLS 197
               ANSDSE ++SSKSFW FKRSSS       SL C  P L+RSNSTGS  N KR  L 
Sbjct: 129 RASGANSDSEAEKSSKSFWSFKRSSSLNCDIKKSLICSFPRLTRSNSTGSVTNSKRAML- 188

Query: 198 KDGVNHKQSSHRNATKTSSQCSSSMGYQKPPLK 217
           +D  NH+ SS       SS C++   YQ  P K
Sbjct: 189 RDVNNHRPSSR------SSCCNA---YQFRPQK 203

BLAST of Sgr014740 vs. TAIR 10
Match: AT3G18300.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G48780.1); Has 69 Blast hits to 69 proteins in 7 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 69; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 72.4 bits (176), Expect = 6.7e-13
Identity = 85/241 (35.27%), Postives = 117/241 (48.55%), Query Frame = 0

Query: 18  RISFSHDFCQTE-AIPVEQRPNSSRSNSSGL--NSSIDFDFCIRECSDQ-ESSSADEIFS 77
           R SF+ D  Q++   P+EQ+P+      + L  +S+ DF+F I    D  +SS ADEIF+
Sbjct: 10  RFSFAGDLGQSDKGTPMEQQPSGPVRRDTTLLDSSNSDFEFHISSNFDPGDSSPADEIFA 69

Query: 78  HGKILPL------------EIKKKPEEPPVRVDQSSSTHAPLTRTQSLDVNAEKCLKEDR 137
            G ILP+            +   K E PP+    + S++ P       + + +  +KE R
Sbjct: 70  DGMILPVLPFQVTATSTMPKRLYKYELPPIVSAPTLSSYLPPLPLPLPEHSRKYSVKETR 129

Query: 138 SSKETK--AANSDSEEKQSSKSFWRFKRSSSCGSGYTHSL-CPLPLLSRSNSTGSAPNIK 197
            S   +   ANSDSE ++SSKSFW FKRSSS       SL C  P L+RSNSTGS    K
Sbjct: 130 GSLNGRGSGANSDSEAEKSSKSFWSFKRSSSLNCDIKKSLICSFPRLTRSNSTGSVAISK 189

Query: 198 RTPLSKDGVNHKQSSHRNATKTSSQCSSSMGYQKPPLKKVHGSYRLNCGDRKQHHLHKST 240
           R  L      +K SS R+         SS  + +PP      SY+     R Q H  K+ 
Sbjct: 190 REMLRD---INKHSSQRHGVPRPGVNPSS--HMRPPSSFCCSSYQF----RPQKHAGKNG 241

BLAST of Sgr014740 vs. TAIR 10
Match: AT5G38320.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: shoot apex, sepal, pedicel; EXPRESSED DURING: 4 anthesis; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G67050.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 64.3 bits (155), Expect = 1.8e-10
Identity = 37/77 (48.05%), Postives = 48/77 (62.34%), Query Frame = 0

Query: 13 ATMSPRISFSHDFCQTEAIPVEQRPNSSRSNSSGLNSSIDFDFCIRE--CSDQESSSADE 72
          A  SPRISFS+DFC  E+IP+EQR + S  + S        +F I     S + S SA+E
Sbjct: 5  ANESPRISFSNDFCHHESIPIEQRTSQSPYDISNFYWGFPLEFSIPRGAISGESSWSAEE 64

Query: 73 IFSHGKILPLEIKKKPE 88
           F+ GKILP+E+KK PE
Sbjct: 65 FFNDGKILPIEMKKIPE 81

BLAST of Sgr014740 vs. TAIR 10
Match: AT5G38320.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G67050.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 54.7 bits (130), Expect = 1.5e-07
Identity = 52/161 (32.30%), Postives = 82/161 (50.93%), Query Frame = 0

Query: 13  ATMSPRISFSHDFCQTEAIPVEQRPNSSRSNSSGLNSSIDFDFCIRE--CSDQESSSADE 72
           A  SPRISFS+DFC  E+IP+EQR + S  + S        +F I     S + S SA+E
Sbjct: 5   ANESPRISFSNDFCHHESIPIEQRTSQSPYDISNFYWGFPLEFSIPRGAISGESSWSAEE 64

Query: 73  IFSHGKILPLEIKKKPEEPPVRVDQSSSTHAPLTRTQSLDV-NAEKCLK-EDRSSKETKA 132
            F+ GKILP+E+KK PE  P+   ++      L R + + + + E  L+ E+   +E + 
Sbjct: 65  FFNDGKILPIEMKKIPE--PIYRSKTDKYKTGLPRPEIIPIEDFEPVLEIEEIGDQEYEV 124

Query: 133 ANSDSEEKQSSKSFWRFKRSSSCGSGYTHSLCPLPLLSRSN 170
                    +  +  + + SSS  S +  S  P P+L  ++
Sbjct: 125 KLPLLPYNSTGSNSIKSQVSSSSSSSFNGSF-PKPILKNNH 162

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022133535.14.4e-9987.22uncharacterized protein LOC111006095 [Momordica charantia][more]
KAG6602701.11.1e-8481.06hypothetical protein SDJN03_07934, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022953381.12.3e-8480.62uncharacterized protein LOC111455949 isoform X1 [Cucurbita moschata][more]
KAG7033387.14.0e-8480.62hypothetical protein SDJN02_07443, partial [Cucurbita argyrosperma subsp. argyro... [more]
XP_022990991.15.2e-8480.62uncharacterized protein LOC111487716 isoform X1 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1BVI82.1e-9987.22uncharacterized protein LOC111006095 OS=Momordica charantia OX=3673 GN=LOC111006... [more]
A0A6J1GMV21.1e-8480.62uncharacterized protein LOC111455949 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1JPH32.5e-8480.62uncharacterized protein LOC111487716 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1GN782.6e-8180.18uncharacterized protein LOC111455949 isoform X2 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1JKF95.8e-8180.18uncharacterized protein LOC111487716 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... [more]
Match NameE-valueIdentityDescription
AT1G67050.18.4e-4050.00unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G48780.15.5e-1538.97unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT3G18300.16.7e-1335.27unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT5G38320.21.8e-1048.05unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT5G38320.11.5e-0732.30unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 166..210
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 79..140
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 112..140
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 92..111
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 260..290
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 193..210
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 228..247
NoneNo IPR availablePANTHERPTHR31722OS06G0675200 PROTEINcoord: 1..219
NoneNo IPR availablePANTHERPTHR31722:SF34F1O19.11 PROTEINcoord: 1..219

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr014740.1Sgr014740.1mRNA