Sgr026097 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr026097
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
Descriptionprotein HEAT INTOLERANT 4-like
Locationtig00153031: 1705487 .. 1711170 (+)
RNA-Seq ExpressionSgr026097
SyntenySgr026097
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGGTCACAACTCACAAACACAAAGTGTAGCCGCACGTGCAGTACACAGACAGGCAAGCGTGCTTTTTTGTCCTCCCGGACCACCGCCGCGCGTCTTGTCTTGTCCCTCCCTCCTTGGTCCACGATGACCACGTGGGGGACCTCTACCCTCCTCTACACTCTACCCTATCAATATTCATTCATTCTGCCACCCACCAATAATTTAAATATATCTATTTAATGTTTTACATAAATATTTACTTTATGGGCTTTTTTATTGGGTCTCTTACAGGCCTGAGCCCATGTGTCTGGAAGGCCCAATTGTCATGTGGGCCGAGCCCAATGGCCCGATCGATCAGATTTCAAACCATTCCTGGAGCTCTGCGCACGCCGCCCGCCTTTCTCTGTCTCTTTCAAATTCGGACCGTCCGCGTTCATTTCACGCTCATTGACCCCGAATTTGAATCCCTTCGTCTCTCTCTCTCATCCTTCGCCTTTCCAATCCCCGGATCTGCCGCTTCTTCCTCTACACACTGTTTTGTATCTGCTCAGTTTCCAACACGATGAGGAAAGCAACTAAGAGGAAAGCTAGAAACAAGGAAGAGGCTAACTCTGCGGAGAAGGAGAACCGTAAAGAATCAACCACAGCCACCGCCACAGCACCTACTCGAGCCAAGCGAGTCAAGGCTTCCAAACCTCATTCCGAACCGGAGTTCTTCGAGGATAAGCGCAATTTGGTACCCAATTAACCATTACATTATTCGCATTCGCTCTGCATTGTTTTCCTTCAGCATTCTGCATTTTGCATTTCAAGTCTTGGGTGTTTCATGTTTGATCCCAGTCTGGAAGTTTTTTTTTTGGTTACTGGGAAGTATAGAGGATTTGGGAATAAATAAACCTTGGATTGCTTCATGCTCGGACACAGCAGTATCACTTCAGAGCACAACTTCGCTAAGCTACTAAAACAGTGGGCAACTCGTCCGGAATTCGCCTCTGTTTAAAATTCTGCATTTCACTCCTTTCTGATTACTGATAACCAAGTATTTTTATTTTTTCCAACCTCTGTAAGAGTGCTGTTAATAATAATAATAATAATGAAATGTTATGACTTCCGAGGATTTAGCCTCTCATTACAAATGAGGTAGATGTATGCGCAAGGCGCGAGAGTAAAAAGAAGGGGAATGCCCTCTTTGGCTTTTTTTTACTTTTGTGCTTCGGTGTGTTATTTCTAGTGATAATGTCCTCTTCTACTGATCGAGCATACCTACTTGTCTTATTCAGGAAGACCTGTGGAAGGCAGCATTCCCTGTGGGAACAGAGGTTTGTAATGTGTTGTATTTGTTCTCTTTTGAGTCGACAATTGTATTTTGCCACTAACGCTAATGATTTTATAACACCAGTGGGATCAACTGGATTTTGTTTATCAGTACAACTGGAATTTCTCAAATCTTGAAGTGAGTTATAGTTCTGTTGTATCTTTGTAATTTCATCCCTCTATTTCTCTTTGACTATAATGTTGAGTTGGATATATTATGCTTTTCAAGTTGTTACAACGATTAAAAGTAAAGTTTTAATCATGATCAGAATGCATTTGAAGAAGGGGGAAAACTATACGGGGAGAAAGTTTACCTATTTGGTTCTACAGAGCGTAAGTAACTTAAAACTCCAGCTCCCTCCCCCCCCACAAAATCTTTGAGAGAACCTGAAAACTCCTCACATCCCTTTTTATATGATTTTATATCATTGCAGCACAACTTGTCTCTTTCAAGGGTGAAAGTAAAGTTATCTGCATTCCTGTAGTGGTGGCTGTAAGTTGTTTGAAATATTTGTTTTACTTGGTCTTGTAGTAAATTAACATGTGATTACTTTCCTGCATTAGCTGTTGTAGTTAATTGATTTTATGTTATATAGGTTGTCTCACCTTTCCCACCTTCCGATAAAATTGGGATTAACTCTGTTCAACGAGAGGCTGAAGAGATCATACCCATGAAACAAATGAAAATGGGTTGGGTTCCCTACATTCCTCTCGAGGATAGGTACTCAACTTCCAATGAATTTCTTGTTTGTTTGTGATTGGACACCTTTAGGATTCCTTGCATAACTTTTGTTTAATGGCTTAATTCAATTCTTCTACAGAGATAGCCGAGTTGATAAGCTGAAATCCCAAATATTTATATTAAGTTGCACTCAAAGAAGGTAAATGGCTTTACCGCAGACTATCATCTCCCATCACACCACATGCCCTCTAAATAATAACCAAAAACAAAAGAGAAAGACCAATATTTATTGAGACCTGTAGTATGCCAATGCCAATTCTTAATCATGTGTGATATGTCGAACCTGTATTGATTGATGCCCCTAGTAAATATTGCAGATACAAATTGCCAACTAATGAATTACATTTGTTTCTTTTATTTGCTGGAGTAATACTGACTTTCAATTTCGTTTTCAAATTTACGACACTTGATGCTTTTATAGGGCTGCTCTGAAGCATCTGAAGATAGATCGTGTCAAGAAATATGAATACTGCTTGCCTTGTGAGTATTCCGTACTTCTTTCTTTTTATTATGTCAAGTGTTTGCATACCAATTAGTACTGTTTGATCCTGATAGAAGGGAAAAGTGGTTCCATTTACTTCTCTGTGGGGAGAAGGTCCGATTAGGAGAAATGGGCTAGCAGCTAGCTCAGGACTTGAATCGTTGATCTTTTCTGGTTCTCAGAGCTCTCTAGCTTTAGAATAAGCTGGTCAAGATAGACAAATACCTTTTCTTTTGACAGATTTCTACCAGCCCTTCAAGGAAGATGAGCTGGAGCAAAGCACTGAGGTCCAAATAATATTTCCCGCAGAACCAAAGCCAGTAAGATTTCAGCCTGTGATATTATGGGGAGAAATATAATGATTGTGATTGTTAGGAAGATGATTGAACGGAAACTTTAGCTACTGTGCTGTTCTTTTTGGTGTCTTTACCCTCTCAGTGATGACTTGATTCAGATGATTGAGGCTCTGCCTTAGTTTAGTATATTTATTTTTAATATAAGATCTCCTTTGCTAATTCAAAGTTTTATGTTGGGTACTGCAGATTTTCTGTGAATTTGATTGGGAACTAGATGAACTTCAGGTACTGGAATTGGCCTTCTGATTTTAGTTTTTGAGCTTTCTCGTAGACTTATTCATGACTCTTCCAGTTCTCTTTTTCAAATACATTATTTGAAGGTAACAGTTAAAATTTTGCAATGAACCTTCTCAAAATCTTTTGTAGGAGTTCACAGATAAGCTGATTGAAGAGGAGGAATTATCTGAAGATCAGAAGGATGCCTTCAAGGTAGTATTTGATGGGAAAAAAGCGCTCACTAATTTAAGTTTTGGACCCTATGACTGAATGTTTGAGCATGTCTTCCTTAGGAGTTTGTTAAGGAGAAAGTTAGAGAAGCAAAGAAAGCTAATCGAGAGGCAAGTTATAGTGAATATTTTTTGGCGGTTCTCTTTATTTGATTTTTTTTTTTACAACTCTTCTGATGGAGTTTTGGTGGTTGTAATTTTTACAGGCAAGGGAAGCACGTAAAAAAGTGCTCCAAGAAATGAGCGAGGAAACAAAAGCAGCATTTGAGAAAATGAGGTTCTATAAGTTCTACCCAGTTCAAACACAAGATAGCCCTGATATTTCCAACGTTAAGGTAACACTTAACACTTTCACCATTGTTTTTGGATAGTCCAAGGAATCTGATGAGATCTTTAGGCAATCTTCAATACTGTTAATCGGTTCCTTGAGATACAATATCTTGTGTTTAAATTTGAAATCAAATTTTTCAATGCATGTACATATCTTACTGTTGGGGATGACTTTTAACCATTTACCTTGTGCTGTCTTGTATTGAGTTCATTGGCCCATAATTTTTCAGTGAAGATTTAACCTTGAACCAAAGCTTTAAAAGGTTTTGACTTGAGAAAGAGACTAAAGCTACTACTACTAGTGGTGGTACTATATATCGTTAATGTGAAAGAGCCCAAGTACAATATATAAACATTACACAACCACTTCCCTGCATGTATGTCTCAGCAGATTCAGTGAGGTTATCTTTGTCTGGACTGTTGACACTAGCTGAGAAACTGCTCTTAATCCTTTGTGTTAATTCTTTTTCTTTTCTGGTGTTGCCTTCAGGCTCCATTCATAAACAGGTATTACGGAAAGGCTCACGAGGTTCTATGATTTGGTATGGTCATGGACTGGAAGGAAAGAAGTTTCTTACACTGTCAACCAACCAGCGCACCAAATGTTCAAGGAGTGCTTAGTTCTTGCTAAGAACTCTTGGAGGCAAGCCCCACTCCCACCGGTCTCGCACATTCCTTGATCGAGGATGTTAAATCAGGCTGTTGCCACTTTAAGATTAGGTCCAGGCACCATTATTGCCAGTCATAAGGAATGTGTAAAATTGAATCTGTACGCCCAGTTTTTAACTTTTTGCCATCAGCTACAGAGATGGAGGCTTGCTGCTACTCAAGTTTGGGGGAACCCTTCAAAATTTTACTTAATCTTTGACAATTGGATCTCACTTAATTTATGGAAACATTAGATTTCAGTTGTAGAATCTGATTACTAATTTACTCTCTGCATGACCATTCTATTTGCCGACCGACCCTTCAATGATAGTGATTTTTACATTAGAAGAATCAATAATTGATTTTTTGGACTGATGATGTTTTTATAGTTCTGTAAAGTTGATGTTAAGAGTCCAATTTTAGCACTCTTAAGATATCAATTGTTTCGTGCTTATTATAGAGAGTGACTTTTACAATATTGCAAGCGCCTAAAATAAAAGGGTAAGATCCCATCTGATACCCTAAAGATCCCATCTGATAACTACTTTGTTTTTAACCTTTAAATACGTCTTTAGAACATTTTACTATAATCTTAACCCCCCCCCCCCCCAACCCAAAAAAAAAAAAAACCAGTAATTTTTGAAATTTGGCTAACAAACGCATTATAAAACAAATTATATTTACTTTTTTTTTCTCATCAATGCGTACATCTCTTAAAGAAAAGAAGCAAAATATTGACACAATATAATGTATTAATACTATAATAATAAGAAAATAACTTAGCAGTTTTAACTCCATTGCTATCTCAAAGTTAAAAAAAAAGATAAAAATTAAGGGAAAAAAAAAGGCTCCTATAAAAACTAATATACAATAAATAATCCAATAAATTGGATTCGTCAGAAAACTCTGTTAAATAATAAACTGTGCAACTCGGCTCGAAACTAAAGCAGCAGCATCCCGCAGCCACATCCTTGCTTCAGTAGCCGCCCGCTGCAGACATATCCGTCACCAGAAGAAAAAAATGATCTTTCCATTCACCAACACCTGCAAATGGAAGCAGCCCACACAGCAATGGCAGAAATACAAGCAGAAAGGTGTAGAATACAATGCGGCGGATCCAATATGGAAAGATCGTGTGCCGATTTTTCCCTTGAGAAACCCCGACCGGGCCATGTAGGATCAGAAACAGTGCAGATGTAAAAAGGTTGGCAAATAAGTGCAATAAGCACAAGGTCTCCCAGTACACGAGCCAATGCAATCCGCTTCCTCCCAGTGTGTCTATGCACAACTTCCCGGCTAAACCGCAGCCAACTCCGAGCAGCAAGACGATGAAAGTAACTGGCATTGCCTGGTTTGAAGTTGCTGAACGTTGGAGAAGCAGATAA

mRNA sequence

ATGAGGTCACAACTCACAAACACAAAGTGTAGCCGCACGTGCAGTACACAGACAGGCAAGCGTGCTTTTTTGTCCTCCCGGACCACCGCCGCGCGTCTTGTCTTGTCCCTCCCTCCTTGGTCCACGATGACCACGCCTGAGCCCATGTGTCTGGAAGGCCCAATTGTCATGTGGGCCGAGCCCAATGGCCCGATCGATCAGATTTCAAACCATTCCTGGAGCTCTGCGCACGCCGCCCGCCTTTCTCTGTCTCTTTCAAATTCGGACCTTTCCAACACGATGAGGAAAGCAACTAAGAGGAAAGCTAGAAACAAGGAAGAGGCTAACTCTGCGGAGAAGGAGAACCGTAAAGAATCAACCACAGCCACCGCCACAGCACCTACTCGAGCCAAGCGAGTCAAGGCTTCCAAACCTCATTCCGAACCGGAGTTCTTCGAGGATAAGCGCAATTTGGAAGACCTGTGGAAGGCAGCATTCCCTGTGGGAACAGAGTGGGATCAACTGGATTTTGTTTATCAGTACAACTGGAATTTCTCAAATCTTGAAAATGCATTTGAAGAAGGGGGAAAACTATACGGGGAGAAAGTTTACCTATTTGGTTCTACAGAGCCACAACTTGTCTCTTTCAAGGGTGAAAGTAAAGTTATCTGCATTCCTGTAGTGGTGGCTGTTGTCTCACCTTTCCCACCTTCCGATAAAATTGGGATTAACTCTGTTCAACGAGAGGCTGAAGAGATCATACCCATGAAACAAATGAAAATGGGTTGGGTTCCCTACATTCCTCTCGAGGATAGAGATAGCCGAGTTGATAAGCTGAAATCCCAAATATTTATATTAAGTTGCACTCAAAGAAGGGCTGCTCTGAAGCATCTGAAGATAGATCGTGTCAAGAAATATGAATACTGCTTGCCTTATTTCTACCAGCCCTTCAAGGAAGATGAGCTGGAGCAAAGCACTGAGGTCCAAATAATATTTCCCGCAGAACCAAAGCCAATTTTCTGTGAATTTGATTGGGAACTAGATGAACTTCAGGAGTTCACAGATAAGCTGATTGAAGAGGAGGAATTATCTGAAGATCAGAAGGATGCCTTCAAGGAGTTTGTTAAGGAGAAAGTTAGAGAAGCAAAGAAAGCTAATCGAGAGGCAAGGGAAGCACGTAAAAAAGTGCTCCAAGAAATGAGCGAGGAAACAAAAGCAGCATTTGAGAAAATGAGGTTCTATAAGTTCTACCCAGTTCAAACACAAGATAGCCCTGATATTTCCAACGTTAAGGCTCCATTCATAAACAGGTATTACGGAAAGGCTCACGAGGCTGTTGCCACTTTAAGATTAGGTCCAGGCACCATTATTGCCAGTCATAAGGAATGTGTAAAATTGAATCTCAGCATCCCGCAGCCACATCCTTGCTTCAGTAGCCGCCCGCTGCAGACATATCCGTCACCAGAAGAAAAAAATGATCTTTCCATTCACCAACACCTGCAAATGGAAGCAGCCCACACAGCAATGGCAGAAATACAAGCAGAAAGGATCAGAAACAGTGCAGATGTAAAAAGGTTGGCAAATAAGTGCAATAAGCACAAGGTCTCCCAGTACACGAGCCAATGCAATCCGCTTCCTCCCAGTGTGTCTATGCACAACTTCCCGGCTAAACCGCAGCCAACTCCGAGCAGCAAGACGATGAAAGTAACTGGCATTGCCTGGTTTGAAGTTGCTGAACGTTGGAGAAGCAGATAA

Coding sequence (CDS)

ATGAGGTCACAACTCACAAACACAAAGTGTAGCCGCACGTGCAGTACACAGACAGGCAAGCGTGCTTTTTTGTCCTCCCGGACCACCGCCGCGCGTCTTGTCTTGTCCCTCCCTCCTTGGTCCACGATGACCACGCCTGAGCCCATGTGTCTGGAAGGCCCAATTGTCATGTGGGCCGAGCCCAATGGCCCGATCGATCAGATTTCAAACCATTCCTGGAGCTCTGCGCACGCCGCCCGCCTTTCTCTGTCTCTTTCAAATTCGGACCTTTCCAACACGATGAGGAAAGCAACTAAGAGGAAAGCTAGAAACAAGGAAGAGGCTAACTCTGCGGAGAAGGAGAACCGTAAAGAATCAACCACAGCCACCGCCACAGCACCTACTCGAGCCAAGCGAGTCAAGGCTTCCAAACCTCATTCCGAACCGGAGTTCTTCGAGGATAAGCGCAATTTGGAAGACCTGTGGAAGGCAGCATTCCCTGTGGGAACAGAGTGGGATCAACTGGATTTTGTTTATCAGTACAACTGGAATTTCTCAAATCTTGAAAATGCATTTGAAGAAGGGGGAAAACTATACGGGGAGAAAGTTTACCTATTTGGTTCTACAGAGCCACAACTTGTCTCTTTCAAGGGTGAAAGTAAAGTTATCTGCATTCCTGTAGTGGTGGCTGTTGTCTCACCTTTCCCACCTTCCGATAAAATTGGGATTAACTCTGTTCAACGAGAGGCTGAAGAGATCATACCCATGAAACAAATGAAAATGGGTTGGGTTCCCTACATTCCTCTCGAGGATAGAGATAGCCGAGTTGATAAGCTGAAATCCCAAATATTTATATTAAGTTGCACTCAAAGAAGGGCTGCTCTGAAGCATCTGAAGATAGATCGTGTCAAGAAATATGAATACTGCTTGCCTTATTTCTACCAGCCCTTCAAGGAAGATGAGCTGGAGCAAAGCACTGAGGTCCAAATAATATTTCCCGCAGAACCAAAGCCAATTTTCTGTGAATTTGATTGGGAACTAGATGAACTTCAGGAGTTCACAGATAAGCTGATTGAAGAGGAGGAATTATCTGAAGATCAGAAGGATGCCTTCAAGGAGTTTGTTAAGGAGAAAGTTAGAGAAGCAAAGAAAGCTAATCGAGAGGCAAGGGAAGCACGTAAAAAAGTGCTCCAAGAAATGAGCGAGGAAACAAAAGCAGCATTTGAGAAAATGAGGTTCTATAAGTTCTACCCAGTTCAAACACAAGATAGCCCTGATATTTCCAACGTTAAGGCTCCATTCATAAACAGGTATTACGGAAAGGCTCACGAGGCTGTTGCCACTTTAAGATTAGGTCCAGGCACCATTATTGCCAGTCATAAGGAATGTGTAAAATTGAATCTCAGCATCCCGCAGCCACATCCTTGCTTCAGTAGCCGCCCGCTGCAGACATATCCGTCACCAGAAGAAAAAAATGATCTTTCCATTCACCAACACCTGCAAATGGAAGCAGCCCACACAGCAATGGCAGAAATACAAGCAGAAAGGATCAGAAACAGTGCAGATGTAAAAAGGTTGGCAAATAAGTGCAATAAGCACAAGGTCTCCCAGTACACGAGCCAATGCAATCCGCTTCCTCCCAGTGTGTCTATGCACAACTTCCCGGCTAAACCGCAGCCAACTCCGAGCAGCAAGACGATGAAAGTAACTGGCATTGCCTGGTTTGAAGTTGCTGAACGTTGGAGAAGCAGATAA

Protein sequence

MRSQLTNTKCSRTCSTQTGKRAFLSSRTTAARLVLSLPPWSTMTTPEPMCLEGPIVMWAEPNGPIDQISNHSWSSAHAARLSLSLSNSDLSNTMRKATKRKARNKEEANSAEKENRKESTTATATAPTRAKRVKASKPHSEPEFFEDKRNLEDLWKAAFPVGTEWDQLDFVYQYNWNFSNLENAFEEGGKLYGEKVYLFGSTEPQLVSFKGESKVICIPVVVAVVSPFPPSDKIGINSVQREAEEIIPMKQMKMGWVPYIPLEDRDSRVDKLKSQIFILSCTQRRAALKHLKIDRVKKYEYCLPYFYQPFKEDELEQSTEVQIIFPAEPKPIFCEFDWELDELQEFTDKLIEEEELSEDQKDAFKEFVKEKVREAKKANREAREARKKVLQEMSEETKAAFEKMRFYKFYPVQTQDSPDISNVKAPFINRYYGKAHEAVATLRLGPGTIIASHKECVKLNLSIPQPHPCFSSRPLQTYPSPEEKNDLSIHQHLQMEAAHTAMAEIQAERIRNSADVKRLANKCNKHKVSQYTSQCNPLPPSVSMHNFPAKPQPTPSSKTMKVTGIAWFEVAERWRSR
Homology
BLAST of Sgr026097 vs. NCBI nr
Match: XP_022133379.1 (uncharacterized protein LOC111005966 [Momordica charantia])

HSP 1 Score: 634.4 bits (1635), Expect = 9.5e-178
Identity = 327/344 (95.06%), Postives = 333/344 (96.80%), Query Frame = 0

Query: 94  MRKATKRKARNKEEANSAEKENRKESTTATATAPTRAKRVKASKPHSEPEFFEDKRNLED 153
           MRK TKRKA   E+A  AEKENRKESTTATA   TRAKRVKASKP S+PE+F+DKRNLED
Sbjct: 1   MRKGTKRKASKNEDAKFAEKENRKESTTATAA--TRAKRVKASKPDSQPEYFQDKRNLED 60

Query: 154 LWKAAFPVGTEWDQLDFVYQYNWNFSNLENAFEEGGKLYGEKVYLFGSTEPQLVSFKGES 213
           LWKAAFPVGTEWDQLD VYQYNWNFSNLE+AFEEGGKLYGEKVYLFGSTEPQLVSFKGES
Sbjct: 61  LWKAAFPVGTEWDQLDSVYQYNWNFSNLEDAFEEGGKLYGEKVYLFGSTEPQLVSFKGES 120

Query: 214 KVICIPVVVAVVSPFPPSDKIGINSVQREAEEIIPMKQMKMGWVPYIPLEDRDSRVDKLK 273
           +VICIPVVVAVVSPFPPSDKIGINSVQREAEEIIPMKQMKMGWVPYIPLEDRDSRVDKLK
Sbjct: 121 RVICIPVVVAVVSPFPPSDKIGINSVQREAEEIIPMKQMKMGWVPYIPLEDRDSRVDKLK 180

Query: 274 SQIFILSCTQRRAALKHLKIDRVKKYEYCLPYFYQPFKEDELEQSTEVQIIFPAEPKPIF 333
           SQIFILSCTQRRAALKHLKIDRVKKYEYCLPYFYQPFKEDELEQSTEVQIIFPAEPKPIF
Sbjct: 181 SQIFILSCTQRRAALKHLKIDRVKKYEYCLPYFYQPFKEDELEQSTEVQIIFPAEPKPIF 240

Query: 334 CEFDWELDELQEFTDKLIEEEELSEDQKDAFKEFVKEKVREAKKANREAREARKKVLQEM 393
           CEFDWELDELQEFTDKLIEEEELSEDQKDAFKEFVKEKVREAKKANREAREARKKVLQEM
Sbjct: 241 CEFDWELDELQEFTDKLIEEEELSEDQKDAFKEFVKEKVREAKKANREAREARKKVLQEM 300

Query: 394 SEETKAAFEKMRFYKFYPVQTQDSPDISNVKAPFINRYYGKAHE 438
           SEETKAAFEKMRFYKFYPVQTQDSPDISNVKAPFINRYYGKAHE
Sbjct: 301 SEETKAAFEKMRFYKFYPVQTQDSPDISNVKAPFINRYYGKAHE 342

BLAST of Sgr026097 vs. NCBI nr
Match: KAF3965322.1 (hypothetical protein CMV_010480 [Castanea mollissima])

HSP 1 Score: 581.3 bits (1497), Expect = 9.5e-162
Identity = 299/362 (82.60%), Postives = 325/362 (89.78%), Query Frame = 0

Query: 94  MRKATKRKARNKEEAN-SAEKENRKESTTATATAPTRAKRVKASKPHSEPEFFEDKRNLE 153
           MRK  KRKA  KEEA   A+++  K++T       ++AKRVKASKP +EPE+FEDKRNLE
Sbjct: 1   MRKGAKRKASQKEEAKPQAQQQESKKAT-------SQAKRVKASKPETEPEYFEDKRNLE 60

Query: 154 DLWKAAFPVGTEWDQLDFVYQYNWNFSNLENAFEEGGKLYGEKVYLFGSTEPQLVSFKGE 213
           DLWK  FPVGTEWDQLD VYQ+NWNFSNLE+AFEEGGKLYG+KVYLFG TEPQLVSFKGE
Sbjct: 61  DLWKEVFPVGTEWDQLDAVYQFNWNFSNLEDAFEEGGKLYGKKVYLFGCTEPQLVSFKGE 120

Query: 214 SKVICIPVVVAVVSPFPPSDKIGINSVQREAEEIIPMKQMKMGWVPYIPLEDRDSRVDKL 273
           SKVICIPVVVAVVSPFPPSDKIGINSVQREAEEIIPMKQMKM WVPYIPLEDRDS+VD+L
Sbjct: 121 SKVICIPVVVAVVSPFPPSDKIGINSVQREAEEIIPMKQMKMDWVPYIPLEDRDSQVDRL 180

Query: 274 KSQIFILSCTQRRAALKHLKIDRVKKYEYCLPYFYQPFKEDELEQSTEVQIIFPAEPKPI 333
           KSQI+IL CTQRRAALKHLKIDR+KKYEYCLPYFYQPFKEDELEQSTEVQIIFP EPKPI
Sbjct: 181 KSQIYILRCTQRRAALKHLKIDRLKKYEYCLPYFYQPFKEDELEQSTEVQIIFPVEPKPI 240

Query: 334 FCEFDWELDELQEFTDKLIEEEELSEDQKDAFKEFVKEKVREAKKANREAREARKKVLQE 393
           FCEFDWELDEL+EFTDKLI+EEEL+EDQKDAFKEFVKEKVREAKKANREAREARKK L+E
Sbjct: 241 FCEFDWELDELEEFTDKLIQEEELAEDQKDAFKEFVKEKVREAKKANREAREARKKALEE 300

Query: 394 MSEETKAAFEKMRFYKFYPVQTQDSPDISNVKAPFINRYYGKAHEAVATLRLGPGTIIAS 453
           M+EE+KAAFE MRFYKFYPVQT DSPD+S+VKAPFINRYYGKAHE    L LG   ++  
Sbjct: 301 MTEESKAAFENMRFYKFYPVQTPDSPDVSSVKAPFINRYYGKAHE---ILELGGKPLLPH 352

Query: 454 HK 455
           HK
Sbjct: 361 HK 352

BLAST of Sgr026097 vs. NCBI nr
Match: KAF3440453.1 (hypothetical protein FNV43_RR18737 [Rhamnella rubrinervis])

HSP 1 Score: 576.6 bits (1485), Expect = 2.3e-160
Identity = 297/346 (85.84%), Postives = 318/346 (91.91%), Query Frame = 0

Query: 94  MRKATKRKA-RNKEEANSAEKENRKESTTATATAPTRAKRVKASK-PHSEPEFFEDKRNL 153
           MRK  KRKA   KEE NSA+  ++++  ++ AT  TRAKRVKASK P  EPE+FEDKRNL
Sbjct: 1   MRKGAKRKASTKKEEGNSAQDNHKQQQQSSKAT--TRAKRVKASKPPQPEPEYFEDKRNL 60

Query: 154 EDLWKAAFPVGTEWDQLDFVYQYNWNFSNLENAFEEGGKLYGEKVYLFGSTEPQLVSFKG 213
           EDLWK  FPVGTEWDQLD VYQ+NW+FSNLE AFEEGGKLYGEKVYLFG TEPQLV  KG
Sbjct: 61  EDLWKVTFPVGTEWDQLDSVYQFNWDFSNLEQAFEEGGKLYGEKVYLFGCTEPQLVPVKG 120

Query: 214 ESKVICIPVVVAVVSPFPPSDKIGINSVQREAEEIIPMKQMKMGWVPYIPLEDRDSRVDK 273
           E+KVICIPVVVAVVSPFPPSDKIGINSVQREAEEIIPMKQMKM WVPYIPLE RDS+VD+
Sbjct: 121 ENKVICIPVVVAVVSPFPPSDKIGINSVQREAEEIIPMKQMKMDWVPYIPLEKRDSQVDR 180

Query: 274 LKSQIFILSCTQRRAALKHLKIDRVKKYEYCLPYFYQPFKEDELEQSTEVQIIFPAEPKP 333
           LKSQIFILSCTQRRAALKHLKIDR+KKYEYCLPYFYQPFKEDELEQSTEVQIIFPAEPKP
Sbjct: 181 LKSQIFILSCTQRRAALKHLKIDRIKKYEYCLPYFYQPFKEDELEQSTEVQIIFPAEPKP 240

Query: 334 IFCEFDWELDELQEFTDKLIEEEELSEDQKDAFKEFVKEKVREAKKANREAREARKKVLQ 393
           IFCEFDWELDEL+EFTDKLI+EEELSEDQKDAFK FV+EKV+EAKKANREAREARKK L+
Sbjct: 241 IFCEFDWELDELEEFTDKLIQEEELSEDQKDAFKAFVREKVKEAKKANREAREARKKALE 300

Query: 394 EMSEETKAAFEKMRFYKFYPVQTQDSPDISNVKAPFINRYYGKAHE 438
           EMSEETKAAFEKMRFYKFYPVQT D+PD+SNVKAPFINRYYGKAHE
Sbjct: 301 EMSEETKAAFEKMRFYKFYPVQTPDTPDVSNVKAPFINRYYGKAHE 344

BLAST of Sgr026097 vs. NCBI nr
Match: KAG7987386.1 (hypothetical protein I3843_03G131400 [Carya illinoinensis])

HSP 1 Score: 574.3 bits (1479), Expect = 1.2e-159
Identity = 295/344 (85.76%), Postives = 313/344 (90.99%), Query Frame = 0

Query: 94  MRKATKRKARNKEEANSAEKENRKESTTATATAPTRAKRVKASKPHSEPEFFEDKRNLED 153
           MRK  KRKA  KE A    + +R+ES   T    ++AKRVKASKP SEPE+ EDKRNLED
Sbjct: 1   MRKGAKRKASQKEGAKPEHESHREESKKTT----SQAKRVKASKPESEPEYIEDKRNLED 60

Query: 154 LWKAAFPVGTEWDQLDFVYQYNWNFSNLENAFEEGGKLYGEKVYLFGSTEPQLVSFKGES 213
           LWK AFPVGTEWDQLD VYQ NWNFSNLE+AFEEGGKL+G+K YLFG TEPQLVSFKGES
Sbjct: 61  LWKEAFPVGTEWDQLDSVYQVNWNFSNLEDAFEEGGKLHGKKAYLFGCTEPQLVSFKGES 120

Query: 214 KVICIPVVVAVVSPFPPSDKIGINSVQREAEEIIPMKQMKMGWVPYIPLEDRDSRVDKLK 273
           KVICIPVVVAVVSPFPPSDKIGINSVQRE+EEIIPMKQMKM WVPYIPLE+R S+VDKL 
Sbjct: 121 KVICIPVVVAVVSPFPPSDKIGINSVQRESEEIIPMKQMKMDWVPYIPLENRGSQVDKLH 180

Query: 274 SQIFILSCTQRRAALKHLKIDRVKKYEYCLPYFYQPFKEDELEQSTEVQIIFPAEPKPIF 333
           S+IFILSCTQRRAALKHLKIDRVKKYEYCLPYFYQPFKEDELEQSTEVQIIFP EPKPIF
Sbjct: 181 SEIFILSCTQRRAALKHLKIDRVKKYEYCLPYFYQPFKEDELEQSTEVQIIFPGEPKPIF 240

Query: 334 CEFDWELDELQEFTDKLIEEEELSEDQKDAFKEFVKEKVREAKKANREAREARKKVLQEM 393
           CEFDWELDEL+EFTDKLI+EEELSEDQKDAFKEFVKE+VREAKKANREAREARKK L+EM
Sbjct: 241 CEFDWELDELEEFTDKLIQEEELSEDQKDAFKEFVKERVREAKKANREAREARKKALEEM 300

Query: 394 SEETKAAFEKMRFYKFYPVQTQDSPDISNVKAPFINRYYGKAHE 438
           SEETKAAFE MRFYKFYPVQT D+PDISNVKAPFINRYYGKAHE
Sbjct: 301 SEETKAAFENMRFYKFYPVQTPDTPDISNVKAPFINRYYGKAHE 340

BLAST of Sgr026097 vs. NCBI nr
Match: XP_023883507.1 (protein HEAT INTOLERANT 4-like [Quercus suber] >XP_023883508.1 protein HEAT INTOLERANT 4-like [Quercus suber])

HSP 1 Score: 573.2 bits (1476), Expect = 2.6e-159
Identity = 293/345 (84.93%), Postives = 316/345 (91.59%), Query Frame = 0

Query: 94  MRKATKRKARNKEEAN-SAEKENRKESTTATATAPTRAKRVKASKPHSEPEFFEDKRNLE 153
           M K  KRKA  KEEA   A+++  K++T       +RAKRVKASKP +EPE+FEDKRNLE
Sbjct: 1   MGKGAKRKASQKEEAKPQAQQQESKKAT-------SRAKRVKASKPETEPEYFEDKRNLE 60

Query: 154 DLWKAAFPVGTEWDQLDFVYQYNWNFSNLENAFEEGGKLYGEKVYLFGSTEPQLVSFKGE 213
           DLWK  FPVGTEWDQLD VYQ+NWNFSNLE+AFEE GKLYG+KVYLFG TEPQLVSFKGE
Sbjct: 61  DLWKEVFPVGTEWDQLDAVYQFNWNFSNLEDAFEEDGKLYGKKVYLFGCTEPQLVSFKGE 120

Query: 214 SKVICIPVVVAVVSPFPPSDKIGINSVQREAEEIIPMKQMKMGWVPYIPLEDRDSRVDKL 273
           SKVICIPVVVAVVSPFPPSDKIGINSVQREAEEIIPMKQMKM WVPYIPLEDRDS+VD+L
Sbjct: 121 SKVICIPVVVAVVSPFPPSDKIGINSVQREAEEIIPMKQMKMDWVPYIPLEDRDSQVDRL 180

Query: 274 KSQIFILSCTQRRAALKHLKIDRVKKYEYCLPYFYQPFKEDELEQSTEVQIIFPAEPKPI 333
           KSQI+IL CTQRRAALKHLKIDR+KKYEYCLPYFYQPFKEDELEQSTEVQIIFP EPKPI
Sbjct: 181 KSQIYILRCTQRRAALKHLKIDRLKKYEYCLPYFYQPFKEDELEQSTEVQIIFPVEPKPI 240

Query: 334 FCEFDWELDELQEFTDKLIEEEELSEDQKDAFKEFVKEKVREAKKANREAREARKKVLQE 393
           FCEFDWELDEL+EFTDKLI+EEEL+EDQKDAFKEFVKEKVREAKKANREAREARKK L+E
Sbjct: 241 FCEFDWELDELEEFTDKLIQEEELAEDQKDAFKEFVKEKVREAKKANREAREARKKALEE 300

Query: 394 MSEETKAAFEKMRFYKFYPVQTQDSPDISNVKAPFINRYYGKAHE 438
           M+EE+KAAFE MRFYKFYPVQT DSPD+S+VKAPFINRYYGKAHE
Sbjct: 301 MTEESKAAFENMRFYKFYPVQTPDSPDVSSVKAPFINRYYGKAHE 338

BLAST of Sgr026097 vs. ExPASy Swiss-Prot
Match: A2RVJ8 (Protein HEAT INTOLERANT 4 OS=Arabidopsis thaliana OX=3702 GN=HIT4 PE=1 SV=1)

HSP 1 Score: 456.1 bits (1172), Expect = 6.1e-127
Identity = 223/345 (64.64%), Postives = 279/345 (80.87%), Query Frame = 0

Query: 95  RKATKRKARNKEEANSAEKENRKESTTATATAPTRAKRVKASKPHSEPEFFEDKRNLEDL 154
           +K   R+   ++ A   + E + E          +AK+ +A+K   EP +FE+KR+LEDL
Sbjct: 96  KKPVARRGGKRKRATKKDTEIKDEKKPV-----PKAKKPRAAKVKEEPVYFEEKRSLEDL 155

Query: 155 WKAAFPVGTEWDQLDFVYQYNWNFSNLENAFEEGGKLYGEKVYLFGSTEPQLVSFKGESK 214
           WK AFPVGTEWDQLD +Y++NW+F NLE A EEGGKLYG+KVY+FG TEPQLV +KG +K
Sbjct: 156 WKVAFPVGTEWDQLDALYEFNWDFQNLEEALEEGGKLYGKKVYVFGCTEPQLVPYKGANK 215

Query: 215 VICIPVVVAVVSPFPPSDKIGINSVQREAEEIIPMKQMKMGWVPYIPLEDRDSRVDKLKS 274
           ++ +P VV + SPFPPSDKIGI SVQRE EEIIPMK+MKM W+PYIP+E RD +VDK+ S
Sbjct: 216 IVHVPAVVVIESPFPPSDKIGITSVQREVEEIIPMKKMKMDWLPYIPIEKRDRQVDKMNS 275

Query: 275 QIFILSCTQRRAALKHLKIDRVKKYEYCLPYFYQPFKEDELEQSTEVQIIFPAEPKPIFC 334
           QIF L CTQRR+AL+H+K D++KK+EYCLPYFYQPFKEDELEQSTEVQI+FP+EP P+ C
Sbjct: 276 QIFTLGCTQRRSALRHMKEDQLKKFEYCLPYFYQPFKEDELEQSTEVQIMFPSEP-PVVC 335

Query: 335 EFDWELDELQEFTDKLIEEEELSEDQKDAFKEFVKEKVREAKKANREAREARKKVLQEMS 394
           EFDWE DELQEF DKL+EEE L  +Q D FKE+VKE+VR AKKANREA++ARKK ++EMS
Sbjct: 336 EFDWEFDELQEFVDKLVEEEALPAEQADEFKEYVKEQVRAAKKANREAKDARKKAIEEMS 395

Query: 395 EETKAAFEKMRFYKFYPVQTQDSPDISNVKAPFINRYYGKAHEAV 440
           E+TK AF+KM+FYKFYP  + D+PD+S V++PFINRYYGKAHE +
Sbjct: 396 EDTKQAFQKMKFYKFYPQPSPDTPDVSGVQSPFINRYYGKAHEVL 434

BLAST of Sgr026097 vs. ExPASy TrEMBL
Match: A0A6J1BVU0 (uncharacterized protein LOC111005966 OS=Momordica charantia OX=3673 GN=LOC111005966 PE=4 SV=1)

HSP 1 Score: 634.4 bits (1635), Expect = 4.6e-178
Identity = 327/344 (95.06%), Postives = 333/344 (96.80%), Query Frame = 0

Query: 94  MRKATKRKARNKEEANSAEKENRKESTTATATAPTRAKRVKASKPHSEPEFFEDKRNLED 153
           MRK TKRKA   E+A  AEKENRKESTTATA   TRAKRVKASKP S+PE+F+DKRNLED
Sbjct: 1   MRKGTKRKASKNEDAKFAEKENRKESTTATAA--TRAKRVKASKPDSQPEYFQDKRNLED 60

Query: 154 LWKAAFPVGTEWDQLDFVYQYNWNFSNLENAFEEGGKLYGEKVYLFGSTEPQLVSFKGES 213
           LWKAAFPVGTEWDQLD VYQYNWNFSNLE+AFEEGGKLYGEKVYLFGSTEPQLVSFKGES
Sbjct: 61  LWKAAFPVGTEWDQLDSVYQYNWNFSNLEDAFEEGGKLYGEKVYLFGSTEPQLVSFKGES 120

Query: 214 KVICIPVVVAVVSPFPPSDKIGINSVQREAEEIIPMKQMKMGWVPYIPLEDRDSRVDKLK 273
           +VICIPVVVAVVSPFPPSDKIGINSVQREAEEIIPMKQMKMGWVPYIPLEDRDSRVDKLK
Sbjct: 121 RVICIPVVVAVVSPFPPSDKIGINSVQREAEEIIPMKQMKMGWVPYIPLEDRDSRVDKLK 180

Query: 274 SQIFILSCTQRRAALKHLKIDRVKKYEYCLPYFYQPFKEDELEQSTEVQIIFPAEPKPIF 333
           SQIFILSCTQRRAALKHLKIDRVKKYEYCLPYFYQPFKEDELEQSTEVQIIFPAEPKPIF
Sbjct: 181 SQIFILSCTQRRAALKHLKIDRVKKYEYCLPYFYQPFKEDELEQSTEVQIIFPAEPKPIF 240

Query: 334 CEFDWELDELQEFTDKLIEEEELSEDQKDAFKEFVKEKVREAKKANREAREARKKVLQEM 393
           CEFDWELDELQEFTDKLIEEEELSEDQKDAFKEFVKEKVREAKKANREAREARKKVLQEM
Sbjct: 241 CEFDWELDELQEFTDKLIEEEELSEDQKDAFKEFVKEKVREAKKANREAREARKKVLQEM 300

Query: 394 SEETKAAFEKMRFYKFYPVQTQDSPDISNVKAPFINRYYGKAHE 438
           SEETKAAFEKMRFYKFYPVQTQDSPDISNVKAPFINRYYGKAHE
Sbjct: 301 SEETKAAFEKMRFYKFYPVQTQDSPDISNVKAPFINRYYGKAHE 342

BLAST of Sgr026097 vs. ExPASy TrEMBL
Match: A0A6J1C502 (uncharacterized protein LOC111008415 OS=Momordica charantia OX=3673 GN=LOC111008415 PE=4 SV=1)

HSP 1 Score: 573.2 bits (1476), Expect = 1.3e-159
Identity = 293/346 (84.68%), Postives = 311/346 (89.88%), Query Frame = 0

Query: 94  MRKATKRKARNKEEANSAEKENRKESTTATATAPTRAKRVKASKPHSEPEFFEDKRNLED 153
           MRK TKRK   KEE    E + RKE       AP+RAKR K  KP SEPE+FEDKRNLED
Sbjct: 1   MRKGTKRKTARKEEDKPVEPK-RKE-------APSRAKRAKLPKPESEPEYFEDKRNLED 60

Query: 154 LWKAAFPVGTEWDQLDFVYQYNWNFSNLENAFEEGGKLYGEKVYLFGSTEPQLVSFKGES 213
           LWKAAFPVGTEWDQLD VYQ+NWNFSNLE+AFEEGGKLYGEKVYLFG TEPQLV FKGE+
Sbjct: 61  LWKAAFPVGTEWDQLDTVYQFNWNFSNLEDAFEEGGKLYGEKVYLFGCTEPQLVPFKGEN 120

Query: 214 KVICIPVVVAVVSPFPPSDKIGINSVQREAEEIIPMKQMKMGWVPYIPLEDRDSRVDKLK 273
           KVICIP VVAVVSPFPPSDKIGINSVQREAEEI+PMKQMKM WVPYIPLE R+SRVDKLK
Sbjct: 121 KVICIPAVVAVVSPFPPSDKIGINSVQREAEEIVPMKQMKMDWVPYIPLEKRESRVDKLK 180

Query: 274 SQIFILSCTQRRAALKHLKIDRVKKYEYCLPYFYQPFKEDELEQSTEVQIIFPAEPKPIF 333
           SQIFILSCTQRRAALKHLKIDRVKKYEYCLPYFYQPFKEDE EQSTEV IIFP +PKP+F
Sbjct: 181 SQIFILSCTQRRAALKHLKIDRVKKYEYCLPYFYQPFKEDEFEQSTEVPIIFPIDPKPVF 240

Query: 334 CEFDWELDELQEFTDKLIEEEELSEDQKDAFKEFVKEKVREAKKANREAREARKKVLQEM 393
           CEFDWELDEL+EFTDKLIEEEELSE QKDAFK+FVKEKVREAKKANREAREARKK ++EM
Sbjct: 241 CEFDWELDELEEFTDKLIEEEELSESQKDAFKDFVKEKVREAKKANREAREARKKAIEEM 300

Query: 394 SEETKAAFEKMRFYKFYPVQTQDSPDISNVKAPFINRYYGKAHEAV 440
           S+ETK AFEKM+FYKFYPVQT D+PDISNVKAPFINRYYGKAHE +
Sbjct: 301 SKETKEAFEKMKFYKFYPVQTPDTPDISNVKAPFINRYYGKAHEVL 338

BLAST of Sgr026097 vs. ExPASy TrEMBL
Match: A0A2I4FW82 (protein HEAT INTOLERANT 4-like OS=Juglans regia OX=51240 GN=LOC109002555 PE=4 SV=1)

HSP 1 Score: 572.4 bits (1474), Expect = 2.1e-159
Identity = 295/344 (85.76%), Postives = 313/344 (90.99%), Query Frame = 0

Query: 94  MRKATKRKARNKEEANSAEKENRKESTTATATAPTRAKRVKASKPHSEPEFFEDKRNLED 153
           MRK  KRKA  KE A    + +R+ES   T    +RAKRVKASKP SEPE+ EDKRNLED
Sbjct: 1   MRKGAKRKASQKEGAKPELESHREESKKTT----SRAKRVKASKPESEPEYIEDKRNLED 60

Query: 154 LWKAAFPVGTEWDQLDFVYQYNWNFSNLENAFEEGGKLYGEKVYLFGSTEPQLVSFKGES 213
           LWK AFPVGTEWDQLD VYQ NWNFSNLE+AFEEGGKL+G+KVYLFG TEPQLVSFKGES
Sbjct: 61  LWKEAFPVGTEWDQLDSVYQVNWNFSNLEDAFEEGGKLHGKKVYLFGCTEPQLVSFKGES 120

Query: 214 KVICIPVVVAVVSPFPPSDKIGINSVQREAEEIIPMKQMKMGWVPYIPLEDRDSRVDKLK 273
           KVICIPVVVAVVSPFPPSDKIGINSVQRE+EEIIPMKQMKM WVPYIPLE+R S+VDKL 
Sbjct: 121 KVICIPVVVAVVSPFPPSDKIGINSVQRESEEIIPMKQMKMDWVPYIPLENRGSQVDKLH 180

Query: 274 SQIFILSCTQRRAALKHLKIDRVKKYEYCLPYFYQPFKEDELEQSTEVQIIFPAEPKPIF 333
           S+IFILSCTQRRAALKHLKIDRVKKYEYCLPYFYQPFKEDELEQSTEVQIIFP EPKPIF
Sbjct: 181 SEIFILSCTQRRAALKHLKIDRVKKYEYCLPYFYQPFKEDELEQSTEVQIIFPGEPKPIF 240

Query: 334 CEFDWELDELQEFTDKLIEEEELSEDQKDAFKEFVKEKVREAKKANREAREARKKVLQEM 393
           CEFDWELDEL+EFTDKLI+EEELSEDQK+ FKEFVKEKVREAKKANREAREARKK ++EM
Sbjct: 241 CEFDWELDELEEFTDKLIQEEELSEDQKNTFKEFVKEKVREAKKANREAREARKKAVEEM 300

Query: 394 SEETKAAFEKMRFYKFYPVQTQDSPDISNVKAPFINRYYGKAHE 438
           SEETKAAFE MRFYKFYPVQT D+PDISNVKAPFINRYYGKAHE
Sbjct: 301 SEETKAAFETMRFYKFYPVQTPDTPDISNVKAPFINRYYGKAHE 340

BLAST of Sgr026097 vs. ExPASy TrEMBL
Match: A0A5N6QIW9 (Uncharacterized protein OS=Carpinus fangiana OX=176857 GN=FH972_002744 PE=4 SV=1)

HSP 1 Score: 571.6 bits (1472), Expect = 3.7e-159
Identity = 292/346 (84.39%), Postives = 319/346 (92.20%), Query Frame = 0

Query: 94  MRKATKRKARNKEEANSAEKENRKESTTATATAPTRAKRVKASKPHSEPEFFEDKRNLED 153
           MRK  KRKA   +EA +A+ E ++ES  AT    +RAKRVKAS P SEPE+FEDKRNLED
Sbjct: 1   MRKGAKRKASQAKEAETAQ-EKQQESKKAT----SRAKRVKASVPESEPEYFEDKRNLED 60

Query: 154 LWKAAFPVGTEWDQLDFVYQYNWNFSNLENAFEEGGKLYGEKVYLFGSTEPQLVSFKGES 213
           LWKAAFPVGTEWDQLD VYQ+ WNFSNLE+AFEEGGKL+G+KVYLFG TEPQLVSFKGES
Sbjct: 61  LWKAAFPVGTEWDQLDAVYQFKWNFSNLEDAFEEGGKLHGKKVYLFGCTEPQLVSFKGES 120

Query: 214 KVICIPVVVAVVSPFPPSDKIGINSVQREAEEIIPMKQMKMGWVPYIPLEDRDSRVDKLK 273
           K+ICIPVVVAVVSPFPPSDKIG+NSVQREAEEIIPMKQMKM WVPYIPLE+R S+V+ L+
Sbjct: 121 KIICIPVVVAVVSPFPPSDKIGVNSVQREAEEIIPMKQMKMDWVPYIPLENRGSQVESLR 180

Query: 274 SQIFILSCTQRRAALKHLKIDRVKKYEYCLPYFYQPFKEDELEQSTEVQIIFPAEPKPIF 333
           SQIFILSCTQRRAALKHLKIDR+KKYEYCLPYFYQPFKEDELEQSTEVQIIFP++ KPIF
Sbjct: 181 SQIFILSCTQRRAALKHLKIDRLKKYEYCLPYFYQPFKEDELEQSTEVQIIFPSDTKPIF 240

Query: 334 CEFDWELDELQEFTDKLIEEEELSEDQKDAFKEFVKEKVREAKKANREAREARKKVLQEM 393
           CEFDWELDEL+EFTDKLI+EEEL+EDQKDAFKEFVKEKVREAK+ANREAREARKK L+EM
Sbjct: 241 CEFDWELDELEEFTDKLIQEEELAEDQKDAFKEFVKEKVREAKRANREAREARKKALEEM 300

Query: 394 SEETKAAFEKMRFYKFYPVQTQDSPDISNVKAPFINRYYGKAHEAV 440
           SEETKAAFE MRFYKFYPVQT DSPD+SNVKAPFINRYYGKAHE +
Sbjct: 301 SEETKAAFENMRFYKFYPVQTPDSPDVSNVKAPFINRYYGKAHEVL 341

BLAST of Sgr026097 vs. ExPASy TrEMBL
Match: A0A0A0KUL7 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G037620 PE=4 SV=1)

HSP 1 Score: 570.9 bits (1470), Expect = 6.2e-159
Identity = 290/346 (83.82%), Postives = 308/346 (89.02%), Query Frame = 0

Query: 94  MRKATKRKARNKEEANSAEKENRKESTTATATAPTRAKRVKASKPHSEPEFFEDKRNLED 153
           MRK TKRKA  KEE   AE +  K        AP+RAKR K  KP SEPE+FEDKRN+ED
Sbjct: 1   MRKGTKRKAARKEEDKPAEPKPDK--------APSRAKRTKLPKPESEPEYFEDKRNMED 60

Query: 154 LWKAAFPVGTEWDQLDFVYQYNWNFSNLENAFEEGGKLYGEKVYLFGSTEPQLVSFKGES 213
           LWKAAFPVGTEWDQLD VYQ+NWNFSNLE+AFEEGGKLYGEKVYLFG TEPQLV FKGE+
Sbjct: 61  LWKAAFPVGTEWDQLDSVYQFNWNFSNLEDAFEEGGKLYGEKVYLFGCTEPQLVPFKGEN 120

Query: 214 KVICIPVVVAVVSPFPPSDKIGINSVQREAEEIIPMKQMKMGWVPYIPLEDRDSRVDKLK 273
           KVICIPVVVAV SPFPPSDKIGINSVQREAEEIIPMKQMKM WVPYIPLE RD RVDKLK
Sbjct: 121 KVICIPVVVAVASPFPPSDKIGINSVQREAEEIIPMKQMKMDWVPYIPLEKRDRRVDKLK 180

Query: 274 SQIFILSCTQRRAALKHLKIDRVKKYEYCLPYFYQPFKEDELEQSTEVQIIFPAEPKPIF 333
           SQIFILSCTQRRAALKHLKIDR+KKYEYCLPYFYQPFK+DE EQSTEV IIFP +PKP+F
Sbjct: 181 SQIFILSCTQRRAALKHLKIDRLKKYEYCLPYFYQPFKDDEFEQSTEVPIIFPVDPKPVF 240

Query: 334 CEFDWELDELQEFTDKLIEEEELSEDQKDAFKEFVKEKVREAKKANREAREARKKVLQEM 393
           CEFDWE DEL+EFTDKLIEEEELSE QKDAFK+FV+EKVREAKKANREAREARKK ++EM
Sbjct: 241 CEFDWEFDELEEFTDKLIEEEELSESQKDAFKDFVREKVREAKKANREAREARKKAIEEM 300

Query: 394 SEETKAAFEKMRFYKFYPVQTQDSPDISNVKAPFINRYYGKAHEAV 440
           S ETK AFEKM+FYKFYPVQT DSPDISNVKAPFINRYYGKAHE +
Sbjct: 301 SNETKEAFEKMKFYKFYPVQTPDSPDISNVKAPFINRYYGKAHEVL 338

BLAST of Sgr026097 vs. TAIR 10
Match: AT5G10010.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: nucleolus; EXPRESSED IN: 25 plant structures; EXPRESSED DURING: 15 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G64910.1); Has 33260 Blast hits to 16857 proteins in 1270 species: Archae - 88; Bacteria - 3040; Metazoa - 11915; Fungi - 3137; Plants - 1371; Viruses - 424; Other Eukaryotes - 13285 (source: NCBI BLink). )

HSP 1 Score: 456.1 bits (1172), Expect = 4.3e-128
Identity = 223/345 (64.64%), Postives = 279/345 (80.87%), Query Frame = 0

Query: 95  RKATKRKARNKEEANSAEKENRKESTTATATAPTRAKRVKASKPHSEPEFFEDKRNLEDL 154
           +K   R+   ++ A   + E + E          +AK+ +A+K   EP +FE+KR+LEDL
Sbjct: 96  KKPVARRGGKRKRATKKDTEIKDEKKPV-----PKAKKPRAAKVKEEPVYFEEKRSLEDL 155

Query: 155 WKAAFPVGTEWDQLDFVYQYNWNFSNLENAFEEGGKLYGEKVYLFGSTEPQLVSFKGESK 214
           WK AFPVGTEWDQLD +Y++NW+F NLE A EEGGKLYG+KVY+FG TEPQLV +KG +K
Sbjct: 156 WKVAFPVGTEWDQLDALYEFNWDFQNLEEALEEGGKLYGKKVYVFGCTEPQLVPYKGANK 215

Query: 215 VICIPVVVAVVSPFPPSDKIGINSVQREAEEIIPMKQMKMGWVPYIPLEDRDSRVDKLKS 274
           ++ +P VV + SPFPPSDKIGI SVQRE EEIIPMK+MKM W+PYIP+E RD +VDK+ S
Sbjct: 216 IVHVPAVVVIESPFPPSDKIGITSVQREVEEIIPMKKMKMDWLPYIPIEKRDRQVDKMNS 275

Query: 275 QIFILSCTQRRAALKHLKIDRVKKYEYCLPYFYQPFKEDELEQSTEVQIIFPAEPKPIFC 334
           QIF L CTQRR+AL+H+K D++KK+EYCLPYFYQPFKEDELEQSTEVQI+FP+EP P+ C
Sbjct: 276 QIFTLGCTQRRSALRHMKEDQLKKFEYCLPYFYQPFKEDELEQSTEVQIMFPSEP-PVVC 335

Query: 335 EFDWELDELQEFTDKLIEEEELSEDQKDAFKEFVKEKVREAKKANREAREARKKVLQEMS 394
           EFDWE DELQEF DKL+EEE L  +Q D FKE+VKE+VR AKKANREA++ARKK ++EMS
Sbjct: 336 EFDWEFDELQEFVDKLVEEEALPAEQADEFKEYVKEQVRAAKKANREAKDARKKAIEEMS 395

Query: 395 EETKAAFEKMRFYKFYPVQTQDSPDISNVKAPFINRYYGKAHEAV 440
           E+TK AF+KM+FYKFYP  + D+PD+S V++PFINRYYGKAHE +
Sbjct: 396 EDTKQAFQKMKFYKFYPQPSPDTPDVSGVQSPFINRYYGKAHEVL 434

BLAST of Sgr026097 vs. TAIR 10
Match: AT5G64910.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G10010.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 331.3 bits (848), Expect = 1.6e-90
Identity = 181/335 (54.03%), Postives = 232/335 (69.25%), Query Frame = 0

Query: 102 ARNKEEANSA-------EKENRKESTTATA----TAPTRAKRVKASKPH-SEPEFFEDKR 161
           A  KEEA  A        +  RK  T   A    + P   KR K +K   SEPE+FE+KR
Sbjct: 125 ASQKEEAKGASSSEPQLRRGKRKRGTKTEAEKKVSTPRAKKRAKTTKAQASEPEYFEEKR 184

Query: 162 NLEDLWKAAFPVGTEWDQLDFVYQYNWNFSNLENAFEEGGKLYGEKVYLFGSTEPQLVSF 221
           NLEDLWKA F VGTEWDQ D + ++NW+F+NLE A EEGG+LYG++VY+FG TE   V++
Sbjct: 185 NLEDLWKATFSVGTEWDQQDALNEFNWDFTNLEEALEEGGELYGKQVYVFGCTESHSVTY 244

Query: 222 KGESKVICIPVVVAVVSPFPPSDKIGINSVQREAEEIIPMKQMKMGWVPYIPLEDRDSRV 281
           K E+K + +PVVV + SP PPSD+IG+ SVQ E  EII MK MKM WVPYIPLE RD +V
Sbjct: 245 KDENKDVLVPVVVCIDSPIPPSDEIGVASVQGEVGEIIAMKTMKMAWVPYIPLEQRDRQV 304

Query: 282 DKLKSQIFILSCTQRRAALKHLKIDRVKKYEYCLPYFYQPFKEDELEQSTEVQIIFPAEP 341
           D     IFIL CTQRR+ALKHL  DRVKK+ YCLPY   P+K D+ E+ST V+I+FP+EP
Sbjct: 305 DNKNFPIFILGCTQRRSALKHLPDDRVKKFNYCLPYINNPYKVDDSEKSTVVKIMFPSEP 364

Query: 342 KPIFCEFDWELDELQEFTDKLIEEEELSEDQKDAFKEFVKEKVREAKKANREAREARKKV 401
            P+ CE+DW    ++EFTD LI EE L  +QK AF+EFVKEK  +A  A   A+EA +K 
Sbjct: 365 -PVECEYDWVKSVIEEFTDSLINEEVLLPEQKVAFEEFVKEKSDKAMAAYDTAQEALEKA 424

Query: 402 LQEMSEETKAAFEKMRFYKFYPVQTQDSPDISNVK 425
            + +SEETK A+++MR YKFYP+ + D+P  + ++
Sbjct: 425 KEGLSEETKKAYQEMRLYKFYPLPSPDTPHTAGIE 458

BLAST of Sgr026097 vs. TAIR 10
Match: AT5G64910.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G10010.1). )

HSP 1 Score: 324.7 bits (831), Expect = 1.5e-88
Identity = 180/335 (53.73%), Postives = 231/335 (68.96%), Query Frame = 0

Query: 102 ARNKEEANSA-------EKENRKESTTATA----TAPTRAKRVKASKPH-SEPEFFEDKR 161
           A  KEEA  A        +  RK  T   A    + P   KR K +K   SEPE+FE+KR
Sbjct: 125 ASQKEEAKGASSSEPQLRRGKRKRGTKTEAEKKVSTPRAKKRAKTTKAQASEPEYFEEKR 184

Query: 162 NLEDLWKAAFPVGTEWDQLDFVYQYNWNFSNLENAFEEGGKLYGEKVYLFGSTEPQLVSF 221
           NLEDLWKA F VGTEWDQ D + ++NW+F+NLE A EEGG+LYG++VY+FG TE    ++
Sbjct: 185 NLEDLWKATFSVGTEWDQQDALNEFNWDFTNLEEALEEGGELYGKQVYVFGCTE---FTY 244

Query: 222 KGESKVICIPVVVAVVSPFPPSDKIGINSVQREAEEIIPMKQMKMGWVPYIPLEDRDSRV 281
           K E+K + +PVVV + SP PPSD+IG+ SVQ E  EII MK MKM WVPYIPLE RD +V
Sbjct: 245 KDENKDVLVPVVVCIDSPIPPSDEIGVASVQGEVGEIIAMKTMKMAWVPYIPLEQRDRQV 304

Query: 282 DKLKSQIFILSCTQRRAALKHLKIDRVKKYEYCLPYFYQPFKEDELEQSTEVQIIFPAEP 341
           D     IFIL CTQRR+ALKHL  DRVKK+ YCLPY   P+K D+ E+ST V+I+FP+EP
Sbjct: 305 DNKNFPIFILGCTQRRSALKHLPDDRVKKFNYCLPYINNPYKVDDSEKSTVVKIMFPSEP 364

Query: 342 KPIFCEFDWELDELQEFTDKLIEEEELSEDQKDAFKEFVKEKVREAKKANREAREARKKV 401
            P+ CE+DW    ++EFTD LI EE L  +QK AF+EFVKEK  +A  A   A+EA +K 
Sbjct: 365 -PVECEYDWVKSVIEEFTDSLINEEVLLPEQKVAFEEFVKEKSDKAMAAYDTAQEALEKA 424

Query: 402 LQEMSEETKAAFEKMRFYKFYPVQTQDSPDISNVK 425
            + +SEETK A+++MR YKFYP+ + D+P  + ++
Sbjct: 425 KEGLSEETKKAYQEMRLYKFYPLPSPDTPHTAGIE 455

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022133379.19.5e-17895.06uncharacterized protein LOC111005966 [Momordica charantia][more]
KAF3965322.19.5e-16282.60hypothetical protein CMV_010480 [Castanea mollissima][more]
KAF3440453.12.3e-16085.84hypothetical protein FNV43_RR18737 [Rhamnella rubrinervis][more]
KAG7987386.11.2e-15985.76hypothetical protein I3843_03G131400 [Carya illinoinensis][more]
XP_023883507.12.6e-15984.93protein HEAT INTOLERANT 4-like [Quercus suber] >XP_023883508.1 protein HEAT INTO... [more]
Match NameE-valueIdentityDescription
A2RVJ86.1e-12764.64Protein HEAT INTOLERANT 4 OS=Arabidopsis thaliana OX=3702 GN=HIT4 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A6J1BVU04.6e-17895.06uncharacterized protein LOC111005966 OS=Momordica charantia OX=3673 GN=LOC111005... [more]
A0A6J1C5021.3e-15984.68uncharacterized protein LOC111008415 OS=Momordica charantia OX=3673 GN=LOC111008... [more]
A0A2I4FW822.1e-15985.76protein HEAT INTOLERANT 4-like OS=Juglans regia OX=51240 GN=LOC109002555 PE=4 SV... [more]
A0A5N6QIW93.7e-15984.39Uncharacterized protein OS=Carpinus fangiana OX=176857 GN=FH972_002744 PE=4 SV=1[more]
A0A0A0KUL76.2e-15983.82Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G037620 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G10010.14.3e-12864.64unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT5G64910.11.6e-9054.03unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT5G64910.21.5e-8853.73unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 365..399
NoneNo IPR availableCOILSCoilCoilcoord: 87..115
NoneNo IPR availableGENE3D6.10.250.2770coord: 92..177
e-value: 1.6E-15
score: 59.4
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 94..119
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 90..142
NoneNo IPR availablePANTHERPTHR33704:SF1PROTEIN HEAT INTOLERANT 4-RELATEDcoord: 94..437
IPR039313Protein HEAT INTOLERANT 4PANTHERPTHR33704PROTEIN HEAT INTOLERANT 4-RELATEDcoord: 94..437

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr026097.1Sgr026097.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:1900034 regulation of cellular response to heat