Sgr021985 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr021985
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionProtein of unknown function, DUF547
Locationtig00153870: 386198 .. 393780 (-)
RNA-Seq ExpressionSgr021985
SyntenySgr021985
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAAAGGTTGGTGCCTGTTTGGAAGCTCAGAAAAAGCAACTCCCTGATAGTCATGTTCAGAATTCCTTGAAGCAGGAGGTATGGTGAAAAACCCTGGACTTGTCAAAACAGTTGTGTGAAGAATCTACTAAGCTTGTATTTGGCTTTATCTTCTAGAATTTCCATGAGTTTAATCAGAAAAAAGAATAAAAATCAGGGAGAAACTGCAGGTAAATCGAAAAGCTGAAAGTATTTCGTGCCGGGCTGTGGATTATTTAGAAGCAATGTTTTTAAGCGAGGCGAAATGGTTTTGCTACTAACACTTTTGATTGAAAGACTAATCCCAAGTGTGTTTTTGAAATGCTTTTAAGCACTTCTAAGTCTTTGGACATACTTTTTTGATATGTAACCAAACACTACGAACTTTGAAAAGTACTTTTGATATGTAATCGAACTTCTAGGTTGGATCAAGTTTGCCTTGGATATTTCTATGGCTTATACTTGTATTTAGAACACATTTGTGCCTGTATTGTGGATTATTTACGAGTAATATCTTTTAGCGAAATAGTTCTCTCACTACACTTTTGATAGAAAGACTAATCCCAAGTGCTTTTTAAAGTGCTTTTAAACATTTCAAAGTCTTCAGAAATACTTTATGAAATGTAACTAAACACTACTTTGAAAAGTACTTTTGATGTGTAACCAAATGCATTTTTCAATCGAACTTCTAGGTCGGATTAGGTTCATTTATGATATTCTAGCTTATATTTAGAACATATTGTCTACTATTCTTCTACAGATTTTACAGCTTCAAGAACAACTACAGAGCCAATTTGTCATTCGTCATGCCTTGGAGAAGGCAATGAACTTTCAGCCTCTCTCACTTGATTCGGCAACCGAAAACTCGATCCCGAAGGTAACTTCATGCCTGAACTTCCTAGTTCCTATGCATACATGTAATCTTATCTGGTTGTATTGTTGTGTATGTGTATCATTGAGAGTGCTCATATGTTTGGACATCCACTTTGTTGCCTAGTTTGCTTTAGCCACCAATTCCATCAAATGCTTTGAAGTGTAATGTGCATAGTCTTAACTCTAGTTCTCATCGATGTCAGGCTGCGATGGAACTGATTAAGCAAATCGCAGTCTTGGAGATAGAAGTTGTTTACTTGGAAAAATATCTTCTGTCACTATATCGTCGAACGTTCAAGCAACAAGTATCCTCTTTTTCAACCATGGATGATCAGCTTGAATCCTATTCTGGGCCTCATATTGTGATAGACAGAGAACATTCTTTCATTCATTCTGACCATATCGTGTCGCCACAAACTTCATTGAGCAATCAATCAAAAGGAAGAAATGAAGTTGAGGAAGCGGAGAAGCTGTTACACTTTGGTCGCAGCTATTCATCTCTTTTGCAGAGATCGCCTGGTTCATCTAAAAACTACCCTCTGTCAAAGTATATGGCTAAAGCAGTAGATTCATACCATTCCCTTCCATTATCAATGCTGGAGGTAACTTGATGTTGATACATCTTGGACAAGGTAGAACTTAGAATATGCATCTGTTTCTCATTTGGGGATTTCATATTGTTTGTTCAGCAATCTCAGAGTGATGCTTCAAATTCTCTGAGCCTCAAGGAGCATCCCGGTGCCTGTATACCTGATCAAGCACATGTGTCGCCGAACTGGCTTTCGGAGGAGATGATCAAGTCTATCTCTGCAATATACTGTGAACTTGCAGAACCTCCTTTGATAAATCATAACAATCCTTCTCCAATCACACCATTGTCATCCATGTATGAGCTTTCTTCACGAGACTTAGGCAGCATGAGGAACTACGAGAAATTTGCGTTGTTCAACTCGCATTTTGATAACCCTTTTCACATTGAAGAATTTAGTGCACCATACTACACAATGTTGAAGGTGCAATGGATTTCTAGAGATAGAAAGAAGGACTCAGATATCAGCCACATGCTACAAGGCTTCAGGTGACTCCTCATTCAAAGAAACCAATCTGTTTATTATGCCATGACTTTGATGTTCTTCATTCATTGTTTGTTGAAATCTCCACTGCACTGTCCGCAGGTCGTTTATTTATCGGCTCAAAGAAGTTGATCTCAAAGCGATGAAACACAAGGAAAGGCTCGCGTTTTGGATTAATGTACACAACACACTTGTAATGCATGTAAGTTCTAAACATAGCACAAACTATGTATATTTTTCATTTGATAACATCACAAAGCATACAAGTAAATGTTTCATATACATTGTTCCACAGGCATATTTGCAATATGGGATTCCCAAAAATAGTTTGAAGAGAATATCGTTGATACAGAAGGTGAGGATGGCTTCTGTTCTTTCAATCAAACAAAAATAAAATGGTATAAATGCTTCTTTCATTAGCTGCTGCACTTTTCAGGCTGCATATAATGTTGGGGGTCACATAATAAGTGTAGATATGATACAAAGCTCAATTCTCGGGTGTCGTTTGCCTCGTTTGGGACAGGTTGTTATCATCTCAGAATAATGTTTCTGTTCTCAGAATTTGTCCTATCAATATCAGTTCATTCAATATCTGACTGGTTTGCTACTTGATTAGTGGCTGCACCTGTTCCTCTCTTCAAAAACAAAATTTAAGGTTAATGATGCACTGAAATCCTTTTCAATCAACCACCCCGAACCTCGGTTATACTTCGCTCTATGTTGCGGGAGCCATTCTGATCCAGCGGTACGTTAGCATCTGATAATTACACAAACAACTATGTTGAAACATTCCAGTATTGGAGTCAATCCAAGCCAAACTATTTTACTTCTAAGAAAACCGGATTTGGCACAGTTTTGAACTTGCTGCTCTGTTATACGCAGGTCCGTATCTATACGGCTAAGAGGGTGAATGAGGAGCTGGAGGTTGCAAAGAAGACTACATCCTTTCAAATTTGAGAACACACAAAGGGCAGAGAATTCTACTTCCAAAGATTGTAGAGTCTTTTGCCAAGATTCAGGTTCATGCCTGGAAGATTTGGTGGACATTGTGGAGCGTTTAAAACCCGACGGGCAGGCAAACGACATTCAGCAGCAGCAACGGAAAAAGATTTGGAAAAGTATTGGGTGGATACCTCACAACTTCACCTTCAGCTTTCTGCTGTCCAAAGAATTGGCATGCCAGTCCCTGCCCTGATAGTTCCATTTCCAAGTTTGTTTGTTTTTCTCAACAGCTTCGAAACAGATGGCTCGAATCTGAAGCGTTCTCATTGAGCAAGATTCGATCTCGGAGCTACATCAACTCATCATTTGGCGTTCAAGGCCGGAAAGTTAAGGGCTACATCCATTTACGTATGGTAATTGCAGCTTGGAATGACAGATGAATCATAGCATAGCCTTTGTGGTTGAGAAGAGTTTGAAAAAGAAAAGAAATGGCTTACCTAGCAGGCTCATGATATAGACTCCAAAAGACAAAAAAATGGTGATTTTGGAGGAAAAACTTGAGCTTGCTTGGTTAGTCATAACTGTAGCAAGAATAGAAGTAGCAGGTTATGATTCAACTCTCATCCTAAGATGATGTCTTGTCTGATATCTAGTGCTGTTGATGTAATGTATCAAAATGCTGCAATGTTTTAAGACTTTGATTTTGACAATCTGCTCTGAGGAAGCATGCCAATTGATTTTTTTTTTTTTTAATTTGGGAAAATTGAGGATAGCTTCATTTACTTCAACTTTTTGAACATGAAATTGTGAATGGACTTTGAAGTGAAATGTGATTCTTTTTTTAAGTTTATTTCTTATCTTAGCCATATGGCATCATTTTTTTTTTTTGGGGAGAATATTGGCATCTAATTTTTGAAAGACTCTTTGAAAATTATAGTGATAAATTGTAAATAATATCTTTACACTCTAGGAATTACATTATATTCTTTGAGTTTACTTTTTTTTTGTTAATTTGGAAAGGCTAACACGATAGAAGTACACGAATTTAAAAGTAATGTACAACTTTATTATTAATTATATTAATGAGTAAAGCAACAATTTTCTAAGTGCAAGAGTTTATGATTTTAAAAGAATAATTCAGTGCTATTCTGACTTGGTTTTATAATATAATATTATTTATTTTGAATTTAAACCTGTATTGTTTTATTTTTAATTTTTCTCAAAATTATTATTCTTATTTATATATCTATTTTTTTTAATTTTAGCTGACGTAAGATTATGTTATGATTTCAATAAATACATAATTAAAGTAAATAAGGTAAACTCAAATTACTAATCTAGTGGAATTAAACTTTTTCGAAAGAATTAAACCTTGAAAAAGGTTAATTATCAATAAAGTGCATATTATTATATTAAAAATCGATTGTTTAATTTTCATCTTATAATTATTGAACTAAAAAAAAAAAAAGGTGAATAAATACTAAGTATGATATATAATATAGTTAAAAATCATTGGTTCTGAATTTTATTTTTTATTGTCTAAATTCAAAATATCATTTGGTGAACTTTCAAATATCTATTGTCTTGAATTTAAATAATTTTCTTATTTATTTAAAAATAATATGGTATATGTATAGATGTAAGCAAGCAGTATCAGTCATAATTGAGTAATAATCAGTAATAATGATCAAAAAATATAAAAAAATGTATTTTAAATCAATAAAAAACATTGAAGTCATACAAAATAAATAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGTAAATAAATAAATAAATATATGAAGAGGAGAGAGAAGGAAATTGGCCTCGAAAGTCGAATCCCAACAAAGAAGCACCCGCCACAGAGTCATAGCTTGGCCGTTCTGTGAGACCCTTCCTTTCCTCCATTGTCATCGATTTCAAACCCCAACCCCACTTTCAGAACTAACCCTAATACGCCATTATCGTTCCTCAGAAACCCTCTTGCAGATAGGGACGCGGTTGGGCTGGGTTCGGGGCAGATGGAAGAACAAAGCACGGCTATAATTTTAGCAAGAGCAATGGAGCTGAGGCTGAAGATTAGAAGCTCTGTTAACACCACGACGACGGCCAGTTCGACGGTGACTTCCCAGGAAATTGGGGATGATCGGTCCGCCGTAGATGGAAATGGGGTTGCGGGATATAGTGGCACTGGTTCACGGCGGACTGAGGCCGATGCGAGTGGGGAGGCGGAGGAAGATGACGAAGCGGTGAGACTTTTGAATATTGCGATGCGCTCGAGTCTCTTGAGAACCAGCTCTCCTCCTTGCAGGTTTGCCCTTAAATTAACTAACCTTGAATTGATTTTACTTTCGTTGTTTCACGCATGAAGATTGAAGAACATGTTCCGATTCGATTAGCCACTTTCTTCATCATTTTTCTTGTTCGCGCTGATAATGTTGAAACTTTCTGCGCCTTGTATGACTGATGCAGACCTGGTTGATTCTGTTGATCAACAATTACGTTTGGACTATGATTGCCCTCTCAATTCCTCTAGTAGAATCGATGATGGTTTTGAAATACTGAAAAAATGAAGTGCAAAGAGACCTATACAGAGCTTTTTTCTTCTTCTTTTGTTTTTTGGGATGACGGGGTGTCAGCCCACTCTTTACATTATGCTCATGGCCCATCACACTACCCGAAAGGGGTCGTTCACCCATGCCAGTTCTAGTAGACAAGTATTTTAACTTATTACCAGGGCACTCCTTGGGACCTATACAGAGTAATTTAGAAAACATATGACCATCTTTTTAGATACAGAACTGTTTCTTCATTTGAAACTGTTCCTAAACGTACACTAGTAGTCTGAAAGCTTGCATATCAACATGCTTTTTACTTCTGTGGTAGTTGAGTAAAAAAAGGAATGGGGAGCATGATTCTCCCTCAATTTGAACAGCTTATTTTATTCATTTTCTAATGCTATAAGTAGCTGGCCAGTTGTAGCTGATGGTTTTGGTGCTTCAGGTTAGGCAACTTGTGGGGTTGTTCAAACCTTCAACACCCAACTTCACTGGCATGTTAATTGATATTGCTTGGATCTGTCTTTCAATGAGTCAGTATTAGTTATTTACCAAACTATTATTTGTTCTGAGGTGTCTATGCACTTGTCCATGTGAGCTTGTTCTTGTTACGACTATTGGTGGATTTCTCCTATCATCAACACCGTGTGCTGGATTCCATCTTTTTTTCTTTTTGAATTGTGCATATTTAGTATATAAGTGTATCAAATTCTCAGAAACCTGGTCCTTTACTGTCAATTGAATCTAGTATAAATGTTAAACTTCTAGGTAAGCTTGCCTATACATGATATTGAAAAGATTCTATATGTTAAATTTTATTTTAAAAAAAAGATTCTGTATGCTAATTATCCAAAAAGTACAAGGAGAGTGAGGATAGGTTATCCTTCACCAAAGCATCAAAGGGGACCGATAGAAAGCTCTCCCATTGGTTGCATAGAAAAAAAATGAAAAAGAGGAATTTAACAAAATTAGATAGAGAACTCCATTTACAAGCTAAAAATATATACTCGTTAATGTTAAAATCAGTTTACTGCCGATTGTTTTATCATCATTCAGGAAAAAGCTCTTTCACACAAGTATGGAGTTACTTCTTGGCTTATACATCGATCAATCTGGTGGCTAAGATATGTTAAATGTTTTCGATTTTGGCGGGGTTTGGTATTCTTCTTGGCTCAATTCAAGCACTGCACACAGAAGCTATTAAATTTATTTCTTAATTTTGACCTCCATGTTGTCTCTTCCTTCAGTTTGTAGTTTTAGAAGCCATTTTCAACAAGAATCTAAAGTTGGACGACAAAAAGTTTTTATGTTCAATATAATGGACCTAGAGAGAGACTAGATTTAGTTTCGGTGCAACAACATTTGTAGCATGGATGAAATACTATTTTATAGAACATTGTAATGTACTGGAACCATTCATTTAACCTTCTAATACTTCTATCTTTATCTTTTGATCTGCAGGATTTACAACAACGGCAAAGGTATGAGAAAGAAGCAGCCCTTTCCGAGATTGAGCATAGTCGTAAGATTTTACTAGATAAACTGAAGAAGTACAAAGGGGAGGATTTGGAAGTGATACATGAGGCTTCAGCTTTTGCTGGGGACACAGTGCAGCACAACCAGGATCTCATGCTTCCGCCATATCCAAGCCATCCTCTTCATTCCCCTTTAGGTAATGGCCACGTACATCCCTTCCCTTCTGGACACAAGTCTGTGAGTAATGGGCTAAAGGACATTGCGACAAATAAAGCTACAAAGGAACCCAATGAATCAGAAAGAAAATGCTCGCAAACGGATTCCAGGAACTCGAGGAATGGATTGGGATCTTTTGTTAGTGTAGCTGCAAAATCAGTGTTCACGATTGTTGGCATAGTATCCATATTGCACTTGTCTGGTTTTAGACCAAAGTTTGGGGGGAAAGTTGCTGCTTTGAAGGTTCTGGACCTTCTTCGACAGTCTGCAGCTGAAGATAATGGATCACACAATGAATGTCCTCCGGGTAAATTCCTCGTGATGGAAGATGGGGAGGCTCGATGCGTTGTGAAAGAGAGAATTGAAATTCCATTTTCTTCAGTTGTGGCTAAACCAGATGTAAACTATGGATGCGGGTAA

mRNA sequence

ATGGAAAAGGTTGGTGCCTGTTTGGAAGCTCAGAAAAAGCAACTCCCTGATAGTCATGTTCAGAATTCCTTGAAGCAGGAGATTTTACAGCTTCAAGAACAACTACAGAGCCAATTTGTCATTCGTCATGCCTTGGAGAAGGCAATGAACTTTCAGCCTCTCTCACTTGATTCGGCAACCGAAAACTCGATCCCGAAGGCTGCGATGGAACTGATTAAGCAAATCGCAGTCTTGGAGATAGAAGTTGTTTACTTGGAAAAATATCTTCTGTCACTATATCGTCGAACGTTCAAGCAACAAGTATCCTCTTTTTCAACCATGGATGATCAGCTTGAATCCTATTCTGGGCCTCATATTGTGATAGACAGAGAACATTCTTTCATTCATTCTGACCATATCGTGTCGCCACAAACTTCATTGAGCAATCAATCAAAAGGAAGAAATGAAGTTGAGGAAGCGGAGAAGCTGTTACACTTTGGTCGCAGCTATTCATCTCTTTTGCAGAGATCGCCTGGTTCATCTAAAAACTACCCTCTGTCAAAGTATATGGCTAAAGCAGTAGATTCATACCATTCCCTTCCATTATCAATGCTGGAGCAATCTCAGAGTGATGCTTCAAATTCTCTGAGCCTCAAGGAGCATCCCGGTGCCTGTATACCTGATCAAGCACATGTGTCGCCGAACTGGCTTTCGGAGGAGATGATCAAGTCTATCTCTGCAATATACTGTGAACTTGCAGAACCTCCTTTGATAAATCATAACAATCCTTCTCCAATCACACCATTGTCATCCATGTATGAGCTTTCTTCACGAGACTTAGGCAGCATGAGGAACTACGAGAAATTTGCGTTGTTCAACTCGCATTTTGATAACCCTTTTCACATTGAAGAATTTAGTGCACCATACTACACAATGTTGAAGGTGCAATGGATTTCTAGAGATAGAAAGAAGGACTCAGATATCAGCCACATGCTACAAGGCTTCAGGTCGTTTATTTATCGGCTCAAAGAAGTTGATCTCAAAGCGATGAAACACAAGGAAAGGCTCGCGTTTTGGATTAATGTACACAACACACTTGTAATGCATGCATATTTGCAATATGGGATTCCCAAAAATAGTTTGAAGAGAATATCGTTGATACAGAAGGCTGCATATAATGTTGGGGGTCACATAATAAGTGTAGATATGATACAAAGCTCAATTCTCGGGTGTCGTTTGCCTCGTTTGGGACAGTGGCTGCACCTGTTCCTCTCTTCAAAAACAAAATTTAAGGTTAATGATGCACTGAAATCCTTTTCAATCAACCACCCCGAACCTCGGTTATACTTCGCTCTATGTTGCGGGAGCCATTCTGATCCAGCGGTCCGTATCTATACGGCTAAGAGGGTGAATGAGGAGCTGGAGAGTCTTTTGCCAAGATTCAGGTTCATGCCTGGAAGATTTGGTGGACATTGTGGAGCGTTTAAAACCCGACGGGCAGGCAAACGACATTCAGCAGCAGCAACGGAAAAAGATTTGGAAAAGTATTGGGTGGATACCTCACAACTTCACCTTCAGCTTTCTGCTGTCCAAAGAATTGGCATGCCAGTCCCTGCCCTGATAGTTCCATTTCCAACAAGATTCGATCTCGGAGCTACATCAACTCATCATTTGGCGTTCAAGGCCGGAAAGTTAAGGGCTACATCCATTTACGTATGGAGAGAGAAGGAAATTGGCCTCGAAAGTCGAATCCCAACAAAGAAGCACCCGCCACAGAGTCATAGCTTGGCCGTTCTAAACCCTCTTGCAGATAGGGACGCGGTTGGGCTGGGTTCGGGGCAGATGGAAGAACAAAGCACGGCTATAATTTTAGCAAGAGCAATGGAGCTGAGGCTGAAGATTAGAAGCTCTGTTAACACCACGACGACGGCCAGTTCGACGGTGACTTCCCAGGAAATTGGGGATGATCGGTCCGCCGTAGATGGAAATGGGGTTGCGGGATATAGTGGCACTGGTTCACGGCGGACTGAGGCCGATGCGAGTGGGGAGGCGGAGGAAGATGACGAAGCGGATTTACAACAACGGCAAAGGTATGAGAAAGAAGCAGCCCTTTCCGAGATTGAGCATAGTCGTAAGATTTTACTAGATAAACTGAAGAAGTACAAAGGGGAGGATTTGGAAGTGATACATGAGGCTTCAGCTTTTGCTGGGGACACAGTGCAGCACAACCAGGATCTCATGCTTCCGCCATATCCAAGCCATCCTCTTCATTCCCCTTTAGGTAATGGCCACGTACATCCCTTCCCTTCTGGACACAAGTCTGTGAGTAATGGGCTAAAGGACATTGCGACAAATAAAGCTACAAAGGAACCCAATGAATCAGAAAGAAAATGCTCGCAAACGGATTCCAGGAACTCGAGGAATGGATTGGGATCTTTTGTTAGTGTAGCTGCAAAATCAGTGTTCACGATTGTTGGCATAGTATCCATATTGCACTTGTCTGGTTTTAGACCAAAGTTTGGGGGGAAAGTTGCTGCTTTGAAGGTTCTGGACCTTCTTCGACAGTCTGCAGCTGAAGATAATGGATCACACAATGAATGTCCTCCGGGTAAATTCCTCGTGATGGAAGATGGGGAGGCTCGATGCGTTGTGAAAGAGAGAATTGAAATTCCATTTTCTTCAGTTGTGGCTAAACCAGATGTAAACTATGGATGCGGGTAA

Coding sequence (CDS)

ATGGAAAAGGTTGGTGCCTGTTTGGAAGCTCAGAAAAAGCAACTCCCTGATAGTCATGTTCAGAATTCCTTGAAGCAGGAGATTTTACAGCTTCAAGAACAACTACAGAGCCAATTTGTCATTCGTCATGCCTTGGAGAAGGCAATGAACTTTCAGCCTCTCTCACTTGATTCGGCAACCGAAAACTCGATCCCGAAGGCTGCGATGGAACTGATTAAGCAAATCGCAGTCTTGGAGATAGAAGTTGTTTACTTGGAAAAATATCTTCTGTCACTATATCGTCGAACGTTCAAGCAACAAGTATCCTCTTTTTCAACCATGGATGATCAGCTTGAATCCTATTCTGGGCCTCATATTGTGATAGACAGAGAACATTCTTTCATTCATTCTGACCATATCGTGTCGCCACAAACTTCATTGAGCAATCAATCAAAAGGAAGAAATGAAGTTGAGGAAGCGGAGAAGCTGTTACACTTTGGTCGCAGCTATTCATCTCTTTTGCAGAGATCGCCTGGTTCATCTAAAAACTACCCTCTGTCAAAGTATATGGCTAAAGCAGTAGATTCATACCATTCCCTTCCATTATCAATGCTGGAGCAATCTCAGAGTGATGCTTCAAATTCTCTGAGCCTCAAGGAGCATCCCGGTGCCTGTATACCTGATCAAGCACATGTGTCGCCGAACTGGCTTTCGGAGGAGATGATCAAGTCTATCTCTGCAATATACTGTGAACTTGCAGAACCTCCTTTGATAAATCATAACAATCCTTCTCCAATCACACCATTGTCATCCATGTATGAGCTTTCTTCACGAGACTTAGGCAGCATGAGGAACTACGAGAAATTTGCGTTGTTCAACTCGCATTTTGATAACCCTTTTCACATTGAAGAATTTAGTGCACCATACTACACAATGTTGAAGGTGCAATGGATTTCTAGAGATAGAAAGAAGGACTCAGATATCAGCCACATGCTACAAGGCTTCAGGTCGTTTATTTATCGGCTCAAAGAAGTTGATCTCAAAGCGATGAAACACAAGGAAAGGCTCGCGTTTTGGATTAATGTACACAACACACTTGTAATGCATGCATATTTGCAATATGGGATTCCCAAAAATAGTTTGAAGAGAATATCGTTGATACAGAAGGCTGCATATAATGTTGGGGGTCACATAATAAGTGTAGATATGATACAAAGCTCAATTCTCGGGTGTCGTTTGCCTCGTTTGGGACAGTGGCTGCACCTGTTCCTCTCTTCAAAAACAAAATTTAAGGTTAATGATGCACTGAAATCCTTTTCAATCAACCACCCCGAACCTCGGTTATACTTCGCTCTATGTTGCGGGAGCCATTCTGATCCAGCGGTCCGTATCTATACGGCTAAGAGGGTGAATGAGGAGCTGGAGAGTCTTTTGCCAAGATTCAGGTTCATGCCTGGAAGATTTGGTGGACATTGTGGAGCGTTTAAAACCCGACGGGCAGGCAAACGACATTCAGCAGCAGCAACGGAAAAAGATTTGGAAAAGTATTGGGTGGATACCTCACAACTTCACCTTCAGCTTTCTGCTGTCCAAAGAATTGGCATGCCAGTCCCTGCCCTGATAGTTCCATTTCCAACAAGATTCGATCTCGGAGCTACATCAACTCATCATTTGGCGTTCAAGGCCGGAAAGTTAAGGGCTACATCCATTTACGTATGGAGAGAGAAGGAAATTGGCCTCGAAAGTCGAATCCCAACAAAGAAGCACCCGCCACAGAGTCATAGCTTGGCCGTTCTAAACCCTCTTGCAGATAGGGACGCGGTTGGGCTGGGTTCGGGGCAGATGGAAGAACAAAGCACGGCTATAATTTTAGCAAGAGCAATGGAGCTGAGGCTGAAGATTAGAAGCTCTGTTAACACCACGACGACGGCCAGTTCGACGGTGACTTCCCAGGAAATTGGGGATGATCGGTCCGCCGTAGATGGAAATGGGGTTGCGGGATATAGTGGCACTGGTTCACGGCGGACTGAGGCCGATGCGAGTGGGGAGGCGGAGGAAGATGACGAAGCGGATTTACAACAACGGCAAAGGTATGAGAAAGAAGCAGCCCTTTCCGAGATTGAGCATAGTCGTAAGATTTTACTAGATAAACTGAAGAAGTACAAAGGGGAGGATTTGGAAGTGATACATGAGGCTTCAGCTTTTGCTGGGGACACAGTGCAGCACAACCAGGATCTCATGCTTCCGCCATATCCAAGCCATCCTCTTCATTCCCCTTTAGGTAATGGCCACGTACATCCCTTCCCTTCTGGACACAAGTCTGTGAGTAATGGGCTAAAGGACATTGCGACAAATAAAGCTACAAAGGAACCCAATGAATCAGAAAGAAAATGCTCGCAAACGGATTCCAGGAACTCGAGGAATGGATTGGGATCTTTTGTTAGTGTAGCTGCAAAATCAGTGTTCACGATTGTTGGCATAGTATCCATATTGCACTTGTCTGGTTTTAGACCAAAGTTTGGGGGGAAAGTTGCTGCTTTGAAGGTTCTGGACCTTCTTCGACAGTCTGCAGCTGAAGATAATGGATCACACAATGAATGTCCTCCGGGTAAATTCCTCGTGATGGAAGATGGGGAGGCTCGATGCGTTGTGAAAGAGAGAATTGAAATTCCATTTTCTTCAGTTGTGGCTAAACCAGATGTAAACTATGGATGCGGGTAA

Protein sequence

MEKVGACLEAQKKQLPDSHVQNSLKQEILQLQEQLQSQFVIRHALEKAMNFQPLSLDSATENSIPKAAMELIKQIAVLEIEVVYLEKYLLSLYRRTFKQQVSSFSTMDDQLESYSGPHIVIDREHSFIHSDHIVSPQTSLSNQSKGRNEVEEAEKLLHFGRSYSSLLQRSPGSSKNYPLSKYMAKAVDSYHSLPLSMLEQSQSDASNSLSLKEHPGACIPDQAHVSPNWLSEEMIKSISAIYCELAEPPLINHNNPSPITPLSSMYELSSRDLGSMRNYEKFALFNSHFDNPFHIEEFSAPYYTMLKVQWISRDRKKDSDISHMLQGFRSFIYRLKEVDLKAMKHKERLAFWINVHNTLVMHAYLQYGIPKNSLKRISLIQKAAYNVGGHIISVDMIQSSILGCRLPRLGQWLHLFLSSKTKFKVNDALKSFSINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELESLLPRFRFMPGRFGGHCGAFKTRRAGKRHSAAATEKDLEKYWVDTSQLHLQLSAVQRIGMPVPALIVPFPTRFDLGATSTHHLAFKAGKLRATSIYVWREKEIGLESRIPTKKHPPQSHSLAVLNPLADRDAVGLGSGQMEEQSTAIILARAMELRLKIRSSVNTTTTASSTVTSQEIGDDRSAVDGNGVAGYSGTGSRRTEADASGEAEEDDEADLQQRQRYEKEAALSEIEHSRKILLDKLKKYKGEDLEVIHEASAFAGDTVQHNQDLMLPPYPSHPLHSPLGNGHVHPFPSGHKSVSNGLKDIATNKATKEPNESERKCSQTDSRNSRNGLGSFVSVAAKSVFTIVGIVSILHLSGFRPKFGGKVAALKVLDLLRQSAAEDNGSHNECPPGKFLVMEDGEARCVVKERIEIPFSSVVAKPDVNYGCG
Homology
BLAST of Sgr021985 vs. NCBI nr
Match: XP_022135648.1 (uncharacterized protein LOC111007555 isoform X1 [Momordica charantia])

HSP 1 Score: 776.5 bits (2004), Expect = 2.4e-220
Identity = 409/473 (86.47%), Postives = 432/473 (91.33%), Query Frame = 0

Query: 1   MEKVGACLEAQKKQLPDSHV-QNSLKQEILQLQEQLQSQFVIRHALEKAMNFQPLSLDSA 60
           ME  GA LEA+KKQLPDSHV QNSLKQEI QLQEQLQSQFVIRHALEKA+NFQP SLDSA
Sbjct: 10  MEHAGAYLEAKKKQLPDSHVLQNSLKQEIQQLQEQLQSQFVIRHALEKAINFQPPSLDSA 69

Query: 61  TENSIPKAAMELIKQIAVLEIEVVYLEKYLLSLYRRTFKQQVSSFSTMDDQLESYSGPHI 120
           TE+SIPKAAMELIKQIAVLE+EVVYLEKYLLSLYRRTFKQQVSS STMDD+LESYSGP  
Sbjct: 70  TESSIPKAAMELIKQIAVLELEVVYLEKYLLSLYRRTFKQQVSSSSTMDDRLESYSGPLF 129

Query: 121 VIDRE--HSFIHSDHIVSPQTSLSNQSKGRNEVEEAEKLLHFGRSYSSLLQRSPGSSKNY 180
           VI+ E  HSFIHSDHIVSPQTS  NQSKGRNEVEE EKL H  RSYSSLL+RSPGSS NY
Sbjct: 130 VIEGEHKHSFIHSDHIVSPQTSFGNQSKGRNEVEEPEKLSHLHRSYSSLLRRSPGSSTNY 189

Query: 181 PLSKYMAKAVDSYHSLPLSMLEQSQSDASNSLSLKEHPGACIPDQAHVSPNWLSEEMIKS 240
           PLSK +AKAVDSYHSLPLSMLEQSQSDASNS+SL EH GA +P++A  SPNW+SEEMIKS
Sbjct: 190 PLSKSVAKAVDSYHSLPLSMLEQSQSDASNSMSLGEHFGAHVPERAPKSPNWISEEMIKS 249

Query: 241 ISAIYCELAEPPLI-NHNNPSPITPLSSMYELSSRD-LGSMRNYEKFALFNSHFDNPFHI 300
           IS IYCELA+PPL+ NHNNPSPI+PLSSM ELSS+D LGSMRNYEK   FNS+F NPFHI
Sbjct: 250 ISLIYCELAQPPLMNNHNNPSPISPLSSMCELSSQDHLGSMRNYEK--SFNSNFGNPFHI 309

Query: 301 EEFSAPYYTMLKVQWISRDRKKDSDISHMLQGFRSFIYRLKEVDLKAMKHKERLAFWINV 360
           EEFS PY TMLKVQWISR+RKKDSDI+HMLQGFRS IYRLKEVDLKAMKH+E+LAFWINV
Sbjct: 310 EEFSVPYCTMLKVQWISRERKKDSDINHMLQGFRSLIYRLKEVDLKAMKHEEKLAFWINV 369

Query: 361 HNTLVMHAYLQYGIPKNSLKRISLIQKAAYNVGGHIISVDMIQSSILGCRLPRLGQWLHL 420
           HNTLVMHAYLQYGIPKNSLKR SLI KAAYNVGGHIISVDMIQSSILGC LPR GQWLHL
Sbjct: 370 HNTLVMHAYLQYGIPKNSLKRTSLILKAAYNVGGHIISVDMIQSSILGCHLPRSGQWLHL 429

Query: 421 FLSSKTKFKVNDALKSFSINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELE 469
           FLSSKTKFKVNDA KSF+INHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELE
Sbjct: 430 FLSSKTKFKVNDARKSFAINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELE 480

BLAST of Sgr021985 vs. NCBI nr
Match: XP_022135649.1 (uncharacterized protein LOC111007555 isoform X2 [Momordica charantia])

HSP 1 Score: 766.1 bits (1977), Expect = 3.3e-217
Identity = 406/473 (85.84%), Postives = 429/473 (90.70%), Query Frame = 0

Query: 1   MEKVGACLEAQKKQLPDSHV-QNSLKQEILQLQEQLQSQFVIRHALEKAMNFQPLSLDSA 60
           ME  GA LEA+KKQLPDSHV QNSLKQEI QLQEQLQSQFVIRHALEKA+NFQP SLDSA
Sbjct: 10  MEHAGAYLEAKKKQLPDSHVLQNSLKQEIQQLQEQLQSQFVIRHALEKAINFQPPSLDSA 69

Query: 61  TENSIPKAAMELIKQIAVLEIEVVYLEKYLLSLYRRTFKQQVSSFSTMDDQLESYSGPHI 120
           TE+SIPKAAMELIKQIAVLE+EVVYLEKYLLSLYRRTFKQQVSS STMDD+LESYSGP  
Sbjct: 70  TESSIPKAAMELIKQIAVLELEVVYLEKYLLSLYRRTFKQQVSSSSTMDDRLESYSGPLF 129

Query: 121 VIDRE--HSFIHSDHIVSPQTSLSNQSKGRNEVEEAEKLLHFGRSYSSLLQRSPGSSKNY 180
           VI+ E  HSFIHSDHIVSPQTS  NQSKGRNEVEE EKL H  RSYSSLL+RSPGSS NY
Sbjct: 130 VIEGEHKHSFIHSDHIVSPQTSFGNQSKGRNEVEEPEKLSHLHRSYSSLLRRSPGSSTNY 189

Query: 181 PLSKYMAKAVDSYHSLPLSMLEQSQSDASNSLSLKEHPGACIPDQAHVSPNWLSEEMIKS 240
           PLSK +AKAVDSYHSLPLSMLE   SDASNS+SL EH GA +P++A  SPNW+SEEMIKS
Sbjct: 190 PLSKSVAKAVDSYHSLPLSMLE---SDASNSMSLGEHFGAHVPERAPKSPNWISEEMIKS 249

Query: 241 ISAIYCELAEPPLI-NHNNPSPITPLSSMYELSSRD-LGSMRNYEKFALFNSHFDNPFHI 300
           IS IYCELA+PPL+ NHNNPSPI+PLSSM ELSS+D LGSMRNYEK   FNS+F NPFHI
Sbjct: 250 ISLIYCELAQPPLMNNHNNPSPISPLSSMCELSSQDHLGSMRNYEK--SFNSNFGNPFHI 309

Query: 301 EEFSAPYYTMLKVQWISRDRKKDSDISHMLQGFRSFIYRLKEVDLKAMKHKERLAFWINV 360
           EEFS PY TMLKVQWISR+RKKDSDI+HMLQGFRS IYRLKEVDLKAMKH+E+LAFWINV
Sbjct: 310 EEFSVPYCTMLKVQWISRERKKDSDINHMLQGFRSLIYRLKEVDLKAMKHEEKLAFWINV 369

Query: 361 HNTLVMHAYLQYGIPKNSLKRISLIQKAAYNVGGHIISVDMIQSSILGCRLPRLGQWLHL 420
           HNTLVMHAYLQYGIPKNSLKR SLI KAAYNVGGHIISVDMIQSSILGC LPR GQWLHL
Sbjct: 370 HNTLVMHAYLQYGIPKNSLKRTSLILKAAYNVGGHIISVDMIQSSILGCHLPRSGQWLHL 429

Query: 421 FLSSKTKFKVNDALKSFSINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELE 469
           FLSSKTKFKVNDA KSF+INHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELE
Sbjct: 430 FLSSKTKFKVNDARKSFAINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELE 477

BLAST of Sgr021985 vs. NCBI nr
Match: XP_008454883.1 (PREDICTED: uncharacterized protein LOC103495193 [Cucumis melo])

HSP 1 Score: 713.8 bits (1841), Expect = 1.9e-201
Identity = 369/461 (80.04%), Postives = 401/461 (86.98%), Query Frame = 0

Query: 8   LEAQKKQLPDSHVQNSLKQEILQLQEQLQSQFVIRHALEKAMNFQPLSLDSATENSIPKA 67
           ++  K+Q+ D  VQ SLKQEILQL+EQLQSQF  RHALEKA+NFQPLSL SATE++IP+A
Sbjct: 4   VKGNKQQISDGDVQISLKQEILQLEEQLQSQFATRHALEKAINFQPLSLYSATEDAIPEA 63

Query: 68  AMELIKQIAVLEIEVVYLEKYLLSLYRRTFKQQVSSFSTMDDQLESYSGPHIVIDREHSF 127
            MELIKQIAVLE+EVVYLEKYLLSLYRRTF QQVSSFSTMDD+LESY  P+ VI+ EHS 
Sbjct: 64  EMELIKQIAVLELEVVYLEKYLLSLYRRTFNQQVSSFSTMDDRLESYIEPNNVIEGEHSC 123

Query: 128 IHSDHIVSPQTSLSNQSKGRNEVEEAEKLLHFGRSYSSLLQRSPGSSKNYPLSKYMAKAV 187
           IHSDHIVSP+T   NQSKGRN VEE EKL H  RS SSL QRS GSS+NY LSKYMAKAV
Sbjct: 124 IHSDHIVSPETLFDNQSKGRNVVEEPEKLSHLHRSNSSLSQRSLGSSRNYSLSKYMAKAV 183

Query: 188 DSYHSLPLSMLEQSQSDASNSLSLKEHPGACIPDQAHVSPNWLSEEMIKSISAIYCELAE 247
           DSYHS PLSMLEQS+ D  +S SL EH GAC+  +   SPNWLSEEMIKSISAIY ELAE
Sbjct: 184 DSYHSFPLSMLEQSRIDVPSSTSLGEHLGACLSIRVDESPNWLSEEMIKSISAIYRELAE 243

Query: 248 PPLINHNNPSPITPLSSMYELSSRDLGSMRNYEKFALFNSHFDNPFHIEEFSAPYYTMLK 307
           PPL+NHNNPSPI+PLSSMYELSS+D GSMRNYEK    NSHF+NPFHIEEF APY TMLK
Sbjct: 244 PPLMNHNNPSPISPLSSMYELSSQDFGSMRNYEK--SLNSHFENPFHIEEFIAPYDTMLK 303

Query: 308 VQWISRDRKKDSDISHMLQGFRSFIYRLKEVDLKAMKHKERLAFWINVHNTLVMHAYLQY 367
           VQWISR+RKKDSDI+HMLQGFRS I+RLKEV LK MKH E+LAFWINVHNTLVMHAYLQY
Sbjct: 304 VQWISRERKKDSDINHMLQGFRSLIFRLKEVKLKVMKHDEKLAFWINVHNTLVMHAYLQY 363

Query: 368 GIPKNSLKRISLIQKAAYNVGGHIISVDMIQSSILGCRLPRLGQWLHLFLSSKTKFKVND 427
           GIPK+ LKRISLI KAAYN+GGHIISVD IQSSILGCRLPR GQWLHLFLSSKTKFKVND
Sbjct: 364 GIPKHCLKRISLILKAAYNIGGHIISVDKIQSSILGCRLPRSGQWLHLFLSSKTKFKVND 423

Query: 428 ALKSFSINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELE 469
             KSF INHPEPRLYFALCCG+ SDPAVR+YTAKRVNE+LE
Sbjct: 424 VQKSFPINHPEPRLYFALCCGNLSDPAVRLYTAKRVNEQLE 462

BLAST of Sgr021985 vs. NCBI nr
Match: TYK06707.1 (uncharacterized protein E5676_scaffold13G00080 [Cucumis melo var. makuwa])

HSP 1 Score: 713.8 bits (1841), Expect = 1.9e-201
Identity = 369/461 (80.04%), Postives = 401/461 (86.98%), Query Frame = 0

Query: 8   LEAQKKQLPDSHVQNSLKQEILQLQEQLQSQFVIRHALEKAMNFQPLSLDSATENSIPKA 67
           ++  K+Q+ D  VQ SLKQEILQL+EQLQSQF  RHALEKA+NFQPLSL SATE++IP+A
Sbjct: 2   VKGNKQQISDGDVQISLKQEILQLEEQLQSQFATRHALEKAINFQPLSLYSATEDAIPEA 61

Query: 68  AMELIKQIAVLEIEVVYLEKYLLSLYRRTFKQQVSSFSTMDDQLESYSGPHIVIDREHSF 127
            MELIKQIAVLE+EVVYLEKYLLSLYRRTF QQVSSFSTMDD+LESY  P+ VI+ EHS 
Sbjct: 62  EMELIKQIAVLELEVVYLEKYLLSLYRRTFNQQVSSFSTMDDRLESYIEPNNVIEGEHSC 121

Query: 128 IHSDHIVSPQTSLSNQSKGRNEVEEAEKLLHFGRSYSSLLQRSPGSSKNYPLSKYMAKAV 187
           IHSDHIVSP+T   NQSKGRN VEE EKL H  RS SSL QRS GSS+NY LSKYMAKAV
Sbjct: 122 IHSDHIVSPETLFDNQSKGRNVVEEPEKLSHLHRSNSSLSQRSLGSSRNYSLSKYMAKAV 181

Query: 188 DSYHSLPLSMLEQSQSDASNSLSLKEHPGACIPDQAHVSPNWLSEEMIKSISAIYCELAE 247
           DSYHS PLSMLEQS+ D  +S SL EH GAC+  +   SPNWLSEEMIKSISAIY ELAE
Sbjct: 182 DSYHSFPLSMLEQSRIDVPSSTSLGEHLGACLSIRVDESPNWLSEEMIKSISAIYRELAE 241

Query: 248 PPLINHNNPSPITPLSSMYELSSRDLGSMRNYEKFALFNSHFDNPFHIEEFSAPYYTMLK 307
           PPL+NHNNPSPI+PLSSMYELSS+D GSMRNYEK    NSHF+NPFHIEEF APY TMLK
Sbjct: 242 PPLMNHNNPSPISPLSSMYELSSQDFGSMRNYEK--SLNSHFENPFHIEEFIAPYDTMLK 301

Query: 308 VQWISRDRKKDSDISHMLQGFRSFIYRLKEVDLKAMKHKERLAFWINVHNTLVMHAYLQY 367
           VQWISR+RKKDSDI+HMLQGFRS I+RLKEV LK MKH E+LAFWINVHNTLVMHAYLQY
Sbjct: 302 VQWISRERKKDSDINHMLQGFRSLIFRLKEVKLKVMKHDEKLAFWINVHNTLVMHAYLQY 361

Query: 368 GIPKNSLKRISLIQKAAYNVGGHIISVDMIQSSILGCRLPRLGQWLHLFLSSKTKFKVND 427
           GIPK+ LKRISLI KAAYN+GGHIISVD IQSSILGCRLPR GQWLHLFLSSKTKFKVND
Sbjct: 362 GIPKHCLKRISLILKAAYNIGGHIISVDKIQSSILGCRLPRSGQWLHLFLSSKTKFKVND 421

Query: 428 ALKSFSINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELE 469
             KSF INHPEPRLYFALCCG+ SDPAVR+YTAKRVNE+LE
Sbjct: 422 VQKSFPINHPEPRLYFALCCGNLSDPAVRLYTAKRVNEQLE 460

BLAST of Sgr021985 vs. NCBI nr
Match: XP_011658927.1 (uncharacterized protein LOC101203131 isoform X2 [Cucumis sativus] >KGN43981.1 hypothetical protein Csa_017702 [Cucumis sativus])

HSP 1 Score: 706.1 bits (1821), Expect = 4.0e-199
Identity = 367/461 (79.61%), Postives = 396/461 (85.90%), Query Frame = 0

Query: 8   LEAQKKQLPDSHVQNSLKQEILQLQEQLQSQFVIRHALEKAMNFQPLSLDSATENSIPKA 67
           ++  K+Q+ D   Q SLKQEILQL+EQLQSQF  RHALEKA+NFQPLSL SATE++IP+A
Sbjct: 4   VKGNKQQISDGDAQISLKQEILQLEEQLQSQFATRHALEKAINFQPLSLYSATEDAIPEA 63

Query: 68  AMELIKQIAVLEIEVVYLEKYLLSLYRRTFKQQVSSFSTMDDQLESYSGPHIVIDREHSF 127
            MELIKQIAVLE+EVVYLEKYLLSLYRRTF QQVSSFSTMDD+LESY  P+ VI+ EHS 
Sbjct: 64  EMELIKQIAVLELEVVYLEKYLLSLYRRTFNQQVSSFSTMDDRLESYIEPNNVIEGEHSC 123

Query: 128 IHSDHIVSPQTSLSNQSKGRNEVEEAEKLLHFGRSYSSLLQRSPGSSKNYPLSKYMAKAV 187
           IHSDHI SP+T   NQSKGRN VEE E L H  RS SSL QRS GSS+NY LSK MAKAV
Sbjct: 124 IHSDHIGSPETLFDNQSKGRNVVEEPENLSHLHRSNSSLSQRSLGSSRNYSLSKSMAKAV 183

Query: 188 DSYHSLPLSMLEQSQSDASNSLSLKEHPGACIPDQAHVSPNWLSEEMIKSISAIYCELAE 247
           DSYHS PLSMLEQS+ D  +S SL EH GAC+  +   SPNWLSEEMIKSISAIY ELAE
Sbjct: 184 DSYHSFPLSMLEQSRIDVPSSTSLGEHLGACLSIRVDESPNWLSEEMIKSISAIYRELAE 243

Query: 248 PPLINHNNPSPITPLSSMYELSSRDLGSMRNYEKFALFNSHFDNPFHIEEFSAPYYTMLK 307
           PPL+NHNNPSPI+PLSSMYELSS+D GSMRNYEK    NSHF+NPFH EEF APY TMLK
Sbjct: 244 PPLMNHNNPSPISPLSSMYELSSQDFGSMRNYEK--SLNSHFENPFHTEEFIAPYDTMLK 303

Query: 308 VQWISRDRKKDSDISHMLQGFRSFIYRLKEVDLKAMKHKERLAFWINVHNTLVMHAYLQY 367
           VQWISR+RK DSDI+HMLQGFRS I+RLKEV LKAMKH E+LAFWINVHNTLVMHAYLQY
Sbjct: 304 VQWISRERKNDSDINHMLQGFRSLIFRLKEVKLKAMKHDEKLAFWINVHNTLVMHAYLQY 363

Query: 368 GIPKNSLKRISLIQKAAYNVGGHIISVDMIQSSILGCRLPRLGQWLHLFLSSKTKFKVND 427
           GI K+ LKRISLI KAAYN+GGHIISVD IQSSILGCRLPR GQWLHLFLSSKTKFKVND
Sbjct: 364 GISKHCLKRISLILKAAYNIGGHIISVDKIQSSILGCRLPRSGQWLHLFLSSKTKFKVND 423

Query: 428 ALKSFSINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELE 469
             KSF INHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELE
Sbjct: 424 VQKSFPINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELE 462

BLAST of Sgr021985 vs. ExPASy Swiss-Prot
Match: Q9XII1 (Plastid division protein PDV2 OS=Arabidopsis thaliana OX=3702 GN=PDV2 PE=1 SV=1)

HSP 1 Score: 196.8 bits (499), Expect = 1.0e-48
Identity = 136/311 (43.73%), Postives = 181/311 (58.20%), Query Frame = 0

Query: 609 EEQSTAIILARAMELRLKIRSSV-NTTTTASSTVTSQE---IGDDR-SAVDGNGVAGYSG 668
           +E+   +ILARA ELRLKI   + N++TT S      E    G+ R S + GN    +  
Sbjct: 3   DEEGIGLILARATELRLKISDCIDNSSTTVSDNGDGNEDLSPGEGRKSEIIGNQDKDFDS 62

Query: 669 TGSRRT-EADASG--------EAEEDDEADLQ---QRQRYEKEAALSEIEHSRKILLDKL 728
             S    EA+A          EA E   A LQ   QRQ+YEK+ ALSEI++SRK+LL+KL
Sbjct: 63  ISSEDVDEAEAERLLRIRDALEALESQLASLQNLRQRQQYEKQLALSEIDYSRKMLLEKL 122

Query: 729 KKYKGEDLEVIHEASAFAGDTVQHNQDLMLPPYPSHP---LHSPLGNGHVHPFPSGHKSV 788
           K+YKG+D EV+ E + FAG+ V +  DL+LPPYP HP   L     NG++   PS  KS 
Sbjct: 123 KEYKGKDFEVLRETTTFAGERVDYENDLLLPPYPVHPPLSLGLDNNNGYLSHLPSKKKSD 182

Query: 789 SNGLKDIATNKATKEPNESERKCSQTDSRNSRNGLGSFVSVAAKSVFTIVGIVSILHLSG 848
           +NG        +    NE+E K     S  S +G+  F+   AK V  I+G++S+L  SG
Sbjct: 183 ANGF------GSGHVRNEAEAKSPNGGSGGSSHGVIRFLGSVAKIVLPIIGVISLLSASG 242

Query: 849 FRPKFGGKVAALKVLDLLRQSAAEDNGSHNECPPGKFLVMEDGEARCVVKERIEIPFSSV 900
           + P+   + A+L +  LL   A     + N+CPPGK LV+EDGEARC+VKER+EIPF SV
Sbjct: 243 YGPEMRKRGASLNLFGLLPHRATRGKRTPNQCPPGKVLVIEDGEARCLVKERVEIPFDSV 302

BLAST of Sgr021985 vs. ExPASy TrEMBL
Match: A0A6J1C5D8 (uncharacterized protein LOC111007555 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111007555 PE=4 SV=1)

HSP 1 Score: 776.5 bits (2004), Expect = 1.2e-220
Identity = 409/473 (86.47%), Postives = 432/473 (91.33%), Query Frame = 0

Query: 1   MEKVGACLEAQKKQLPDSHV-QNSLKQEILQLQEQLQSQFVIRHALEKAMNFQPLSLDSA 60
           ME  GA LEA+KKQLPDSHV QNSLKQEI QLQEQLQSQFVIRHALEKA+NFQP SLDSA
Sbjct: 10  MEHAGAYLEAKKKQLPDSHVLQNSLKQEIQQLQEQLQSQFVIRHALEKAINFQPPSLDSA 69

Query: 61  TENSIPKAAMELIKQIAVLEIEVVYLEKYLLSLYRRTFKQQVSSFSTMDDQLESYSGPHI 120
           TE+SIPKAAMELIKQIAVLE+EVVYLEKYLLSLYRRTFKQQVSS STMDD+LESYSGP  
Sbjct: 70  TESSIPKAAMELIKQIAVLELEVVYLEKYLLSLYRRTFKQQVSSSSTMDDRLESYSGPLF 129

Query: 121 VIDRE--HSFIHSDHIVSPQTSLSNQSKGRNEVEEAEKLLHFGRSYSSLLQRSPGSSKNY 180
           VI+ E  HSFIHSDHIVSPQTS  NQSKGRNEVEE EKL H  RSYSSLL+RSPGSS NY
Sbjct: 130 VIEGEHKHSFIHSDHIVSPQTSFGNQSKGRNEVEEPEKLSHLHRSYSSLLRRSPGSSTNY 189

Query: 181 PLSKYMAKAVDSYHSLPLSMLEQSQSDASNSLSLKEHPGACIPDQAHVSPNWLSEEMIKS 240
           PLSK +AKAVDSYHSLPLSMLEQSQSDASNS+SL EH GA +P++A  SPNW+SEEMIKS
Sbjct: 190 PLSKSVAKAVDSYHSLPLSMLEQSQSDASNSMSLGEHFGAHVPERAPKSPNWISEEMIKS 249

Query: 241 ISAIYCELAEPPLI-NHNNPSPITPLSSMYELSSRD-LGSMRNYEKFALFNSHFDNPFHI 300
           IS IYCELA+PPL+ NHNNPSPI+PLSSM ELSS+D LGSMRNYEK   FNS+F NPFHI
Sbjct: 250 ISLIYCELAQPPLMNNHNNPSPISPLSSMCELSSQDHLGSMRNYEK--SFNSNFGNPFHI 309

Query: 301 EEFSAPYYTMLKVQWISRDRKKDSDISHMLQGFRSFIYRLKEVDLKAMKHKERLAFWINV 360
           EEFS PY TMLKVQWISR+RKKDSDI+HMLQGFRS IYRLKEVDLKAMKH+E+LAFWINV
Sbjct: 310 EEFSVPYCTMLKVQWISRERKKDSDINHMLQGFRSLIYRLKEVDLKAMKHEEKLAFWINV 369

Query: 361 HNTLVMHAYLQYGIPKNSLKRISLIQKAAYNVGGHIISVDMIQSSILGCRLPRLGQWLHL 420
           HNTLVMHAYLQYGIPKNSLKR SLI KAAYNVGGHIISVDMIQSSILGC LPR GQWLHL
Sbjct: 370 HNTLVMHAYLQYGIPKNSLKRTSLILKAAYNVGGHIISVDMIQSSILGCHLPRSGQWLHL 429

Query: 421 FLSSKTKFKVNDALKSFSINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELE 469
           FLSSKTKFKVNDA KSF+INHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELE
Sbjct: 430 FLSSKTKFKVNDARKSFAINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELE 480

BLAST of Sgr021985 vs. ExPASy TrEMBL
Match: A0A6J1C220 (uncharacterized protein LOC111007555 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111007555 PE=4 SV=1)

HSP 1 Score: 766.1 bits (1977), Expect = 1.6e-217
Identity = 406/473 (85.84%), Postives = 429/473 (90.70%), Query Frame = 0

Query: 1   MEKVGACLEAQKKQLPDSHV-QNSLKQEILQLQEQLQSQFVIRHALEKAMNFQPLSLDSA 60
           ME  GA LEA+KKQLPDSHV QNSLKQEI QLQEQLQSQFVIRHALEKA+NFQP SLDSA
Sbjct: 10  MEHAGAYLEAKKKQLPDSHVLQNSLKQEIQQLQEQLQSQFVIRHALEKAINFQPPSLDSA 69

Query: 61  TENSIPKAAMELIKQIAVLEIEVVYLEKYLLSLYRRTFKQQVSSFSTMDDQLESYSGPHI 120
           TE+SIPKAAMELIKQIAVLE+EVVYLEKYLLSLYRRTFKQQVSS STMDD+LESYSGP  
Sbjct: 70  TESSIPKAAMELIKQIAVLELEVVYLEKYLLSLYRRTFKQQVSSSSTMDDRLESYSGPLF 129

Query: 121 VIDRE--HSFIHSDHIVSPQTSLSNQSKGRNEVEEAEKLLHFGRSYSSLLQRSPGSSKNY 180
           VI+ E  HSFIHSDHIVSPQTS  NQSKGRNEVEE EKL H  RSYSSLL+RSPGSS NY
Sbjct: 130 VIEGEHKHSFIHSDHIVSPQTSFGNQSKGRNEVEEPEKLSHLHRSYSSLLRRSPGSSTNY 189

Query: 181 PLSKYMAKAVDSYHSLPLSMLEQSQSDASNSLSLKEHPGACIPDQAHVSPNWLSEEMIKS 240
           PLSK +AKAVDSYHSLPLSMLE   SDASNS+SL EH GA +P++A  SPNW+SEEMIKS
Sbjct: 190 PLSKSVAKAVDSYHSLPLSMLE---SDASNSMSLGEHFGAHVPERAPKSPNWISEEMIKS 249

Query: 241 ISAIYCELAEPPLI-NHNNPSPITPLSSMYELSSRD-LGSMRNYEKFALFNSHFDNPFHI 300
           IS IYCELA+PPL+ NHNNPSPI+PLSSM ELSS+D LGSMRNYEK   FNS+F NPFHI
Sbjct: 250 ISLIYCELAQPPLMNNHNNPSPISPLSSMCELSSQDHLGSMRNYEK--SFNSNFGNPFHI 309

Query: 301 EEFSAPYYTMLKVQWISRDRKKDSDISHMLQGFRSFIYRLKEVDLKAMKHKERLAFWINV 360
           EEFS PY TMLKVQWISR+RKKDSDI+HMLQGFRS IYRLKEVDLKAMKH+E+LAFWINV
Sbjct: 310 EEFSVPYCTMLKVQWISRERKKDSDINHMLQGFRSLIYRLKEVDLKAMKHEEKLAFWINV 369

Query: 361 HNTLVMHAYLQYGIPKNSLKRISLIQKAAYNVGGHIISVDMIQSSILGCRLPRLGQWLHL 420
           HNTLVMHAYLQYGIPKNSLKR SLI KAAYNVGGHIISVDMIQSSILGC LPR GQWLHL
Sbjct: 370 HNTLVMHAYLQYGIPKNSLKRTSLILKAAYNVGGHIISVDMIQSSILGCHLPRSGQWLHL 429

Query: 421 FLSSKTKFKVNDALKSFSINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELE 469
           FLSSKTKFKVNDA KSF+INHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELE
Sbjct: 430 FLSSKTKFKVNDARKSFAINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELE 477

BLAST of Sgr021985 vs. ExPASy TrEMBL
Match: A0A5D3C4C9 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold13G00080 PE=4 SV=1)

HSP 1 Score: 713.8 bits (1841), Expect = 9.3e-202
Identity = 369/461 (80.04%), Postives = 401/461 (86.98%), Query Frame = 0

Query: 8   LEAQKKQLPDSHVQNSLKQEILQLQEQLQSQFVIRHALEKAMNFQPLSLDSATENSIPKA 67
           ++  K+Q+ D  VQ SLKQEILQL+EQLQSQF  RHALEKA+NFQPLSL SATE++IP+A
Sbjct: 2   VKGNKQQISDGDVQISLKQEILQLEEQLQSQFATRHALEKAINFQPLSLYSATEDAIPEA 61

Query: 68  AMELIKQIAVLEIEVVYLEKYLLSLYRRTFKQQVSSFSTMDDQLESYSGPHIVIDREHSF 127
            MELIKQIAVLE+EVVYLEKYLLSLYRRTF QQVSSFSTMDD+LESY  P+ VI+ EHS 
Sbjct: 62  EMELIKQIAVLELEVVYLEKYLLSLYRRTFNQQVSSFSTMDDRLESYIEPNNVIEGEHSC 121

Query: 128 IHSDHIVSPQTSLSNQSKGRNEVEEAEKLLHFGRSYSSLLQRSPGSSKNYPLSKYMAKAV 187
           IHSDHIVSP+T   NQSKGRN VEE EKL H  RS SSL QRS GSS+NY LSKYMAKAV
Sbjct: 122 IHSDHIVSPETLFDNQSKGRNVVEEPEKLSHLHRSNSSLSQRSLGSSRNYSLSKYMAKAV 181

Query: 188 DSYHSLPLSMLEQSQSDASNSLSLKEHPGACIPDQAHVSPNWLSEEMIKSISAIYCELAE 247
           DSYHS PLSMLEQS+ D  +S SL EH GAC+  +   SPNWLSEEMIKSISAIY ELAE
Sbjct: 182 DSYHSFPLSMLEQSRIDVPSSTSLGEHLGACLSIRVDESPNWLSEEMIKSISAIYRELAE 241

Query: 248 PPLINHNNPSPITPLSSMYELSSRDLGSMRNYEKFALFNSHFDNPFHIEEFSAPYYTMLK 307
           PPL+NHNNPSPI+PLSSMYELSS+D GSMRNYEK    NSHF+NPFHIEEF APY TMLK
Sbjct: 242 PPLMNHNNPSPISPLSSMYELSSQDFGSMRNYEK--SLNSHFENPFHIEEFIAPYDTMLK 301

Query: 308 VQWISRDRKKDSDISHMLQGFRSFIYRLKEVDLKAMKHKERLAFWINVHNTLVMHAYLQY 367
           VQWISR+RKKDSDI+HMLQGFRS I+RLKEV LK MKH E+LAFWINVHNTLVMHAYLQY
Sbjct: 302 VQWISRERKKDSDINHMLQGFRSLIFRLKEVKLKVMKHDEKLAFWINVHNTLVMHAYLQY 361

Query: 368 GIPKNSLKRISLIQKAAYNVGGHIISVDMIQSSILGCRLPRLGQWLHLFLSSKTKFKVND 427
           GIPK+ LKRISLI KAAYN+GGHIISVD IQSSILGCRLPR GQWLHLFLSSKTKFKVND
Sbjct: 362 GIPKHCLKRISLILKAAYNIGGHIISVDKIQSSILGCRLPRSGQWLHLFLSSKTKFKVND 421

Query: 428 ALKSFSINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELE 469
             KSF INHPEPRLYFALCCG+ SDPAVR+YTAKRVNE+LE
Sbjct: 422 VQKSFPINHPEPRLYFALCCGNLSDPAVRLYTAKRVNEQLE 460

BLAST of Sgr021985 vs. ExPASy TrEMBL
Match: A0A1S3BZ51 (uncharacterized protein LOC103495193 OS=Cucumis melo OX=3656 GN=LOC103495193 PE=4 SV=1)

HSP 1 Score: 713.8 bits (1841), Expect = 9.3e-202
Identity = 369/461 (80.04%), Postives = 401/461 (86.98%), Query Frame = 0

Query: 8   LEAQKKQLPDSHVQNSLKQEILQLQEQLQSQFVIRHALEKAMNFQPLSLDSATENSIPKA 67
           ++  K+Q+ D  VQ SLKQEILQL+EQLQSQF  RHALEKA+NFQPLSL SATE++IP+A
Sbjct: 4   VKGNKQQISDGDVQISLKQEILQLEEQLQSQFATRHALEKAINFQPLSLYSATEDAIPEA 63

Query: 68  AMELIKQIAVLEIEVVYLEKYLLSLYRRTFKQQVSSFSTMDDQLESYSGPHIVIDREHSF 127
            MELIKQIAVLE+EVVYLEKYLLSLYRRTF QQVSSFSTMDD+LESY  P+ VI+ EHS 
Sbjct: 64  EMELIKQIAVLELEVVYLEKYLLSLYRRTFNQQVSSFSTMDDRLESYIEPNNVIEGEHSC 123

Query: 128 IHSDHIVSPQTSLSNQSKGRNEVEEAEKLLHFGRSYSSLLQRSPGSSKNYPLSKYMAKAV 187
           IHSDHIVSP+T   NQSKGRN VEE EKL H  RS SSL QRS GSS+NY LSKYMAKAV
Sbjct: 124 IHSDHIVSPETLFDNQSKGRNVVEEPEKLSHLHRSNSSLSQRSLGSSRNYSLSKYMAKAV 183

Query: 188 DSYHSLPLSMLEQSQSDASNSLSLKEHPGACIPDQAHVSPNWLSEEMIKSISAIYCELAE 247
           DSYHS PLSMLEQS+ D  +S SL EH GAC+  +   SPNWLSEEMIKSISAIY ELAE
Sbjct: 184 DSYHSFPLSMLEQSRIDVPSSTSLGEHLGACLSIRVDESPNWLSEEMIKSISAIYRELAE 243

Query: 248 PPLINHNNPSPITPLSSMYELSSRDLGSMRNYEKFALFNSHFDNPFHIEEFSAPYYTMLK 307
           PPL+NHNNPSPI+PLSSMYELSS+D GSMRNYEK    NSHF+NPFHIEEF APY TMLK
Sbjct: 244 PPLMNHNNPSPISPLSSMYELSSQDFGSMRNYEK--SLNSHFENPFHIEEFIAPYDTMLK 303

Query: 308 VQWISRDRKKDSDISHMLQGFRSFIYRLKEVDLKAMKHKERLAFWINVHNTLVMHAYLQY 367
           VQWISR+RKKDSDI+HMLQGFRS I+RLKEV LK MKH E+LAFWINVHNTLVMHAYLQY
Sbjct: 304 VQWISRERKKDSDINHMLQGFRSLIFRLKEVKLKVMKHDEKLAFWINVHNTLVMHAYLQY 363

Query: 368 GIPKNSLKRISLIQKAAYNVGGHIISVDMIQSSILGCRLPRLGQWLHLFLSSKTKFKVND 427
           GIPK+ LKRISLI KAAYN+GGHIISVD IQSSILGCRLPR GQWLHLFLSSKTKFKVND
Sbjct: 364 GIPKHCLKRISLILKAAYNIGGHIISVDKIQSSILGCRLPRSGQWLHLFLSSKTKFKVND 423

Query: 428 ALKSFSINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELE 469
             KSF INHPEPRLYFALCCG+ SDPAVR+YTAKRVNE+LE
Sbjct: 424 VQKSFPINHPEPRLYFALCCGNLSDPAVRLYTAKRVNEQLE 462

BLAST of Sgr021985 vs. ExPASy TrEMBL
Match: A0A0A0K861 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G075610 PE=4 SV=1)

HSP 1 Score: 706.1 bits (1821), Expect = 1.9e-199
Identity = 367/461 (79.61%), Postives = 396/461 (85.90%), Query Frame = 0

Query: 8   LEAQKKQLPDSHVQNSLKQEILQLQEQLQSQFVIRHALEKAMNFQPLSLDSATENSIPKA 67
           ++  K+Q+ D   Q SLKQEILQL+EQLQSQF  RHALEKA+NFQPLSL SATE++IP+A
Sbjct: 4   VKGNKQQISDGDAQISLKQEILQLEEQLQSQFATRHALEKAINFQPLSLYSATEDAIPEA 63

Query: 68  AMELIKQIAVLEIEVVYLEKYLLSLYRRTFKQQVSSFSTMDDQLESYSGPHIVIDREHSF 127
            MELIKQIAVLE+EVVYLEKYLLSLYRRTF QQVSSFSTMDD+LESY  P+ VI+ EHS 
Sbjct: 64  EMELIKQIAVLELEVVYLEKYLLSLYRRTFNQQVSSFSTMDDRLESYIEPNNVIEGEHSC 123

Query: 128 IHSDHIVSPQTSLSNQSKGRNEVEEAEKLLHFGRSYSSLLQRSPGSSKNYPLSKYMAKAV 187
           IHSDHI SP+T   NQSKGRN VEE E L H  RS SSL QRS GSS+NY LSK MAKAV
Sbjct: 124 IHSDHIGSPETLFDNQSKGRNVVEEPENLSHLHRSNSSLSQRSLGSSRNYSLSKSMAKAV 183

Query: 188 DSYHSLPLSMLEQSQSDASNSLSLKEHPGACIPDQAHVSPNWLSEEMIKSISAIYCELAE 247
           DSYHS PLSMLEQS+ D  +S SL EH GAC+  +   SPNWLSEEMIKSISAIY ELAE
Sbjct: 184 DSYHSFPLSMLEQSRIDVPSSTSLGEHLGACLSIRVDESPNWLSEEMIKSISAIYRELAE 243

Query: 248 PPLINHNNPSPITPLSSMYELSSRDLGSMRNYEKFALFNSHFDNPFHIEEFSAPYYTMLK 307
           PPL+NHNNPSPI+PLSSMYELSS+D GSMRNYEK    NSHF+NPFH EEF APY TMLK
Sbjct: 244 PPLMNHNNPSPISPLSSMYELSSQDFGSMRNYEK--SLNSHFENPFHTEEFIAPYDTMLK 303

Query: 308 VQWISRDRKKDSDISHMLQGFRSFIYRLKEVDLKAMKHKERLAFWINVHNTLVMHAYLQY 367
           VQWISR+RK DSDI+HMLQGFRS I+RLKEV LKAMKH E+LAFWINVHNTLVMHAYLQY
Sbjct: 304 VQWISRERKNDSDINHMLQGFRSLIFRLKEVKLKAMKHDEKLAFWINVHNTLVMHAYLQY 363

Query: 368 GIPKNSLKRISLIQKAAYNVGGHIISVDMIQSSILGCRLPRLGQWLHLFLSSKTKFKVND 427
           GI K+ LKRISLI KAAYN+GGHIISVD IQSSILGCRLPR GQWLHLFLSSKTKFKVND
Sbjct: 364 GISKHCLKRISLILKAAYNIGGHIISVDKIQSSILGCRLPRSGQWLHLFLSSKTKFKVND 423

Query: 428 ALKSFSINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELE 469
             KSF INHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELE
Sbjct: 424 VQKSFPINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELE 462

BLAST of Sgr021985 vs. TAIR 10
Match: AT5G66600.1 (Protein of unknown function, DUF547 )

HSP 1 Score: 348.2 bits (892), Expect = 2.0e-95
Identity = 214/476 (44.96%), Postives = 289/476 (60.71%), Query Frame = 0

Query: 18  SHVQNSLKQEILQLQEQLQSQFVIRHALEKAMNFQPLS---LDSATENSIPKAAMELIKQ 77
           S+ + SLKQEI  L+ +LQ QF +R ALEKA+ ++  S   L    + ++PK A +LIK 
Sbjct: 68  SNTETSLKQEITHLETRLQDQFKVRCALEKALGYRTASSYVLTETNDIAMPKPATDLIKD 127

Query: 78  IAVLEIEVVYLEKYLLSLYRRTFKQQVSSFSTMDDQLESYSGPHIVIDREHSFIHSD--- 137
           +AVLE+EV++LE+YLLSLYR+ F+QQ+SS S   +  +  S P     R   F   D   
Sbjct: 128 VAVLEMEVIHLEQYLLSLYRKAFEQQISSVSPNLENKKPKSPPVTTPRRRLDFSEDDDTP 187

Query: 138 -----HIVSPQTSLSNQSKGRN--EVEEAEKLLHFGRSYSSLLQRSPGSSKNYPLSKYMA 197
                H V       NQSK      V+  +    F RS+S   QRS   S+         
Sbjct: 188 SKTDQHTVPLLDDNQNQSKKTEIAAVDRDQMDPSFRRSHS---QRSAFGSRKASPEDSWG 247

Query: 198 KAVDSYHSLPLSMLEQSQSDASNSLSLKEHPGACIPDQAHVSPNWLSEEMIKSISAIYCE 257
           KA  S HS PL +      +  N +SL EH G  I D    +PN LSE M+K +S IYC+
Sbjct: 248 KASRSCHSQPLYV-----QNGDNLISLAEHLGTRISDHVPETPNKLSEGMVKCMSEIYCK 307

Query: 258 LAEPPLINHNN-PSPITPLSS-------MYELSSRDLGSMRNYEKFALFNSHFDNPFHIE 317
           LAEPP + H    SP + LSS        Y+ SS   G+  +      F+   DN FH+E
Sbjct: 308 LAEPPSVLHRGLSSPNSSLSSSAFSPSDQYDTSSPGFGNSSS------FDVRLDNSFHVE 367

Query: 318 ---EFSAPYYTMLKVQWISRDRKKDSDISHMLQGFRSFIYRLKEVDLKAMKHKERLAFWI 377
              +FS PY ++++V  I RD KK S++  +LQ F+S I RL+EVD + +KH+E+LAFWI
Sbjct: 368 GEKDFSGPYSSIVEVLCIYRDAKKASEVEDLLQNFKSLISRLEEVDPRKLKHEEKLAFWI 427

Query: 378 NVHNTLVMHAYLQYGIPKNSLKRISLIQKAAYNVGGHIISVDMIQSSILGCRLPRLGQWL 437
           NVHN LVMHA+L YGIP+N++KR+ L+ KAAYN+GGH IS + IQSSILGC++   GQWL
Sbjct: 428 NVHNALVMHAFLAYGIPQNNVKRVLLLLKAAYNIGGHTISAEAIQSSILGCKMSHPGQWL 487

Query: 438 HLFLSSKTKFKVNDALKSFSINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELES 470
            L  +S+ KFK  D   +++I+HPEP L+FAL  GSHSDPAVR+YT KR+ +ELE+
Sbjct: 488 RLLFASR-KFKAGDERLAYAIDHPEPLLHFALTSGSHSDPAVRVYTPKRIQQELET 528

BLAST of Sgr021985 vs. TAIR 10
Match: AT5G66600.2 (Protein of unknown function, DUF547 )

HSP 1 Score: 348.2 bits (892), Expect = 2.0e-95
Identity = 214/476 (44.96%), Postives = 289/476 (60.71%), Query Frame = 0

Query: 18  SHVQNSLKQEILQLQEQLQSQFVIRHALEKAMNFQPLS---LDSATENSIPKAAMELIKQ 77
           S+ + SLKQEI  L+ +LQ QF +R ALEKA+ ++  S   L    + ++PK A +LIK 
Sbjct: 48  SNTETSLKQEITHLETRLQDQFKVRCALEKALGYRTASSYVLTETNDIAMPKPATDLIKD 107

Query: 78  IAVLEIEVVYLEKYLLSLYRRTFKQQVSSFSTMDDQLESYSGPHIVIDREHSFIHSD--- 137
           +AVLE+EV++LE+YLLSLYR+ F+QQ+SS S   +  +  S P     R   F   D   
Sbjct: 108 VAVLEMEVIHLEQYLLSLYRKAFEQQISSVSPNLENKKPKSPPVTTPRRRLDFSEDDDTP 167

Query: 138 -----HIVSPQTSLSNQSKGRN--EVEEAEKLLHFGRSYSSLLQRSPGSSKNYPLSKYMA 197
                H V       NQSK      V+  +    F RS+S   QRS   S+         
Sbjct: 168 SKTDQHTVPLLDDNQNQSKKTEIAAVDRDQMDPSFRRSHS---QRSAFGSRKASPEDSWG 227

Query: 198 KAVDSYHSLPLSMLEQSQSDASNSLSLKEHPGACIPDQAHVSPNWLSEEMIKSISAIYCE 257
           KA  S HS PL +      +  N +SL EH G  I D    +PN LSE M+K +S IYC+
Sbjct: 228 KASRSCHSQPLYV-----QNGDNLISLAEHLGTRISDHVPETPNKLSEGMVKCMSEIYCK 287

Query: 258 LAEPPLINHNN-PSPITPLSS-------MYELSSRDLGSMRNYEKFALFNSHFDNPFHIE 317
           LAEPP + H    SP + LSS        Y+ SS   G+  +      F+   DN FH+E
Sbjct: 288 LAEPPSVLHRGLSSPNSSLSSSAFSPSDQYDTSSPGFGNSSS------FDVRLDNSFHVE 347

Query: 318 ---EFSAPYYTMLKVQWISRDRKKDSDISHMLQGFRSFIYRLKEVDLKAMKHKERLAFWI 377
              +FS PY ++++V  I RD KK S++  +LQ F+S I RL+EVD + +KH+E+LAFWI
Sbjct: 348 GEKDFSGPYSSIVEVLCIYRDAKKASEVEDLLQNFKSLISRLEEVDPRKLKHEEKLAFWI 407

Query: 378 NVHNTLVMHAYLQYGIPKNSLKRISLIQKAAYNVGGHIISVDMIQSSILGCRLPRLGQWL 437
           NVHN LVMHA+L YGIP+N++KR+ L+ KAAYN+GGH IS + IQSSILGC++   GQWL
Sbjct: 408 NVHNALVMHAFLAYGIPQNNVKRVLLLLKAAYNIGGHTISAEAIQSSILGCKMSHPGQWL 467

Query: 438 HLFLSSKTKFKVNDALKSFSINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELES 470
            L  +S+ KFK  D   +++I+HPEP L+FAL  GSHSDPAVR+YT KR+ +ELE+
Sbjct: 468 RLLFASR-KFKAGDERLAYAIDHPEPLLHFALTSGSHSDPAVRVYTPKRIQQELET 508

BLAST of Sgr021985 vs. TAIR 10
Match: AT5G66600.3 (Protein of unknown function, DUF547 )

HSP 1 Score: 348.2 bits (892), Expect = 2.0e-95
Identity = 214/476 (44.96%), Postives = 289/476 (60.71%), Query Frame = 0

Query: 18  SHVQNSLKQEILQLQEQLQSQFVIRHALEKAMNFQPLS---LDSATENSIPKAAMELIKQ 77
           S+ + SLKQEI  L+ +LQ QF +R ALEKA+ ++  S   L    + ++PK A +LIK 
Sbjct: 68  SNTETSLKQEITHLETRLQDQFKVRCALEKALGYRTASSYVLTETNDIAMPKPATDLIKD 127

Query: 78  IAVLEIEVVYLEKYLLSLYRRTFKQQVSSFSTMDDQLESYSGPHIVIDREHSFIHSD--- 137
           +AVLE+EV++LE+YLLSLYR+ F+QQ+SS S   +  +  S P     R   F   D   
Sbjct: 128 VAVLEMEVIHLEQYLLSLYRKAFEQQISSVSPNLENKKPKSPPVTTPRRRLDFSEDDDTP 187

Query: 138 -----HIVSPQTSLSNQSKGRN--EVEEAEKLLHFGRSYSSLLQRSPGSSKNYPLSKYMA 197
                H V       NQSK      V+  +    F RS+S   QRS   S+         
Sbjct: 188 SKTDQHTVPLLDDNQNQSKKTEIAAVDRDQMDPSFRRSHS---QRSAFGSRKASPEDSWG 247

Query: 198 KAVDSYHSLPLSMLEQSQSDASNSLSLKEHPGACIPDQAHVSPNWLSEEMIKSISAIYCE 257
           KA  S HS PL +      +  N +SL EH G  I D    +PN LSE M+K +S IYC+
Sbjct: 248 KASRSCHSQPLYV-----QNGDNLISLAEHLGTRISDHVPETPNKLSEGMVKCMSEIYCK 307

Query: 258 LAEPPLINHNN-PSPITPLSS-------MYELSSRDLGSMRNYEKFALFNSHFDNPFHIE 317
           LAEPP + H    SP + LSS        Y+ SS   G+  +      F+   DN FH+E
Sbjct: 308 LAEPPSVLHRGLSSPNSSLSSSAFSPSDQYDTSSPGFGNSSS------FDVRLDNSFHVE 367

Query: 318 ---EFSAPYYTMLKVQWISRDRKKDSDISHMLQGFRSFIYRLKEVDLKAMKHKERLAFWI 377
              +FS PY ++++V  I RD KK S++  +LQ F+S I RL+EVD + +KH+E+LAFWI
Sbjct: 368 GEKDFSGPYSSIVEVLCIYRDAKKASEVEDLLQNFKSLISRLEEVDPRKLKHEEKLAFWI 427

Query: 378 NVHNTLVMHAYLQYGIPKNSLKRISLIQKAAYNVGGHIISVDMIQSSILGCRLPRLGQWL 437
           NVHN LVMHA+L YGIP+N++KR+ L+ KAAYN+GGH IS + IQSSILGC++   GQWL
Sbjct: 428 NVHNALVMHAFLAYGIPQNNVKRVLLLLKAAYNIGGHTISAEAIQSSILGCKMSHPGQWL 487

Query: 438 HLFLSSKTKFKVNDALKSFSINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELES 470
            L  +S+ KFK  D   +++I+HPEP L+FAL  GSHSDPAVR+YT KR+ +ELE+
Sbjct: 488 RLLFASR-KFKAGDERLAYAIDHPEPLLHFALTSGSHSDPAVRVYTPKRIQQELET 528

BLAST of Sgr021985 vs. TAIR 10
Match: AT5G66600.4 (Protein of unknown function, DUF547 )

HSP 1 Score: 348.2 bits (892), Expect = 2.0e-95
Identity = 214/476 (44.96%), Postives = 289/476 (60.71%), Query Frame = 0

Query: 18  SHVQNSLKQEILQLQEQLQSQFVIRHALEKAMNFQPLS---LDSATENSIPKAAMELIKQ 77
           S+ + SLKQEI  L+ +LQ QF +R ALEKA+ ++  S   L    + ++PK A +LIK 
Sbjct: 83  SNTETSLKQEITHLETRLQDQFKVRCALEKALGYRTASSYVLTETNDIAMPKPATDLIKD 142

Query: 78  IAVLEIEVVYLEKYLLSLYRRTFKQQVSSFSTMDDQLESYSGPHIVIDREHSFIHSD--- 137
           +AVLE+EV++LE+YLLSLYR+ F+QQ+SS S   +  +  S P     R   F   D   
Sbjct: 143 VAVLEMEVIHLEQYLLSLYRKAFEQQISSVSPNLENKKPKSPPVTTPRRRLDFSEDDDTP 202

Query: 138 -----HIVSPQTSLSNQSKGRN--EVEEAEKLLHFGRSYSSLLQRSPGSSKNYPLSKYMA 197
                H V       NQSK      V+  +    F RS+S   QRS   S+         
Sbjct: 203 SKTDQHTVPLLDDNQNQSKKTEIAAVDRDQMDPSFRRSHS---QRSAFGSRKASPEDSWG 262

Query: 198 KAVDSYHSLPLSMLEQSQSDASNSLSLKEHPGACIPDQAHVSPNWLSEEMIKSISAIYCE 257
           KA  S HS PL +      +  N +SL EH G  I D    +PN LSE M+K +S IYC+
Sbjct: 263 KASRSCHSQPLYV-----QNGDNLISLAEHLGTRISDHVPETPNKLSEGMVKCMSEIYCK 322

Query: 258 LAEPPLINHNN-PSPITPLSS-------MYELSSRDLGSMRNYEKFALFNSHFDNPFHIE 317
           LAEPP + H    SP + LSS        Y+ SS   G+  +      F+   DN FH+E
Sbjct: 323 LAEPPSVLHRGLSSPNSSLSSSAFSPSDQYDTSSPGFGNSSS------FDVRLDNSFHVE 382

Query: 318 ---EFSAPYYTMLKVQWISRDRKKDSDISHMLQGFRSFIYRLKEVDLKAMKHKERLAFWI 377
              +FS PY ++++V  I RD KK S++  +LQ F+S I RL+EVD + +KH+E+LAFWI
Sbjct: 383 GEKDFSGPYSSIVEVLCIYRDAKKASEVEDLLQNFKSLISRLEEVDPRKLKHEEKLAFWI 442

Query: 378 NVHNTLVMHAYLQYGIPKNSLKRISLIQKAAYNVGGHIISVDMIQSSILGCRLPRLGQWL 437
           NVHN LVMHA+L YGIP+N++KR+ L+ KAAYN+GGH IS + IQSSILGC++   GQWL
Sbjct: 443 NVHNALVMHAFLAYGIPQNNVKRVLLLLKAAYNIGGHTISAEAIQSSILGCKMSHPGQWL 502

Query: 438 HLFLSSKTKFKVNDALKSFSINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELES 470
            L  +S+ KFK  D   +++I+HPEP L+FAL  GSHSDPAVR+YT KR+ +ELE+
Sbjct: 503 RLLFASR-KFKAGDERLAYAIDHPEPLLHFALTSGSHSDPAVRVYTPKRIQQELET 543

BLAST of Sgr021985 vs. TAIR 10
Match: AT2G23700.1 (Protein of unknown function, DUF547 )

HSP 1 Score: 282.3 bits (721), Expect = 1.3e-75
Identity = 210/579 (36.27%), Postives = 288/579 (49.74%), Query Frame = 0

Query: 9   EAQKKQLPDSHVQNSLKQEILQLQEQLQSQFVIRHALEKAMNFQPLSLDSATENSIPKAA 68
           E +K   PD   ++SLKQEI +L+++LQ+QF +R ALEKA+ ++  S D    +S PK  
Sbjct: 56  EMKKDLSPDVKFKSSLKQEIQELEKRLQNQFDVRGALEKALGYKTPSRD-IKGDSTPKPP 115

Query: 69  MELIKQIAVLEIEVVYLEKYLLSLYRRTFKQQVSSFSTMDDQLESYSGPHIVI------- 128
            ELIK+IAVLE+EV +LE+YLLSLYR+ F QQ SS S    + +S   P   +       
Sbjct: 116 TELIKEIAVLELEVSHLEQYLLSLYRKAFDQQTSSVSPPTSKQQSSCSPKSTLRGKRLDF 175

Query: 129 ---DREHSFIHSDHIVSP---------------QTSLSNQ--------------SKGRNE 188
                   F   + + SP               Q SL+ Q              S GR  
Sbjct: 176 SRTPESRCFSFDNRLKSPRLVEKELESPNLRCRQESLATQPRCFSFDNRLKEPSSAGRQC 235

Query: 189 VEEAEKL------------------LHFG------------------------------- 248
            +E  ++                   HF                                
Sbjct: 236 NQEVSRIDSRSFSFDNRVKEPGSAARHFNQEDSRIDSQCVSFDNRVKEPVSGVRQFDQES 295

Query: 249 ------------------------------RSYSSLLQRSPGSSKNYPLSKYMAKAVDSY 308
                                         R  SSL QRS  +++  P       +V + 
Sbjct: 296 SRIDSRCFSFDNRLKDQCFIEKEDIDSCVRRCQSSLNQRSTFNNRISP----PEDSVFAC 355

Query: 309 HSLPLSMLEQSQSDASNSLSLKEHPGACIPDQAHVSPNWLSEEMIKSISAIYCELAEPPL 368
           HS PLS+ E  Q + SN  SL EH G  I D   ++PN LSEEMIK  SAIY +LA+PP 
Sbjct: 356 HSQPLSIHEYIQ-NGSNDASLAEHMGTRISDHIFMTPNKLSEEMIKCASAIYSKLADPPS 415

Query: 369 INHNNPSPITPLSSMYELSSRDLGSMRNYEKFALFNSHFDNPFHIEEFSAPYYTMLKVQW 428
           INH   SP +  SS  E S +D   M  +      NS FD+ F   EFS PY +M++V  
Sbjct: 416 INHGFSSPSSSPSSTSEFSPQDQYDM--WSPSFRKNSSFDDQF---EFSGPYSSMIEVSH 475

Query: 429 ISRDRKKDSDISHMLQGFRSFIYRLKEVDLKAMKHKERLAFWINVHNTLVMHAYLQYGIP 470
           I R+RK+  D+  M + F   + +L+ VD + + H+E+LAFWINVHN LVMH +L  GIP
Sbjct: 476 IHRNRKR-RDLDLMNRNFSLLLKQLESVDPRKLTHQEKLAFWINVHNALVMHTFLANGIP 535

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022135648.12.4e-22086.47uncharacterized protein LOC111007555 isoform X1 [Momordica charantia][more]
XP_022135649.13.3e-21785.84uncharacterized protein LOC111007555 isoform X2 [Momordica charantia][more]
XP_008454883.11.9e-20180.04PREDICTED: uncharacterized protein LOC103495193 [Cucumis melo][more]
TYK06707.11.9e-20180.04uncharacterized protein E5676_scaffold13G00080 [Cucumis melo var. makuwa][more]
XP_011658927.14.0e-19979.61uncharacterized protein LOC101203131 isoform X2 [Cucumis sativus] >KGN43981.1 hy... [more]
Match NameE-valueIdentityDescription
Q9XII11.0e-4843.73Plastid division protein PDV2 OS=Arabidopsis thaliana OX=3702 GN=PDV2 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A6J1C5D81.2e-22086.47uncharacterized protein LOC111007555 isoform X1 OS=Momordica charantia OX=3673 G... [more]
A0A6J1C2201.6e-21785.84uncharacterized protein LOC111007555 isoform X2 OS=Momordica charantia OX=3673 G... [more]
A0A5D3C4C99.3e-20280.04Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A1S3BZ519.3e-20280.04uncharacterized protein LOC103495193 OS=Cucumis melo OX=3656 GN=LOC103495193 PE=... [more]
A0A0A0K8611.9e-19979.61Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G075610 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G66600.12.0e-9544.96Protein of unknown function, DUF547 [more]
AT5G66600.22.0e-9544.96Protein of unknown function, DUF547 [more]
AT5G66600.32.0e-9544.96Protein of unknown function, DUF547 [more]
AT5G66600.42.0e-9544.96Protein of unknown function, DUF547 [more]
AT2G23700.11.3e-7536.27Protein of unknown function, DUF547 [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006869Domain of unknown function DUF547PFAMPF04784DUF547coord: 341..473
e-value: 4.7E-41
score: 139.8
IPR025757Ternary complex factor MIP1, leucine-zipperPFAMPF14389Lzipper-MIP1coord: 19..99
e-value: 4.2E-24
score: 84.7
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 632..649
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 739..799
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 779..795
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 632..684
NoneNo IPR availablePANTHERPTHR23054UNCHARACTERIZEDcoord: 1..468
NoneNo IPR availablePANTHERPTHR23054:SF20TERNARY COMPLEX FACTOR MIP1 LEUCINE-ZIPPER PROTEINcoord: 1..468

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr021985.1Sgr021985.1mRNA