Sgr020620 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr020620
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionHistone H3.2
Locationtig00153552: 507595 .. 516621 (+)
RNA-Seq ExpressionSgr020620
SyntenySgr020620
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACCGACAACGACTCTGTAGCGAAGACCTTCCGAGCTCTGGTGGAGAGCGCGAACCGGAAGTTCGCTAGGGTCCAAGACGTCCCGGCTTACGGACATGTGGACAACCACCACTATTTTCATAAGGTTTTCAAGGCTTATATGCGTCTCTGGAAGTTCCAGCAGGAATTCCGCACCAAGCTTGTTGAATCTGGTCTTAACCGCTGGGAAATTGGCGAGATCGCTAGCCGGATCGGTCAGCTTTATTTCGGGCATTACATGAGAACCAGCGAGGCCAGGTTTTTGATTGAAGCTTATGTCTTCTACGAAGCCATTCTTAATCGAAGCTATTTTGAGGGATCCAAGAATTCGAGGAAGGATCTGGGAGCAAGATTCAAGGAGCTGAGGTTTTACGCAAGGTTTTTGCTTGTTTCTTTGTTCCTGAATCGTACGGACACGGTTCAGGTCCTCGCGGAACGATTGAAGGCTTTGGTAGACGATAGCAAGGCCGCTTTTCGGGTAATTTCTCATTTGCTCGATAATTGACTGGATTTTTGGATTATGTTACATATTTCCAGTTTTCTGATTGACCTCTTCGACGACTTGTTTATTTGTTGAAAAAATGAAGTTTTTTTTAACGATCCAGCTTTTTTTTTTTTTTTTTTTTTTCGTTTGGCTTCTTACAGAATGCTGTCTAGACTCTAGACCTTGTATAGCAGCTCAGATTGTTCGGTTGTTTTTCTTTCTTTATCATAATTCTAAATAATATTTTAAAAACTCAATTGTCTTCTCGTTGAATTTGAAGAGCAAATTTATTTTTGTTCCCGTTTCTTTTTGGTTTTCTCTCAGCAGCCAAACCTAACAGAGTATGGAAACATTTATGATATGCTTACAATATTTGTCATTTTTCTGTCCTCGAGATAATTTATGGGTCATATTAAGAAATTTACTGCTCGCAACAAATGCGAACAGTGGAGGGTTTTTTTTTTTTTTTTTTCATTTTTTGCTGGCATGTGGGCGGCTTTGGAATTTAACTTGCCAATATGCAATATTTGGGAGATTTTCTTATTATGACAGCTGAGATAAATATGAGTTCTCCATATTATTATTAGAATCACAGTTTGCTGCATCTAACATCTGTCCGGATCTCGTGCTTTTAGAGCACTTTCTCGATTTAAACTCGGGTCGTGTTCATACTCCACAGAGTTTCATGTGTAATCTTTGAAGAATGAGAATGATTTTAACACCGCTGCTTGATCCGATAAATGAGCACTCTTTATTTCACTTTATCTGGTTAGAAGCATAGATATCTTCTTCACTAGAAAAGTGCTTTTCACTGGCATAACTGAGGCATTTGAACTATTGCTTCAAGGGTACTGACTTTAAAGAATGGAGGTTAGTTGTACAAGAAATTTTCTGCTTCATGAAAGTAGCATCAGTCTCAATGAATATCAGACCTTTGCGTTATTCTGCTCCATTTGATTCCCATCCGTCATCCCTTCCATGTGTGGCTCGTTTCCATGCAAAGAGGGTTCTTAAATTCCGAGATGCTGTTTTGACAAGCTACCACCGAAATGAGGTGAATTAATTATCTATCATTTTTTTGTCTGCCTCAAGAGTAAATATTAATCTTGAATGAGAAGATTGTTAATGAAATTTACTGACTTCATGCACTATTTCTTTCAGGTTAAATTTGCAGAAATTACTTTGGACACTTATAGAATGCTGCAATGTCTAGAATGGGAGCCTGGTTTCTTCTACCAAAGGCATCCAGTTGAACCAAATGAAAATGGTGCTTCCATTGATCATTCTGGGGCGTCTGGAATAATTGATATTAACTTAGCTACTGATATTACTGATCCATCTTTACCTCCAAATCCAAGGAAAGCTATACTCTATCGGCCTTCTGTGACTCATTTGATAGCTGTAAGTTTTGATTTATTCATGAAATTGTCACTGCTTCAGTTGCATGGCAGAGTTCTTGGTTTATTCAATTTATAGTCTTTTCTCCATTGTTCAGTGAGTGTTGATCGCTTTTTAATTTCTAAGTTAGGTGATTTCTGCATTTTAATTCCTGGGAGGCTCTGATCTCGAGATATTGTCTTATTACCGTCTGAATTGGTTCTGAGTTATGCCCTAAAGCCTGTCACGACTCACATGTGATATAATGCCCTCAAGCTCTGAATTGGGTCTGCTTGCACTTATTAGCTCGGATTAGCTTCGCATGTGATCTAATAATGATAGGAACTGATGCATATTACTAAAACAAAGGAAAATACAAAGGTTAATAACATCTAGAGAGGGATCTCTCTCCCACAGTTCTGTACCCAAATATTTACCTACCTCACTCACTTTCCTCTTCCCTTATGTATAACCTCTGGTAATTACAATTCCCTCAATAAACGCTAACAAATCCTCCTACTGTCCCACATAACTACCCATATACCAACTAGCTCTCATGGCCTTATGCTAATAAACTCCGATTCCAATCACTGCCCACTGTCGAAGTATCACCTTGTTCTCAAGGTGATGGATTCAGAATTAAGATATGCCAAATCTCCTCTTGTGCTTGTTGTGACTTCAGCTATCCAACCTCTGAGAGAGAAATCACCATTTTTCCCCATAAATTCTGTTCCGATGGTTACCGTACTGCAGCCTGCGAGGGGAAGTATCAACAGCTCCTTTAGTGTGCACTCCACATTTCTCAGTCATCTTCACAATTTTGTAAGGTACCCAAGCACAGCGTCTCAGCCTGTGCTTCTCCTTTTGTTTTCCGTCAATTTTGGAGTTTGGTTAACAATTACCTCTAATAAAGTCATAAAAACAGCAACAAGCCCCAACTCTTCTCCATATCTATCCACATACTAACCTACCTTCATGCTTAAATTCCAATGCCTATCACAGCCCGCTGCCTCAGTGTCACCTTGTCCTCTAGGTGATGAATACTGAACTAGGATATACCCAATCTCCTCTTAGTGGCCATTGTGACTTCAGCTATCCACAGCAAGAGAGAGGAAGAACTTTTTTTTTTTTTTGGGTATAAAGTTCTGCTTCGCTAGCCATAGTACTTTAGCCACTCTAACCACAGTGTTGTAATGCTTGACATGGAAAGGTTGATTTGATTTGCAGTGACCTTCGCGTTTGCTAAGGAAGTATACCAGCTGCTTAAGAGTACAGTTGCATTTCTCAATCATCTTCACACTTTCGGAAGGTACCCAAGCACAGTGTCCTAGCCTGTGCTTCTTCTCTAGGTTCCCACCCTCATTTGAAGCTTCCTCAAATTTTCTGTCAAACCCATGTTTGAAAGTGGAAATTGGCCTTCCATTATATGGGTTTTTGGGTCTTGATGCAGTCATACTCTTGTTGGAGCGAGTTGCATTCTCCAAGAACATGTATTATTGCCCCAATCAAAGTACTTGAATTATTCTTGTCAGATATAAAAACTCTATTTTCTGGAGCTGTACTTTTGCTGTGACCAACAACAACTTCTGCTAAGCTATAGCCTTAGTCCAATCCATCCGCACAAAAGGAATAAAAATGAAACAAGAATCTGTGTTCACCGGATATGGAGTGGACTATTCTGAATCTCTTCTTCTCCCTCCCCTCTGCCAGAGGAGATTCGATTGATTTTTTGTATTGTGTATTATGATTGATTTGTAGTCCTATCAGCCCAGCCCACTCATGAGTTTTAGTACCCATAACCTCTCAGGCCCATGGAGTAGTATTCTGAACTATCAGCTATGTAGCCAATGCCACCAAGATTTCTACTCACTGGTTACTGAAATCGACCCTTGAAACCCTAATGTTCTGTTGGCTTGGTAAAACCACTCCTTTGGATTGTATCTACTAAGATTGACCATATCATCTTCTACGCACCACTACCTTCTATAGCCCTAAAGGCGTGGATTTTGCTTTCATTTGTCCAAGATTGGGTCGTTGCAGATCAGATACCCAAGATGGCTATTTATCCCTCAAGGTTTTATACCCAAATATTTGCCTACTTCACTTCTCTGAGTTCTCTCACCCCTTATTTATAACCAATGGTAATTAAGAGTTCCCTCTAACAAACTCTGTCTATCCCACGTAACATGCCACATACTAACCCACCCCTGTACTAATCAAACTCCAATGCCTATCAAATAATTAAATCTCACCATCTACCGTTTGGGAACTTTTTGAGTTGAGATCTTTACATTACTAGTTAACAAAAATTATCATTAAGCCATTTCTGAATCTGAGCAGGGTTTTCGTTCTGAGAATGCCTACTCTGTATCATGTTTGACAGATGGAGTCTGTGGTTGCTCTACCTGTTATTTTCTTTTTGTTGTCTTTTTATTTAGTAAGAACTTTGAATTGAGCTTCTTGGCTTGTGAGCAGGTCATGGCTACAATTTGCGAGGAGCTCCTTCCAGATAGTATCATGCTGATTTATCTTTCTGCAGCAGGTTAATGTCCTTCTATAATCAATGATTAAGTGAAAACGATCTTATTTATTCATTTTCGTTTTTACTATTTTTTATTGACAGTGGTGATTTAATGATAATTTTGTGCAGGGAAATGCTGTCAAAACAGTGTTAATCAAATGGCAAGTCATGGGGAATCCAGAAAATCCCTTAAACATAAAGTCATCTCCCAGAACTCACGAGAAAATTGTAATGCTCTGCCTGAGTCCTGTAAGAGTGAGAAGCCAGGGTCAAGTGACCTTTATGATGAGTATTTGTGGTTTGGGCATAGGGGTAATGGAGGTACAGTCTTCTCATACTCTCTCACAATGGTATTTGCTGTTGAAAAATGTGGATATTCAATCATTTTAACTCTTTGTCGTTTCCAATTTCATTGATGACCTAAAAATCATTTTTTTAATGTTCAGGTCCAAACGTTCTATACCCTGGTGATATAATACCTTTCACACGTAGACCTGTTTTCTTGATAGTTGACAGTAATAACAGCCATGCATTCAAGGCAGGTTTGTCAAAACTTTTACACGTGTTGCCAGGTCATAATGGTTTGCTTCATACTACTATATTGAAATTAAATTTCCCCTTGATAGGTGGACATTTATCATTTTCTTCAGTATTATCATATCCGCTCTATATATGTGATTATTGAGGCTTTGTTGAGATGAGATAGATATGACCAGCAGATGCATTAGATTGATTTGTTTTCCTTGTCTAAGAAAACAGGTGTTTGAGGAAAAATATTGCATGAAACTTTCATGTTGCATTTTCAGTATGAATCTTCTTATTTACACAAAAAAGCTAGTTCACCAATATCTAACCAAACAGTTTCTTGTTGATGGACCTGTCTGGTCTAGAGTGAATTTAATGGGTCTTAAAAGATACAATGCTTCAAGGAGAAGGCACTTTAATTTTTCCATTTTTCTCTGATGCCTTTTGCCTATGTAATTAAATTTTATTTGACCAACCATAAAGCTGCAGGACTATCATAGAGTTCACCTCATTGAATGTTCTTACATATACAGTGCGATACTCAGAGTAATGTTTAGCACTACTCTCAAGTCAATCTGACTGCTCACATGGAAACAATATTTGATTTTCAGGTTCTACATGGGGCAGAAAGAGGAGAGACTGCCGCTATACTTCTTTCACATTTGAGGCCTGTATTCAAGAATCCCTTAGATGTTGATACAATTCAATCAGGAAGTCAGTTTACTTTTTTCTTGACTGCTCCTCTGCCTGCATTTTGCGAAATGGTTGGCCTGTCCTCGGCCAATTTGGATCTAGTAAGTTAATTATAAGAGTTAATACTTTCTTGTTTTCTCTTGAAATATGTTCAGGACTTCCGCTTGATCCAACTTGTTCTACTTATAGGATGTTTACAATGATGCCGAGACGATAATCTCTTCTGCGTTTTCCGAGTGGGAAATAATTCTTTGTACATCAACTAGCTTAAACCTCGTCTGGGCCCAAGTTTTGTGTGATCATTTTTTACGCCGTCTCATTCTCAGGTATTTTCAGTCAAGTTGGTTACGTTATTTATGCAATGTTATATGTACATGAGCATCTTTTTTGTGTTATCGGCATGGAGGCTGCTGATGAAAGCCTGAATCATATGGAATGCTCTAGCCATGGAAATGTAGTAACAGTTCTCTTTTTTGTTACATAATCTGCAGATTTATGTTCTGCCGATCCGTGCTATCTTTCTTCAGTACTACAGAAGACGGCGACCTTCCTGTTTGCCTGCCTTGTCTTCCCGACTCCATCTCTTCGAATTCTGGAGTTGTCAGTTCAGCAGTTCGCCGTCTCGCAAAGCACCTAAACGTTGCTGACTTATTTAACTTCCACGAAGTATAATCACAATTGAGATATTGTGAAAGCCCGAATTCGAGCTCAAAGGCTGCCTTCAAGGTCAGTTCAGAACCGAAACAAGAGCATCATTGGCCTCTTGAGAGTTGCAATAGGATCACAAATTGGTGTTTGTTAAGCTAGAATTATGAAACACTGGTAGTGTGATGGGTGGTTGCCTTCCATTTTAATGCTTATGTGCAACCTACTGGTTTCCTTAAGTTTCAAGGTGGGTTACTGTTTAGTTAGATAAGAGTGTGATAATTTATTCTAGGTTTCTTTTGAGCCTTGAAGATTCATTTTAGTTCTTTTTTTTTTGCCAATGCCTTTTCATCTCTGTAAAAAAAAAAAAAAGAAAAGAAAAGAGAGAAAAATTGAAGCAGAGAAATAGAGCTCCTCCCCCTCCCCCTCCCAACCCCGCAAAGTCAGCATGTTTCAAGTTATTTGTAGATGTTGGCAAATCGGGAATGTATAATCGTTTTCCAGGACCCTTTTACAAAAGTGACAAATCTTGAGAAGCTTGACTTCGTCTTTGTAACTTAAGTTTGTTCGATATTGGAATATTATAAGAAAGCATTTTCCAAGTTCATGAACTTCTTACTCGTCCAAGATTAATCATACAATGTCTTATTAGTTTTTTGCCAGAAAAGTTTTTAACTCTTTAGTGCCTATGAAGTCTTTCAATGTTTCTCCTTGTTTTTTGATATTTTGGTTCTCCTCTTAGGGCTGGAGGATTGTCGTTGTGTTAATGGAAAAGAGGTTCAGAAAGTAGTATTTTGTGTAATTCCTATATGTTTCAAAAGCAGTTTTAAACACCCACAGGATTTGCAAATTTGTAAATCATAAGCTTTTGTCGGTTACTTATAATAGGATTAGTGCACAAATCCAAATTTGTGTTCAAAATATGATTCTTATCATATGTAATGAAGGGTTATTAAAAGATAGATTGATTTTAGGATGTCCAGGAAGCATAACTGGATAAATCTTACCTGGCGTGATTTGATTGGTCGGCTATTTCAATTCCCGTTTTCCCCTGCCAAAATTGATAAGCCAATTCATAACTTTATCTTGACAAGTTCAAAATCTAAAGAGCCCTTGAAAGTGGGTAAAAGCCCCAACTAACTCACGTTTCACGCTTTTGATTCTTTAATCCCCACGATTTTAAAAAAAAAAGCCACAGATTTATTGCAATTCCAACCAATAGGAAGCCAGTATTGTATCCTCACGATCCTCGTCTTCCCTTTCGAGATCACATCCTGACCGTTCATTGAACTCGAGAAAACACAACTCACCCCCTCATTTACCGTTAGATTCGTACACCGAAAAACACATGGATATGCTTTCTTAACCGTCCGATCCCCAGATTATCTCTTCGCACACGACGCTATAAATATGCCCTAACTGCTCTTGTTGTTAAACTTTTTGGCTTCTTTGACGGAGGAGAGAGGGATCAGCCCGGAAGAGAAGAAGCGCTAAGCGACTCGAGGTCGGTTAATTTTTTTTTTCTTTTTGTTTTTCTTTTAATTACTCGGGATGTTAATTAGGGTTTAATTTTGTTGGTTTTCATCTCATGGTTAGTGAATTTGTTCCATGATCTTTCTGTTTTTGTCTATGTTTTATCTCCTTGTCGGTTTTTCTCCTTCCTTAATTGTATTTCAAGATTTGTGTTTGAGATTTTGAAAATTTTGGAGTCCGAAGGTTTTAGGTTTCTGTTGGGTTCTCAACCTTCCCTTTATGCTTGATCTCCTTAGGGTTTTTGTGTTTTCTGATCACAAGTCTCTGGGGATGTTTTTTTCTCCTGAAATTGTTTAGAGAAGGACATCGAAATCTGCTTGTTGTTAGGAAATGGAATTGGTTATCTATTAGATACACGGAATTAGATGATACTACTGGCTGATTATTTTCTGAAATTTCATGATGGCGCTTTAAGTATGAGAATTTGCGTGACCTGGATTGGGTTTGGTGTAGGTCTAATAGTGAATAATTCGGAATTGGGGTTTATTTTCTTACAGATGGCCCGTACCAAGCAAACTGCCCGTAAGTCTACTGGAGGGAAGGCTCCCAGGAAGCAGCTGGCCACAAAGGTTTCCCCAATTCGCTCGCTTAAAATTCCTTGGATGAATTTGATTGAATTTTGTTATTGTGTAAACATTGCTAGTTATTTTTGTCTTTGCTATTCTAGGCTGCCCGTAAGTCAGCCCCAACTACAGGAGGGGTGAAGAAGCCTCACAGATACCGTCCTGGTACTGTTGCTCTTCGGTGAGTTTGCATGGTCTTTCTCTTCTTGATTTACCAATCTTGGAGGTGTGTTATTTTTCCATTGGATGCAACAAACAAGTCGTTAACTTTTTTAACCCCTTGTGGACTCATGCAGTGAAATTCGTAAGTATCAGAAGAGTACTGAGCTATTGATTAGGAAGTTACCGTTCCAGAGGTTGGTTCGTGAAATTGCCCAGGACTTCAAGGTATGTTCTTTATGTATGGCAATTTCTTTCATCTCCTGCTTGGCGATCTAGCTGTGTGCTTTTCTGGGTTTGATGTAATTAAGGTTTTTGCGTTGATTCTTGGTGCAGACTGATCTGCGTTTCCAGAGCCATGCTGTTCTTGCCTTGCAAGAGGCAGCCGAGGCATACCTTGTTGGTTTGTTTGAGGACACCAATCTGTGTGCCATTCATGCCAAGCGTGTTACCATAATGCCTAAAGATATCCAGTTGGCTCGGAGAATCCGGGGTGAACGTGCTTAG

mRNA sequence

ATGACCGACAACGACTCTGTAGCGAAGACCTTCCGAGCTCTGGTGGAGAGCGCGAACCGGAAGTTCGCTAGGGTCCAAGACGTCCCGGCTTACGGACATGTGGACAACCACCACTATTTTCATAAGGTTTTCAAGGCTTATATGCGTCTCTGGAAGTTCCAGCAGGAATTCCGCACCAAGCTTGTTGAATCTGGTCTTAACCGCTGGGAAATTGGCGAGATCGCTAGCCGGATCGGTCAGCTTTATTTCGGGCATTACATGAGAACCAGCGAGGCCAGGTTTTTGATTGAAGCTTATGTCTTCTACGAAGCCATTCTTAATCGAAGCTATTTTGAGGGATCCAAGAATTCGAGGAAGGATCTGGGAGCAAGATTCAAGGAGCTGAGGTTTTACGCAAGGTTTTTGCTTGTTTCTTTGTTCCTGAATCGTACGGACACGGTTCAGGTCCTCGCGGAACGATTGAAGGCTTTGGTAGACGATAGCAAGGCCGCTTTTCGGGGTACTGACTTTAAAGAATGGAGGTTAGTTGTACAAGAAATTTTCTGCTTCATGAAAGTAGCATCAGTCTCAATGAATATCAGACCTTTGCGTTATTCTGCTCCATTTGATTCCCATCCGTCATCCCTTCCATGTGTGGCTCGTTTCCATGCAAAGAGGGTTCTTAAATTCCGAGATGCTGTTTTGACAAGCTACCACCGAAATGAGGTTAAATTTGCAGAAATTACTTTGGACACTTATAGAATGCTGCAATGTCTAGAATGGGAGCCTGGTTTCTTCTACCAAAGGCATCCAGTTGAACCAAATGAAAATGGTGCTTCCATTGATCATTCTGGGGCGTCTGGAATAATTGATATTAACTTAGCTACTGATATTACTGATCCATCTTTACCTCCAAATCCAAGGAAAGCTATACTCTATCGGCCTTCTGTGACTCATTTGATAGCTGTCATGGCTACAATTTGCGAGGAGCTCCTTCCAGATAGTATCATGCTGATTTATCTTTCTGCAGCAGGGAAATGCTGTCAAAACAGTGTTAATCAAATGGCAAGTCATGGGGAATCCAGAAAATCCCTTAAACATAAAGTCATCTCCCAGAACTCACGAGAAAATTGTAATGCTCTGCCTGAGTCCTGTAAGAGTGAGAAGCCAGGGTCAAGTGACCTTTATGATGAGTATTTGTGGTTTGGGCATAGGGGTAATGGAGGTCCAAACGTTCTATACCCTGGTGATATAATACCTTTCACACGTAGACCTGTTTTCTTGATAGTTGACAGTAATAACAGCCATGCATTCAAGGTTCTACATGGGGCAGAAAGAGGAGAGACTGCCGCTATACTTCTTTCACATTTGAGGCCTGTATTCAAGAATCCCTTAGATGTTGATACAATTCAATCAGGAAGTCAGTTTACTTTTTTCTTGACTGCTCCTCTGCCTGCATTTTGCGAAATGGTTGGCCTGTCCTCGGCCAATTTGGATCTAGATGTTTACAATGATGCCGAGACGATAATCTCTTCTGCGTTTTCCGAGTGGGAAATAATTCTTTGTACATCAACTAGCTTAAACCTCGTCTGGGCCCAAGTTTTGTGTGATCATTTTTTACGCCGTCTCATTCTCAGATTTATGTTCTGCCGATCCGTGCTATCTTTCTTCAGTACTACAGAAGACGGCGACCTTCCTGTTTGCCTGCCTTGTCTTCCCGACTCCATCTCTTCGAATTCTGGAGTTGTCAGTTCAGCAGTTCGCCGTCTCGCAAAGCACCTAAACGTTGCTGACTTATTTAACTTCCACGAAATTCGTACACCGAAAAACACATGGATATGCTTTCTTAACCGTCCGATCCCCAGATTATCTCTTCGCACACGACGCTATAAATATGCCCTAACTGCTCTTGTTGTTAAACTTTTTGGCTTCTTTGACGGAGGAGAGAGGGATCAGCCCGGAAGAGAAGAAGCGCTAAGCGACTCGAGTATGAGAATTTGCGTGACCTGGATTGGGTTTGGTGTAGGTCTAATAGTGAATAATTCGGAATTGGGGTTTATTTTCTTACAGATGGCCCGTACCAAGCAAACTGCCCGTAAGTCTACTGGAGGGAAGGCTCCCAGGAAGCAGCTGGCCACAAAGGCTGCCCGTAAGTCAGCCCCAACTACAGGAGGGGTGAAGAAGCCTCACAGATACCGTCCTGGTACTGTTGCTCTTCGTGAAATTCGTAAGTATCAGAAGAGTACTGAGCTATTGATTAGGAAGTTACCGTTCCAGAGGTTGGTTCGTGAAATTGCCCAGGACTTCAAGACTGATCTGCGTTTCCAGAGCCATGCTGTTCTTGCCTTGCAAGAGGCAGCCGAGGCATACCTTGTTGGTTTGTTTGAGGACACCAATCTGTGTGCCATTCATGCCAAGCGTGTTACCATAATGCCTAAAGATATCCAGTTGGCTCGGAGAATCCGGGGTGAACGTGCTTAG

Coding sequence (CDS)

ATGACCGACAACGACTCTGTAGCGAAGACCTTCCGAGCTCTGGTGGAGAGCGCGAACCGGAAGTTCGCTAGGGTCCAAGACGTCCCGGCTTACGGACATGTGGACAACCACCACTATTTTCATAAGGTTTTCAAGGCTTATATGCGTCTCTGGAAGTTCCAGCAGGAATTCCGCACCAAGCTTGTTGAATCTGGTCTTAACCGCTGGGAAATTGGCGAGATCGCTAGCCGGATCGGTCAGCTTTATTTCGGGCATTACATGAGAACCAGCGAGGCCAGGTTTTTGATTGAAGCTTATGTCTTCTACGAAGCCATTCTTAATCGAAGCTATTTTGAGGGATCCAAGAATTCGAGGAAGGATCTGGGAGCAAGATTCAAGGAGCTGAGGTTTTACGCAAGGTTTTTGCTTGTTTCTTTGTTCCTGAATCGTACGGACACGGTTCAGGTCCTCGCGGAACGATTGAAGGCTTTGGTAGACGATAGCAAGGCCGCTTTTCGGGGTACTGACTTTAAAGAATGGAGGTTAGTTGTACAAGAAATTTTCTGCTTCATGAAAGTAGCATCAGTCTCAATGAATATCAGACCTTTGCGTTATTCTGCTCCATTTGATTCCCATCCGTCATCCCTTCCATGTGTGGCTCGTTTCCATGCAAAGAGGGTTCTTAAATTCCGAGATGCTGTTTTGACAAGCTACCACCGAAATGAGGTTAAATTTGCAGAAATTACTTTGGACACTTATAGAATGCTGCAATGTCTAGAATGGGAGCCTGGTTTCTTCTACCAAAGGCATCCAGTTGAACCAAATGAAAATGGTGCTTCCATTGATCATTCTGGGGCGTCTGGAATAATTGATATTAACTTAGCTACTGATATTACTGATCCATCTTTACCTCCAAATCCAAGGAAAGCTATACTCTATCGGCCTTCTGTGACTCATTTGATAGCTGTCATGGCTACAATTTGCGAGGAGCTCCTTCCAGATAGTATCATGCTGATTTATCTTTCTGCAGCAGGGAAATGCTGTCAAAACAGTGTTAATCAAATGGCAAGTCATGGGGAATCCAGAAAATCCCTTAAACATAAAGTCATCTCCCAGAACTCACGAGAAAATTGTAATGCTCTGCCTGAGTCCTGTAAGAGTGAGAAGCCAGGGTCAAGTGACCTTTATGATGAGTATTTGTGGTTTGGGCATAGGGGTAATGGAGGTCCAAACGTTCTATACCCTGGTGATATAATACCTTTCACACGTAGACCTGTTTTCTTGATAGTTGACAGTAATAACAGCCATGCATTCAAGGTTCTACATGGGGCAGAAAGAGGAGAGACTGCCGCTATACTTCTTTCACATTTGAGGCCTGTATTCAAGAATCCCTTAGATGTTGATACAATTCAATCAGGAAGTCAGTTTACTTTTTTCTTGACTGCTCCTCTGCCTGCATTTTGCGAAATGGTTGGCCTGTCCTCGGCCAATTTGGATCTAGATGTTTACAATGATGCCGAGACGATAATCTCTTCTGCGTTTTCCGAGTGGGAAATAATTCTTTGTACATCAACTAGCTTAAACCTCGTCTGGGCCCAAGTTTTGTGTGATCATTTTTTACGCCGTCTCATTCTCAGATTTATGTTCTGCCGATCCGTGCTATCTTTCTTCAGTACTACAGAAGACGGCGACCTTCCTGTTTGCCTGCCTTGTCTTCCCGACTCCATCTCTTCGAATTCTGGAGTTGTCAGTTCAGCAGTTCGCCGTCTCGCAAAGCACCTAAACGTTGCTGACTTATTTAACTTCCACGAAATTCGTACACCGAAAAACACATGGATATGCTTTCTTAACCGTCCGATCCCCAGATTATCTCTTCGCACACGACGCTATAAATATGCCCTAACTGCTCTTGTTGTTAAACTTTTTGGCTTCTTTGACGGAGGAGAGAGGGATCAGCCCGGAAGAGAAGAAGCGCTAAGCGACTCGAGTATGAGAATTTGCGTGACCTGGATTGGGTTTGGTGTAGGTCTAATAGTGAATAATTCGGAATTGGGGTTTATTTTCTTACAGATGGCCCGTACCAAGCAAACTGCCCGTAAGTCTACTGGAGGGAAGGCTCCCAGGAAGCAGCTGGCCACAAAGGCTGCCCGTAAGTCAGCCCCAACTACAGGAGGGGTGAAGAAGCCTCACAGATACCGTCCTGGTACTGTTGCTCTTCGTGAAATTCGTAAGTATCAGAAGAGTACTGAGCTATTGATTAGGAAGTTACCGTTCCAGAGGTTGGTTCGTGAAATTGCCCAGGACTTCAAGACTGATCTGCGTTTCCAGAGCCATGCTGTTCTTGCCTTGCAAGAGGCAGCCGAGGCATACCTTGTTGGTTTGTTTGAGGACACCAATCTGTGTGCCATTCATGCCAAGCGTGTTACCATAATGCCTAAAGATATCCAGTTGGCTCGGAGAATCCGGGGTGAACGTGCTTAG

Protein sequence

MTDNDSVAKTFRALVESANRKFARVQDVPAYGHVDNHHYFHKVFKAYMRLWKFQQEFRTKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVASVSMNIRPLRYSAPFDSHPSSLPCVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQRHPVEPNENGASIDHSGASGIIDINLATDITDPSLPPNPRKAILYRPSVTHLIAVMATICEELLPDSIMLIYLSAAGKCCQNSVNQMASHGESRKSLKHKVISQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFKVLHGAERGETAAILLSHLRPVFKNPLDVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDLDVYNDAETIISSAFSEWEIILCTSTSLNLVWAQVLCDHFLRRLILRFMFCRSVLSFFSTTEDGDLPVCLPCLPDSISSNSGVVSSAVRRLAKHLNVADLFNFHEIRTPKNTWICFLNRPIPRLSLRTRRYKYALTALVVKLFGFFDGGERDQPGREEALSDSSMRICVTWIGFGVGLIVNNSELGFIFLQMARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA
Homology
BLAST of Sgr020620 vs. NCBI nr
Match: KAG6577288.1 (Protein SCAI, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1316.2 bits (3405), Expect = 0.0e+00
Identity = 677/819 (82.66%), Postives = 703/819 (85.84%), Query Frame = 0

Query: 1   MTDNDSVAKTFRALVESANRKFARVQDVPAYGHVDNHHYFHKVFKAYMRLWKFQQEFRTK 60
           MTDN+S AKTFRALVESANRKFARVQDVPAYG +D++HYFHKVFKAYMRLWK QQE+R K
Sbjct: 1   MTDNESAAKTFRALVESANRKFARVQDVPAYGRMDSNHYFHKVFKAYMRLWKHQQEYRAK 60

Query: 61  LVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILNRSYFEGSKNSRKD 120
           LVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAY+FYEAILNRSYFEGSKNSRKD
Sbjct: 61  LVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYIFYEAILNRSYFEGSKNSRKD 120

Query: 121 LGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKAAFRGTDFKEWRLVVQEI 180
           LGARFKELRFYARFL+VSLFLNRTDTVQVLAERLKALVDDSKA FR TDFKEWRLVVQEI
Sbjct: 121 LGARFKELRFYARFLMVSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEI 180

Query: 181 FCFMKVASVSMNIRPLRYSAPFDSHPSSLPCVARFHAKRVLKFRDAVLTSYHRNEVKFAE 240
           FCFM +A+ SMN+RPLRYS  FDSHPSSLP VARFHAKRVLKFRDAVLTSYHRNEVKFAE
Sbjct: 181 FCFMNIATASMNVRPLRYSTAFDSHPSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAE 240

Query: 241 ITLDTYRMLQCLEWEPGFFYQRHPVEPNENGASIDHSGASGIIDINLATDITDPSLPPNP 300
           ITLDTYRMLQCLEWEPGFFYQ+HPVEPNENGA+ID+SGASGIIDINL+TDITDPSLPPNP
Sbjct: 241 ITLDTYRMLQCLEWEPGFFYQKHPVEPNENGAAIDYSGASGIIDINLSTDITDPSLPPNP 300

Query: 301 RKAILYRPSVTHLIAVMATICEELLPDSIMLIYLSAAGKCCQNSVNQMASHGESRKSLKH 360
           +KAILYRPSVTHLIAVMATICEELLPDSIMLIYLSAAGKCCQNSV+Q AS+GESRKSLK 
Sbjct: 301 KKAILYRPSVTHLIAVMATICEELLPDSIMLIYLSAAGKCCQNSVHQTASYGESRKSLKS 360

Query: 361 KVISQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRPVF 420
           KVI QNSRENCNALPESCKS+KPGSSDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRPVF
Sbjct: 361 KVIPQNSRENCNALPESCKSQKPGSSDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRPVF 420

Query: 421 LIVDSNNSHAFKVLHGAERGETAAILLSHLRPVFKNPLDVDTIQSGSQFTFFLTAPLPAF 480
           LIVDSNNSHAFK LHGAERGETAAILLS LRP FKNPL+VDTIQSGSQFTFFLTAPLPAF
Sbjct: 421 LIVDSNNSHAFKALHGAERGETAAILLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAF 480

Query: 481 CEMVGLSSANLDLDVYNDAETIISSAFSEWEIILCTSTSLNLVWAQVLCDHFLRRLILRF 540
           CEMVGLSSANLD+DVYNDAETIISSAFSEWEIILCTSTSLN+VWAQVL DHFLRRLILRF
Sbjct: 481 CEMVGLSSANLDIDVYNDAETIISSAFSEWEIILCTSTSLNIVWAQVLSDHFLRRLILRF 540

Query: 541 MFCRSVLSFFSTTEDGDLPVCLPCLPDSISSNSGVVSSAVRRLAKHLNVADLFNFHEIRT 600
           +FCRSVLSFF+T ED DLP+CLPCLPDSISSN GVV SA+RRLA HLN  +         
Sbjct: 541 IFCRSVLSFFNTKEDDDLPICLPCLPDSISSNCGVVISAIRRLANHLNKRN--------- 600

Query: 601 PKNTWICFLNRPIPRLSLRTRRYKYALTALVVKLFGFFDGGERDQPGREEALSDSSMRIC 660
                                                                       
Sbjct: 601 ------------------------------------------------------------ 660

Query: 661 VTWIGFGVGLIVNNSELGFIFLQMARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVK 720
                                 QMARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVK
Sbjct: 661 ----------------------QMARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVK 720

Query: 721 KPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAA 780
           KPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAA
Sbjct: 721 KPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAA 728

Query: 781 EAYLVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA 820
           EAYLVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA
Sbjct: 781 EAYLVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA 728

BLAST of Sgr020620 vs. NCBI nr
Match: KAG6600643.1 (hypothetical protein SDJN03_05876, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1213.7 bits (3139), Expect = 0.0e+00
Identity = 635/815 (77.91%), Postives = 668/815 (81.96%), Query Frame = 0

Query: 1   MTDNDSVAKTFRALVESANRKFARVQDVPAYGHVDNHHYFHKVFKAYMRLWKFQQEFRTK 60
           MTD++SVAKTFRALVESA+RKFARVQDVPAYG VDNHHYFHKVFKAYMRLWKFQQEFR K
Sbjct: 1   MTDSNSVAKTFRALVESADRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKFQQEFRAK 60

Query: 61  LVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILNRSYFEGSKNSRKD 120
           LVESGLNR EIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILNR+YFE SKNSRKD
Sbjct: 61  LVESGLNRSEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILNRNYFEESKNSRKD 120

Query: 121 LGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKAAFRGTDFKEWRLVVQEI 180
           LGARFK+LRFYARFLLVSL LNRT TVQVLAERLKALVDDSKAAFRGTDFKEWRLVVQEI
Sbjct: 121 LGARFKQLRFYARFLLVSLLLNRTHTVQVLAERLKALVDDSKAAFRGTDFKEWRLVVQEI 180

Query: 181 FCFMKVASVSMNIRPLRYSAPFDSHPSSLPCVARFHAKRVLKFRDAVLTSYHRNEVKFAE 240
           FCFMKVA++SMN                                           VKFAE
Sbjct: 181 FCFMKVATISMN-------------------------------------------VKFAE 240

Query: 241 ITLDTYRMLQCLEWEPGFFYQRHPVEPNENGASIDHSGASGIIDINLATDITDPSLPPNP 300
           ITLDT+RMLQCLEWEPGFFYQ+HPVEPNENGA+ID+SGASGIIDINLATD++DPSLPPNP
Sbjct: 241 ITLDTFRMLQCLEWEPGFFYQKHPVEPNENGATIDYSGASGIIDINLATDMSDPSLPPNP 300

Query: 301 RKAILYRPSVTHLIAVMATICEELLPDSIMLIYLSAAGKCCQNSVNQMASHGESRKSLKH 360
           +KAILYRPSVTHLIAVMAT+CEELLPDSIMLIYLSAAGKCCQNSVNQM S+GESRKS++ 
Sbjct: 301 KKAILYRPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQNSVNQMVSYGESRKSVRS 360

Query: 361 KVISQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRPVF 420
           KVI+QNSRENCN+LPESCKSEK GSSDLYDEYLWFGHR NGGPNVLYPGDIIPFTRRPVF
Sbjct: 361 KVITQNSRENCNSLPESCKSEKRGSSDLYDEYLWFGHRSNGGPNVLYPGDIIPFTRRPVF 420

Query: 421 LIVDSNNSHAFKVLHGAERGETAAILLSHLRPVFKNPLDVDTIQSGSQFTFFLTAPLPAF 480
           LIVDSNNSHAFKVLHGAERGETAAILLS LRP FKNPL+VDTIQSGSQFTFFLTAPLPAF
Sbjct: 421 LIVDSNNSHAFKVLHGAERGETAAILLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAF 480

Query: 481 CEMVGLSSANLDLDVYNDAETIISSAFSEWEIILCTSTSLNLVWAQVLCDHFLRRLILRF 540
            EMVGL SAN+D DVYNDAETI+SSA SEWE +LCTSTSLN+VWAQVL D+FLRRLILRF
Sbjct: 481 LEMVGLPSANMDTDVYNDAETIVSSALSEWETVLCTSTSLNIVWAQVLSDNFLRRLILRF 540

Query: 541 MFCRSVLSFFSTTEDGDLPVCLPCLPDSISSNSGVVSSAVRRLAKHLNVADLFNFHEIRT 600
           +FCRSVLSFFST ED DLP+CLPCLPDS++SNSGVV SA+RRLAKHLNVADLFNFHE R+
Sbjct: 541 IFCRSVLSFFSTKEDDDLPICLPCLPDSVASNSGVVCSAIRRLAKHLNVADLFNFHERRS 600

Query: 601 PKNTWICFLNRPIPRLSLRTRRYKYALTALVVKLFGFFDGGERDQPGREEALSDSSMRIC 660
                                                                       
Sbjct: 601 SAKQH------------------------------------------------------- 660

Query: 661 VTWIGFGVGLIVNNSELGFIFLQMARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVK 720
                                 +MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVK
Sbjct: 661 ----------------------EMARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVK 695

Query: 721 KPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAA 780
           KPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAA
Sbjct: 721 KPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAA 695

Query: 781 EAYLVGLFEDTNLCAIHAKRVTIMPKDIQLARRIR 816
           EAYLVGLFEDTNLCAIHAKRVTIMPKDIQLARRIR
Sbjct: 781 EAYLVGLFEDTNLCAIHAKRVTIMPKDIQLARRIR 695

BLAST of Sgr020620 vs. NCBI nr
Match: XP_022136688.1 (protein SCAI isoform X1 [Momordica charantia])

HSP 1 Score: 1137.5 bits (2941), Expect = 0.0e+00
Identity = 563/597 (94.30%), Postives = 583/597 (97.65%), Query Frame = 0

Query: 1   MTDNDSVAKTFRALVESANRKFARVQDVPAYGHVDNHHYFHKVFKAYMRLWKFQQEFRTK 60
           MTD DSVAKTFRALV+SANRKFARVQDVPAYG VDNHHYFHKVFKA+MRLWKFQQEFRTK
Sbjct: 1   MTDYDSVAKTFRALVDSANRKFARVQDVPAYGRVDNHHYFHKVFKAFMRLWKFQQEFRTK 60

Query: 61  LVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILNRSYFEGSKNSRKD 120
           LVESGLNRWEIGEIASRIGQLYFGHY+RTSEARFLIEAYVFYEAILNRSYFEGSKNSRKD
Sbjct: 61  LVESGLNRWEIGEIASRIGQLYFGHYLRTSEARFLIEAYVFYEAILNRSYFEGSKNSRKD 120

Query: 121 LGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKAAFRGTDFKEWRLVVQEI 180
           LGARFKELRFYARFL VSLFLNRTDTVQVLAERLKALVDDSKA F GTDFKEWRLVVQEI
Sbjct: 121 LGARFKELRFYARFLQVSLFLNRTDTVQVLAERLKALVDDSKAVFLGTDFKEWRLVVQEI 180

Query: 181 FCFMKVASVSMNIRPLRYSAPFDSHPSSLPCVARFHAKRVLKFRDAVLTSYHRNEVKFAE 240
           FCFMKVA+VSMN+RPLRYSA FDSHPSSLP VARFHAKRVLKFRDAVLTSYHR+EVKFAE
Sbjct: 181 FCFMKVATVSMNVRPLRYSALFDSHPSSLPFVARFHAKRVLKFRDAVLTSYHRHEVKFAE 240

Query: 241 ITLDTYRMLQCLEWEPGFFYQRHPVEPNENGASIDHSGASGIIDINLATDITDPSLPPNP 300
           ITLDTYRMLQCLEWEPGFF+Q+HPVEPNENGA+IDHSGASGIIDINLATDI+DPSLPPNP
Sbjct: 241 ITLDTYRMLQCLEWEPGFFFQKHPVEPNENGATIDHSGASGIIDINLATDISDPSLPPNP 300

Query: 301 RKAILYRPSVTHLIAVMATICEELLPDSIMLIYLSAAGKCCQNSVNQMASHGESRKSLKH 360
           +KAILYRPSVTHLIAVMATICEELLPDSIMLIYLSAAGKCCQNSVNQMAS GESRKSLK+
Sbjct: 301 KKAILYRPSVTHLIAVMATICEELLPDSIMLIYLSAAGKCCQNSVNQMASQGESRKSLKN 360

Query: 361 KVISQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRPVF 420
           KVI+QNSRENCNALPESCKSEK GSSDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRPVF
Sbjct: 361 KVIAQNSRENCNALPESCKSEKAGSSDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRPVF 420

Query: 421 LIVDSNNSHAFKVLHGAERGETAAILLSHLRPVFKNPLDVDTIQSGSQFTFFLTAPLPAF 480
           LIVDSNNSHAFKVLHGAERGETAAILLS LRPVFKNPLDVDT++SGSQFTFFLTAPLPAF
Sbjct: 421 LIVDSNNSHAFKVLHGAERGETAAILLSPLRPVFKNPLDVDTVRSGSQFTFFLTAPLPAF 480

Query: 481 CEMVGLSSANLDLDVYNDAETIISSAFSEWEIILCTSTSLNLVWAQVLCDHFLRRLILRF 540
           CEMVGLS ANLDLDVYNDAETIISSAFSEWEIILCTSTSLNLVWAQVL DHFLRRLILRF
Sbjct: 481 CEMVGLSPANLDLDVYNDAETIISSAFSEWEIILCTSTSLNLVWAQVLSDHFLRRLILRF 540

Query: 541 MFCRSVLSFFSTTEDGDLPVCLPCLPDSISSNSGVVSSAVRRLAKHLNVADLFNFHE 598
           +FCRSVLSFFSTTED +LPVCLPCLP+SI+SNSGVVSS VRR+AKHLNVADLFNFHE
Sbjct: 541 VFCRSVLSFFSTTEDFNLPVCLPCLPNSIASNSGVVSSVVRRIAKHLNVADLFNFHE 597

BLAST of Sgr020620 vs. NCBI nr
Match: XP_004146874.2 (LOW QUALITY PROTEIN: protein SCAI [Cucumis sativus])

HSP 1 Score: 1118.6 bits (2892), Expect = 0.0e+00
Identity = 547/598 (91.47%), Postives = 573/598 (95.82%), Query Frame = 0

Query: 1   MTDNDSVAKTFRALVESANRKFARVQDVPAYGHVDNHHYFHKVFKAYMRLWKFQQEFRTK 60
           MTD+D  AKTFRA+VE+ANRKFARVQDVPAYG VDNHHYFHKVFKAYMRLWK+QQEFR K
Sbjct: 1   MTDHDCEAKTFRAMVENANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAK 60

Query: 61  LVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILNRSYFEGSKNSRKD 120
           LVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILNRSYFEGSKNSRKD
Sbjct: 61  LVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILNRSYFEGSKNSRKD 120

Query: 121 LGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKAAFRGTDFKEWRLVVQEI 180
           LGA FKELRFYARFLLVSL LNRTDTVQVLAERLKALVDDSKA FR TDFKEWRLVVQEI
Sbjct: 121 LGAXFKELRFYARFLLVSLLLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEI 180

Query: 181 FCFMKVASVSMNIRPLRYSAPFDSHPSSLPCVARFHAKRVLKFRDAVLTSYHRNEVKFAE 240
           FCFM +A+ S N+RPLRYS  FDSHP SLP V RFHAKRVLKFRDAVLTSYHRNEVKFAE
Sbjct: 181 FCFMNIATASTNVRPLRYSTAFDSHPPSLPFVGRFHAKRVLKFRDAVLTSYHRNEVKFAE 240

Query: 241 ITLDTYRMLQCLEWEPGFFYQRHPVEPNENGASIDHSGASGIIDINLATDITDPSLPPNP 300
           ITLDTYRMLQCLEWEPGFFYQ+HPVEPNENGA IDHSGASGIIDINLATD+TDPSLPPNP
Sbjct: 241 ITLDTYRMLQCLEWEPGFFYQKHPVEPNENGAGIDHSGASGIIDINLATDVTDPSLPPNP 300

Query: 301 RKAILYRPSVTHLIAVMATICEELLPDSIMLIYLSAAGKCCQNSVNQMASHGESRKSLKH 360
           +KAIL+RPSVTHLIAVMAT+CEELLPDSIMLIYLSAAGKCCQNSVNQMAS GESRKSLK+
Sbjct: 301 KKAILHRPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQNSVNQMASVGESRKSLKN 360

Query: 361 KVISQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRPVF 420
           KV +QNSRENCNAL ESCKSEKPGSSDLYDEYLWFGHRG+GGPNVLYPGDIIPFTRRPVF
Sbjct: 361 KVTAQNSRENCNALAESCKSEKPGSSDLYDEYLWFGHRGSGGPNVLYPGDIIPFTRRPVF 420

Query: 421 LIVDSNNSHAFKVLHGAERGETAAILLSHLRPVFKNPLDVDTIQSGSQFTFFLTAPLPAF 480
           LIVDSNNSHAFKVLHGAERGETAAILLS LRP FKNPL+VDTIQSGSQFTFFLTAPLPAF
Sbjct: 421 LIVDSNNSHAFKVLHGAERGETAAILLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAF 480

Query: 481 CEMVGLSSANLDLDVYNDAETIISSAFSEWEIILCTSTSLNLVWAQVLCDHFLRRLILRF 540
           CEMVGLSSANLD+DVYNDA+TI+SSAFS+WEIILCTSTSLN+VWAQVL DHFLRRLILRF
Sbjct: 481 CEMVGLSSANLDIDVYNDADTILSSAFSDWEIILCTSTSLNIVWAQVLSDHFLRRLILRF 540

Query: 541 MFCRSVLSFFSTTEDGDLPVCLPCLPDSISSNSGVVSSAVRRLAKHLNVADLFNFHEI 599
           +FCRSVLSFF+T ED DLPVCLPCLPDS+SSNSGVVSSA+RRLAKHLNVADLFNFHE+
Sbjct: 541 IFCRSVLSFFNTKEDDDLPVCLPCLPDSVSSNSGVVSSAIRRLAKHLNVADLFNFHEV 598

BLAST of Sgr020620 vs. NCBI nr
Match: XP_008453803.1 (PREDICTED: protein SCAI [Cucumis melo])

HSP 1 Score: 1118.2 bits (2891), Expect = 0.0e+00
Identity = 549/598 (91.81%), Postives = 572/598 (95.65%), Query Frame = 0

Query: 1   MTDNDSVAKTFRALVESANRKFARVQDVPAYGHVDNHHYFHKVFKAYMRLWKFQQEFRTK 60
           MTDNDS AKTFRA+VE+ANRKFARVQDVPAYG VDNHHYFHKVFKAYMRLWK+QQEFR K
Sbjct: 1   MTDNDSEAKTFRAMVENANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAK 60

Query: 61  LVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILNRSYFEGSKNSRKD 120
           LVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILNRSYFEGSKNSRKD
Sbjct: 61  LVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILNRSYFEGSKNSRKD 120

Query: 121 LGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKAAFRGTDFKEWRLVVQEI 180
           LGARFKELRFYARFLLVSL LNRTDTVQVLAERLKALVDDSKA FR TDFKEWRLVVQEI
Sbjct: 121 LGARFKELRFYARFLLVSLLLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEI 180

Query: 181 FCFMKVASVSMNIRPLRYSAPFDSHPSSLPCVARFHAKRVLKFRDAVLTSYHRNEVKFAE 240
           FCFM VA+ S N+RPLRYS  FDSHP SLP V RFHAKRVLKFRDAVLTSYHRNEVKFAE
Sbjct: 181 FCFMNVATASTNVRPLRYSTAFDSHPPSLPFVGRFHAKRVLKFRDAVLTSYHRNEVKFAE 240

Query: 241 ITLDTYRMLQCLEWEPGFFYQRHPVEPNENGASIDHSGASGIIDINLATDITDPSLPPNP 300
           ITLDTYRMLQCLEWEPGFFYQ+HPVEPNENGA IDHSGASGIIDINLATD+TDPSLPPNP
Sbjct: 241 ITLDTYRMLQCLEWEPGFFYQKHPVEPNENGAGIDHSGASGIIDINLATDVTDPSLPPNP 300

Query: 301 RKAILYRPSVTHLIAVMATICEELLPDSIMLIYLSAAGKCCQNSVNQMASHGESRKSLKH 360
           +KAILYRPSVTHLIAVMAT+CEELLPDSIMLIYLSAAGKCCQNSVNQM S GESRKSLK+
Sbjct: 301 KKAILYRPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQNSVNQM-SFGESRKSLKN 360

Query: 361 KVISQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRPVF 420
           KV +QNSRENCNAL ESCK EKPGSSDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRPVF
Sbjct: 361 KVTAQNSRENCNALAESCKLEKPGSSDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRPVF 420

Query: 421 LIVDSNNSHAFKVLHGAERGETAAILLSHLRPVFKNPLDVDTIQSGSQFTFFLTAPLPAF 480
           LIVDSNNSHAFKVLHGAERGETAAILLS LRP FKNPL+VDTIQSGSQFTFFLTAPLPAF
Sbjct: 421 LIVDSNNSHAFKVLHGAERGETAAILLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAF 480

Query: 481 CEMVGLSSANLDLDVYNDAETIISSAFSEWEIILCTSTSLNLVWAQVLCDHFLRRLILRF 540
           CEMVGLSSANLD+DVYNDA+TI+SSAFS+WEIILCTSTSLN+VWAQVL DHFLRRLILRF
Sbjct: 481 CEMVGLSSANLDIDVYNDADTILSSAFSDWEIILCTSTSLNIVWAQVLSDHFLRRLILRF 540

Query: 541 MFCRSVLSFFSTTEDGDLPVCLPCLPDSISSNSGVVSSAVRRLAKHLNVADLFNFHEI 599
           +FCR+VLSFF+T ED DLP CLPCLPDS+SSNSGVVSSA+RRLAKHLNVADLFNFHE+
Sbjct: 541 IFCRAVLSFFNTKEDDDLPFCLPCLPDSVSSNSGVVSSAIRRLAKHLNVADLFNFHEV 597

BLAST of Sgr020620 vs. ExPASy Swiss-Prot
Match: P59169 (Histone H3.3 OS=Arabidopsis thaliana OX=3702 GN=HTR4 PE=1 SV=2)

HSP 1 Score: 258.5 bits (659), Expect = 2.6e-67
Identity = 136/136 (100.00%), Postives = 136/136 (100.00%), Query Frame = 0

Query: 684 MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE 743
           MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE
Sbjct: 1   MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE 60

Query: 744 LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI 803
           LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI
Sbjct: 61  LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI 120

Query: 804 MPKDIQLARRIRGERA 820
           MPKDIQLARRIRGERA
Sbjct: 121 MPKDIQLARRIRGERA 136

BLAST of Sgr020620 vs. ExPASy Swiss-Prot
Match: Q6RUR1 (Histone H3.3 OS=Capsicum annuum OX=4072 PE=2 SV=3)

HSP 1 Score: 258.5 bits (659), Expect = 2.6e-67
Identity = 136/136 (100.00%), Postives = 136/136 (100.00%), Query Frame = 0

Query: 684 MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE 743
           MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE
Sbjct: 1   MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE 60

Query: 744 LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI 803
           LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI
Sbjct: 61  LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI 120

Query: 804 MPKDIQLARRIRGERA 820
           MPKDIQLARRIRGERA
Sbjct: 121 MPKDIQLARRIRGERA 136

BLAST of Sgr020620 vs. ExPASy Swiss-Prot
Match: Q71V89 (Histone H3.3 OS=Gossypium hirsutum OX=3635 GN=HIS3 PE=2 SV=3)

HSP 1 Score: 258.5 bits (659), Expect = 2.6e-67
Identity = 136/136 (100.00%), Postives = 136/136 (100.00%), Query Frame = 0

Query: 684 MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE 743
           MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE
Sbjct: 1   MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE 60

Query: 744 LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI 803
           LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI
Sbjct: 61  LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI 120

Query: 804 MPKDIQLARRIRGERA 820
           MPKDIQLARRIRGERA
Sbjct: 121 MPKDIQLARRIRGERA 136

BLAST of Sgr020620 vs. ExPASy Swiss-Prot
Match: Q3C2E5 (Histone H3.3 OS=Lolium multiflorum OX=4521 GN=RH3 PE=2 SV=3)

HSP 1 Score: 258.5 bits (659), Expect = 2.6e-67
Identity = 136/136 (100.00%), Postives = 136/136 (100.00%), Query Frame = 0

Query: 684 MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE 743
           MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE
Sbjct: 1   MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE 60

Query: 744 LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI 803
           LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI
Sbjct: 61  LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI 120

Query: 804 MPKDIQLARRIRGERA 820
           MPKDIQLARRIRGERA
Sbjct: 121 MPKDIQLARRIRGERA 136

BLAST of Sgr020620 vs. ExPASy Swiss-Prot
Match: P69245 (Histone H3.3 OS=Lolium temulentum OX=34176 PE=2 SV=2)

HSP 1 Score: 258.5 bits (659), Expect = 2.6e-67
Identity = 136/136 (100.00%), Postives = 136/136 (100.00%), Query Frame = 0

Query: 684 MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE 743
           MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE
Sbjct: 1   MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE 60

Query: 744 LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI 803
           LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI
Sbjct: 61  LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI 120

Query: 804 MPKDIQLARRIRGERA 820
           MPKDIQLARRIRGERA
Sbjct: 121 MPKDIQLARRIRGERA 136

BLAST of Sgr020620 vs. ExPASy TrEMBL
Match: A0A6J1C886 (protein SCAI isoform X1 OS=Momordica charantia OX=3673 GN=LOC111008340 PE=4 SV=1)

HSP 1 Score: 1137.5 bits (2941), Expect = 0.0e+00
Identity = 563/597 (94.30%), Postives = 583/597 (97.65%), Query Frame = 0

Query: 1   MTDNDSVAKTFRALVESANRKFARVQDVPAYGHVDNHHYFHKVFKAYMRLWKFQQEFRTK 60
           MTD DSVAKTFRALV+SANRKFARVQDVPAYG VDNHHYFHKVFKA+MRLWKFQQEFRTK
Sbjct: 1   MTDYDSVAKTFRALVDSANRKFARVQDVPAYGRVDNHHYFHKVFKAFMRLWKFQQEFRTK 60

Query: 61  LVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILNRSYFEGSKNSRKD 120
           LVESGLNRWEIGEIASRIGQLYFGHY+RTSEARFLIEAYVFYEAILNRSYFEGSKNSRKD
Sbjct: 61  LVESGLNRWEIGEIASRIGQLYFGHYLRTSEARFLIEAYVFYEAILNRSYFEGSKNSRKD 120

Query: 121 LGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKAAFRGTDFKEWRLVVQEI 180
           LGARFKELRFYARFL VSLFLNRTDTVQVLAERLKALVDDSKA F GTDFKEWRLVVQEI
Sbjct: 121 LGARFKELRFYARFLQVSLFLNRTDTVQVLAERLKALVDDSKAVFLGTDFKEWRLVVQEI 180

Query: 181 FCFMKVASVSMNIRPLRYSAPFDSHPSSLPCVARFHAKRVLKFRDAVLTSYHRNEVKFAE 240
           FCFMKVA+VSMN+RPLRYSA FDSHPSSLP VARFHAKRVLKFRDAVLTSYHR+EVKFAE
Sbjct: 181 FCFMKVATVSMNVRPLRYSALFDSHPSSLPFVARFHAKRVLKFRDAVLTSYHRHEVKFAE 240

Query: 241 ITLDTYRMLQCLEWEPGFFYQRHPVEPNENGASIDHSGASGIIDINLATDITDPSLPPNP 300
           ITLDTYRMLQCLEWEPGFF+Q+HPVEPNENGA+IDHSGASGIIDINLATDI+DPSLPPNP
Sbjct: 241 ITLDTYRMLQCLEWEPGFFFQKHPVEPNENGATIDHSGASGIIDINLATDISDPSLPPNP 300

Query: 301 RKAILYRPSVTHLIAVMATICEELLPDSIMLIYLSAAGKCCQNSVNQMASHGESRKSLKH 360
           +KAILYRPSVTHLIAVMATICEELLPDSIMLIYLSAAGKCCQNSVNQMAS GESRKSLK+
Sbjct: 301 KKAILYRPSVTHLIAVMATICEELLPDSIMLIYLSAAGKCCQNSVNQMASQGESRKSLKN 360

Query: 361 KVISQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRPVF 420
           KVI+QNSRENCNALPESCKSEK GSSDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRPVF
Sbjct: 361 KVIAQNSRENCNALPESCKSEKAGSSDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRPVF 420

Query: 421 LIVDSNNSHAFKVLHGAERGETAAILLSHLRPVFKNPLDVDTIQSGSQFTFFLTAPLPAF 480
           LIVDSNNSHAFKVLHGAERGETAAILLS LRPVFKNPLDVDT++SGSQFTFFLTAPLPAF
Sbjct: 421 LIVDSNNSHAFKVLHGAERGETAAILLSPLRPVFKNPLDVDTVRSGSQFTFFLTAPLPAF 480

Query: 481 CEMVGLSSANLDLDVYNDAETIISSAFSEWEIILCTSTSLNLVWAQVLCDHFLRRLILRF 540
           CEMVGLS ANLDLDVYNDAETIISSAFSEWEIILCTSTSLNLVWAQVL DHFLRRLILRF
Sbjct: 481 CEMVGLSPANLDLDVYNDAETIISSAFSEWEIILCTSTSLNLVWAQVLSDHFLRRLILRF 540

Query: 541 MFCRSVLSFFSTTEDGDLPVCLPCLPDSISSNSGVVSSAVRRLAKHLNVADLFNFHE 598
           +FCRSVLSFFSTTED +LPVCLPCLP+SI+SNSGVVSS VRR+AKHLNVADLFNFHE
Sbjct: 541 VFCRSVLSFFSTTEDFNLPVCLPCLPNSIASNSGVVSSVVRRIAKHLNVADLFNFHE 597

BLAST of Sgr020620 vs. ExPASy TrEMBL
Match: A0A0A0KU99 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G026910 PE=4 SV=1)

HSP 1 Score: 1120.9 bits (2898), Expect = 0.0e+00
Identity = 548/598 (91.64%), Postives = 574/598 (95.99%), Query Frame = 0

Query: 1   MTDNDSVAKTFRALVESANRKFARVQDVPAYGHVDNHHYFHKVFKAYMRLWKFQQEFRTK 60
           MTD+D  AKTFRA+VE+ANRKFARVQDVPAYG VDNHHYFHKVFKAYMRLWK+QQEFR K
Sbjct: 1   MTDHDCEAKTFRAMVENANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAK 60

Query: 61  LVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILNRSYFEGSKNSRKD 120
           LVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILNRSYFEGSKNSRKD
Sbjct: 61  LVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILNRSYFEGSKNSRKD 120

Query: 121 LGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKAAFRGTDFKEWRLVVQEI 180
           LGARFKELRFYARFLLVSL LNRTDTVQVLAERLKALVDDSKA FR TDFKEWRLVVQEI
Sbjct: 121 LGARFKELRFYARFLLVSLLLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEI 180

Query: 181 FCFMKVASVSMNIRPLRYSAPFDSHPSSLPCVARFHAKRVLKFRDAVLTSYHRNEVKFAE 240
           FCFM +A+ S N+RPLRYS  FDSHP SLP V RFHAKRVLKFRDAVLTSYHRNEVKFAE
Sbjct: 181 FCFMNIATASTNVRPLRYSTAFDSHPPSLPFVGRFHAKRVLKFRDAVLTSYHRNEVKFAE 240

Query: 241 ITLDTYRMLQCLEWEPGFFYQRHPVEPNENGASIDHSGASGIIDINLATDITDPSLPPNP 300
           ITLDTYRMLQCLEWEPGFFYQ+HPVEPNENGA IDHSGASGIIDINLATD+TDPSLPPNP
Sbjct: 241 ITLDTYRMLQCLEWEPGFFYQKHPVEPNENGAGIDHSGASGIIDINLATDVTDPSLPPNP 300

Query: 301 RKAILYRPSVTHLIAVMATICEELLPDSIMLIYLSAAGKCCQNSVNQMASHGESRKSLKH 360
           +KAIL+RPSVTHLIAVMAT+CEELLPDSIMLIYLSAAGKCCQNSVNQMAS GESRKSLK+
Sbjct: 301 KKAILHRPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQNSVNQMASVGESRKSLKN 360

Query: 361 KVISQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRPVF 420
           KV +QNSRENCNAL ESCKSEKPGSSDLYDEYLWFGHRG+GGPNVLYPGDIIPFTRRPVF
Sbjct: 361 KVTAQNSRENCNALAESCKSEKPGSSDLYDEYLWFGHRGSGGPNVLYPGDIIPFTRRPVF 420

Query: 421 LIVDSNNSHAFKVLHGAERGETAAILLSHLRPVFKNPLDVDTIQSGSQFTFFLTAPLPAF 480
           LIVDSNNSHAFKVLHGAERGETAAILLS LRP FKNPL+VDTIQSGSQFTFFLTAPLPAF
Sbjct: 421 LIVDSNNSHAFKVLHGAERGETAAILLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAF 480

Query: 481 CEMVGLSSANLDLDVYNDAETIISSAFSEWEIILCTSTSLNLVWAQVLCDHFLRRLILRF 540
           CEMVGLSSANLD+DVYNDA+TI+SSAFS+WEIILCTSTSLN+VWAQVL DHFLRRLILRF
Sbjct: 481 CEMVGLSSANLDIDVYNDADTILSSAFSDWEIILCTSTSLNIVWAQVLSDHFLRRLILRF 540

Query: 541 MFCRSVLSFFSTTEDGDLPVCLPCLPDSISSNSGVVSSAVRRLAKHLNVADLFNFHEI 599
           +FCRSVLSFF+T ED DLPVCLPCLPDS+SSNSGVVSSA+RRLAKHLNVADLFNFHE+
Sbjct: 541 IFCRSVLSFFNTKEDDDLPVCLPCLPDSVSSNSGVVSSAIRRLAKHLNVADLFNFHEV 598

BLAST of Sgr020620 vs. ExPASy TrEMBL
Match: A0A1S3BX87 (protein SCAI OS=Cucumis melo OX=3656 GN=LOC103494421 PE=4 SV=1)

HSP 1 Score: 1118.2 bits (2891), Expect = 0.0e+00
Identity = 549/598 (91.81%), Postives = 572/598 (95.65%), Query Frame = 0

Query: 1   MTDNDSVAKTFRALVESANRKFARVQDVPAYGHVDNHHYFHKVFKAYMRLWKFQQEFRTK 60
           MTDNDS AKTFRA+VE+ANRKFARVQDVPAYG VDNHHYFHKVFKAYMRLWK+QQEFR K
Sbjct: 1   MTDNDSEAKTFRAMVENANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAK 60

Query: 61  LVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILNRSYFEGSKNSRKD 120
           LVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILNRSYFEGSKNSRKD
Sbjct: 61  LVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILNRSYFEGSKNSRKD 120

Query: 121 LGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKAAFRGTDFKEWRLVVQEI 180
           LGARFKELRFYARFLLVSL LNRTDTVQVLAERLKALVDDSKA FR TDFKEWRLVVQEI
Sbjct: 121 LGARFKELRFYARFLLVSLLLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEI 180

Query: 181 FCFMKVASVSMNIRPLRYSAPFDSHPSSLPCVARFHAKRVLKFRDAVLTSYHRNEVKFAE 240
           FCFM VA+ S N+RPLRYS  FDSHP SLP V RFHAKRVLKFRDAVLTSYHRNEVKFAE
Sbjct: 181 FCFMNVATASTNVRPLRYSTAFDSHPPSLPFVGRFHAKRVLKFRDAVLTSYHRNEVKFAE 240

Query: 241 ITLDTYRMLQCLEWEPGFFYQRHPVEPNENGASIDHSGASGIIDINLATDITDPSLPPNP 300
           ITLDTYRMLQCLEWEPGFFYQ+HPVEPNENGA IDHSGASGIIDINLATD+TDPSLPPNP
Sbjct: 241 ITLDTYRMLQCLEWEPGFFYQKHPVEPNENGAGIDHSGASGIIDINLATDVTDPSLPPNP 300

Query: 301 RKAILYRPSVTHLIAVMATICEELLPDSIMLIYLSAAGKCCQNSVNQMASHGESRKSLKH 360
           +KAILYRPSVTHLIAVMAT+CEELLPDSIMLIYLSAAGKCCQNSVNQM S GESRKSLK+
Sbjct: 301 KKAILYRPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQNSVNQM-SFGESRKSLKN 360

Query: 361 KVISQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRPVF 420
           KV +QNSRENCNAL ESCK EKPGSSDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRPVF
Sbjct: 361 KVTAQNSRENCNALAESCKLEKPGSSDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRPVF 420

Query: 421 LIVDSNNSHAFKVLHGAERGETAAILLSHLRPVFKNPLDVDTIQSGSQFTFFLTAPLPAF 480
           LIVDSNNSHAFKVLHGAERGETAAILLS LRP FKNPL+VDTIQSGSQFTFFLTAPLPAF
Sbjct: 421 LIVDSNNSHAFKVLHGAERGETAAILLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAF 480

Query: 481 CEMVGLSSANLDLDVYNDAETIISSAFSEWEIILCTSTSLNLVWAQVLCDHFLRRLILRF 540
           CEMVGLSSANLD+DVYNDA+TI+SSAFS+WEIILCTSTSLN+VWAQVL DHFLRRLILRF
Sbjct: 481 CEMVGLSSANLDIDVYNDADTILSSAFSDWEIILCTSTSLNIVWAQVLSDHFLRRLILRF 540

Query: 541 MFCRSVLSFFSTTEDGDLPVCLPCLPDSISSNSGVVSSAVRRLAKHLNVADLFNFHEI 599
           +FCR+VLSFF+T ED DLP CLPCLPDS+SSNSGVVSSA+RRLAKHLNVADLFNFHE+
Sbjct: 541 IFCRAVLSFFNTKEDDDLPFCLPCLPDSVSSNSGVVSSAIRRLAKHLNVADLFNFHEV 597

BLAST of Sgr020620 vs. ExPASy TrEMBL
Match: A0A6J1EMW7 (protein SCAI-like OS=Cucurbita moschata OX=3662 GN=LOC111436036 PE=4 SV=1)

HSP 1 Score: 1114.4 bits (2881), Expect = 0.0e+00
Identity = 549/598 (91.81%), Postives = 574/598 (95.99%), Query Frame = 0

Query: 1   MTDNDSVAKTFRALVESANRKFARVQDVPAYGHVDNHHYFHKVFKAYMRLWKFQQEFRTK 60
           MTDN+S AKTFRALVESANRKFARVQDVPAYG +D++HYFHKVFKAYMRLWK QQE+R K
Sbjct: 1   MTDNESAAKTFRALVESANRKFARVQDVPAYGRMDSNHYFHKVFKAYMRLWKHQQEYRAK 60

Query: 61  LVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILNRSYFEGSKNSRKD 120
           LVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAY+FYEAILNRSYFEGSKNSRKD
Sbjct: 61  LVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYIFYEAILNRSYFEGSKNSRKD 120

Query: 121 LGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKAAFRGTDFKEWRLVVQEI 180
           LGARFKELRFYARFL+VSL LNRTDTVQVLAERLKALVDDSKA FR TDFKEWRLVVQEI
Sbjct: 121 LGARFKELRFYARFLMVSLLLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEI 180

Query: 181 FCFMKVASVSMNIRPLRYSAPFDSHPSSLPCVARFHAKRVLKFRDAVLTSYHRNEVKFAE 240
           FCFM VA+ SMN+RPLRYS  FDSHPSSLP VARFHAKRVLKFRDAVLTSYHRNEVKFAE
Sbjct: 181 FCFMNVATASMNVRPLRYSTAFDSHPSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAE 240

Query: 241 ITLDTYRMLQCLEWEPGFFYQRHPVEPNENGASIDHSGASGIIDINLATDITDPSLPPNP 300
           ITLDTYRMLQCLEWEPGFFYQ+HPVEPNENGA+ID+SGASGIIDINL+TDITDPSLPPNP
Sbjct: 241 ITLDTYRMLQCLEWEPGFFYQKHPVEPNENGAAIDYSGASGIIDINLSTDITDPSLPPNP 300

Query: 301 RKAILYRPSVTHLIAVMATICEELLPDSIMLIYLSAAGKCCQNSVNQMASH-GESRKSLK 360
           +KAILYRPSVTHLIAVMATICEELLPDSIMLIYLSAAGKCCQNSV+Q AS+ GESRKSLK
Sbjct: 301 KKAILYRPSVTHLIAVMATICEELLPDSIMLIYLSAAGKCCQNSVHQTASYGGESRKSLK 360

Query: 361 HKVISQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRPV 420
            KVI+QNSRENCNALPESCKS+KPGSSDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRPV
Sbjct: 361 SKVIAQNSRENCNALPESCKSQKPGSSDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRPV 420

Query: 421 FLIVDSNNSHAFKVLHGAERGETAAILLSHLRPVFKNPLDVDTIQSGSQFTFFLTAPLPA 480
           FLIVDSNNSHAFK LHGAERGETAAILLS LRP FKNPL+VDTIQSGSQFTFFLTAPLPA
Sbjct: 421 FLIVDSNNSHAFKALHGAERGETAAILLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPA 480

Query: 481 FCEMVGLSSANLDLDVYNDAETIISSAFSEWEIILCTSTSLNLVWAQVLCDHFLRRLILR 540
           FCEMVGLSSANLD+DVYNDAETIISSAFSEWEIILCTSTSLN+VWAQVL DHFLRRLILR
Sbjct: 481 FCEMVGLSSANLDIDVYNDAETIISSAFSEWEIILCTSTSLNIVWAQVLSDHFLRRLILR 540

Query: 541 FMFCRSVLSFFSTTEDGDLPVCLPCLPDSISSNSGVVSSAVRRLAKHLNVADLFNFHE 598
           F+FCRSVLSFF+T ED DLP+CLPCLPDSISSN GVV SA+RRLA HLNVADLFNFHE
Sbjct: 541 FIFCRSVLSFFNTKEDDDLPICLPCLPDSISSNCGVVISAIRRLANHLNVADLFNFHE 598

BLAST of Sgr020620 vs. ExPASy TrEMBL
Match: A0A6J1JDH9 (protein SCAI-like OS=Cucurbita maxima OX=3661 GN=LOC111483509 PE=4 SV=1)

HSP 1 Score: 1113.2 bits (2878), Expect = 0.0e+00
Identity = 549/598 (91.81%), Postives = 575/598 (96.15%), Query Frame = 0

Query: 1   MTDNDSVAKTFRALVESANRKFARVQDVPAYGHVDNHHYFHKVFKAYMRLWKFQQEFRTK 60
           MTDN+S AKTFRALVESANRKFARVQDVPAYG +D++HYFHKVFKAYMRLWK QQE+R K
Sbjct: 1   MTDNESAAKTFRALVESANRKFARVQDVPAYGRMDSNHYFHKVFKAYMRLWKHQQEYRAK 60

Query: 61  LVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILNRSYFEGSKNSRKD 120
           LVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAY+FYEAILNRSYFEGSKNSRKD
Sbjct: 61  LVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYIFYEAILNRSYFEGSKNSRKD 120

Query: 121 LGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKAAFRGTDFKEWRLVVQEI 180
           LGARFKELRFYARFL+VSLFLNRTDTVQVLAERLKALVDDSKAAFR TDFKEWRLVVQEI
Sbjct: 121 LGARFKELRFYARFLMVSLFLNRTDTVQVLAERLKALVDDSKAAFRATDFKEWRLVVQEI 180

Query: 181 FCFMKVASVSMNIRPLRYSAPFDSHPSSLPCVARFHAKRVLKFRDAVLTSYHRNEVKFAE 240
           FCFM VA+ SMN+RPLRYS  FDSH SSLP VARFHAKRVLKFRDAVLTSYHRNEVKFAE
Sbjct: 181 FCFMNVATASMNVRPLRYSTAFDSHLSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAE 240

Query: 241 ITLDTYRMLQCLEWEPGFFYQRHPVEPNENGASIDHSGASGIIDINLATDITDPSLPPNP 300
           ITLDTYRMLQCLEWEPGFFYQ+HPVEPNENGA+ID+SGASGIIDINL+TDITDPSLPPNP
Sbjct: 241 ITLDTYRMLQCLEWEPGFFYQKHPVEPNENGAAIDYSGASGIIDINLSTDITDPSLPPNP 300

Query: 301 RKAILYRPSVTHLIAVMATICEELLPDSIMLIYLSAAGKCCQNSVNQMASH-GESRKSLK 360
           +KAILYRPSVTHLIAVMATICEELLPDSIMLIYLSAAGKCCQNSV+Q AS+ GESRKSLK
Sbjct: 301 KKAILYRPSVTHLIAVMATICEELLPDSIMLIYLSAAGKCCQNSVHQTASYGGESRKSLK 360

Query: 361 HKVISQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRPV 420
            KVI+QNSRENCNALPESCKS+KPGSSDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRPV
Sbjct: 361 SKVIAQNSRENCNALPESCKSQKPGSSDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRPV 420

Query: 421 FLIVDSNNSHAFKVLHGAERGETAAILLSHLRPVFKNPLDVDTIQSGSQFTFFLTAPLPA 480
           FLIVDSNNSHAFK LHGAERGETAAILLS LRP FKNPL+VDTIQSGSQFTFFLTAPLPA
Sbjct: 421 FLIVDSNNSHAFKALHGAERGETAAILLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPA 480

Query: 481 FCEMVGLSSANLDLDVYNDAETIISSAFSEWEIILCTSTSLNLVWAQVLCDHFLRRLILR 540
           FCEMVGLSSANLD+DVYNDAETIISSAFSEWEIILCTSTSLN+VWAQVL DHFLRRLILR
Sbjct: 481 FCEMVGLSSANLDIDVYNDAETIISSAFSEWEIILCTSTSLNIVWAQVLSDHFLRRLILR 540

Query: 541 FMFCRSVLSFFSTTEDGDLPVCLPCLPDSISSNSGVVSSAVRRLAKHLNVADLFNFHE 598
           F+FCRSVLSFF+T ED DLP+CLPCLPD+ISSN GVV SA+RRLA HLNVADLFNFHE
Sbjct: 541 FIFCRSVLSFFNTKEDDDLPICLPCLPDAISSNCGVVISAIRRLANHLNVADLFNFHE 598

BLAST of Sgr020620 vs. TAIR 10
Match: AT4G40050.1 (Protein of unknown function (DUF3550/UPF0682) )

HSP 1 Score: 699.9 bits (1805), Expect = 2.4e-201
Identity = 366/596 (61.41%), Postives = 464/596 (77.85%), Query Frame = 0

Query: 5   DSVAKTFRALVESANRKFARVQDVPAYGHVDNHHYFHKVFKAYMRLWKFQQEFRTKLVES 64
           + V+  FRALVE+A+RKFARV+D+PA+G   + HYF KVFKAYM+LW +QQ  R+KLVES
Sbjct: 4   EDVSSNFRALVENADRKFARVRDLPAFGRAQS-HYFQKVFKAYMKLWNYQQSHRSKLVES 63

Query: 65  GLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILNRSYFEGSKNSRKDLGAR 124
           GLNRWEIGEIASRIGQLYF  YMRTSEARFL+EA+VFYEAIL RSYF+ ++   KDLGAR
Sbjct: 64  GLNRWEIGEIASRIGQLYFSQYMRTSEARFLLEAFVFYEAILKRSYFDEAQG--KDLGAR 123

Query: 125 FKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKAAFRGTDFKEWRLVVQEIFCFM 184
           FKELRFYARFLLVSL ++R   +  LA++L+ LVD S + FR T+FKEWRLVVQEI  F+
Sbjct: 124 FKELRFYARFLLVSLIVDRKQMLLHLADKLRLLVDHSISNFRETNFKEWRLVVQEITRFI 183

Query: 185 KVASVSMNIRPLRYSAPFDSHPSSLPCVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLD 244
           +  +    +RPLRY A  DS+P+S   +ARFHAK++ KFRDA+L SYHRNEVK+AE+TLD
Sbjct: 184 ESDTNLTYLRPLRYCAMLDSYPASQTYLARFHAKKLFKFRDALLASYHRNEVKYAEVTLD 243

Query: 245 TYRMLQCLEWEP-GFFYQRHPVEPNENGASIDHSGASGIIDINLATDITDPSLPPNPRKA 304
           TYRM+QCLEWEP G FYQ+ PVE  ENG  +DH+  SG+ID+NLA D+ DPSLPPNPRKA
Sbjct: 244 TYRMMQCLEWEPSGSFYQKKPVEAKENGFVVDHTLTSGLIDMNLAADMADPSLPPNPRKA 303

Query: 305 ILYRPSVTHLIAVMATICEELLPDSIMLIYLSAAGKCCQNSVNQMASHGESRKSLKHKVI 364
           ILYRP+V+HL+AV+A IC+EL P+++ML+YLSA+G   + +V Q  +   S ++ K K++
Sbjct: 304 ILYRPTVSHLLAVLAMICDELSPETVMLLYLSASGGPARENVAQPENSVGSSRTSKSKLL 363

Query: 365 SQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHR-GNGGPNVLYPGDIIPFTRRPVFLI 424
           ++ S+E  +   E   + K  S++ Y+ +LW G R G+ G N LYPGD+IPFTR+P+FLI
Sbjct: 364 ARASQEQKSYKSEPHSNGKLMSAEYYENHLWLGPRGGSSGSNNLYPGDLIPFTRKPLFLI 423

Query: 425 VDSNNSHAFKVLHGAERGETAAILLSHLRPVFKNPLDVDT-IQSGSQFTFFLTAPLPAFC 484
           +DS+ S AFKVL GAERGE  A+LLS L+P F+NP   DT   +GSQFTFFLTAPL AFC
Sbjct: 424 IDSDTSRAFKVLGGAERGEPVAMLLSPLKPSFENPSTDDTEALNGSQFTFFLTAPLQAFC 483

Query: 485 EMVGLSSANLDLDVYNDAETIISSAFSEWEIILCTSTSLNLVWAQVLCDHFLRRLILRFM 544
           +M+GLS+   D ++ ++AE+I+S++FSEWE IL TS  LNLVWAQVL D FLRRLILRF+
Sbjct: 484 QMLGLSNTKPDPELCDEAESILSASFSEWETILLTSKVLNLVWAQVLPDPFLRRLILRFI 543

Query: 545 FCRSVLSFFSTTEDGD--LPVCLPCLPDSISSNSGVVSSAVRRLAKHLNVADLFNF 596
           FCRSVL+ FS TED D  LP C P LP+ +SS S  V S+V+RLA+HL VA  F+F
Sbjct: 544 FCRSVLTSFSRTEDDDPYLPQCHPNLPELLSSVSKPVQSSVQRLAEHLGVAKSFHF 596

BLAST of Sgr020620 vs. TAIR 10
Match: AT3G03570.1 (Protein of unknown function (DUF3550/UPF0682) )

HSP 1 Score: 488.0 bits (1255), Expect = 1.4e-137
Identity = 272/604 (45.03%), Postives = 387/604 (64.07%), Query Frame = 0

Query: 3   DNDSVAKTFRALVESANRKFARVQDVPAYGHVDNHHYFHKVFKAYMRLWKFQQEFRTKLV 62
           +N  +++ + +LV  A++KF++++D+P Y      +YF KVFK Y +LWKFQQE R KLV
Sbjct: 9   NNIPLSEVYWSLVNKADKKFSKIRDLPFYERSRYENYFFKVFKVYTQLWKFQQENRQKLV 68

Query: 63  ESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILNRSYFEGSKNSRKDLG 122
           E+GL RWEIGEIASRI QLY+GHYMRTS+A +L E+YVFYEAIL R YF+      +DL 
Sbjct: 69  EAGLKRWEIGEIASRIAQLYYGHYMRTSDAGYLSESYVFYEAILTREYFK--DGLFQDLN 128

Query: 123 ARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKAAFRGTDFKEWRLVVQEIFC 182
              K+LRF ARFL+V L L R + V  L ++ K L+D+ K  F+ TDFKEW++V QEI  
Sbjct: 129 IANKQLRFLARFLMVCLVLGRREMVHQLVDQFKRLIDECKRTFQETDFKEWKVVAQEIVR 188

Query: 183 FMKVASVSMNIRPLRYSAPFDSHPSSLPCVARFHAKRVLKFRDAVLTSYHRNEVKFAEIT 242
           F+K  +  MNIRPLRYS   D +  +        A R L+  DA+L+SY+ NEVK++E+T
Sbjct: 189 FLKSDTAFMNIRPLRYSLVLDPNLDA----GTPRASRSLRLTDAILSSYYCNEVKYSELT 248

Query: 243 LDTYRMLQCLEWEP-GFFYQRHPVEPNENGASIDHSGASGIIDINLATDITDPSLPPNPR 302
           LD++RMLQCLEWEP G  YQ         GA +  +   G+  IN +  + DP+LPPNPR
Sbjct: 249 LDSFRMLQCLEWEPSGSLYQ-------STGAKMGQNAPVGVARIN-SQSMNDPTLPPNPR 308

Query: 303 KAILYRPSVTHLIAVMATICEELLPDSIMLIYLSAAGKCCQNSVNQMASHGESR------ 362
           KA+LYRPS+TH +AV+ATICEEL    I+L+YLSA+GK  Q S + +++   +       
Sbjct: 309 KAVLYRPSITHFLAVLATICEELPSHGILLLYLSASGKIGQISSSPLSARSATSVEENIL 368

Query: 363 KSLKHKVISQNSRENCNALPESCKSEKPGSSDLYDE--YLWFGHRGNGGPNVLYPGDIIP 422
           +  +   I Q +  +    P S +S +  S D       L FG  G  G + +YP D++P
Sbjct: 369 RDFESHTIKQETEPSLQITP-SGQSLRQISEDAVSTPCGLSFGSHGLTGSSYIYPSDLVP 428

Query: 423 FTRRPVFLIVDSNNSHAFKVLHGAERGETAAILL--SHLRPVFKNPLDVDTIQSGSQFTF 482
           FTR+P+F+I+DS++S  FK + GAE+GE AA+LL  SH  P+     D     SGS FT 
Sbjct: 429 FTRKPLFIIIDSDSSTVFKNICGAEKGEPAALLLSPSHTPPLIS--ADFSRQPSGSLFTI 488

Query: 483 FLTAPLPAFCEMVGLSSANLDLDVYNDAETIISSAFSEWEIILCTSTSLNLVWAQVLCDH 542
           FLT+P+ AFC +  +S+++++ D++  AE ++SS+ +EW   L TS +L+ VW+Q+L D 
Sbjct: 489 FLTSPVQAFCLLSEISASDMETDIFTKAEKLLSSSMNEWASTLATSDTLHPVWSQILKDP 548

Query: 543 FLRRLILRFMFCRSVLSFFSTTEDG--DLPVCLPCLPDSISSNSGVVSSAVRRLAKHLNV 594
           FLRRL+LRF+FCR+VL+ ++   +   + P C P LP+S+   +  V SAV ++A     
Sbjct: 549 FLRRLLLRFIFCRAVLALYTPVFNNKQNQPECCPSLPESLLPTAPAVQSAVFQMANVFGA 595

BLAST of Sgr020620 vs. TAIR 10
Match: AT4G40030.2 (Histone superfamily protein )

HSP 1 Score: 259.6 bits (662), Expect = 8.4e-69
Identity = 136/138 (98.55%), Postives = 138/138 (100.00%), Query Frame = 0

Query: 682 LQMARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKS 741
           ++MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKS
Sbjct: 27  IKMARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKS 86

Query: 742 TELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRV 801
           TELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRV
Sbjct: 87  TELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRV 146

Query: 802 TIMPKDIQLARRIRGERA 820
           TIMPKDIQLARRIRGERA
Sbjct: 147 TIMPKDIQLARRIRGERA 164

BLAST of Sgr020620 vs. TAIR 10
Match: AT4G40030.1 (Histone superfamily protein )

HSP 1 Score: 258.5 bits (659), Expect = 1.9e-68
Identity = 136/136 (100.00%), Postives = 136/136 (100.00%), Query Frame = 0

Query: 684 MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE 743
           MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE
Sbjct: 1   MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE 60

Query: 744 LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI 803
           LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI
Sbjct: 61  LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI 120

Query: 804 MPKDIQLARRIRGERA 820
           MPKDIQLARRIRGERA
Sbjct: 121 MPKDIQLARRIRGERA 136

BLAST of Sgr020620 vs. TAIR 10
Match: AT4G40040.1 (Histone superfamily protein )

HSP 1 Score: 258.5 bits (659), Expect = 1.9e-68
Identity = 136/136 (100.00%), Postives = 136/136 (100.00%), Query Frame = 0

Query: 684 MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE 743
           MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE
Sbjct: 1   MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE 60

Query: 744 LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI 803
           LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI
Sbjct: 61  LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI 120

Query: 804 MPKDIQLARRIRGERA 820
           MPKDIQLARRIRGERA
Sbjct: 121 MPKDIQLARRIRGERA 136

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAG6577288.10.0e+0082.66Protein SCAI, partial [Cucurbita argyrosperma subsp. sororia][more]
KAG6600643.10.0e+0077.91hypothetical protein SDJN03_05876, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022136688.10.0e+0094.30protein SCAI isoform X1 [Momordica charantia][more]
XP_004146874.20.0e+0091.47LOW QUALITY PROTEIN: protein SCAI [Cucumis sativus][more]
XP_008453803.10.0e+0091.81PREDICTED: protein SCAI [Cucumis melo][more]
Match NameE-valueIdentityDescription
P591692.6e-67100.00Histone H3.3 OS=Arabidopsis thaliana OX=3702 GN=HTR4 PE=1 SV=2[more]
Q6RUR12.6e-67100.00Histone H3.3 OS=Capsicum annuum OX=4072 PE=2 SV=3[more]
Q71V892.6e-67100.00Histone H3.3 OS=Gossypium hirsutum OX=3635 GN=HIS3 PE=2 SV=3[more]
Q3C2E52.6e-67100.00Histone H3.3 OS=Lolium multiflorum OX=4521 GN=RH3 PE=2 SV=3[more]
P692452.6e-67100.00Histone H3.3 OS=Lolium temulentum OX=34176 PE=2 SV=2[more]
Match NameE-valueIdentityDescription
A0A6J1C8860.0e+0094.30protein SCAI isoform X1 OS=Momordica charantia OX=3673 GN=LOC111008340 PE=4 SV=1[more]
A0A0A0KU990.0e+0091.64Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G026910 PE=4 SV=1[more]
A0A1S3BX870.0e+0091.81protein SCAI OS=Cucumis melo OX=3656 GN=LOC103494421 PE=4 SV=1[more]
A0A6J1EMW70.0e+0091.81protein SCAI-like OS=Cucurbita moschata OX=3662 GN=LOC111436036 PE=4 SV=1[more]
A0A6J1JDH90.0e+0091.81protein SCAI-like OS=Cucurbita maxima OX=3661 GN=LOC111483509 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G40050.12.4e-20161.41Protein of unknown function (DUF3550/UPF0682) [more]
AT3G03570.11.4e-13745.03Protein of unknown function (DUF3550/UPF0682) [more]
AT4G40030.28.4e-6998.55Histone superfamily protein [more]
AT4G40030.11.9e-68100.00Histone superfamily protein [more]
AT4G40040.11.9e-68100.00Histone superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000164Histone H3/CENP-APRINTSPR00622HISTONEH3coord: 686..700
score: 97.1
coord: 797..818
score: 94.86
coord: 741..758
score: 97.46
coord: 763..781
score: 82.38
coord: 700..714
score: 92.32
coord: 781..797
score: 96.93
coord: 717..738
score: 91.8
IPR000164Histone H3/CENP-ASMARTSM00428h35coord: 717..819
e-value: 1.1E-74
score: 264.1
IPR000164Histone H3/CENP-APROSITEPS00322HISTONE_H3_1coord: 698..704
IPR000164Histone H3/CENP-APROSITEPS00959HISTONE_H3_2coord: 750..758
IPR022709Protein SCAIPFAMPF12070SCAIcoord: 14..551
e-value: 1.7E-186
score: 621.2
IPR009072Histone-foldGENE3D1.10.20.10Histone, subunit Acoord: 685..819
e-value: 4.8E-74
score: 248.6
IPR009072Histone-foldSUPERFAMILY47113Histone-foldcoord: 685..816
IPR007125Histone H2A/H2B/H3PFAMPF00125Histonecoord: 684..815
e-value: 6.7E-52
score: 175.3
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 691..724
NoneNo IPR availablePANTHERPTHR21243PROTEIN SCAIcoord: 5..597

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr020620.1Sgr020620.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009873 ethylene-activated signaling pathway
biological_process GO:0045892 negative regulation of transcription, DNA-templated
biological_process GO:0006351 transcription, DNA-templated
cellular_component GO:0000786 nucleosome
cellular_component GO:0005634 nucleus
molecular_function GO:0003677 DNA binding
molecular_function GO:0003700 DNA-binding transcription factor activity
molecular_function GO:0046982 protein heterodimerization activity
molecular_function GO:0003714 transcription corepressor activity