Sgr023231 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr023231
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionRegulator of chromosome condensation (RCC1) family with FYVE zinc finger domain
Locationtig00000892: 1244480 .. 1266337 (-)
RNA-Seq ExpressionSgr023231
SyntenySgr023231
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCATTCTGAGGAGAATAAAGATTTTACCGAGAAGAACAAGAAACGGAAGCTCAAGACACCTAACCAGGTGATTGCTTTAGAGAAGTTCTATAATGGTAAGAGTACAGCTTTTTGTCAAATCCTCCTCTAAGTTCTTTGTTATCCCTTTAATCTGAAAAAAAGTTCTTAATTCCAATTCCAGAACACAAGTATCCTACAGAGGAAATGAAATCACAGCTTGCAGAGCAGCTAGGTTTGACTGAAAAGCAAATTTCTGGATGGTTTTGCCACAGAAGATTAAAAGACAAAAGGTTTTGTGAAACATATGCTAGTGTACGACAGGATCGTTCAAGTGGTGTTATCCAAGATCATGGCAGTGGGCTCGCACAAGATTCATGTGGTAGCACAAAAAATGGAGACTATTGGCATATCGATCCACGTGAAGTTGAAAGTCAAAAGCCTTATGGCCATGAGCACCCAGCTACAGATAATGTCCTTGAGCGTAGGAGTCAATTTACAGAAAATGTTAGTAATATGGATAATACATCTTCAGAAAGCAGCTCTTCTTTAAAAGATAGGTTATTATCTCAAAGTGAAAATCCGTATGATACGGAAGTTTCGCGGTATTTAACACATGATGGTGCTATTCCACCATCAAATCCTAAGGTTTTAAACTCCCTGCGATATAAACCATCAGGCTATTTAAAAGTGAAGGGCGAAGTTGAAAATGCTGCTATTACTGCTGTTAAGAGACAGTTGGGTGTGCAATATCGGGAGGACGGTCCACCGCTTGGTGTGGAATTCCAGCCACTTCCTCCTGGTGCATTTGAGTCCCCAGCGAAAGGTCCAATCCATGGTAAATAATAGAATGTTCTTCAATGTTGACAGTGATGCTATTGTTCTTGGCTAATATGTTTTCATAGTATTATCTAGTATAGTGTATAAATTAGCAGTTGTAAATAGACTTATTGTTATACACAATAAAAATTGTACTTACTATTTTGTGCCCTCAAACAATATGAATTGGTTAGTAATTATTAAGTTATTTCTGTATTAGTCATTTTGTACGATGCAATTAGATGTGCACAATAGGGTTCTGAGGCAAATTAACTACTACTAAGAGATTGGATTGCAATTAACTAAGTATGGATACTTTTTATTTTCACTATTGGTTGCTAAGAATTTACGTTCTAAAGAACATATGACATTTGTGAGGCAAATTTTTCTACTGGAAAGCTTGTCCCTAATATACAAAAACAAAAATGGATGAATTGAAGTGAATGAAAAATTAGATAATGAGTCAACCGCCTCCCTACACATTTTTTTGTTTGGGGGGGGGGGGGGGGGGTGTGGATTCAAGATGACTCTACAGTGTGATGCAAGAAGAAGAAATTTTTTTGGTAGTATAGATTCCGGTACTAAGACATTTCTAATCATTATGGGAACCGTATTTCTTGTTCAATAACTCTCCCTCTCCGACACTGGTTGGTGGGTAGAGGGAATTAGAAAGAGAAGGCTCTTAGATGTATGAAGATTTGGGAGTGAGAATCAGAATAGATTAGAATGTTAGGATTTTAGTAATATTTAAAGCAATAATCTTGGATCTAATGAGCTTAATGTCGTAAACTTACAGTGCCTCTGTACAGAGATATTCTTAGTTTAAGTGCTTTATCCTATACCAGAGTCTCTGTCATGCTTTACTAAGTATGAATATTTTCTTCTGGGTTTTATTTCCTATGATTTTTATTAAACTGTCACATCAAAATTTCATTGAGCCTCATACCTATTTTTTGATGGATACTTCACAGATTCATACTATGTTGGAAATCCCGTGCTTCCCCATTCTCCAGACATATTGACAATGAAGAAACAAAGAGCTCTTGGCTCTGTAAGTGGACCTTTATAATGGCTTTCTCAAGATTTTATTTTATTATGGATTTTGTTTGATATGACCCCAATACTAAATAAGGAGTTAAAATAAAGTATTAGATTGGATGATCTATAAGCATGCAATATATCTGTGTGAATTGAAAGGCTTTATCTTATGCTACTCTGTCATCTTTCTATTTTCTCCAGTTTATCCCAAATTTTATTGGTAAATCTGTGGGAATATACTGGAAGGACCTGAAAATATTTCTTTTATATGCTTTTATTTAAGCAGGACAACAGTCCAATCTATTTAAGACATAAATATATACATACAAATACATACGCACATGTATATATATTTGCAATCTCACTGAAGCAGGCTTAGCTTCAAATGGATTTTTTATTCATGGGTTAGGGAGTTAAGATGATTAGTTTTTGCTTCATCTGGGATCGCCAAAAAGTTAGTGTACAAGTGTTCTAAAGTTAGTCCCTTCCAAGAAGAAAATTTTCATAACCTGCCCGTGCAAATTCCTTATTTAGTAATGAGTTGATACAATAATTGGAATAAAATTGAAAGCGCAAAGTTAGAAGAATCAAGGCACACACTTAGAGAAGCTTGAGATCTAAAATAACTTCCTTAGAAGAAGCAACTTTTATCACAGTCAAGGATGAAACACTAGTATTTAAATGCTTTAATAGGAATTCTGAAATTTTATAGTTCAATATGAACTCTAAACACTCGAGATAAGTTTGTAAATCTTTCATGATTATCAGCAATCTGGAATCAAAGATAATATTTCATCAAACAAAAAGATACTGAACATAGTACTTTCCCTAATCTTTTTCTTGCCACTCTCTCTGATTGCTCTGCAGAGGTATGAAATGCATAGTTCAAATATGAGTTCTCAGGACTCATATATGGAGGAAGCAATCCCCACCAGCACTACATGTAAACCTGAGTCTCAGGAGAAGAATTCTGTCTATCAATTAAAGAAAAGTTCCAACTATTACAACAAAACTGATCCTTTTCCCCGCCAGAACTCTCCCTTGAATATGTATGAGGAATCTGGTGGGTTAACATTTTCCAGCAGTAGCAAAAGGGATCATAAAATGAGCTCTAGCTATAACATTCCTAGAAGTAGAACTGATTCTGTTTCCATCAATCATGGCTCCTATACTTCAAAAGTTGCTAGTGAACAGACAGAGATGCAGCTGCATAACCATGGTAGTGTCGGCTCAAAGAGTTTTAATAGGAGTGGCTATTTGGACTATAATTCTAAGAAAATGTCAAAGGTCTGTACTTCATACCTTTTTATCGAATGTTCTTATATTATTTTTCCCAACAACCAAATCTGTAAATATTCATCTTGAAAATTTCTAATTCTCCAATGTTAATATAGGAAATGTTCAATGGAGAAGCAAAGCCAGTAAATGAATGTAGTGATCCAGTCAGAGTAAAGATCCCATCATCAAATGAATTGGCTGTGAGTTTAAGCGGTGTCATGTTTTGCACAATGTTATAATATTGTGGTATCGTGGTTCCTTGCTAACACTTCCACTTCTCATATATCCAGGTTGCAAATCGATGTCAGTTGGATTTTCCTCGGCCAGACTATGCTGTAAAAGCATCATTTTCTGAAAAACCAGGGCGGAAGAATCATACTAGAAGGTCAGTCCAAGTAACAATTGCTTGTAGCATGCAAAGTATTGCGCATATTTATATTGAAGTTCTAATTATATTATATCCTGCATTTTCTGCAAATGCTCTTCTCATCGACACTGAACTTTTTACTGGCGCAATCCCAATTTTGTTGCTGAGTCATGCAATGTAGTGGAAAAAAATCATCTTGATCCATAATGGTTCGATCAATTTTGGTTGCAAGTGGTAAGAATTTTTCTTTTTTCTTTTCAATAAGAAAGTCGATCTTTAAACGGGAATAAGTATAATAGAAGGTGAACACAAGGTGTCCTCCCACGCGAAAGGAGCTCTATTAAAAGGAGTTTCGGTTGGTATGTCAAAAAAAGATGTTAAAAAAGAAATTTGAAAAAGTACACGGCAAAGAACTTCAAAACCCCAACAGTCTCCCATTCGTCGACTTGAAGCTTTTGCTCTACTTAAAAATTCCATTGATTTCTTTATAGACATACCTCCCATAGTGAGTCCATCACAGCATTCTTTCAAATTGTCTTAGCCTTCTTCTCGAAGGGATGCCCACATAAAAGTTTTTTAACATGGTCATTAGCTTTCTCTGGTAGGCACCAAGAAAGGACAAAACCTTGCAGTAACCTGGACCAGTACTCTAGGTATGATGGCAGTGCAGAAACAGGCAATCAAGATCTACCAGATTTCGGGTTACTCCAATTGGGGGAGATGGAGGTACATCAAAGGACGCCTTCTTCGGACCCTATTTTGCTGTGATTAAACCCCTCAAAACTAAGGACCACACCAACAACTTAGACCTTTTCAGGGACTTGCTATTTCTAAATGGAATTGGTGGTATGCTTGGCGAGTTCTTTAGTTAGTTTGATTAGCAAAATACTCACAAGGATAGCACCCACATTTTCTTAAGCTCTCACGCCCTAACAATTGAATCTCTTTGAAGAGTTCGGAAACTTTTTGTTGCCAATCTTGGGCATGTGGCCCACCTTAAGACCTACTTTGAAGTGTTATATATAGAACAGGTAAACATCTAGATCCACCTCAGAGTGTGCTTTCTTCTTTCTGTTTTGACTGAGGCTACTGAACAATTGCTGCATGGATGTCAAAACTATATACCTAATCAATGGTTGAGAAAGAAATACAATTACAGCTAGTATTTATAGCCTTCTTGCCTTCATCTTTAATGTGGTTAGAATCACTGTAGATATTTCCTGTACTGTAACATGAATGCTTCTCCCTGGAGATTTTTAATGAGTTATTTATTAACAACAGCACATGTCATGTAGATCCGCAATGGAGATGCCATACAGCTTCACAGTCGATGAGGCTGCAGATACCAGTTCATCGTTGGATTGAGCAACGCTAGAAGTTTGGTGGGGTCTCGAGATTCAGAGTACATAATTTGGAATTATTGTAAATTGGGTAGCAATATTAGTGAGTTGCAGCACACCTTAGGCAGATTCTGGCGGTCAGGCTCTCTACCTTTCCCTCTCCTCTCTTGTGTATAAGTACAAAGCCTTCATTATAACTGTCTTTCTGAGGAAAATGCTACAAGGTGAAGTCTATCTTCTCTCCTCTCCTTTTTTTTTTTTTTTTTGCGCGTGGATCTCCTATCAGGTTTCTGGTCCTGTCGATTTGTATTCCTGTAATTTGTTTGATTGACAGAAGCATCAACAGTGTATTATAACTGCATCGCCCATTTACGACTCGATAGTTTCATTTCAAGGCACTGTTTGGAAGTCTTAACAATATGTTAGACCTTACATTTTAGCTATTCATTTGTTCACCAAAAGGGGTACTCACCACGTGCCATACCCAAACATGCCTTTGTTAGATTTTTTAGTATATTTGCTTGCCATATTATTTATTTTATCCCATGATAACATATGTAGCAGACCAGAAAGACCCATTAATTGTGAAAATAGCTCACTTATCTACGAAGAATTTGCAGAAGGATTTCTCAGCCACACATTTCTCTCTCTCTCTCTCTCTCTAAATTATGTTATGGTGTCTTTAAGTGGTTCTATTGATCATGCTGTTGTGACAGAATACAAGACAAACAGTCTCTTAATGCTTCAATCATCAAATTTGTTAGCACTTGGAAATATTTTATCTCAATCTCAATCATTAAATTGTATGATTCCAATAGAAATTGTTGTGCACGTCCCTGCACATCAATTTCACGCAAACTTTGCTGAACTCTTGCATGTATTATTGTTTTCAATCAGTCAAGAGTTGTCATCTCAAATTGAAAACTTAATAACTTAAAAATGTTACGAAAGAAACAACCACGTTTGAGTTTTCTTTTAAAAACTTTGCTTCTCCTACCAACACAGATCGCATGAACCATTCATTACCTTGACTGGGAAAAATGGTAAAAAATGGCTTAAGGAAGTGGCTTTTTAGGTAGGGGTGTGGCTTTATTTTGCATCATGTGGGTGGGGGGGTTGCAATTGGAGGGGCCTGCCCCGGCCTCTCAATTTGTTGGATAAGCACATCTCACACAATAAAAAAGAGTGCTCAGTTTACAAGAAAACTGAGGAATTATTGAAACATGAGATTGTTGAAATATAGAATCAAAGCATTCAATGGATGGTGTCCAGTGTTAGGGCTTCATTTACACCTCAAGATTTAGAATGTTAAAAGGTTGGGGATTAGGAGGACCACTTGGGGACTTAGAAAAAGGAAGTTACAATACAGAGACTTGTGGCCTCCATCACTCTCACCCCTCACACCCCCCACTTTTTCAAAAGTGTACCCACCCACCTTGTTGTTGCTCTCATTCTTCATTCCTTGTAATCAAAACTTGGTATGACAAAACTCTCTCTCTCTCTCTGGCTTTTCTTAGTCTTTGTATTTCTCTTTCTAAATTGCAGATTCTTGAAAGCTTCGTCTGAATGTTTGGCTTACGAGTTGATTTGTTTAACTTAAGATTCAAGTCTACATTTAGAGTGAGTTTTGGTATGATTTTTGGAATATTGAAACATGCTTTTAGCCAATTAAAAACGTTTTCCAAAATCTTTGTAATGTTTGTTACATGGTTTAAAGTGATTTTGAACATTCTAAAAGTACTTCAAGGCACTTTAAAGTAATATTAGATAAACGCTTAATTAGTTTTTCTTTAAAAATTTTTATGAAAAATATGTTTCACTTAAAAGTACTATTCTCAAAAGTCATCTCAAATTATCTCTCAACGCCCTTCATGTATTTAATTGATATATTTGACTTGAATGAGCATCCTTCTTAATCTAATAAAATTTTTAATTTGTGATTTCTTGTTGCTTTAATTTAACAATTCATTCCTAAGTTATTGGCTTCTTCCATGGGGTAGTTCTCAAAATGCAATTAATTTGTAAAAGAGATGTATTGATGTCAGCAAAACAATGGCTTTTGCTAATAACAATTTGATGTTAACGTTTCATTTTTTCATTATTATTTTTAAATTTAAAATGCAAAAAAAAGTTAGACATGCCACATTTTTAATATACATCTTTTATCAATATCATCAAATAATTCTAAGTTTGATTAAAATTAAAATCAAACGACGCTACAATTTTATATATATATATTTAGTTTAACGAATATATGAATGAGGATCGAACTTCGACCTTATAAAAGTAATAAGTACTTTATTCACTAACTGAAAAAAAAAAAAGAGATTAACGAAGAGATAATATAACTATTAGTGGATTAGTTAAGTAAGTTGAGTAAATAATGAGGGTTTAGAAAATTGGAACTGAAAAAATTGACGCCGTTGCTCGGGCATGTGTCAAAGTCCAGCCGACGGAGAATGGGAGAAAATGGAAAGAGCAAGTGTCAACATCGAAATTTGACAAAATATCGTGGGCATTGTCACTTATCATATCAACTTTTATTTTTTTCATTAATACTTTCTTTCCAATTCCAATTTCTATACTATTTCCTTTCAACGGAAAAAAAAGTATACTTTTATTTTTTTAAAAAAATAAGTATACCATGTTACAAATACGGATAATGATTCAACCTACAACCTCTTAAAAAAAAATTATCTTAATTATTAGGCTATAAAATAATAAAACTATAGTTACTTGTAATAGTAAAAGAATTTATAGTGTTTGTCTTAAAGTTACTCGTTTTATTTTGGTTCTTAAAATTTTAAAAATTTTATTTTTCATTGCTTGTAAAATATAAGTTTGAAATTTAATAAAAAATGACATAATCTATTTTATTATCGATTAACATCCCCATATAATTAAATGTTAGAAAATGAGCATTTAAGATCCTAATGTCATATAATAAAAATATGAATCTTGTCACTTTTCATTCAAAGGTTTTAATGTATTAAATGTGAAATTTATTAAAGCTTGAGAGACCATACTAAACTTTTTTAAACTTTATGAACCAAATTAGAAATGAAGTAAGCTTTAGAGACCAAAATTATATTTTTATTTATGTCAGATCTATCAATTGATAAGTAATTTTTACGCAGACAAGCGTTGAAACCACATCCCACATTTAATATGTTAGATGTTGAATGTCAAAATTGATCAAGATTTATTCATGATTAACAAAAATCAATGACAATGACTAAAATATTTTTTTAATATAAAGTTTAGGGATAAAAACATAACATTAAAGAGTTTAATGACTAAAATATAATATTTGAATATTTAGAAACTAAAACATAATTTAAATATTAATTTTACACTAACCTAATCAACTAAATTTTTGGTCTAACATTGGTTTTAACTGATATCATAAAGTATTGAATTCAAACATTTTAAAGTGCTATTACTTCCATAACCTACTTTGTTAGTTCTCTTATTAAAATTGAATTTCACAAATTAGAGGAACGTTACAAATATATTGAAAATATCAAATTTATCGTATGTTAATCTTTTAAATTTTATCTGTAATTCTTTTTAAAAAAGTTATAGGTTCAAATCTTAATTCCATATTTGTGATGCGATACTCCGAGAGAAAATAATCTCCCCTTTAATGTTATATTATAAAGAGATAATAAATTAAAAAAATGGATATATTTTTCATAGAATTTGTAAATGTCACCATGGAAATGAATTTTGTATTTTTAATTATTATTCCTTTTTATTTTCCATTTGTTCCGATTAAGATTGAGCTCTCTCTCCGAAAGAACAACAAATTTTTCTTTTTTTTTTTTTTTTATACTTTTTACAATTCATTCTATTCGTCTGCTCTACACTCTGTCATTCTCTCTCCTCGACTTTCCACTCTCTCTCTCTCTTTCTCTCCTGCAACTCCGAATCAGATCATAGTTCCCAGTTTTCTTTCACTTTCTTGCACTTTCTCGGTCAGGTACTTGTTCTCATACGTCAGAGGCATAACTCGAACTTCTGCAGTCTGTTCACGCGAATCAGTGGAGTTTCACCTCTGTTTTCGGCCATTTCGATTCTTAAACCTAGTTGTTAGTTTCTCCTGTCAAGTCATTGAAGCAGACTAAGCCAATTCAGCCGAGTTAATCAAGAATTTGCAGCTATAGTGCTTCAGAGTCAGTGCAGTTTCAAGCGGTGGCTGGACGAATCGGACTGGAGCTGTATAGTTGAGATGGATGTTTTTAGTATTGTAGATTGGAATTGCTCGTTGAATTTGAGTTTTAGGCGGAGAGTAGGCGGCTTCTCGCAGGAATTGGTCAATGGAAGCGCTTGGTTTTGATAATGGGTGAGCTAAAGTTAACTCTTTAGATTTTGAGGGTTCAAAAGGAAGAAATCTTGATGTCGAGGATGGATAGGATGGCTTCAGATCTTAATAGGAATGGTTCGGTGGAAAGGGATATCGAGCAGGTGTCTCATCCGTTTCCTTTTGTCCCTCTTTCTTTTCTTTTGCATCCTTGTACCATTCAAACTTTTGTCCTTCTGATTTGCTAATTTATAATGCATTCATTGCTATGTTTCTCTCAAATCAGTGTAACAACCGTATAAATGAAGTAGTGGAATTTTTGTCAGTGTCGTTTTGCACTGCCGTCCCCATGTAGCATAATCTAGTCTCGTTTTCAGTGCAAACAAGTGCCGTTTTAACTACGATAGAGTCGTCAGAAGTTGAGTTGTAATCTTTGATGAATTTATGGAGGCAAGTTTGATTTACTTAGCGGGATATGAAAAGTAAATTAATTAGATGAAGGTAGTTGTGTGCTGAATATTGACGACTATTTTATGGAAAGCTACCTACACTTTCAGGTCCTAGTGATGCAATTTAGCATTGGTGAAAAACAATAGCACTGGAGGATTTCATATGATCTGCTGATGTATGATTTTGAAAATTGCTTAGACAAACCGAAGGATGCATCCCACACCCACTTTTGTCGATCTAAATCAACTAGGCAACCTTCTTATACAATGACAGTGGTTGAAGATGAATATAGTAGAACCATCACATCCCTTCTCAACACCGATGCCCTTGATCTAAATTTAATTCATAGGATCAACATTAGTTTAGTAATTATGTTTTAATTTTCATGAGCAATCATTGCCAAGGAATCAAAGATGTGCTTTTCTTTCCTATGTTTTATGCTCATTTATTGATCCAAGTAGTCAAGTAGATTCCAAAGACTAGTCGTTGCAGCCAATGATGTCAATTGGCAACATGTTCAAGGCTTTTATTAAGTGCAACATGACATTTTAGACTGTAGAGAGGAAGATGGAAACCATGATGAATACACATAACATTTTATTTTGTTCATATATTACAGTGTAAGTGTTACTGATAAAGAATTGAAAGGTACCATCTTTTCATTGATATTCTTGGAGGAGGCGTGAGGTTTGGCCCATTTTAATGTCTCCATCTGGATGTCCTAGACTAATTCATTGTAATTTTTTTTTAAGTTTTGACTTTTAACAATTGCAGCATCTTTTTGTAAAATTGGCTCCTCCTCGTTTGGGCTTGACTTTTTGTATGGCTCTTGCTTTTGTTCCTCCCCTCATCTTAATAAAAGTTCGTTTTTTCATCAATAAAAATATTTTCATTCTTATTATGTTTTTTGTTAATGTATCATTTCTTGCCTGGAATCTGCAGCATCTACATTTCCACCACTTGATAGCTAAACTTCTGGAAGCCATTTAGATGGCCTCCGCATGGATATAGTATGTAATACTTATTGCTGTTTTACATATTACAGTCCATTATTGCTCTAAAGAAGGGAGCATATCTGCTAAAGTATGGAAGGAGGGGAAAACCAAAATTCTGTCCTTTCCGGCTTTCCAATGTAAGTTTTGGTTCTGTTTTAACAAATTTGTTGTTGCATTTGTTCAAAAATTACACTTTTGTATTAGTGATTATCGTTGTATGCATGATATTACCTTCTGATGGGGTAAGCTCAATTTTTACACCATCACGGTGCATGAACTTTGCTCGATGATATCATGTTTATGAGAAAATTGCATCTTCTGAAAATGTCTAGGTGATTGATTTAGCTAAAGAGGTCATTGTCTGTGTAGCATATTTACTTGGAGCTTGATCACTGGATAACATATCATCATAAAGAAAGGGATGTCCAACTGGATGACATGACATTCTCCCCCAAAATTTGACGAAAGTAGATTCCAATTGGATAAGAAAAGAAGAGCAGATTATTTGATGTGTTTAAGATCCTAACATAATCGAGTCAATATTAAACTCTTGTCATACTCTTTTCTTTTATTTTGGTTGTTAATGGAGCTCCATACTATGTTTCCCTCTCTTCTTGATCTACTTACATGGGGTAGTGTCATAAGAACTCAAAGTGATTGGCAATGTTTTATGTTTTCAGGATGAGTCTGTTCTAATTTGGTTTTCAGGGAAAGAGGAGAAACACCTTAAACTAAGCCATGTTTCTAGAATAATATCTGGGCAACGCACTGTAAGTATCACTTTCTAAAGATTAGATTACTTCAATACCTATGTCTATTCTCCCCCCGCCTTTGTGGCCACATCCACATTGATTTGCATCCAATGTGATAGTGTGATAATTATCAAGTGAGACGTGGTAAAGTTTATGACGCTGAACTTTAATAATCTCTAATTTCTAATTTTTTGTATACTGAAAAGTCACAATTGAAAAGGTTGCTGCTTTCATTGCATATGGTTTATTTAGAACGTTAAGAATGTGATATCTTAGAAATTTTAAGCTGCTTGCATGTCATCTATTTTGTTAAACCTTTCTAGAAGCTGTTCATGTGTCGGCCTATAACCAAGAAAATGTAGAGAATAATAGAAATCTTAAGGAGAAGATATTTTTTGTTCATTGCCTTTTGAAGGTCCTATGTAAAACTTGCTTTCCTATTTCAATCTTTGGGTAGGTTCATTAAACAGTGTCCTTGGAAGAACCACCTTGAGCACATAAGTTTAGTCAGTCATTGTGTATGGTGTTTTTATTCTTTTTCTTTCACATAAAAAGAAAAGTTTTAATTGGTATTTTGGCATTTGCTTATACCTTATAGGTTTAAAATTATTTGGGTCCCTAAATTTTTACACTTATTTCGTTTGGTCCTTATACTTTGATAATCTTTTTTTTCAGTCCTTGAACTTAAAAAAATATTACACTTAGTCTTTGTTTTTACATTTCTATATTGCAATTAATAGGACAATTAGTCCACCCTTTCTAAACTTGATATCAAACATACATCTACATTGGTGCACGTAAGGTTTCTGCTAAATTAACTATAGAAATTTAACAAAACATATCAAAATTCATTGTTATGCAAAGTTTAGTGATCATAATAGAACATTTGAAAGTATTAGAATCACAATAGAATAAGTGAGAAAGTTTTGGAACAAACATAAGATTTAAACCTAACTTATATTTTGAGGCTTATTTCAACCAAAAACATTAATTGGAATCATTGACTATTCTACTTATGGCTTTATGATGTATGTATGCTACTACTGATCAGATCTGTTTCACAAAACTTTCCAGCCAATATTTCAAAGGTATCCACGGCCAGAAAAGGAATACCAGTCATTTTCTCTAATATATAACGAAAGATCTTTAGATTTGGTATGTTATTGTTCTAATTATTATTTCATTTATTATTCTGTTTATATTCAGTTAATTGGCCTCACCCTTATTTATCTTGCCTACTTTGTGATTTTAAATCTATAATAATCTGGTTATTTATTCCGCTTACTAGATTTGCAAGGATAAAGATGAAGCTGAGGTTTGGTTCAATGGTTTGAAAACATTAATTTCTCGTAGCCATCACCGTAAATGGAGGACAGAATCTAGGAGCGATGGAATGCAGTCTGAAGCAAATAGTCCTCGAACTTACACCAGAAGAAGTTCTCCTCTTAATTCACCATTTGGTAGTAATGATAGCTTGCAAAAGGTAAACTTATTTCTTTCTCTATCTTCTTATCTATTACGCAAAGTAAATTCAGATAACCAATGTTTTTTCCCCTTAAAATTTAGGATGGTGATTTTCGACTTCACAGTCCATATGAAAGTCCTCCTAAAAATGGAATGGATAAGGCGTTATCAGATGTGATACTGTATGCTGTTCCTCCCAAGGGCTTCTTCCCTTCTGATTCTGCCAGTATATCAGTTAATTCTTTATCATCAGGTAGCTCAGACATGCATGGTCCCATGAAAGCAATGGGAATTGATGCTTTTAGAGTTAGTTTATCAAGTGCTGTCAGCTCATCCAGCCAAGGCTCAGGTCATGATGATGGTGATGCCTTGGGGGACGTTTTTATTTGGGGTGAAGGAACTGGGGATGGTGTTCTTGGTGGTGGAAGTCATAGAGTTGGAAGTTGTTTAAGTATCAAAATGGATTCTTTGCTGCCTAAAGCCCTGGAATCTGCTGTAGTTCTGGACGTTCAGAACATTGCCTGTGGTGGACGCCATGCTGCCTTAGTGACCAAGCAAGGAGAAATTTTCACCTGGGGGGAGGAATCAGGAGGCAGGCTTGGGCATGGTGTCGATTCTGATGTTTTGCAACCAAAGCTTATAGATGCCCTTGGTAATACAAATATTGAATTGGTATCTTGTGGTGAGTACCACACATCTGCTGTAACACTTTCTGGTGATTTGTACACATGGGGTGATGGAACTTACAATTTTGGTCTTCTTGGCCATGGGAATGAAGTAAGCCACTGGATCCCTAAAAAAATAAATGGACCATTGGAGGGCATACATGTCTCTTCTATCTCTTGTGGACCTTGGCACACTGCAGTTGTAACCTCTGCAGGGCAACTTTTTACCTTTGGTGATGGAACGTTTGGTGTTTTAGGCCATGGAGATCGCAACAGTGTCTCAATGCCTAGAGAAGTGGAGTCCCTCAAGGGTCTACGCACTGTGCGGGCTGCTTGTGGCGTTTGGCATACTGCTGCTGTAGTTGAAGTTATGGTTGGAAGCTCAAGTTCCAGCAATTGCTCTTCAGGGAAGCTATTCACATGGGGAGATGGAGATAAGGGTCGACTAGGGCATGGTGACAAAGAGACTAAACTTGTGCCTACTTGTGTTGCAGCTCTTGTTGAACCTAATTTTTGTCGAGTTTCATGTGGGCACAGCCTGACAGTCGCCCTTACAACATCTGGCCATGTCTACACAATGGGGAGTCCTGTTTATGGTCAGTTAGGAAATCCTCATGCTGATGGGAAGGTTCCAGTTCGAGTTGAAGGAAAGCTTTCCAAAAGTTTTGTGGAAGAAATAGCTTGTGGTGCTTATCATGTTGCTGTTTTAACTTCAAGAACGGAAGTCTACACTTGGGGCAAGGGTGCAAATGGTCGTTTGGGTCATGGTGATACAGATGACAGAAATTCACCAACGTTAGTAGAAGCTTTGAAAGACAAGCAAGTTAAAAGCATTGCATGTGGTACAAATTTTACTGCAGCTATCTGCCTTCATAAATGGGTTTCTGGTGTTGATCAGTCTATGTGTTCAGGCTGCCACTTACCATTTAACTTCAAAAGGAAGCGGCATAATTGTTATAACTGTGGACTTGTTTTCTGCCATTCATGCAGCAGTAAGAAATGTCACAAGGCTTCTATGGCCCCAAATCCTAACAAACCTTATCGTGTATGTGATAACTGTTATAACAAACTACGGAAGGCACTTGAGACTGATGCTTCTTCTCAGTCTTCAGTGAGCCGAAGAAGAAGCATCAATCAAGGAACGACTGAATTTGTTGAGAAAGATGAGAAACCGGAATCTGTCAAGTCTCGTGCTCAACTTGCTCGGTTTTCTTCCATGGAATCTGTGAAGCAAGTTGAAAACCAATCTTCCAAGAAAAACAAAAAATTTGAATGTAATAGTAGCAGGGTGTCGCCCGTTCCAAATGGAGGATCCCAGTGGGGAGTTATTTCCAAATCATTTAATCCAGTGTTTGGGTCATCTAAAAAGTTCTTTTCAGCTTCGGTTCCTGGTTCTAGAATTGTTTCCAGAGCAACATCCCCAATATCAAGGCGAGCAAGTCCACCTCGCTCAACAACACCTACCCCAACTCTTGGAGGTCTTACCTCACCAAAGATTGCAGTAGATGATGGTAAAAGGACAAATGATAGTCTTAGCCAGGAGGTTATTAAGTTAAAAGCTCAGGTACACAGTATCCATTTTTCTCTCGTATCATTTCTCCTGCTTATCATTTCGGTGTTGCATTTAATGGTATATGAAACTAAGTAATCTTGAGGAGCTGTAATGTAACACAAGAATTTTTCCATAGAGAGATATAGCCACTTCGAGGCTTGAAAGAAGCCTTAATTTTGATTTCTATGATACAATAAATGATGATGTAGAATCAAGTTGCCATTCCTTATTATTATTCTTTATGGTCGATTTACAATAGATTTCATTGCCAAAATGTCCGATTTACCAAATTGGTTTCCACATTAGAAAAAACTATACGGTATAAAGTAAATAATGCAGTATTCAGAGGTATATATGTATTCTCTTATTTATTTATTTTCCCTCCAAATTGTTGCAGGCAATCATTTGCTCAAAATACAAATGAATGAATTTATAAAAAAATTAATCGTTGGAAAGAAACCATAGCTTATCAAATTTCTTGGATCCTCTTGGTACTCTGTCAAGATTTTTTCTAATTATCCTTCTTTTTTGATTAATGCCAAATGGAGTTCTTCTTTAGAAGTTCCTTTGGTAGGTGGAAATCTATCTCTTATCTAAAACAATTTATAGAAAATGTTGATTCTACAACCTTTGCTTCAGTAATGGAACTGTTGTACCAAACTTCTTCAAGGTTCTTTGATTCAGTTGCATATTGCTTTTGCTACATTGTCCTTTTTCCTTTATTGTACGAAAACCATAATTTGCATAGCAGTCCTGAAAAGCATCAAGCCTTACCAATTGTTTTTTTTCAGGTTGAAAATCTTACCCGCAAAGCCCAACTTCAAGAAGTTGAGTTGGAAAGAACGACCAAACAGTTGAAGGAAGCACTATCATTTGCTGCAGGAGAAGCAACAAAGTGCAATGCAGCAAAGGAAGTAATCAAGTCACTTACTGCCCAAGTAAGATGTATTCTTGTATGTGATGGTATACTGGTTTATGGAGTTATATATCTATGTCCGCTATTATTTCTTTTTCTTACCTGCAAATAGGAATCTGCATGCCATTTACTTGAGCTCCCTTACATTAGCATATCGTAAGTGCCTTCATAAAAAAATTACAAGTAGTAAACCACGAATTACTACTACCTATTTCTGTTCCTATTTTGGCTTCTATCTCCCCATTTAATATACCCATAAATACCAGGACCTCTCCATGTTCACAGAATTTGAAACTGGGGACTATTTTCTAGCTTTATAACTTTGATAGTGAAGAATTCTGAGTGACATTAGTTACCCTAGATGGAGGTGACCATAAAATAATGTGTTTTGAACTCACAGCCTCTTGCGCGAACCTCTAATTTCACTGATGAATAAAGATTCTTTAACCACTAATTCCTAACCTTTCGTGATAAGGGTTTGTCTAGGCATTTTACATTGGGCAAAAGAATCTACTTATGTCTCGCTAATTCAAAGAACTAAAGCTAGTTAGTCAAGCTAATTTATTTATATTATTGATCATACTAAGGATTAGAAAAATAAATTTGAAATCTTTTTGGCTGCAATTTGATGGTAGCTTTAGCTGCTGTTCCTGGTTGAGATCCCAGTTGTTGGATATTGTCTCTTACTTTCTGTTAGCACTACTTTCAGGTGCCAAATAAAATTTTCAATTTTACTCAGTCGATAGATAAACGACACTGACTTATTTTTACATTATCATTTTTACTTAAAGGTGCAAATTTCTTTTTCCTGTGTTTCAATAGTTGAAGGAAATGGCAGAAAGACTTCCAGTTGGAGCAGCTCGAAACATCAAATCACCTTCTCTTGCCTCCTTGGGCTCCAGTCCTCTCTTCAATGATGTGGTTACTCCATCAATTGACCGATCTAATGGTCAAACAATGTCTCTAGAAGCCGACATTATAGAATCAAACAGTCACTTGCTGTCTAATGGGTCCAGCACTGCAAGTAATCGTAGTTCAGGCCATAACAGACAAGGAAATTCTGATTCAACGACTAAAAATGGTAACAAGGTTAAAGAAAGTGATTCCCGTCATGATGCTGAATGGGTTGAGCAAGATGAGCCTGGTGTATATATCACGTTTACCTCCCTTCAGGGTGGTGCCAAAGATCTCAAGCGAGTGCGTTTCAGGTATAATTCTTTAGCCTCGTTGACTTGATCCTTGTAAAAGAATTGATCTGAAGCTAGTAGTTTAAATGATGTCCTTATCAAACTATCAAAAATGAAGGCACATGCCTCTGCTCCCCATCTGGATTCTTTTGCTGAACAGCAGCCTGTTATCCTCTCCTATGGGTGGCTTAGGTTGTCTGGCTTGGTAATAACTTAGTGCTTGAGAGGGTAGGTAATAATAAAATCAGTGACCATTGTAATAACCATCCAAGGGTAGGTAATAATAAAATCAATGACCATTGTAATAATTCATTAATAGTTGTTCATGCATGGTGTAAAAGAGGAAAAGTCAATGAAATTCATTGGCTGTCAATTAGTAACCATCAACTTCAAATTTTAATGTTTTCCCGTCATTATTGAGAATCAAAACTTTTTGAGATAAGAACAATCAGGAACAAGCATTTATTCTTGTTGTTGAATTTTGTGAAGCATGTTGTGTGGAGATGATGGGTCACGTGGTAGCCCATCTTTCCAATCTCCGTGTTAACAAAAGAATCACTTGTGTAAAGCATACGGTCCATCATATTTTGATGGTGCAGAATTTATAACATTTCACCATTTTTGGTTGCAGTCGGAAACGGTTTACCGAGAAGCAAGCAGAACAGTGGTGGGCAGAGAACCGAGCAAGAGTATATGATCAATATAATGTGCGTATGATCGATAAGTCCAGTGTGGGCGTTGGTAGCGAGGACTTGGCTCACTGACATGTGGAAATGGGTCAGTCCAATCCAATTGATTGTAAATGAACAGGATGTTTACAGGCTCACACCTTATATTACTGATAAGTGATAATATGATCAATTTGTTTGGTTTTATCCTTCCCTAGGATAGAAAAGAAAAAGCAATATATGGTATTGAGTTAGAGGGATAACTAGAGGGGTTTGTGTTTTTAATTGTTCCCCTGCTTTCACCTTCATTTTCTTGTTTGTTTCTCTGCCCTATTTTCTCTTTTGGGGGTCCCTCAATTTTTCCCCTATATATGTGTCTATGTGGGGGATTGAGTGTGAGAAATGTAAATTTTTAGTAACTCTGCTGGCTGTTGTTCATATCAGCTGCAATGTAAATCCGATAAGCGAATGCATCTGCATATTTTTGTGCTCATCACCTGTTCTGAACTGTCTATAATTCCATTTTTTCCATTTAAAGAGCAGCAATTTTCATGCTTTGAGTTTTGACTACTTTGCTCTCTCGGATGCGGTGCATTCTAGCAGTATGATGCTCAGAGTCCCTCGGACATGTTGCTTGTGTATAAAGAAATTTTTTTTGACTACTTACCTTGCATTATTGTTTGATCGTAGTGTGCTAGAAATTTTACCCCCACCCTGAAAAGAAACCGTTCCTTTTCCATTAGATTTGCAGACGTTACAGCCAGAAATATTCAAATTCTCCTACTTTGGATGAACTGTTCCAGCATTTACCAAAGAAAACGATCCTTGATGGGTGTAATTTGATCCATCAAGTCTTCTTTTAGAATTTTGTTAGTTTCATTCTCAGGATTCACTATAAAATTCAATCTCAGGACTCTCCACAAAGATTCAAAAGGTTAAAGTTGTTGGGTTGAGGTAAATTTATTATTATTATTTTATATGTTCTAATACTCTCACCTATGGTGGGTTTAGATGGTATCATGAAATAGAACTCACAAGACTTAGAAATTTAAGTTGAAATGGAGAAAAAAATCACTTGGTAGGTAGTATTTTGTTTTTAAAAATTAGAGATTCGAACCTATAATCTCTTAAAATATATAAAAATGTCTTAATCACTTGATTATCTTCTAAAAATCCGTAGTTATTATTTTATAATTTCAAGTTTAACTTTAATTTGTTAGATTAAAGTAAATTAAACTTTTTTATGTGTTCTCTTACGCATATTAATAAGAATAATTTTTTAGAAAATGAAAGAGGCAATAATAAAAATATACTTAATAATCAATTATTATAAATCCGTCTAATGATATACATTTTTTCGAGAAACGAAGGAACGTATCATTGCGAATTCCCGACAAGATGCATCATTTTTCACATCCTCGTGGCAAAAGCAGCTCATTATTGTTATCCTAACCAAAAAAAGCTCATTATTGTTCATTGGAACTTAATCTGAATACCGCCATTGATTAAAAAAAAAACCATCTAATACTACGAGAATAAACGGCTTAAAAATTTCAAAATAGTTTTAATAAATATAATGTTAAATTACAAGTCTAATCTCTAAACGTTTAGAATTATGTAAAATAAATCTCTAAATTTTAAAAAGTATTTATTAAACACAAAATTGAAAGTTTAGAAATTTATTAGACACTTTTTAAAATTTAAAGATCTATTTGACACAACAATAAAAGTTCAATAACTAAACTTGTAATTCAATCTAAATTCTACAAAATTTACTTTAATTCTACAATTCTTTTTGTTTAAAATTTCAATTCGTTATTCATTTTCCTTTCAATTCTTCTTTATTTTGATCGCGTCTCGTTTCACTTTCCTCCGTCGCAACACTGCATGAGCTCTCTCTCCCCAAACAATTCCACCTTCTCTCCACAAACCTCTCTCTCGATCTCTCCCGCTTTTCCAATTTAACTTTAAACTATAATGTATGGACTATACGAGGCTGTCATCTGCCTTGAGATCTTGCGCTAGCTCCAAGTTACTGAAACAAGGCAAACTCATTCACCAGAGAATATTTTCTTCAGGCTTTCAAACCAACATCGCCCTCTGCAAAACCCTCATCGACTTTTACTTCTCTTGCCATGATTACAATCAGCGAAGCTTGTTTTTCAGACCACCGACTGCCCATTGGATGTTTCTCTGTGGAATGCTCTTCTGTCTGCTTACACCAAGAACTTCATGTTCGATGAAGCTTTGCAACTCTTTGACCAGTTGAAGTGTCATTCTCATGTAAGACCTGATTGTTACACTTACCCAGTTGCTCTCAAGGCGTGCGGTGGATTGGGTAGAGTTGTTTGTGGGAGAAGGGTCCATAATCATTTGATAAAAACGGGTTTGATATGGGATGTTTTTGTGACGAGCTCTCTGATGAATATGTATGCGAAGTGTAATCAGTTTCATGATGCCATTAATCTGTTCGATGAATTGCCTCACAGAGATGTGGGGTGTTGGAACACAGTAATCTCTTGTTATTTTCAAGATGATAAGGCTGAGACTGCCCTGAAAATGTTCGATAAAATGAAAGATTCGGGTTTTGAGCCTAATTCAGTGACTTTTACTATTGTTATCTCTTCATGTACAAGGCTTTTGAATTTGGAAAGAGGTAAGGAGATTCGTAGGGAGTTGATGGAGAGCGGGGTTTTGTTGGATGCTTTTGTTCTATCTGCGCTTGTAGATATGTATGGAAAATGTGGTTGTTTAGAAATGGCCAAAGAAGTTTTTGAGCAAATTCCAAGGAAGAATGCGATCACTTGGAATTCCATGATCACAGGTTATAGCTTGAAAGGTGACAGCAGATCGTGCATTGAACTTCTAAAGAGGATGAATGACGAAGGAACCAAACCGACTTTGTCAACTTTAACCAGCATAATATCAGCTAGCTCGAGATCAGTTCAACCTTGGCATGGAAAATTCATACATGGATATTTTTTAAGAAATAGAATGGATGCTGATATCTTCATCGACATTTCTCTCATTGATCTATATTTCAAATGTGGATATGTTGCTTCAGCTGAAACTATCTTCAGAAATATATCCAAGAATGAAGTAGTTTCTTGGAATGTTATGATTTCTGGATATGTCATGGTGGGTAAGCACATTCAGGCTCTCCGCACCTATGATAACATGAAAGAACACTGCGTAAAACCAGATGCCGTTACATTTTCTAGCACCTTATCAGCTTGTTCACAGCTAGCAGCCTTGGAAAAGGGTAGGGAGCTTCACAACTGCATTATCAGTCATAAGTTGGAAACCAATGAAATTGTCATGGGGGCTCTTCTTGATATGTATGCTAAATGTGGTGATGTCGATGAAGCGCGGAAACTCTTTCATCGTATACCAGAGAGGGATCTTGTATCGTGGACAACAATGATCACTGCTTATGGATCTCATGGCCAACCGTCAGAAGCTTTGAGGATTTTTGATGAAATGCAGAAGTCGAACATACAAGCAGATTCAGTTACATTCCTAGCAGTCCTATCTGCTTGTAGCCATGCTGGATTGGTTGATGAAGGTTATAGATATTTCAACGAGATGATCATTCAGTATGACATTAAGCCCGGCATTGAACACAATTCATGCTTGATAGATCTTCTCGGACGTGCTGGAAGATTATGTGAAGCTTATGAGATTCTCCAAAGATCAGAAGAGACTAGGAATGATATTGGATTGTTAAGCACATTGTTTTCTGCTTGTCGCTTACATAACAATTTCGTTTTAGGTATAGAAATTGGCAAAATGCTTGTGGAGGTAGATCCCGATGATCCTTCTACTTACATTTTGCTGTCGAATATGTATGCTTCTGTCAATAAATGGGAGGAGGTACGAAAAGTACGACAAAAAATGAAAGAACTAGGATTGAATAAAAGCCCTGGTTGCAGCTGGATAGAGATAAACCAGAGGATCCAGCCATTCTTTGTTGAAGATAAGTCAAACCCTCTGGTTGATGGGGTCTATGAATGTCTAAGCAGTCTAGCTCGTCATATGGAGAAGTATGAATTAGAGCTGCAGTAG

mRNA sequence

ATGCATTCTGAGGAGAATAAAGATTTTACCGAGAAGAACAAGAAACGGAAGCTCAAGACACCTAACCAGGTGATTGCTTTAGAGAAGTTCTATAATGAACACAAGTATCCTACAGAGGAAATGAAATCACAGCTTGCAGAGCAGCTAGGTTTGACTGAAAAGCAAATTTCTGGATGGTTTTGCCACAGAAGATTAAAAGACAAAAGGTTTTGTGAAACATATGCTAGTGTACGACAGGATCGTTCAAGTGGTGTTATCCAAGATCATGGCAGTGGGCTCGCACAAGATTCATGTGGTAGCACAAAAAATGGAGACTATTGGCATATCGATCCACGTGAAGTTGAAAGTCAAAAGCCTTATGGCCATGAGCACCCAGCTACAGATAATGTCCTTGAGCGTAGGAGTCAATTTACAGAAAATGTTAGTAATATGGATAATACATCTTCAGAAAGCAGCTCTTCTTTAAAAGATAGGTTATTATCTCAAAGTGAAAATCCGTATGATACGGAAGTTTCGCGGTATTTAACACATGATGGTGCTATTCCACCATCAAATCCTAAGGTTTTAAACTCCCTGCGATATAAACCATCAGGCTATTTAAAAGTGAAGGGCGAAGTTGAAAATGCTGCTATTACTGCTGTTAAGAGACAGTTGGGTGTGCAATATCGGGAGGACGGTCCACCGCTTGGTGTGGAATTCCAGCCACTTCCTCCTGGTGCATTTGAGTCCCCAGCGAAAGGTCCAATCCATGATTCATACTATGTTGGAAATCCCGTGCTTCCCCATTCTCCAGACATATTGACAATGAAGAAACAAAGAGCTCTTGGCTCTAGGTATGAAATGCATAGTTCAAATATGAGTTCTCAGGACTCATATATGGAGGAAGCAATCCCCACCAGCACTACATGTAAACCTGAGTCTCAGGAGAAGAATTCTGTCTATCAATTAAAGAAAAGTTCCAACTATTACAACAAAACTGATCCTTTTCCCCGCCAGAACTCTCCCTTGAATATGTATGAGGAATCTGGTGGGTTAACATTTTCCAGCAGTAGCAAAAGGGATCATAAAATGAGCTCTAGCTATAACATTCCTAGAAGTAGAACTGATTCTGTTTCCATCAATCATGGCTCCTATACTTCAAAAGTTGCTAGTGAACAGACAGAGATGCAGCTGCATAACCATGGTAGTGTCGGCTCAAAGAGTTTTAATAGGAGTGGCTATTTGGACTATAATTCTAAGAAAATGTCAAAGGAAATGTTCAATGGAGAAGCAAAGCCAGTAAATGAATGTAGTGATCCAGTCAGAGTAAAGATCCCATCATCAAATGAATTGGCTGTTGCAAATCGATGTCAGTTGGATTTTCCTCGGCCAGACTATGCTGTAAAAGCATCATTTTCTGAAAAACCAGGGCGGAAGAATCATACTAGAAGGCACCAAGAAAGGACAAAACCTTGCAGTAACCTGGACCAGTACTCTAGGTATGATGGCAGTGCAGAAACAGGCAATCAAGATCTACCAGATTTCGGGTTACTCCAATTGGGGGAGATGGAGGTACTTGTTCTCATACGTCAGAGGCATAACTCGAACTTCTGCAGTCTGTTCACGCGAATCAGTGGAGTTTCACCTCTGTTTTCGGCCATTTCGATTCTTAAACCTAGTTCTATAGTGCTTCAGAGTCAGTGCAGTTTCAAGCGGTGGCTGGACGAATCGGACTGGAGCTGTATAGTTGAGATGGATGTTTTTAGTATTGTAGATTGGAATTGCTCGTTGAATTTGAGTTTTAGGCGGAGAATTTTGAGGGTTCAAAAGGAAGAAATCTTGATGTCGAGGATGGATAGGATGGCTTCAGATCTTAATAGGAATGGTTCGGTGGAAAGGGATATCGAGCAGTCCATTATTGCTCTAAAGAAGGGAGCATATCTGCTAAAGTATGGAAGGAGGGGAAAACCAAAATTCTGTCCTTTCCGGCTTTCCAATGATGAGTCTGTTCTAATTTGGTTTTCAGGGAAAGAGGAGAAACACCTTAAACTAAGCCATGTTTCTAGAATAATATCTGGGCAACGCACTCCAATATTTCAAAGGTATCCACGGCCAGAAAAGGAATACCAGTCATTTTCTCTAATATATAACGAAAGATCTTTAGATTTGATTTGCAAGGATAAAGATGAAGCTGAGGTTTGGTTCAATGGTTTGAAAACATTAATTTCTCGTAGCCATCACCGTAAATGGAGGACAGAATCTAGGAGCGATGGAATGCAGTCTGAAGCAAATAGTCCTCGAACTTACACCAGAAGAAGTTCTCCTCTTAATTCACCATTTGGTAGTAATGATAGCTTGCAAAAGGATGGTGATTTTCGACTTCACAGTCCATATGAAAGTCCTCCTAAAAATGGAATGGATAAGGCGTTATCAGATGTGATACTGTATGCTGTTCCTCCCAAGGGCTTCTTCCCTTCTGATTCTGCCAGTATATCAGTTAATTCTTTATCATCAGGTAGCTCAGACATGCATGGTCCCATGAAAGCAATGGGAATTGATGCTTTTAGAGTTAGTTTATCAAGTGCTGTCAGCTCATCCAGCCAAGGCTCAGGTCATGATGATGGTGATGCCTTGGGGGACGTTTTTATTTGGGGTGAAGGAACTGGGGATGGTGTTCTTGGTGGTGGAAGTCATAGAGTTGGAAGTTGTTTAAGTATCAAAATGGATTCTTTGCTGCCTAAAGCCCTGGAATCTGCTGTAGTTCTGGACGTTCAGAACATTGCCTGTGGTGGACGCCATGCTGCCTTAGTGACCAAGCAAGGAGAAATTTTCACCTGGGGGGAGGAATCAGGAGGCAGGCTTGGGCATGGTGTCGATTCTGATGTTTTGCAACCAAAGCTTATAGATGCCCTTGGTAATACAAATATTGAATTGGTATCTTGTGGTGAGTACCACACATCTGCTGTAACACTTTCTGGTGATTTGTACACATGGGGTGATGGAACTTACAATTTTGGTCTTCTTGGCCATGGGAATGAAGTAAGCCACTGGATCCCTAAAAAAATAAATGGACCATTGGAGGGCATACATGTCTCTTCTATCTCTTGTGGACCTTGGCACACTGCAGTTGTAACCTCTGCAGGGCAACTTTTTACCTTTGGTGATGGAACGTTTGGTGTTTTAGGCCATGGAGATCGCAACAGTGTCTCAATGCCTAGAGAAGTGGAGTCCCTCAAGGGTCTACGCACTGTGCGGGCTGCTTGTGGCGTTTGGCATACTGCTGCTGTAGTTGAAGTTATGGTTGGAAGCTCAAGTTCCAGCAATTGCTCTTCAGGGAAGCTATTCACATGGGGAGATGGAGATAAGGGTCGACTAGGGCATGGTGACAAAGAGACTAAACTTGTGCCTACTTGTGTTGCAGCTCTTGTTGAACCTAATTTTTGTCGAGTTTCATGTGGGCACAGCCTGACAGTCGCCCTTACAACATCTGGCCATGTCTACACAATGGGGAGTCCTGTTTATGGTCAGTTAGGAAATCCTCATGCTGATGGGAAGGTTCCAGTTCGAGTTGAAGGAAAGCTTTCCAAAAGTTTTGTGGAAGAAATAGCTTGTGGTGCTTATCATGTTGCTGTTTTAACTTCAAGAACGGAAGTCTACACTTGGGGCAAGGGTGCAAATGGTCGTTTGGGTCATGGTGATACAGATGACAGAAATTCACCAACGTTAGTAGAAGCTTTGAAAGACAAGCAAGTTAAAAGCATTGCATGTGGTACAAATTTTACTGCAGCTATCTGCCTTCATAAATGGGTTTCTGGTGTTGATCAGTCTATGTGTTCAGGCTGCCACTTACCATTTAACTTCAAAAGGAAGCGGCATAATTGTTATAACTGTGGACTTGTTTTCTGCCATTCATGCAGCAGTAAGAAATGTCACAAGGCTTCTATGGCCCCAAATCCTAACAAACCTTATCGTGTATGTGATAACTGTTATAACAAACTACGGAAGGCACTTGAGACTGATGCTTCTTCTCAGTCTTCAGTGAGCCGAAGAAGAAGCATCAATCAAGGAACGACTGAATTTGTTGAGAAAGATGAGAAACCGGAATCTGTCAAGTCTCGTGCTCAACTTGCTCGGTTTTCTTCCATGGAATCTGTGAAGCAAGTTGAAAACCAATCTTCCAAGAAAAACAAAAAATTTGAATGTAATAGTAGCAGGGTGTCGCCCGTTCCAAATGGAGGATCCCAGTGGGGAGTTATTTCCAAATCATTTAATCCAGTGTTTGGGTCATCTAAAAAGTTCTTTTCAGCTTCGGTTCCTGGTTCTAGAATTGTTTCCAGAGCAACATCCCCAATATCAAGGCGAGCAAGTCCACCTCGCTCAACAACACCTACCCCAACTCTTGGAGGTCTTACCTCACCAAAGATTGCAGTAGATGATGGTAAAAGGACAAATGATAGTCTTAGCCAGGAGGTTATTAAGTTAAAAGCTCAGGTTGAAAATCTTACCCGCAAAGCCCAACTTCAAGAAGTTGAGTTGGAAAGAACGACCAAACAGTTGAAGGAAGCACTATCATTTGCTGCAGGAGAAGCAACAAAGTGCAATGCAGCAAAGGAAGTAATCAAGTCACTTACTGCCCAATTGAAGGAAATGGCAGAAAGACTTCCAGTTGGAGCAGCTCGAAACATCAAATCACCTTCTCTTGCCTCCTTGGGCTCCAGTCCTCTCTTCAATGATGTGGTTACTCCATCAATTGACCGATCTAATGGTCAAACAATGTCTCTAGAAGCCGACATTATAGAATCAAACAGTCACTTGCTGTCTAATGGGTCCAGCACTGCAAGTAATCGTAGTTCAGGCCATAACAGACAAGGAAATTCTGATTCAACGACTAAAAATGGTAACAAGGTTAAAGAAAGTGATTCCCGTCATGATGCTGAATGGGTTGAGCAAGATGAGCCTGGTGTATATATCACGTTTACCTCCCTTCAGGGTGGTGCCAAAGATCTCAAGCGAGTGCGTTTCAGTCGGAAACGGTTTACCGAGAAGCAAGCAGAACAGTGGTGGGCAGAGAACCGAGCAAGAGTATATGATCAATATAATGTGCGTATGATCGATAAGTCCACGAAGCTTGTTTTTCAGACCACCGACTGCCCATTGGATGTTTCTCTGTGGAATGCTCTTCTGTCTGCTTACACCAAGAACTTCATGTTCGATGAAGCTTTGCAACTCTTTGACCAGTTGAAGTGTCATTCTCATGTAAGACCTGATTGTTACACTTACCCAGTTGCTCTCAAGGCGTGCGGTGGATTGGGTAGAGTTGTTTGTGGGAGAAGGGTCCATAATCATTTGATAAAAACGGGTTTGATATGGGATGTTTTTGTGACGAGCTCTCTGATGAATATGTATGCGAAGTGTAATCAGTTTCATGATGCCATTAATCTGTTCGATGAATTGCCTCACAGAGATGTGGGGTGTTGGAACACAGTAATCTCTTGTTATTTTCAAGATGATAAGGCTGAGACTGCCCTGAAAATGTTCGATAAAATGAAAGATTCGGGTTTTGAGCCTAATTCAGTGACTTTTACTATTGTTATCTCTTCATGTACAAGGCTTTTGAATTTGGAAAGAGGTAAGGAGATTCGTAGGGAGTTGATGGAGAGCGGGGTTTTGTTGGATGCTTTTGTTCTATCTGCGCTTGTAGATATGTATGGAAAATGTGGTTGTTTAGAAATGGCCAAAGAAGTTTTTGAGCAAATTCCAAGGAAGAATGCGATCACTTGGAATTCCATGATCACAGGTTATAGCTTGAAAGGTGACAGCAGATCGTGCATTGAACTTCTAAAGAGGATGAATGACGAAGGAACCAAACCGACTTTGTCAACTTTAACCAGCATAATATCAGCTAGCTCGAGATCAGTTCAACCTTGGCATGGAAAATTCATACATGGATATTTTTTAAGAAATAGAATGGATGCTGATATCTTCATCGACATTTCTCTCATTGATCTATATTTCAAATGTGGATATGTTGCTTCAGCTGAAACTATCTTCAGAAATATATCCAAGAATGAAGTAGTTTCTTGGAATGTTATGATTTCTGGATATGTCATGGTGGGTAAGCACATTCAGGCTCTCCGCACCTATGATAACATGAAAGAACACTGCGTAAAACCAGATGCCGTTACATTTTCTAGCACCTTATCAGCTTGTTCACAGCTAGCAGCCTTGGAAAAGGGTAGGGAGCTTCACAACTGCATTATCAGTCATAAGTTGGAAACCAATGAAATTGTCATGGGGGCTCTTCTTGATATGTATGCTAAATGTGGTGATGTCGATGAAGCGCGGAAACTCTTTCATCGTATACCAGAGAGGGATCTTGTATCGTGGACAACAATGATCACTGCTTATGGATCTCATGGCCAACCGTCAGAAGCTTTGAGGATTTTTGATGAAATGCAGAAGTCGAACATACAAGCAGATTCAGTTACATTCCTAGCAGTCCTATCTGCTTGTAGCCATGCTGGATTGGTTGATGAAGGTTATAGATATTTCAACGAGATGATCATTCAGTATGACATTAAGCCCGGCATTGAACACAATTCATGCTTGATAGATCTTCTCGGACGTGCTGGAAGATTATGTGAAGCTTATGAGATTCTCCAAAGATCAGAAGAGACTAGGAATGATATTGGATTGTTAAGCACATTGTTTTCTGCTTGTCGCTTACATAACAATTTCGTTTTAGGTATAGAAATTGGCAAAATGCTTGTGGAGGTAGATCCCGATGATCCTTCTACTTACATTTTGCTGTCGAATATGTATGCTTCTGTCAATAAATGGGAGGAGGTACGAAAAGTACGACAAAAAATGAAAGAACTAGGATTGAATAAAAGCCCTGGTTGCAGCTGGATAGAGATAAACCAGAGGATCCAGCCATTCTTTGTTGAAGATAAGTCAAACCCTCTGGTTGATGGGGTCTATGAATGTCTAAGCAGTCTAGCTCGTCATATGGAGAAGTATGAATTAGAGCTGCAGTAG

Coding sequence (CDS)

ATGCATTCTGAGGAGAATAAAGATTTTACCGAGAAGAACAAGAAACGGAAGCTCAAGACACCTAACCAGGTGATTGCTTTAGAGAAGTTCTATAATGAACACAAGTATCCTACAGAGGAAATGAAATCACAGCTTGCAGAGCAGCTAGGTTTGACTGAAAAGCAAATTTCTGGATGGTTTTGCCACAGAAGATTAAAAGACAAAAGGTTTTGTGAAACATATGCTAGTGTACGACAGGATCGTTCAAGTGGTGTTATCCAAGATCATGGCAGTGGGCTCGCACAAGATTCATGTGGTAGCACAAAAAATGGAGACTATTGGCATATCGATCCACGTGAAGTTGAAAGTCAAAAGCCTTATGGCCATGAGCACCCAGCTACAGATAATGTCCTTGAGCGTAGGAGTCAATTTACAGAAAATGTTAGTAATATGGATAATACATCTTCAGAAAGCAGCTCTTCTTTAAAAGATAGGTTATTATCTCAAAGTGAAAATCCGTATGATACGGAAGTTTCGCGGTATTTAACACATGATGGTGCTATTCCACCATCAAATCCTAAGGTTTTAAACTCCCTGCGATATAAACCATCAGGCTATTTAAAAGTGAAGGGCGAAGTTGAAAATGCTGCTATTACTGCTGTTAAGAGACAGTTGGGTGTGCAATATCGGGAGGACGGTCCACCGCTTGGTGTGGAATTCCAGCCACTTCCTCCTGGTGCATTTGAGTCCCCAGCGAAAGGTCCAATCCATGATTCATACTATGTTGGAAATCCCGTGCTTCCCCATTCTCCAGACATATTGACAATGAAGAAACAAAGAGCTCTTGGCTCTAGGTATGAAATGCATAGTTCAAATATGAGTTCTCAGGACTCATATATGGAGGAAGCAATCCCCACCAGCACTACATGTAAACCTGAGTCTCAGGAGAAGAATTCTGTCTATCAATTAAAGAAAAGTTCCAACTATTACAACAAAACTGATCCTTTTCCCCGCCAGAACTCTCCCTTGAATATGTATGAGGAATCTGGTGGGTTAACATTTTCCAGCAGTAGCAAAAGGGATCATAAAATGAGCTCTAGCTATAACATTCCTAGAAGTAGAACTGATTCTGTTTCCATCAATCATGGCTCCTATACTTCAAAAGTTGCTAGTGAACAGACAGAGATGCAGCTGCATAACCATGGTAGTGTCGGCTCAAAGAGTTTTAATAGGAGTGGCTATTTGGACTATAATTCTAAGAAAATGTCAAAGGAAATGTTCAATGGAGAAGCAAAGCCAGTAAATGAATGTAGTGATCCAGTCAGAGTAAAGATCCCATCATCAAATGAATTGGCTGTTGCAAATCGATGTCAGTTGGATTTTCCTCGGCCAGACTATGCTGTAAAAGCATCATTTTCTGAAAAACCAGGGCGGAAGAATCATACTAGAAGGCACCAAGAAAGGACAAAACCTTGCAGTAACCTGGACCAGTACTCTAGGTATGATGGCAGTGCAGAAACAGGCAATCAAGATCTACCAGATTTCGGGTTACTCCAATTGGGGGAGATGGAGGTACTTGTTCTCATACGTCAGAGGCATAACTCGAACTTCTGCAGTCTGTTCACGCGAATCAGTGGAGTTTCACCTCTGTTTTCGGCCATTTCGATTCTTAAACCTAGTTCTATAGTGCTTCAGAGTCAGTGCAGTTTCAAGCGGTGGCTGGACGAATCGGACTGGAGCTGTATAGTTGAGATGGATGTTTTTAGTATTGTAGATTGGAATTGCTCGTTGAATTTGAGTTTTAGGCGGAGAATTTTGAGGGTTCAAAAGGAAGAAATCTTGATGTCGAGGATGGATAGGATGGCTTCAGATCTTAATAGGAATGGTTCGGTGGAAAGGGATATCGAGCAGTCCATTATTGCTCTAAAGAAGGGAGCATATCTGCTAAAGTATGGAAGGAGGGGAAAACCAAAATTCTGTCCTTTCCGGCTTTCCAATGATGAGTCTGTTCTAATTTGGTTTTCAGGGAAAGAGGAGAAACACCTTAAACTAAGCCATGTTTCTAGAATAATATCTGGGCAACGCACTCCAATATTTCAAAGGTATCCACGGCCAGAAAAGGAATACCAGTCATTTTCTCTAATATATAACGAAAGATCTTTAGATTTGATTTGCAAGGATAAAGATGAAGCTGAGGTTTGGTTCAATGGTTTGAAAACATTAATTTCTCGTAGCCATCACCGTAAATGGAGGACAGAATCTAGGAGCGATGGAATGCAGTCTGAAGCAAATAGTCCTCGAACTTACACCAGAAGAAGTTCTCCTCTTAATTCACCATTTGGTAGTAATGATAGCTTGCAAAAGGATGGTGATTTTCGACTTCACAGTCCATATGAAAGTCCTCCTAAAAATGGAATGGATAAGGCGTTATCAGATGTGATACTGTATGCTGTTCCTCCCAAGGGCTTCTTCCCTTCTGATTCTGCCAGTATATCAGTTAATTCTTTATCATCAGGTAGCTCAGACATGCATGGTCCCATGAAAGCAATGGGAATTGATGCTTTTAGAGTTAGTTTATCAAGTGCTGTCAGCTCATCCAGCCAAGGCTCAGGTCATGATGATGGTGATGCCTTGGGGGACGTTTTTATTTGGGGTGAAGGAACTGGGGATGGTGTTCTTGGTGGTGGAAGTCATAGAGTTGGAAGTTGTTTAAGTATCAAAATGGATTCTTTGCTGCCTAAAGCCCTGGAATCTGCTGTAGTTCTGGACGTTCAGAACATTGCCTGTGGTGGACGCCATGCTGCCTTAGTGACCAAGCAAGGAGAAATTTTCACCTGGGGGGAGGAATCAGGAGGCAGGCTTGGGCATGGTGTCGATTCTGATGTTTTGCAACCAAAGCTTATAGATGCCCTTGGTAATACAAATATTGAATTGGTATCTTGTGGTGAGTACCACACATCTGCTGTAACACTTTCTGGTGATTTGTACACATGGGGTGATGGAACTTACAATTTTGGTCTTCTTGGCCATGGGAATGAAGTAAGCCACTGGATCCCTAAAAAAATAAATGGACCATTGGAGGGCATACATGTCTCTTCTATCTCTTGTGGACCTTGGCACACTGCAGTTGTAACCTCTGCAGGGCAACTTTTTACCTTTGGTGATGGAACGTTTGGTGTTTTAGGCCATGGAGATCGCAACAGTGTCTCAATGCCTAGAGAAGTGGAGTCCCTCAAGGGTCTACGCACTGTGCGGGCTGCTTGTGGCGTTTGGCATACTGCTGCTGTAGTTGAAGTTATGGTTGGAAGCTCAAGTTCCAGCAATTGCTCTTCAGGGAAGCTATTCACATGGGGAGATGGAGATAAGGGTCGACTAGGGCATGGTGACAAAGAGACTAAACTTGTGCCTACTTGTGTTGCAGCTCTTGTTGAACCTAATTTTTGTCGAGTTTCATGTGGGCACAGCCTGACAGTCGCCCTTACAACATCTGGCCATGTCTACACAATGGGGAGTCCTGTTTATGGTCAGTTAGGAAATCCTCATGCTGATGGGAAGGTTCCAGTTCGAGTTGAAGGAAAGCTTTCCAAAAGTTTTGTGGAAGAAATAGCTTGTGGTGCTTATCATGTTGCTGTTTTAACTTCAAGAACGGAAGTCTACACTTGGGGCAAGGGTGCAAATGGTCGTTTGGGTCATGGTGATACAGATGACAGAAATTCACCAACGTTAGTAGAAGCTTTGAAAGACAAGCAAGTTAAAAGCATTGCATGTGGTACAAATTTTACTGCAGCTATCTGCCTTCATAAATGGGTTTCTGGTGTTGATCAGTCTATGTGTTCAGGCTGCCACTTACCATTTAACTTCAAAAGGAAGCGGCATAATTGTTATAACTGTGGACTTGTTTTCTGCCATTCATGCAGCAGTAAGAAATGTCACAAGGCTTCTATGGCCCCAAATCCTAACAAACCTTATCGTGTATGTGATAACTGTTATAACAAACTACGGAAGGCACTTGAGACTGATGCTTCTTCTCAGTCTTCAGTGAGCCGAAGAAGAAGCATCAATCAAGGAACGACTGAATTTGTTGAGAAAGATGAGAAACCGGAATCTGTCAAGTCTCGTGCTCAACTTGCTCGGTTTTCTTCCATGGAATCTGTGAAGCAAGTTGAAAACCAATCTTCCAAGAAAAACAAAAAATTTGAATGTAATAGTAGCAGGGTGTCGCCCGTTCCAAATGGAGGATCCCAGTGGGGAGTTATTTCCAAATCATTTAATCCAGTGTTTGGGTCATCTAAAAAGTTCTTTTCAGCTTCGGTTCCTGGTTCTAGAATTGTTTCCAGAGCAACATCCCCAATATCAAGGCGAGCAAGTCCACCTCGCTCAACAACACCTACCCCAACTCTTGGAGGTCTTACCTCACCAAAGATTGCAGTAGATGATGGTAAAAGGACAAATGATAGTCTTAGCCAGGAGGTTATTAAGTTAAAAGCTCAGGTTGAAAATCTTACCCGCAAAGCCCAACTTCAAGAAGTTGAGTTGGAAAGAACGACCAAACAGTTGAAGGAAGCACTATCATTTGCTGCAGGAGAAGCAACAAAGTGCAATGCAGCAAAGGAAGTAATCAAGTCACTTACTGCCCAATTGAAGGAAATGGCAGAAAGACTTCCAGTTGGAGCAGCTCGAAACATCAAATCACCTTCTCTTGCCTCCTTGGGCTCCAGTCCTCTCTTCAATGATGTGGTTACTCCATCAATTGACCGATCTAATGGTCAAACAATGTCTCTAGAAGCCGACATTATAGAATCAAACAGTCACTTGCTGTCTAATGGGTCCAGCACTGCAAGTAATCGTAGTTCAGGCCATAACAGACAAGGAAATTCTGATTCAACGACTAAAAATGGTAACAAGGTTAAAGAAAGTGATTCCCGTCATGATGCTGAATGGGTTGAGCAAGATGAGCCTGGTGTATATATCACGTTTACCTCCCTTCAGGGTGGTGCCAAAGATCTCAAGCGAGTGCGTTTCAGTCGGAAACGGTTTACCGAGAAGCAAGCAGAACAGTGGTGGGCAGAGAACCGAGCAAGAGTATATGATCAATATAATGTGCGTATGATCGATAAGTCCACGAAGCTTGTTTTTCAGACCACCGACTGCCCATTGGATGTTTCTCTGTGGAATGCTCTTCTGTCTGCTTACACCAAGAACTTCATGTTCGATGAAGCTTTGCAACTCTTTGACCAGTTGAAGTGTCATTCTCATGTAAGACCTGATTGTTACACTTACCCAGTTGCTCTCAAGGCGTGCGGTGGATTGGGTAGAGTTGTTTGTGGGAGAAGGGTCCATAATCATTTGATAAAAACGGGTTTGATATGGGATGTTTTTGTGACGAGCTCTCTGATGAATATGTATGCGAAGTGTAATCAGTTTCATGATGCCATTAATCTGTTCGATGAATTGCCTCACAGAGATGTGGGGTGTTGGAACACAGTAATCTCTTGTTATTTTCAAGATGATAAGGCTGAGACTGCCCTGAAAATGTTCGATAAAATGAAAGATTCGGGTTTTGAGCCTAATTCAGTGACTTTTACTATTGTTATCTCTTCATGTACAAGGCTTTTGAATTTGGAAAGAGGTAAGGAGATTCGTAGGGAGTTGATGGAGAGCGGGGTTTTGTTGGATGCTTTTGTTCTATCTGCGCTTGTAGATATGTATGGAAAATGTGGTTGTTTAGAAATGGCCAAAGAAGTTTTTGAGCAAATTCCAAGGAAGAATGCGATCACTTGGAATTCCATGATCACAGGTTATAGCTTGAAAGGTGACAGCAGATCGTGCATTGAACTTCTAAAGAGGATGAATGACGAAGGAACCAAACCGACTTTGTCAACTTTAACCAGCATAATATCAGCTAGCTCGAGATCAGTTCAACCTTGGCATGGAAAATTCATACATGGATATTTTTTAAGAAATAGAATGGATGCTGATATCTTCATCGACATTTCTCTCATTGATCTATATTTCAAATGTGGATATGTTGCTTCAGCTGAAACTATCTTCAGAAATATATCCAAGAATGAAGTAGTTTCTTGGAATGTTATGATTTCTGGATATGTCATGGTGGGTAAGCACATTCAGGCTCTCCGCACCTATGATAACATGAAAGAACACTGCGTAAAACCAGATGCCGTTACATTTTCTAGCACCTTATCAGCTTGTTCACAGCTAGCAGCCTTGGAAAAGGGTAGGGAGCTTCACAACTGCATTATCAGTCATAAGTTGGAAACCAATGAAATTGTCATGGGGGCTCTTCTTGATATGTATGCTAAATGTGGTGATGTCGATGAAGCGCGGAAACTCTTTCATCGTATACCAGAGAGGGATCTTGTATCGTGGACAACAATGATCACTGCTTATGGATCTCATGGCCAACCGTCAGAAGCTTTGAGGATTTTTGATGAAATGCAGAAGTCGAACATACAAGCAGATTCAGTTACATTCCTAGCAGTCCTATCTGCTTGTAGCCATGCTGGATTGGTTGATGAAGGTTATAGATATTTCAACGAGATGATCATTCAGTATGACATTAAGCCCGGCATTGAACACAATTCATGCTTGATAGATCTTCTCGGACGTGCTGGAAGATTATGTGAAGCTTATGAGATTCTCCAAAGATCAGAAGAGACTAGGAATGATATTGGATTGTTAAGCACATTGTTTTCTGCTTGTCGCTTACATAACAATTTCGTTTTAGGTATAGAAATTGGCAAAATGCTTGTGGAGGTAGATCCCGATGATCCTTCTACTTACATTTTGCTGTCGAATATGTATGCTTCTGTCAATAAATGGGAGGAGGTACGAAAAGTACGACAAAAAATGAAAGAACTAGGATTGAATAAAAGCCCTGGTTGCAGCTGGATAGAGATAAACCAGAGGATCCAGCCATTCTTTGTTGAAGATAAGTCAAACCCTCTGGTTGATGGGGTCTATGAATGTCTAAGCAGTCTAGCTCGTCATATGGAGAAGTATGAATTAGAGCTGCAGTAG

Protein sequence

MHSEENKDFTEKNKKRKLKTPNQVIALEKFYNEHKYPTEEMKSQLAEQLGLTEKQISGWFCHRRLKDKRFCETYASVRQDRSSGVIQDHGSGLAQDSCGSTKNGDYWHIDPREVESQKPYGHEHPATDNVLERRSQFTENVSNMDNTSSESSSSLKDRLLSQSENPYDTEVSRYLTHDGAIPPSNPKVLNSLRYKPSGYLKVKGEVENAAITAVKRQLGVQYREDGPPLGVEFQPLPPGAFESPAKGPIHDSYYVGNPVLPHSPDILTMKKQRALGSRYEMHSSNMSSQDSYMEEAIPTSTTCKPESQEKNSVYQLKKSSNYYNKTDPFPRQNSPLNMYEESGGLTFSSSSKRDHKMSSSYNIPRSRTDSVSINHGSYTSKVASEQTEMQLHNHGSVGSKSFNRSGYLDYNSKKMSKEMFNGEAKPVNECSDPVRVKIPSSNELAVANRCQLDFPRPDYAVKASFSEKPGRKNHTRRHQERTKPCSNLDQYSRYDGSAETGNQDLPDFGLLQLGEMEVLVLIRQRHNSNFCSLFTRISGVSPLFSAISILKPSSIVLQSQCSFKRWLDESDWSCIVEMDVFSIVDWNCSLNLSFRRRILRVQKEEILMSRMDRMASDLNRNGSVERDIEQSIIALKKGAYLLKYGRRGKPKFCPFRLSNDESVLIWFSGKEEKHLKLSHVSRIISGQRTPIFQRYPRPEKEYQSFSLIYNERSLDLICKDKDEAEVWFNGLKTLISRSHHRKWRTESRSDGMQSEANSPRTYTRRSSPLNSPFGSNDSLQKDGDFRLHSPYESPPKNGMDKALSDVILYAVPPKGFFPSDSASISVNSLSSGSSDMHGPMKAMGIDAFRVSLSSAVSSSSQGSGHDDGDALGDVFIWGEGTGDGVLGGGSHRVGSCLSIKMDSLLPKALESAVVLDVQNIACGGRHAALVTKQGEIFTWGEESGGRLGHGVDSDVLQPKLIDALGNTNIELVSCGEYHTSAVTLSGDLYTWGDGTYNFGLLGHGNEVSHWIPKKINGPLEGIHVSSISCGPWHTAVVTSAGQLFTFGDGTFGVLGHGDRNSVSMPREVESLKGLRTVRAACGVWHTAAVVEVMVGSSSSSNCSSGKLFTWGDGDKGRLGHGDKETKLVPTCVAALVEPNFCRVSCGHSLTVALTTSGHVYTMGSPVYGQLGNPHADGKVPVRVEGKLSKSFVEEIACGAYHVAVLTSRTEVYTWGKGANGRLGHGDTDDRNSPTLVEALKDKQVKSIACGTNFTAAICLHKWVSGVDQSMCSGCHLPFNFKRKRHNCYNCGLVFCHSCSSKKCHKASMAPNPNKPYRVCDNCYNKLRKALETDASSQSSVSRRRSINQGTTEFVEKDEKPESVKSRAQLARFSSMESVKQVENQSSKKNKKFECNSSRVSPVPNGGSQWGVISKSFNPVFGSSKKFFSASVPGSRIVSRATSPISRRASPPRSTTPTPTLGGLTSPKIAVDDGKRTNDSLSQEVIKLKAQVENLTRKAQLQEVELERTTKQLKEALSFAAGEATKCNAAKEVIKSLTAQLKEMAERLPVGAARNIKSPSLASLGSSPLFNDVVTPSIDRSNGQTMSLEADIIESNSHLLSNGSSTASNRSSGHNRQGNSDSTTKNGNKVKESDSRHDAEWVEQDEPGVYITFTSLQGGAKDLKRVRFSRKRFTEKQAEQWWAENRARVYDQYNVRMIDKSTKLVFQTTDCPLDVSLWNALLSAYTKNFMFDEALQLFDQLKCHSHVRPDCYTYPVALKACGGLGRVVCGRRVHNHLIKTGLIWDVFVTSSLMNMYAKCNQFHDAINLFDELPHRDVGCWNTVISCYFQDDKAETALKMFDKMKDSGFEPNSVTFTIVISSCTRLLNLERGKEIRRELMESGVLLDAFVLSALVDMYGKCGCLEMAKEVFEQIPRKNAITWNSMITGYSLKGDSRSCIELLKRMNDEGTKPTLSTLTSIISASSRSVQPWHGKFIHGYFLRNRMDADIFIDISLIDLYFKCGYVASAETIFRNISKNEVVSWNVMISGYVMVGKHIQALRTYDNMKEHCVKPDAVTFSSTLSACSQLAALEKGRELHNCIISHKLETNEIVMGALLDMYAKCGDVDEARKLFHRIPERDLVSWTTMITAYGSHGQPSEALRIFDEMQKSNIQADSVTFLAVLSACSHAGLVDEGYRYFNEMIIQYDIKPGIEHNSCLIDLLGRAGRLCEAYEILQRSEETRNDIGLLSTLFSACRLHNNFVLGIEIGKMLVEVDPDDPSTYILLSNMYASVNKWEEVRKVRQKMKELGLNKSPGCSWIEINQRIQPFFVEDKSNPLVDGVYECLSSLARHMEKYELELQ
Homology
BLAST of Sgr023231 vs. NCBI nr
Match: EXC12413.1 (E3 ubiquitin-protein ligase HERC2 [Morus notabilis])

HSP 1 Score: 2141.3 bits (5547), Expect = 0.0e+00
Identity = 1167/1728 (67.53%), Postives = 1293/1728 (74.83%), Query Frame = 0

Query: 1    MHSEENKDFTEKNKKRKLKTPNQVIALEKFYNEHKYPTEEMKSQLAEQLGLTEKQISGWF 60
            +HS+ENK  ++ NKKR+LKTP+QV+ALEKFYNEHKYPTEEMKS+LAE+LGLTEKQISGWF
Sbjct: 7    VHSDENK-VSQDNKKRQLKTPSQVMALEKFYNEHKYPTEEMKSELAEELGLTEKQISGWF 66

Query: 61   CHRRLKDKRF--CETYASVRQDRSSGVIQDHGSGLAQDSCGSTKNGDYWHIDPREVESQK 120
            CHRRLKDKR    E  +S RQ+R SG+IQD GSG  QDSCGSTK+ DY H+DPREVES++
Sbjct: 67   CHRRLKDKRSLKVEKCSSGRQERLSGIIQDLGSGFGQDSCGSTKHADYRHVDPREVESRR 126

Query: 121  PY--GHEHPATDNVLERRSQFTENVSNMDNTSSESSSSLKDRLLSQSENPYDTEVSRYLT 180
             Y  GH+ PA D   E RS +TE VS MDNTSSESSSSL+D   S +E+P+  E SRYL 
Sbjct: 127  LYDKGHDFPAADLSHENRSHYTERVSGMDNTSSESSSSLRDG-FSPTEDPHIVESSRYLA 186

Query: 181  HDGAIPPSNPKVLNSLRYKPSGYLKVKGEVENAAITAVKRQLGVQYREDGPPLGVEFQPL 240
             DG + P N K    + YKPSGYLKVKGE+ENAAITAVKRQLG QYREDGPPLGVEF PL
Sbjct: 187  QDGLVAPLNSKGARHMGYKPSGYLKVKGEIENAAITAVKRQLGRQYREDGPPLGVEFDPL 246

Query: 241  PPGAFESPAKGPIHDSYYVGNPVLPHSPDILTMKKQRALGSRYEMHSSNMSSQDSYMEEA 300
            PPGAFESP + P+H+ YY G PVL HSPDI  +K+Q +  +RYE+H+S +SS+DSY++EA
Sbjct: 247  PPGAFESPIRDPVHEPYYAGIPVLSHSPDISVVKRQPSPSTRYEVHNSKLSSRDSYLQEA 306

Query: 301  IPTSTTCKPESQEKNSVYQLKKSSNYYNKTDPFPRQNSPLNMYEESGGLTFSSSSKRDHK 360
                       QEK    QL++ S Y + T  FP +NS L+M ++S     S  S R  K
Sbjct: 307  --PGIMHGVNHQEKKHCNQLRQKSTYLDHTSNFPGRNSSLDMCDDS-----SYKSNRSRK 366

Query: 361  MSSSYNIPRSRTDSVSINHGSYTSKVASEQTEMQLHNHGSVGSKSFNRS----------- 420
            M S +      +DS   + G Y  K+AS+ ++  LH    +  K   RS           
Sbjct: 367  MGSKHGAEGMTSDSFLNHQGHYGGKIASKPSQSGLHEDDVLSPKIVQRSEHSKFKASIST 426

Query: 421  ----GYLDYNSKKMS-----KEMFNGEAKPVNECSDPVRVKIPSSNELAVANRCQLDFPR 480
                G  D   K +S     ++ F GE K + +    V+VK+   +E+ VA R ++DFPR
Sbjct: 427  RNHCGTPDIEEKGVSTMMAQEDKFGGEGKAMKD----VKVKMRPVSEMLVAKRVKVDFPR 486

Query: 481  PDYAVKASFSEKPGRKNHTRRHQERTKPCSNLDQYSRYDGSAETGNQDLPDFGLLQLGEM 540
             +    +SFSE   RKNH +                                GL      
Sbjct: 487  QENVTNSSFSEMLPRKNHMK--------------------------------GL------ 546

Query: 541  EVLVLIRQRHNSNFCSLFTRISGVSPLFSAISILKPSSIVLQSQCSFKRWLDESDWSCIV 600
                                                                        
Sbjct: 547  ------------------------------------------------------------ 606

Query: 601  EMDVFSIVDWNCSLNLSFRRRILRVQKEEILMSRMDRMASDLNRNGSVERDIEQSIIALK 660
                                                                        
Sbjct: 607  ------------------------------------------------------------ 666

Query: 661  KGAYLLKYGRRGKPKFCPFRLSNDESVLIWFSGKEEKHLKLSHVSRIISGQRTPIFQRYP 720
                                   DESVLIW SGKEEKHLKLSHVSRIISGQRTPIFQRYP
Sbjct: 667  -----------------------DESVLIWISGKEEKHLKLSHVSRIISGQRTPIFQRYP 726

Query: 721  RPEKEYQSFSLIYNERSLDLICKDKDEAEVWFNGLKTLISRSHHRKWRTESRSDGMQSEA 780
            RPEKEYQSFSLIYN+RSLDLICKDKDEAEVWF+GLK LISRSHHRKWRTESRSDG+ SEA
Sbjct: 727  RPEKEYQSFSLIYNDRSLDLICKDKDEAEVWFSGLKALISRSHHRKWRTESRSDGIPSEA 786

Query: 781  NSPRTYTRRSSPLNSPFGSNDSLQKDGD--FRLHSPYESPPKNGMDKALSDVILYAVPPK 840
            NSPRT TRRSSPL+SPFGSNDSLQKDG    RLHSPYESPPKNG+DKALSDVILYAVPPK
Sbjct: 787  NSPRTCTRRSSPLHSPFGSNDSLQKDGSDHLRLHSPYESPPKNGLDKALSDVILYAVPPK 846

Query: 841  GFFPSDSASISVNSLSSGSSD-MHGPMKAMGIDAFRVSLSSAVSSSSQGSGHDDGDALGD 900
            GFFPSDSAS SV+SLSSG SD +HG +KAM +DAFRVSLSSAVSS SQGSGHDDGDALGD
Sbjct: 847  GFFPSDSASASVHSLSSGGSDSVHGHVKAMPVDAFRVSLSSAVSSLSQGSGHDDGDALGD 906

Query: 901  VFIWGEGTGDGVLGGGSHRVGSCLSIKMDSLLPKALESAVVLDVQNIACGGRHAALVTKQ 960
            VFIWGEG GDGVLG G HRVGSC S K+DSLLPK LESAVVLDVQN+ACGGRHAALVTKQ
Sbjct: 907  VFIWGEGMGDGVLGSGPHRVGSCFSGKIDSLLPKRLESAVVLDVQNVACGGRHAALVTKQ 966

Query: 961  GEIFTWGEESGGRLGHGVDSDVLQPKLIDALGNTNIELVSCGEYHTSAVTLSGDLYTWGD 1020
            GEIF+WGEESGGRLGHGVDSDVLQPKLIDAL  TNIE V+CGEYHT AVTLSG+LYTWGD
Sbjct: 967  GEIFSWGEESGGRLGHGVDSDVLQPKLIDALSTTNIEFVACGEYHTCAVTLSGELYTWGD 1026

Query: 1021 GTYNFGLLGHGNEVSHWIPKKINGPLEGIHVSSISCGPWHTAVVTSAGQLFTFGDGTFGV 1080
            GTYNFGLLGHGNEVSHW+PK++NGPLEGIHVS ISCGPWHTAVVTSAGQLFTFGDGTFGV
Sbjct: 1027 GTYNFGLLGHGNEVSHWMPKRVNGPLEGIHVSYISCGPWHTAVVTSAGQLFTFGDGTFGV 1086

Query: 1081 LGHGDRNSVSMPREVESLKGLRTVRAACGVWHTAAVVEVMVGSSSSSNCSSGKLFTWGDG 1140
            LGHGDR SVSMPREVESLKGLRTVRAACGVWHTAAVVEVMVG+SSSSNCSSGKLFTWGDG
Sbjct: 1087 LGHGDRTSVSMPREVESLKGLRTVRAACGVWHTAAVVEVMVGNSSSSNCSSGKLFTWGDG 1146

Query: 1141 DKGRLGHGDKETKLVPTCVAALVEPNFCRVSCGHSLTVALTTSGHVYTMGSPVYGQLGNP 1200
            DKGRLGHG+KE +LVPTCVAALVEPNFC+V+CGHSLTVALTTSGHVYTMGSPVYGQLGNP
Sbjct: 1147 DKGRLGHGEKEARLVPTCVAALVEPNFCQVACGHSLTVALTTSGHVYTMGSPVYGQLGNP 1206

Query: 1201 HADGKVPVRVEGKLSKSFVEEIACGAYHVAVLTSRTEVYTWGKGANGRLGHGDTDDRNSP 1260
             ADGK+P RVEGK SK FVEEIACGAYHVAVLTS+TEVYTWGKGANGRLGHGD DDRNSP
Sbjct: 1207 QADGKLPTRVEGKHSKRFVEEIACGAYHVAVLTSKTEVYTWGKGANGRLGHGDIDDRNSP 1266

Query: 1261 TLVEALKDKQVKSIACGTNFTAAICLHKWVSGVDQSMCSGCHLPFNFKRKRHNCYNCGLV 1320
            TLVEALKDKQVKSIACGTNFTAAICLHKWVS +DQSMCSGC LPFNFKRKRHNCYNCG V
Sbjct: 1267 TLVEALKDKQVKSIACGTNFTAAICLHKWVSEIDQSMCSGCRLPFNFKRKRHNCYNCGFV 1326

Query: 1321 FCHSCSSKKCHKASMAPNPNKPYRVCDNCYNKLRKALETDASSQSSVSRRRSINQGTTEF 1380
            FCHSCSSKK  KASMAPNPNKPYRVCDNC+NKLRKA+ETD+SS  SVSRR SINQG+ EF
Sbjct: 1327 FCHSCSSKKSLKASMAPNPNKPYRVCDNCFNKLRKAIETDSSSH-SVSRRGSINQGSNEF 1386

Query: 1381 VEKDEKPESVKSRAQLARFSSMESVKQVENQSSKKNKKFECNSSRVSPVPNGGSQWGVIS 1440
            ++K+EK +S +SRAQLARFSSMES+KQVE +SSKKNKK E NSSRVSPVPNGGSQWG I 
Sbjct: 1387 IDKEEKLDS-RSRAQLARFSSMESLKQVETRSSKKNKKLEFNSSRVSPVPNGGSQWGAI- 1446

Query: 1441 KSFNPVFGSSKKFFSASVPGSRIVSRATSPISRRASPPRSTTPTPTLGGLTSPKIAVDDG 1500
            KSFNP FGSSKKFFSASVPGSRIVSRATSPISRR SPPR+TTPTPTL GLTSPKI VD+ 
Sbjct: 1447 KSFNPGFGSSKKFFSASVPGSRIVSRATSPISRRPSPPRATTPTPTLEGLTSPKIGVDNT 1506

Query: 1501 KRTNDSLSQEVIKLKAQVENLTRKAQLQEVELERTTKQLKEALSFAAGEATKCNAAKEVI 1560
            KRTNDSLSQEVIKL+AQVENLTR+AQLQEVELERTTKQLKEAL+ A  E  KC AAKEVI
Sbjct: 1507 KRTNDSLSQEVIKLRAQVENLTRQAQLQEVELERTTKQLKEALAIAGEETAKCKAAKEVI 1537

Query: 1561 KSLTAQLKEMAERLPVGAARNIKSPSLASLGSSPLFNDVVTPSIDRSNGQTMSLEADIIE 1620
            KSLTAQLK+MAERLPVGAARN+KSPSLASLGS  + +DV  PS+DR N Q +S E D   
Sbjct: 1567 KSLTAQLKDMAERLPVGAARNVKSPSLASLGSDLVGSDVSNPSVDRLNSQILSQEPDSNG 1537

Query: 1621 SNSHLLSNGSSTASNRSSGHNRQGNSDSTTKNGNKVKESDSRHDAEWVEQDEPGVYITFT 1680
            S+S L SNGS+T +NRSS HN+QG+SD TT+NG + K+ DSR+D EWVEQDEPGVYIT T
Sbjct: 1627 SHSQLHSNGSTTTANRSSSHNKQGHSDVTTRNGTRTKDIDSRNDTEWVEQDEPGVYITLT 1537

Query: 1681 SLQGGAKDLKRVRFSRKRFTEKQAEQWWAENRARVYDQYNVRMIDKST 1702
            SL GGAKDLKRVRFSRKRF+EKQAEQWWAENRARVY+QYNVRMIDKS+
Sbjct: 1687 SLPGGAKDLKRVRFSRKRFSEKQAEQWWAENRARVYEQYNVRMIDKSS 1537

BLAST of Sgr023231 vs. NCBI nr
Match: XP_022153015.1 (uncharacterized protein LOC111020619 [Momordica charantia])

HSP 1 Score: 2122.1 bits (5497), Expect = 0.0e+00
Identity = 1067/1094 (97.53%), Postives = 1080/1094 (98.72%), Query Frame = 0

Query: 608  MSRMDRMASDLNRNGSVERDIEQSIIALKKGAYLLKYGRRGKPKFCPFRLSNDESVLIWF 667
            MSRMDRMASDLNRNG VERDIEQSIIALKKGAYLLKYGRRGKPKFCPFRLSNDESVLIWF
Sbjct: 1    MSRMDRMASDLNRNGPVERDIEQSIIALKKGAYLLKYGRRGKPKFCPFRLSNDESVLIWF 60

Query: 668  SGKEEKHLKLSHVSRIISGQRTPIFQRYPRPEKEYQSFSLIYNERSLDLICKDKDEAEVW 727
            SGKEEK LKLSHVSRIISGQRTPIFQRYPRPEKEYQSFSLIYN+RSLDLICKDKDEAEVW
Sbjct: 61   SGKEEKLLKLSHVSRIISGQRTPIFQRYPRPEKEYQSFSLIYNDRSLDLICKDKDEAEVW 120

Query: 728  FNGLKTLISRSHHRKWRTESRSDGMQSEANSPRTYTRRSSPLNSPFGSNDSLQKDGDFRL 787
            FNGLKTLISRSHHRKWRTESRSDGMQSEANSPRTYTRRSSPLNSPFGSNDSLQKDGDFRL
Sbjct: 121  FNGLKTLISRSHHRKWRTESRSDGMQSEANSPRTYTRRSSPLNSPFGSNDSLQKDGDFRL 180

Query: 788  HSPYESPPKNGMDKALSDVILYAVPPKGFFPSDSASISVNSLSSGSSDMHGPMKAMGIDA 847
            HSPYESPPKNGMDKALSDVILYAVPPKGFFPSDSASISVNSLSSGSSDMHGPMKAM IDA
Sbjct: 181  HSPYESPPKNGMDKALSDVILYAVPPKGFFPSDSASISVNSLSSGSSDMHGPMKAMAIDA 240

Query: 848  FRVSLSSAVSSSSQGSGHDDGDALGDVFIWGEGTGDGVLGGGSHRVGSCLSIKMDSLLPK 907
            FRVSLSSAVSSSSQGSGHDDGDALGDVFIWGEGTGDG+LGGGSHRVGS LSIKMDSLLPK
Sbjct: 241  FRVSLSSAVSSSSQGSGHDDGDALGDVFIWGEGTGDGILGGGSHRVGSSLSIKMDSLLPK 300

Query: 908  ALESAVVLDVQNIACGGRHAALVTKQGEIFTWGEESGGRLGHGVDSDVLQPKLIDALGNT 967
            ALESAVVLDVQNIACGGRHAALVTKQGEIFTWGEESGGRLGHGVDSDVLQPKLIDALGNT
Sbjct: 301  ALESAVVLDVQNIACGGRHAALVTKQGEIFTWGEESGGRLGHGVDSDVLQPKLIDALGNT 360

Query: 968  NIELVSCGEYHTSAVTLSGDLYTWGDGTYNFGLLGHGNEVSHWIPKKINGPLEGIHVSSI 1027
            NIELVSCGEYHT AVTLSGDLYTWGDGTYNFGLLGHGNEVSHWIPKKINGPLEGIHVSSI
Sbjct: 361  NIELVSCGEYHTCAVTLSGDLYTWGDGTYNFGLLGHGNEVSHWIPKKINGPLEGIHVSSI 420

Query: 1028 SCGPWHTAVVTSAGQLFTFGDGTFGVLGHGDRNSVSMPREVESLKGLRTVRAACGVWHTA 1087
            SCGPWHTAVVTSAGQLFTFGDGTFGVLGHGDRNSVSMPREVESLKGLRTVRAACGVWHTA
Sbjct: 421  SCGPWHTAVVTSAGQLFTFGDGTFGVLGHGDRNSVSMPREVESLKGLRTVRAACGVWHTA 480

Query: 1088 AVVEVMVGSSSSSNCSSGKLFTWGDGDKGRLGHGDKETKLVPTCVAALVEPNFCRVSCGH 1147
            AVVEVMVGSSSSSNCSSGKLFTWGDGDKGRLGHGDKETKLVPTCVAALVEPNFCRVSCGH
Sbjct: 481  AVVEVMVGSSSSSNCSSGKLFTWGDGDKGRLGHGDKETKLVPTCVAALVEPNFCRVSCGH 540

Query: 1148 SLTVALTTSGHVYTMGSPVYGQLGNPHADGKVPVRVEGKLSKSFVEEIACGAYHVAVLTS 1207
            SLTVALTTSGHVYTMGSPVYGQLGNPHADGKVPVR+EGK+SKSFVEEIACGAYHVAVLTS
Sbjct: 541  SLTVALTTSGHVYTMGSPVYGQLGNPHADGKVPVRIEGKISKSFVEEIACGAYHVAVLTS 600

Query: 1208 RTEVYTWGKGANGRLGHGDTDDRNSPTLVEALKDKQVKSIACGTNFTAAICLHKWVSGVD 1267
            +TEVYTWGKGANGRLGHGDTDDRNSPTLVEALKDKQVKSIACGTNFTAAICLHKWVSGVD
Sbjct: 601  KTEVYTWGKGANGRLGHGDTDDRNSPTLVEALKDKQVKSIACGTNFTAAICLHKWVSGVD 660

Query: 1268 QSMCSGCHLPFNFKRKRHNCYNCGLVFCHSCSSKKCHKASMAPNPNKPYRVCDNCYNKLR 1327
            QSMCSGCHLPFNFKRKRHNCYNCGLVFCHSCSSKKCHKASMAPNPNKPYRVCDNCYNKLR
Sbjct: 661  QSMCSGCHLPFNFKRKRHNCYNCGLVFCHSCSSKKCHKASMAPNPNKPYRVCDNCYNKLR 720

Query: 1328 KALETDASSQSSVSRRRSINQGTTEFVEKDEKPESVKSRAQLARFSSMESVKQVENQSSK 1387
            KALETDASSQSSVSRRRSINQG+TEFVEKDEKPESVKSRAQLARFSSMESVKQVENQS K
Sbjct: 721  KALETDASSQSSVSRRRSINQGSTEFVEKDEKPESVKSRAQLARFSSMESVKQVENQSYK 780

Query: 1388 KNKKFECNSSRVSPVPNGGSQWGVISKSFNPVFGSSKKFFSASVPGSRIVSRATSPISRR 1447
            KNKKF+CNSSRVSPVPNGGSQWG ISKSFNPVFGSSKKFFSASVPGSRIVSRATSPISRR
Sbjct: 781  KNKKFDCNSSRVSPVPNGGSQWGAISKSFNPVFGSSKKFFSASVPGSRIVSRATSPISRR 840

Query: 1448 ASPPRSTTPTPTLGGLTSPKIAVDDGKRTNDSLSQEVIKLKAQVENLTRKAQLQEVELER 1507
            ASPPRSTTPTPTLGGLTSPKIAVDD KRTNDSL QEV+KLKAQVENLTRKAQLQEVELER
Sbjct: 841  ASPPRSTTPTPTLGGLTSPKIAVDDAKRTNDSLRQEVVKLKAQVENLTRKAQLQEVELER 900

Query: 1508 TTKQLKEALSFAAGEATKCNAAKEVIKSLTAQLKEMAERLPVGAARNIKSPSLASLGSSP 1567
            TTKQLKEALSFAAGEATKCNAAKEVIKSLTAQLKEMAERLPVGAARN+KSPSLASLGSSP
Sbjct: 901  TTKQLKEALSFAAGEATKCNAAKEVIKSLTAQLKEMAERLPVGAARNVKSPSLASLGSSP 960

Query: 1568 LFNDVVTPSIDRSNGQTMSLEADIIESNSHLLSNGSSTASNRSSGHNRQGNSDSTTKNGN 1627
             FN+VV PSIDRSNGQTM LEA+IIESNSHLLSNGSSTASNRSSGHNRQGNSDS  +NGN
Sbjct: 961  PFNEVVAPSIDRSNGQTMPLEAEIIESNSHLLSNGSSTASNRSSGHNRQGNSDSAARNGN 1020

Query: 1628 KVKESDSRHDAEWVEQDEPGVYITFTSLQGGAKDLKRVRFSRKRFTEKQAEQWWAENRAR 1687
            KVKESDSRHDAEWVEQDEPGVYITFTSLQGGAKDLKRVRFSRKRFTEKQAEQWWAENRAR
Sbjct: 1021 KVKESDSRHDAEWVEQDEPGVYITFTSLQGGAKDLKRVRFSRKRFTEKQAEQWWAENRAR 1080

Query: 1688 VYDQYNVRMIDKST 1702
            VYDQYNVRMIDKS+
Sbjct: 1081 VYDQYNVRMIDKSS 1094

BLAST of Sgr023231 vs. NCBI nr
Match: XP_038900986.1 (PH, RCC1 and FYVE domains-containing protein 1 [Benincasa hispida])

HSP 1 Score: 2109.0 bits (5463), Expect = 0.0e+00
Identity = 1058/1094 (96.71%), Postives = 1076/1094 (98.35%), Query Frame = 0

Query: 608  MSRMDRMASDLNRNGSVERDIEQSIIALKKGAYLLKYGRRGKPKFCPFRLSNDESVLIWF 667
            MSRMDRM SDLNRNG VERDIEQSIIALKKGAYLLKYGRRGKPKFCPFRLSNDESVLIWF
Sbjct: 1    MSRMDRMTSDLNRNGPVERDIEQSIIALKKGAYLLKYGRRGKPKFCPFRLSNDESVLIWF 60

Query: 668  SGKEEKHLKLSHVSRIISGQRTPIFQRYPRPEKEYQSFSLIYNERSLDLICKDKDEAEVW 727
            SGKEEKHLKLSHVSRIISGQRTPIFQRYPRPEKEYQSFSLIYN+RSLDLICKDKDEAEVW
Sbjct: 61   SGKEEKHLKLSHVSRIISGQRTPIFQRYPRPEKEYQSFSLIYNDRSLDLICKDKDEAEVW 120

Query: 728  FNGLKTLISRSHHRKWRTESRSDGMQSEANSPRTYTRRSSPLNSPFGSNDSLQKDGDFRL 787
            FNGLKTLISRSHHRKWRTESRSDGMQSEANSPRTYTRRSSPLNSPFGSNDSLQKDGDFRL
Sbjct: 121  FNGLKTLISRSHHRKWRTESRSDGMQSEANSPRTYTRRSSPLNSPFGSNDSLQKDGDFRL 180

Query: 788  HSPYESPPKNGMDKALSDVILYAVPPKGFFPSDSASISVNSLSSGSSDMHGPMKAMGIDA 847
             SPY SPPKNGMDKALSDVILYAVPPKGFFPSDSASISVNSLSSG S+MHGPMKAMGIDA
Sbjct: 181  QSPYGSPPKNGMDKALSDVILYAVPPKGFFPSDSASISVNSLSSGGSEMHGPMKAMGIDA 240

Query: 848  FRVSLSSAVSSSSQGSGHDDGDALGDVFIWGEGTGDGVLGGGSHRVGSCLSIKMDSLLPK 907
            FRVSLSSAVSSSSQGSGHDDGDALGDVFIWGEGTGDGVLGGGSH+VGSC S+KMDSLLPK
Sbjct: 241  FRVSLSSAVSSSSQGSGHDDGDALGDVFIWGEGTGDGVLGGGSHKVGSCFSLKMDSLLPK 300

Query: 908  ALESAVVLDVQNIACGGRHAALVTKQGEIFTWGEESGGRLGHGVDSDVLQPKLIDALGNT 967
            ALESAVVLDVQNIACGG HAALVTKQGEIFTWGEESGGRLGHGVDSDVLQPKL+DALGNT
Sbjct: 301  ALESAVVLDVQNIACGGHHAALVTKQGEIFTWGEESGGRLGHGVDSDVLQPKLVDALGNT 360

Query: 968  NIELVSCGEYHTSAVTLSGDLYTWGDGTYNFGLLGHGNEVSHWIPKKINGPLEGIHVSSI 1027
            NIELVSCGEYHT AVTLSGDLYTWGDGTYNFGLLGHGNEVSHWIPKKINGPLEGIHVSSI
Sbjct: 361  NIELVSCGEYHTCAVTLSGDLYTWGDGTYNFGLLGHGNEVSHWIPKKINGPLEGIHVSSI 420

Query: 1028 SCGPWHTAVVTSAGQLFTFGDGTFGVLGHGDRNSVSMPREVESLKGLRTVRAACGVWHTA 1087
            SCGPWHTAVVTSAGQLFTFGDGTFGVLGHGDRNSV+MPREVESLKGLRTVRAACGVWHTA
Sbjct: 421  SCGPWHTAVVTSAGQLFTFGDGTFGVLGHGDRNSVTMPREVESLKGLRTVRAACGVWHTA 480

Query: 1088 AVVEVMVGSSSSSNCSSGKLFTWGDGDKGRLGHGDKETKLVPTCVAALVEPNFCRVSCGH 1147
            A+VEVMVGSSSSSNCSSGKLFTWGDGDKGRLGHGDKETKLVPTCVAALV+PNFCRVSCGH
Sbjct: 481  AIVEVMVGSSSSSNCSSGKLFTWGDGDKGRLGHGDKETKLVPTCVAALVDPNFCRVSCGH 540

Query: 1148 SLTVALTTSGHVYTMGSPVYGQLGNPHADGKVPVRVEGKLSKSFVEEIACGAYHVAVLTS 1207
            SLTVALTTSGHVY MGSPVYGQLGNPHADGKVPVRVEGKLSK FVEEIACGAYHVAVLTS
Sbjct: 541  SLTVALTTSGHVYAMGSPVYGQLGNPHADGKVPVRVEGKLSKCFVEEIACGAYHVAVLTS 600

Query: 1208 RTEVYTWGKGANGRLGHGDTDDRNSPTLVEALKDKQVKSIACGTNFTAAICLHKWVSGVD 1267
            RTEVYTWGKGANGRLGHGDTDDRNSPTLVEALKDKQVKSIACGTNFTAAICLHKWVSGVD
Sbjct: 601  RTEVYTWGKGANGRLGHGDTDDRNSPTLVEALKDKQVKSIACGTNFTAAICLHKWVSGVD 660

Query: 1268 QSMCSGCHLPFNFKRKRHNCYNCGLVFCHSCSSKKCHKASMAPNPNKPYRVCDNCYNKLR 1327
            QSMCSGCHLPFNFKRKRHNCYNCGLVFCHSCSSKKCHKASMAPNPNKPYRVCDNCY+KLR
Sbjct: 661  QSMCSGCHLPFNFKRKRHNCYNCGLVFCHSCSSKKCHKASMAPNPNKPYRVCDNCYSKLR 720

Query: 1328 KALETDASSQSSVSRRRSINQGTTEFVEKDEKPESVKSRAQLARFSSMESVKQVENQSSK 1387
            KALETD SSQSSVSRRRSINQG+ +FVEKDEKPESVKSRAQLARFSSMESVKQ E+Q SK
Sbjct: 721  KALETDTSSQSSVSRRRSINQGSNDFVEKDEKPESVKSRAQLARFSSMESVKQGESQFSK 780

Query: 1388 KNKKFECNSSRVSPVPNGGSQWGVISKSFNPVFGSSKKFFSASVPGSRIVSRATSPISRR 1447
            KNKKFECNSSRVSPVPNGGSQWG ISKSFNPVFGSSKKFFSASVPGSRIVSRATSPISRR
Sbjct: 781  KNKKFECNSSRVSPVPNGGSQWGAISKSFNPVFGSSKKFFSASVPGSRIVSRATSPISRR 840

Query: 1448 ASPPRSTTPTPTLGGLTSPKIAVDDGKRTNDSLSQEVIKLKAQVENLTRKAQLQEVELER 1507
            ASPPRSTTPTPTLGGLTSPKIAVDD KRTNDSLSQEV+KLKAQVENLTRKAQLQEVE+ER
Sbjct: 841  ASPPRSTTPTPTLGGLTSPKIAVDDAKRTNDSLSQEVVKLKAQVENLTRKAQLQEVEMER 900

Query: 1508 TTKQLKEALSFAAGEATKCNAAKEVIKSLTAQLKEMAERLPVGAARNIKSPSLASLGSSP 1567
            TTKQLKEAL+FAAGEATKCNAAKEVIKSLTAQLKEMAERLPVGAARNIKSPSLASLGSSP
Sbjct: 901  TTKQLKEALAFAAGEATKCNAAKEVIKSLTAQLKEMAERLPVGAARNIKSPSLASLGSSP 960

Query: 1568 LFNDVVTPSIDRSNGQTMSLEADIIESNSHLLSNGSSTASNRSSGHNRQGNSDSTTKNGN 1627
             FNDVVTPSIDRSNGQTMSLEADIIESNSHLLSNGS+TAS RSSGHNRQGNSDSTT+NGN
Sbjct: 961  PFNDVVTPSIDRSNGQTMSLEADIIESNSHLLSNGSTTASIRSSGHNRQGNSDSTTRNGN 1020

Query: 1628 KVKESDSRHDAEWVEQDEPGVYITFTSLQGGAKDLKRVRFSRKRFTEKQAEQWWAENRAR 1687
            KVKESDSRHDAEWVEQDEPGVYITFTSLQGGAKDLKRVRFSRKRFTEKQAEQWWAENRAR
Sbjct: 1021 KVKESDSRHDAEWVEQDEPGVYITFTSLQGGAKDLKRVRFSRKRFTEKQAEQWWAENRAR 1080

Query: 1688 VYDQYNVRMIDKST 1702
            VYDQYNVRMIDKS+
Sbjct: 1081 VYDQYNVRMIDKSS 1094

BLAST of Sgr023231 vs. NCBI nr
Match: XP_011657620.1 (PH, RCC1 and FYVE domains-containing protein 1 [Cucumis sativus] >KGN48105.1 hypothetical protein Csa_003363 [Cucumis sativus])

HSP 1 Score: 2099.7 bits (5439), Expect = 0.0e+00
Identity = 1055/1094 (96.44%), Postives = 1073/1094 (98.08%), Query Frame = 0

Query: 608  MSRMDRMASDLNRNGSVERDIEQSIIALKKGAYLLKYGRRGKPKFCPFRLSNDESVLIWF 667
            MSRMDRM SDLNRNG VERDIEQSIIALKKGAYLLKYGRRGKPKFCPFRLSNDESVLIWF
Sbjct: 1    MSRMDRMTSDLNRNGPVERDIEQSIIALKKGAYLLKYGRRGKPKFCPFRLSNDESVLIWF 60

Query: 668  SGKEEKHLKLSHVSRIISGQRTPIFQRYPRPEKEYQSFSLIYNERSLDLICKDKDEAEVW 727
            SGKEEK LKLSHVSRIISGQRTPIFQRYPRPEKEYQSFSLIYN+RSLDLICKDKDEAEVW
Sbjct: 61   SGKEEKLLKLSHVSRIISGQRTPIFQRYPRPEKEYQSFSLIYNDRSLDLICKDKDEAEVW 120

Query: 728  FNGLKTLISRSHHRKWRTESRSDGMQSEANSPRTYTRRSSPLNSPFGSNDSLQKDGDFRL 787
            FNGLKTLISRSHHRKWRTESRSDGMQSEANSPRTYTRRSSPLNSPFGSNDSLQKDGDFRL
Sbjct: 121  FNGLKTLISRSHHRKWRTESRSDGMQSEANSPRTYTRRSSPLNSPFGSNDSLQKDGDFRL 180

Query: 788  HSPYESPPKNGMDKALSDVILYAVPPKGFFPSDSASISVNSLSSGSSDMHGPMKAMGIDA 847
             SPY SPPKNGMDKALSDVILY VPPKGFFPSDSASISVNSLSSGSS+MHGPMKAMGIDA
Sbjct: 181  QSPYGSPPKNGMDKALSDVILYTVPPKGFFPSDSASISVNSLSSGSSEMHGPMKAMGIDA 240

Query: 848  FRVSLSSAVSSSSQGSGHDDGDALGDVFIWGEGTGDGVLGGGSHRVGSCLSIKMDSLLPK 907
            FRVSLSSAVSSSSQGSGHDDGDALGDVFIWGEGTGDGVLGGGSH+VGSC S+KMDSLLPK
Sbjct: 241  FRVSLSSAVSSSSQGSGHDDGDALGDVFIWGEGTGDGVLGGGSHKVGSCFSLKMDSLLPK 300

Query: 908  ALESAVVLDVQNIACGGRHAALVTKQGEIFTWGEESGGRLGHGVDSDVLQPKLIDALGNT 967
            ALESAVVLDVQNIACGGRHAALVTKQGEIFTWGEESGGRLGHGVDSDVLQPKL+DALGNT
Sbjct: 301  ALESAVVLDVQNIACGGRHAALVTKQGEIFTWGEESGGRLGHGVDSDVLQPKLVDALGNT 360

Query: 968  NIELVSCGEYHTSAVTLSGDLYTWGDGTYNFGLLGHGNEVSHWIPKKINGPLEGIHVSSI 1027
            NIELVSCGEYHT AVTLSGDLYTWGDGTYNFGLLGHGNEVSHWIPKKINGPLEGIHVSSI
Sbjct: 361  NIELVSCGEYHTCAVTLSGDLYTWGDGTYNFGLLGHGNEVSHWIPKKINGPLEGIHVSSI 420

Query: 1028 SCGPWHTAVVTSAGQLFTFGDGTFGVLGHGDRNSVSMPREVESLKGLRTVRAACGVWHTA 1087
            +CGPWHTAVVTSAGQLFTFGDGTFGVLGHGDR SV+MPREVESLKGLRTVRAACGVWHTA
Sbjct: 421  ACGPWHTAVVTSAGQLFTFGDGTFGVLGHGDRGSVTMPREVESLKGLRTVRAACGVWHTA 480

Query: 1088 AVVEVMVGSSSSSNCSSGKLFTWGDGDKGRLGHGDKETKLVPTCVAALVEPNFCRVSCGH 1147
            AVVEVMVGSSSSSNCSSGKLFTWGDGDKGRLGHGDKETKLVPTCVAALV+PNFCRVSCGH
Sbjct: 481  AVVEVMVGSSSSSNCSSGKLFTWGDGDKGRLGHGDKETKLVPTCVAALVDPNFCRVSCGH 540

Query: 1148 SLTVALTTSGHVYTMGSPVYGQLGNPHADGKVPVRVEGKLSKSFVEEIACGAYHVAVLTS 1207
            S+TVALTTSGHVYTMGSPVYGQLGNPHADGKVPVRVEGKLSKSFVEEIACGAYHVAVLTS
Sbjct: 541  SMTVALTTSGHVYTMGSPVYGQLGNPHADGKVPVRVEGKLSKSFVEEIACGAYHVAVLTS 600

Query: 1208 RTEVYTWGKGANGRLGHGDTDDRNSPTLVEALKDKQVKSIACGTNFTAAICLHKWVSGVD 1267
            RTEVYTWGKGANGRLGHGDTDDRNSPTLVEALKDKQVKSIACGTNFTAAICLHKWVSG D
Sbjct: 601  RTEVYTWGKGANGRLGHGDTDDRNSPTLVEALKDKQVKSIACGTNFTAAICLHKWVSGFD 660

Query: 1268 QSMCSGCHLPFNFKRKRHNCYNCGLVFCHSCSSKKCHKASMAPNPNKPYRVCDNCYNKLR 1327
            QSMCSGCHLPFNFKRKRHNCYNCGLVFCHSCSSKKCHKASMAPNPNKPYRVCDNCYNKLR
Sbjct: 661  QSMCSGCHLPFNFKRKRHNCYNCGLVFCHSCSSKKCHKASMAPNPNKPYRVCDNCYNKLR 720

Query: 1328 KALETDASSQSSVSRRRSINQGTTEFVEKDEKPESVKSRAQLARFSSMESVKQVENQSSK 1387
            KALETDASSQSSVSRRRSINQG+T+FVEK+EKPESVKSRAQLARFSSMESVKQ ENQ SK
Sbjct: 721  KALETDASSQSSVSRRRSINQGSTDFVEKEEKPESVKSRAQLARFSSMESVKQGENQFSK 780

Query: 1388 KNKKFECNSSRVSPVPNGGSQWGVISKSFNPVFGSSKKFFSASVPGSRIVSRATSPISRR 1447
            KNKKFECNSSRVSPVPNGGSQWG ISKSFNPVFGSSKKFFSASVPGSRIVSRATSPISRR
Sbjct: 781  KNKKFECNSSRVSPVPNGGSQWGAISKSFNPVFGSSKKFFSASVPGSRIVSRATSPISRR 840

Query: 1448 ASPPRSTTPTPTLGGLTSPKIAVDDGKRTNDSLSQEVIKLKAQVENLTRKAQLQEVELER 1507
            ASPPRSTTPTPTLGGLTSPKIAVDD KRTNDSLSQEV+KLKAQVENLTRKAQLQEVE+ER
Sbjct: 841  ASPPRSTTPTPTLGGLTSPKIAVDDAKRTNDSLSQEVVKLKAQVENLTRKAQLQEVEMER 900

Query: 1508 TTKQLKEALSFAAGEATKCNAAKEVIKSLTAQLKEMAERLPVGAARNIKSPSLASLGSSP 1567
            TTKQLKEAL+FAA EATKCNAAKEVI SLTAQLKEMAERLPVGAARNIKSPSLASLGSSP
Sbjct: 901  TTKQLKEALAFAAAEATKCNAAKEVIMSLTAQLKEMAERLPVGAARNIKSPSLASLGSSP 960

Query: 1568 LFNDVVTPSIDRSNGQTMSLEADIIESNSHLLSNGSSTASNRSSGHNRQGNSDSTTKNGN 1627
             FNDVVTPSIDRSNGQTMSLEAD+IESNSHLLSNGSSTAS RSSGHNR  NSDSTT+NGN
Sbjct: 961  PFNDVVTPSIDRSNGQTMSLEADVIESNSHLLSNGSSTASIRSSGHNRPANSDSTTRNGN 1020

Query: 1628 KVKESDSRHDAEWVEQDEPGVYITFTSLQGGAKDLKRVRFSRKRFTEKQAEQWWAENRAR 1687
            KVKESDSRHDAEWVEQDEPGVYITFTSLQGGAKDLKRVRFSRKRFTEKQAEQWWAENRAR
Sbjct: 1021 KVKESDSRHDAEWVEQDEPGVYITFTSLQGGAKDLKRVRFSRKRFTEKQAEQWWAENRAR 1080

Query: 1688 VYDQYNVRMIDKST 1702
            VYDQYNVR IDKS+
Sbjct: 1081 VYDQYNVRTIDKSS 1094

BLAST of Sgr023231 vs. NCBI nr
Match: KAA0061774.1 (Regulator of chromosome condensation (RCC1) family with FYVE zinc finger domain isoform 1 [Cucumis melo var. makuwa])

HSP 1 Score: 2095.9 bits (5429), Expect = 0.0e+00
Identity = 1051/1091 (96.33%), Postives = 1072/1091 (98.26%), Query Frame = 0

Query: 611  MDRMASDLNRNGSVERDIEQSIIALKKGAYLLKYGRRGKPKFCPFRLSNDESVLIWFSGK 670
            MDRM SDLNRNG VERDIEQSIIALKKGAYLLKYGRRGKPKFCPFRLSNDESVLIWFSGK
Sbjct: 1    MDRMTSDLNRNGPVERDIEQSIIALKKGAYLLKYGRRGKPKFCPFRLSNDESVLIWFSGK 60

Query: 671  EEKHLKLSHVSRIISGQRTPIFQRYPRPEKEYQSFSLIYNERSLDLICKDKDEAEVWFNG 730
            EEKHLKLSHVSRIISGQRTPIFQRYPRPEKEYQSFSLIYN+RSLDLICKDKDEAEVWFNG
Sbjct: 61   EEKHLKLSHVSRIISGQRTPIFQRYPRPEKEYQSFSLIYNDRSLDLICKDKDEAEVWFNG 120

Query: 731  LKTLISRSHHRKWRTESRSDGMQSEANSPRTYTRRSSPLNSPFGSNDSLQKDGDFRLHSP 790
            LKTLISRSHHRKWRTESRSDGMQSEANSPRTYTRRSSPLNSPFGSNDSLQKDGDFRL SP
Sbjct: 121  LKTLISRSHHRKWRTESRSDGMQSEANSPRTYTRRSSPLNSPFGSNDSLQKDGDFRLQSP 180

Query: 791  YESPPKNGMDKALSDVILYAVPPKGFFPSDSASISVNSLSSGSSDMHGPMKAMGIDAFRV 850
            Y SPPKNGMDKALSDVILY VPPKGFFPSDSAS+SVNSLSSGSS+MHGPMKAM IDAFRV
Sbjct: 181  YGSPPKNGMDKALSDVILYTVPPKGFFPSDSASMSVNSLSSGSSEMHGPMKAMAIDAFRV 240

Query: 851  SLSSAVSSSSQGSGHDDGDALGDVFIWGEGTGDGVLGGGSHRVGSCLSIKMDSLLPKALE 910
            SLSSAVSSSSQGSGHDDGDALGDVFIWGEGTGDGVLGGGSH+VGSC S+KMDSLLPKALE
Sbjct: 241  SLSSAVSSSSQGSGHDDGDALGDVFIWGEGTGDGVLGGGSHKVGSCFSLKMDSLLPKALE 300

Query: 911  SAVVLDVQNIACGGRHAALVTKQGEIFTWGEESGGRLGHGVDSDVLQPKLIDALGNTNIE 970
            SAVVLDVQNIACGGRHAALVTKQGEIFTWGEESGGRLGHGVDSDVLQPKL+DALGNTNIE
Sbjct: 301  SAVVLDVQNIACGGRHAALVTKQGEIFTWGEESGGRLGHGVDSDVLQPKLVDALGNTNIE 360

Query: 971  LVSCGEYHTSAVTLSGDLYTWGDGTYNFGLLGHGNEVSHWIPKKINGPLEGIHVSSISCG 1030
            LVSCGEYHT AVTLSGDLYTWGDGTYNFGLLGHGNEVSHWIPKKINGPLEGIHVSSI+CG
Sbjct: 361  LVSCGEYHTCAVTLSGDLYTWGDGTYNFGLLGHGNEVSHWIPKKINGPLEGIHVSSIACG 420

Query: 1031 PWHTAVVTSAGQLFTFGDGTFGVLGHGDRNSVSMPREVESLKGLRTVRAACGVWHTAAVV 1090
            PWHTAVVTSAGQLFTFGDGTFGVLGHGDR+SV+MPREVESLKGLRTVRAACGVWHTAAVV
Sbjct: 421  PWHTAVVTSAGQLFTFGDGTFGVLGHGDRSSVTMPREVESLKGLRTVRAACGVWHTAAVV 480

Query: 1091 EVMVGSSSSSNCSSGKLFTWGDGDKGRLGHGDKETKLVPTCVAALVEPNFCRVSCGHSLT 1150
            EVMVGSSSSSNCSSGKLFTWGDGDKGRLGHGDKETKLVPTCVAALV+PNFCRVSCGHS+T
Sbjct: 481  EVMVGSSSSSNCSSGKLFTWGDGDKGRLGHGDKETKLVPTCVAALVDPNFCRVSCGHSMT 540

Query: 1151 VALTTSGHVYTMGSPVYGQLGNPHADGKVPVRVEGKLSKSFVEEIACGAYHVAVLTSRTE 1210
            VALTTSG VYTMGSPVYGQLGNPHADGKVPVRVEGKLSKSFVEEIACGAYHVAVLTSRTE
Sbjct: 541  VALTTSGQVYTMGSPVYGQLGNPHADGKVPVRVEGKLSKSFVEEIACGAYHVAVLTSRTE 600

Query: 1211 VYTWGKGANGRLGHGDTDDRNSPTLVEALKDKQVKSIACGTNFTAAICLHKWVSGVDQSM 1270
            VYTWGKGANGRLGHGDTDDRNSPTLVEALKDKQVKSIACGTNFTAAICLHKWVSG DQSM
Sbjct: 601  VYTWGKGANGRLGHGDTDDRNSPTLVEALKDKQVKSIACGTNFTAAICLHKWVSGFDQSM 660

Query: 1271 CSGCHLPFNFKRKRHNCYNCGLVFCHSCSSKKCHKASMAPNPNKPYRVCDNCYNKLRKAL 1330
            CSGCHLPFNFKRKRHNCYNCGLVFCHSCSSKKCHKASMAPNPNKPYRVCDNCYNKLRKAL
Sbjct: 661  CSGCHLPFNFKRKRHNCYNCGLVFCHSCSSKKCHKASMAPNPNKPYRVCDNCYNKLRKAL 720

Query: 1331 ETDASSQSSVSRRRSINQGTTEFVEKDEKPESVKSRAQLARFSSMESVKQVENQSSKKNK 1390
            ETDASSQSSVSRRRSINQG+T+FVEKDEKPESVKSRAQLARFSSMESVKQ E+Q SKKNK
Sbjct: 721  ETDASSQSSVSRRRSINQGSTDFVEKDEKPESVKSRAQLARFSSMESVKQGESQFSKKNK 780

Query: 1391 KFECNSSRVSPVPNGGSQWGVISKSFNPVFGSSKKFFSASVPGSRIVSRATSPISRRASP 1450
            KFECNSSRVSPVPNGGSQWG ISKSFNPVFGSSKKFFSASVPGSRIVSRATSPISRRASP
Sbjct: 781  KFECNSSRVSPVPNGGSQWGAISKSFNPVFGSSKKFFSASVPGSRIVSRATSPISRRASP 840

Query: 1451 PRSTTPTPTLGGLTSPKIAVDDGKRTNDSLSQEVIKLKAQVENLTRKAQLQEVELERTTK 1510
            PRSTTPTPTLGGLTSPKIAVDD KRTNDSLSQEV+KLKAQVENLTRKAQLQEVE+ERTTK
Sbjct: 841  PRSTTPTPTLGGLTSPKIAVDDAKRTNDSLSQEVVKLKAQVENLTRKAQLQEVEMERTTK 900

Query: 1511 QLKEALSFAAGEATKCNAAKEVIKSLTAQLKEMAERLPVGAARNIKSPSLASLGSSPLFN 1570
            QLKEAL+FAA EATKCNAAKEVI SLTAQLKEMAERLPVGAARNIKSP+LASLGSSP FN
Sbjct: 901  QLKEALAFAAAEATKCNAAKEVIMSLTAQLKEMAERLPVGAARNIKSPTLASLGSSPPFN 960

Query: 1571 DVVTPSIDRSNGQTMSLEADIIESNSHLLSNGSSTASNRSSGHNRQGNSDSTTKNGNKVK 1630
            DVVTPSIDRSNGQTMSLEAD+IESNSHLLSNGSSTAS RSSGHNRQGNSDSTT+NGNKVK
Sbjct: 961  DVVTPSIDRSNGQTMSLEADVIESNSHLLSNGSSTASIRSSGHNRQGNSDSTTRNGNKVK 1020

Query: 1631 ESDSRHDAEWVEQDEPGVYITFTSLQGGAKDLKRVRFSRKRFTEKQAEQWWAENRARVYD 1690
            ESDSRHDAEWVEQDEPGVYITFTSLQGGAKDLKRVRFSRKRFTEKQAEQWWAENRARVYD
Sbjct: 1021 ESDSRHDAEWVEQDEPGVYITFTSLQGGAKDLKRVRFSRKRFTEKQAEQWWAENRARVYD 1080

Query: 1691 QYNVRMIDKST 1702
            QYNVR IDKS+
Sbjct: 1081 QYNVRTIDKSS 1091

BLAST of Sgr023231 vs. ExPASy Swiss-Prot
Match: Q947D2 (PH, RCC1 and FYVE domains-containing protein 1 OS=Arabidopsis thaliana OX=3702 GN=PRAF1 PE=1 SV=1)

HSP 1 Score: 1040.8 bits (2690), Expect = 2.3e-302
Identity = 579/1123 (51.56%), Postives = 756/1123 (67.32%), Query Frame = 0

Query: 616  SDLNRNGSVERDIEQSIIALKKGAYLLKYGRRGKPKFCPFRLSNDESVLIWFSGKEEKHL 675
            +DL    + + ++EQ++I LKKG  LLKYGR+GKPKF PFRLS+DE  LIW S   EK L
Sbjct: 2    ADLVTYSNADHNLEQALITLKKGTQLLKYGRKGKPKFYPFRLSSDEKSLIWISSSGEKRL 61

Query: 676  KLSHVSRIISGQRTPIFQRYPRPEKEYQSFSLIYN--ERSLDLICKDKDEAEVWFNGLKT 735
            KL+ VS+I+ GQRT +FQRY RPEK+Y SFSL+YN  ++SLDLICKDK EAE+W  GLKT
Sbjct: 62   KLASVSKIVPGQRTAVFQRYLRPEKDYLSFSLLYNGKKKSLDLICKDKVEAEIWIGGLKT 121

Query: 736  LISRSHHRKWRTESRSDGMQSEANSPRTYTRRSSPLNSPFGSNDSLQKDGDFRLHSPYES 795
            LIS     + + +  S G  S  ++ R  T  SSP +S   ++      G     +P+  
Sbjct: 122  LISTGQGGRSKIDGWSGGGLS-VDASRELT-SSSPSSSSASASRGHSSPG-----TPFNI 181

Query: 796  PPKNGMDKALSDVILYAVPPKGFFPSDSASISVNSLSSGSSDMHGPMKAMGIDAFRVSLS 855
             P      A  +     VPP     S+ + +++++ +  +       K  G D FRVS+S
Sbjct: 182  DPITSPKSAEPE-----VPPT---DSEKSHVALDNKNMQT-------KVSGSDGFRVSVS 241

Query: 856  SAVSSSSQGSGHDDGDALGDVFIWGEGTGDGVLGGGSHRVGSCLSIKMDSLLPKALESAV 915
            SA SSSS GS  DD DALGDV+IWGE   D V+  G  +  S L+ + D L+PK LES +
Sbjct: 242  SAQSSSSHGSAADDSDALGDVYIWGEVICDNVVKVGIDKNASYLTTRTDVLVPKPLESNI 301

Query: 916  VLDVQNIACGGRHAALVTKQGEIFTWGEESGGRLGHGVDSDVLQPKLIDAL-GNTNIELV 975
            VLDV  IACG RHAA VT+QGEIFTWGEESGGRLGHG+  DV  P+L+++L   ++++ V
Sbjct: 302  VLDVHQIACGVRHAAFVTRQGEIFTWGEESGGRLGHGIGKDVFHPRLVESLTATSSVDFV 361

Query: 976  SCGEYHTSAVTLSGDLYTWGDGTYNFGLLGHGNEVSHWIPKKINGPLEGIHVSSISCGPW 1035
            +CGE+HT AVTL+G+LYTWGDGT+N GLLGHG+++SHWIPK+I G LEG+HV+S+SCGPW
Sbjct: 362  ACGEFHTCAVTLAGELYTWGDGTHNVGLLGHGSDISHWIPKRIAGSLEGLHVASVSCGPW 421

Query: 1036 HTAVVTSAGQLFTFGDGTFGVLGHGDRNSVSMPREVESLKGLRTVRAACGVWHTAAVVEV 1095
            HTA++TS G+LFTFGDGTFGVLGHGD+ +V  PREVESL GLRT+  +CGVWHTAAVVE+
Sbjct: 422  HTALITSYGRLFTFGDGTFGVLGHGDKETVQYPREVESLSGLRTIAVSCGVWHTAAVVEI 481

Query: 1096 MVGSSSSSNCSSGKLFTWGDGDKGRLGHGDKETKLVPTCVAALVEPNFCRVSCGHSLTVA 1155
            +V  S+SS+ SSGKLFTWGDGDK RLGHGDK+ +L PTCV AL++ NF +++CGHSLTV 
Sbjct: 482  IVTQSNSSSVSSGKLFTWGDGDKNRLGHGDKDPRLKPTCVPALIDYNFHKIACGHSLTVG 541

Query: 1156 LTTSGHVYTMGSPVYGQLGNPHADGKVPVRVEGKLSKSFVEEIACGAYHVAVLTSRTEVY 1215
            LTTSG V+TMGS VYGQLGN   DGK+P  VE KL+  FVEEI+CGAYHVA LTSR EVY
Sbjct: 542  LTTSGQVFTMGSTVYGQLGNLQTDGKLPCLVEDKLASEFVEEISCGAYHVAALTSRNEVY 601

Query: 1216 TWGKGANGRLGHGDTDDRNSPTLVEALKDKQVKSIACGTNFTAAICLHKWVSGVDQSMCS 1275
            TWGKGANGRLGHGD +DR  PT+VEALKD+ VK IACG+N+TAAICLHKWVSG +QS CS
Sbjct: 602  TWGKGANGRLGHGDLEDRKVPTIVEALKDRHVKYIACGSNYTAAICLHKWVSGAEQSQCS 661

Query: 1276 GCHLPFNFKRKRHNCYNCGLVFCHSCSSKKCHKASMAPNPNKPYRVCDNCYNKLRKALET 1335
             C L F F RKRHNCYNCGLV CHSCSSKK  +A++AP+  + YRVCD+CY KL K  E 
Sbjct: 662  TCRLAFGFTRKRHNCYNCGLVHCHSCSSKKAFRAALAPSAGRLYRVCDSCYVKLSKVSEI 721

Query: 1336 DASSQ--SSVSRRRSINQGTTEFVEKDEKPESVKSRAQLARF--SSMESVKQVENQSSKK 1395
            + +++  S+V R    N+   +           KS  +LA+F  S+M+ +KQ++++++K+
Sbjct: 722  NDTNRRNSAVPRLSGENRDRLD-----------KSEIRLAKFGTSNMDLIKQLDSKAAKQ 781

Query: 1396 NKKFECNS-SRVSPVPNGGSQWGVISKSFNPVFGSSKKFFSASVPGSRIVSRATSPISRR 1455
             KK +  S  R S +P+       +  +   +  ++ K   A    S I SR+ SP SRR
Sbjct: 782  GKKTDTFSLGRNSQLPSLLQLKDAVQSNIGDMRRATPKLAQAP---SGISSRSVSPFSRR 841

Query: 1456 ASPPRSTTPTPTLGGLTSPKIAVDDGKRTNDSLSQEVIKLKAQVENLTRKAQLQEVELER 1515
            +SPPRS TP P+  GL  P    D+ K+TN+ L+QE++KL+ QV++LT+K + QEVEL+ 
Sbjct: 842  SSPPRSATPMPSTSGLYFPVGIADNMKKTNEILNQEIVKLRTQVDSLTQKCEFQEVELQN 901

Query: 1516 TTKQLKEALSFAAGEATKCNAAKEVIKSLTAQLKEMAERLPVGAARNI------------ 1575
            + K+ +EAL+ A  E+ K  AAKE IKSL AQLK++AE+LP G +  +            
Sbjct: 902  SVKKTQEALALAEEESAKSRAAKEAIKSLIAQLKDVAEKLPPGESVKLACLQNGLDQNGF 961

Query: 1576 -----------KSPSLASLGSSPLFNDVVTPSIDRSNGQTMSLEADIIESNSHL------ 1635
                       +S S+ S  SS    D    +   SN Q+        E NS+       
Sbjct: 962  HFPEENGFHPSRSESMTSSISSVAPFDFAFANASWSNLQSPKQTPRASERNSNAYPADPR 1021

Query: 1636 LSNGSSTASNRSSGHNRQGNSDSTTKNGNKVKESDSRHDAEWVEQDEPGVYITFTSLQGG 1695
            LS+  S  S R      Q NSD+ +        + ++ +AEW+EQ EPGVYIT  +L  G
Sbjct: 1022 LSSSGSVISERIEPFQFQNNSDNGSSQTG--VNNTNQVEAEWIEQYEPGVYITLVALHDG 1081

Query: 1696 AKDLKRVRFSRKRFTEKQAEQWWAENRARVYDQYNVRMIDKST 1702
             +DL+RVRFSR+RF E QAE WW+ENR +VY++YNVR+ +KST
Sbjct: 1082 TRDLRRVRFSRRRFGEHQAETWWSENREKVYEKYNVRVSEKST 1086

BLAST of Sgr023231 vs. ExPASy Swiss-Prot
Match: O04659 (Pentatricopeptide repeat-containing protein At5g27110 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E14 PE=2 SV=2)

HSP 1 Score: 740.7 bits (1911), Expect = 5.0e-212
Identity = 353/632 (55.85%), Postives = 469/632 (74.21%), Query Frame = 0

Query: 1700 STKLVFQTTDCPLDVSLWNALLSAYTKNFMFDEALQLFDQLKCHSHVRPDCYTYPVALKA 1759
            S + VF+  D   DV +WN+L+S Y+KN MF + L++F +L   S   PD +T+P  +KA
Sbjct: 57   SARHVFENFDIRSDVYIWNSLMSGYSKNSMFHDTLEVFKRLLNCSICVPDSFTFPNVIKA 116

Query: 1760 CGGLGRVVCGRRVHNHLIKTGLIWDVFVTSSLMNMYAKCNQFHDAINLFDELPHRDVGCW 1819
             G LGR   GR +H  ++K+G + DV V SSL+ MYAK N F +++ +FDE+P RDV  W
Sbjct: 117  YGALGREFLGRMIHTLVVKSGYVCDVVVASSLVGMYAKFNLFENSLQVFDEMPERDVASW 176

Query: 1820 NTVISCYFQDDKAETALKMFDKMKDSGFEPNSVTFTIVISSCTRLLNLERGKEIRRELME 1879
            NTVISC++Q  +AE AL++F +M+ SGFEPNSV+ T+ IS+C+RLL LERGKEI R+ ++
Sbjct: 177  NTVISCFYQSGEAEKALELFGRMESSGFEPNSVSLTVAISACSRLLWLERGKEIHRKCVK 236

Query: 1880 SGVLLDAFVLSALVDMYGKCGCLEMAKEVFEQIPRKNAITWNSMITGYSLKGDSRSCIEL 1939
             G  LD +V SALVDMYGKC CLE+A+EVF+++PRK+ + WNSMI GY  KGDS+SC+E+
Sbjct: 237  KGFELDEYVNSALVDMYGKCDCLEVAREVFQKMPRKSLVAWNSMIKGYVAKGDSKSCVEI 296

Query: 1940 LKRMNDEGTKPTLSTLTSIISASSRSVQPWHGKFIHGYFLRNRMDADIFIDISLIDLYFK 1999
            L RM  EGT+P+ +TLTSI+ A SRS    HGKFIHGY +R+ ++ADI+++ SLIDLYFK
Sbjct: 297  LNRMIIEGTRPSQTTLTSILMACSRSRNLLHGKFIHGYVIRSVVNADIYVNCSLIDLYFK 356

Query: 2000 CGYVASAETIFRNISKNEVVSWNVMISGYVMVGKHIQALRTYDNMKEHCVKPDAVTFSST 2059
            CG    AET+F    K+   SWNVMIS Y+ VG   +A+  YD M    VKPD VTF+S 
Sbjct: 357  CGEANLAETVFSKTQKDVAESWNVMISSYISVGNWFKAVEVYDQMVSVGVKPDVVTFTSV 416

Query: 2060 LSACSQLAALEKGRELHNCIISHKLETNEIVMGALLDMYAKCGDVDEARKLFHRIPERDL 2119
            L ACSQLAALEKG+++H  I   +LET+E+++ ALLDMY+KCG+  EA ++F+ IP++D+
Sbjct: 417  LPACSQLAALEKGKQIHLSISESRLETDELLLSALLDMYSKCGNEKEAFRIFNSIPKKDV 476

Query: 2120 VSWTTMITAYGSHGQPSEALRIFDEMQKSNIQADSVTFLAVLSACSHAGLVDEGYRYFNE 2179
            VSWT MI+AYGSHGQP EAL  FDEMQK  ++ D VT LAVLSAC HAGL+DEG ++F++
Sbjct: 477  VSWTVMISAYGSHGQPREALYQFDEMQKFGLKPDGVTLLAVLSACGHAGLIDEGLKFFSQ 536

Query: 2180 MIIQYDIKPGIEHNSCLIDLLGRAGRLCEAYEILQRSEETRNDIGLLSTLFSACRLHNNF 2239
            M  +Y I+P IEH SC+ID+LGRAGRL EAYEI+Q++ ET ++  LLSTLFSAC LH   
Sbjct: 537  MRSKYGIEPIIEHYSCMIDILGRAGRLLEAYEIIQQTPETSDNAELLSTLFSACCLHLEH 596

Query: 2240 VLGIEIGKMLVEVDPDDPSTYILLSNMYASVNKWEEVRKVRQKMKELGLNKSPGCSWIEI 2299
             LG  I ++LVE  PDD STY++L N+YAS   W+  R+VR KMKE+GL K PGCSWIE+
Sbjct: 597  SLGDRIARLLVENYPDDASTYMVLFNLYASGESWDAARRVRLKMKEMGLRKKPGCSWIEM 656

Query: 2300 NQRIQPFFVEDKSNPLVDGVYECLSSLARHME 2332
            + ++  FF ED+S+   + VYECL+ L+ HME
Sbjct: 657  SDKVCHFFAEDRSHLRAENVYECLALLSGHME 688

BLAST of Sgr023231 vs. ExPASy Swiss-Prot
Match: Q3E6Q1 (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 442.6 bits (1137), Expect = 2.8e-122
Identity = 223/636 (35.06%), Postives = 363/636 (57.08%), Query Frame = 0

Query: 1697 IDKSTKLVFQTTDCPLDVSLWNALLSAYTKNFMFDEALQLFDQLKCHSHVRPDCYTYPVA 1756
            +D++ + VF+  D  L+V L++ +L  + K    D+ALQ F +++ +  V P  Y +   
Sbjct: 85   VDEAAR-VFEPIDSKLNV-LYHTMLKGFAKVSDLDKALQFFVRMR-YDDVEPVVYNFTYL 144

Query: 1757 LKACGGLGRVVCGRRVHNHLIKTGLIWDVFVTSSLMNMYAKCNQFHDAINLFDELPHRDV 1816
            LK CG    +  G+ +H  L+K+G   D+F  + L NMYAKC Q ++A  +FD +P RD+
Sbjct: 145  LKVCGDEAELRVGKEIHGLLVKSGFSLDLFAMTGLENMYAKCRQVNEARKVFDRMPERDL 204

Query: 1817 GCWNTVISCYFQDDKAETALKMFDKMKDSGFEPNSVTFTIVISSCTRLLNLERGKEIRRE 1876
              WNT+++ Y Q+  A  AL+M   M +   +P+ +T   V+ + + L  +  GKEI   
Sbjct: 205  VSWNTIVAGYSQNGMARMALEMVKSMCEENLKPSFITIVSVLPAVSALRLISVGKEIHGY 264

Query: 1877 LMESGVLLDAFVLSALVDMYGKCGCLEMAKEVFEQIPRKNAITWNSMITGYSLKGDSRSC 1936
             M SG      + +ALVDMY KCG LE A+++F+ +  +N ++WNSMI  Y    + +  
Sbjct: 265  AMRSGFDSLVNISTALVDMYAKCGSLETARQLFDGMLERNVVSWNSMIDAYVQNENPKEA 324

Query: 1937 IELLKRMNDEGTKPTLSTLTSIISASSRSVQPWHGKFIHGYFLRNRMDADIFIDISLIDL 1996
            + + ++M DEG KPT  ++   + A +       G+FIH   +   +D ++ +  SLI +
Sbjct: 325  MLIFQKMLDEGVKPTDVSVMGALHACADLGDLERGRFIHKLSVELGLDRNVSVVNSLISM 384

Query: 1997 YFKCGYVASAETIFRNISKNEVVSWNVMISGYVMVGKHIQALRTYDNMKEHCVKPDAVTF 2056
            Y KC  V +A ++F  +    +VSWN MI G+   G+ I AL  +  M+   VKPD  T+
Sbjct: 385  YCKCKEVDTAASMFGKLQSRTLVSWNAMILGFAQNGRPIDALNYFSQMRSRTVKPDTFTY 444

Query: 2057 SSTLSACSQLAALEKGRELHNCIISHKLETNEIVMGALLDMYAKCGDVDEARKLFHRIPE 2116
             S ++A ++L+     + +H  ++   L+ N  V  AL+DMYAKCG +  AR +F  + E
Sbjct: 445  VSVITAIAELSITHHAKWIHGVVMRSCLDKNVFVTTALVDMYAKCGAIMIARLIFDMMSE 504

Query: 2117 RDLVSWTTMITAYGSHGQPSEALRIFDEMQKSNIQADSVTFLAVLSACSHAGLVDEGYRY 2176
            R + +W  MI  YG+HG    AL +F+EMQK  I+ + VTFL+V+SACSH+GLV+ G + 
Sbjct: 505  RHVTTWNAMIDGYGTHGFGKAALELFEEMQKGTIKPNGVTFLSVISACSHSGLVEAGLKC 564

Query: 2177 FNEMIIQYDIKPGIEHNSCLIDLLGRAGRLCEAYEILQRSEETRNDIGLLSTLFSACRLH 2236
            F  M   Y I+  ++H   ++DLLGRAGRL EA++ + +    +  + +   +  AC++H
Sbjct: 565  FYMMKENYSIELSMDHYGAMVDLLGRAGRLNEAWDFIMQM-PVKPAVNVYGAMLGACQIH 624

Query: 2237 NNFVLGIEIGKMLVEVDPDDPSTYILLSNMYASVNKWEEVRKVRQKMKELGLNKSPGCSW 2296
             N     +  + L E++PDD   ++LL+N+Y + + WE+V +VR  M   GL K+PGCS 
Sbjct: 625  KNVNFAEKAAERLFELNPDDGGYHVLLANIYRAASMWEKVGQVRVSMLRQGLRKTPGCSM 684

Query: 2297 IEINQRIQPFFVEDKSNPLVDGVYECLSSLARHMEK 2333
            +EI   +  FF    ++P    +Y  L  L  H+++
Sbjct: 685  VEIKNEVHSFFSGSTAHPDSKKIYAFLEKLICHIKE 716

BLAST of Sgr023231 vs. ExPASy Swiss-Prot
Match: Q0WN60 (Pentatricopeptide repeat-containing protein At1g18485 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H8 PE=2 SV=2)

HSP 1 Score: 421.0 bits (1081), Expect = 8.7e-116
Identity = 243/841 (28.89%), Postives = 423/841 (50.30%), Query Frame = 0

Query: 1595 NSHLLSNGSSTASN---RSSGHNRQGNSDSTTKNGNK-VKESDSRHDAEWVEQDEPGVYI 1654
            NS+ +S+ S+ A++   R S     G+ D + +   + V + +S  DA  + ++  G+ +
Sbjct: 32   NSNSISSNSTNANHFLRRISNFCETGDLDKSFRTVQEFVGDDESSSDAFLLVREALGLLL 91

Query: 1655 TFTSLQGGAKDLKRVRFSRKRFTEKQAEQWWAENRARVYDQYNVRMIDKSTKLVFQTTDC 1714
                  G  KD++  R   +  +     +       R+   Y +      ++ VF     
Sbjct: 92   ---QASGKRKDIEMGRKIHQLVSGSTRLRNDDVLCTRIITMYAMCGSPDDSRFVFDALRS 151

Query: 1715 PLDVSLWNALLSAYTKNFMFDEALQLFDQLKCHSHVRPDCYTYPVALKACGGLGRVVCGR 1774
              ++  WNA++S+Y++N ++DE L+ F ++   + + PD +TYP  +KAC G+  V  G 
Sbjct: 152  K-NLFQWNAVISSYSRNELYDEVLETFIEMISTTDLLPDHFTYPCVIKACAGMSDVGIGL 211

Query: 1775 RVHNHLIKTGLIWDVFVTSSLMNMYAKCNQFHDAINLFDELPHRDVGCWNTVISCYFQDD 1834
             VH  ++KTGL+ DVFV ++L++ Y       DA+ LFD +P R++  WN++I  +  + 
Sbjct: 212  AVHGLVVKTGLVEDVFVGNALVSFYGTHGFVTDALQLFDIMPERNLVSWNSMIRVFSDNG 271

Query: 1835 KAETAL----KMFDKMKDSGFEPNSVTFTIVISSCTRLLNLERGKEIRRELMESGVLLDA 1894
             +E +     +M ++  D  F P+  T   V+  C R   +  GK +    ++  +  + 
Sbjct: 272  FSEESFLLLGEMMEENGDGAFMPDVATLVTVLPVCAREREIGLGKGVHGWAVKLRLDKEL 331

Query: 1895 FVLSALVDMYGKCGCLEMAKEVFEQIPRKNAITWNSMITGYSLKGDSRSCIELLKRM--- 1954
             + +AL+DMY KCGC+  A+ +F+    KN ++WN+M+ G+S +GD+    ++L++M   
Sbjct: 332  VLNNALMDMYSKCGCITNAQMIFKMNNNKNVVSWNTMVGGFSAEGDTHGTFDVLRQMLAG 391

Query: 1955 ------------------------------------------------------------ 2014
                                                                        
Sbjct: 392  GEDVKADEVTILNAVPVCFHESFLPSLKELHCYSLKQEFVYNELVANAFVASYAKCGSLS 451

Query: 2015 ---------------------------ND-------------EGTKPTLSTLTSIISASS 2074
                                       ND              G  P   T+ S++SA S
Sbjct: 452  YAQRVFHGIRSKTVNSWNALIGGHAQSNDPRLSLDAHLQMKISGLLPDSFTVCSLLSACS 511

Query: 2075 RSVQPWHGKFIHGYFLRNRMDADIFIDISLIDLYFKCGYVASAETIFRNISKNEVVSWNV 2134
            +      GK +HG+ +RN ++ D+F+ +S++ LY  CG + + + +F  +    +VSWN 
Sbjct: 512  KLKSLRLGKEVHGFIIRNWLERDLFVYLSVLSLYIHCGELCTVQALFDAMEDKSLVSWNT 571

Query: 2135 MISGYVMVGKHIQALRTYDNMKEHCVKPDAVTFSSTLSACSQLAALEKGRELHNCIISHK 2194
            +I+GY+  G   +AL  +  M  + ++   ++      ACS L +L  GRE H   + H 
Sbjct: 572  VITGYLQNGFPDRALGVFRQMVLYGIQLCGISMMPVFGACSLLPSLRLGREAHAYALKHL 631

Query: 2195 LETNEIVMGALLDMYAKCGDVDEARKLFHRIPERDLVSWTTMITAYGSHGQPSEALRIFD 2254
            LE +  +  +L+DMYAK G + ++ K+F+ + E+   SW  MI  YG HG   EA+++F+
Sbjct: 632  LEDDAFIACSLIDMYAKNGSITQSSKVFNGLKEKSTASWNAMIMGYGIHGLAKEAIKLFE 691

Query: 2255 EMQKSNIQADSVTFLAVLSACSHAGLVDEGYRYFNEMIIQYDIKPGIEHNSCLIDLLGRA 2314
            EMQ++    D +TFL VL+AC+H+GL+ EG RY ++M   + +KP ++H +C+ID+LGRA
Sbjct: 692  EMQRTGHNPDDLTFLGVLTACNHSGLIHEGLRYLDQMKSSFGLKPNLKHYACVIDMLGRA 751

Query: 2315 GRLCEAYEILQRSEETRNDIGLLSTLFSACRLHNNFVLGIEIGKMLVEVDPDDPSTYILL 2325
            G+L +A  ++        D+G+  +L S+CR+H N  +G ++   L E++P+ P  Y+LL
Sbjct: 752  GQLDKALRVVAEEMSEEADVGIWKSLLSSCRIHQNLEMGEKVAAKLFELEPEKPENYVLL 811

BLAST of Sgr023231 vs. ExPASy Swiss-Prot
Match: Q9SS60 (Pentatricopeptide repeat-containing protein At3g03580 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H23 PE=2 SV=1)

HSP 1 Score: 419.9 bits (1078), Expect = 1.9e-115
Identity = 216/620 (34.84%), Postives = 362/620 (58.39%), Query Frame = 0

Query: 1713 DVSLWNALLSAYTKNFMFDEALQLFDQLKCHSHVRPDCYTYPVALKACGGLGRVVCGRRV 1772
            D+  WN+L+S Y+ +  ++EAL+++ +LK +S + PD +T    L A G L  V  G+ +
Sbjct: 171  DLVSWNSLISGYSSHGYYEEALEIYHELK-NSWIVPDSFTVSSVLPAFGNLLVVKQGQGL 230

Query: 1773 HNHLIKTGLIWDVFVTSSLMNMYAKCNQFHDAINLFDELPHRDVGCWNTVISCYFQDDKA 1832
            H   +K+G+   V V + L+ MY K  +  DA  +FDE+  RD   +NT+I  Y + +  
Sbjct: 231  HGFALKSGVNSVVVVNNGLVAMYLKFRRPTDARRVFDEMDVRDSVSYNTMICGYLKLEMV 290

Query: 1833 ETALKMFDKMKDSGFEPNSVTFTIVISSCTRLLNLERGKEIRRELMESGVLLDAFVLSAL 1892
            E +++MF +  D  F+P+ +T + V+ +C  L +L   K I   ++++G +L++ V + L
Sbjct: 291  EESVRMFLENLDQ-FKPDLLTVSSVLRACGHLRDLSLAKYIYNYMLKAGFVLESTVRNIL 350

Query: 1893 VDMYGKCGCLEMAKEVFEQIPRKNAITWNSMITGYSLKGDSRSCIELLKRMNDEGTKPTL 1952
            +D+Y KCG +  A++VF  +  K+ ++WNS+I+GY   GD    ++L K M     +   
Sbjct: 351  IDVYAKCGDMITARDVFNSMECKDTVSWNSIISGYIQSGDLMEAMKLFKMMMIMEEQADH 410

Query: 1953 STLTSIISASSRSVQPWHGKFIHGYFLRNRMDADIFIDISLIDLYFKCGYVASAETIFRN 2012
             T   +IS S+R      GK +H   +++ +  D+ +  +LID+Y KCG V  +  IF +
Sbjct: 411  ITYLMLISVSTRLADLKFGKGLHSNGIKSGICIDLSVSNALIDMYAKCGEVGDSLKIFSS 470

Query: 2013 ISKNEVVSWNVMISGYVMVGKHIQALRTYDNMKEHCVKPDAVTFSSTLSACSQLAALEKG 2072
            +   + V+WN +IS  V  G     L+    M++  V PD  TF  TL  C+ LAA   G
Sbjct: 471  MGTGDTVTWNTVISACVRFGDFATGLQVTTQMRKSEVVPDMATFLVTLPMCASLAAKRLG 530

Query: 2073 RELHNCIISHKLETNEIVMGALLDMYAKCGDVDEARKLFHRIPERDLVSWTTMITAYGSH 2132
            +E+H C++    E+   +  AL++MY+KCG ++ + ++F R+  RD+V+WT MI AYG +
Sbjct: 531  KEIHCCLLRFGYESELQIGNALIEMYSKCGCLENSSRVFERMSRRDVVTWTGMIYAYGMY 590

Query: 2133 GQPSEALRIFDEMQKSNIQADSVTFLAVLSACSHAGLVDEGYRYFNEMIIQYDIKPGIEH 2192
            G+  +AL  F +M+KS I  DSV F+A++ ACSH+GLVDEG   F +M   Y I P IEH
Sbjct: 591  GEGEKALETFADMEKSGIVPDSVVFIAIIYACSHSGLVDEGLACFEKMKTHYKIDPMIEH 650

Query: 2193 NSCLIDLLGRAGRLCEAYEILQRSEETRNDIGLLSTLFSACRLHNNFVLGIEIGKMLVEV 2252
             +C++DLL R+ ++ +A E +Q +   + D  + +++  ACR   +      + + ++E+
Sbjct: 651  YACVVDLLSRSQKISKAEEFIQ-AMPIKPDASIWASVLRACRTSGDMETAERVSRRIIEL 710

Query: 2253 DPDDPSTYILLSNMYASVNKWEEVRKVRQKMKELGLNKSPGCSWIEINQRIQPFFVEDKS 2312
            +PDDP   IL SN YA++ KW++V  +R+ +K+  + K+PG SWIE+ + +  F   D S
Sbjct: 711  NPDDPGYSILASNAYAALRKWDKVSLIRKSLKDKHITKNPGYSWIEVGKNVHVFSSGDDS 770

Query: 2313 NPLVDGVYECLSSLARHMEK 2333
             P  + +Y+ L  L   M K
Sbjct: 771  APQSEAIYKSLEILYSLMAK 787

BLAST of Sgr023231 vs. ExPASy TrEMBL
Match: W9RV99 (E3 ubiquitin-protein ligase HERC2 OS=Morus notabilis OX=981085 GN=L484_001795 PE=4 SV=1)

HSP 1 Score: 2141.3 bits (5547), Expect = 0.0e+00
Identity = 1167/1728 (67.53%), Postives = 1293/1728 (74.83%), Query Frame = 0

Query: 1    MHSEENKDFTEKNKKRKLKTPNQVIALEKFYNEHKYPTEEMKSQLAEQLGLTEKQISGWF 60
            +HS+ENK  ++ NKKR+LKTP+QV+ALEKFYNEHKYPTEEMKS+LAE+LGLTEKQISGWF
Sbjct: 7    VHSDENK-VSQDNKKRQLKTPSQVMALEKFYNEHKYPTEEMKSELAEELGLTEKQISGWF 66

Query: 61   CHRRLKDKRF--CETYASVRQDRSSGVIQDHGSGLAQDSCGSTKNGDYWHIDPREVESQK 120
            CHRRLKDKR    E  +S RQ+R SG+IQD GSG  QDSCGSTK+ DY H+DPREVES++
Sbjct: 67   CHRRLKDKRSLKVEKCSSGRQERLSGIIQDLGSGFGQDSCGSTKHADYRHVDPREVESRR 126

Query: 121  PY--GHEHPATDNVLERRSQFTENVSNMDNTSSESSSSLKDRLLSQSENPYDTEVSRYLT 180
             Y  GH+ PA D   E RS +TE VS MDNTSSESSSSL+D   S +E+P+  E SRYL 
Sbjct: 127  LYDKGHDFPAADLSHENRSHYTERVSGMDNTSSESSSSLRDG-FSPTEDPHIVESSRYLA 186

Query: 181  HDGAIPPSNPKVLNSLRYKPSGYLKVKGEVENAAITAVKRQLGVQYREDGPPLGVEFQPL 240
             DG + P N K    + YKPSGYLKVKGE+ENAAITAVKRQLG QYREDGPPLGVEF PL
Sbjct: 187  QDGLVAPLNSKGARHMGYKPSGYLKVKGEIENAAITAVKRQLGRQYREDGPPLGVEFDPL 246

Query: 241  PPGAFESPAKGPIHDSYYVGNPVLPHSPDILTMKKQRALGSRYEMHSSNMSSQDSYMEEA 300
            PPGAFESP + P+H+ YY G PVL HSPDI  +K+Q +  +RYE+H+S +SS+DSY++EA
Sbjct: 247  PPGAFESPIRDPVHEPYYAGIPVLSHSPDISVVKRQPSPSTRYEVHNSKLSSRDSYLQEA 306

Query: 301  IPTSTTCKPESQEKNSVYQLKKSSNYYNKTDPFPRQNSPLNMYEESGGLTFSSSSKRDHK 360
                       QEK    QL++ S Y + T  FP +NS L+M ++S     S  S R  K
Sbjct: 307  --PGIMHGVNHQEKKHCNQLRQKSTYLDHTSNFPGRNSSLDMCDDS-----SYKSNRSRK 366

Query: 361  MSSSYNIPRSRTDSVSINHGSYTSKVASEQTEMQLHNHGSVGSKSFNRS----------- 420
            M S +      +DS   + G Y  K+AS+ ++  LH    +  K   RS           
Sbjct: 367  MGSKHGAEGMTSDSFLNHQGHYGGKIASKPSQSGLHEDDVLSPKIVQRSEHSKFKASIST 426

Query: 421  ----GYLDYNSKKMS-----KEMFNGEAKPVNECSDPVRVKIPSSNELAVANRCQLDFPR 480
                G  D   K +S     ++ F GE K + +    V+VK+   +E+ VA R ++DFPR
Sbjct: 427  RNHCGTPDIEEKGVSTMMAQEDKFGGEGKAMKD----VKVKMRPVSEMLVAKRVKVDFPR 486

Query: 481  PDYAVKASFSEKPGRKNHTRRHQERTKPCSNLDQYSRYDGSAETGNQDLPDFGLLQLGEM 540
             +    +SFSE   RKNH +                                GL      
Sbjct: 487  QENVTNSSFSEMLPRKNHMK--------------------------------GL------ 546

Query: 541  EVLVLIRQRHNSNFCSLFTRISGVSPLFSAISILKPSSIVLQSQCSFKRWLDESDWSCIV 600
                                                                        
Sbjct: 547  ------------------------------------------------------------ 606

Query: 601  EMDVFSIVDWNCSLNLSFRRRILRVQKEEILMSRMDRMASDLNRNGSVERDIEQSIIALK 660
                                                                        
Sbjct: 607  ------------------------------------------------------------ 666

Query: 661  KGAYLLKYGRRGKPKFCPFRLSNDESVLIWFSGKEEKHLKLSHVSRIISGQRTPIFQRYP 720
                                   DESVLIW SGKEEKHLKLSHVSRIISGQRTPIFQRYP
Sbjct: 667  -----------------------DESVLIWISGKEEKHLKLSHVSRIISGQRTPIFQRYP 726

Query: 721  RPEKEYQSFSLIYNERSLDLICKDKDEAEVWFNGLKTLISRSHHRKWRTESRSDGMQSEA 780
            RPEKEYQSFSLIYN+RSLDLICKDKDEAEVWF+GLK LISRSHHRKWRTESRSDG+ SEA
Sbjct: 727  RPEKEYQSFSLIYNDRSLDLICKDKDEAEVWFSGLKALISRSHHRKWRTESRSDGIPSEA 786

Query: 781  NSPRTYTRRSSPLNSPFGSNDSLQKDGD--FRLHSPYESPPKNGMDKALSDVILYAVPPK 840
            NSPRT TRRSSPL+SPFGSNDSLQKDG    RLHSPYESPPKNG+DKALSDVILYAVPPK
Sbjct: 787  NSPRTCTRRSSPLHSPFGSNDSLQKDGSDHLRLHSPYESPPKNGLDKALSDVILYAVPPK 846

Query: 841  GFFPSDSASISVNSLSSGSSD-MHGPMKAMGIDAFRVSLSSAVSSSSQGSGHDDGDALGD 900
            GFFPSDSAS SV+SLSSG SD +HG +KAM +DAFRVSLSSAVSS SQGSGHDDGDALGD
Sbjct: 847  GFFPSDSASASVHSLSSGGSDSVHGHVKAMPVDAFRVSLSSAVSSLSQGSGHDDGDALGD 906

Query: 901  VFIWGEGTGDGVLGGGSHRVGSCLSIKMDSLLPKALESAVVLDVQNIACGGRHAALVTKQ 960
            VFIWGEG GDGVLG G HRVGSC S K+DSLLPK LESAVVLDVQN+ACGGRHAALVTKQ
Sbjct: 907  VFIWGEGMGDGVLGSGPHRVGSCFSGKIDSLLPKRLESAVVLDVQNVACGGRHAALVTKQ 966

Query: 961  GEIFTWGEESGGRLGHGVDSDVLQPKLIDALGNTNIELVSCGEYHTSAVTLSGDLYTWGD 1020
            GEIF+WGEESGGRLGHGVDSDVLQPKLIDAL  TNIE V+CGEYHT AVTLSG+LYTWGD
Sbjct: 967  GEIFSWGEESGGRLGHGVDSDVLQPKLIDALSTTNIEFVACGEYHTCAVTLSGELYTWGD 1026

Query: 1021 GTYNFGLLGHGNEVSHWIPKKINGPLEGIHVSSISCGPWHTAVVTSAGQLFTFGDGTFGV 1080
            GTYNFGLLGHGNEVSHW+PK++NGPLEGIHVS ISCGPWHTAVVTSAGQLFTFGDGTFGV
Sbjct: 1027 GTYNFGLLGHGNEVSHWMPKRVNGPLEGIHVSYISCGPWHTAVVTSAGQLFTFGDGTFGV 1086

Query: 1081 LGHGDRNSVSMPREVESLKGLRTVRAACGVWHTAAVVEVMVGSSSSSNCSSGKLFTWGDG 1140
            LGHGDR SVSMPREVESLKGLRTVRAACGVWHTAAVVEVMVG+SSSSNCSSGKLFTWGDG
Sbjct: 1087 LGHGDRTSVSMPREVESLKGLRTVRAACGVWHTAAVVEVMVGNSSSSNCSSGKLFTWGDG 1146

Query: 1141 DKGRLGHGDKETKLVPTCVAALVEPNFCRVSCGHSLTVALTTSGHVYTMGSPVYGQLGNP 1200
            DKGRLGHG+KE +LVPTCVAALVEPNFC+V+CGHSLTVALTTSGHVYTMGSPVYGQLGNP
Sbjct: 1147 DKGRLGHGEKEARLVPTCVAALVEPNFCQVACGHSLTVALTTSGHVYTMGSPVYGQLGNP 1206

Query: 1201 HADGKVPVRVEGKLSKSFVEEIACGAYHVAVLTSRTEVYTWGKGANGRLGHGDTDDRNSP 1260
             ADGK+P RVEGK SK FVEEIACGAYHVAVLTS+TEVYTWGKGANGRLGHGD DDRNSP
Sbjct: 1207 QADGKLPTRVEGKHSKRFVEEIACGAYHVAVLTSKTEVYTWGKGANGRLGHGDIDDRNSP 1266

Query: 1261 TLVEALKDKQVKSIACGTNFTAAICLHKWVSGVDQSMCSGCHLPFNFKRKRHNCYNCGLV 1320
            TLVEALKDKQVKSIACGTNFTAAICLHKWVS +DQSMCSGC LPFNFKRKRHNCYNCG V
Sbjct: 1267 TLVEALKDKQVKSIACGTNFTAAICLHKWVSEIDQSMCSGCRLPFNFKRKRHNCYNCGFV 1326

Query: 1321 FCHSCSSKKCHKASMAPNPNKPYRVCDNCYNKLRKALETDASSQSSVSRRRSINQGTTEF 1380
            FCHSCSSKK  KASMAPNPNKPYRVCDNC+NKLRKA+ETD+SS  SVSRR SINQG+ EF
Sbjct: 1327 FCHSCSSKKSLKASMAPNPNKPYRVCDNCFNKLRKAIETDSSSH-SVSRRGSINQGSNEF 1386

Query: 1381 VEKDEKPESVKSRAQLARFSSMESVKQVENQSSKKNKKFECNSSRVSPVPNGGSQWGVIS 1440
            ++K+EK +S +SRAQLARFSSMES+KQVE +SSKKNKK E NSSRVSPVPNGGSQWG I 
Sbjct: 1387 IDKEEKLDS-RSRAQLARFSSMESLKQVETRSSKKNKKLEFNSSRVSPVPNGGSQWGAI- 1446

Query: 1441 KSFNPVFGSSKKFFSASVPGSRIVSRATSPISRRASPPRSTTPTPTLGGLTSPKIAVDDG 1500
            KSFNP FGSSKKFFSASVPGSRIVSRATSPISRR SPPR+TTPTPTL GLTSPKI VD+ 
Sbjct: 1447 KSFNPGFGSSKKFFSASVPGSRIVSRATSPISRRPSPPRATTPTPTLEGLTSPKIGVDNT 1506

Query: 1501 KRTNDSLSQEVIKLKAQVENLTRKAQLQEVELERTTKQLKEALSFAAGEATKCNAAKEVI 1560
            KRTNDSLSQEVIKL+AQVENLTR+AQLQEVELERTTKQLKEAL+ A  E  KC AAKEVI
Sbjct: 1507 KRTNDSLSQEVIKLRAQVENLTRQAQLQEVELERTTKQLKEALAIAGEETAKCKAAKEVI 1537

Query: 1561 KSLTAQLKEMAERLPVGAARNIKSPSLASLGSSPLFNDVVTPSIDRSNGQTMSLEADIIE 1620
            KSLTAQLK+MAERLPVGAARN+KSPSLASLGS  + +DV  PS+DR N Q +S E D   
Sbjct: 1567 KSLTAQLKDMAERLPVGAARNVKSPSLASLGSDLVGSDVSNPSVDRLNSQILSQEPDSNG 1537

Query: 1621 SNSHLLSNGSSTASNRSSGHNRQGNSDSTTKNGNKVKESDSRHDAEWVEQDEPGVYITFT 1680
            S+S L SNGS+T +NRSS HN+QG+SD TT+NG + K+ DSR+D EWVEQDEPGVYIT T
Sbjct: 1627 SHSQLHSNGSTTTANRSSSHNKQGHSDVTTRNGTRTKDIDSRNDTEWVEQDEPGVYITLT 1537

Query: 1681 SLQGGAKDLKRVRFSRKRFTEKQAEQWWAENRARVYDQYNVRMIDKST 1702
            SL GGAKDLKRVRFSRKRF+EKQAEQWWAENRARVY+QYNVRMIDKS+
Sbjct: 1687 SLPGGAKDLKRVRFSRKRFSEKQAEQWWAENRARVYEQYNVRMIDKSS 1537

BLAST of Sgr023231 vs. ExPASy TrEMBL
Match: A0A6J1DFK1 (uncharacterized protein LOC111020619 OS=Momordica charantia OX=3673 GN=LOC111020619 PE=4 SV=1)

HSP 1 Score: 2122.1 bits (5497), Expect = 0.0e+00
Identity = 1067/1094 (97.53%), Postives = 1080/1094 (98.72%), Query Frame = 0

Query: 608  MSRMDRMASDLNRNGSVERDIEQSIIALKKGAYLLKYGRRGKPKFCPFRLSNDESVLIWF 667
            MSRMDRMASDLNRNG VERDIEQSIIALKKGAYLLKYGRRGKPKFCPFRLSNDESVLIWF
Sbjct: 1    MSRMDRMASDLNRNGPVERDIEQSIIALKKGAYLLKYGRRGKPKFCPFRLSNDESVLIWF 60

Query: 668  SGKEEKHLKLSHVSRIISGQRTPIFQRYPRPEKEYQSFSLIYNERSLDLICKDKDEAEVW 727
            SGKEEK LKLSHVSRIISGQRTPIFQRYPRPEKEYQSFSLIYN+RSLDLICKDKDEAEVW
Sbjct: 61   SGKEEKLLKLSHVSRIISGQRTPIFQRYPRPEKEYQSFSLIYNDRSLDLICKDKDEAEVW 120

Query: 728  FNGLKTLISRSHHRKWRTESRSDGMQSEANSPRTYTRRSSPLNSPFGSNDSLQKDGDFRL 787
            FNGLKTLISRSHHRKWRTESRSDGMQSEANSPRTYTRRSSPLNSPFGSNDSLQKDGDFRL
Sbjct: 121  FNGLKTLISRSHHRKWRTESRSDGMQSEANSPRTYTRRSSPLNSPFGSNDSLQKDGDFRL 180

Query: 788  HSPYESPPKNGMDKALSDVILYAVPPKGFFPSDSASISVNSLSSGSSDMHGPMKAMGIDA 847
            HSPYESPPKNGMDKALSDVILYAVPPKGFFPSDSASISVNSLSSGSSDMHGPMKAM IDA
Sbjct: 181  HSPYESPPKNGMDKALSDVILYAVPPKGFFPSDSASISVNSLSSGSSDMHGPMKAMAIDA 240

Query: 848  FRVSLSSAVSSSSQGSGHDDGDALGDVFIWGEGTGDGVLGGGSHRVGSCLSIKMDSLLPK 907
            FRVSLSSAVSSSSQGSGHDDGDALGDVFIWGEGTGDG+LGGGSHRVGS LSIKMDSLLPK
Sbjct: 241  FRVSLSSAVSSSSQGSGHDDGDALGDVFIWGEGTGDGILGGGSHRVGSSLSIKMDSLLPK 300

Query: 908  ALESAVVLDVQNIACGGRHAALVTKQGEIFTWGEESGGRLGHGVDSDVLQPKLIDALGNT 967
            ALESAVVLDVQNIACGGRHAALVTKQGEIFTWGEESGGRLGHGVDSDVLQPKLIDALGNT
Sbjct: 301  ALESAVVLDVQNIACGGRHAALVTKQGEIFTWGEESGGRLGHGVDSDVLQPKLIDALGNT 360

Query: 968  NIELVSCGEYHTSAVTLSGDLYTWGDGTYNFGLLGHGNEVSHWIPKKINGPLEGIHVSSI 1027
            NIELVSCGEYHT AVTLSGDLYTWGDGTYNFGLLGHGNEVSHWIPKKINGPLEGIHVSSI
Sbjct: 361  NIELVSCGEYHTCAVTLSGDLYTWGDGTYNFGLLGHGNEVSHWIPKKINGPLEGIHVSSI 420

Query: 1028 SCGPWHTAVVTSAGQLFTFGDGTFGVLGHGDRNSVSMPREVESLKGLRTVRAACGVWHTA 1087
            SCGPWHTAVVTSAGQLFTFGDGTFGVLGHGDRNSVSMPREVESLKGLRTVRAACGVWHTA
Sbjct: 421  SCGPWHTAVVTSAGQLFTFGDGTFGVLGHGDRNSVSMPREVESLKGLRTVRAACGVWHTA 480

Query: 1088 AVVEVMVGSSSSSNCSSGKLFTWGDGDKGRLGHGDKETKLVPTCVAALVEPNFCRVSCGH 1147
            AVVEVMVGSSSSSNCSSGKLFTWGDGDKGRLGHGDKETKLVPTCVAALVEPNFCRVSCGH
Sbjct: 481  AVVEVMVGSSSSSNCSSGKLFTWGDGDKGRLGHGDKETKLVPTCVAALVEPNFCRVSCGH 540

Query: 1148 SLTVALTTSGHVYTMGSPVYGQLGNPHADGKVPVRVEGKLSKSFVEEIACGAYHVAVLTS 1207
            SLTVALTTSGHVYTMGSPVYGQLGNPHADGKVPVR+EGK+SKSFVEEIACGAYHVAVLTS
Sbjct: 541  SLTVALTTSGHVYTMGSPVYGQLGNPHADGKVPVRIEGKISKSFVEEIACGAYHVAVLTS 600

Query: 1208 RTEVYTWGKGANGRLGHGDTDDRNSPTLVEALKDKQVKSIACGTNFTAAICLHKWVSGVD 1267
            +TEVYTWGKGANGRLGHGDTDDRNSPTLVEALKDKQVKSIACGTNFTAAICLHKWVSGVD
Sbjct: 601  KTEVYTWGKGANGRLGHGDTDDRNSPTLVEALKDKQVKSIACGTNFTAAICLHKWVSGVD 660

Query: 1268 QSMCSGCHLPFNFKRKRHNCYNCGLVFCHSCSSKKCHKASMAPNPNKPYRVCDNCYNKLR 1327
            QSMCSGCHLPFNFKRKRHNCYNCGLVFCHSCSSKKCHKASMAPNPNKPYRVCDNCYNKLR
Sbjct: 661  QSMCSGCHLPFNFKRKRHNCYNCGLVFCHSCSSKKCHKASMAPNPNKPYRVCDNCYNKLR 720

Query: 1328 KALETDASSQSSVSRRRSINQGTTEFVEKDEKPESVKSRAQLARFSSMESVKQVENQSSK 1387
            KALETDASSQSSVSRRRSINQG+TEFVEKDEKPESVKSRAQLARFSSMESVKQVENQS K
Sbjct: 721  KALETDASSQSSVSRRRSINQGSTEFVEKDEKPESVKSRAQLARFSSMESVKQVENQSYK 780

Query: 1388 KNKKFECNSSRVSPVPNGGSQWGVISKSFNPVFGSSKKFFSASVPGSRIVSRATSPISRR 1447
            KNKKF+CNSSRVSPVPNGGSQWG ISKSFNPVFGSSKKFFSASVPGSRIVSRATSPISRR
Sbjct: 781  KNKKFDCNSSRVSPVPNGGSQWGAISKSFNPVFGSSKKFFSASVPGSRIVSRATSPISRR 840

Query: 1448 ASPPRSTTPTPTLGGLTSPKIAVDDGKRTNDSLSQEVIKLKAQVENLTRKAQLQEVELER 1507
            ASPPRSTTPTPTLGGLTSPKIAVDD KRTNDSL QEV+KLKAQVENLTRKAQLQEVELER
Sbjct: 841  ASPPRSTTPTPTLGGLTSPKIAVDDAKRTNDSLRQEVVKLKAQVENLTRKAQLQEVELER 900

Query: 1508 TTKQLKEALSFAAGEATKCNAAKEVIKSLTAQLKEMAERLPVGAARNIKSPSLASLGSSP 1567
            TTKQLKEALSFAAGEATKCNAAKEVIKSLTAQLKEMAERLPVGAARN+KSPSLASLGSSP
Sbjct: 901  TTKQLKEALSFAAGEATKCNAAKEVIKSLTAQLKEMAERLPVGAARNVKSPSLASLGSSP 960

Query: 1568 LFNDVVTPSIDRSNGQTMSLEADIIESNSHLLSNGSSTASNRSSGHNRQGNSDSTTKNGN 1627
             FN+VV PSIDRSNGQTM LEA+IIESNSHLLSNGSSTASNRSSGHNRQGNSDS  +NGN
Sbjct: 961  PFNEVVAPSIDRSNGQTMPLEAEIIESNSHLLSNGSSTASNRSSGHNRQGNSDSAARNGN 1020

Query: 1628 KVKESDSRHDAEWVEQDEPGVYITFTSLQGGAKDLKRVRFSRKRFTEKQAEQWWAENRAR 1687
            KVKESDSRHDAEWVEQDEPGVYITFTSLQGGAKDLKRVRFSRKRFTEKQAEQWWAENRAR
Sbjct: 1021 KVKESDSRHDAEWVEQDEPGVYITFTSLQGGAKDLKRVRFSRKRFTEKQAEQWWAENRAR 1080

Query: 1688 VYDQYNVRMIDKST 1702
            VYDQYNVRMIDKS+
Sbjct: 1081 VYDQYNVRMIDKSS 1094

BLAST of Sgr023231 vs. ExPASy TrEMBL
Match: A0A0A0KEK0 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G432780 PE=4 SV=1)

HSP 1 Score: 2099.7 bits (5439), Expect = 0.0e+00
Identity = 1055/1094 (96.44%), Postives = 1073/1094 (98.08%), Query Frame = 0

Query: 608  MSRMDRMASDLNRNGSVERDIEQSIIALKKGAYLLKYGRRGKPKFCPFRLSNDESVLIWF 667
            MSRMDRM SDLNRNG VERDIEQSIIALKKGAYLLKYGRRGKPKFCPFRLSNDESVLIWF
Sbjct: 1    MSRMDRMTSDLNRNGPVERDIEQSIIALKKGAYLLKYGRRGKPKFCPFRLSNDESVLIWF 60

Query: 668  SGKEEKHLKLSHVSRIISGQRTPIFQRYPRPEKEYQSFSLIYNERSLDLICKDKDEAEVW 727
            SGKEEK LKLSHVSRIISGQRTPIFQRYPRPEKEYQSFSLIYN+RSLDLICKDKDEAEVW
Sbjct: 61   SGKEEKLLKLSHVSRIISGQRTPIFQRYPRPEKEYQSFSLIYNDRSLDLICKDKDEAEVW 120

Query: 728  FNGLKTLISRSHHRKWRTESRSDGMQSEANSPRTYTRRSSPLNSPFGSNDSLQKDGDFRL 787
            FNGLKTLISRSHHRKWRTESRSDGMQSEANSPRTYTRRSSPLNSPFGSNDSLQKDGDFRL
Sbjct: 121  FNGLKTLISRSHHRKWRTESRSDGMQSEANSPRTYTRRSSPLNSPFGSNDSLQKDGDFRL 180

Query: 788  HSPYESPPKNGMDKALSDVILYAVPPKGFFPSDSASISVNSLSSGSSDMHGPMKAMGIDA 847
             SPY SPPKNGMDKALSDVILY VPPKGFFPSDSASISVNSLSSGSS+MHGPMKAMGIDA
Sbjct: 181  QSPYGSPPKNGMDKALSDVILYTVPPKGFFPSDSASISVNSLSSGSSEMHGPMKAMGIDA 240

Query: 848  FRVSLSSAVSSSSQGSGHDDGDALGDVFIWGEGTGDGVLGGGSHRVGSCLSIKMDSLLPK 907
            FRVSLSSAVSSSSQGSGHDDGDALGDVFIWGEGTGDGVLGGGSH+VGSC S+KMDSLLPK
Sbjct: 241  FRVSLSSAVSSSSQGSGHDDGDALGDVFIWGEGTGDGVLGGGSHKVGSCFSLKMDSLLPK 300

Query: 908  ALESAVVLDVQNIACGGRHAALVTKQGEIFTWGEESGGRLGHGVDSDVLQPKLIDALGNT 967
            ALESAVVLDVQNIACGGRHAALVTKQGEIFTWGEESGGRLGHGVDSDVLQPKL+DALGNT
Sbjct: 301  ALESAVVLDVQNIACGGRHAALVTKQGEIFTWGEESGGRLGHGVDSDVLQPKLVDALGNT 360

Query: 968  NIELVSCGEYHTSAVTLSGDLYTWGDGTYNFGLLGHGNEVSHWIPKKINGPLEGIHVSSI 1027
            NIELVSCGEYHT AVTLSGDLYTWGDGTYNFGLLGHGNEVSHWIPKKINGPLEGIHVSSI
Sbjct: 361  NIELVSCGEYHTCAVTLSGDLYTWGDGTYNFGLLGHGNEVSHWIPKKINGPLEGIHVSSI 420

Query: 1028 SCGPWHTAVVTSAGQLFTFGDGTFGVLGHGDRNSVSMPREVESLKGLRTVRAACGVWHTA 1087
            +CGPWHTAVVTSAGQLFTFGDGTFGVLGHGDR SV+MPREVESLKGLRTVRAACGVWHTA
Sbjct: 421  ACGPWHTAVVTSAGQLFTFGDGTFGVLGHGDRGSVTMPREVESLKGLRTVRAACGVWHTA 480

Query: 1088 AVVEVMVGSSSSSNCSSGKLFTWGDGDKGRLGHGDKETKLVPTCVAALVEPNFCRVSCGH 1147
            AVVEVMVGSSSSSNCSSGKLFTWGDGDKGRLGHGDKETKLVPTCVAALV+PNFCRVSCGH
Sbjct: 481  AVVEVMVGSSSSSNCSSGKLFTWGDGDKGRLGHGDKETKLVPTCVAALVDPNFCRVSCGH 540

Query: 1148 SLTVALTTSGHVYTMGSPVYGQLGNPHADGKVPVRVEGKLSKSFVEEIACGAYHVAVLTS 1207
            S+TVALTTSGHVYTMGSPVYGQLGNPHADGKVPVRVEGKLSKSFVEEIACGAYHVAVLTS
Sbjct: 541  SMTVALTTSGHVYTMGSPVYGQLGNPHADGKVPVRVEGKLSKSFVEEIACGAYHVAVLTS 600

Query: 1208 RTEVYTWGKGANGRLGHGDTDDRNSPTLVEALKDKQVKSIACGTNFTAAICLHKWVSGVD 1267
            RTEVYTWGKGANGRLGHGDTDDRNSPTLVEALKDKQVKSIACGTNFTAAICLHKWVSG D
Sbjct: 601  RTEVYTWGKGANGRLGHGDTDDRNSPTLVEALKDKQVKSIACGTNFTAAICLHKWVSGFD 660

Query: 1268 QSMCSGCHLPFNFKRKRHNCYNCGLVFCHSCSSKKCHKASMAPNPNKPYRVCDNCYNKLR 1327
            QSMCSGCHLPFNFKRKRHNCYNCGLVFCHSCSSKKCHKASMAPNPNKPYRVCDNCYNKLR
Sbjct: 661  QSMCSGCHLPFNFKRKRHNCYNCGLVFCHSCSSKKCHKASMAPNPNKPYRVCDNCYNKLR 720

Query: 1328 KALETDASSQSSVSRRRSINQGTTEFVEKDEKPESVKSRAQLARFSSMESVKQVENQSSK 1387
            KALETDASSQSSVSRRRSINQG+T+FVEK+EKPESVKSRAQLARFSSMESVKQ ENQ SK
Sbjct: 721  KALETDASSQSSVSRRRSINQGSTDFVEKEEKPESVKSRAQLARFSSMESVKQGENQFSK 780

Query: 1388 KNKKFECNSSRVSPVPNGGSQWGVISKSFNPVFGSSKKFFSASVPGSRIVSRATSPISRR 1447
            KNKKFECNSSRVSPVPNGGSQWG ISKSFNPVFGSSKKFFSASVPGSRIVSRATSPISRR
Sbjct: 781  KNKKFECNSSRVSPVPNGGSQWGAISKSFNPVFGSSKKFFSASVPGSRIVSRATSPISRR 840

Query: 1448 ASPPRSTTPTPTLGGLTSPKIAVDDGKRTNDSLSQEVIKLKAQVENLTRKAQLQEVELER 1507
            ASPPRSTTPTPTLGGLTSPKIAVDD KRTNDSLSQEV+KLKAQVENLTRKAQLQEVE+ER
Sbjct: 841  ASPPRSTTPTPTLGGLTSPKIAVDDAKRTNDSLSQEVVKLKAQVENLTRKAQLQEVEMER 900

Query: 1508 TTKQLKEALSFAAGEATKCNAAKEVIKSLTAQLKEMAERLPVGAARNIKSPSLASLGSSP 1567
            TTKQLKEAL+FAA EATKCNAAKEVI SLTAQLKEMAERLPVGAARNIKSPSLASLGSSP
Sbjct: 901  TTKQLKEALAFAAAEATKCNAAKEVIMSLTAQLKEMAERLPVGAARNIKSPSLASLGSSP 960

Query: 1568 LFNDVVTPSIDRSNGQTMSLEADIIESNSHLLSNGSSTASNRSSGHNRQGNSDSTTKNGN 1627
             FNDVVTPSIDRSNGQTMSLEAD+IESNSHLLSNGSSTAS RSSGHNR  NSDSTT+NGN
Sbjct: 961  PFNDVVTPSIDRSNGQTMSLEADVIESNSHLLSNGSSTASIRSSGHNRPANSDSTTRNGN 1020

Query: 1628 KVKESDSRHDAEWVEQDEPGVYITFTSLQGGAKDLKRVRFSRKRFTEKQAEQWWAENRAR 1687
            KVKESDSRHDAEWVEQDEPGVYITFTSLQGGAKDLKRVRFSRKRFTEKQAEQWWAENRAR
Sbjct: 1021 KVKESDSRHDAEWVEQDEPGVYITFTSLQGGAKDLKRVRFSRKRFTEKQAEQWWAENRAR 1080

Query: 1688 VYDQYNVRMIDKST 1702
            VYDQYNVR IDKS+
Sbjct: 1081 VYDQYNVRTIDKSS 1094

BLAST of Sgr023231 vs. ExPASy TrEMBL
Match: A0A5A7V3C5 (Regulator of chromosome condensation (RCC1) family with FYVE zinc finger domain isoform 1 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold212G001530 PE=4 SV=1)

HSP 1 Score: 2095.9 bits (5429), Expect = 0.0e+00
Identity = 1051/1091 (96.33%), Postives = 1072/1091 (98.26%), Query Frame = 0

Query: 611  MDRMASDLNRNGSVERDIEQSIIALKKGAYLLKYGRRGKPKFCPFRLSNDESVLIWFSGK 670
            MDRM SDLNRNG VERDIEQSIIALKKGAYLLKYGRRGKPKFCPFRLSNDESVLIWFSGK
Sbjct: 1    MDRMTSDLNRNGPVERDIEQSIIALKKGAYLLKYGRRGKPKFCPFRLSNDESVLIWFSGK 60

Query: 671  EEKHLKLSHVSRIISGQRTPIFQRYPRPEKEYQSFSLIYNERSLDLICKDKDEAEVWFNG 730
            EEKHLKLSHVSRIISGQRTPIFQRYPRPEKEYQSFSLIYN+RSLDLICKDKDEAEVWFNG
Sbjct: 61   EEKHLKLSHVSRIISGQRTPIFQRYPRPEKEYQSFSLIYNDRSLDLICKDKDEAEVWFNG 120

Query: 731  LKTLISRSHHRKWRTESRSDGMQSEANSPRTYTRRSSPLNSPFGSNDSLQKDGDFRLHSP 790
            LKTLISRSHHRKWRTESRSDGMQSEANSPRTYTRRSSPLNSPFGSNDSLQKDGDFRL SP
Sbjct: 121  LKTLISRSHHRKWRTESRSDGMQSEANSPRTYTRRSSPLNSPFGSNDSLQKDGDFRLQSP 180

Query: 791  YESPPKNGMDKALSDVILYAVPPKGFFPSDSASISVNSLSSGSSDMHGPMKAMGIDAFRV 850
            Y SPPKNGMDKALSDVILY VPPKGFFPSDSAS+SVNSLSSGSS+MHGPMKAM IDAFRV
Sbjct: 181  YGSPPKNGMDKALSDVILYTVPPKGFFPSDSASMSVNSLSSGSSEMHGPMKAMAIDAFRV 240

Query: 851  SLSSAVSSSSQGSGHDDGDALGDVFIWGEGTGDGVLGGGSHRVGSCLSIKMDSLLPKALE 910
            SLSSAVSSSSQGSGHDDGDALGDVFIWGEGTGDGVLGGGSH+VGSC S+KMDSLLPKALE
Sbjct: 241  SLSSAVSSSSQGSGHDDGDALGDVFIWGEGTGDGVLGGGSHKVGSCFSLKMDSLLPKALE 300

Query: 911  SAVVLDVQNIACGGRHAALVTKQGEIFTWGEESGGRLGHGVDSDVLQPKLIDALGNTNIE 970
            SAVVLDVQNIACGGRHAALVTKQGEIFTWGEESGGRLGHGVDSDVLQPKL+DALGNTNIE
Sbjct: 301  SAVVLDVQNIACGGRHAALVTKQGEIFTWGEESGGRLGHGVDSDVLQPKLVDALGNTNIE 360

Query: 971  LVSCGEYHTSAVTLSGDLYTWGDGTYNFGLLGHGNEVSHWIPKKINGPLEGIHVSSISCG 1030
            LVSCGEYHT AVTLSGDLYTWGDGTYNFGLLGHGNEVSHWIPKKINGPLEGIHVSSI+CG
Sbjct: 361  LVSCGEYHTCAVTLSGDLYTWGDGTYNFGLLGHGNEVSHWIPKKINGPLEGIHVSSIACG 420

Query: 1031 PWHTAVVTSAGQLFTFGDGTFGVLGHGDRNSVSMPREVESLKGLRTVRAACGVWHTAAVV 1090
            PWHTAVVTSAGQLFTFGDGTFGVLGHGDR+SV+MPREVESLKGLRTVRAACGVWHTAAVV
Sbjct: 421  PWHTAVVTSAGQLFTFGDGTFGVLGHGDRSSVTMPREVESLKGLRTVRAACGVWHTAAVV 480

Query: 1091 EVMVGSSSSSNCSSGKLFTWGDGDKGRLGHGDKETKLVPTCVAALVEPNFCRVSCGHSLT 1150
            EVMVGSSSSSNCSSGKLFTWGDGDKGRLGHGDKETKLVPTCVAALV+PNFCRVSCGHS+T
Sbjct: 481  EVMVGSSSSSNCSSGKLFTWGDGDKGRLGHGDKETKLVPTCVAALVDPNFCRVSCGHSMT 540

Query: 1151 VALTTSGHVYTMGSPVYGQLGNPHADGKVPVRVEGKLSKSFVEEIACGAYHVAVLTSRTE 1210
            VALTTSG VYTMGSPVYGQLGNPHADGKVPVRVEGKLSKSFVEEIACGAYHVAVLTSRTE
Sbjct: 541  VALTTSGQVYTMGSPVYGQLGNPHADGKVPVRVEGKLSKSFVEEIACGAYHVAVLTSRTE 600

Query: 1211 VYTWGKGANGRLGHGDTDDRNSPTLVEALKDKQVKSIACGTNFTAAICLHKWVSGVDQSM 1270
            VYTWGKGANGRLGHGDTDDRNSPTLVEALKDKQVKSIACGTNFTAAICLHKWVSG DQSM
Sbjct: 601  VYTWGKGANGRLGHGDTDDRNSPTLVEALKDKQVKSIACGTNFTAAICLHKWVSGFDQSM 660

Query: 1271 CSGCHLPFNFKRKRHNCYNCGLVFCHSCSSKKCHKASMAPNPNKPYRVCDNCYNKLRKAL 1330
            CSGCHLPFNFKRKRHNCYNCGLVFCHSCSSKKCHKASMAPNPNKPYRVCDNCYNKLRKAL
Sbjct: 661  CSGCHLPFNFKRKRHNCYNCGLVFCHSCSSKKCHKASMAPNPNKPYRVCDNCYNKLRKAL 720

Query: 1331 ETDASSQSSVSRRRSINQGTTEFVEKDEKPESVKSRAQLARFSSMESVKQVENQSSKKNK 1390
            ETDASSQSSVSRRRSINQG+T+FVEKDEKPESVKSRAQLARFSSMESVKQ E+Q SKKNK
Sbjct: 721  ETDASSQSSVSRRRSINQGSTDFVEKDEKPESVKSRAQLARFSSMESVKQGESQFSKKNK 780

Query: 1391 KFECNSSRVSPVPNGGSQWGVISKSFNPVFGSSKKFFSASVPGSRIVSRATSPISRRASP 1450
            KFECNSSRVSPVPNGGSQWG ISKSFNPVFGSSKKFFSASVPGSRIVSRATSPISRRASP
Sbjct: 781  KFECNSSRVSPVPNGGSQWGAISKSFNPVFGSSKKFFSASVPGSRIVSRATSPISRRASP 840

Query: 1451 PRSTTPTPTLGGLTSPKIAVDDGKRTNDSLSQEVIKLKAQVENLTRKAQLQEVELERTTK 1510
            PRSTTPTPTLGGLTSPKIAVDD KRTNDSLSQEV+KLKAQVENLTRKAQLQEVE+ERTTK
Sbjct: 841  PRSTTPTPTLGGLTSPKIAVDDAKRTNDSLSQEVVKLKAQVENLTRKAQLQEVEMERTTK 900

Query: 1511 QLKEALSFAAGEATKCNAAKEVIKSLTAQLKEMAERLPVGAARNIKSPSLASLGSSPLFN 1570
            QLKEAL+FAA EATKCNAAKEVI SLTAQLKEMAERLPVGAARNIKSP+LASLGSSP FN
Sbjct: 901  QLKEALAFAAAEATKCNAAKEVIMSLTAQLKEMAERLPVGAARNIKSPTLASLGSSPPFN 960

Query: 1571 DVVTPSIDRSNGQTMSLEADIIESNSHLLSNGSSTASNRSSGHNRQGNSDSTTKNGNKVK 1630
            DVVTPSIDRSNGQTMSLEAD+IESNSHLLSNGSSTAS RSSGHNRQGNSDSTT+NGNKVK
Sbjct: 961  DVVTPSIDRSNGQTMSLEADVIESNSHLLSNGSSTASIRSSGHNRQGNSDSTTRNGNKVK 1020

Query: 1631 ESDSRHDAEWVEQDEPGVYITFTSLQGGAKDLKRVRFSRKRFTEKQAEQWWAENRARVYD 1690
            ESDSRHDAEWVEQDEPGVYITFTSLQGGAKDLKRVRFSRKRFTEKQAEQWWAENRARVYD
Sbjct: 1021 ESDSRHDAEWVEQDEPGVYITFTSLQGGAKDLKRVRFSRKRFTEKQAEQWWAENRARVYD 1080

Query: 1691 QYNVRMIDKST 1702
            QYNVR IDKS+
Sbjct: 1081 QYNVRTIDKSS 1091

BLAST of Sgr023231 vs. ExPASy TrEMBL
Match: A0A1S3BNH2 (LOW QUALITY PROTEIN: uncharacterized protein LOC103491479 OS=Cucumis melo OX=3656 GN=LOC103491479 PE=4 SV=1)

HSP 1 Score: 2090.8 bits (5416), Expect = 0.0e+00
Identity = 1051/1094 (96.07%), Postives = 1072/1094 (97.99%), Query Frame = 0

Query: 608  MSRMDRMASDLNRNGSVERDIEQSIIALKKGAYLLKYGRRGKPKFCPFRLSNDESVLIWF 667
            MSRMDRM SDLNRNG VERDIEQSIIALKKGAYLLKYGRRGKPKFCPFRLSNDESVLIWF
Sbjct: 1    MSRMDRMTSDLNRNGPVERDIEQSIIALKKGAYLLKYGRRGKPKFCPFRLSNDESVLIWF 60

Query: 668  SGKEEKHLKLSHVSRIISGQRTPIFQRYPRPEKEYQSFSLIYNERSLDLICKDKDEAEVW 727
            SGKEEKHLKLSHVSRIISGQRTPIFQRYPRPEKEYQSFSLIYN+RSLDLICKDKDEAEVW
Sbjct: 61   SGKEEKHLKLSHVSRIISGQRTPIFQRYPRPEKEYQSFSLIYNDRSLDLICKDKDEAEVW 120

Query: 728  FNGLKTLISRSHHRKWRTESRSDGMQSEANSPRTYTRRSSPLNSPFGSNDSLQKDGDFRL 787
            FNGLKTLISRSHHRKWRTESRSDGMQSEANSPRTYTRRSSPLNSPFGSNDSLQKDGDFRL
Sbjct: 121  FNGLKTLISRSHHRKWRTESRSDGMQSEANSPRTYTRRSSPLNSPFGSNDSLQKDGDFRL 180

Query: 788  HSPYESPPKNGMDKALSDVILYAVPPKGFFPSDSASISVNSLSSGSSDMHGPMKAMGIDA 847
             SPY SPPKNGMDKALSDVILY VPPKGFFPSDSAS+SVNSLSSGSS+MHGPMKAM IDA
Sbjct: 181  QSPYGSPPKNGMDKALSDVILYTVPPKGFFPSDSASMSVNSLSSGSSEMHGPMKAMAIDA 240

Query: 848  FRVSLSSAVSSSSQGSGHDDGDALGDVFIWGEGTGDGVLGGGSHRVGSCLSIKMDSLLPK 907
            FRVSLSSAVSSSSQGSGHDDGDALGDVFIWGEGTGDGVLGGGSH+VGSC S+KMDSLLPK
Sbjct: 241  FRVSLSSAVSSSSQGSGHDDGDALGDVFIWGEGTGDGVLGGGSHKVGSCFSLKMDSLLPK 300

Query: 908  ALESAVVLDVQNIACGGRHAALVTKQGEIFTWGEESGGRLGHGVDSDVLQPKLIDALGNT 967
            ALESAVVLDVQNIACGGRHAALVTKQGEIFTWGEESGGRLGHGVDSDVLQPKL+DALGNT
Sbjct: 301  ALESAVVLDVQNIACGGRHAALVTKQGEIFTWGEESGGRLGHGVDSDVLQPKLVDALGNT 360

Query: 968  NIELVSCGEYHTSAVTLSGDLYTWGDGTYNFGLLGHGNEVSHWIPKKINGPLEGIHVSSI 1027
            NIELVSCGEYHT AVTLSGDLYTWGDGTYNFGLLGHGNEVSHWIPKKINGPLEGIHVSSI
Sbjct: 361  NIELVSCGEYHTCAVTLSGDLYTWGDGTYNFGLLGHGNEVSHWIPKKINGPLEGIHVSSI 420

Query: 1028 SCGPWHTAVVTSAGQLFTFGDGTFGVLGHGDRNSVSMPREVESLKGLRTVRAACGVWHTA 1087
            +CGPWHTAVVTSAGQLFTFGDGTFGVLGHGDR+SV+MPREVESLKGLRTVRAACGVWHTA
Sbjct: 421  ACGPWHTAVVTSAGQLFTFGDGTFGVLGHGDRSSVTMPREVESLKGLRTVRAACGVWHTA 480

Query: 1088 AVVEVMVGSSSSSNCSSGKLFTWGDGDKGRLGHGDKETKLVPTCVAALVEPNFCRVSCGH 1147
            AVVEVMVGSSSSSNCSSGKLFTWGDGDKGRLGHGDKETKLVPTCVAALV+PNFCRVSCGH
Sbjct: 481  AVVEVMVGSSSSSNCSSGKLFTWGDGDKGRLGHGDKETKLVPTCVAALVDPNFCRVSCGH 540

Query: 1148 SLTVALTTSGHVYTMGSPVYGQLGNPHADGKVPVRVEGKLSKSFVEEIACGAYHVAVLTS 1207
            S+TVALTTSG VYTMGSPVYGQLGNPHADGKVPVRVEGKLSKS VEEIACGAYHVAVLTS
Sbjct: 541  SMTVALTTSGQVYTMGSPVYGQLGNPHADGKVPVRVEGKLSKSLVEEIACGAYHVAVLTS 600

Query: 1208 RTEVYTWGKGANGRLGHGDTDDRNSPTLVEALKDKQVKSIACGTNFTAAICLHKWVSGVD 1267
            RTEVYTWGKGANGRLGHGDTDDRNSPTLVEALKDKQVKSIACGTNFTAAICLHKWVSG D
Sbjct: 601  RTEVYTWGKGANGRLGHGDTDDRNSPTLVEALKDKQVKSIACGTNFTAAICLHKWVSGFD 660

Query: 1268 QSMCSGCHLPFNFKRKRHNCYNCGLVFCHSCSSKKCHKASMAPNPNKPYRVCDNCYNKLR 1327
            QSMCSGCHLPFNFKRKRHNCYNCGLVFCHSCSSKKCHKASMAPNPNKPYRVCDNCYNKLR
Sbjct: 661  QSMCSGCHLPFNFKRKRHNCYNCGLVFCHSCSSKKCHKASMAPNPNKPYRVCDNCYNKLR 720

Query: 1328 KALETDASSQSSVSRRRSINQGTTEFVEKDEKPESVKSRAQLARFSSMESVKQVENQSSK 1387
            KALETDASSQSSVSRRRSINQG+T+FVEKDEKPESVKSRAQLARFSSMESVKQ E+Q S 
Sbjct: 721  KALETDASSQSSVSRRRSINQGSTDFVEKDEKPESVKSRAQLARFSSMESVKQGESQFS- 780

Query: 1388 KNKKFECNSSRVSPVPNGGSQWGVISKSFNPVFGSSKKFFSASVPGSRIVSRATSPISRR 1447
            KNKKFECNSSRVSPVPNGGSQWG ISKSFNPVFGSSKKFFSASVPGSRIVSRATSPISRR
Sbjct: 781  KNKKFECNSSRVSPVPNGGSQWGAISKSFNPVFGSSKKFFSASVPGSRIVSRATSPISRR 840

Query: 1448 ASPPRSTTPTPTLGGLTSPKIAVDDGKRTNDSLSQEVIKLKAQVENLTRKAQLQEVELER 1507
            ASPPRSTTPTPTLGGLTSPKIAVDD KRTNDSLSQEV+KLKAQVENLTRKAQLQEVE+ER
Sbjct: 841  ASPPRSTTPTPTLGGLTSPKIAVDDAKRTNDSLSQEVVKLKAQVENLTRKAQLQEVEMER 900

Query: 1508 TTKQLKEALSFAAGEATKCNAAKEVIKSLTAQLKEMAERLPVGAARNIKSPSLASLGSSP 1567
            TTKQLKEAL+FAA EATKCNAAKEVI SLTAQLKEMAERLPVGAARNIKSP+LASLGSSP
Sbjct: 901  TTKQLKEALAFAAAEATKCNAAKEVIMSLTAQLKEMAERLPVGAARNIKSPTLASLGSSP 960

Query: 1568 LFNDVVTPSIDRSNGQTMSLEADIIESNSHLLSNGSSTASNRSSGHNRQGNSDSTTKNGN 1627
             FNDVVTPSIDRSNGQTMSLEAD+IESNSHLLSNGS TAS RSSGHNRQGNSDSTT+NGN
Sbjct: 961  PFNDVVTPSIDRSNGQTMSLEADVIESNSHLLSNGSGTASIRSSGHNRQGNSDSTTRNGN 1020

Query: 1628 KVKESDSRHDAEWVEQDEPGVYITFTSLQGGAKDLKRVRFSRKRFTEKQAEQWWAENRAR 1687
            KVKESDSRHDAEWVEQDEPGVYITFTSLQGGAKDLKRVRFSRKRFTEKQAEQWWAENRAR
Sbjct: 1021 KVKESDSRHDAEWVEQDEPGVYITFTSLQGGAKDLKRVRFSRKRFTEKQAEQWWAENRAR 1080

Query: 1688 VYDQYNVRMIDKST 1702
            VYDQYNVR IDKS+
Sbjct: 1081 VYDQYNVRTIDKSS 1093

BLAST of Sgr023231 vs. TAIR 10
Match: AT5G19420.1 (Regulator of chromosome condensation (RCC1) family with FYVE zinc finger domain )

HSP 1 Score: 1686.4 bits (4366), Expect = 0.0e+00
Identity = 862/1103 (78.15%), Postives = 971/1103 (88.03%), Query Frame = 0

Query: 608  MSRMDRMA-SDLNRNGSVERDIEQSIIALKKGAYLLKYGRRGKPKFCPFRLSNDESVLIW 667
            MSR  RM  SDL+R G V RDIEQ+I ALKKGAYLLKYGRRGKPKFCPFRLSNDESVLIW
Sbjct: 1    MSRNGRMTPSDLSRAGPVTRDIEQAITALKKGAYLLKYGRRGKPKFCPFRLSNDESVLIW 60

Query: 668  FSGKEEKHLKLSHVSRIISGQRTPIFQRYPRPEKEYQSFSLIYNERSLDLICKDKDEAEV 727
            FSGKEEKHLKLSHVSRIISGQRTPIFQRYPRPEKEYQSFSLIY+ERSLDLICKDKDEAEV
Sbjct: 61   FSGKEEKHLKLSHVSRIISGQRTPIFQRYPRPEKEYQSFSLIYDERSLDLICKDKDEAEV 120

Query: 728  WFNGLKTLISRSHHRKWRTESRSDGMQSEANSPRTYTRRSSPLNSPFGSNDSLQKDGD-- 787
            WF+GLK LISR H RKWRTESRSDG  SEANSPRTYTRRSSPL+SPF SN+S QK+G   
Sbjct: 121  WFSGLKALISRCHQRKWRTESRSDGTPSEANSPRTYTRRSSPLHSPFSSNESFQKEGSNH 180

Query: 788  FRLHSPYESPPKNGMDKALSDVILYAVPPKGFFPSDSASISVNSLSSGSSD-MHGPMKAM 847
             RLHSPYESPPKNG+DKA SD+ LYAVPPKGFFP  SA++SV+SLSSG SD +HG MK M
Sbjct: 181  LRLHSPYESPPKNGVDKAFSDMSLYAVPPKGFFPPGSATMSVHSLSSGGSDTLHGHMKGM 240

Query: 848  GIDAFRVSLSSAVSSSSQGSGHDDGDALGDVFIWGEGTGDGVLGGGSHRVGSCLSIKMDS 907
            G+DAFRVSLSSA+SSSS GSGHDDGD LGDVF+WGEG G+GVLGGG+HRVGS L IKMDS
Sbjct: 241  GMDAFRVSLSSAISSSSHGSGHDDGDTLGDVFMWGEGIGEGVLGGGNHRVGSSLEIKMDS 300

Query: 908  LLPKALESAVVLDVQNIACGGRHAALVTKQGEIFTWGEESGGRLGHGVDSDVLQPKLIDA 967
            LLPKALES +VLDVQNIACGG+HA LVTKQGE F+WGEES GRLGHGVDS+V  PKLIDA
Sbjct: 301  LLPKALESTIVLDVQNIACGGQHAVLVTKQGESFSWGEESEGRLGHGVDSNVQHPKLIDA 360

Query: 968  LGNTNIELVSCGEYHTSAVTLSGDLYTWGDGTYNFGLLGHGNEVSHWIPKKINGPLEGIH 1027
            L  TNIELV+CGEYH+ AVTLSGDLYTWG G  +FG+LGHGNEVSHW+PK++N  +EGIH
Sbjct: 361  LNTTNIELVACGEYHSCAVTLSGDLYTWGKG--DFGILGHGNEVSHWVPKRVNFLMEGIH 420

Query: 1028 VSSISCGPWHTAVVTSAGQLFTFGDGTFGVLGHGDRNSVSMPREVESLKGLRTVRAACGV 1087
            VSSI+CGP+HTAVVTSAGQLFTFGDGTFGVLGHGDR SV +PREV+SLKGLRTVRAACGV
Sbjct: 421  VSSIACGPYHTAVVTSAGQLFTFGDGTFGVLGHGDRKSVFIPREVDSLKGLRTVRAACGV 480

Query: 1088 WHTAAVVEVMVGSSSSSNCSSGKLFTWGDGDKGRLGHGDKETKLVPTCVAALVEPNFCRV 1147
            WHTAAVVEVMVGSSSSSNCSSGKLFTWGDGDK RLGHGDKE KLVPTCVAALVEPNFC+V
Sbjct: 481  WHTAAVVEVMVGSSSSSNCSSGKLFTWGDGDKSRLGHGDKEPKLVPTCVAALVEPNFCQV 540

Query: 1148 SCGHSLTVALTTSGHVYTMGSPVYGQLGNPHADGKVPVRVEGKLSKSFVEEIACGAYHVA 1207
            +CGHSLTVALTTSGHVYTMGSPVYGQLGNPHADGKVP RV+GKL KSFVEEIACGAYHVA
Sbjct: 541  ACGHSLTVALTTSGHVYTMGSPVYGQLGNPHADGKVPTRVDGKLHKSFVEEIACGAYHVA 600

Query: 1208 VLTSRTEVYTWGKGANGRLGHGDTDDRNSPTLVEALKDKQVKSIACGTNFTAAICLHKWV 1267
            VLTSRTEVYTWGKG+NGRLGHGD DDRNSPTLVE+LKDKQVKSIACG+NFTAA+CLHKW 
Sbjct: 601  VLTSRTEVYTWGKGSNGRLGHGDADDRNSPTLVESLKDKQVKSIACGSNFTAAVCLHKWA 660

Query: 1268 SGVDQSMCSGCHLPFNFKRKRHNCYNCGLVFCHSCSSKKCHKASMAPNPNKPYRVCDNCY 1327
            SG+DQSMCSGC  PFNFKRKRHNCYNCGLVFCHSCS+KK  KA MAPNPNKPYRVCD C+
Sbjct: 661  SGMDQSMCSGCRQPFNFKRKRHNCYNCGLVFCHSCSNKKSLKACMAPNPNKPYRVCDRCF 720

Query: 1328 NKLRKALETDASSQSSVSRRRSINQGTTEFVEKDEKPESVKSRAQLARFSSMESVKQVEN 1387
            NKL+KA+ETD SS SS+SRR S+NQG ++ +++DEK ++ +S  QLARFS +E ++QV++
Sbjct: 721  NKLKKAMETDPSSHSSLSRRESVNQG-SDAIDRDEKLDT-RSDGQLARFSLLEPMRQVDS 780

Query: 1388 QSSKKNKKFECNSSRVSPVPNGGSQWGV--ISKSFNPVFGSSKKFFSASVPGSRIVSRAT 1447
            + SKKNKK+E NSSRVSP+P+GGS  G   I+KSFNP FGSSKKFFSASVPGSRI SRAT
Sbjct: 781  R-SKKNKKYEFNSSRVSPIPSGGSHRGSLNITKSFNPTFGSSKKFFSASVPGSRIASRAT 840

Query: 1448 SPISRRASPPRSTTPTPTLGGLTSPKIAVDDGKRTNDSLSQEVIKLKAQVENLTRKAQLQ 1507
            SPISRR SPPRSTTPTPTL GLT+PKI VDD KR+ND+LSQEV+ L++QVENLTRKAQLQ
Sbjct: 841  SPISRRPSPPRSTTPTPTLSGLTTPKIVVDDTKRSNDNLSQEVVMLRSQVENLTRKAQLQ 900

Query: 1508 EVELERTTKQLKEALSFAAGEATKCNAAKEVIKSLTAQLKEMAERLPVGAARNIKSPSLA 1567
            EVELERTTKQLKEAL+ A+ E+ +C AAKEVIKSLTAQLK+MAERLPVG+AR +KSPSL 
Sbjct: 901  EVELERTTKQLKEALAIASEESARCKAAKEVIKSLTAQLKDMAERLPVGSARTVKSPSLN 960

Query: 1568 SLGSSPLFNDVVTPSIDRSNGQTMSLEADIIESNSHLLSNGSSTASNRSSGHNRQGN--S 1627
            S GSSP +    + +++R N +    ++D + +   + SNG+ST    S  + +Q N  +
Sbjct: 961  SFGSSPDYAAPSSNTLNRPNSR--ETDSDSL-TTVPMFSNGTSTPVFDSGSYRQQANHAA 1020

Query: 1628 DSTTKNGNKVKESDSRHDAEWVEQDEPGVYITFTSLQGGAKDLKRVRFSRKRFTEKQAEQ 1687
            ++  +   + KES+ R++ EWVEQDEPGVYIT T+L GGA+DLKRVRFSRKRF+EKQAE+
Sbjct: 1021 EAINRISTRSKESEPRNENEWVEQDEPGVYITLTALAGGARDLKRVRFSRKRFSEKQAEE 1080

Query: 1688 WWAENRARVYDQYNVR-MIDKST 1702
            WWAENR RVY+QYNVR ++DKS+
Sbjct: 1081 WWAENRGRVYEQYNVRIVVDKSS 1095

BLAST of Sgr023231 vs. TAIR 10
Match: AT5G19420.2 (Regulator of chromosome condensation (RCC1) family with FYVE zinc finger domain )

HSP 1 Score: 1668.7 bits (4320), Expect = 0.0e+00
Identity = 860/1129 (76.17%), Postives = 970/1129 (85.92%), Query Frame = 0

Query: 609  SRMDRMA-SDLNRNGSVERDIEQ---------------------------SIIALKKGAY 668
            +R  RM  SDL+R G V RDIEQ                           +I ALKKGAY
Sbjct: 9    TRNGRMTPSDLSRAGPVTRDIEQLKIELYSTFGVSKLDSSYILENKNALHAITALKKGAY 68

Query: 669  LLKYGRRGKPKFCPFRLSNDESVLIWFSGKEEKHLKLSHVSRIISGQRTPIFQRYPRPEK 728
            LLKYGRRGKPKFCPFRLSNDESVLIWFSGKEEKHLKLSHVSRIISGQRTPIFQRYPRPEK
Sbjct: 69   LLKYGRRGKPKFCPFRLSNDESVLIWFSGKEEKHLKLSHVSRIISGQRTPIFQRYPRPEK 128

Query: 729  EYQSFSLIYNERSLDLICKDKDEAEVWFNGLKTLISRSHHRKWRTESRSDGMQSEANSPR 788
            EYQSFSLIY+ERSLDLICKDKDEAEVWF+GLK LISR H RKWRTESRSDG  SEANSPR
Sbjct: 129  EYQSFSLIYDERSLDLICKDKDEAEVWFSGLKALISRCHQRKWRTESRSDGTPSEANSPR 188

Query: 789  TYTRRSSPLNSPFGSNDSLQKDGD--FRLHSPYESPPKNGMDKALSDVILYAVPPKGFFP 848
            TYTRRSSPL+SPF SN+S QK+G    RLHSPYESPPKNG+DKA SD+ LYAVPPKGFFP
Sbjct: 189  TYTRRSSPLHSPFSSNESFQKEGSNHLRLHSPYESPPKNGVDKAFSDMSLYAVPPKGFFP 248

Query: 849  SDSASISVNSLSSGSSD-MHGPMKAMGIDAFRVSLSSAVSSSSQGSGHDDGDALGDVFIW 908
              SA++SV+SLSSG SD +HG MK MG+DAFRVSLSSA+SSSS GSGHDDGD LGDVF+W
Sbjct: 249  PGSATMSVHSLSSGGSDTLHGHMKGMGMDAFRVSLSSAISSSSHGSGHDDGDTLGDVFMW 308

Query: 909  GEGTGDGVLGGGSHRVGSCLSIKMDSLLPKALESAVVLDVQNIACGGRHAALVTKQGEIF 968
            GEG G+GVLGGG+HRVGS L IKMDSLLPKALES +VLDVQNIACGG+HA LVTKQGE F
Sbjct: 309  GEGIGEGVLGGGNHRVGSSLEIKMDSLLPKALESTIVLDVQNIACGGQHAVLVTKQGESF 368

Query: 969  TWGEESGGRLGHGVDSDVLQPKLIDALGNTNIELVSCGEYHTSAVTLSGDLYTWGDGTYN 1028
            +WGEES GRLGHGVDS+V  PKLIDAL  TNIELV+CGEYH+ AVTLSGDLYTWG G  +
Sbjct: 369  SWGEESEGRLGHGVDSNVQHPKLIDALNTTNIELVACGEYHSCAVTLSGDLYTWGKG--D 428

Query: 1029 FGLLGHGNEVSHWIPKKINGPLEGIHVSSISCGPWHTAVVTSAGQLFTFGDGTFGVLGHG 1088
            FG+LGHGNEVSHW+PK++N  +EGIHVSSI+CGP+HTAVVTSAGQLFTFGDGTFGVLGHG
Sbjct: 429  FGILGHGNEVSHWVPKRVNFLMEGIHVSSIACGPYHTAVVTSAGQLFTFGDGTFGVLGHG 488

Query: 1089 DRNSVSMPREVESLKGLRTVRAACGVWHTAAVVEVMVGSSSSSNCSSGKLFTWGDGDKGR 1148
            DR SV +PREV+SLKGLRTVRAACGVWHTAAVVEVMVGSSSSSNCSSGKLFTWGDGDK R
Sbjct: 489  DRKSVFIPREVDSLKGLRTVRAACGVWHTAAVVEVMVGSSSSSNCSSGKLFTWGDGDKSR 548

Query: 1149 LGHGDKETKLVPTCVAALVEPNFCRVSCGHSLTVALTTSGHVYTMGSPVYGQLGNPHADG 1208
            LGHGDKE KLVPTCVAALVEPNFC+V+CGHSLTVALTTSGHVYTMGSPVYGQLGNPHADG
Sbjct: 549  LGHGDKEPKLVPTCVAALVEPNFCQVACGHSLTVALTTSGHVYTMGSPVYGQLGNPHADG 608

Query: 1209 KVPVRVEGKLSKSFVEEIACGAYHVAVLTSRTEVYTWGKGANGRLGHGDTDDRNSPTLVE 1268
            KVP RV+GKL KSFVEEIACGAYHVAVLTSRTEVYTWGKG+NGRLGHGD DDRNSPTLVE
Sbjct: 609  KVPTRVDGKLHKSFVEEIACGAYHVAVLTSRTEVYTWGKGSNGRLGHGDADDRNSPTLVE 668

Query: 1269 ALKDKQVKSIACGTNFTAAICLHKWVSGVDQSMCSGCHLPFNFKRKRHNCYNCGLVFCHS 1328
            +LKDKQVKSIACG+NFTAA+CLHKW SG+DQSMCSGC  PFNFKRKRHNCYNCGLVFCHS
Sbjct: 669  SLKDKQVKSIACGSNFTAAVCLHKWASGMDQSMCSGCRQPFNFKRKRHNCYNCGLVFCHS 728

Query: 1329 CSSKKCHKASMAPNPNKPYRVCDNCYNKLRKALETDASSQSSVSRRRSINQGTTEFVEKD 1388
            CS+KK  KA MAPNPNKPYRVCD C+NKL+KA+ETD SS SS+SRR S+NQG ++ +++D
Sbjct: 729  CSNKKSLKACMAPNPNKPYRVCDRCFNKLKKAMETDPSSHSSLSRRESVNQG-SDAIDRD 788

Query: 1389 EKPESVKSRAQLARFSSMESVKQVENQSSKKNKKFECNSSRVSPVPNGGSQWGV--ISKS 1448
            EK ++ +S  QLARFS +E ++QV+++ SKKNKK+E NSSRVSP+P+GGS  G   I+KS
Sbjct: 789  EKLDT-RSDGQLARFSLLEPMRQVDSR-SKKNKKYEFNSSRVSPIPSGGSHRGSLNITKS 848

Query: 1449 FNPVFGSSKKFFSASVPGSRIVSRATSPISRRASPPRSTTPTPTLGGLTSPKIAVDDGKR 1508
            FNP FGSSKKFFSASVPGSRI SRATSPISRR SPPRSTTPTPTL GLT+PKI VDD KR
Sbjct: 849  FNPTFGSSKKFFSASVPGSRIASRATSPISRRPSPPRSTTPTPTLSGLTTPKIVVDDTKR 908

Query: 1509 TNDSLSQEVIKLKAQVENLTRKAQLQEVELERTTKQLKEALSFAAGEATKCNAAKEVIKS 1568
            +ND+LSQEV+ L++QVENLTRKAQLQEVELERTTKQLKEAL+ A+ E+ +C AAKEVIKS
Sbjct: 909  SNDNLSQEVVMLRSQVENLTRKAQLQEVELERTTKQLKEALAIASEESARCKAAKEVIKS 968

Query: 1569 LTAQLKEMAERLPVGAARNIKSPSLASLGSSPLFNDVVTPSIDRSNGQTMSLEADIIESN 1628
            LTAQLK+MAERLPVG+AR +KSPSL S GSSP +    + +++R N +    ++D + + 
Sbjct: 969  LTAQLKDMAERLPVGSARTVKSPSLNSFGSSPDYAAPSSNTLNRPNSR--ETDSDSL-TT 1028

Query: 1629 SHLLSNGSSTASNRSSGHNRQGN--SDSTTKNGNKVKESDSRHDAEWVEQDEPGVYITFT 1688
              + SNG+ST    S  + +Q N  +++  +   + KES+ R++ EWVEQDEPGVYIT T
Sbjct: 1029 VPMFSNGTSTPVFDSGSYRQQANHAAEAINRISTRSKESEPRNENEWVEQDEPGVYITLT 1088

Query: 1689 SLQGGAKDLKRVRFSRKRFTEKQAEQWWAENRARVYDQYNVR-MIDKST 1702
            +L GGA+DLKRVRFSRKRF+EKQAE+WWAENR RVY+QYNVR ++DKS+
Sbjct: 1089 ALAGGARDLKRVRFSRKRFSEKQAEEWWAENRGRVYEQYNVRIVVDKSS 1129

BLAST of Sgr023231 vs. TAIR 10
Match: AT5G12350.1 (Regulator of chromosome condensation (RCC1) family with FYVE zinc finger domain )

HSP 1 Score: 1659.4 bits (4296), Expect = 0.0e+00
Identity = 854/1102 (77.50%), Postives = 955/1102 (86.66%), Query Frame = 0

Query: 608  MSRMDRMASDLNRNGSVERDIEQSIIALKKGAYLLKYGRRGKPKFCPFRLSNDESVLIWF 667
            MSR  RMASDL+R G VERDIEQ+IIALKKGAYLLKYGRRGKPKFCPFRLSNDE+VLIWF
Sbjct: 1    MSRNGRMASDLSRAGPVERDIEQAIIALKKGAYLLKYGRRGKPKFCPFRLSNDETVLIWF 60

Query: 668  SGKEEKHLKLSHVSRIISGQRTPIFQRYPRPEKEYQSFSLIYNERSLDLICKDKDEAEVW 727
            SG EEKHLKLSHVSRIISGQRTPIFQRYPRPEKEYQSFSLIY+ERSLD+ICKDKDEAEVW
Sbjct: 61   SGNEEKHLKLSHVSRIISGQRTPIFQRYPRPEKEYQSFSLIYSERSLDVICKDKDEAEVW 120

Query: 728  FNGLKTLISRSHHRKWRTESRSDGMQSEANSPRTYTRRSSPLNSPFGSNDSLQKDGD--F 787
            F GLK LIS  H R  RTESRSDG  SEANSPRTYTRRSSPL+SPF SNDSLQKDG    
Sbjct: 121  FTGLKALISHCHQRNRRTESRSDGTPSEANSPRTYTRRSSPLHSPFSSNDSLQKDGSNHL 180

Query: 788  RLHSPYESPPKNGMDKALSDVILYAVPPKGFFPSDSASISVNSLSSGSSDMHGPMKAMGI 847
            R+HSP+ESPPKNG+DKA SD+ LYAVPPKGF+PSDSA+ISV+  S GS  MHG M+ MG+
Sbjct: 181  RIHSPFESPPKNGLDKAFSDMALYAVPPKGFYPSDSATISVH--SGGSDSMHGHMRGMGM 240

Query: 848  DAFRVSLSSAVSSSSQGSGHDDGDALGDVFIWGEGTGDGVLGGGSHRVGSCLSIKMDSLL 907
            DAFRVS+SSAVSSSS GSGHDDGDALGDVFIWGEG G+GVLGGG+ RVGS   IKMDSLL
Sbjct: 241  DAFRVSMSSAVSSSSHGSGHDDGDALGDVFIWGEGIGEGVLGGGNRRVGSSFDIKMDSLL 300

Query: 908  PKALESAVVLDVQNIACGGRHAALVTKQGEIFTWGEESGGRLGHGVDSDVLQPKLIDALG 967
            PKALES +VLDVQNIACGG+HA LVTKQGE F+WGEES GRLGHGVDS++ QPKLIDAL 
Sbjct: 301  PKALESTIVLDVQNIACGGQHAVLVTKQGESFSWGEESEGRLGHGVDSNIQQPKLIDALN 360

Query: 968  NTNIELVSCGEYHTSAVTLSGDLYTWGDGTYNFGLLGHGNEVSHWIPKKINGPLEGIHVS 1027
             TNIELV+CGE+H+ AVTLSGDLYTWG G  +FG+LGHGNEVSHW+PK++N  LEGIHVS
Sbjct: 361  TTNIELVACGEFHSCAVTLSGDLYTWGKG--DFGVLGHGNEVSHWVPKRVNFLLEGIHVS 420

Query: 1028 SISCGPWHTAVVTSAGQLFTFGDGTFGVLGHGDRNSVSMPREVESLKGLRTVRAACGVWH 1087
            SI+CGP+HTAVVTSAGQLFTFGDGTFGVLGHGD+ SV +PREV+SLKGLRTVRAACGVWH
Sbjct: 421  SIACGPYHTAVVTSAGQLFTFGDGTFGVLGHGDKKSVFIPREVDSLKGLRTVRAACGVWH 480

Query: 1088 TAAVVEVMVGSSSSSNCSSGKLFTWGDGDKGRLGHGDKETKLVPTCVAALVEPNFCRVSC 1147
            TAAVVEVMVGSSSSSNCSSGKLFTWGDGDKGRLGHG+KE KLVPTCVAALVEPNFC+V+C
Sbjct: 481  TAAVVEVMVGSSSSSNCSSGKLFTWGDGDKGRLGHGNKEPKLVPTCVAALVEPNFCQVAC 540

Query: 1148 GHSLTVALTTSGHVYTMGSPVYGQLGNPHADGKVPVRVEGKLSKSFVEEIACGAYHVAVL 1207
            GHSLTVALTTSGHVYTMGSPVYGQLGN HADGK P RVEGKL KSFVEEIACGAYHVAVL
Sbjct: 541  GHSLTVALTTSGHVYTMGSPVYGQLGNSHADGKTPNRVEGKLHKSFVEEIACGAYHVAVL 600

Query: 1208 TSRTEVYTWGKGANGRLGHGDTDDRNSPTLVEALKDKQVKSIACGTNFTAAICLHKWVSG 1267
            TSRTEVYTWGKG+NGRLGHGD DDRNSPTLVE+LKDKQVKSIACGTNFTAA+C+H+W SG
Sbjct: 601  TSRTEVYTWGKGSNGRLGHGDVDDRNSPTLVESLKDKQVKSIACGTNFTAAVCIHRWASG 660

Query: 1268 VDQSMCSGCHLPFNFKRKRHNCYNCGLVFCHSCSSKKCHKASMAPNPNKPYRVCDNCYNK 1327
            +DQSMCSGC  PF+FKRKRHNCYNCGLVFCHSC+SKK  KA MAPNPNKPYRVCD C+NK
Sbjct: 661  MDQSMCSGCRQPFSFKRKRHNCYNCGLVFCHSCTSKKSLKACMAPNPNKPYRVCDKCFNK 720

Query: 1328 LRKALETDASSQSSVSRRRSINQGTTEFVEKDEKPESVKSRAQLARFSSMESVKQVENQS 1387
            L+K +ETD SS SS+SRR SINQG ++ ++KD+K +S +S  QLARFS MES++QV+++ 
Sbjct: 721  LKKTMETDPSSHSSLSRRGSINQG-SDPIDKDDKFDS-RSDGQLARFSLMESMRQVDSR- 780

Query: 1388 SKKNKKFECNSSRVSPVPNGGSQWGV--ISKSFNPVFGSSKKFFSASVPGSRIVSRATSP 1447
             KKNKK+E NSSRVSP+P+G SQ G   I+KSFNPVFG+SKKFFSASVPGSRIVSRATSP
Sbjct: 781  HKKNKKYEFNSSRVSPIPSGSSQRGALNIAKSFNPVFGASKKFFSASVPGSRIVSRATSP 840

Query: 1448 ISRRASPPRSTTPTPTLGGLTSPKIAVDDGKRTNDSLSQEVIKLKAQVENLTRKAQLQEV 1507
            ISRR SPPRSTTPTPTL GL +PK  VDD KRTND+LSQEV+KL++QVE+LTRKAQLQEV
Sbjct: 841  ISRRPSPPRSTTPTPTLSGLATPKFVVDDTKRTNDNLSQEVVKLRSQVESLTRKAQLQEV 900

Query: 1508 ELERTTKQLKEALSFAAGEATKCNAAKEVIKSLTAQLKEMAERLPVGAARNIKS-PSLAS 1567
            ELERTTKQLKEAL+    E T+C AAKEVIKSLTAQLK+MAERLPVG+AR +KS PSL S
Sbjct: 901  ELERTTKQLKEALAITNEETTRCKAAKEVIKSLTAQLKDMAERLPVGSARTVKSPPSLNS 960

Query: 1568 LGSSPLFNDVVTPSIDRSN--GQTMSLEADIIESNSHLLSNGSSTASNRSSGHNRQGNSD 1627
             GSSP         ID  N   Q  S E++     + + SNG+ T +         GN +
Sbjct: 961  FGSSP-------GRIDPFNILNQANSQESEPNGITTPMFSNGTMTPA--------FGNGE 1020

Query: 1628 STTKNGNKVKESDSRHDAEWVEQDEPGVYITFTSLQGGAKDLKRVRFSRKRFTEKQAEQW 1687
            +T         +++R++ EWVEQDEPGVYIT T+L GGA+DLKRVRFSRKRF+E QAEQW
Sbjct: 1021 AT---------NEARNEKEWVEQDEPGVYITLTALAGGARDLKRVRFSRKRFSEIQAEQW 1071

Query: 1688 WAENRARVYDQYNVRMIDKSTK 1703
            WA+NR RVY+QYNVRM+DK+++
Sbjct: 1081 WADNRGRVYEQYNVRMVDKASE 1071

BLAST of Sgr023231 vs. TAIR 10
Match: AT1G76950.1 (Regulator of chromosome condensation (RCC1) family with FYVE zinc finger domain )

HSP 1 Score: 1040.8 bits (2690), Expect = 1.7e-303
Identity = 579/1123 (51.56%), Postives = 756/1123 (67.32%), Query Frame = 0

Query: 616  SDLNRNGSVERDIEQSIIALKKGAYLLKYGRRGKPKFCPFRLSNDESVLIWFSGKEEKHL 675
            +DL    + + ++EQ++I LKKG  LLKYGR+GKPKF PFRLS+DE  LIW S   EK L
Sbjct: 2    ADLVTYSNADHNLEQALITLKKGTQLLKYGRKGKPKFYPFRLSSDEKSLIWISSSGEKRL 61

Query: 676  KLSHVSRIISGQRTPIFQRYPRPEKEYQSFSLIYN--ERSLDLICKDKDEAEVWFNGLKT 735
            KL+ VS+I+ GQRT +FQRY RPEK+Y SFSL+YN  ++SLDLICKDK EAE+W  GLKT
Sbjct: 62   KLASVSKIVPGQRTAVFQRYLRPEKDYLSFSLLYNGKKKSLDLICKDKVEAEIWIGGLKT 121

Query: 736  LISRSHHRKWRTESRSDGMQSEANSPRTYTRRSSPLNSPFGSNDSLQKDGDFRLHSPYES 795
            LIS     + + +  S G  S  ++ R  T  SSP +S   ++      G     +P+  
Sbjct: 122  LISTGQGGRSKIDGWSGGGLS-VDASRELT-SSSPSSSSASASRGHSSPG-----TPFNI 181

Query: 796  PPKNGMDKALSDVILYAVPPKGFFPSDSASISVNSLSSGSSDMHGPMKAMGIDAFRVSLS 855
             P      A  +     VPP     S+ + +++++ +  +       K  G D FRVS+S
Sbjct: 182  DPITSPKSAEPE-----VPPT---DSEKSHVALDNKNMQT-------KVSGSDGFRVSVS 241

Query: 856  SAVSSSSQGSGHDDGDALGDVFIWGEGTGDGVLGGGSHRVGSCLSIKMDSLLPKALESAV 915
            SA SSSS GS  DD DALGDV+IWGE   D V+  G  +  S L+ + D L+PK LES +
Sbjct: 242  SAQSSSSHGSAADDSDALGDVYIWGEVICDNVVKVGIDKNASYLTTRTDVLVPKPLESNI 301

Query: 916  VLDVQNIACGGRHAALVTKQGEIFTWGEESGGRLGHGVDSDVLQPKLIDAL-GNTNIELV 975
            VLDV  IACG RHAA VT+QGEIFTWGEESGGRLGHG+  DV  P+L+++L   ++++ V
Sbjct: 302  VLDVHQIACGVRHAAFVTRQGEIFTWGEESGGRLGHGIGKDVFHPRLVESLTATSSVDFV 361

Query: 976  SCGEYHTSAVTLSGDLYTWGDGTYNFGLLGHGNEVSHWIPKKINGPLEGIHVSSISCGPW 1035
            +CGE+HT AVTL+G+LYTWGDGT+N GLLGHG+++SHWIPK+I G LEG+HV+S+SCGPW
Sbjct: 362  ACGEFHTCAVTLAGELYTWGDGTHNVGLLGHGSDISHWIPKRIAGSLEGLHVASVSCGPW 421

Query: 1036 HTAVVTSAGQLFTFGDGTFGVLGHGDRNSVSMPREVESLKGLRTVRAACGVWHTAAVVEV 1095
            HTA++TS G+LFTFGDGTFGVLGHGD+ +V  PREVESL GLRT+  +CGVWHTAAVVE+
Sbjct: 422  HTALITSYGRLFTFGDGTFGVLGHGDKETVQYPREVESLSGLRTIAVSCGVWHTAAVVEI 481

Query: 1096 MVGSSSSSNCSSGKLFTWGDGDKGRLGHGDKETKLVPTCVAALVEPNFCRVSCGHSLTVA 1155
            +V  S+SS+ SSGKLFTWGDGDK RLGHGDK+ +L PTCV AL++ NF +++CGHSLTV 
Sbjct: 482  IVTQSNSSSVSSGKLFTWGDGDKNRLGHGDKDPRLKPTCVPALIDYNFHKIACGHSLTVG 541

Query: 1156 LTTSGHVYTMGSPVYGQLGNPHADGKVPVRVEGKLSKSFVEEIACGAYHVAVLTSRTEVY 1215
            LTTSG V+TMGS VYGQLGN   DGK+P  VE KL+  FVEEI+CGAYHVA LTSR EVY
Sbjct: 542  LTTSGQVFTMGSTVYGQLGNLQTDGKLPCLVEDKLASEFVEEISCGAYHVAALTSRNEVY 601

Query: 1216 TWGKGANGRLGHGDTDDRNSPTLVEALKDKQVKSIACGTNFTAAICLHKWVSGVDQSMCS 1275
            TWGKGANGRLGHGD +DR  PT+VEALKD+ VK IACG+N+TAAICLHKWVSG +QS CS
Sbjct: 602  TWGKGANGRLGHGDLEDRKVPTIVEALKDRHVKYIACGSNYTAAICLHKWVSGAEQSQCS 661

Query: 1276 GCHLPFNFKRKRHNCYNCGLVFCHSCSSKKCHKASMAPNPNKPYRVCDNCYNKLRKALET 1335
             C L F F RKRHNCYNCGLV CHSCSSKK  +A++AP+  + YRVCD+CY KL K  E 
Sbjct: 662  TCRLAFGFTRKRHNCYNCGLVHCHSCSSKKAFRAALAPSAGRLYRVCDSCYVKLSKVSEI 721

Query: 1336 DASSQ--SSVSRRRSINQGTTEFVEKDEKPESVKSRAQLARF--SSMESVKQVENQSSKK 1395
            + +++  S+V R    N+   +           KS  +LA+F  S+M+ +KQ++++++K+
Sbjct: 722  NDTNRRNSAVPRLSGENRDRLD-----------KSEIRLAKFGTSNMDLIKQLDSKAAKQ 781

Query: 1396 NKKFECNS-SRVSPVPNGGSQWGVISKSFNPVFGSSKKFFSASVPGSRIVSRATSPISRR 1455
             KK +  S  R S +P+       +  +   +  ++ K   A    S I SR+ SP SRR
Sbjct: 782  GKKTDTFSLGRNSQLPSLLQLKDAVQSNIGDMRRATPKLAQAP---SGISSRSVSPFSRR 841

Query: 1456 ASPPRSTTPTPTLGGLTSPKIAVDDGKRTNDSLSQEVIKLKAQVENLTRKAQLQEVELER 1515
            +SPPRS TP P+  GL  P    D+ K+TN+ L+QE++KL+ QV++LT+K + QEVEL+ 
Sbjct: 842  SSPPRSATPMPSTSGLYFPVGIADNMKKTNEILNQEIVKLRTQVDSLTQKCEFQEVELQN 901

Query: 1516 TTKQLKEALSFAAGEATKCNAAKEVIKSLTAQLKEMAERLPVGAARNI------------ 1575
            + K+ +EAL+ A  E+ K  AAKE IKSL AQLK++AE+LP G +  +            
Sbjct: 902  SVKKTQEALALAEEESAKSRAAKEAIKSLIAQLKDVAEKLPPGESVKLACLQNGLDQNGF 961

Query: 1576 -----------KSPSLASLGSSPLFNDVVTPSIDRSNGQTMSLEADIIESNSHL------ 1635
                       +S S+ S  SS    D    +   SN Q+        E NS+       
Sbjct: 962  HFPEENGFHPSRSESMTSSISSVAPFDFAFANASWSNLQSPKQTPRASERNSNAYPADPR 1021

Query: 1636 LSNGSSTASNRSSGHNRQGNSDSTTKNGNKVKESDSRHDAEWVEQDEPGVYITFTSLQGG 1695
            LS+  S  S R      Q NSD+ +        + ++ +AEW+EQ EPGVYIT  +L  G
Sbjct: 1022 LSSSGSVISERIEPFQFQNNSDNGSSQTG--VNNTNQVEAEWIEQYEPGVYITLVALHDG 1081

Query: 1696 AKDLKRVRFSRKRFTEKQAEQWWAENRARVYDQYNVRMIDKST 1702
             +DL+RVRFSR+RF E QAE WW+ENR +VY++YNVR+ +KST
Sbjct: 1082 TRDLRRVRFSRRRFGEHQAETWWSENREKVYEKYNVRVSEKST 1086

BLAST of Sgr023231 vs. TAIR 10
Match: AT5G42140.1 (Regulator of chromosome condensation (RCC1) family with FYVE zinc finger domain )

HSP 1 Score: 1029.6 bits (2661), Expect = 3.8e-300
Identity = 584/1105 (52.85%), Postives = 753/1105 (68.14%), Query Frame = 0

Query: 630  QSIIALKKGAYLLKYGRRGKPKFCPFRLSNDESVLIWFSGKEEKHLKLSHVSRIISGQRT 689
            Q++IALKKGA LLKYGR+GKPKFCPFRLSNDE+ LIW S   EK LKL+ VS+I+ GQRT
Sbjct: 11   QALIALKKGAQLLKYGRKGKPKFCPFRLSNDETSLIWISNGGEKRLKLATVSKIVPGQRT 70

Query: 690  PIFQRYPRPEKEYQSFSLIYN--ERSLDLICKDKDEAEVWFNGLKTLISRSHHRKWRTES 749
             +FQRY RP+K+Y SFSLIY+  +R+LDLICKDK EAEVW  GLK LIS    R  + + 
Sbjct: 71   AVFQRYLRPDKDYLSFSLIYSNRKRTLDLICKDKVEAEVWIAGLKALISGQAGRS-KIDG 130

Query: 750  RSDGMQSEANSPRTYTRRSSPLNSPFGSNDSLQKDGDFRL-HSPYESPPKNGMDKALSDV 809
             SDG  S A+S      R   L+SP  +N S+    DF +  SPY S             
Sbjct: 131  WSDGGLSIADS------RDLTLSSP--TNSSVCASRDFNIADSPYNSTNF---------- 190

Query: 810  ILYAVPPKGFFPSDSASISVNSLSSGSSDMHGPMKAMGIDAFRVSLSSAVSSSSQGSGHD 869
                  P+     +S S   + ++S S +M   ++  G DAFRVS+SS  SSSS GS  D
Sbjct: 191  ------PRTSRTENSVSSERSHVASDSPNM--LVRGTGSDAFRVSVSSVQSSSSHGSAPD 250

Query: 870  DGDALGDVFIWGEGTGDGVLGGGSHRVGSCLSIKMDSLLPKALESAVVLDVQNIACGGRH 929
            D DALGDV+IWGE   + V   G+ +    L  + D L+PK LES VVLDV +IACG +H
Sbjct: 251  DCDALGDVYIWGEVLCENVTKFGADKNIGYLGSRSDVLIPKPLESNVVLDVHHIACGVKH 310

Query: 930  AALVTKQGEIFTWGEESGGRLGHGVDSDVLQPKLIDALGNTNIELVSCGEYHTSAVTLSG 989
            AALV++QGE+FTWGE SGGRLGHG+  DV  P+LI++L  T+I+ V+CGE+HT AVT++G
Sbjct: 311  AALVSRQGEVFTWGEASGGRLGHGMGKDVTGPQLIESLAATSIDFVACGEFHTCAVTMTG 370

Query: 990  DLYTWGDGTYNFGLLGHGNEVSHWIPKKINGPLEGIHVSSISCGPWHTAVVTSAGQLFTF 1049
            ++YTWGDGT+N GLLGHG +VSHWIPK+I+GPLEG+ ++S+SCGPWHTA++TS GQLFTF
Sbjct: 371  EIYTWGDGTHNAGLLGHGTDVSHWIPKRISGPLEGLQIASVSCGPWHTALITSTGQLFTF 430

Query: 1050 GDGTFGVLGHGDRNSVSMPREVESLKGLRTVRAACGVWHTAAVVEVMVGSSSSSNCSSGK 1109
            GDGTFGVLGHGD+ +V  PREVESL GLRT+  ACGVWH AA+VEV+V + SSS+ SSGK
Sbjct: 431  GDGTFGVLGHGDKETVFYPREVESLSGLRTIAVACGVWHAAAIVEVIV-THSSSSVSSGK 490

Query: 1110 LFTWGDGDKGRLGHGDKETKLVPTCVAALVEPNFCRVSCGHSLTVALTTSGHVYTMGSPV 1169
            LFTWGDGDK RLGHGDKE +L PTCV+AL++  F RV+CGHSLTV LTTSG VYTMGS V
Sbjct: 491  LFTWGDGDKSRLGHGDKEPRLKPTCVSALIDHTFHRVACGHSLTVGLTTSGKVYTMGSTV 550

Query: 1170 YGQLGNPHADGKVPVRVEGKLSKSFVEEIACGAYHVAVLTSRTEVYTWGKGANGRLGHGD 1229
            YGQLGNP+ADGK+P  VE KL+K  VEEIACGAYHVAVLTSR EV+TWGKGANGRLGHGD
Sbjct: 551  YGQLGNPNADGKLPCLVEDKLTKDCVEEIACGAYHVAVLTSRNEVFTWGKGANGRLGHGD 610

Query: 1230 TDDRNSPTLVEALKDKQVKSIACGTNFTAAICLHKWVSGVDQSMCSGCHLPFNFKRKRHN 1289
             +DR +PTLV+ALK++ VK+IACG+NFTAAICLHKWVSG +QS CS C   F F RKRHN
Sbjct: 611  VEDRKAPTLVDALKERHVKNIACGSNFTAAICLHKWVSGTEQSQCSACRQAFGFTRKRHN 670

Query: 1290 CYNCGLVFCHSCSSKKCHKASMAPNPNKPYRVCDNCYNKLRKALETDASSQSSVSRRRSI 1349
            CYNCGLV CHSCSSKK  KA++APNP KPYRVCD+C++KL K  E +  S+ +V  R S 
Sbjct: 671  CYNCGLVHCHSCSSKKSLKAALAPNPGKPYRVCDSCHSKLSKVSEANIDSRKNVMPRLS- 730

Query: 1350 NQGTTEFVEKDEKPESVKSRAQLARF---SSMESVKQVENQSSKKNKKFECNS-SRVSPV 1409
                      + K    K+  +LA+    S+++ +KQ++N+++++ KK +  S  R S  
Sbjct: 731  ---------GENKDRLDKTEIRLAKSGIPSNIDLIKQLDNRAARQGKKADTFSLVRTSQT 790

Query: 1410 PNGGSQWGVISKSFNPVFGSSKKFFSASVPGSRIVSRATSPISRRASPPRSTTPTPTLGG 1469
            P    +   ++   +   G  K    A  P S   SR  SP SRR+SPPRS TP P   G
Sbjct: 791  PLTQLK-DALTNVADLRRGPPK---PAVTPSS---SRPVSPFSRRSSPPRSVTPIPLNVG 850

Query: 1470 LTSPKIAVDDGKRTNDSLSQEVIKLKAQVENLTRKAQLQEVELERTTKQLKEALSFAAGE 1529
            L       +  K+TN+ L+QEV++L+AQ E+L  + ++QE E++++ K+++EA+S AA E
Sbjct: 851  LGFSTSIAESLKKTNELLNQEVVRLRAQAESLRHRCEVQEFEVQKSVKKVQEAMSLAAEE 910

Query: 1530 ATKCNAAKEVIKSLTAQLKEMAERLPVGA--ARNIKSPSLA------------------- 1589
            + K  AAKEVIKSLTAQ+K++A  LP GA  A   ++ +L                    
Sbjct: 911  SAKSEAAKEVIKSLTAQVKDIAALLPPGAYEAETTRTANLLNGFEQNGFHFTNANGQRQS 970

Query: 1590 ---SLGSSPLFNDVVTPSIDRSNGQTMSLEA--DIIESNSHLLSNGSSTASNRSSGHNRQ 1649
               S+  + L + +  P+    NG   + ++  +   S   LLS G   ++  S      
Sbjct: 971  RSDSMSDTSLASPLAMPA-RSMNGLWRNSQSPRNTDASMGELLSEGVRISNGFSEDGRNS 1030

Query: 1650 GNSDSTTKNGNKVKESDSRHDAEWVEQDEPGVYITFTSLQGGAKDLKRVRFSRKRFTEKQ 1702
             +S ++  N ++V       +AEW+EQ EPGVYIT  +L  G +DLKRVRFSR+RF E+Q
Sbjct: 1031 RSSAASASNASQV-------EAEWIEQYEPGVYITLLALGDGTRDLKRVRFSRRRFREQQ 1062

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
EXC12413.10.0e+0067.53E3 ubiquitin-protein ligase HERC2 [Morus notabilis][more]
XP_022153015.10.0e+0097.53uncharacterized protein LOC111020619 [Momordica charantia][more]
XP_038900986.10.0e+0096.71PH, RCC1 and FYVE domains-containing protein 1 [Benincasa hispida][more]
XP_011657620.10.0e+0096.44PH, RCC1 and FYVE domains-containing protein 1 [Cucumis sativus] >KGN48105.1 hyp... [more]
KAA0061774.10.0e+0096.33Regulator of chromosome condensation (RCC1) family with FYVE zinc finger domain ... [more]
Match NameE-valueIdentityDescription
Q947D22.3e-30251.56PH, RCC1 and FYVE domains-containing protein 1 OS=Arabidopsis thaliana OX=3702 G... [more]
O046595.0e-21255.85Pentatricopeptide repeat-containing protein At5g27110 OS=Arabidopsis thaliana OX... [more]
Q3E6Q12.8e-12235.06Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
Q0WN608.7e-11628.89Pentatricopeptide repeat-containing protein At1g18485 OS=Arabidopsis thaliana OX... [more]
Q9SS601.9e-11534.84Pentatricopeptide repeat-containing protein At3g03580 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
W9RV990.0e+0067.53E3 ubiquitin-protein ligase HERC2 OS=Morus notabilis OX=981085 GN=L484_001795 PE... [more]
A0A6J1DFK10.0e+0097.53uncharacterized protein LOC111020619 OS=Momordica charantia OX=3673 GN=LOC111020... [more]
A0A0A0KEK00.0e+0096.44Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G432780 PE=4 SV=1[more]
A0A5A7V3C50.0e+0096.33Regulator of chromosome condensation (RCC1) family with FYVE zinc finger domain ... [more]
A0A1S3BNH20.0e+0096.07LOW QUALITY PROTEIN: uncharacterized protein LOC103491479 OS=Cucumis melo OX=365... [more]
Match NameE-valueIdentityDescription
AT5G19420.10.0e+0078.15Regulator of chromosome condensation (RCC1) family with FYVE zinc finger domain [more]
AT5G19420.20.0e+0076.17Regulator of chromosome condensation (RCC1) family with FYVE zinc finger domain [more]
AT5G12350.10.0e+0077.50Regulator of chromosome condensation (RCC1) family with FYVE zinc finger domain [more]
AT1G76950.11.7e-30351.56Regulator of chromosome condensation (RCC1) family with FYVE zinc finger domain [more]
AT5G42140.13.8e-30052.85Regulator of chromosome condensation (RCC1) family with FYVE zinc finger domain [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 1526..1546
NoneNo IPR availableCOILSCoilCoilcoord: 1477..1511
NoneNo IPR availablePFAMPF16627BRX_assoccoord: 1562..1631
e-value: 3.4E-17
score: 62.4
NoneNo IPR availableGENE3D1.10.10.60coord: 2..83
e-value: 2.2E-18
score: 67.5
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1367..1382
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 299..373
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1598..1625
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1438..1475
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 463..489
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1333..1400
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1598..1640
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1333..1348
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..20
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1626..1640
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 750..778
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 140..161
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1349..1365
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 742..798
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1438..1462
NoneNo IPR availablePANTHERPTHR22870:SF397INACTIVE PHOSPHOLIPASE C-LIKE PROTEIN 2coord: 628..1697
NoneNo IPR availablePANTHERPTHR22870REGULATOR OF CHROMOSOME CONDENSATIONcoord: 628..1697
NoneNo IPR availableCDDcd00065FYVE_like_SFcoord: 1270..1323
e-value: 1.311E-16
score: 73.7204
NoneNo IPR availableCDDcd13365PH_PLC_plant-likecoord: 626..736
e-value: 7.54949E-51
score: 173.624
NoneNo IPR availableSUPERFAMILY50729PH domain-likecoord: 628..735
IPR000408Regulator of chromosome condensation, RCC1PRINTSPR00633RCCNDNSATIONcoord: 936..952
score: 28.43
coord: 986..1002
score: 36.27
coord: 1043..1057
score: 41.11
coord: 1146..1164
score: 31.58
coord: 1204..1225
score: 37.12
coord: 917..930
score: 30.95
coord: 1027..1043
score: 28.43
IPR000408Regulator of chromosome condensation, RCC1PFAMPF00415RCC1coord: 1210..1257
e-value: 8.2E-14
score: 51.9
coord: 985..1037
e-value: 9.1E-10
score: 39.0
coord: 1104..1153
e-value: 2.3E-12
score: 47.3
coord: 934..982
e-value: 1.0E-8
score: 35.7
coord: 1041..1089
e-value: 3.8E-11
score: 43.4
coord: 1156..1205
e-value: 5.4E-8
score: 33.3
IPR000408Regulator of chromosome condensation, RCC1PROSITEPS00626RCC1_2coord: 1079..1089
IPR000408Regulator of chromosome condensation, RCC1PROSITEPS00626RCC1_2coord: 1027..1037
IPR000408Regulator of chromosome condensation, RCC1PROSITEPS00626RCC1_2coord: 920..930
IPR000408Regulator of chromosome condensation, RCC1PROSITEPS50012RCC1_3coord: 934..985
score: 14.6045
IPR000408Regulator of chromosome condensation, RCC1PROSITEPS50012RCC1_3coord: 872..933
score: 11.870899
IPR000408Regulator of chromosome condensation, RCC1PROSITEPS50012RCC1_3coord: 1105..1156
score: 15.026599
IPR000408Regulator of chromosome condensation, RCC1PROSITEPS50012RCC1_3coord: 1041..1092
score: 12.614599
IPR000408Regulator of chromosome condensation, RCC1PROSITEPS50012RCC1_3coord: 986..1040
score: 13.941199
IPR000408Regulator of chromosome condensation, RCC1PROSITEPS50012RCC1_3coord: 1209..1260
score: 14.6648
IPR000408Regulator of chromosome condensation, RCC1PROSITEPS50012RCC1_3coord: 1157..1208
score: 13.1774
IPR001356Homeobox domainSMARTSM00389HOX_1coord: 12..74
e-value: 6.6E-17
score: 72.2
IPR001356Homeobox domainPFAMPF00046Homeodomaincoord: 14..69
e-value: 2.5E-16
score: 59.3
IPR001356Homeobox domainPROSITEPS50071HOMEOBOX_2coord: 10..70
score: 17.361786
IPR001356Homeobox domainCDDcd00086homeodomaincoord: 14..69
e-value: 2.21246E-18
score: 78.8244
IPR000306FYVE zinc fingerSMARTSM00064fyve_4coord: 1259..1328
e-value: 2.2E-16
score: 70.5
IPR000306FYVE zinc fingerPFAMPF01363FYVEcoord: 1261..1327
e-value: 6.7E-13
score: 48.6
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 2118..2165
e-value: 5.8E-12
score: 45.6
coord: 1815..1861
e-value: 6.6E-15
score: 55.1
coord: 1915..1961
e-value: 2.3E-7
score: 30.9
coord: 2018..2063
e-value: 1.4E-10
score: 41.2
coord: 1713..1760
e-value: 2.0E-8
score: 34.3
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 2259..2288
e-value: 0.46
score: 10.9
coord: 1789..1814
e-value: 0.047
score: 14.0
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 1716..1750
e-value: 3.3E-5
score: 21.8
coord: 2120..2153
e-value: 2.5E-8
score: 31.6
coord: 2155..2188
e-value: 8.8E-4
score: 17.3
coord: 1818..1850
e-value: 1.4E-8
score: 32.4
coord: 2019..2053
e-value: 8.5E-6
score: 23.6
coord: 1918..1951
e-value: 1.9E-4
score: 19.4
coord: 2093..2119
e-value: 0.0019
score: 16.2
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 1713..1747
score: 9.985802
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 2118..2152
score: 12.528824
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 2087..2117
score: 9.196589
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 1850..1884
score: 8.659485
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 2153..2188
score: 9.284279
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 1916..1950
score: 11.355965
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 1885..1915
score: 8.506026
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 2017..2051
score: 10.654441
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 1815..1849
score: 12.090371
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 2256..2290
score: 9.185627
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 1912..1968
e-value: 2.4E-5
score: 25.9
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 1714..1880
e-value: 1.9E-36
score: 128.0
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 2070..2183
e-value: 2.1E-31
score: 111.5
coord: 2184..2309
e-value: 6.8E-7
score: 31.0
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 1977..2069
e-value: 6.3E-17
score: 63.5
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 1792..2283
IPR009091Regulator of chromosome condensation 1/beta-lactamase-inhibitor protein IIGENE3D2.130.10.30coord: 1102..1272
e-value: 6.4E-50
score: 172.0
IPR009091Regulator of chromosome condensation 1/beta-lactamase-inhibitor protein IIGENE3D2.130.10.30coord: 830..995
e-value: 1.5E-28
score: 101.9
coord: 996..1094
e-value: 5.4E-27
score: 96.7
IPR009091Regulator of chromosome condensation 1/beta-lactamase-inhibitor protein IISUPERFAMILY50985RCC1/BLIP-IIcoord: 871..1262
IPR013083Zinc finger, RING/FYVE/PHD-typeGENE3D3.30.40.10Zinc/RING finger domain, C3HC4 (zinc finger)coord: 1273..1333
e-value: 1.6E-13
score: 52.8
IPR027988Transcription factor BREVIS RADIX, N-terminal domainPFAMPF13713BRX_Ncoord: 1522..1555
e-value: 5.7E-16
score: 58.0
IPR001849Pleckstrin homology domainPFAMPF16457PH_12coord: 629..735
e-value: 2.4E-7
score: 31.3
IPR013591Brevis radix (BRX) domainPFAMPF08381BRXcoord: 1638..1692
e-value: 2.5E-28
score: 97.2
IPR013591Brevis radix (BRX) domainPROSITEPS51514BRXcoord: 1638..1693
score: 33.531193
IPR011993PH-like domain superfamilyGENE3D2.30.29.30coord: 627..744
e-value: 5.6E-35
score: 121.9
IPR017455Zinc finger, FYVE-relatedPROSITEPS50178ZF_FYVEcoord: 1265..1327
score: 11.656281
IPR011011Zinc finger, FYVE/PHD-typeSUPERFAMILY57903FYVE/PHD zinc fingercoord: 1261..1332
IPR009057Homeobox-like domain superfamilySUPERFAMILY46689Homeodomain-likecoord: 12..73

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr023231.1Sgr023231.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003677 DNA binding
molecular_function GO:0046872 metal ion binding
molecular_function GO:0005515 protein binding