Sgr022873 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr022873
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionDNA binding protein, putative isoform 1
Locationtig00000589: 2679357 .. 2692728 (-)
RNA-Seq ExpressionSgr022873
SyntenySgr022873
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAACATCATCATCAAGTGGAAGTGGAGCCTCCGGCAGCGGAAGCATCCACTGGCTGCAAGAGAGGGAAGAAGAAACCAGTGGGTCGGGAAAAGAAGGAAGCGCAGGGAAGAGCTACGAAGAAGAAGGCGGGGGCAACTTCGGTCGACGAAGAGCAATCTACAGGCCGTTTAGATGGCATGCCGATTAAGGTTTCGGAGTTTGATCATTGTGTTGAAAATCATTTTAGCGCCATTGATACAATTTCCAAGCTCTGTGGTGAGGCAGAGGATGGCGATGGCGGAATTGACGAAAGTGACATTCAGAGGTTTTCGTCATCCACCATTTTCTTAAGGTACGCATGGTTGCAAATGTTTTTTAGCAGATGCACTTATATACAAACAAGTATGCTTTACGCTTAGTACGTAGCTCGAGGATTCTTGCATTGTTTGCACTGATAAGTTATTTGGCAGTAATGACATGTTATACAACTGAAGATACTTCAGTTTATTTTAAGCGTGAAATGCGTTGAATATCGTGTGATGATGATTTTTATTCTCGCACAATTTCTTGGAAAAATGACGTGAAATCTTCTTCCTGCATTATCTGTCACCGCTGCGAATCTGTACTTAATTTTTCAAATTATACTTAACTAGGGAATGGAGGTACTATAATTACGAATCGAAAACCGTCAAGTTTGCTAGTGACTCGAGAGGCCCTGAGGGTAAGGATGCTGACATCACGATAAACTTACCACAGTTTTCTTCTGCTGCTGTTCTGAAGGTGCTGGGTTGTAGGTCATGTTTAATATGTTGGCTTTTTATTTCTATTTATATTCCCTGCTGTTTTTTTTTCCTTGTTTGCCCTCTCCTCCTTTTCCCCCACTGTTTGCTTCTTTACAGGCGATGAACTGAAGTAATTGTTATGCAGCATAAAGCATTAACATAAGTCTTAATTTACTACGCAGAATGGAACACCATCTGAAGCCACTACATCTCTAGACTGGAGGTACATTTTTTTGGACTTGGTTTTTTGAAAGTTATGGTTTTTGTTGATCCCATCGTAGGAATTTTAAGGAATAAGCGAGCTCTGTATGAATGTCGAGGTTTTCATTTTTCAAGTTGAACTCTGATGTAATGGATGCATAATGACCTTACTTTAAGGTATTTTAGGAATTTGTATGTTGCATCTACTGCATAAATACTTTAAAAAGACTGCATACCATGAAATTTTTTATTTCGCTCTTGCTGCTAGGTATCTTCCCCTCAAGACATTTGTGAGTGGAGCTTTCTGCCTATGATGTGATTAAAGTACATGATGTTGCATACCGTGCCAATTTGATTTTTTTATTCCTTATGCTGTTGGTTACAGTAGTTTTCATTATTGGGCTAATTTTGTCAGTTGTGAATCAATTGAAGCTACTTTTTTGGTGTAATTATAACTTTCAGCATATGATATGTACATTCAATCAATCACTTTTATTGCTATTTTGGACAGAAATTTTGTTATGTATGTTGGCGGGCCTGTTTGGGCTTTAGATTGGTGTCCTCAAGTTCATGAAAGAACTGACTCCCTAATCAAATGTGAGGTATCCCTCCTCTTCTATTGGTCCTTTTGGAAGTAAAACTGCTATGGACACACATTATAATTTATGAATGTAAAATATTGTTTTATTCATCTTCACTTATGAGCTTCCATTTTCATACACACAGATAGGCACTGCTTTCTTTTTGTTCGTACTTTCTCTGTTTATCAGAAGTTCTTATTCATTTTCTTTTTTTTATTTGTCTCAACTCATATGACAGTTTATTGCTGTTTCTGCTCATCCACCTGGCTCTTCTTATCACAAGATGGGTACCCCACTCACAGGAAGAGGTATGGTGCAGATATGGTGCTTACTCCATGGCACTGAAAACCATGAAGCAGAGCCAACCAATGCAGCAAAGGGCAAATCGAACCCTAAAAAGGTTGAGGTATCATCAGACTTATCATCTCAACCAAAGAGGCCTAGAGGAAGACCACCAGGGCCTAAGAAAAATGGGGCATCAGACTTGCCATCTCAACCGAAGAGGCCTAGAGGAAGACCTAAAAAGAAAAAAGAAGAATCCAATGATAACATGGGTGACAATTACCAATTTGTTCAGGCCCTTTCTATTGAATACCCAGTTGTCCCTAAAAATTCTGAAGAACTTGTATTACTGGAAAATAGTGTTGAAAGACAGAAGTGTACCTTACAAGAAGTTTCTACATGCAATTCTGAAGATGAAGTTCCTGCGCAGAAGAGAAGAGTGAGAAGAAAAGCTGGGACTAAGAATCATATTGATGACATGGGGATGTTAACACTTACAGAAAATCGAGAAGATGGATCCAATGGTATCAATCTTCAGGCAAATGAGAATGTTATAAGCAAAAATTCTGGGGAAGACACTCTATTATGTAATAATATTTCAAAGAATGCTGTATTAGACACTAGCTCAATTGAATTCTCTATTCCCGGGAGTGTTGCTTTGCCTAGAGTGGTATTGTGCTTAGCTCACAATGGGAAGGTGGCATGGGATTTGAAATGGAAGCCATTTAATGCGCGTACTAGCAAATGCAAGCACCGAATGGGCTACCTTGCTGTCTTGCTGGGCAACGGATCTCTAGAAGTGTAATTGAATTGCTTTAACTGTCTTATGACATATATGCTTAGTTACTTTATCTTTAAGTTTTTTGTCCATTCTTTATAAAACTATTTGTCTATTTAATTTATCACGTCGCTTGTGTACTTTTTAGTTATTCTCGTAGGTTTACTGAGAGCTTTTTGTGCTTGCTAAAGATCTAAAATATGTAAAGGATTTGGAAGAATATACTTAAGCAATGTTGGATAATTTAGTTCGATTCGAATTTGTTGAATATGACAGGTTTTGTACTGCTCAGTCCTAGAATCAGCTTCCATGAAATTGCTATTAGTGGATCAAATGTTTGACGTTACTTGTTTGCTAGGATAAATGACATTTTATTTGAGCATGTCATAACAGAAAAACTACTTTTGAAGTAAGTTCTACAAAGTTTCTGGTGAACGTACAATATGAGCCTTTATTTTTATCATTTTCTATAAATATTATTTCTTTGTAGAGTTGACTTATGATTTTATTTTTTGAAAAATTAATTCATAATTCTAGTTTTCATGTTTAATTAGAGCAGCACAATGCATTATACCAACATGATGTACACCACATCGGAAATTCTATTATAATATATAGCCTGCAAAGTTCTTGCATTATCATAGTTTGTTTGCTGTTACCGGCAACTTCGGCTGTTTTAAGGTTTTCATGCTGTTTATTATGTGTCTCTTATGATTTGCAATATTGGTTCTTTAGCGGGAACTGTGCTATCTGATGATCCTATTAACTAATTGCAGCTGGGAGGTCCCTTTTCCTCACGTAGTGAAGGCAATCTATTCTAAATTCAATGGGGAGGGTATGGATCCTCGCTTTGTTAAGTTGAAGCCTATTTTCAAATGCTCGATGTTGAGAAGTGCAAATACACAGAGGTATTCTGCGTTTACTTTAGATATAATGACAACTCTTCAAGTTCAATATGATGCAGTTTTGTGAAAAGAAAAACCAATGCGTTCTGTATTGCTATCAATGGAAGAGTTTTCTTTAATCATGAGAAGAGACTGCCATAGAGATGGTCATACTCTTTATTATACACTAGCAGTCATTAGTTTTACTTTTATGCTCATCATACTTGATTAAAAGCTTATGACCCTGAAAGTTATAGTTTGAAAGTGATCATCTTTTCCTCATTTTAGTATTTTTAGGGTTTCTACATACTGGATTCCTACAAGTAGATTTGGGCCCTAGGTTGGTGTGTCTCATGCATGAAGGGATCAGAGACACAAGATCATCTTTTCATCAATTGTCCATTTGCTGAAAATTACTGGTCCTCCATCCTTTATGGCAGATTGCTCTACCCTACAACCTAAACTGCCTCCTATCCTACATCTTTGCAGGACATTCACTTAAAAAGGAAAGGGAGACCTCTTGGCTCCAGATTAATAGGGCCTTCTTTTGGAACGTTTGGCTTGAAAGAAACAACAAAACTTTAGACAAAAAGGGAAGCCTTTTGAGAGTTTTTAGGAATAGGTTTTGTGACTGTTTTAACTTGGGATAAATGTGTCTCTCCCTATAAGCAATTGTTGTCTTAATGATCTTTTGGCGGCTTGGCCAATTGGAAGAATTTTTTGGTTATCTTTCACAAGGGATTGGCTTTCCTCCCCTATTTGTAATATCATGTTATCAATGAAATATTTGTTTCTTACGGAAAAAAAGGTAAATAATAAATTTTGTACCCAATCGTTCACTACTGAAGTCTCACTCTCTGCATCATTGCACATTGCACCATTCATCATGTACAACCAACCTCCCCTTGTATTCTGATTAAATCATCCTTGTCATTGAGAGAACTAACTTAAGCTGGAAAACTTTTGAATGAAGTCCTCGGCTTTGCCTGTGTTAGAAGAGGCGTGGTATTATGAAACTAGTGCGTATAGACTATTCAGTTCATTCTTGTTGAATTCAGAAATTATCCTAGGTTCCTTGCAATGTTAACATACGGACATTTTCACCCTTTTTCCTTGTAAATCAACCCCTTATCTCCCCAATTTTAGTACCTGATAGTGGCTTCCTTGCGATGGATGGATAGTTCAAATATGGAATATTTGCAATATGTTCACCTCTACTTGGTAGTTGATTCCATAAAGGAAAAAATATATCATTCTGATTGGATTAAATTTAAGTGGTGAAGCAAACCCTTCCATTGATTTTCATTTTTAATATATTCTGGGATTTTGGAATTGTCATTCTTTTGGTAAAATTGCATCACAAGTTAGCTATCTCTCACGAGTTTTATTTTGTCACTAACTCTCTTTGTTACTTCTAATTCATTGTCTAAATTGTTGTGCTGCAGCATCCCTCTGACAGTGGAATGGTCGTCAACACCTCCTTATGATTATCTATTCGCTGGATGCCATGATGGAACAGTTATTATTTAATTTCCATTTTCCCTTTCTTTTTATCTTCTTTTGGTTGCTCATAGCAAAGAATAAAGTTCTAAATATATGGTTCTTTCATATAGGAAGAACTTCTTCATTTGGTTTGAGAAATTTCGGGAACTGCTTACTCATTCTTTTGATGAAATGTTTTTGGGAGCATGAGAAAATTGAAATTCTTGAGGATTACTCTATCCAAGGATACACGTTTTTGGATGTTTTCTTAATTTTGTTTGTCATTGATATGTCTTTATTACTGATACAATTCCTGATCCGGTTTTGGGTGGTCCTGCTCTCTTGTTGCTTCGAATTTTAATTTTATGATGACTAAAACATAGTTCATCCATTTAGGTTGCCTTGTGGAAGTTTTCTGCAAATAGTACCTGTGAAGGTTGTTTTTCTTGTCCGAACTTGTGTTTCTAATTCCTTCTTTTGTTTGTATTTTGAATTTCTTCCCTTGTGCAATTATAGTGATATAGACATCATTATTTCCTTTTGTTCTTCAGATACAAGGCCTTTACTTCGTTTTAGTGCAGATACAGTTCCCATAAGAGCGGTTGCATGGGCACCAAGTGAAAGGTTTGTAATGATAGAGCTCAAGTTTTATTTATTCTTCTTCATATTCAATATTGGGTCAACCATAATAAACAAAAGAGGATAAAGAGAGAAGGATGAAAACAGGAGTGTCTTCAGTTGTTAGCGACTCTCATTTCATTTGAAATTTTGGGATTGATATTACTGTTGGTTCACAGTAGTTGGGCAATTTTACCTCCTTAGAAAAAGGTTACAAGTTCAATTCCCCTCCCTCATGCCGCTTGTCTATGTAATGGGGAAAGGCATTTTTACCTTTGATGGAGAAAATAATAAGAGGAATATAACTACTATGAGATGAGGATGACCAGTCTAAATTTTATTTTATTTCAGTTATTTACCGACATATGGAAACTATAAATATACATATACTTTCTTCTCATTATGTGCAGCGATCCAGAAAGTGCAAATGTGGTACTTACTGCTGGTCATGGAGGTTTAAAGTTTTGGGACCTAAGGTTGGTGGATTTTTTTTGTTTTTATTTTATTTTATTGAAGGAAGTTAATTAATTCGATTGGAGCAACTTTTTTATACCATTTCATCCCTACGAGATGCATTCTGTATTCCATGTCCTAATTCAAGCTTCATTTACAATGGTTTTTTACTAGCCAACATCCTCAAATATCTTGAGGTAATTTGAAGATGTAAAACAGCAAAACTAACACCCTCCTCTTCCACTCTTTAGACACACCCGAGGAAATTTCGTTGAAACTATTCTTCCTCCAACAATGGAGTAACTTCTTGATATTTGCAAAGAGGAACTTCCTTATCTGACAAGTGACGAGGACAAAGGTTTTGAATTTGGAAGTTCCTGTCCCGATTTGAAACTTCTCTGGAGGCATTTTGAAGATTTCTATTGGAAGACTTGCAGAGAAAGGGAGGAATATTGCAAATTAAGATTCAAGGCGGCTTTGTATTTTTTTGTGATGGGAACCATTTAATGTTAGTCAGCCTTAATGAAACTTTGATGAGCCTTGTCTTTGAAGGAGCCAATGTTATTTCTGTTGCTGTCTTGCTGAAGACCAATTAGTTTGACCACTTGTGTTTATAAAGTAATAGCAGAAGTGTTGTCAGAGAGGTTTGAAAAAGTTATCACCTCCATGATCTCTATTATGTGATCAACTTTTATTGGTGGGAGACAAATTCTAGACCCTATGCTTATAGACAATCAAGTAGCCAATGCAGCAAGAGAGGGGGATGGAAAAGCTTATGATTTTGTTGATTGGATCATTTGTGGCCAGAATCTTGAGCTGAAAGTCTTTGGCCCTAGGTGGAGGAGATGGATTTGGGGATGCCTGTCCAGCACTAACTTTCTTTTCTTGTTAATGATAGACCTGATGCGAAGTTAATCTCTTTTTGGGGTGTGAGGTAAGGTGACTCCCCTCTTCCCTCTCTCTATATTATTGTGGTTCATGTGTTTAGTAGGCTCTCGGAGAAAAGAGCTTGAGAAGGACTTATTAAAGGTTGTGAGGTTGGGGGAGATAAAGTTCCAGTTTCTGAGTTGCAATTTGCCGGTGATCTCTTGCTTTTCTCTTCTTTTGATAAAGCTAATTGTGTAGAGGCGACAAGGCTGGGTTGAAAAAGCAAAATAAAATGTCTAAAAGGAGGGAGATGAGAATTCAACTTTTTTTTTTTTCCTCGCTTCATTGGCTCTTGGATGTTGAAAAGAAAAGTGCTTTTATTTCAAATTTCAATACTTGTTTGGAATTTAGATGAAAGTACAGTAGAAGGTTGCGGCCTAGAGGGTGAGATTGTCAGCTTTTTCAGCAAATTGTACACAGAAGTAGAGGGGGACAGATTCTTTGCCTATGGGAATTAAATGAAAGTCGAATTTTAAGTGTGAAAAAAATTGGTTGGAAAGGACCTTTGAGGAGATAGCGGCGTTCAACATTATTAAAAATCTTGGAAGATTAAAACTTTAGGTCTTGACGGGTTCACCATAAACTGACTTAAGTTTTGGAACATTTTAAAACCAGATTTCTTAGATTTCCTTGAAGATTTTTCTCAGGAGGGGCATCTCAATTTGTGCATGAAGGAGGTCTTTATTTGCTTAATACAAAAGAATGAAAGGCCTGTTAGAGTGAAGGATTTCAAGCCCAAAGCCTTGTGGCTAGCGTGGTTCTAGCTGAAAGGTTGAAGAGTGTGTTGTCGTCTACTATCTCTCCCAATTAGAATGGCTTCATAAAAGGAAGATAAATTATTAATCTTATTTTATTTGCAAATAAGTCGCAGAAGATTACAACGTTAGAAAGAAGGAAGAAAGGATGGATCTTCAAATTATATGTGGAGAAATCATATGATAGAATTATTTTGAAATTACTGTGGTAATGGAAAGAAAAGAATTTGGGGCTAGTAGTGATTTAAAGAAGCTCGCCTAGGTGCACACATATGGTGCATTGCCTTCACTTGAGGCGGAAGTGTCTTTTAATTACTTTAAAGAAAATAAAGAAATGAGAAGAAAACCTCTGTATCTTTAAAGAATCTCTTGAAGAAGTCCTTTAAATTTCATAATTTTATTATTTACAATACTATTGTTTCTATAATTAAAATAAGTTATTTTTTTCTATTCTGCGCTCCACACAAAAAAAGTCCTCACTTTTTTACACTTAAATGTGCCTCACACTTTGAAAAAGATTAAGGGCTAGATGGTTGAAATGGATGGAAGAATGTCTTTATGGAACTAAGTTCTCTATTTTTTTCCTCATGAGGCCTAGAGAAAGAGTGCTTGCATCAAGAGGGTTGGGGCAAGGTGCTCTTTTCCCTCTTCTTAACTGTAGGTGATGTTTTGAGTAGGATGATAGACCATGGCATTTTAGGGGATTGGCTGGAAGGTTTACTCATTGGGAGGGTGAAAATACACATTTCCCTCTTACAATGCGCAGATGTTATGTGGCCAAGAGCGATAACTTGTTTAGTGTCCTTCAAATATTTGAGAAGGCATCAAGGTTGAAGATTAACATGACAAGTCTGTCAATTGGTGGTGTGAACTTAAATGTGGATCGCATGCAGGAGGTGACCAAGAGACTGAGGTGTAGTTCTGATGAATTTCTATTGTTGAGAGTATATTGAAAAATATTCAATATATTATAACCCACCAGTTTAAGCTTTTGGGTCTAGTGGTGATTTATTAAATGAGGAAGTCATAGACCTTTGTCATATCATTCCTTTCCCAATCAATATTAATAATTTGCATGTTGGGCTTCGTGAGTGTTTGAGAATATATTGAGAAAGATTAAATATACCGTAACCTATCACCTCAAGCTTTTGGGTCTAGTGGTGATTTTATATTGATGTCTTGGGTGTGCCCTTGGTGAAAACCCTAAAAAGGTAGGTTTCTGAAACCTTTTTTTTTTTTTAAGAATAGAGAGGAAGTTGCGGATGTAGAAGTATTTTTGCCTATCAATAGGGAGTAGGGCCTATCTTGTGCCAACTGGTGTTATCCAACTTACCTGTATACTTTACGTCTTATCTTTCATTTTTCTTGAAAAGTGGCATTGTTGATAGAGAGAAAGATCTAGGCTTTGCTGTGGGAAGGCAATAAAGAAAGGAGGCTGTCACCTACTGAATTGGAATATCATTTCAAAAACCTTGGAAAATAGTGGATGAGGTGTTTGGAATGTAAGGAAAAGAAATGAGGTCCTATCAACAAAATGGCTTTGGGGGTTTCCCTGTGAAACTAGTGAGGTATGGCATCAAATAATGACTAGCATTCATAGGAAAGGGGAAAATGGATGGTCTACCGAAGGAAGGAAAACACGTAGCAATAGGAATCCATGGTGGAATATGTTAGGATCCCAATTACAAGAAAACAGTAATATAACTATATTGCAATATCAAAGAGAAATTACAATATCAATAGTGTTTCGAGAGGGCTATCTCTCCCACAATCCATCCACTAAGGACTAACTTCCTAAAACTATGCCCCCTAAGCTCTCTTACTCCCTCTATTTATAATCGACCCACCCTAACAAACTACCCAGTCACTACTAATATCCTACCAATGTTTCCCATACTTCCCTATATTTCCCTATTAATACTCTCACAGAATATCAGCTAGCTTAAAGGGAATTTGGAGATCTTCACCACAATCAAGGTGGAAATGGTGAGAGAACATCATGATACATGGGTGGAGAATGAGAATCAAACACTTTGCTGCCGATTTGCTAGACTTTTAAAAATGTGATATCATTAGAAAAGGATACAACGGTGCAGGAATATGAGAGGAAAATACCAACTCATGGAGGATTTTGGCTAGAAGAAACCTACATGATGAAGAGTGGGATTAGCGACTGGAGCTCCTTCACTTGGTGGATAGACTACGGCTAACAAGGTAGCAAGATGGAAGAGTTTGGAGGCTTGAGAAATCGAGTACCTTCTTGGTTAAATCTACCTTCAGTGAACTATTAGGCTTATCTCTATTACTTAATTGGGAACTAGTGGCTGCGTTATGGAAGCCAATTTTCTGATGCACTCTTTGTTCTTGGGGAGTCAGTACAGCGGGAGATTATAAGCAAGATGCCCAAATATGACACTTAGACGAAAGGAAACTGCTGTATACTTTGTGAAGGAGCAGAGGACATTCAAGACCATATTATATATAAGATTCAAGGCAACAACAAAAAAGACAAGATACAACGCATTTTTGCAATAATTCTTAATTGATTAATTGATGTGAACGGTCACTTGTTATTTCATTTATTCTGTGCTCATTTTATGCTCTTCTTTTGTTATAGTTTAAAATTTTGGTGGTTTTGGATCCACATTATAATGTTTCATTTCAATTCAGTTGCAATCTGATATGAATTGTTACAATATTATATCAGAGGCCATGCACCAGAACCATTGATTGCTAGGTCCATCTGGGATTTCCTGTATTTTTTCCCCCATGATTGTGTTAGTGACAATTGTTAATTTTTAACAAACTGAAGAATATACCACTTCCCTGCAGAGATCCATTCCGTCCTTTGTGGGACCTTCATCCAGCGCCCAGGATGATATATAGCCTGGATTGGCTTCCTGATCCTAGGTACATATTTTATCTTGATAGAAAGCAGGTTAGGCACATCAAGTTATTGTTTTACTAATTCTCTTTTATTTACAATGACCCTATTCTGTTTAGATGCGTTATTTTATCCTTTGATGATGGAACATTGAGACTTCTCAGTTTGCTAAAGGCTGCATATGACGTTCCAGTAACCGGTAAACCCTTCACAGGGACTAAACAACAAGGTTTACATAGTTACTGTTGTTCATCATTTGCTATCTGGAGTGTTCAAGTGTCACGGCAGACAGGTATATCTGTAATTCGTGGTTTGTTAGCTTGGATTGTGTTTCACTTGCTTAAAGGATTTATACAACTATCGGTAGCTAGAAGAATTAACTGGAATCAAAGTGGTTTACTGGAGACCTAATTTATAACATCAAACAAGGAAACATTGGATTTGCTTCAGCACTGAATTGGTTTATGTGTGGTATCATCAGTTGCCCGTATTACTCAGTGGTAGAGCATTAGCTATGAACTGAAGGTTTATTTGTACCTGTATCATTCTCCTGATTATTTAAGTTTCTGATTGATTTTAGAAAAGAAGGCTTAAATAATTGTTAAATTTGAGGTCGTGGTACTAGATTTGCAACTTTTTTTTTTTTGCTTGGCTTTTTTCAAGTGATTGAAAACAAATAAAAGAAACATTCTCGTTGATGTGATTACGTGCTTGTTTTGATTTAATTTTCAGCTGTGTAGAAATAAATAACAAATAGTATCACATCATGTCTGAATTCAAAAAGGAAATGGGCACCAAAATATCACCAAGCAGGAGCTGTGTGAATCTTAGTTCTTTCGAAAAAAAAAACAATAATCTCCCTACTTTATACACCCAGTATCATAAGTTTCCAGTGAGCAATGATGATTTTAATTTGTGTTTTGTATACAAATAAATTTTTTGGTCAAAATGTAGAAGCTTTGCCTGGTTCTGATGTTTTAAAGATCAAATTTGATTTTACTTTTAAATTTTGACCCAATATAATGAACTATTTGATTCTTATCATATACATTTAAGACATCAACTGATGATTTCAGGCATGGTTGCATACTGCGGTGCTGATGGAGCTGTTCACCGTTTCCAGGTGAGTTCAACTTGCAGTAGATTTCACTAGGGAGGTGAAAGCTCGTCTCTTCGGCACTTTCCATTCCATTGGTTTCAGTTTTTATACTCTTATTCTGTTGCTTTCTATAATCTTATTGCTGTGTATTTATGTATGTGTGTGTATATTTATAAACTACTTTTCCAATGAAGAGGGAAAATTACAAACAGGAGTAGACAAGGCATCTAATAACCCCTAATCTTAGGGAGGATTACAGCAATCCCTCCCGTTGGTATTCAAATAACAAGAGATATAATTACAAAAAATTTTTGGTAGGAGACTGGAGAAACCAGAAATCTGGCTTCTACCCAAAAATATTAATGATGGGTCTGCATATCTTTGGTTTGGGTTTAAGCATGAGATTGGAATATCCAGCAGTCCTCTCCCATTGTTTAAACCCTATCAAATCCATGGGTTTCCTGAGGATGGATCAAAGAGGGAGAACATAAAATTATTTGATCAATTAAATTTTTTTAATAATAACTTCGATAAAGAGTTATTTGACTATAAAATGTACTTTTGATGGTCAAAATAGATAAATAGTTGAAAAACTACGCAGTTCTTATATGTTTTTTTTCCAGTATGCGTCACCAATGGAATTCAAAGATCATTGATACATACATTAAATGCACTTTCATTAATATAGCATTATCCTCCATAAATCATGCCTTGAATTATCATTTCATAATTTGTTTTTCTTTCACGATGAATTTATCAGCTTACTACCAGAGCGGTGGAGAAAGATCATTCACGTAGTCGAACCCCGCATTTTGTATGCGAATACTTAACTGAGGAGCAATCAACTATTACAATCCACTCTCCAGCATCAGATGTTCCATTCCCTTTGAAGAAGCTGTCCAACAAATCCGACCCGCCATTGTCCATGCGAGCTATTTTATCTGATTCAATTCAGTCAAATGAAGAAAATCACAAGACGGCCATGGCTTCTGCATCGGAAAATGAGACATTAGGTAGCATGTCGATCCTCGACTTCTCCCAATCATTTTTTAAGAAACTTTCACCAAATTGTTGATGCTATTGCAGCCCTTAGCTATGGCAATGATGTCAGTATTGAATCTGGGTCTGAGGATACACTGATGTCCATCAAGAAGAGAAACCAAACTCAATCAAAGGGCAAGAAGAAGGGAGTGGATAACCAAGCATTGGAATGTAGTGATGAGCCTAACGATGCACAAGCGAAAGCTGACGTATTGCCGGGTTCGGGCAATGATTTTGAAATTTTCCCTCCCAAATCAGTTGCAATGCATAGAGTGAGATGGAACATGAACACAGGGAGTGAAAGATGGTTGTGCTATGGTGGAGCAGCTGGAATTCTACGATGTCAGGAGATTGTGTTGTCTGACCTCGATAAGAAGTTAATGATGAAGAAATGAAATTTGTTTGAAACAGAGGCTTTGCCCAAATGTTCAATGGGAGCTGACGTTCACGAAGCGACTCGTGCTCTTAAGCAATTCTTTGTACAGTGGTGAGATCTGGTTTGCTTCTGTTGCTTAAAGGGAAGCCATTTCTAATCAGAACAGTGATCACGGTGCCTTCTTCATCCTTGAGGGGCCAAATTATGGGAGCCGCCAAAAAGAATGACAGAGCATGGGATAACTGA

mRNA sequence

ATGGAACATCATCATCAAGTGGAAGTGGAGCCTCCGGCAGCGGAAGCATCCACTGGCTGCAAGAGAGGGAAGAAGAAACCAGTGGGTCGGGAAAAGAAGGAAGCGCAGGGAAGAGCTACGAAGAAGAAGGCGGGGGCAACTTCGGTCGACGAAGAGCAATCTACAGGCCGTTTAGATGGCATGCCGATTAAGGTTTCGGAGTTTGATCATTGTGTTGAAAATCATTTTAGCGCCATTGATACAATTTCCAAGCTCTGTGGTGAGGCAGAGGATGGCGATGGCGGAATTGACGAAAGTGACATTCAGAGGTTTTCGTCATCCACCATTTTCTTAAGGGAATGGAGGTACTATAATTACGAATCGAAAACCGTCAAGTTTGCTAGTGACTCGAGAGGCCCTGAGGGTAAGGATGCTGACATCACGATAAACTTACCACAGTTTTCTTCTGCTGCTGTTCTGAAGAATGGAACACCATCTGAAGCCACTACATCTCTAGACTGGAGAAATTTTGTTATGTATGTTGGCGGGCCTGTTTGGGCTTTAGATTGGTGTCCTCAAGTTCATGAAAGAACTGACTCCCTAATCAAATGTGAGTTTATTGCTGTTTCTGCTCATCCACCTGGCTCTTCTTATCACAAGATGGGTACCCCACTCACAGGAAGAGGTATGGTGCAGATATGGTGCTTACTCCATGGCACTGAAAACCATGAAGCAGAGCCAACCAATGCAGCAAAGGGCAAATCGAACCCTAAAAAGGTTGAGGTATCATCAGACTTATCATCTCAACCAAAGAGGCCTAGAGGAAGACCACCAGGGCCTAAGAAAAATGGGGCATCAGACTTGCCATCTCAACCGAAGAGGCCTAGAGGAAGACCTAAAAAGAAAAAAGAAGAATCCAATGATAACATGGGTGACAATTACCAATTTGTTCAGGCCCTTTCTATTGAATACCCAGTTGTCCCTAAAAATTCTGAAGAACTTGTATTACTGGAAAATAGTGTTGAAAGACAGAAGTGTACCTTACAAGAAGTTTCTACATGCAATTCTGAAGATGAAGTTCCTGCGCAGAAGAGAAGAGTGAGAAGAAAAGCTGGGACTAAGAATCATATTGATGACATGGGGATGTTAACACTTACAGAAAATCGAGAAGATGGATCCAATGGTATCAATCTTCAGGCAAATGAGAATGTTATAAGCAAAAATTCTGGGGAAGACACTCTATTATGTAATAATATTTCAAAGAATGCTGTATTAGACACTAGCTCAATTGAATTCTCTATTCCCGGGAGTGTTGCTTTGCCTAGAGTGGTATTGTGCTTAGCTCACAATGGGAAGGTGGCATGGGATTTGAAATGGAAGCCATTTAATGCGCGTACTAGCAAATGCAAGCACCGAATGGGCTACCTTGCTGTCTTGCTGGGCAACGGATCTCTAGAAGTCTGGGAGGTCCCTTTTCCTCACGTAGTGAAGGCAATCTATTCTAAATTCAATGGGGAGGGTATGGATCCTCGCTTTGTTAAGTTGAAGCCTATTTTCAAATGCTCGATGTTGAGAAGTGCAAATACACAGAGCATCCCTCTGACAGTGGAATGGTCGTCAACACCTCCTTATGATTATCTATTCGCTGGATGCCATGATGGAACAGTTGCCTTGTGGAAGTTTTCTGCAAATAGTACCTGTGAAGATACAAGGCCTTTACTTCGTTTTAGTGCAGATACAGTTCCCATAAGAGCGGTTGCATGGGCACCAAGTGAAAGCGATCCAGAAAGTGCAAATGTGGTACTTACTGCTGGTCATGGAGGTTTAAAGTTTTGGGACCTAAGAGATCCATTCCGTCCTTTGTGGGACCTTCATCCAGCGCCCAGGATGATATATAGCCTGGATTGGCTTCCTGATCCTAGATGCGTTATTTTATCCTTTGATGATGGAACATTGAGACTTCTCAGTTTGCTAAAGGCTGCATATGACGTTCCAGTAACCGGTAAACCCTTCACAGGGACTAAACAACAAGGTTTACATAGTTACTGTTGTTCATCATTTGCTATCTGGAGTGTTCAAGTGTCACGGCAGACAGGCATGGTTGCATACTGCGGTGCTGATGGAGCTGTTCACCGTTTCCAGCTTACTACCAGAGCGGTGGAGAAAGATCATTCACGTAGTCGAACCCCGCATTTTGTATGCGAATACTTAACTGAGGAGCAATCAACTATTACAATCCACTCTCCAGCATCAGATGTTCCATTCCCTTTGAAGAAGCTGTCCAACAAATCCGACCCGCCATTGTCCATGCGAGCTATTTTATCTGATTCAATTCAGTCAAATGAAGAAAATCACAAGACGGCCATGGCTTCTGCATCGGAAAATGAGACATTAGCCCTTAGCTATGGCAATGATGTCAGTATTGAATCTGGGTCTGAGGATACACTGATGTCCATCAAGAAGAGAAACCAAACTCAATCAAAGGGCAAGAAGAAGGGAGTGGATAACCAAGCATTGGAATGTAGTGATGAGCCTAACGATGCACAAGCGAAAGCTGACGTATTGCCGGGTTCGGGCAATGATTTTGAAATTTTCCCTCCCAAATCAGTTGCAATGCATAGAGTGAGATGGAACATGAACACAGGGAGTGAAAGATGGTTGTGCTATGGTGGAGCAGCTGGAATTCTACGATGTCAGGAGATTGTGTTGTCTGACCTCGATAAGAAATCTGGTTTGCTTCTGTTGCTTAAAGGGAAGCCATTTCTAATCAGAACAGTGATCACGGTGCCTTCTTCATCCTTGAGGGGCCAAATTATGGGAGCCGCCAAAAAGAATGACAGAGCATGGGATAACTGA

Coding sequence (CDS)

ATGGAACATCATCATCAAGTGGAAGTGGAGCCTCCGGCAGCGGAAGCATCCACTGGCTGCAAGAGAGGGAAGAAGAAACCAGTGGGTCGGGAAAAGAAGGAAGCGCAGGGAAGAGCTACGAAGAAGAAGGCGGGGGCAACTTCGGTCGACGAAGAGCAATCTACAGGCCGTTTAGATGGCATGCCGATTAAGGTTTCGGAGTTTGATCATTGTGTTGAAAATCATTTTAGCGCCATTGATACAATTTCCAAGCTCTGTGGTGAGGCAGAGGATGGCGATGGCGGAATTGACGAAAGTGACATTCAGAGGTTTTCGTCATCCACCATTTTCTTAAGGGAATGGAGGTACTATAATTACGAATCGAAAACCGTCAAGTTTGCTAGTGACTCGAGAGGCCCTGAGGGTAAGGATGCTGACATCACGATAAACTTACCACAGTTTTCTTCTGCTGCTGTTCTGAAGAATGGAACACCATCTGAAGCCACTACATCTCTAGACTGGAGAAATTTTGTTATGTATGTTGGCGGGCCTGTTTGGGCTTTAGATTGGTGTCCTCAAGTTCATGAAAGAACTGACTCCCTAATCAAATGTGAGTTTATTGCTGTTTCTGCTCATCCACCTGGCTCTTCTTATCACAAGATGGGTACCCCACTCACAGGAAGAGGTATGGTGCAGATATGGTGCTTACTCCATGGCACTGAAAACCATGAAGCAGAGCCAACCAATGCAGCAAAGGGCAAATCGAACCCTAAAAAGGTTGAGGTATCATCAGACTTATCATCTCAACCAAAGAGGCCTAGAGGAAGACCACCAGGGCCTAAGAAAAATGGGGCATCAGACTTGCCATCTCAACCGAAGAGGCCTAGAGGAAGACCTAAAAAGAAAAAAGAAGAATCCAATGATAACATGGGTGACAATTACCAATTTGTTCAGGCCCTTTCTATTGAATACCCAGTTGTCCCTAAAAATTCTGAAGAACTTGTATTACTGGAAAATAGTGTTGAAAGACAGAAGTGTACCTTACAAGAAGTTTCTACATGCAATTCTGAAGATGAAGTTCCTGCGCAGAAGAGAAGAGTGAGAAGAAAAGCTGGGACTAAGAATCATATTGATGACATGGGGATGTTAACACTTACAGAAAATCGAGAAGATGGATCCAATGGTATCAATCTTCAGGCAAATGAGAATGTTATAAGCAAAAATTCTGGGGAAGACACTCTATTATGTAATAATATTTCAAAGAATGCTGTATTAGACACTAGCTCAATTGAATTCTCTATTCCCGGGAGTGTTGCTTTGCCTAGAGTGGTATTGTGCTTAGCTCACAATGGGAAGGTGGCATGGGATTTGAAATGGAAGCCATTTAATGCGCGTACTAGCAAATGCAAGCACCGAATGGGCTACCTTGCTGTCTTGCTGGGCAACGGATCTCTAGAAGTCTGGGAGGTCCCTTTTCCTCACGTAGTGAAGGCAATCTATTCTAAATTCAATGGGGAGGGTATGGATCCTCGCTTTGTTAAGTTGAAGCCTATTTTCAAATGCTCGATGTTGAGAAGTGCAAATACACAGAGCATCCCTCTGACAGTGGAATGGTCGTCAACACCTCCTTATGATTATCTATTCGCTGGATGCCATGATGGAACAGTTGCCTTGTGGAAGTTTTCTGCAAATAGTACCTGTGAAGATACAAGGCCTTTACTTCGTTTTAGTGCAGATACAGTTCCCATAAGAGCGGTTGCATGGGCACCAAGTGAAAGCGATCCAGAAAGTGCAAATGTGGTACTTACTGCTGGTCATGGAGGTTTAAAGTTTTGGGACCTAAGAGATCCATTCCGTCCTTTGTGGGACCTTCATCCAGCGCCCAGGATGATATATAGCCTGGATTGGCTTCCTGATCCTAGATGCGTTATTTTATCCTTTGATGATGGAACATTGAGACTTCTCAGTTTGCTAAAGGCTGCATATGACGTTCCAGTAACCGGTAAACCCTTCACAGGGACTAAACAACAAGGTTTACATAGTTACTGTTGTTCATCATTTGCTATCTGGAGTGTTCAAGTGTCACGGCAGACAGGCATGGTTGCATACTGCGGTGCTGATGGAGCTGTTCACCGTTTCCAGCTTACTACCAGAGCGGTGGAGAAAGATCATTCACGTAGTCGAACCCCGCATTTTGTATGCGAATACTTAACTGAGGAGCAATCAACTATTACAATCCACTCTCCAGCATCAGATGTTCCATTCCCTTTGAAGAAGCTGTCCAACAAATCCGACCCGCCATTGTCCATGCGAGCTATTTTATCTGATTCAATTCAGTCAAATGAAGAAAATCACAAGACGGCCATGGCTTCTGCATCGGAAAATGAGACATTAGCCCTTAGCTATGGCAATGATGTCAGTATTGAATCTGGGTCTGAGGATACACTGATGTCCATCAAGAAGAGAAACCAAACTCAATCAAAGGGCAAGAAGAAGGGAGTGGATAACCAAGCATTGGAATGTAGTGATGAGCCTAACGATGCACAAGCGAAAGCTGACGTATTGCCGGGTTCGGGCAATGATTTTGAAATTTTCCCTCCCAAATCAGTTGCAATGCATAGAGTGAGATGGAACATGAACACAGGGAGTGAAAGATGGTTGTGCTATGGTGGAGCAGCTGGAATTCTACGATGTCAGGAGATTGTGTTGTCTGACCTCGATAAGAAATCTGGTTTGCTTCTGTTGCTTAAAGGGAAGCCATTTCTAATCAGAACAGTGATCACGGTGCCTTCTTCATCCTTGAGGGGCCAAATTATGGGAGCCGCCAAAAAGAATGACAGAGCATGGGATAACTGA

Protein sequence

MEHHHQVEVEPPAAEASTGCKRGKKKPVGREKKEAQGRATKKKAGATSVDEEQSTGRLDGMPIKVSEFDHCVENHFSAIDTISKLCGEAEDGDGGIDESDIQRFSSSTIFLREWRYYNYESKTVKFASDSRGPEGKDADITINLPQFSSAAVLKNGTPSEATTSLDWRNFVMYVGGPVWALDWCPQVHERTDSLIKCEFIAVSAHPPGSSYHKMGTPLTGRGMVQIWCLLHGTENHEAEPTNAAKGKSNPKKVEVSSDLSSQPKRPRGRPPGPKKNGASDLPSQPKRPRGRPKKKKEESNDNMGDNYQFVQALSIEYPVVPKNSEELVLLENSVERQKCTLQEVSTCNSEDEVPAQKRRVRRKAGTKNHIDDMGMLTLTENREDGSNGINLQANENVISKNSGEDTLLCNNISKNAVLDTSSIEFSIPGSVALPRVVLCLAHNGKVAWDLKWKPFNARTSKCKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIYSKFNGEGMDPRFVKLKPIFKCSMLRSANTQSIPLTVEWSSTPPYDYLFAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRAVAWAPSESDPESANVVLTAGHGGLKFWDLRDPFRPLWDLHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLLKAAYDVPVTGKPFTGTKQQGLHSYCCSSFAIWSVQVSRQTGMVAYCGADGAVHRFQLTTRAVEKDHSRSRTPHFVCEYLTEEQSTITIHSPASDVPFPLKKLSNKSDPPLSMRAILSDSIQSNEENHKTAMASASENETLALSYGNDVSIESGSEDTLMSIKKRNQTQSKGKKKGVDNQALECSDEPNDAQAKADVLPGSGNDFEIFPPKSVAMHRVRWNMNTGSERWLCYGGAAGILRCQEIVLSDLDKKSGLLLLLKGKPFLIRTVITVPSSSLRGQIMGAAKKNDRAWDN
Homology
BLAST of Sgr022873 vs. NCBI nr
Match: XP_022144681.1 (uncharacterized protein LOC111014310 isoform X1 [Momordica charantia])

HSP 1 Score: 1524.6 bits (3946), Expect = 0.0e+00
Identity = 767/915 (83.83%), Postives = 809/915 (88.42%), Query Frame = 0

Query: 2   EHHHQVE-VEP--PAAEASTGCKRGKKK-PVGREKKEAQGRATKKKAGATSVDEEQSTGR 61
           EHHH VE VEP  PAA  STG KRGK+K PV R+KKEA GRA KKK G  S DEEQ TGR
Sbjct: 3   EHHHPVEVVEPPVPAASTSTGGKRGKRKQPVARQKKEAPGRA-KKKPGGASADEEQPTGR 62

Query: 62  LDGMPIKVSEFDHCVENHFSAIDTISKLCGEAEDGDGGIDESDIQRFSSSTIFLREWRYY 121
           LDG+ IKV EFDHC ENHF A+DTI++LCGEAEDGDGGIDESDIQRFSSS  FLREWR+Y
Sbjct: 63  LDGVGIKVLEFDHCAENHFRAMDTIAELCGEAEDGDGGIDESDIQRFSSSAFFLREWRFY 122

Query: 122 NYESKTVKFASDSRGPEGKDADITINLPQFSSAAVLKNGTPSEATTSLDWRNFVMYVGGP 181
           NYE KTVKFASD RG EGKD DITINLPQFSSAAVLKNGTPS A TSLDWRNFVMYVGGP
Sbjct: 123 NYEPKTVKFASDLRGSEGKDGDITINLPQFSSAAVLKNGTPSGAATSLDWRNFVMYVGGP 182

Query: 182 VWALDWCPQVHERTDSLIKCEFIAVSAHPPGSSYHKMGTPLTGRGMVQIWCLLHGTENHE 241
           VWALDWCPQV E+TD+LIKCEFIAVSAHPPGSSYHKMGTPL GRGMVQIWCL+HGTENHE
Sbjct: 183 VWALDWCPQVLEKTDALIKCEFIAVSAHPPGSSYHKMGTPLIGRGMVQIWCLVHGTENHE 242

Query: 242 AEPTNAAKGKSNPKKVEVSSDLSSQPKRPRGRPPGPKKNGASDLPSQPKRPRGRPKKKKE 301
            EP  A K KS PKK EVSSDLSSQPKRPRGRPPG KK GASDLPSQPKRPRGRPKKK+E
Sbjct: 243 PEPAYATKCKSKPKKDEVSSDLSSQPKRPRGRPPGTKKKGASDLPSQPKRPRGRPKKKQE 302

Query: 302 ESNDNMGDNYQFVQALSIEYPV----------VPKNSEELVLLENSVERQKCTLQEVSTC 361
            SNDNMGDN Q VQ+LS+EYP            PKNSEEL+LL NSVERQK TLQ VSTC
Sbjct: 303 GSNDNMGDNNQIVQSLSVEYPAGSSNLLEIDGDPKNSEELLLLGNSVERQKSTLQAVSTC 362

Query: 362 NSEDEVPAQKRRVRRKAGTKNHIDDMGMLTLTENREDGSNGINLQANENVISKNSGEDTL 421
           NS+DE PAQKRRVRRK GTKNHIDDMG L  T NREDGS+ I+ Q NENVIS+ SGEDTL
Sbjct: 363 NSKDEGPAQKRRVRRKVGTKNHIDDMGTLPFTVNREDGSSTISFQENENVISEYSGEDTL 422

Query: 422 LCNNISKNAVLDTSSIEFSIPGSVALPRVVLCLAHNGKVAWDLKWKPFNARTSKCKHRMG 481
           LCNNISKNA       EFSIP SVALPRVVLCLAHNGKVAWDLKWKP NA T+ CKHRMG
Sbjct: 423 LCNNISKNA-------EFSIPESVALPRVVLCLAHNGKVAWDLKWKPSNACTTNCKHRMG 482

Query: 482 YLAVLLGNGSLEVWEVPFPHVVKAIYSKFNGEGMDPRFVKLKPIFKCSMLRSANTQSIPL 541
           YLAVLLGNGSLEVWE+PFPHVVKAIYSKFN EG DPRFVKLKPIF+ +ML+SAN QSIPL
Sbjct: 483 YLAVLLGNGSLEVWEIPFPHVVKAIYSKFNREGTDPRFVKLKPIFRSTMLKSANIQSIPL 542

Query: 542 TVEWSSTPPYDYLFAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRAVAWAPSESD 601
           TVEWSSTPPYDYLFAGC+DGTVALWKFSANSTCEDTRPLLRFSADTVPIR VAWAP+ESD
Sbjct: 543 TVEWSSTPPYDYLFAGCNDGTVALWKFSANSTCEDTRPLLRFSADTVPIRRVAWAPNESD 602

Query: 602 PESANVVLTAGHGGLKFWDLRDPFRPLWDLHPAPRMIYSLDWLPDPRCVILSFDDGTLRL 661
           PESANVVLTA HGGLKFWDLRDPFRPLWD+HPAPRMIYSLDWLPDPRCVILSFDDGTLRL
Sbjct: 603 PESANVVLTASHGGLKFWDLRDPFRPLWDIHPAPRMIYSLDWLPDPRCVILSFDDGTLRL 662

Query: 662 LSLLKAAYDVPVTGKPFTGTKQQGLHSYCCSSFAIWSVQVSRQTGMVAYCGADGAVHRFQ 721
           LSLLKAAYDVPVTGKPFTGTKQQGLHSY  SSFAIWSVQVSRQTGMVAYC ADGAV RFQ
Sbjct: 663 LSLLKAAYDVPVTGKPFTGTKQQGLHSYYGSSFAIWSVQVSRQTGMVAYCSADGAVLRFQ 722

Query: 722 LTTRAVEKDHSRSRTPHFVCEYLTEEQSTITIHSPASDVPFPLKKLSNKSDPPLSMRAIL 781
           LTTRAVEKDHSR+RTPHF+CEYLTEE+S ITIHSPAS VPFPLKK SNKSD PLS RAIL
Sbjct: 723 LTTRAVEKDHSRNRTPHFICEYLTEEESAITIHSPASGVPFPLKKASNKSDLPLSFRAIL 782

Query: 782 SDSIQSNEENHKTAMASASENETLALSYGNDVSIESGSEDTLMSIKKRNQTQSKGKKKGV 841
           SDSI+SNE NHKTA A+ASENE LA+ + NDVS++SGSEDTLMS+KK+NQTQSK KKK V
Sbjct: 783 SDSIESNEGNHKTATATASENEALAIYHDNDVSVDSGSEDTLMSMKKKNQTQSKCKKKEV 842

Query: 842 DNQALECSDEPNDAQAKADVLPGSGNDFEIFPPKSVAMHRVRWNMNTGSERWLCYGGAAG 901
           D+QALECSDEPNDAQ K D LPGSG++FE FPPKSVA+HRVRWNMNTGSERWLCYGG AG
Sbjct: 843 DSQALECSDEPNDAQTKTDELPGSGDNFETFPPKSVALHRVRWNMNTGSERWLCYGGEAG 902

Query: 902 ILRCQEIVLSDLDKK 903
           I+RCQEIVLSD DKK
Sbjct: 903 IIRCQEIVLSDFDKK 909

BLAST of Sgr022873 vs. NCBI nr
Match: XP_038903194.1 (uncharacterized protein LOC120089853 [Benincasa hispida])

HSP 1 Score: 1397.5 bits (3616), Expect = 0.0e+00
Identity = 712/918 (77.56%), Postives = 775/918 (84.42%), Query Frame = 0

Query: 8   EVEP-PAAEASTGCKRGKKKPVGREKKEAQGRATKKKAG---ATSVDEEQSTGRLDGMPI 67
           E++P P     T  K+GKKKP  REKK+++  A  K       TSV++ Q TGRLDG  +
Sbjct: 3   ELQPQPQPSIGTSSKKGKKKPPAREKKKSEKTAQNKPGATTTTTSVNKHQPTGRLDGPKV 62

Query: 68  KVSEFDHCVENHFSAIDTISKLCGEAEDGDGGIDESDIQRFSSSTIFLREWRYYNYESKT 127
           KVSEFDHC+ENHF+A+DTI +LC EAE  DGGIDESDIQRF+SSTIFLREWR+YNYE K 
Sbjct: 63  KVSEFDHCIENHFNAMDTIVELCCEAE--DGGIDESDIQRFASSTIFLREWRFYNYEPKF 122

Query: 128 VKFASDSRGPEGKDADITINLPQFSSAAVLKNGTPSEATTSLDWRNFVMYVGGPVWALDW 187
           +KFASDSRGPEGKDADITI LPQFSSAAVLKNG P  ATTSLD+RNF M+VGGPVWALDW
Sbjct: 123 IKFASDSRGPEGKDADITITLPQFSSAAVLKNGAPPGATTSLDFRNFAMHVGGPVWALDW 182

Query: 188 CPQVHERTDSLIKCEFIAVSAHPPGSSYHKMGTPLTGRGMVQIWCLLHGTENHEAEPTNA 247
           CPQVHERTDSLIKCEFIAVSAHPPGSSYHKMG PLTGRGMVQIWC +HGTE++  EPTN 
Sbjct: 183 CPQVHERTDSLIKCEFIAVSAHPPGSSYHKMGIPLTGRGMVQIWCFVHGTESY--EPTNV 242

Query: 248 AKGKSNPKKVEVSSDLSSQPKRPRGRPPGPKKNGASDLPSQPKRPRGRPKKKKEESNDNM 307
                     E  +DLSSQPKRPRGRP G KKNGAS LP QPKRPRGRPKKK+EESND  
Sbjct: 243 E---------EPPADLSSQPKRPRGRPSGRKKNGASGLPPQPKRPRGRPKKKQEESNDKK 302

Query: 308 GDNYQFVQALSIEYPV----------VPKNSEELVLLENSVERQKCTLQEVSTCNSEDEV 367
           GD+   VQA SIE PV          VPKNSE +VLLENSVER++ TLQEVSTCNSEDEV
Sbjct: 303 GDSCPLVQAFSIENPVGSSNLLEMDGVPKNSENIVLLENSVERERSTLQEVSTCNSEDEV 362

Query: 368 PAQKRRVRRKAGTKNHIDDMGMLTLTENREDGSNGINLQANENVISKNSGEDTLLCNNIS 427
           PAQKRRVRRK   KNH+ D+GML+LTENREDGSN I+L+ANENV+ + SGED LLC NIS
Sbjct: 363 PAQKRRVRRKTEPKNHVGDVGMLSLTENREDGSNAISLEANENVVCEYSGEDNLLCKNIS 422

Query: 428 KNAVLDTSSIEFSIPGSVALPRVVLCLAHNGKVAWDLKWKPFNARTSKCKHRMGYLAVLL 487
            NAVLDTSSIEFSIP SVALPRVVLCLAHNGKVAWDLKWKP NA T  CK RMGYLAVLL
Sbjct: 423 GNAVLDTSSIEFSIPESVALPRVVLCLAHNGKVAWDLKWKPTNASTDNCKLRMGYLAVLL 482

Query: 488 GNGSLEVWEVPFPHVVKAIYSKFNGEGMDPRFVKLKPIFKCSMLRSANTQSIPLTVEWSS 547
           GNGSLEVWEVPFPH VKAIYSKFNGEG DPRFVKLKPIF+CSMLR+ANTQSIPLTVEWS 
Sbjct: 483 GNGSLEVWEVPFPHAVKAIYSKFNGEGTDPRFVKLKPIFRCSMLRNANTQSIPLTVEWSQ 542

Query: 548 TPPYDYLFAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRAVAWAPSESDPESANV 607
           TPPYDYL AGCHDGTVALWKFSANS+CEDTRPLLRFSADTVPIRAVAWAPSES  ESANV
Sbjct: 543 TPPYDYLLAGCHDGTVALWKFSANSSCEDTRPLLRFSADTVPIRAVAWAPSESGSESANV 602

Query: 608 VLTAGHGGLKFWDLRDPFRPLWDLHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLLKA 667
           +LTAGHGGLKFWDLRDPFRPLWDLHPAPR+IYSLDWLP+PRCV LSFDDGTLRLLSLLKA
Sbjct: 603 ILTAGHGGLKFWDLRDPFRPLWDLHPAPRIIYSLDWLPNPRCVFLSFDDGTLRLLSLLKA 662

Query: 668 AYDVPVTGKPFTGTKQQGLHSYCCSSFAIWSVQVSRQTGMVAYCGADGAVHRFQLTTRAV 727
           AYDVPVTG+PFT  KQ+GLH+Y CSS+AIWS+QVSRQTGMVAYCGADGAV RFQLTT+A 
Sbjct: 663 AYDVPVTGQPFTAIKQKGLHTYFCSSYAIWSIQVSRQTGMVAYCGADGAVVRFQLTTKAA 722

Query: 728 EKDHSRSRTPHFVCEYLTEEQSTITIHSPASDVPFPLKKLSNKSDPPLSMRAILSDSIQS 787
           +K++SR RTPH+VCEYLTEE+STITIHSP  ++PF LKKLSNKS+ PLSMRAILSDS+QS
Sbjct: 723 DKENSRHRTPHYVCEYLTEEESTITIHSP-PNIPFSLKKLSNKSEHPLSMRAILSDSMQS 782

Query: 788 NEENHKTAMASASENETLALSYGNDVSIESGSEDTLMSIKKRNQTQSKGKKKGVDNQALE 847
           NE NHKTA A A ENE+ AL    DV +ESG EDTLMSIKK+N+TQSK  KKGV+NQ L+
Sbjct: 783 NEGNHKTATAPALENES-ALCSDVDVGVESGIEDTLMSIKKKNRTQSK-CKKGVENQKLD 842

Query: 848 CSDEPN---------DAQAKADVLPGSGNDFEIFPPKSVAMHRVRWNMNTGSERWLCYGG 903
           CSDEPN         D Q  A V+PGS + FE  PPKSVAMHRVRWNMN GSERWLCYGG
Sbjct: 843 CSDEPNDDAQMDADVDGQTDAAVVPGSRDQFESLPPKSVAMHRVRWNMNIGSERWLCYGG 902

BLAST of Sgr022873 vs. NCBI nr
Match: XP_023528187.1 (uncharacterized protein LOC111791176 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1397.1 bits (3615), Expect = 0.0e+00
Identity = 706/909 (77.67%), Postives = 776/909 (85.37%), Query Frame = 0

Query: 7   VEVEPPAAEAS--TGCKRGKKKPVGREKKEAQGRATKKKAGATSVDEEQSTGRLDGMPIK 66
           +E  P  AEAS  T CK+GKKK V  E  E Q RA KKKAGATSV+E Q TGRLD   +K
Sbjct: 1   MEELPHQAEASMGTSCKKGKKKSVSLE--EPQKRA-KKKAGATSVNEVQPTGRLDDSRVK 60

Query: 67  VSEFDHCVENHFSAIDTISKLCGEAEDGDGGIDESDIQRFSSSTIFLREWRYYNYESKTV 126
           VSEFDHCVENHF AID I++L GEAE+G+GG+DESD QRFSSST FLREW++YNYE KTV
Sbjct: 61  VSEFDHCVENHFRAIDAIAELYGEAENGEGGVDESDFQRFSSSTTFLREWKFYNYEPKTV 120

Query: 127 KFASDSRGPEGKDADITINLPQFSSAAVLKNGTPSEATTSLDWRNFVMYVGGPVWALDWC 186
           KF SDSR PEGKDADIT+ LPQFSSAAVLKNG P  ATTSLD+RNF+M+VGGPVWA+DWC
Sbjct: 121 KFTSDSRVPEGKDADITMELPQFSSAAVLKNGAPPGATTSLDFRNFIMHVGGPVWAIDWC 180

Query: 187 PQVHERTDSLIKCEFIAVSAHPPGSSYHKMGTPLTGRGMVQIWCLLHGTENHEAEPTNAA 246
           P VHERTDSLIKCEFIAVSAHPPGSSYH MG PL+GRGMVQIWCL+HGTE+HE+E T+A 
Sbjct: 181 PLVHERTDSLIKCEFIAVSAHPPGSSYHTMGIPLSGRGMVQIWCLVHGTESHESETTSAT 240

Query: 247 KGKSNPKKVEVSSDLSSQPKRPRGRPPGPKKNGASDLPSQPKRPRGRPKKKKEESN-DNM 306
           + K         SDL SQPKRPRGRPPG KKNGAS LPSQPKRPRGRPKKK+EE N DN 
Sbjct: 241 ECK--------DSDL-SQPKRPRGRPPGRKKNGASALPSQPKRPRGRPKKKQEEPNDDNK 300

Query: 307 GDNYQFVQALSIEYP----------VVPKNSEELVLLENSVERQKCTLQEVSTCNSEDEV 366
             +YQ VQ LS+EYP           VP NSE+ V LENSVER   T++E+STCNSEDEV
Sbjct: 301 VASYQLVQPLSVEYPDVSSNLLEIDDVPHNSEKPVSLENSVERGSSTIEEISTCNSEDEV 360

Query: 367 PAQKRRVRRKAGTKNHIDDMGMLTLTENREDGSNGINLQANENVISKNSGEDTLLCNNIS 426
           P QKRRVRR A TKNH+DD+G L+L ENREDG N  N +ANENV S+ SGEDTLLC NIS
Sbjct: 361 PVQKRRVRRNADTKNHVDDVGTLSLIENREDGFNATNHEANENVTSEYSGEDTLLCKNIS 420

Query: 427 KNAVLDTSSIEFSIPGSVALPRVVLCLAHNGKVAWDLKWKPFNARTSKCKHRMGYLAVLL 486
           +NA+LDT S  FSIP SVALPR+VLCLAHNGKVAWDLKWKP NART+KCK RMGYLAVLL
Sbjct: 421 ENAILDTGSTGFSIPESVALPRLVLCLAHNGKVAWDLKWKPTNARTTKCKQRMGYLAVLL 480

Query: 487 GNGSLEVWEVPFPHVVKAIYSKFNGEGMDPRFVKLKPIFKCSMLRSANTQSIPLTVEWSS 546
           GNGSLEVWEVPFPHVVKAIYSK NGEG DPRFV+LKP F+CSMLRSA+TQSIPLTVEWS 
Sbjct: 481 GNGSLEVWEVPFPHVVKAIYSKLNGEGTDPRFVRLKPTFRCSMLRSADTQSIPLTVEWSP 540

Query: 547 TPPYDYLFAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRAVAWAPSESDPESANV 606
           TPPYDYL AGCHDGTVALWKFSANST EDTRPLLRFSADTVPIRAVAWAPSES+PES NV
Sbjct: 541 TPPYDYLLAGCHDGTVALWKFSANSTAEDTRPLLRFSADTVPIRAVAWAPSESEPESENV 600

Query: 607 VLTAGHGGLKFWDLRDPFRPLWDLHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLLKA 666
           +L A HGG+KFWDLRDPFRPLWDLHPAPR+IYSLDWLP+PRCV LSFDDGTLRLLSLLKA
Sbjct: 601 ILIASHGGIKFWDLRDPFRPLWDLHPAPRIIYSLDWLPNPRCVFLSFDDGTLRLLSLLKA 660

Query: 667 AYDVPVTGKPFTGTKQQGLHSYCCSSFAIWSVQVSRQTGMVAYCGADGAVHRFQLTTRAV 726
           AYDVPVTG+PFT  KQ+GLH+YCCS FAIWS+QVSRQTGMVAYCGADGAV RFQLTT+AV
Sbjct: 661 AYDVPVTGQPFTAIKQKGLHTYCCSPFAIWSIQVSRQTGMVAYCGADGAVVRFQLTTKAV 720

Query: 727 EKDHSRSRTPHFVCEYLTEEQSTITIHSPASDVPFPLKKLSNKSDPPLSMRAILSDSIQS 786
           +K++SR+RTPHFVCEYLTEEQS ITIHSPASDVP PLKKL+NKS+ PLSMRAILSDS+Q 
Sbjct: 721 DKENSRNRTPHFVCEYLTEEQSIITIHSPASDVPIPLKKLANKSEQPLSMRAILSDSMQP 780

Query: 787 NEENHKTAMASASENETLALSYGNDVSIESGSEDTLMSIKKRNQTQSKGKKKGVDNQALE 846
           NE N K+A  SA ENE+ AL Y +DV +ESGSEDT MSI+ +NQTQSK KKKGV NQ LE
Sbjct: 781 NEGNDKSATTSALENES-ALCYDDDVGVESGSEDTPMSIQNKNQTQSKSKKKGVVNQELE 840

Query: 847 CSDEPNDAQAKADVLPGSGNDFEIFPPKSVAMHRVRWNMNTGSERWLCYGGAAGILRCQE 903
            S EP+D+Q   DV+PGSG+ FE FPPKSVA+HR+RWNMN GSERWLCYGGAAGILRCQE
Sbjct: 841 HSHEPSDSQTDDDVVPGSGDHFENFPPKSVALHRLRWNMNIGSERWLCYGGAAGILRCQE 896

BLAST of Sgr022873 vs. NCBI nr
Match: KAG7017442.1 (General transcription factor 3C polypeptide 2 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1382.9 bits (3578), Expect = 0.0e+00
Identity = 701/909 (77.12%), Postives = 770/909 (84.71%), Query Frame = 0

Query: 7   VEVEPPAAEAS--TGCKRGKKKPVGREKKEAQGRATKKKAGATSVDEEQSTGRLDGMPIK 66
           +E  P  AEAS  T CK+GKKK V     E Q RA KKKAGATSV+E Q TGRLD   +K
Sbjct: 1   MEELPHQAEASMGTSCKKGKKKSV--SLGEPQKRA-KKKAGATSVNEVQPTGRLDDSRVK 60

Query: 67  VSEFDHCVENHFSAIDTISKLCGEAEDGDGGIDESDIQRFSSSTIFLREWRYYNYESKTV 126
           VSEFDHCVENHF AID I++L GEAE+G+GG+DESD QRFSSST FLREW++YNYE KTV
Sbjct: 61  VSEFDHCVENHFRAIDAIAELYGEAENGEGGVDESDFQRFSSSTTFLREWKFYNYEPKTV 120

Query: 127 KFASDSRGPEGKDADITINLPQFSSAAVLKNGTPSEATTSLDWRNFVMYVGGPVWALDWC 186
           KF SDSR PEGKDADIT+ LPQFSSAAVLKNG P  ATTSLD+RNF+M+VGGPVWA+DWC
Sbjct: 121 KFTSDSRVPEGKDADITMELPQFSSAAVLKNGAPPGATTSLDFRNFIMHVGGPVWAIDWC 180

Query: 187 PQVHERTDSLIKCEFIAVSAHPPGSSYHKMGTPLTGRGMVQIWCLLHGTENHEAEPTNAA 246
           P VHER+DSLIKCEFIAVSAHPPGSSYH MG PL+GRGMVQIWCL+HGTE+HE+E T+A 
Sbjct: 181 PLVHERSDSLIKCEFIAVSAHPPGSSYHTMGIPLSGRGMVQIWCLVHGTESHESETTSAT 240

Query: 247 KGKSNPKKVEVSSDLSSQPKRPRGRPPGPKKNGASDLPSQPKRPRGRPKKKKEESN-DNM 306
           + K         SDL SQPKRPRGRPPG KKNGAS LPSQPKRPRGRPK K+EE N DN 
Sbjct: 241 ECK--------DSDL-SQPKRPRGRPPGRKKNGASALPSQPKRPRGRPKNKQEEPNDDNK 300

Query: 307 GDNYQFVQALSIEYP----------VVPKNSEELVLLENSVERQKCTLQEVSTCNSEDEV 366
             +YQ VQ LS+EYP           VP NSE+ V LENSVER   T++E+STCNSEDEV
Sbjct: 301 VASYQLVQPLSVEYPDVSSNLLEIDDVPHNSEKPVSLENSVERGSSTIEEISTCNSEDEV 360

Query: 367 PAQKRRVRRKAGTKNHIDDMGMLTLTENREDGSNGINLQANENVISKNSGEDTLLCNNIS 426
           P QKRRVRR A TKNH+DD+G L+L ENREDGSN  N +ANENV S+ SGEDTLLC NIS
Sbjct: 361 PVQKRRVRRNADTKNHVDDVGTLSLIENREDGSNATNHEANENVTSEYSGEDTLLCKNIS 420

Query: 427 KNAVLDTSSIEFSIPGSVALPRVVLCLAHNGKVAWDLKWKPFNARTSKCKHRMGYLAVLL 486
           + A+LDT S  FSIP +VALPR+VLCLAHNGKVAWDLKWKP NART+KCK RMGYLAVLL
Sbjct: 421 EKAILDTGSTGFSIPETVALPRLVLCLAHNGKVAWDLKWKPTNARTTKCKQRMGYLAVLL 480

Query: 487 GNGSLEVWEVPFPHVVKAIYSKFNGEGMDPRFVKLKPIFKCSMLRSANTQSIPLTVEWSS 546
           GNGSLEVWEVPFPHVVKAIYSK NGEG DPRFVKLKP F+CSMLRSA+TQSIPLTVEWS 
Sbjct: 481 GNGSLEVWEVPFPHVVKAIYSKLNGEGTDPRFVKLKPTFRCSMLRSADTQSIPLTVEWSP 540

Query: 547 TPPYDYLFAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRAVAWAPSESDPESANV 606
           TPPYDYL AGCHDGTVALWKFSANST EDTRPLLRFSADTVPIRAVAWAPSES+PES NV
Sbjct: 541 TPPYDYLLAGCHDGTVALWKFSANSTAEDTRPLLRFSADTVPIRAVAWAPSESEPESENV 600

Query: 607 VLTAGHGGLKFWDLRDPFRPLWDLHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLLKA 666
           +L A HGG+KFWDLRDPFRPLWDLHPAPR+IYSLDWLP+PRCV LSFDDGTLRLLSLLKA
Sbjct: 601 ILIASHGGIKFWDLRDPFRPLWDLHPAPRIIYSLDWLPNPRCVFLSFDDGTLRLLSLLKA 660

Query: 667 AYDVPVTGKPFTGTKQQGLHSYCCSSFAIWSVQVSRQTGMVAYCGADGAVHRFQLTTRAV 726
           AYDVPVTG+PFT  KQ+GLH+YCCS FAIWS+QVSRQTGMVAYCGADGAV RFQLTT+AV
Sbjct: 661 AYDVPVTGQPFTAIKQKGLHTYCCSPFAIWSIQVSRQTGMVAYCGADGAVVRFQLTTKAV 720

Query: 727 EKDHSRSRTPHFVCEYLTEEQSTITIHSPASDVPFPLKKLSNKSDPPLSMRAILSDSIQS 786
           +K++SR+RTPHFVCEYLTEEQS ITIHSPASDVP PLKKLSNKS+ PLSMRAILSDS+Q 
Sbjct: 721 DKENSRNRTPHFVCEYLTEEQSIITIHSPASDVPIPLKKLSNKSEQPLSMRAILSDSMQP 780

Query: 787 NEENHKTAMASASENETLALSYGNDVSIESGSEDTLMSIKKRNQTQSKGKKKGVDNQALE 846
           NE N K+A  SA  NE+ AL Y +DV +ESGSEDT MSI+ +NQTQSK KKKGV NQ LE
Sbjct: 781 NEGNDKSATTSALANES-ALGYDDDVDVESGSEDTPMSIQNKNQTQSKSKKKGVFNQELE 840

Query: 847 CSDEPNDAQAKADVLPGSGNDFEIFPPKSVAMHRVRWNMNTGSERWLCYGGAAGILRCQE 903
            S EP+D+Q   DV+PG G  FE FPPKSVA+HR+RWNMN GSERWL YGGAAGILRCQE
Sbjct: 841 HSHEPSDSQTDDDVVPGLGEHFENFPPKSVALHRLRWNMNIGSERWLSYGGAAGILRCQE 896

BLAST of Sgr022873 vs. NCBI nr
Match: XP_022934485.1 (uncharacterized protein LOC111441649 isoform X1 [Cucurbita moschata])

HSP 1 Score: 1381.3 bits (3574), Expect = 0.0e+00
Identity = 700/909 (77.01%), Postives = 769/909 (84.60%), Query Frame = 0

Query: 7   VEVEPPAAEAS--TGCKRGKKKPVGREKKEAQGRATKKKAGATSVDEEQSTGRLDGMPIK 66
           +E  P  AEAS  T CK+GKKK V  E  E Q RA KKK GATSV+E Q TGRLD   +K
Sbjct: 1   MEELPHQAEASMGTSCKKGKKKSVSLE--EPQKRA-KKKGGATSVNEVQPTGRLDDSRVK 60

Query: 67  VSEFDHCVENHFSAIDTISKLCGEAEDGDGGIDESDIQRFSSSTIFLREWRYYNYESKTV 126
           VSEFDHCVENHF AID I++L GEAE+G+GG+DESD QRFSSST FLREW++YNYE KTV
Sbjct: 61  VSEFDHCVENHFRAIDAIAELYGEAENGEGGVDESDFQRFSSSTTFLREWKFYNYEPKTV 120

Query: 127 KFASDSRGPEGKDADITINLPQFSSAAVLKNGTPSEATTSLDWRNFVMYVGGPVWALDWC 186
           KF SDSR PEGKDADIT+ LPQFSSAAVLKNG P  AT SLD+RNF+M+VGGPVWA+DWC
Sbjct: 121 KFTSDSRVPEGKDADITMELPQFSSAAVLKNGAPPGATASLDFRNFIMHVGGPVWAIDWC 180

Query: 187 PQVHERTDSLIKCEFIAVSAHPPGSSYHKMGTPLTGRGMVQIWCLLHGTENHEAEPTNAA 246
           P VHERTDSLIKCEFIAVSAHPPGSSYH MG PL+GRGMVQIWCL+HGTE+HE+E T+A 
Sbjct: 181 PLVHERTDSLIKCEFIAVSAHPPGSSYHTMGIPLSGRGMVQIWCLVHGTESHESETTSAT 240

Query: 247 KGKSNPKKVEVSSDLSSQPKRPRGRPPGPKKNGASDLPSQPKRPRGRPKKKKEESN-DNM 306
           + K         SDL SQPKRPRGRPPG KKNGAS LPSQPKRPRGRPKKK+EE N DN 
Sbjct: 241 ECK--------DSDL-SQPKRPRGRPPGRKKNGASALPSQPKRPRGRPKKKQEEPNDDNK 300

Query: 307 GDNYQFVQALSIEYPVVPK----------NSEELVLLENSVERQKCTLQEVSTCNSEDEV 366
             +YQ VQ LS+EYP V            NSE+ V LENSVER   T++E+STCNSEDEV
Sbjct: 301 VASYQLVQPLSVEYPDVSSNLLEIDDVSHNSEKPVSLENSVERGSSTIEEISTCNSEDEV 360

Query: 367 PAQKRRVRRKAGTKNHIDDMGMLTLTENREDGSNGINLQANENVISKNSGEDTLLCNNIS 426
           P QKRRVRR A TKNH+DD+G L+L ENREDGSN  N +ANENV S+ SGEDT LC NIS
Sbjct: 361 PVQKRRVRRNADTKNHVDDVGTLSLIENREDGSNATNHEANENVTSEYSGEDTRLCKNIS 420

Query: 427 KNAVLDTSSIEFSIPGSVALPRVVLCLAHNGKVAWDLKWKPFNARTSKCKHRMGYLAVLL 486
           + A+LDT S  FSIP +VALPR+VLCLAHNGKVAWDLKWKP NART+KCK RMGYLAVLL
Sbjct: 421 EKAILDTGSTGFSIPETVALPRLVLCLAHNGKVAWDLKWKPTNARTTKCKQRMGYLAVLL 480

Query: 487 GNGSLEVWEVPFPHVVKAIYSKFNGEGMDPRFVKLKPIFKCSMLRSANTQSIPLTVEWSS 546
           GNGSLEVWEVPFPHVVKAIYSK NGEG DPRFVKLKP F+CSMLRSA+TQSIPLTVEWS 
Sbjct: 481 GNGSLEVWEVPFPHVVKAIYSKLNGEGTDPRFVKLKPTFRCSMLRSADTQSIPLTVEWSP 540

Query: 547 TPPYDYLFAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRAVAWAPSESDPESANV 606
           TPPYDYL AGCHDGTVALWKFSA+ST EDTRPLLRFSADTVPIRAVAWAPSES+PES NV
Sbjct: 541 TPPYDYLLAGCHDGTVALWKFSASSTAEDTRPLLRFSADTVPIRAVAWAPSESEPESENV 600

Query: 607 VLTAGHGGLKFWDLRDPFRPLWDLHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLLKA 666
           +L A HGG+KFWDLRDPFRPLWDLHPAPR+IYSLDWLP+PRCV LSFDDGTLRLLSLLKA
Sbjct: 601 ILIASHGGIKFWDLRDPFRPLWDLHPAPRIIYSLDWLPNPRCVFLSFDDGTLRLLSLLKA 660

Query: 667 AYDVPVTGKPFTGTKQQGLHSYCCSSFAIWSVQVSRQTGMVAYCGADGAVHRFQLTTRAV 726
           AYDVPVTG+PFT  KQ+GLH+YCCS FAIWS+QVSRQTGMVAYCGADGAV RFQLTT+AV
Sbjct: 661 AYDVPVTGQPFTAIKQKGLHTYCCSPFAIWSIQVSRQTGMVAYCGADGAVVRFQLTTKAV 720

Query: 727 EKDHSRSRTPHFVCEYLTEEQSTITIHSPASDVPFPLKKLSNKSDPPLSMRAILSDSIQS 786
           +K++SR+RTPHFVCEYLTEEQS ITIHSPASDVP PLKKLSNKS+ PLSMRAILSDS+Q 
Sbjct: 721 DKENSRNRTPHFVCEYLTEEQSIITIHSPASDVPIPLKKLSNKSEQPLSMRAILSDSMQP 780

Query: 787 NEENHKTAMASASENETLALSYGNDVSIESGSEDTLMSIKKRNQTQSKGKKKGVDNQALE 846
           NE N K+A  SA ENE+ AL Y +DV +ESGSEDT MSI+ +NQTQSK KKKGV NQ LE
Sbjct: 781 NEGNDKSATTSALENES-ALCYDDDVDVESGSEDTPMSIQNKNQTQSKSKKKGVVNQELE 840

Query: 847 CSDEPNDAQAKADVLPGSGNDFEIFPPKSVAMHRVRWNMNTGSERWLCYGGAAGILRCQE 903
            S EP+D+Q   DV+PG G  FE FPPKSVA+HR+RWNMN GSERWL YGGAAGILRCQE
Sbjct: 841 HSHEPSDSQTDDDVVPGLGEHFENFPPKSVALHRLRWNMNIGSERWLSYGGAAGILRCQE 896

BLAST of Sgr022873 vs. ExPASy Swiss-Prot
Match: Q8BL74 (General transcription factor 3C polypeptide 2 OS=Mus musculus OX=10090 GN=Gtf3c2 PE=2 SV=2)

HSP 1 Score: 52.0 bits (123), Expect = 4.3e-05
Identity = 25/72 (34.72%), Postives = 39/72 (54.17%), Query Frame = 0

Query: 173 YVGGPVWALDWCPQVHERTDSLIKCEFIAVSAHPPGSSYHKMGTPLTGRGMVQIWCLLHG 232
           + GGP+WALDWCP       S    +++A+ + P  +  H +    +G G++Q+W L  G
Sbjct: 388 FTGGPLWALDWCPVPEGSAAS----QYVALFSSPDMNETHPLSQLHSGPGLLQLWGL--G 447

Query: 233 TENHEAEPTNAA 245
           T   E+ P N A
Sbjct: 448 TLQQESCPGNRA 453

BLAST of Sgr022873 vs. ExPASy Swiss-Prot
Match: Q8WUA4 (General transcription factor 3C polypeptide 2 OS=Homo sapiens OX=9606 GN=GTF3C2 PE=1 SV=2)

HSP 1 Score: 51.2 bits (121), Expect = 7.4e-05
Identity = 26/72 (36.11%), Postives = 41/72 (56.94%), Query Frame = 0

Query: 173 YVGGPVWALDWCPQVHERTDSLIKCEFIAVSAHPPGSSYHKMGTPLTGRGMVQIWCLLHG 232
           + GGP+WALDWCP V E   +    +++A+ + P  +  H +    +G G++Q+W L  G
Sbjct: 392 FTGGPLWALDWCP-VPEGAGA---SQYVALFSSPDMNETHPLSQLHSGPGLLQLWGL--G 451

Query: 233 TENHEAEPTNAA 245
           T   E+ P N A
Sbjct: 452 TLQQESCPGNRA 457

BLAST of Sgr022873 vs. ExPASy Swiss-Prot
Match: Q5RDC3 (General transcription factor 3C polypeptide 2 OS=Pongo abelii OX=9601 GN=GTF3C2 PE=2 SV=1)

HSP 1 Score: 51.2 bits (121), Expect = 7.4e-05
Identity = 26/72 (36.11%), Postives = 41/72 (56.94%), Query Frame = 0

Query: 173 YVGGPVWALDWCPQVHERTDSLIKCEFIAVSAHPPGSSYHKMGTPLTGRGMVQIWCLLHG 232
           + GGP+WALDWCP V E   +    +++A+ + P  +  H +    +G G++Q+W L  G
Sbjct: 392 FTGGPLWALDWCP-VPEGAGA---SQYVALFSSPDMNETHPLSQLHSGPGLLQLWGL--G 451

Query: 233 TENHEAEPTNAA 245
           T   E+ P N A
Sbjct: 452 TLQQESCPGNRA 457

BLAST of Sgr022873 vs. ExPASy TrEMBL
Match: A0A6J1CU50 (uncharacterized protein LOC111014310 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111014310 PE=4 SV=1)

HSP 1 Score: 1524.6 bits (3946), Expect = 0.0e+00
Identity = 767/915 (83.83%), Postives = 809/915 (88.42%), Query Frame = 0

Query: 2   EHHHQVE-VEP--PAAEASTGCKRGKKK-PVGREKKEAQGRATKKKAGATSVDEEQSTGR 61
           EHHH VE VEP  PAA  STG KRGK+K PV R+KKEA GRA KKK G  S DEEQ TGR
Sbjct: 3   EHHHPVEVVEPPVPAASTSTGGKRGKRKQPVARQKKEAPGRA-KKKPGGASADEEQPTGR 62

Query: 62  LDGMPIKVSEFDHCVENHFSAIDTISKLCGEAEDGDGGIDESDIQRFSSSTIFLREWRYY 121
           LDG+ IKV EFDHC ENHF A+DTI++LCGEAEDGDGGIDESDIQRFSSS  FLREWR+Y
Sbjct: 63  LDGVGIKVLEFDHCAENHFRAMDTIAELCGEAEDGDGGIDESDIQRFSSSAFFLREWRFY 122

Query: 122 NYESKTVKFASDSRGPEGKDADITINLPQFSSAAVLKNGTPSEATTSLDWRNFVMYVGGP 181
           NYE KTVKFASD RG EGKD DITINLPQFSSAAVLKNGTPS A TSLDWRNFVMYVGGP
Sbjct: 123 NYEPKTVKFASDLRGSEGKDGDITINLPQFSSAAVLKNGTPSGAATSLDWRNFVMYVGGP 182

Query: 182 VWALDWCPQVHERTDSLIKCEFIAVSAHPPGSSYHKMGTPLTGRGMVQIWCLLHGTENHE 241
           VWALDWCPQV E+TD+LIKCEFIAVSAHPPGSSYHKMGTPL GRGMVQIWCL+HGTENHE
Sbjct: 183 VWALDWCPQVLEKTDALIKCEFIAVSAHPPGSSYHKMGTPLIGRGMVQIWCLVHGTENHE 242

Query: 242 AEPTNAAKGKSNPKKVEVSSDLSSQPKRPRGRPPGPKKNGASDLPSQPKRPRGRPKKKKE 301
            EP  A K KS PKK EVSSDLSSQPKRPRGRPPG KK GASDLPSQPKRPRGRPKKK+E
Sbjct: 243 PEPAYATKCKSKPKKDEVSSDLSSQPKRPRGRPPGTKKKGASDLPSQPKRPRGRPKKKQE 302

Query: 302 ESNDNMGDNYQFVQALSIEYPV----------VPKNSEELVLLENSVERQKCTLQEVSTC 361
            SNDNMGDN Q VQ+LS+EYP            PKNSEEL+LL NSVERQK TLQ VSTC
Sbjct: 303 GSNDNMGDNNQIVQSLSVEYPAGSSNLLEIDGDPKNSEELLLLGNSVERQKSTLQAVSTC 362

Query: 362 NSEDEVPAQKRRVRRKAGTKNHIDDMGMLTLTENREDGSNGINLQANENVISKNSGEDTL 421
           NS+DE PAQKRRVRRK GTKNHIDDMG L  T NREDGS+ I+ Q NENVIS+ SGEDTL
Sbjct: 363 NSKDEGPAQKRRVRRKVGTKNHIDDMGTLPFTVNREDGSSTISFQENENVISEYSGEDTL 422

Query: 422 LCNNISKNAVLDTSSIEFSIPGSVALPRVVLCLAHNGKVAWDLKWKPFNARTSKCKHRMG 481
           LCNNISKNA       EFSIP SVALPRVVLCLAHNGKVAWDLKWKP NA T+ CKHRMG
Sbjct: 423 LCNNISKNA-------EFSIPESVALPRVVLCLAHNGKVAWDLKWKPSNACTTNCKHRMG 482

Query: 482 YLAVLLGNGSLEVWEVPFPHVVKAIYSKFNGEGMDPRFVKLKPIFKCSMLRSANTQSIPL 541
           YLAVLLGNGSLEVWE+PFPHVVKAIYSKFN EG DPRFVKLKPIF+ +ML+SAN QSIPL
Sbjct: 483 YLAVLLGNGSLEVWEIPFPHVVKAIYSKFNREGTDPRFVKLKPIFRSTMLKSANIQSIPL 542

Query: 542 TVEWSSTPPYDYLFAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRAVAWAPSESD 601
           TVEWSSTPPYDYLFAGC+DGTVALWKFSANSTCEDTRPLLRFSADTVPIR VAWAP+ESD
Sbjct: 543 TVEWSSTPPYDYLFAGCNDGTVALWKFSANSTCEDTRPLLRFSADTVPIRRVAWAPNESD 602

Query: 602 PESANVVLTAGHGGLKFWDLRDPFRPLWDLHPAPRMIYSLDWLPDPRCVILSFDDGTLRL 661
           PESANVVLTA HGGLKFWDLRDPFRPLWD+HPAPRMIYSLDWLPDPRCVILSFDDGTLRL
Sbjct: 603 PESANVVLTASHGGLKFWDLRDPFRPLWDIHPAPRMIYSLDWLPDPRCVILSFDDGTLRL 662

Query: 662 LSLLKAAYDVPVTGKPFTGTKQQGLHSYCCSSFAIWSVQVSRQTGMVAYCGADGAVHRFQ 721
           LSLLKAAYDVPVTGKPFTGTKQQGLHSY  SSFAIWSVQVSRQTGMVAYC ADGAV RFQ
Sbjct: 663 LSLLKAAYDVPVTGKPFTGTKQQGLHSYYGSSFAIWSVQVSRQTGMVAYCSADGAVLRFQ 722

Query: 722 LTTRAVEKDHSRSRTPHFVCEYLTEEQSTITIHSPASDVPFPLKKLSNKSDPPLSMRAIL 781
           LTTRAVEKDHSR+RTPHF+CEYLTEE+S ITIHSPAS VPFPLKK SNKSD PLS RAIL
Sbjct: 723 LTTRAVEKDHSRNRTPHFICEYLTEEESAITIHSPASGVPFPLKKASNKSDLPLSFRAIL 782

Query: 782 SDSIQSNEENHKTAMASASENETLALSYGNDVSIESGSEDTLMSIKKRNQTQSKGKKKGV 841
           SDSI+SNE NHKTA A+ASENE LA+ + NDVS++SGSEDTLMS+KK+NQTQSK KKK V
Sbjct: 783 SDSIESNEGNHKTATATASENEALAIYHDNDVSVDSGSEDTLMSMKKKNQTQSKCKKKEV 842

Query: 842 DNQALECSDEPNDAQAKADVLPGSGNDFEIFPPKSVAMHRVRWNMNTGSERWLCYGGAAG 901
           D+QALECSDEPNDAQ K D LPGSG++FE FPPKSVA+HRVRWNMNTGSERWLCYGG AG
Sbjct: 843 DSQALECSDEPNDAQTKTDELPGSGDNFETFPPKSVALHRVRWNMNTGSERWLCYGGEAG 902

Query: 902 ILRCQEIVLSDLDKK 903
           I+RCQEIVLSD DKK
Sbjct: 903 IIRCQEIVLSDFDKK 909

BLAST of Sgr022873 vs. ExPASy TrEMBL
Match: A0A6J1F7U5 (uncharacterized protein LOC111441649 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111441649 PE=4 SV=1)

HSP 1 Score: 1381.3 bits (3574), Expect = 0.0e+00
Identity = 700/909 (77.01%), Postives = 769/909 (84.60%), Query Frame = 0

Query: 7   VEVEPPAAEAS--TGCKRGKKKPVGREKKEAQGRATKKKAGATSVDEEQSTGRLDGMPIK 66
           +E  P  AEAS  T CK+GKKK V  E  E Q RA KKK GATSV+E Q TGRLD   +K
Sbjct: 1   MEELPHQAEASMGTSCKKGKKKSVSLE--EPQKRA-KKKGGATSVNEVQPTGRLDDSRVK 60

Query: 67  VSEFDHCVENHFSAIDTISKLCGEAEDGDGGIDESDIQRFSSSTIFLREWRYYNYESKTV 126
           VSEFDHCVENHF AID I++L GEAE+G+GG+DESD QRFSSST FLREW++YNYE KTV
Sbjct: 61  VSEFDHCVENHFRAIDAIAELYGEAENGEGGVDESDFQRFSSSTTFLREWKFYNYEPKTV 120

Query: 127 KFASDSRGPEGKDADITINLPQFSSAAVLKNGTPSEATTSLDWRNFVMYVGGPVWALDWC 186
           KF SDSR PEGKDADIT+ LPQFSSAAVLKNG P  AT SLD+RNF+M+VGGPVWA+DWC
Sbjct: 121 KFTSDSRVPEGKDADITMELPQFSSAAVLKNGAPPGATASLDFRNFIMHVGGPVWAIDWC 180

Query: 187 PQVHERTDSLIKCEFIAVSAHPPGSSYHKMGTPLTGRGMVQIWCLLHGTENHEAEPTNAA 246
           P VHERTDSLIKCEFIAVSAHPPGSSYH MG PL+GRGMVQIWCL+HGTE+HE+E T+A 
Sbjct: 181 PLVHERTDSLIKCEFIAVSAHPPGSSYHTMGIPLSGRGMVQIWCLVHGTESHESETTSAT 240

Query: 247 KGKSNPKKVEVSSDLSSQPKRPRGRPPGPKKNGASDLPSQPKRPRGRPKKKKEESN-DNM 306
           + K         SDL SQPKRPRGRPPG KKNGAS LPSQPKRPRGRPKKK+EE N DN 
Sbjct: 241 ECK--------DSDL-SQPKRPRGRPPGRKKNGASALPSQPKRPRGRPKKKQEEPNDDNK 300

Query: 307 GDNYQFVQALSIEYPVVPK----------NSEELVLLENSVERQKCTLQEVSTCNSEDEV 366
             +YQ VQ LS+EYP V            NSE+ V LENSVER   T++E+STCNSEDEV
Sbjct: 301 VASYQLVQPLSVEYPDVSSNLLEIDDVSHNSEKPVSLENSVERGSSTIEEISTCNSEDEV 360

Query: 367 PAQKRRVRRKAGTKNHIDDMGMLTLTENREDGSNGINLQANENVISKNSGEDTLLCNNIS 426
           P QKRRVRR A TKNH+DD+G L+L ENREDGSN  N +ANENV S+ SGEDT LC NIS
Sbjct: 361 PVQKRRVRRNADTKNHVDDVGTLSLIENREDGSNATNHEANENVTSEYSGEDTRLCKNIS 420

Query: 427 KNAVLDTSSIEFSIPGSVALPRVVLCLAHNGKVAWDLKWKPFNARTSKCKHRMGYLAVLL 486
           + A+LDT S  FSIP +VALPR+VLCLAHNGKVAWDLKWKP NART+KCK RMGYLAVLL
Sbjct: 421 EKAILDTGSTGFSIPETVALPRLVLCLAHNGKVAWDLKWKPTNARTTKCKQRMGYLAVLL 480

Query: 487 GNGSLEVWEVPFPHVVKAIYSKFNGEGMDPRFVKLKPIFKCSMLRSANTQSIPLTVEWSS 546
           GNGSLEVWEVPFPHVVKAIYSK NGEG DPRFVKLKP F+CSMLRSA+TQSIPLTVEWS 
Sbjct: 481 GNGSLEVWEVPFPHVVKAIYSKLNGEGTDPRFVKLKPTFRCSMLRSADTQSIPLTVEWSP 540

Query: 547 TPPYDYLFAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRAVAWAPSESDPESANV 606
           TPPYDYL AGCHDGTVALWKFSA+ST EDTRPLLRFSADTVPIRAVAWAPSES+PES NV
Sbjct: 541 TPPYDYLLAGCHDGTVALWKFSASSTAEDTRPLLRFSADTVPIRAVAWAPSESEPESENV 600

Query: 607 VLTAGHGGLKFWDLRDPFRPLWDLHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLLKA 666
           +L A HGG+KFWDLRDPFRPLWDLHPAPR+IYSLDWLP+PRCV LSFDDGTLRLLSLLKA
Sbjct: 601 ILIASHGGIKFWDLRDPFRPLWDLHPAPRIIYSLDWLPNPRCVFLSFDDGTLRLLSLLKA 660

Query: 667 AYDVPVTGKPFTGTKQQGLHSYCCSSFAIWSVQVSRQTGMVAYCGADGAVHRFQLTTRAV 726
           AYDVPVTG+PFT  KQ+GLH+YCCS FAIWS+QVSRQTGMVAYCGADGAV RFQLTT+AV
Sbjct: 661 AYDVPVTGQPFTAIKQKGLHTYCCSPFAIWSIQVSRQTGMVAYCGADGAVVRFQLTTKAV 720

Query: 727 EKDHSRSRTPHFVCEYLTEEQSTITIHSPASDVPFPLKKLSNKSDPPLSMRAILSDSIQS 786
           +K++SR+RTPHFVCEYLTEEQS ITIHSPASDVP PLKKLSNKS+ PLSMRAILSDS+Q 
Sbjct: 721 DKENSRNRTPHFVCEYLTEEQSIITIHSPASDVPIPLKKLSNKSEQPLSMRAILSDSMQP 780

Query: 787 NEENHKTAMASASENETLALSYGNDVSIESGSEDTLMSIKKRNQTQSKGKKKGVDNQALE 846
           NE N K+A  SA ENE+ AL Y +DV +ESGSEDT MSI+ +NQTQSK KKKGV NQ LE
Sbjct: 781 NEGNDKSATTSALENES-ALCYDDDVDVESGSEDTPMSIQNKNQTQSKSKKKGVVNQELE 840

Query: 847 CSDEPNDAQAKADVLPGSGNDFEIFPPKSVAMHRVRWNMNTGSERWLCYGGAAGILRCQE 903
            S EP+D+Q   DV+PG G  FE FPPKSVA+HR+RWNMN GSERWL YGGAAGILRCQE
Sbjct: 841 HSHEPSDSQTDDDVVPGLGEHFENFPPKSVALHRLRWNMNIGSERWLSYGGAAGILRCQE 896

BLAST of Sgr022873 vs. ExPASy TrEMBL
Match: A0A6J1J0H6 (uncharacterized protein LOC111481574 OS=Cucurbita maxima OX=3661 GN=LOC111481574 PE=4 SV=1)

HSP 1 Score: 1338.6 bits (3463), Expect = 0.0e+00
Identity = 678/908 (74.67%), Postives = 746/908 (82.16%), Query Frame = 0

Query: 7   VEVEPPAAEAS--TGCKRGKKKPVGREKKEAQGRATKKKAGATSVDEEQSTGRLDGMPIK 66
           +E  P  AEAS  T CK+GKKK V  E+     +  KKKAGATSV+E Q TGRLD   +K
Sbjct: 1   MEELPHQAEASMGTSCKKGKKKSVSLEEPL---KRAKKKAGATSVNEVQPTGRLDDFRVK 60

Query: 67  VSEFDHCVENHFSAIDTISKLCGEAEDGDGGIDESDIQRFSSSTIFLREWRYYNYESKTV 126
           VSEFDHCVENHF AID I++L GEAE+G+GG+DESD QRFSSST FLREW++YNYE KTV
Sbjct: 61  VSEFDHCVENHFRAIDAIAELYGEAENGEGGVDESDFQRFSSSTTFLREWKFYNYEPKTV 120

Query: 127 KFASDSRGPEGKDADITINLPQFSSAAVLKNGTPSEATTSLDWRNFVMYVGGPVWALDWC 186
           KF SDSR PEGKDADIT+ LPQFSSAAVLKNG P  ATTSLD+RNF+M+VGGPVWA+DWC
Sbjct: 121 KFTSDSRVPEGKDADITMELPQFSSAAVLKNGAPPGATTSLDFRNFIMHVGGPVWAIDWC 180

Query: 187 PQVHERTDSLIKCEFIAVSAHPPGSSYHKMGTPLTGRGMVQIWCLLHGTENHEAEPTNAA 246
           P VHERTDSLIKCEFIAVSAHPPGSSYH MG PL+GRGMVQIWCL+HGTE+HE+E TNA 
Sbjct: 181 PLVHERTDSLIKCEFIAVSAHPPGSSYHTMGIPLSGRGMVQIWCLVHGTESHESETTNAT 240

Query: 247 KGKSNPKKVEVSSDLSSQPKRPRGRPPGPKKNGASDLPSQPKRPRGRPKKKKEESNDNMG 306
           + K        +SDL SQPKRPRGRPPG KKNGAS L SQ KRPRGRPKKK+EE NDN  
Sbjct: 241 ECK--------ASDL-SQPKRPRGRPPGRKKNGASALSSQQKRPRGRPKKKQEEPNDNEV 300

Query: 307 DNYQFVQALSIEYP----------VVPKNSEELVLLENSVERQKCTLQEVSTCNSEDEVP 366
            +YQ VQ LS+EYP           VP NSE+LV LENSVER   T++E+STCNSEDEVP
Sbjct: 301 ASYQLVQPLSVEYPDVSSNLLEIDDVPHNSEKLVSLENSVERGSSTIEEISTCNSEDEVP 360

Query: 367 AQKRRVRRKAGTKNHIDDMGMLTLTENREDGSNGINLQANENVISKNSGEDTLLCNNISK 426
            QKRR RR A TKNH+DD+G                                 LC NIS+
Sbjct: 361 VQKRRERRNADTKNHVDDVG--------------------------------TLCKNISE 420

Query: 427 NAVLDTSSIEFSIPGSVALPRVVLCLAHNGKVAWDLKWKPFNARTSKCKHRMGYLAVLLG 486
           NA+LDT S  FSIP SVALPR+VLCLAHNGKVAWDLKWKP NART+KCK RMGYLAVLLG
Sbjct: 421 NAILDTGSTGFSIPESVALPRLVLCLAHNGKVAWDLKWKPTNARTTKCKQRMGYLAVLLG 480

Query: 487 NGSLEVWEVPFPHVVKAIYSKFNGEGMDPRFVKLKPIFKCSMLRSANTQSIPLTVEWSST 546
           NGSLEVWE+PFPHVVKAIYS  NGEG DPRFVKLKP F+CSMLRSA+TQSIPLTVEWS T
Sbjct: 481 NGSLEVWEIPFPHVVKAIYSNLNGEGTDPRFVKLKPTFRCSMLRSADTQSIPLTVEWSPT 540

Query: 547 PPYDYLFAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRAVAWAPSESDPESANVV 606
           PPYDYL AGCHDGTVALWKFSANST EDTRPLLRFSADTVPIRAVAWAPSES+PES NV+
Sbjct: 541 PPYDYLLAGCHDGTVALWKFSANSTAEDTRPLLRFSADTVPIRAVAWAPSESEPESENVI 600

Query: 607 LTAGHGGLKFWDLRDPFRPLWDLHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLLKAA 666
           L A HGG+KFWDLRDPFRPLWDLHPAPR+IYSLDWLP+PRCV LSFDDGTLRLLSLLKAA
Sbjct: 601 LIASHGGIKFWDLRDPFRPLWDLHPAPRIIYSLDWLPNPRCVFLSFDDGTLRLLSLLKAA 660

Query: 667 YDVPVTGKPFTGTKQQGLHSYCCSSFAIWSVQVSRQTGMVAYCGADGAVHRFQLTTRAVE 726
           YDVPVTG+PFT  KQ+GLH+YCCS FAIWS+QVSRQTGMVAYCGADGAV RFQLTT+AV+
Sbjct: 661 YDVPVTGQPFTAIKQKGLHTYCCSPFAIWSIQVSRQTGMVAYCGADGAVVRFQLTTKAVD 720

Query: 727 KDHSRSRTPHFVCEYLTEEQSTITIHSPASDVPFPLKKLSNKSDPPLSMRAILSDSIQSN 786
           K++SR+RTPHFVCEYLTEEQS ITIHSPASDVP PLKKLSNKS+ PLSMRAILSDS+Q N
Sbjct: 721 KENSRNRTPHFVCEYLTEEQSIITIHSPASDVPIPLKKLSNKSEQPLSMRAILSDSMQPN 780

Query: 787 EENHKTAMASASENETLALSYGNDVSIESGSEDTLMSIKKRNQTQSKGKKKGVDNQALEC 846
           E N K+A  SA ENE+ AL Y +DV +ESGSEDT MSI+ +NQTQSK KKKGV NQ LE 
Sbjct: 781 EGNDKSATTSALENES-ALCYDDDVGVESGSEDTPMSIQNKNQTQSKSKKKGVVNQELEH 840

Query: 847 SDEPNDAQAKADVLPGSGNDFEIFPPKSVAMHRVRWNMNTGSERWLCYGGAAGILRCQEI 903
           S EP+D+Q   DV+PG G+ FE FPPKSVA+HR+RWNMN GSERWLCYGGAAGILRCQEI
Sbjct: 841 SHEPSDSQTDDDVVPGLGDHFENFPPKSVALHRLRWNMNIGSERWLCYGGAAGILRCQEI 863

BLAST of Sgr022873 vs. ExPASy TrEMBL
Match: A0A6J1KYL7 (uncharacterized protein LOC111498749 OS=Cucurbita maxima OX=3661 GN=LOC111498749 PE=4 SV=1)

HSP 1 Score: 1333.2 bits (3449), Expect = 0.0e+00
Identity = 679/893 (76.04%), Postives = 743/893 (83.20%), Query Frame = 0

Query: 12  PAAEASTGCKRGKKKPVGREKKEAQGRATKKKAGATSVDEEQSTGRLDGMPIKVSEFDHC 71
           P A   TGC +GKKKPV R+KKE++ RA KKK  ATSV+EEQ+ GR DG  IKV EFDHC
Sbjct: 10  PEASNGTGCVKGKKKPVSRKKKESENRA-KKKPEATSVNEEQAAGRSDGPGIKVLEFDHC 69

Query: 72  VENHFSAIDTISKLCGEAEDGDGGIDESDIQRFSSSTIFLREWRYYNYESKTVKFASDSR 131
           VENHF AIDT+++LCG+ EDG G IDE DIQRFSSSTIFLREWRYYNYE KTVKFASDSR
Sbjct: 70  VENHFKAIDTMAELCGDREDGVGEIDERDIQRFSSSTIFLREWRYYNYEPKTVKFASDSR 129

Query: 132 GPEGKDADITINLPQFSSAAVLKNGTPSEATTSLDWRNFVMYVGGPVWALDWCPQVHERT 191
           GPE KDADI++NLPQFSSAA LKNG P EA TSLD+RNFVMYVGGP+WALDWCPQ HERT
Sbjct: 130 GPEDKDADISVNLPQFSSAAFLKNGAPPEAATSLDFRNFVMYVGGPIWALDWCPQDHERT 189

Query: 192 DSLIKCEFIAVSAHPPGSSYHKMGTPLTGRGMVQIWCLLHGTENHEAEPTNAAKGKSNPK 251
           DSLIKCE+IAVSAHPP SSYHKMG PLTGRGMVQIWC++HGTE+HEAEPT         K
Sbjct: 190 DSLIKCEYIAVSAHPPCSSYHKMGIPLTGRGMVQIWCVVHGTESHEAEPT---------K 249

Query: 252 KVEVSSDLSSQPKRPRGRPPGPKKNGASDLPSQPKRPRGRPKKKKEESNDNMGDNYQFVQ 311
               SS+LSSQP++PRGRPPG KKN AS+LPSQPKRPRGRP KKK+ESND          
Sbjct: 250 NSVASSNLSSQPRKPRGRPPGRKKNEASNLPSQPKRPRGRP-KKKQESND---------- 309

Query: 312 ALSIEYPVVPKNSEELVLLENSVERQKCTLQEVSTCNSEDEVPAQKRRVRRKAGTKNHID 371
                               NSVER   TLQEV TCNS+DEV AQK+R+RRK  TKNH+D
Sbjct: 310 --------------------NSVERS--TLQEVPTCNSDDEVLAQKKRMRRKVETKNHVD 369

Query: 372 DMGMLTLTENREDGSNGINLQANENVISKNSGEDTLLCNNISKNAVLDTSSIEFSIPGSV 431
           D+G L LTENRE+ SN INLQANENVIS+ SGEDTLLCNN+S+NA LD SSI FSIP SV
Sbjct: 370 DVGTLALTENRENESNAINLQANENVISEYSGEDTLLCNNVSENAGLDPSSI-FSIPESV 429

Query: 432 ALPRVVLCLAHNGKVAWDLKWKPFNARTSKCKHRMGYLAVLLGNGSLEVWEVPFPHVVKA 491
           ALPRVVLCLAHNGKVAWDLKWKP NA  +KCKHRMGYLA+LLGNGSLEVWEVPFPHV++A
Sbjct: 430 ALPRVVLCLAHNGKVAWDLKWKPTNACDAKCKHRMGYLALLLGNGSLEVWEVPFPHVMRA 489

Query: 492 IYSKFNGEGMDPRFVKLKPIFKCSMLRSANTQSIPLTVEWSSTPPYDYLFAGCHDGTVAL 551
           IYSK NGEG DPRFVKLKP F+CSMLRSAN QSIPLTVEWSSTPPYDYL AGCHDGTVAL
Sbjct: 490 IYSKCNGEGTDPRFVKLKPTFRCSMLRSANKQSIPLTVEWSSTPPYDYLLAGCHDGTVAL 549

Query: 552 WKFSANSTCEDTRPLLRFSADTVPIRAVAWAPSESDPESANVVLTAGHGGLKFWDLRDPF 611
           WKFSANS CEDTRPLLRFSADTVPIR VAWAP++SDPE  NV+LTAGHGGLKFWDLRDPF
Sbjct: 550 WKFSANSPCEDTRPLLRFSADTVPIRGVAWAPNDSDPECENVILTAGHGGLKFWDLRDPF 609

Query: 612 RPLWDLHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLLKAAYDVPVTGKPFTGTKQQG 671
           RPLWDLHPAPR+IYSLDWL DPRC ILSFDDGTLRLLSL KAA DVPVTGKPFT  KQ+G
Sbjct: 610 RPLWDLHPAPRIIYSLDWLSDPRCTILSFDDGTLRLLSLPKAACDVPVTGKPFTRVKQKG 669

Query: 672 LHSYCCSSFAIWSVQVSRQTGMVAYCGADGAVHRFQLTTRAVEKDHSRSRTPHFVCEYLT 731
           LH+YCC+S AIWSVQVSRQTGMVAYCGADG V RFQLTT+AV K++SR+RTP FVC+Y T
Sbjct: 670 LHTYCCTSTAIWSVQVSRQTGMVAYCGADGTVVRFQLTTKAVVKENSRNRTPQFVCDYFT 729

Query: 732 EEQSTITIHSPASDVPFPLKKLSNKSD-PPLSMRAILSDSIQSNEENHKTAMASASENET 791
           EEQSTITIH+P  DVPFPLKK+SN+ D PPLSMRAILS+ IQSNE NHKTA AS SEN T
Sbjct: 730 EEQSTITIHTPELDVPFPLKKMSNRPDPPPLSMRAILSE-IQSNEGNHKTAAASLSENGT 789

Query: 792 LALSYGNDVSIESGSEDTLMSIKKRNQTQSKGKKKG-VDNQALECSDEPNDAQAKADVLP 851
           LA  + +D ++ESGSEDT  SI K+++TQSK KKKG  DNQ  ECSDE ND        P
Sbjct: 790 LAPCFDDDFNVESGSEDTPTSINKKSKTQSKCKKKGEKDNQESECSDEANDVPTNNGGAP 849

Query: 852 GSGNDFEIFPPKSVAMHRVRWNMNTGSERWLCYGGAAGILRCQEIVLSDLDKK 903
           GSG+  E  PPKSVA+HRVRWNMNTGSERWLCYGGAAG+LRCQEI+LS LDKK
Sbjct: 850 GSGDSPENLPPKSVAVHRVRWNMNTGSERWLCYGGAAGLLRCQEILLSALDKK 857

BLAST of Sgr022873 vs. ExPASy TrEMBL
Match: A0A6J1EW91 (uncharacterized protein LOC111437104 OS=Cucurbita moschata OX=3662 GN=LOC111437104 PE=4 SV=1)

HSP 1 Score: 1330.1 bits (3441), Expect = 0.0e+00
Identity = 677/893 (75.81%), Postives = 740/893 (82.87%), Query Frame = 0

Query: 12  PAAEASTGCKRGKKKPVGREKKEAQGRATKKKAGATSVDEEQSTGRLDGMPIKVSEFDHC 71
           P A   TGC +GKKKPV R+KKE++ RA KKK  ATSV+EEQ  GR DG  IKV EFD+C
Sbjct: 10  PEASNGTGCVKGKKKPVSRKKKESENRA-KKKPEATSVNEEQPAGRSDGPGIKVLEFDYC 69

Query: 72  VENHFSAIDTISKLCGEAEDGDGGIDESDIQRFSSSTIFLREWRYYNYESKTVKFASDSR 131
           VENHF AIDT+++LCG+ EDG G IDESDIQRFSSSTIFLREWRYYNYE KTVKFASDS 
Sbjct: 70  VENHFKAIDTMAELCGDREDGVGEIDESDIQRFSSSTIFLREWRYYNYEPKTVKFASDSS 129

Query: 132 GPEGKDADITINLPQFSSAAVLKNGTPSEATTSLDWRNFVMYVGGPVWALDWCPQVHERT 191
           GPE KDADI++NLPQFSSAA LKNG P EA TS+D+RNFVMYVGGP+WALDWCPQ HERT
Sbjct: 130 GPEDKDADISVNLPQFSSAAFLKNGAPPEAATSMDFRNFVMYVGGPIWALDWCPQDHERT 189

Query: 192 DSLIKCEFIAVSAHPPGSSYHKMGTPLTGRGMVQIWCLLHGTENHEAEPTNAAKGKSNPK 251
           DSLIKCE+IAVSAHPP SSYHKMG PLTGRGMVQIWC++HGTE HEAEPT         K
Sbjct: 190 DSLIKCEYIAVSAHPPCSSYHKMGIPLTGRGMVQIWCVVHGTECHEAEPT---------K 249

Query: 252 KVEVSSDLSSQPKRPRGRPPGPKKNGASDLPSQPKRPRGRPKKKKEESNDNMGDNYQFVQ 311
               SS+LSSQP++PRGRPPG KKN AS+LPSQPKRPRGRP KKK+ESND          
Sbjct: 250 NSVASSNLSSQPRKPRGRPPGRKKNEASNLPSQPKRPRGRP-KKKQESND---------- 309

Query: 312 ALSIEYPVVPKNSEELVLLENSVERQKCTLQEVSTCNSEDEVPAQKRRVRRKAGTKNHID 371
                               NSVER   TLQEV TCNS+DEV AQK+RVRRK  TKNH+D
Sbjct: 310 --------------------NSVERS--TLQEVPTCNSDDEVLAQKKRVRRKVETKNHVD 369

Query: 372 DMGMLTLTENREDGSNGINLQANENVISKNSGEDTLLCNNISKNAVLDTSSIEFSIPGSV 431
           D+GML+LTENRE+ SN INLQANENVIS+ SGEDTLLCNN+S+NA LD SSIEFSIP SV
Sbjct: 370 DVGMLSLTENRENESNAINLQANENVISEYSGEDTLLCNNVSENAGLDPSSIEFSIPESV 429

Query: 432 ALPRVVLCLAHNGKVAWDLKWKPFNARTSKCKHRMGYLAVLLGNGSLEVWEVPFPHVVKA 491
           ALPRVVLCLAHNGKVAWDLKWKP NA  +KCKHRMGYLAVLLGNGSLEVWEVPFPHV++A
Sbjct: 430 ALPRVVLCLAHNGKVAWDLKWKPTNACDAKCKHRMGYLAVLLGNGSLEVWEVPFPHVMRA 489

Query: 492 IYSKFNGEGMDPRFVKLKPIFKCSMLRSANTQSIPLTVEWSSTPPYDYLFAGCHDGTVAL 551
           IYSK NGEG DPRFVKL P F+CSMLRSAN QSIPLTVEWSSTPPYDYL AGCHDGTVAL
Sbjct: 490 IYSKCNGEGTDPRFVKLNPTFRCSMLRSANKQSIPLTVEWSSTPPYDYLLAGCHDGTVAL 549

Query: 552 WKFSANSTCEDTRPLLRFSADTVPIRAVAWAPSESDPESANVVLTAGHGGLKFWDLRDPF 611
           WKF+ANS CEDTRPLLRFSADTVPIR VAWAP++SDPE  NV+LTAGHGGLKFWDLRDPF
Sbjct: 550 WKFAANSPCEDTRPLLRFSADTVPIRGVAWAPNDSDPECENVILTAGHGGLKFWDLRDPF 609

Query: 612 RPLWDLHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLLKAAYDVPVTGKPFTGTKQQG 671
           RPLWDLHPAPR+IYSLDWL DP C ILSFDDGTLRLLSL KAA DVPVTGKPFT  KQ+G
Sbjct: 610 RPLWDLHPAPRIIYSLDWLSDPSCAILSFDDGTLRLLSLPKAACDVPVTGKPFTRVKQKG 669

Query: 672 LHSYCCSSFAIWSVQVSRQTGMVAYCGADGAVHRFQLTTRAVEKDHSRSRTPHFVCEYLT 731
           LH+YCC+S AIWSVQVSRQTGMVAYCGADG V RFQLTT+AV K++SR+RTP FVC+Y T
Sbjct: 670 LHTYCCTSTAIWSVQVSRQTGMVAYCGADGTVVRFQLTTKAVVKENSRNRTPQFVCDYFT 729

Query: 732 EEQSTITIHSPASDVPFPLKKLSNKSD-PPLSMRAILSDSIQSNEENHKTAMASASENET 791
           EEQSTITIH+P  DVPFPLKK+SN+ D PPLSMRAILS+ IQSNE NHKTA AS SEN T
Sbjct: 730 EEQSTITIHTPELDVPFPLKKMSNRPDPPPLSMRAILSE-IQSNEGNHKTAAASLSENGT 789

Query: 792 LALSYGNDVSIESGSEDTLMSIKKRNQTQSKGKKKG-VDNQALECSDEPNDAQAKADVLP 851
           LA    +D ++ESGSEDT   I K+++TQSK KKKG  DNQ  ECSDE ND        P
Sbjct: 790 LAPCSDDDFNVESGSEDTPTPINKKSKTQSKCKKKGEKDNQESECSDEANDVPTNNGGAP 849

Query: 852 GSGNDFEIFPPKSVAMHRVRWNMNTGSERWLCYGGAAGILRCQEIVLSDLDKK 903
           GSG+  E  PPKSVA+HRVRWNMNTGSERWLCYGGAAG+LRCQEI+LS LDKK
Sbjct: 850 GSGDSPENLPPKSVAVHRVRWNMNTGSERWLCYGGAAGLLRCQEILLSALDKK 858

BLAST of Sgr022873 vs. TAIR 10
Match: AT1G19485.1 (Transducin/WD40 repeat-like superfamily protein )

HSP 1 Score: 737.3 bits (1902), Expect = 1.6e-212
Identity = 412/861 (47.85%), Postives = 545/861 (63.30%), Query Frame = 0

Query: 58  LDGMPIKVSEFDHCVENHFSAIDTISKLCGEAEDGDGGIDESDIQRFSSSTIFLREWRYY 117
           +DG    +S FD+  E+H  A+++I+ LCGEA   +  IDE+DI   SSS  FLREWR+Y
Sbjct: 1   MDGEECNISLFDYSAESHLKAVESITDLCGEA---NADIDENDINILSSSVTFLREWRHY 60

Query: 118 NYESKTVKFASDS-RGPEGKDADITINLPQFSSAAVLKNGTPSEATTSLD--WRNFVMYV 177
           N+E K+  F +++ +  + KD + +  LPQFSSA   K     + ++S     ++FVM+V
Sbjct: 61  NFEPKSFAFYNEAEKNHQPKDIN-SQTLPQFSSARAPKVKIHDDESSSSGEISKDFVMHV 120

Query: 178 GGPVWALDWCPQVHERTDSLIKCEFIAVSAHPPGSSYHKMGTPLTGRGMVQIWCLLHGT- 237
           GG VWA++WCP+VH   D+  KCEF+AV+ HPP S  HK+G PL GRG++QIWC+++ T 
Sbjct: 121 GGSVWAMEWCPRVHGNPDAQAKCEFLAVATHPPDSYSHKIGIPLIGRGIIQIWCIINATC 180

Query: 238 ENHEAEPTNAAKGKSNPKKVEVSSDL--SSQPKRPRGRPPGPKKNGASDLPSQPKRPRGR 297
           +    + ++  K  +   + + S +   +++PK+PRGR   P+K+      ++PK+PRGR
Sbjct: 181 KKDSGQVSDKGKKLTGKSRKQPSGETTETTEPKKPRGR---PRKHPVE--TTEPKKPRGR 240

Query: 298 PKKKK-EESNDNMGDNYQFVQALSIEYP---VVPKNSEELVLLENSVERQKC----TLQE 357
           P+KK   E    + D+  +V+ALS+ YP   VVP      +L E  V   K     + Q 
Sbjct: 241 PRKKSTAELPVELDDDVLYVEALSVRYPENSVVPATPLR-ILRETPVTETKVNNEGSGQV 300

Query: 358 VSTCNSEDEVPAQKRRVRRKAGTKNHIDDMGMLTLTENREDGSNGINLQANE---NVISK 417
           +S+ N+  ++P +++R +                 T++ E+    + L+ +E   NV SK
Sbjct: 301 LSSDNANIKLPVRRKRQK-----------------TKSTEESCTPMILEYSEAVGNVPSK 360

Query: 418 NSGEDTLLCNNISKNAVLDTSSIEFSIPGSVALPRVVLCLAHNGKVAWDLKWKPFNARTS 477
            S       + IS++               VALPRVVLCLAHNGKV WD+KW+P  A  S
Sbjct: 361 PS-------SGISEDI--------------VALPRVVLCLAHNGKVVWDMKWRPSYAGDS 420

Query: 478 KCKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIYSKFNGEGMDPRFVKLKPIFKCSMLRSA 537
             KH MGYLAVLLGNGSLEVW+VP P    A+Y        DPRFVKL P+FKCS L+  
Sbjct: 421 LNKHSMGYLAVLLGNGSLEVWDVPMPKATSALYLSSKKAATDPRFVKLAPVFKCSNLKCG 480

Query: 538 NTQSIPLTVEWSSTPPYDYLFAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRAVA 597
           +T+SIPLTVEWS+    D+L AGCHDGTVALWKFS   + EDTRPLL FSADT PIRAVA
Sbjct: 481 DTKSIPLTVEWSTLGNPDFLLAGCHDGTVALWKFSTTKSSEDTRPLLFFSADTAPIRAVA 540

Query: 598 WAPSESDPESANVVLTAGHGGLKFWDLRDPFRPLWDLHPAPRMIYSLDWLPDPRCVILSF 657
           WAP ESD ESAN+V TAGH GLKFWDLRDPFRPLWDLHP PR IYSLDWL DP CV+LSF
Sbjct: 541 WAPGESDQESANIVATAGHAGLKFWDLRDPFRPLWDLHPVPRFIYSLDWLQDPSCVLLSF 600

Query: 658 DDGTLRLLSLLKAAYDVPVTGKPFTGTKQQGLHSYCCSSFAIWSVQVSRQTGMVAYCGAD 717
           DDGTLR+LSL+K AYDVP TG+P+  TKQQGL  Y CS+F IWS+QVSR TG+ AYC AD
Sbjct: 601 DDGTLRILSLVKVAYDVPATGRPYPNTKQQGLSVYNCSTFPIWSIQVSRLTGIAAYCTAD 660

Query: 718 GAVHRFQLTTRAVEKDHSRSRTPHFVCEYLTEEQSTITIHSPASDVPFPLKK-LSNKSDP 777
           G++  F+LTT+AVEKD +R+RTPH++C  LT + ST  +HSP  D+P  LKK +    + 
Sbjct: 661 GSIFHFELTTKAVEKD-TRNRTPHYLCGQLTMKDSTFIVHSPVPDIPIVLKKPVGETGEK 720

Query: 778 PLSMRAILSDSIQSNEENHKTAMASASENETLALSYGNDVSIESGSEDTLMSIKKRNQTQ 837
              +R++L++S      N        S+ + LA ++  D  +ES SE T     K    +
Sbjct: 721 QRCLRSLLNESPSRYASN-------VSDVQPLAFAHVEDPGLESESEGTNNKAAKSKAKK 780

Query: 838 SKGKKKGVDNQ---ALECSDE---PNDAQAKADVLPGSGNDFEIFPPKSVAMHRVRWNMN 895
            K   +  +++   AL C  E     + + KA     +G   E FPPK VAMHRVRWNMN
Sbjct: 781 GKNNARAEEDENSRALVCVKEDGGEEEGRRKAASNNSNGMKAEGFPPKMVAMHRVRWNMN 805

BLAST of Sgr022873 vs. TAIR 10
Match: AT1G19485.2 (Transducin/WD40 repeat-like superfamily protein )

HSP 1 Score: 737.3 bits (1902), Expect = 1.6e-212
Identity = 412/861 (47.85%), Postives = 545/861 (63.30%), Query Frame = 0

Query: 58  LDGMPIKVSEFDHCVENHFSAIDTISKLCGEAEDGDGGIDESDIQRFSSSTIFLREWRYY 117
           +DG    +S FD+  E+H  A+++I+ LCGEA   +  IDE+DI   SSS  FLREWR+Y
Sbjct: 1   MDGEECNISLFDYSAESHLKAVESITDLCGEA---NADIDENDINILSSSVTFLREWRHY 60

Query: 118 NYESKTVKFASDS-RGPEGKDADITINLPQFSSAAVLKNGTPSEATTSLD--WRNFVMYV 177
           N+E K+  F +++ +  + KD + +  LPQFSSA   K     + ++S     ++FVM+V
Sbjct: 61  NFEPKSFAFYNEAEKNHQPKDIN-SQTLPQFSSARAPKVKIHDDESSSSGEISKDFVMHV 120

Query: 178 GGPVWALDWCPQVHERTDSLIKCEFIAVSAHPPGSSYHKMGTPLTGRGMVQIWCLLHGT- 237
           GG VWA++WCP+VH   D+  KCEF+AV+ HPP S  HK+G PL GRG++QIWC+++ T 
Sbjct: 121 GGSVWAMEWCPRVHGNPDAQAKCEFLAVATHPPDSYSHKIGIPLIGRGIIQIWCIINATC 180

Query: 238 ENHEAEPTNAAKGKSNPKKVEVSSDL--SSQPKRPRGRPPGPKKNGASDLPSQPKRPRGR 297
           +    + ++  K  +   + + S +   +++PK+PRGR   P+K+      ++PK+PRGR
Sbjct: 181 KKDSGQVSDKGKKLTGKSRKQPSGETTETTEPKKPRGR---PRKHPVE--TTEPKKPRGR 240

Query: 298 PKKKK-EESNDNMGDNYQFVQALSIEYP---VVPKNSEELVLLENSVERQKC----TLQE 357
           P+KK   E    + D+  +V+ALS+ YP   VVP      +L E  V   K     + Q 
Sbjct: 241 PRKKSTAELPVELDDDVLYVEALSVRYPENSVVPATPLR-ILRETPVTETKVNNEGSGQV 300

Query: 358 VSTCNSEDEVPAQKRRVRRKAGTKNHIDDMGMLTLTENREDGSNGINLQANE---NVISK 417
           +S+ N+  ++P +++R +                 T++ E+    + L+ +E   NV SK
Sbjct: 301 LSSDNANIKLPVRRKRQK-----------------TKSTEESCTPMILEYSEAVGNVPSK 360

Query: 418 NSGEDTLLCNNISKNAVLDTSSIEFSIPGSVALPRVVLCLAHNGKVAWDLKWKPFNARTS 477
            S       + IS++               VALPRVVLCLAHNGKV WD+KW+P  A  S
Sbjct: 361 PS-------SGISEDI--------------VALPRVVLCLAHNGKVVWDMKWRPSYAGDS 420

Query: 478 KCKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIYSKFNGEGMDPRFVKLKPIFKCSMLRSA 537
             KH MGYLAVLLGNGSLEVW+VP P    A+Y        DPRFVKL P+FKCS L+  
Sbjct: 421 LNKHSMGYLAVLLGNGSLEVWDVPMPKATSALYLSSKKAATDPRFVKLAPVFKCSNLKCG 480

Query: 538 NTQSIPLTVEWSSTPPYDYLFAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRAVA 597
           +T+SIPLTVEWS+    D+L AGCHDGTVALWKFS   + EDTRPLL FSADT PIRAVA
Sbjct: 481 DTKSIPLTVEWSTLGNPDFLLAGCHDGTVALWKFSTTKSSEDTRPLLFFSADTAPIRAVA 540

Query: 598 WAPSESDPESANVVLTAGHGGLKFWDLRDPFRPLWDLHPAPRMIYSLDWLPDPRCVILSF 657
           WAP ESD ESAN+V TAGH GLKFWDLRDPFRPLWDLHP PR IYSLDWL DP CV+LSF
Sbjct: 541 WAPGESDQESANIVATAGHAGLKFWDLRDPFRPLWDLHPVPRFIYSLDWLQDPSCVLLSF 600

Query: 658 DDGTLRLLSLLKAAYDVPVTGKPFTGTKQQGLHSYCCSSFAIWSVQVSRQTGMVAYCGAD 717
           DDGTLR+LSL+K AYDVP TG+P+  TKQQGL  Y CS+F IWS+QVSR TG+ AYC AD
Sbjct: 601 DDGTLRILSLVKVAYDVPATGRPYPNTKQQGLSVYNCSTFPIWSIQVSRLTGIAAYCTAD 660

Query: 718 GAVHRFQLTTRAVEKDHSRSRTPHFVCEYLTEEQSTITIHSPASDVPFPLKK-LSNKSDP 777
           G++  F+LTT+AVEKD +R+RTPH++C  LT + ST  +HSP  D+P  LKK +    + 
Sbjct: 661 GSIFHFELTTKAVEKD-TRNRTPHYLCGQLTMKDSTFIVHSPVPDIPIVLKKPVGETGEK 720

Query: 778 PLSMRAILSDSIQSNEENHKTAMASASENETLALSYGNDVSIESGSEDTLMSIKKRNQTQ 837
              +R++L++S      N        S+ + LA ++  D  +ES SE T     K    +
Sbjct: 721 QRCLRSLLNESPSRYASN-------VSDVQPLAFAHVEDPGLESESEGTNNKAAKSKAKK 780

Query: 838 SKGKKKGVDNQ---ALECSDE---PNDAQAKADVLPGSGNDFEIFPPKSVAMHRVRWNMN 895
            K   +  +++   AL C  E     + + KA     +G   E FPPK VAMHRVRWNMN
Sbjct: 781 GKNNARAEEDENSRALVCVKEDGGEEEGRRKAASNNSNGMKAEGFPPKMVAMHRVRWNMN 805

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022144681.10.0e+0083.83uncharacterized protein LOC111014310 isoform X1 [Momordica charantia][more]
XP_038903194.10.0e+0077.56uncharacterized protein LOC120089853 [Benincasa hispida][more]
XP_023528187.10.0e+0077.67uncharacterized protein LOC111791176 [Cucurbita pepo subsp. pepo][more]
KAG7017442.10.0e+0077.12General transcription factor 3C polypeptide 2 [Cucurbita argyrosperma subsp. arg... [more]
XP_022934485.10.0e+0077.01uncharacterized protein LOC111441649 isoform X1 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
Q8BL744.3e-0534.72General transcription factor 3C polypeptide 2 OS=Mus musculus OX=10090 GN=Gtf3c2... [more]
Q8WUA47.4e-0536.11General transcription factor 3C polypeptide 2 OS=Homo sapiens OX=9606 GN=GTF3C2 ... [more]
Q5RDC37.4e-0536.11General transcription factor 3C polypeptide 2 OS=Pongo abelii OX=9601 GN=GTF3C2 ... [more]
Match NameE-valueIdentityDescription
A0A6J1CU500.0e+0083.83uncharacterized protein LOC111014310 isoform X1 OS=Momordica charantia OX=3673 G... [more]
A0A6J1F7U50.0e+0077.01uncharacterized protein LOC111441649 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1J0H60.0e+0074.67uncharacterized protein LOC111481574 OS=Cucurbita maxima OX=3661 GN=LOC111481574... [more]
A0A6J1KYL70.0e+0076.04uncharacterized protein LOC111498749 OS=Cucurbita maxima OX=3661 GN=LOC111498749... [more]
A0A6J1EW910.0e+0075.81uncharacterized protein LOC111437104 OS=Cucurbita moschata OX=3662 GN=LOC1114371... [more]
Match NameE-valueIdentityDescription
AT1G19485.11.6e-21247.85Transducin/WD40 repeat-like superfamily protein [more]
AT1G19485.21.6e-21247.85Transducin/WD40 repeat-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR017956AT hook, DNA-binding motifPRINTSPR00929ATHOOKcoord: 244..254
score: 28.41
coord: 262..273
score: 61.11
coord: 284..294
score: 63.64
IPR001680WD40 repeatSMARTSM00320WD40_4coord: 562..606
e-value: 0.15
score: 21.2
coord: 513..553
e-value: 6.2
score: 12.0
coord: 668..707
e-value: 17.0
score: 9.2
coord: 610..649
e-value: 8.4
score: 11.2
IPR015943WD40/YVTN repeat-like-containing domain superfamilyGENE3D2.130.10.10coord: 391..743
e-value: 5.1E-24
score: 87.1
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 19..48
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..60
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 814..838
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 238..305
NoneNo IPR availablePANTHERPTHR15052RNA POLYMERASE III TRANSCRIPTION INITIATION FACTOR COMPLEX SUBUNITcoord: 20..894
IPR036322WD40-repeat-containing domain superfamilySUPERFAMILY50978WD40 repeat-likecoord: 467..711

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr022873.1Sgr022873.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006383 transcription by RNA polymerase III
cellular_component GO:0000127 transcription factor TFIIIC complex
molecular_function GO:0003677 DNA binding
molecular_function GO:0005515 protein binding