Bhi06G000202 (gene) Wax gourd (B227) v1

Overview
NameBhi06G000202
Typegene
OrganismBenincasa hispida (Wax gourd (B227) v1)
DescriptionSART-1 family protein DOT2
Locationchr6: 5291820 .. 5299383 (+)
RNA-Seq ExpressionBhi06G000202
SyntenyBhi06G000202
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
GATTTGGAGGATTTTAGCGTGCTGTCCGGAAAAAGTTTACGTTTTGGGGAATTTCCAGTAAGTATCGAAGTTAATATTGTAAATTGGGCTTTAATCTCTGAAATTTCATTATGCTGAGGTTTGGATCGTGTTATAGGTTTAGAGCAGTTTTGGATCTAAGATAGCTTGGGCAAATTCTTTAAGGTTTTTTGGAGTGAAACTACAGTATGTGTTTAGGTCGTAAATTTTGATACTAATTTAAGTGTATGCTTTAGGATGAGATTGGGTAAGCCAAAGGAGAATTATTCTGTTAATAGTGGGATTTAATTTTGAGGTAAGTGGCTTTACTAATGGAATGTTGGGGAGAGTTTGATCGAGCTTTTGGGTTTGCCAACCAATTTGTACATGATAGATTGTGGGACCTCATTATTGCATGTTGCATATCCTAGTTAGTCCAGTATAGTGACAATTACCCCTCCTACGTACTTCTCAATTTTATCAGAAATAGTATACATCTCAACTTCAAAGAATACTTCAAATAAAAATTAGAAGAATAGAGAGAAACTAACCCATCTTTACCCTTTAACCGAGGATAACCTCTAAATACTAGAACATTTACAGTACAGTAGAGGGAAAGGATAAAACATAAACGTTAACCCCGTACATACAATCTAAAAACAGAACAACTAGCTACATTATTTTTCCTCTCATCTCTGCTTCATATGTAATTTTCTATGCCCACCGTTCCGTCATTGTTTGCTATGAAGACTGCCCCTACTAGTATCACATATTCAATGTTTTTAACTAATGCTCAATGTGTTCTAAAATTTAGCTTTGGTTGTTGATATAGTTTGCTCCAATTTTCTCTTCGGTCTATATCTTTCTGGTTTAATTTATACTTTTTAGGTGTTAATGGAAATTTAGAGGCAAACTATTAAATATTTGCAGATGGATGGGGAACGGTCATCCGCACCTGATGAAAGAAATGGTCCAGATATCGCTTGGGCAAGAGAGCGTGGGGAAGGAGGACATGATGACTTTGGTTATAGTGGAGGAGAAAAGTCAAGTAAACATCGGAGTGAGGATCATCGAAAGAGTAGTCGAGGGGAGGAAAAAGACCATAGAAGTAAAGATCGAGAACGATCTAAGAGATCTAGTGATGATGCATCGAAAGAAAAGGAGAAAGAGGCAAAGGATTCAGAAAGGGATCGAGTTCGTAGTCGGGAGAAGAGGAAGGAAGATAGAGATGAGCATGAGAAAGAAAGGAGCAGGGGTAGCAAAGTTAAAGACAAAGATTATGACAGAGAGATTTACAAGGAGAAAGAATATGAGAGAGAGAGAGATAGAAAAGATCGAGGAAAGGATAGAGAGCGTGAGAGGGAAAGAGAGCTGGAGAAGGACAATGTTCGAGGACAAGACAAAGAGAAGGGAAAGGAGAGAGACAGAGACAGGGACAGAGACAGGGATAGGGATAGGGATAGGGATAGGAAGAAGAAGGACAAGGACAAGGACCGATCAAATGAAAATGAAAGGGAGAAGGGGAGAGAGAAACACAGAGATCAAGAGGACAAGGAAAGCTATAGGAACATTGACAAGGACAGAGGAAAAGAGAGAATTTTGGAAGATGATAGGAAAACAGATCAAAGCAAGGAGAAATCACAAGATAAAGAAGGAATTGGCAGCAAAATTGATGAGGAAAGAATTGGTTGGATTGCAGATGAGGGTAAGGATTATATGGTAGAAAGTGATGGTGATAATAACAGGAACAGAGATGTTGATCAAGGGAACATGGTCCAGGATTTGGGAGGTGAAGAAAATTTTGACGGGTTGAAAGTTGGAGCTCATGCGTCTTCGACTATGCTTGAGGAGCGCATTCGGAAGTGAGTAGATATTCACCCTTTACTTTCTATATATATGTTATTTTGTGTAAATATATTTGAATATCAAGTGCATGGATAGGTATTAATGAACAATGAAGTGTGTTCAAGCAGATTTTGACCATGTAGATTGCACTCAGTTTTGCTTTATCTCATTATAATCTGATCATAAGATTTTCCTTCGGTAGATTAAACTTGAATCAATGGAATTGTGTTTGATCTGTATTTTATTCTATTTAGAGAAATTTTAGCCTTAACTTGGAAAATTTTAGTGATATAGTTACATTGCTTGCTTATTTGTTAACAAATAATGTTGATGACCAAGGCTTTTACTTCCTTCTACACATTTTAGCTGGACCAACATCATCTTGAAAAGATGGCCATAGGTAGTTATTTGAAAAGAAAATAAATGGTAAAATAATTGATGCAAACCCTCACCAAGGATTCCTATGGAAAACTCAGATGAGGTTAATCAAGAAGAGACTGTGATTGCCAAAAAAAATATTGTGATGGGAACACCAAAATACGTATTCAGATCATGCATCTTCCATGATTAAATACTAGCACATGTTAGTCTTGGATCAAATTCTAGAGATAAGGGACCCAGAAGTTCTTGATTTTTATACTCTTACATAACCTCCTATAAAAAGAAAACACTGTTACATAACTAAAAACTTTTTAACATTACTGAAAACATACTGTGCTCTCTTGGACATAGGGTTATATGATTATCTTAGATGCTATGATGGTCTGTGGATACTTCTTTCGTCATAATAAGCGGTTTAAAGTTTTGTTCACCCTACAAAGTGAACGTCTGACTTTATGTTGTGACTGGGAATTTACATGTAAATGTACTTTCTGTGTAATTGATACTTCAATTGACTTAAAAGCTACATTTCATTGAAATTCTTGATGCAGCATGAAAGAAGACAGGTTAAAGAAGCAAACTGAAGAATCTGAGGTTTTAGCATGGGTTAAAAGGAGTCGTAAACTTGAGGAAAAGAAACTTTCTGAAAAAGAGAAAGCCTTGCAGCTCTCAAAGATTTTTGAGGAACAAGTTAGTTAACTGCATAATACTGAGTCATTTGCAGTCTCTTCTTTTATAATGACAACACTCATGTGATAGTAATTGTGACATTTTGAAATTTCTCAGGATAACATTGATCAAGGTGTAAGCGATGATGATATTGCACCAGAAGATACAACTAGTATGAGTTCTTACTCGCCCCCTTCTTTTCTTGCACTTTCTCTTTTCTTTTTATTACTTCTTCCATGTCTGAATTAAAACCTAATGTGGTTTTAAAGCTCCAAAGAGGTATGTATGCTTCAAAAAGGCAAGGGAGGATTCCTAGAAGTGAAAAAGAGGTAGAGAGAGAGAGATTATTTGATGTTTCTATAATGGCAGCTTATCTAATTCTGTATTCAATTGTCAGATAATCATAATCTAGCTGGAGTTAAAGTACTCCATGGCATAGACAAAGTACTAGAAGGTGGTGCGGTTGTTTTAACCCTTAAGGATCAGAATATCTTAGCTGATGGCGACATTAATGAAGGTAACATACGTTGGTTCTTTTCTTTAATTCTTGGGCCAAAGTTCACTTATATATTTTTATGCCTCAGACGTAGATGTACTTGAGAATGTCGAAATTGGAGAACAGAAGCAGAGAGACATGGCCTATAAAGCAGCAAAAAAGAAAACCGGCATTTATGATGACAAGTAAGTCATTTAGTTAAGTGCTTTGTGAGCTGTTCTGTAGGTTGCCTCACTTCTGATCCATTTTTCTATTATAGGTTTAATGATGAAAATTATGGCGAGAAGAAGATGCTTCCACAGTATGATGATCCAGCAGCTGCAGATGAGGTGTTCTGTTTTTCTTTCTTTTGTCTGTCTATTTCTATGAACTTACATAAGGACATCGATGTCTTATTCATGATCTTTACCATGAACCATTTCTAGAATCGGAGCATAAGCATTCTCTAAGGACTCGAATTTGTGTTGCAGGGCCTAACCTTAGATGGAAGAGGAGGTTTTAATAATGATGCAGAAAAGAAGCTTGAGGAGGTAATTTGAACTTGTCTTTGAACACATGATGATTGTTTCTCCATTTTGAGTTAATGAGTTTCTTCCTACTGTCATTCCAGTAATTATTATTGTATTCTACGTTCTTCCCACCCATGATGATTGGAAATAGTTCAGTATGTTATGATGCTTCTTTAATCTTTGACTTCGTTTTGAGCTCTTTGCGCAATCTTTTTGCTCTTTGACAGCTTCGGAGAAGATTACAGGGATCTAATTCAGTCATGCACTTTGAAGATCTTAATGCATCAACAAAAGTCTCACATGATTATTACACTCAAGATGAGATGCTTAAATTTAAGAAGCCCAGGAAAAAGAAATCCCTCCGAAAGAAGGAAAAGCTAGATCTTGATGCCCTTGAAGCGGAAGCAATTTCTGCTGGATTGGGTGTTGGAGACCTTGGTTCTCGAAATGGTTCTAGAAGGCAAGCACAAAAAGAGGAACAAGAGAAATCTGAGGCAGAAATGCGACATAATGCATATCAGTCAGCCTATGCTAAAGCAGATGAGGCATCAAGATCTCTAAAATTAGTTCAAACTAGCTCTGTCAGATTAGAGGACAACGACGATACGCTCATTGCAGATGATGATGAAGATTTCTATAAGTCGTTGGATAGAGCAAGAAAATTAGCTCTTAAGAGGCAGGAGGCAGCATCCGGACCAGGAGCCATTGCTCTTCTTGCCACAGCGACAACTAGCAGTCAGACAACTGATGATCAAAACACAAAAGCTGGAGAATTGCAGGAAAATAAGGTTGTATTTACAGAAATGGAAGAATTTGTCTGGGGTCTCCAGCTTGATGAAGGTATCATTTTCCTTGGTCCTTTTTTTGATGTACCACGTATTACTGTTCACATTGACTTCCCTTATAAGTCCCTCAATCAATCTTGAACACAGGGTACTTGTTGGTATTTATTCTCTTCCCCCTCCCCCACCTCATTGGGCTTTGGGCTTTGTTAATGGTTCTTGCAGGCTTCCTGCCTTGCCCCATTTTTTGGATGTTCAATTACACTATCATCCCGAAGAGGATTTTCTTTTCAAGTGCTTGCTAAATATGAGTTTTAACGGCTGTTGCTTGATTTAGGCAAAACAGAAATTCTGACGTGCTATTTTCTGTACTAAATGGATATGTCACATTTTAAACCTTTACCATTGCATATATGGTTTTCGAAAAATGAAGTAAATGAGGAAAACCATGAAGATTAACATTAGGTTTTTTCCATCAGATGCTCATAAACCAGAGGAGGAAGATGTCTTTATGGATGATGATGAAGTACCAAAAGAAGAATATCATGAAGAGATTAAGGATAAAGATGGTGGATGGACTGAAGTCAAAGATACTGCCAAAGAAGAAACCACTCCTGAGGAAAACGAGGCAATAGCTCCCGATGAAACGATCCATGAAGTTCCTGTTGGAAAGGGATTATCCAGTGCACTGAAGCTGCTTAAAGAGCGAGGAACTCTGAAGGAAAGCATTGAATGGGGTGGCAGAAACATGGACAAGAGAAAGAGCAAACTTGTTGGTATAGTAGATGAAGATGAACCAAAAGAATCTAAGTCAAAGGAATCCCGTTTATCTTCTTTGGTGGATTACAAAAAGGAGATCCACATTGAGAGGACTGATGAATTTGGGAGAATTGTAAGCTAGCTTGACTCTATAATTTACCTAGCCATGCCGTTCATATTTAGTTTACTTGTATCACATATTCACTTACTGGTTTTAATATTAGAGGCGTAGTCATTGTGCCTCCTGGTGATGCATGGTAGACTTTTTGTTTCTAGTGGGGTTTAGGGGAATTGGGAATTAGTTCTCCTGCTCTGGGTGTATATGCATGTGAGATAGAGAAAGTGTAGCAAAAGTGTAGTTAAAGTGTAGTGTTAATTTCACAGAAGACCTATGCCAGGTGATTGACAGATGATGATTGCAAGTTGAGAATGTGTATTTCGTGACAAACCTATTATAAAACTTTTACTGGTAGCATAGTGAGGAGCTGTGTGCTAAGTTAGAAACTTAAAATAAATCGATTTATTTTTAAATGTCTAGAATAACATCTAGGGAAATGTGTTTGTTATCAAATTGCCATACCTAGGTCATGCTCGCCCTTTCTTTATATCTTGTTCAAACAATAAAATGCACCCTCCACCAGAAAACAATATTAACAAACACATTCAAATTGTTTTAAATATGGATCTCTCATTTCCGAGTTGTTGGCATTCTACATGCTTATCTTTATGCTGATTGTGTAGGGTTTCTGCCTGTAGTTGTTAAATTTTTATCTTCAATTATTTCAATGGATTTTGACTATAATTACTGGTCTATCCTTTCCATGGGTCTGTAGATGGTGATAATATTCCTATACTTATAATCAAGGAGATTTTTATTCCCATTTGTGTCCTTTTACTGCACCAGATGACTCCAAAGGAGTCGTTTCGTCAACTCTCTCACAAGTTCCATGGCAAGGGACCTGGAAAAATGAAGCAAGAAAAGCGGATGAAGCAATACCAAGAAGAGTTGAAATTGAAGCAGATGAAGAATGCTGATACTCCTTCGTTATCAGTAGAGAGAATGAGGGAAGCTCAAGCACAATTAAAGACCCCTTATCTTGTTCTCAGCGGCCATGTTAAACCTGGGTATGCTTGGTTTTCATTTGAACGAGTATTTTTCCTTGTAACTTGGTTTTTACCGATTCAGTTTATAACTATTTTGTCGGTTCTTCGATGTTTCTAGCCAAACGAGCGATCCAAGAAGTGGTTTTGCTACAGTTGAAAAGGATCTCCCAGGCGGCTTGACACCCATGCTTGGTGACAGAAAAGTGAGTATTCTTGTATTATCCTTTTCTATTATTTTCTTTGTTATTGAATTTTTATGTTACGGAAGAAATTCTCTAGAGTTGGTAGCTATTTATAAATTGGAAACATATTCAATATGGGAGGGTTTCTAACATATTTAGAGCCTAATTTCTTGCCAGTAACAATTATTATTATATTAGGCAAGTTTGTCAGTTGCTTCTCTATTCATGTTTGCAACTTCAGCAACTACCTCCTCTACCCTACGCTGTTTTCAAATCCCATTTTTGTGAAATCTAAACCAAACACAAACATTTCATTCTTTTTATATTGTCATTCCGTTAAAGAATCTCATACTTACCACCCCTTCTTTCCGTAGTTTTTTCTGCAATGAAAAAAAAAAGAGAGAAATCTAATGGATTTCAAATTTATGCCAAATTATTTTCCTTTTTGAGAAGATGTAATATATGTTCCATTTTTGAATCGGTGTAGATTGTAGTAACATCTCATTTTCACTCGTACATGCTGAACAGTAAACGATACATTCTCTAATATCTGAATTTCATTTTGGCGCACTCTAAGCATATGGTCATTCAAACTTTCAATTGGCTTGTTACCAAACTTCTCATTTTTGAACTTTTTAGGCATACCAAACAGAAACACCTCCTTTGTATTAGCATTTGTTTGTTTATATTTATTTTACTTCATGACTCTTTCACCTCAATTTTTTATTGTCATGTTAATTTAATATAAGTTGGTCGTTTTATTTTCAGGTTGAGCATTTCTTGGGGATAAAGCGTAAAGGCGAAGCTTCAAATACAGGCACAAAGAAAGCAAAAGTTTGA

mRNA sequence

GATTTGGAGGATTTTAGCGTGCTGTCCGGAAAAAGTTTACGTTTTGGGGAATTTCCAATGGATGGGGAACGGTCATCCGCACCTGATGAAAGAAATGGTCCAGATATCGCTTGGGCAAGAGAGCGTGGGGAAGGAGGACATGATGACTTTGGTTATAGTGGAGGAGAAAAGTCAAGTAAACATCGGAGTGAGGATCATCGAAAGAGTAGTCGAGGGGAGGAAAAAGACCATAGAAGTAAAGATCGAGAACGATCTAAGAGATCTAGTGATGATGCATCGAAAGAAAAGGAGAAAGAGGCAAAGGATTCAGAAAGGGATCGAGTTCGTAGTCGGGAGAAGAGGAAGGAAGATAGAGATGAGCATGAGAAAGAAAGGAGCAGGGGTAGCAAAGTTAAAGACAAAGATTATGACAGAGAGATTTACAAGGAGAAAGAATATGAGAGAGAGAGAGATAGAAAAGATCGAGGAAAGGATAGAGAGCGTGAGAGGGAAAGAGAGCTGGAGAAGGACAATGACAAGGACCGATCAAATGAAAATGAAAGGGAGAAGGGGAGAGAGAAACACAGAGATCAAGAGGACAAGGAAAGCTATAGGAACATTGACAAGGACAGAGGAAAAGAGAGAATTTTGGAAGATGATAGGAAAACAGATCAAAGCAAGGAGAAATCACAAGATAAAGAAGGAATTGGCAGCAAAATTGATGAGGAAAGAATTGGTTGGATTGCAGATGAGGGTAAGGATTATATGGTAGAAAGTGATGGTGATAATAACAGGAACAGAGATGTTGATCAAGGGAACATGGTCCAGGATTTGGGAGGTGAAGAAAATTTTGACGGGTTGAAAGTTGGAGCTCATGCGTCTTCGACTATGCTTGAGGAGCGCATTCGGAACATGAAAGAAGACAGGTTAAAGAAGCAAACTGAAGAATCTGAGGTTTTAGCATGGGTTAAAAGGAGTCGTAAACTTGAGGAAAAGAAACTTTCTGAAAAAGAGAAAGCCTTGCAGCTCTCAAAGATTTTTGAGGAACAAGATAACATTGATCAAGGTGTAAGCGATGATGATATTGCACCAGAAGATACAACTAATAATCATAATCTAGCTGGAGTTAAAGTACTCCATGGCATAGACAAAGTACTAGAAGGTGGTGCGGTTGTTTTAACCCTTAAGGATCAGAATATCTTAGCTGATGGCGACATTAATGAAGACGTAGATGTACTTGAGAATGTCGAAATTGGAGAACAGAAGCAGAGAGACATGGCCTATAAAGCAGCAAAAAAGAAAACCGGCATTTATGATGACAAGTTTAATGATGAAAATTATGGCGAGAAGAAGATGCTTCCACAGTATGATGATCCAGCAGCTGCAGATGAGGGCCTAACCTTAGATGGAAGAGGAGGTTTTAATAATGATGCAGAAAAGAAGCTTGAGGAGCTTCGGAGAAGATTACAGGGATCTAATTCAGTCATGCACTTTGAAGATCTTAATGCATCAACAAAAGTCTCACATGATTATTACACTCAAGATGAGATGCTTAAATTTAAGAAGCCCAGGAAAAAGAAATCCCTCCGAAAGAAGGAAAAGCTAGATCTTGATGCCCTTGAAGCGGAAGCAATTTCTGCTGGATTGGGTGTTGGAGACCTTGGTTCTCGAAATGGTTCTAGAAGGCAAGCACAAAAAGAGGAACAAGAGAAATCTGAGGCAGAAATGCGACATAATGCATATCAGTCAGCCTATGCTAAAGCAGATGAGGCATCAAGATCTCTAAAATTAGTTCAAACTAGCTCTGTCAGATTAGAGGACAACGACGATACGCTCATTGCAGATGATGATGAAGATTTCTATAAGTCGTTGGATAGAGCAAGAAAATTAGCTCTTAAGAGGCAGGAGGCAGCATCCGGACCAGGAGCCATTGCTCTTCTTGCCACAGCGACAACTAGCAGTCAGACAACTGATGATCAAAACACAAAAGCTGGAGAATTGCAGGAAAATAAGGTTGTATTTACAGAAATGGAAGAATTTGTCTGGGGTCTCCAGCTTGATGAAGATGCTCATAAACCAGAGGAGGAAGATGTCTTTATGGATGATGATGAAGTACCAAAAGAAGAATATCATGAAGAGATTAAGGATAAAGATGGTGGATGGACTGAAGTCAAAGATACTGCCAAAGAAGAAACCACTCCTGAGGAAAACGAGGCAATAGCTCCCGATGAAACGATCCATGAAGTTCCTGTTGGAAAGGGATTATCCAGTGCACTGAAGCTGCTTAAAGAGCGAGGAACTCTGAAGGAAAGCATTGAATGGGGTGGCAGAAACATGGACAAGAGAAAGAGCAAACTTGTTGGTATAGTAGATGAAGATGAACCAAAAGAATCTAAGTCAAAGGAATCCCGTTTATCTTCTTTGGTGGATTACAAAAAGGAGATCCACATTGAGAGGACTGATGAATTTGGGAGAATTATGACTCCAAAGGAGTCGTTTCGTCAACTCTCTCACAAGTTCCATGGCAAGGGACCTGGAAAAATGAAGCAAGAAAAGCGGATGAAGCAATACCAAGAAGAGTTGAAATTGAAGCAGATGAAGAATGCTGATACTCCTTCGTTATCAGTAGAGAGAATGAGGGAAGCTCAAGCACAATTAAAGACCCCTTATCTTGTTCTCAGCGGCCATGTTAAACCTGGCCAAACGAGCGATCCAAGAAGTGGTTTTGCTACAGTTGAAAAGGATCTCCCAGGCGGCTTGACACCCATGCTTGGTGACAGAAAAGTTGAGCATTTCTTGGGGATAAAGCGTAAAGGCGAAGCTTCAAATACAGGCACAAAGAAAGCAAAAGTTTGA

Coding sequence (CDS)

GATTTGGAGGATTTTAGCGTGCTGTCCGGAAAAAGTTTACGTTTTGGGGAATTTCCAATGGATGGGGAACGGTCATCCGCACCTGATGAAAGAAATGGTCCAGATATCGCTTGGGCAAGAGAGCGTGGGGAAGGAGGACATGATGACTTTGGTTATAGTGGAGGAGAAAAGTCAAGTAAACATCGGAGTGAGGATCATCGAAAGAGTAGTCGAGGGGAGGAAAAAGACCATAGAAGTAAAGATCGAGAACGATCTAAGAGATCTAGTGATGATGCATCGAAAGAAAAGGAGAAAGAGGCAAAGGATTCAGAAAGGGATCGAGTTCGTAGTCGGGAGAAGAGGAAGGAAGATAGAGATGAGCATGAGAAAGAAAGGAGCAGGGGTAGCAAAGTTAAAGACAAAGATTATGACAGAGAGATTTACAAGGAGAAAGAATATGAGAGAGAGAGAGATAGAAAAGATCGAGGAAAGGATAGAGAGCGTGAGAGGGAAAGAGAGCTGGAGAAGGACAATGACAAGGACCGATCAAATGAAAATGAAAGGGAGAAGGGGAGAGAGAAACACAGAGATCAAGAGGACAAGGAAAGCTATAGGAACATTGACAAGGACAGAGGAAAAGAGAGAATTTTGGAAGATGATAGGAAAACAGATCAAAGCAAGGAGAAATCACAAGATAAAGAAGGAATTGGCAGCAAAATTGATGAGGAAAGAATTGGTTGGATTGCAGATGAGGGTAAGGATTATATGGTAGAAAGTGATGGTGATAATAACAGGAACAGAGATGTTGATCAAGGGAACATGGTCCAGGATTTGGGAGGTGAAGAAAATTTTGACGGGTTGAAAGTTGGAGCTCATGCGTCTTCGACTATGCTTGAGGAGCGCATTCGGAACATGAAAGAAGACAGGTTAAAGAAGCAAACTGAAGAATCTGAGGTTTTAGCATGGGTTAAAAGGAGTCGTAAACTTGAGGAAAAGAAACTTTCTGAAAAAGAGAAAGCCTTGCAGCTCTCAAAGATTTTTGAGGAACAAGATAACATTGATCAAGGTGTAAGCGATGATGATATTGCACCAGAAGATACAACTAATAATCATAATCTAGCTGGAGTTAAAGTACTCCATGGCATAGACAAAGTACTAGAAGGTGGTGCGGTTGTTTTAACCCTTAAGGATCAGAATATCTTAGCTGATGGCGACATTAATGAAGACGTAGATGTACTTGAGAATGTCGAAATTGGAGAACAGAAGCAGAGAGACATGGCCTATAAAGCAGCAAAAAAGAAAACCGGCATTTATGATGACAAGTTTAATGATGAAAATTATGGCGAGAAGAAGATGCTTCCACAGTATGATGATCCAGCAGCTGCAGATGAGGGCCTAACCTTAGATGGAAGAGGAGGTTTTAATAATGATGCAGAAAAGAAGCTTGAGGAGCTTCGGAGAAGATTACAGGGATCTAATTCAGTCATGCACTTTGAAGATCTTAATGCATCAACAAAAGTCTCACATGATTATTACACTCAAGATGAGATGCTTAAATTTAAGAAGCCCAGGAAAAAGAAATCCCTCCGAAAGAAGGAAAAGCTAGATCTTGATGCCCTTGAAGCGGAAGCAATTTCTGCTGGATTGGGTGTTGGAGACCTTGGTTCTCGAAATGGTTCTAGAAGGCAAGCACAAAAAGAGGAACAAGAGAAATCTGAGGCAGAAATGCGACATAATGCATATCAGTCAGCCTATGCTAAAGCAGATGAGGCATCAAGATCTCTAAAATTAGTTCAAACTAGCTCTGTCAGATTAGAGGACAACGACGATACGCTCATTGCAGATGATGATGAAGATTTCTATAAGTCGTTGGATAGAGCAAGAAAATTAGCTCTTAAGAGGCAGGAGGCAGCATCCGGACCAGGAGCCATTGCTCTTCTTGCCACAGCGACAACTAGCAGTCAGACAACTGATGATCAAAACACAAAAGCTGGAGAATTGCAGGAAAATAAGGTTGTATTTACAGAAATGGAAGAATTTGTCTGGGGTCTCCAGCTTGATGAAGATGCTCATAAACCAGAGGAGGAAGATGTCTTTATGGATGATGATGAAGTACCAAAAGAAGAATATCATGAAGAGATTAAGGATAAAGATGGTGGATGGACTGAAGTCAAAGATACTGCCAAAGAAGAAACCACTCCTGAGGAAAACGAGGCAATAGCTCCCGATGAAACGATCCATGAAGTTCCTGTTGGAAAGGGATTATCCAGTGCACTGAAGCTGCTTAAAGAGCGAGGAACTCTGAAGGAAAGCATTGAATGGGGTGGCAGAAACATGGACAAGAGAAAGAGCAAACTTGTTGGTATAGTAGATGAAGATGAACCAAAAGAATCTAAGTCAAAGGAATCCCGTTTATCTTCTTTGGTGGATTACAAAAAGGAGATCCACATTGAGAGGACTGATGAATTTGGGAGAATTATGACTCCAAAGGAGTCGTTTCGTCAACTCTCTCACAAGTTCCATGGCAAGGGACCTGGAAAAATGAAGCAAGAAAAGCGGATGAAGCAATACCAAGAAGAGTTGAAATTGAAGCAGATGAAGAATGCTGATACTCCTTCGTTATCAGTAGAGAGAATGAGGGAAGCTCAAGCACAATTAAAGACCCCTTATCTTGTTCTCAGCGGCCATGTTAAACCTGGCCAAACGAGCGATCCAAGAAGTGGTTTTGCTACAGTTGAAAAGGATCTCCCAGGCGGCTTGACACCCATGCTTGGTGACAGAAAAGTTGAGCATTTCTTGGGGATAAAGCGTAAAGGCGAAGCTTCAAATACAGGCACAAAGAAAGCAAAAGTTTGA

Protein sequence

DLEDFSVLSGKSLRFGEFPMDGERSSAPDERNGPDIAWARERGEGGHDDFGYSGGEKSSKHRSEDHRKSSRGEEKDHRSKDRERSKRSSDDASKEKEKEAKDSERDRVRSREKRKEDRDEHEKERSRGSKVKDKDYDREIYKEKEYERERDRKDRGKDRERERERELEKDNDKDRSNENEREKGREKHRDQEDKESYRNIDKDRGKERILEDDRKTDQSKEKSQDKEGIGSKIDEERIGWIADEGKDYMVESDGDNNRNRDVDQGNMVQDLGGEENFDGLKVGAHASSTMLEERIRNMKEDRLKKQTEESEVLAWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIAPEDTTNNHNLAGVKVLHGIDKVLEGGAVVLTLKDQNILADGDINEDVDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPAAADEGLTLDGRGGFNNDAEKKLEELRRRLQGSNSVMHFEDLNASTKVSHDYYTQDEMLKFKKPRKKKSLRKKEKLDLDALEAEAISAGLGVGDLGSRNGSRRQAQKEEQEKSEAEMRHNAYQSAYAKADEASRSLKLVQTSSVRLEDNDDTLIADDDEDFYKSLDRARKLALKRQEAASGPGAIALLATATTSSQTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEDAHKPEEEDVFMDDDEVPKEEYHEEIKDKDGGWTEVKDTAKEETTPEENEAIAPDETIHEVPVGKGLSSALKLLKERGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKESKSKESRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASNTGTKKAKV
Homology
BLAST of Bhi06G000202 vs. TAIR 10
Match: AT5G16780.1 (SART-1 family )

HSP 1 Score: 785.4 bits (2027), Expect = 5.1e-227
Identity = 486/852 (57.04%), Postives = 621/852 (72.89%), Query Frame = 0

Query: 102 DSERDRVRSREKR--------KEDRDEHEKERSRGSKVKDKDYDREIYKEKEYERERDRK 161
           +  + R   RE+R        +E RD   KE+   SK K+KDYDRE  ++K++ R+   K
Sbjct: 4   EKSKSRHEIREERADYEGSPVREHRDGRRKEKDHRSKDKEKDYDREKIRDKDHRRD---K 63

Query: 162 DRGKDRERERERELEKDNDKDRSNENEREKGREKHRDQEDKESYRNIDKDRGKERILEDD 221
           ++ +DR+R R+ + EK+  + R  E E++K R++ + ++DKE  RN  KDR  ER    D
Sbjct: 64  EKERDRKRSRDEDTEKEISRGRDKEREKDKSRDRVK-EKDKEKERNRHKDRENER----D 123

Query: 222 RKTDQSKEKSQDKEGIGSKIDEERIGWIADEGKDYMVESDGDNNRNRDVDQGNMVQDLGG 281
            + ++ K++++ KE    K  E       D+ + +      +++ NR +++G        
Sbjct: 124 NEKEKDKDRARVKERASKKSHE-------DDDETHKAAERYEHSDNRGLNEGG------- 183

Query: 282 EENFDGLKVGAHASSTMLEERIRNMKEDRLKKQTEESEVLAWVKRSRKLEEKKLSEKEKA 341
            +N D    G  AS+  L+ RI  M+E+R KK  + S+ L+WV RSRK+EEK+ +EK++A
Sbjct: 184 -DNVDAASSGKEASALDLQNRILKMREERKKKAEDASDALSWVARSRKIEEKRNAEKQRA 243

Query: 342 LQLSKIFEEQDNIDQGVSDDDIAPEDTTNNHNLAGVKVLHGIDKVLEGGAVVLTLKDQNI 401
            QLS+IFEEQDN++QG +      ED  +  +L+GVKVLHG++KV+EGGAV+LTLKDQ++
Sbjct: 244 QQLSRIFEEQDNLNQGEN------EDGEDGEHLSGVKVLHGLEKVVEGGAVILTLKDQSV 303

Query: 402 LADGDINEDVDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPA 461
           L DGD+N ++D+LENVEIGEQK+R+ AY+AAKKK GIYDDKFND+   EKKMLPQYD+ A
Sbjct: 304 LTDGDVNNEIDMLENVEIGEQKRRNEAYEAAKKKKGIYDDKFNDDPGAEKKMLPQYDE-A 363

Query: 462 AADEGLTLDGRGGFNNDAEKKLEELRRRLQGSNSVMHFEDLNASTKVSHDYYTQDEMLKF 521
           A DEG+ LD +G F  +AEKKLEELR+R+QG  +   FEDLN+S KVS DY++Q+EMLKF
Sbjct: 364 ATDEGIFLDAKGRFTGEAEKKLEELRKRIQG-QTTHTFEDLNSSAKVSSDYFSQEEMLKF 423

Query: 522 KKPRKKKSLRKKEKLDLDALEAEAISAGLGVGDLGSRNGSRRQAQKEEQEKSEAEMRHNA 581
           KKP+KKK LRKK+KLDL  LEAEA+++GLG  DLGSR   RRQA KEE+E+ E E R NA
Sbjct: 424 KKPKKKKQLRKKDKLDLSMLEAEAVASGLGAEDLGSRKDGRRQAMKEEKERIEYEKRSNA 483

Query: 582 YQSAYAKADEASRSLKLVQTSSVRLEDNDDTLIADDDEDFYKSLDRARKLAL-KRQEAAS 641
           YQ A AKADEASR L+  Q    + ++++  ++ADD ED YKSL++AR+LAL K++EA S
Sbjct: 484 YQEAIAKADEASRLLRREQVQPFKRDEDESMVLADDAEDLYKSLEKARRLALIKKEEAGS 543

Query: 642 GPGAIALLATATTSSQTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEDAHKPEEEDVF 701
           GP A+A L  A++++QTTDD  T   E QEN VVFTEM +FVWGLQ + D  KPE EDVF
Sbjct: 544 GPQAVAHL-VASSTNQTTDDNTTTGDETQENTVVFTEMGDFVWGLQRENDVRKPESEDVF 603

Query: 702 MDDDEVPKE--EYHEEIKDKDGGWTEVKDT---AKEETTPEENEAIAPDETIHEVPVGKG 761
           M++D  PK   E  EE  D   G TEV DT   A E+++  + + I PDE IHEV VGKG
Sbjct: 604 MEEDVAPKAPVEVKEEHPD---GLTEVNDTDMDAAEDSS--DTKEITPDENIHEVAVGKG 663

Query: 762 LSSALKLLKERGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKESKSKESRLSSLVDYKKE 821
           LS ALKLLK+RGTLKE +EWGGRNMDK+KSKLVGIVD+D  KESK KES+     D  K+
Sbjct: 664 LSGALKLLKDRGTLKEKVEWGGRNMDKKKSKLVGIVDDDGGKESKDKESK-----DRFKD 723

Query: 822 IHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSL 881
           I IERTDEFGR +TPKE+FR LSHKFHGKGPGKMK+EKRMKQYQEELKLKQMKN+DTPS 
Sbjct: 724 IRIERTDEFGRTLTPKEAFRLLSHKFHGKGPGKMKEEKRMKQYQEELKLKQMKNSDTPSQ 783

Query: 882 SVERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLG 940
           SV+RMREAQAQLKTPYLVLSGHVKPGQTSDP+SGFATVEKD+PG LTPMLGDRKVEHFLG
Sbjct: 784 SVQRMREAQAQLKTPYLVLSGHVKPGQTSDPQSGFATVEKDVPGSLTPMLGDRKVEHFLG 813

BLAST of Bhi06G000202 vs. TAIR 10
Match: AT3G14700.1 (SART-1 family )

HSP 1 Score: 96.3 bits (238), Expect = 1.4e-19
Identity = 65/159 (40.88%), Postives = 96/159 (60.38%), Query Frame = 0

Query: 728 EENEAIAPDETIHEVPVGKGLSSALKLLKERGTLKESIEWGGRNMDKRKSKLVGIVDEDE 787
           + +  +  D  + E  VG GLS AL  L+E+GT KE            + K+VG+     
Sbjct: 72  DNHSRVRGDGIMREADVGTGLSGALNRLREQGTFKE------------EGKVVGV----- 131

Query: 788 PKESKSKESRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRM 847
            K++  ++ R     D  K+I I+R +++GRIMT KE++R L H FHGKGPGK KQEK+ 
Sbjct: 132 -KDNNHEDDRFK---DRFKDIQIQRVNKWGRIMTEKEAYRSLCHGFHGKGPGKKKQEKQR 191

Query: 848 KQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVL 887
           K++++  K KQM++++    SVER+RE  A  KTPY+VL
Sbjct: 192 KKHED--KSKQMESSER---SVERIREIHAISKTPYIVL 204

BLAST of Bhi06G000202 vs. ExPASy Swiss-Prot
Match: Q9LFE0 (SART-1 family protein DOT2 OS=Arabidopsis thaliana OX=3702 GN=DOT2 PE=1 SV=1)

HSP 1 Score: 785.4 bits (2027), Expect = 7.1e-226
Identity = 486/852 (57.04%), Postives = 621/852 (72.89%), Query Frame = 0

Query: 102 DSERDRVRSREKR--------KEDRDEHEKERSRGSKVKDKDYDREIYKEKEYERERDRK 161
           +  + R   RE+R        +E RD   KE+   SK K+KDYDRE  ++K++ R+   K
Sbjct: 4   EKSKSRHEIREERADYEGSPVREHRDGRRKEKDHRSKDKEKDYDREKIRDKDHRRD---K 63

Query: 162 DRGKDRERERERELEKDNDKDRSNENEREKGREKHRDQEDKESYRNIDKDRGKERILEDD 221
           ++ +DR+R R+ + EK+  + R  E E++K R++ + ++DKE  RN  KDR  ER    D
Sbjct: 64  EKERDRKRSRDEDTEKEISRGRDKEREKDKSRDRVK-EKDKEKERNRHKDRENER----D 123

Query: 222 RKTDQSKEKSQDKEGIGSKIDEERIGWIADEGKDYMVESDGDNNRNRDVDQGNMVQDLGG 281
            + ++ K++++ KE    K  E       D+ + +      +++ NR +++G        
Sbjct: 124 NEKEKDKDRARVKERASKKSHE-------DDDETHKAAERYEHSDNRGLNEGG------- 183

Query: 282 EENFDGLKVGAHASSTMLEERIRNMKEDRLKKQTEESEVLAWVKRSRKLEEKKLSEKEKA 341
            +N D    G  AS+  L+ RI  M+E+R KK  + S+ L+WV RSRK+EEK+ +EK++A
Sbjct: 184 -DNVDAASSGKEASALDLQNRILKMREERKKKAEDASDALSWVARSRKIEEKRNAEKQRA 243

Query: 342 LQLSKIFEEQDNIDQGVSDDDIAPEDTTNNHNLAGVKVLHGIDKVLEGGAVVLTLKDQNI 401
            QLS+IFEEQDN++QG +      ED  +  +L+GVKVLHG++KV+EGGAV+LTLKDQ++
Sbjct: 244 QQLSRIFEEQDNLNQGEN------EDGEDGEHLSGVKVLHGLEKVVEGGAVILTLKDQSV 303

Query: 402 LADGDINEDVDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPA 461
           L DGD+N ++D+LENVEIGEQK+R+ AY+AAKKK GIYDDKFND+   EKKMLPQYD+ A
Sbjct: 304 LTDGDVNNEIDMLENVEIGEQKRRNEAYEAAKKKKGIYDDKFNDDPGAEKKMLPQYDE-A 363

Query: 462 AADEGLTLDGRGGFNNDAEKKLEELRRRLQGSNSVMHFEDLNASTKVSHDYYTQDEMLKF 521
           A DEG+ LD +G F  +AEKKLEELR+R+QG  +   FEDLN+S KVS DY++Q+EMLKF
Sbjct: 364 ATDEGIFLDAKGRFTGEAEKKLEELRKRIQG-QTTHTFEDLNSSAKVSSDYFSQEEMLKF 423

Query: 522 KKPRKKKSLRKKEKLDLDALEAEAISAGLGVGDLGSRNGSRRQAQKEEQEKSEAEMRHNA 581
           KKP+KKK LRKK+KLDL  LEAEA+++GLG  DLGSR   RRQA KEE+E+ E E R NA
Sbjct: 424 KKPKKKKQLRKKDKLDLSMLEAEAVASGLGAEDLGSRKDGRRQAMKEEKERIEYEKRSNA 483

Query: 582 YQSAYAKADEASRSLKLVQTSSVRLEDNDDTLIADDDEDFYKSLDRARKLAL-KRQEAAS 641
           YQ A AKADEASR L+  Q    + ++++  ++ADD ED YKSL++AR+LAL K++EA S
Sbjct: 484 YQEAIAKADEASRLLRREQVQPFKRDEDESMVLADDAEDLYKSLEKARRLALIKKEEAGS 543

Query: 642 GPGAIALLATATTSSQTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEDAHKPEEEDVF 701
           GP A+A L  A++++QTTDD  T   E QEN VVFTEM +FVWGLQ + D  KPE EDVF
Sbjct: 544 GPQAVAHL-VASSTNQTTDDNTTTGDETQENTVVFTEMGDFVWGLQRENDVRKPESEDVF 603

Query: 702 MDDDEVPKE--EYHEEIKDKDGGWTEVKDT---AKEETTPEENEAIAPDETIHEVPVGKG 761
           M++D  PK   E  EE  D   G TEV DT   A E+++  + + I PDE IHEV VGKG
Sbjct: 604 MEEDVAPKAPVEVKEEHPD---GLTEVNDTDMDAAEDSS--DTKEITPDENIHEVAVGKG 663

Query: 762 LSSALKLLKERGTLKESIEWGGRNMDKRKSKLVGIVDEDEPKESKSKESRLSSLVDYKKE 821
           LS ALKLLK+RGTLKE +EWGGRNMDK+KSKLVGIVD+D  KESK KES+     D  K+
Sbjct: 664 LSGALKLLKDRGTLKEKVEWGGRNMDKKKSKLVGIVDDDGGKESKDKESK-----DRFKD 723

Query: 822 IHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSL 881
           I IERTDEFGR +TPKE+FR LSHKFHGKGPGKMK+EKRMKQYQEELKLKQMKN+DTPS 
Sbjct: 724 IRIERTDEFGRTLTPKEAFRLLSHKFHGKGPGKMKEEKRMKQYQEELKLKQMKNSDTPSQ 783

Query: 882 SVERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLG 940
           SV+RMREAQAQLKTPYLVLSGHVKPGQTSDP+SGFATVEKD+PG LTPMLGDRKVEHFLG
Sbjct: 784 SVQRMREAQAQLKTPYLVLSGHVKPGQTSDPQSGFATVEKDVPGSLTPMLGDRKVEHFLG 813

BLAST of Bhi06G000202 vs. ExPASy Swiss-Prot
Match: Q9Z315 (U4/U6.U5 tri-snRNP-associated protein 1 OS=Mus musculus OX=10090 GN=Sart1 PE=1 SV=1)

HSP 1 Score: 107.5 bits (267), Expect = 8.6e-22
Identity = 218/802 (27.18%), Postives = 369/802 (46.01%), Query Frame = 0

Query: 141 YKEKEYERERDRKDRGKDRERERERELEKDNDKDRSNENEREKGREKHRDQEDKESYRNI 200
           +K+ ++         G+ R+R RER  E+ + + R  E E   G       + + S R +
Sbjct: 35  HKKHKHRSSGGGSSGGERRKRSRERGSERGSGR-RGAEAEARSGAHGRERSQAEPSERRV 94

Query: 201 DKDRGKERILEDDRKTDQSKEKSQDKEGIGSKIDE-----ERIGWIADEGKDYMVESDGD 260
            +++       DD     +  K+   +     I+E      ++G    E      E+ G 
Sbjct: 95  KREK------RDDGYEAAASSKASSGDASSLSIEETNKLRAKLGLKPLEVNAVKKEA-GT 154

Query: 261 NNRNRDVDQGNMVQDLGGEENFDGLKVGAHASSTMLEERIRNMKEDRLKKQTEE----SE 320
                  D  N +     EE  + L       +   E+R+ N K  ++K   E+     +
Sbjct: 155 KEEPVAADVINPMALRQREELREKL-------AAAKEKRLLNQKLGKIKTLGEDDPWLDD 214

Query: 321 VLAWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVS-----DDDIAPEDTTNNHNL 380
             AW++RSR+L++    EK+ A + +K+ EE D  + GVS     + +   +D  +  +L
Sbjct: 215 TAAWIERSRQLQK----EKDLAEKRAKLLEEMDQ-EFGVSTLVEEEFEQRRQDLYSARDL 274

Query: 381 AGVKVLHGIDKVLEGGAVVLTLKDQNILADGDINEDVDVLENVEIGEQKQRDMAYKAAKK 440
            G+ V H ID   EG  VVLTLKD+ +L DG+     DVL NV + ++++ D   +  KK
Sbjct: 275 QGLTVEHAIDSFREGETVVLTLKDKGVLQDGE-----DVLVNVNMVDKERADKNVELRKK 334

Query: 441 KTGIYDDKFNDENYGE------KKMLPQYDDPAAAD--EGLTLDGRGGFNNDAEKKLEEL 500
           K   Y     DE+  +      + +L +YD+    +      L+  G  +   E++LEE+
Sbjct: 335 KPD-YLPYAEDESVDDLAQQKPRSILAKYDEELEGERPHSFRLEQGGMADGLRERELEEI 394

Query: 501 RRRLQGSNSVMHFEDLNA-STKVSHDYYTQDEMLKFKKPRKK-KSLRKKEK-LDLDALEA 560
           R +L+     +  + L++   +++ +Y + +EM+ FKK +++ K +RKKEK + + A + 
Sbjct: 395 RTKLR-----LQAQSLSSVGPRLASEYLSPEEMVTFKKTKRRVKKIRKKEKEVIMRADDL 454

Query: 561 EAISAGLGVGDLGSR---NGSRRQAQKEEQEKSEAEMRHNAYQSAYAKADEASRSLKLVQ 620
             +      GD GSR    G RR  + EE+   + E    A           +  +   +
Sbjct: 455 LPLGDQTQDGDFGSRLRGRGRRRVPEVEEEALEDEEKDPVAQPPPSDDTRVENMDISDEE 514

Query: 621 TSSVRLEDNDDTLIADDDE-DFYKSLDRARKL-ALKRQEAASGPGAIALLATATTSSQTT 680
                   + + L  D+ E +  K L++ R+L  L++ +     G   L       S+  
Sbjct: 515 DGGALPPGSPEGLEEDEAELELQKQLEKGRRLRQLQQLQQLRDSGEKVLEIVKKLESRQR 574

Query: 681 DDQNTKAGELQENKVVFTEMEEF--------VWGLQLDEDAHKPEEEDVFMD---DDEVP 740
             +  +  E ++  +VF    EF         +GL     A   EE++  MD   D+E  
Sbjct: 575 GWEEEEDPE-RKGTIVFNATSEFCRTLGEIPTYGL-----AGNREEQEELMDFERDEERS 634

Query: 741 KEEYHEEIKDKDGGWTEVKDTAKEETTPEENEAIAPDETIHEVPVGKGLSSALKLLKERG 800
                E   +++ GW+ V     EE   ++  A +      E  V +GL++AL L + +G
Sbjct: 635 ANGGSESDGEENIGWSTV--NLDEEKQHQDFSASSTTILDEEPIVNRGLAAALLLCQNKG 694

Query: 801 TLKESIEWGGRNMDKRKS--KLVGIVDED---EPKESKSKESR-----LSSLVDYKKEIH 860
            L+ +++   R     KS    V  +++    + K S+ +E R           YK ++ 
Sbjct: 695 LLETTVQKVARVKAPNKSLPSAVYCIEDKMAIDDKYSRREEYRGFTQDFKEKDGYKPDVK 754

Query: 861 IERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSV 892
           IE  DE GR +TPKE+FRQLSH+FHGKG GKMK E+RMK+  EE  LK+M ++DTP  +V
Sbjct: 755 IEYVDETGRKLTPKEAFRQLSHRFHGKGSGKMKTERRMKKLDEEALLKKMSSSDTPLGTV 797

BLAST of Bhi06G000202 vs. ExPASy Swiss-Prot
Match: Q5XIW8 (U4/U6.U5 tri-snRNP-associated protein 1 OS=Rattus norvegicus OX=10116 GN=Sart1 PE=1 SV=1)

HSP 1 Score: 105.9 bits (263), Expect = 2.5e-21
Identity = 216/801 (26.97%), Postives = 368/801 (45.94%), Query Frame = 0

Query: 141 YKEKEYERERDRKDRGKDRERERERELEKDNDKDRSNENERE--KGREKHRDQEDKESYR 200
           +K+ ++         G+ R+R RER  E+ + +  +    R    GRE+ + +  +   +
Sbjct: 35  HKKHKHRSSGGGSSGGERRKRSRERGAERGSGRRGAEAEARSGAHGRERSQAEPSERRVK 94

Query: 201 NIDKDRGKERILEDDRKT-DQSKEKSQDKEGIGSKIDEERIGWIADEGKDYMVESDGDNN 260
              +D G E        + D S    ++   + +K+  + +   A      + +  G   
Sbjct: 95  REKRDEGYEAAASSKASSGDASSLSIEETNKLRAKLGLKPLEVNA------VKKEAGTKE 154

Query: 261 RNRDVDQGNMVQDLGGEENFDGLKVGAHASSTMLEERIRNMKEDRLKKQTEE----SEVL 320
                D  N +     EE  + L       +   E+R+ N K  ++K   E+     +  
Sbjct: 155 EPVAADVINPMALRQREELREKL-------AAAKEKRLLNQKLGKIKTLGEDDPWLDDTA 214

Query: 321 AWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVS-----DDDIAPEDTTNNHNLAG 380
           AW++RSR+L++    EK+ A + +K+ EE D  + GVS     + +   +D  +  +L G
Sbjct: 215 AWIERSRQLQK----EKDLAEKRAKLLEEMDQ-EFGVSTLVEEEFEQRRQDLYSARDLQG 274

Query: 381 VKVLHGIDKVLEGGAVVLTLKDQNILADGDINEDVDVLENVEIGEQKQRDMAYKAAKKKT 440
           + V H ID   EG  VVLTLKD+ +L +G+     DVL NV + ++++ D   +  KKK 
Sbjct: 275 LTVEHAIDSFREGETVVLTLKDKGVLQEGE-----DVLVNVNMVDKERADKNVELRKKKP 334

Query: 441 GIYDDKFNDENYGE------KKMLPQYDDPAAAD--EGLTLDGRGGFNNDAEKKLEELRR 500
             Y     DE+  +      + +L +YD+    +      L+  G  +   E++LEE+R 
Sbjct: 335 D-YLPYAEDESVDDLAQQKPRSILAKYDEELEGERPHSFRLEQGGMADGLRERELEEIRT 394

Query: 501 RLQGSNSVMHFEDLN-ASTKVSHDYYTQDEMLKFKKPRKK-KSLRKKEKLDLDALEAEAI 560
           +L+     +  + LN    +++ +Y + +EM+ FKK +++ K +RKKEK ++     + +
Sbjct: 395 KLR-----LQAQSLNTVGPRLASEYLSPEEMVTFKKTKRRVKKIRKKEK-EVIMRADDLL 454

Query: 561 SAG---LGVGDLGS--RNGSRRQAQKEEQEKSEAEMRHNAYQSAYAKADEASRSLKLVQT 620
             G      GD GS  R   RR+  + E+E  E E +    Q   +            + 
Sbjct: 455 PLGEDQTQDGDFGSRLRGRGRRRVPEVEEEALEDEEKDPVAQPPPSDDTRVENMDISDEE 514

Query: 621 SSVRLEDNDDTLIADDDE-DFYKSLDRARKL-ALKRQEAASGPGAIALLATATTSSQTTD 680
               L      L  D+ E +  K L++ R+L  L++ +     G   L       S+   
Sbjct: 515 DGGALPSGPPELEEDEAELELQKQLEKGRRLRQLQQLQQLRDSGEKVLEIVKKLESRQRG 574

Query: 681 DQNTKAGELQENKVVFTEMEEF--------VWGLQLDEDAHKPEEEDVFMD---DDEVPK 740
            +  +  E ++  +VF    EF         +GL     A   EE++  MD   D+E   
Sbjct: 575 WEEEEDPE-RKGTIVFNATSEFCRTLGEIPTYGL-----AGNREEQEELMDFERDEERSA 634

Query: 741 EEYHEEIKDKDGGWTEVKDTAKEETTPEENEAIAPDETIHEVPVGKGLSSALKLLKERGT 800
               E   +++ GW+ V     EE   ++  A +      E  V +GL++AL L + +G 
Sbjct: 635 NGGSESDGEENIGWSTV--NLDEEKQHQDFSASSTTILDEEPIVNRGLAAALLLCQNKGL 694

Query: 801 LKESIEWGGRNMDKRKS--KLVGIVDED---EPKESKSKESR-----LSSLVDYKKEIHI 860
           L+ +++   R     KS    V  +++    + K S+ +E R           YK ++ I
Sbjct: 695 LETTVQKVARVKAPNKSLPSAVYCIEDKMAIDDKYSRREEYRGFTQDFKEKDGYKPDVKI 754

Query: 861 ERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVE 892
           E  DE GR +TPKE+FRQLSH+FHGKG GKMK E+RMK+  EE  LK+M ++DTP  +V 
Sbjct: 755 EYVDETGRKLTPKEAFRQLSHRFHGKGSGKMKTERRMKKLDEEALLKKMSSSDTPLGTVA 797

BLAST of Bhi06G000202 vs. ExPASy Swiss-Prot
Match: O43290 (U4/U6.U5 tri-snRNP-associated protein 1 OS=Homo sapiens OX=9606 GN=SART1 PE=1 SV=1)

HSP 1 Score: 100.5 bits (249), Expect = 1.1e-19
Identity = 210/808 (25.99%), Postives = 373/808 (46.16%), Query Frame = 0

Query: 142 KEKEYERERDRK-----DRGKDRERERERELEKDNDKDRSNENEREK--GREKHRDQEDK 201
           + +E+++ + R        G+ R+R RER  E+ + +  +    R    GRE+ + +  +
Sbjct: 31  RHREHKKHKHRSGGSGGSGGERRKRSRERGGERGSGRRGAEAEARSSTHGRERSQAEPSE 90

Query: 202 ESYRNIDKDRGKERILEDDRKT-DQSKEKSQDKEGIGSKIDEERIGWIADEGKDYMVESD 261
              +   +D G E        + D S    ++   + +K+  + +   A      + +  
Sbjct: 91  RRVKREKRDDGYEAAASSKTSSGDASSLSIEETNKLRAKLGLKPLEVNA------IKKEA 150

Query: 262 GDNNRNRDVDQGNMVQDLGGEENFDGLKVGAHASSTMLEERIRNMKEDRLKKQTEE---- 321
           G        D  N +     EE  + L       +   E+R+ N K  ++K   E+    
Sbjct: 151 GTKEEPVTADVINPMALRQREELREKL-------AAAKEKRLLNQKLGKIKTLGEDDPWL 210

Query: 322 SEVLAWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVS---DDDIAP--EDTTNNH 381
            +  AW++RSR+L++    EK+ A + +K+ EE D  + GVS   +++     +D  +  
Sbjct: 211 DDTAAWIERSRQLQK----EKDLAEKRAKLLEEMDQ-EFGVSTLVEEEFGQRRQDLYSAR 270

Query: 382 NLAGVKVLHGIDKVLEGGAVVLTLKDQNILADGDINEDVDVLENVEIGEQKQRDMAYKAA 441
           +L G+ V H ID   EG  ++LTLKD+ +L      E+ DVL NV + ++++ +   +  
Sbjct: 271 DLQGLTVEHAIDSFREGETMILTLKDKGVL-----QEEEDVLVNVNLVDKERAEKNVELR 330

Query: 442 KKKTGIYDDKFNDENYGE------KKMLPQYDDPAAAD--EGLTLDGRGGFNNDAEKKLE 501
           KKK   Y     DE+  +      + +L +YD+    +      L+  G  +   E++LE
Sbjct: 331 KKKPD-YLPYAEDESVDDLAQQKPRSILSKYDEELEGERPHSFRLEQGGTADGLRERELE 390

Query: 502 ELRRRLQGSNSVMHFEDLN-ASTKVSHDYYTQDEMLKFKKPRKK-KSLRKKEK-LDLDAL 561
           E+R +L+     +  + L+    +++ +Y T +EM+ FKK +++ K +RKKEK + + A 
Sbjct: 391 EIRAKLR-----LQAQSLSTVGPRLASEYLTPEEMVTFKKTKRRVKKIRKKEKEVVVRAD 450

Query: 562 EAEAISAGLGVGDLGSR---NGSRRQAQKEEQEKSEAEMRHNAYQSAYAKADEASRSLKL 621
           +   +      GD GSR    G RR ++ EE+++   +             D    ++ +
Sbjct: 451 DLLPLGDQTQDGDFGSRLRGRGRRRVSEVEEEKEPVPQ--------PLPSDDTRVENMDI 510

Query: 622 VQTSSVRLEDNDDTLIADDDE---DFYKSLD---RARKLALKRQEAASGPGAIALLATAT 681
                          + ++DE   +  K L+   R R+L   +Q   SG   + ++    
Sbjct: 511 SDEEEGGAPPPGSPQVLEEDEAELELQKQLEKGRRLRQLQQLQQLRDSGEKVVEIVKKLE 570

Query: 682 TSSQTTDDQNTKAGELQENKVVFTEMEEF--------VWGLQLDEDAHKPEEEDVFMD-- 741
           +  +  ++        ++  +VF    EF         +GL     A   EE++  MD  
Sbjct: 571 SRQRGWEEDEDPE---RKGAIVFNATSEFCRTLGEIPTYGL-----AGNREEQEELMDFE 630

Query: 742 -DDEVPKEEYHEEIKDKDGGWTEVKDTAKEETTPEENEAIAPDETIHEVPVGKGLSSALK 801
            D+E       E   +++ GW+ V     EE   ++  A +      E  V +GL++AL 
Sbjct: 631 RDEERSANGGSESDGEENIGWSTV--NLDEEKQQQDFSASSTTILDEEPIVNRGLAAALL 690

Query: 802 LLKERGTLKESIEWGGRNMDKRKS--KLVGIVDED---EPKESKSKESR-----LSSLVD 861
           L + +G L+ +++   R     KS    V  +++    + K S+ +E R           
Sbjct: 691 LCQNKGLLETTVQKVARVKAPNKSLPSAVYCIEDKMAIDDKYSRREEYRGFTQDFKEKDG 750

Query: 862 YKKEIHIERTDEFGRIMTPKESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNAD 892
           YK ++ IE  DE GR +TPKE+FRQLSH+FHGKG GKMK E+RMK+  EE  LK+M ++D
Sbjct: 751 YKPDVKIEYVDETGRKLTPKEAFRQLSHRFHGKGSGKMKTERRMKKLDEEALLKKMSSSD 791

BLAST of Bhi06G000202 vs. ExPASy TrEMBL
Match: A0A1S4E2I4 (SART-1 family protein DOT2 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103498705 PE=3 SV=1)

HSP 1 Score: 1542.7 bits (3993), Expect = 0.0e+00
Identity = 854/958 (89.14%), Postives = 889/958 (92.80%), Query Frame = 0

Query: 18   FPMDGERSSAPDERNGPDIAWARERGEGGHDDFGYSGGEKSSKHRSEDHRKSSRGEEKDH 77
            F MD ERSSAPDERNG              DD GYSG EKSSKHRSEDHRKSSRGEEKDH
Sbjct: 69   FQMDWERSSAPDERNG--------------DDLGYSGAEKSSKHRSEDHRKSSRGEEKDH 128

Query: 78   RSKDRERSKRSSDDASKEKEKEAKDSERDRVRSREKRKEDRDEHEKERSRGSKVKDKDYD 137
            RSKDRERSKRSSDDASKEKEKE KDSERDRVRSREKRKEDRDEHEKER RGSKVKDKDYD
Sbjct: 129  RSKDRERSKRSSDDASKEKEKEVKDSERDRVRSREKRKEDRDEHEKERGRGSKVKDKDYD 188

Query: 138  REIYKEKEYERERDRKDRGKDRERERERELEKDN-------------------------- 197
            REIYK+KEYERERDRKDRGKDRERERERELEKDN                          
Sbjct: 189  REIYKDKEYERERDRKDRGKDRERERERELEKDNVRGHDKERGKEKDRDRDKDRDRDRDR 248

Query: 198  -----DKDRSNENEREKGREKHRDQEDKESYRNIDKDRGKERILEDDRKTDQSKEKSQDK 257
                 DKDRSNE EREKGREKHRDQEDKESYRN+DK+RGKERILEDDRKTDQ+K+K QDK
Sbjct: 249  KKKDKDKDRSNEIEREKGREKHRDQEDKESYRNVDKERGKERILEDDRKTDQTKQKLQDK 308

Query: 258  EGIGSKIDEERIGWIADEGKDYMVESDGDNNRNRDVDQGNMVQDLGGEENFDGLKVGAHA 317
            EGIGSK DEER GWIADEGKDYM+ESDG+NNR+RDV+QGNMVQ LGGEENFDGLKVG+H 
Sbjct: 309  EGIGSKNDEERTGWIADEGKDYMLESDGENNRDRDVNQGNMVQHLGGEENFDGLKVGSHP 368

Query: 318  SSTMLEERIRNMKEDRLKKQTEESEVLAWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNI 377
            SSTMLEERIRNMKEDRLKKQTEESEVLAWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNI
Sbjct: 369  SSTMLEERIRNMKEDRLKKQTEESEVLAWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNI 428

Query: 378  DQGVSDDDIAPEDTTNNHNLAGVKVLHGIDKVLEGGAVVLTLKDQNILADGDINEDVDVL 437
            DQ VSDDDIAPE+TTNNH+L GVKVLHG+DKVLEGGAVVLTLKDQ+ILADGD+NE++D+L
Sbjct: 429  DQDVSDDDIAPENTTNNHDLTGVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNEELDML 488

Query: 438  ENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPAAADEGLTLDGRGG 497
            ENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEN GEKKMLPQYDDPA ADEGLTLDGRGG
Sbjct: 489  ENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENDGEKKMLPQYDDPAEADEGLTLDGRGG 548

Query: 498  FNNDAEKKLEELRRRLQGSNSVMHFEDLNASTKVSHDYYTQDEMLKFKKPRKKKSLRKKE 557
            FNNDAEKKLEELRRRLQG++SV HFEDLN STKVSHDYYTQDEMLKFKKPRKKKSLRKKE
Sbjct: 549  FNNDAEKKLEELRRRLQGTSSVKHFEDLNVSTKVSHDYYTQDEMLKFKKPRKKKSLRKKE 608

Query: 558  KLDLDALEAEAISAGLGVGDLGSRNGSRRQAQKEEQEKSEAEMRHNAYQSAYAKADEASR 617
            KLD+DALEAEAISAGLGVGDLGSRN SRRQA+KEEQEKSEAEMR NAYQSAYAKADEASR
Sbjct: 609  KLDIDALEAEAISAGLGVGDLGSRNDSRRQAKKEEQEKSEAEMRLNAYQSAYAKADEASR 668

Query: 618  SLKLVQTSSVRLEDNDDTLIADDDEDFYKSLDRARKLALKRQEAASGPGAIALLATATTS 677
            SL+LVQTSS RLEDNDD LIADDDEDFYKSL+RARKLALK+Q+AASGPGAIALLATATTS
Sbjct: 669  SLQLVQTSSTRLEDNDDALIADDDEDFYKSLERARKLALKKQDAASGPGAIALLATATTS 728

Query: 678  SQTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEDAHKPEEEDVFMDDDEVPKEEYHEE 737
            SQ TDDQNTKAGELQENKV+FTEMEEFVWGLQLDEDAHKPEEEDVFMDDDEVPKEEYHE+
Sbjct: 729  SQATDDQNTKAGELQENKVIFTEMEEFVWGLQLDEDAHKPEEEDVFMDDDEVPKEEYHED 788

Query: 738  IKDKDGGWTEVKDTAKEETTPEENEAIAPDETIHEVPVGKGLSSALKLLKERGTLKESIE 797
            +KDKDGGWTEVKDTAKEE+ P+EN+A+APDETIHEVPVGKGLSSALKLLK+RGTLKESIE
Sbjct: 789  VKDKDGGWTEVKDTAKEESIPDENKAVAPDETIHEVPVGKGLSSALKLLKDRGTLKESIE 848

Query: 798  WGGRNMDKRKSKLVGIVDEDEPKESKSKESRLSSLVDYKKEIHIERTDEFGRIMTPKESF 857
            WGGRNMDKRKSKLVGIVDEDEPKESKSK+SRLSSLVDYKKEIHIERTDEFGRIMTPKESF
Sbjct: 849  WGGRNMDKRKSKLVGIVDEDEPKESKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESF 908

Query: 858  RQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVL 917
            RQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVL
Sbjct: 909  RQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVL 968

Query: 918  SGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASNTGTKKAKV 945
            SGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASNTGTKKAKV
Sbjct: 969  SGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASNTGTKKAKV 1012

BLAST of Bhi06G000202 vs. ExPASy TrEMBL
Match: A0A1S3CAS5 (SART-1 family protein DOT2 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103498705 PE=3 SV=1)

HSP 1 Score: 1540.4 bits (3987), Expect = 0.0e+00
Identity = 853/956 (89.23%), Postives = 888/956 (92.89%), Query Frame = 0

Query: 20  MDGERSSAPDERNGPDIAWARERGEGGHDDFGYSGGEKSSKHRSEDHRKSSRGEEKDHRS 79
           MD ERSSAPDERNG              DD GYSG EKSSKHRSEDHRKSSRGEEKDHRS
Sbjct: 1   MDWERSSAPDERNG--------------DDLGYSGAEKSSKHRSEDHRKSSRGEEKDHRS 60

Query: 80  KDRERSKRSSDDASKEKEKEAKDSERDRVRSREKRKEDRDEHEKERSRGSKVKDKDYDRE 139
           KDRERSKRSSDDASKEKEKE KDSERDRVRSREKRKEDRDEHEKER RGSKVKDKDYDRE
Sbjct: 61  KDRERSKRSSDDASKEKEKEVKDSERDRVRSREKRKEDRDEHEKERGRGSKVKDKDYDRE 120

Query: 140 IYKEKEYERERDRKDRGKDRERERERELEKDN---------------------------- 199
           IYK+KEYERERDRKDRGKDRERERERELEKDN                            
Sbjct: 121 IYKDKEYERERDRKDRGKDRERERERELEKDNVRGHDKERGKEKDRDRDKDRDRDRDRKK 180

Query: 200 ---DKDRSNENEREKGREKHRDQEDKESYRNIDKDRGKERILEDDRKTDQSKEKSQDKEG 259
              DKDRSNE EREKGREKHRDQEDKESYRN+DK+RGKERILEDDRKTDQ+K+K QDKEG
Sbjct: 181 KDKDKDRSNEIEREKGREKHRDQEDKESYRNVDKERGKERILEDDRKTDQTKQKLQDKEG 240

Query: 260 IGSKIDEERIGWIADEGKDYMVESDGDNNRNRDVDQGNMVQDLGGEENFDGLKVGAHASS 319
           IGSK DEER GWIADEGKDYM+ESDG+NNR+RDV+QGNMVQ LGGEENFDGLKVG+H SS
Sbjct: 241 IGSKNDEERTGWIADEGKDYMLESDGENNRDRDVNQGNMVQHLGGEENFDGLKVGSHPSS 300

Query: 320 TMLEERIRNMKEDRLKKQTEESEVLAWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQ 379
           TMLEERIRNMKEDRLKKQTEESEVLAWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQ
Sbjct: 301 TMLEERIRNMKEDRLKKQTEESEVLAWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQ 360

Query: 380 GVSDDDIAPEDTTNNHNLAGVKVLHGIDKVLEGGAVVLTLKDQNILADGDINEDVDVLEN 439
            VSDDDIAPE+TTNNH+L GVKVLHG+DKVLEGGAVVLTLKDQ+ILADGD+NE++D+LEN
Sbjct: 361 DVSDDDIAPENTTNNHDLTGVKVLHGVDKVLEGGAVVLTLKDQSILADGDVNEELDMLEN 420

Query: 440 VEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPAAADEGLTLDGRGGFN 499
           VEIGEQKQRDMAYKAAKKKTGIYDDKFNDEN GEKKMLPQYDDPA ADEGLTLDGRGGFN
Sbjct: 421 VEIGEQKQRDMAYKAAKKKTGIYDDKFNDENDGEKKMLPQYDDPAEADEGLTLDGRGGFN 480

Query: 500 NDAEKKLEELRRRLQGSNSVMHFEDLNASTKVSHDYYTQDEMLKFKKPRKKKSLRKKEKL 559
           NDAEKKLEELRRRLQG++SV HFEDLN STKVSHDYYTQDEMLKFKKPRKKKSLRKKEKL
Sbjct: 481 NDAEKKLEELRRRLQGTSSVKHFEDLNVSTKVSHDYYTQDEMLKFKKPRKKKSLRKKEKL 540

Query: 560 DLDALEAEAISAGLGVGDLGSRNGSRRQAQKEEQEKSEAEMRHNAYQSAYAKADEASRSL 619
           D+DALEAEAISAGLGVGDLGSRN SRRQA+KEEQEKSEAEMR NAYQSAYAKADEASRSL
Sbjct: 541 DIDALEAEAISAGLGVGDLGSRNDSRRQAKKEEQEKSEAEMRLNAYQSAYAKADEASRSL 600

Query: 620 KLVQTSSVRLEDNDDTLIADDDEDFYKSLDRARKLALKRQEAASGPGAIALLATATTSSQ 679
           +LVQTSS RLEDNDD LIADDDEDFYKSL+RARKLALK+Q+AASGPGAIALLATATTSSQ
Sbjct: 601 QLVQTSSTRLEDNDDALIADDDEDFYKSLERARKLALKKQDAASGPGAIALLATATTSSQ 660

Query: 680 TTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEDAHKPEEEDVFMDDDEVPKEEYHEEIK 739
            TDDQNTKAGELQENKV+FTEMEEFVWGLQLDEDAHKPEEEDVFMDDDEVPKEEYHE++K
Sbjct: 661 ATDDQNTKAGELQENKVIFTEMEEFVWGLQLDEDAHKPEEEDVFMDDDEVPKEEYHEDVK 720

Query: 740 DKDGGWTEVKDTAKEETTPEENEAIAPDETIHEVPVGKGLSSALKLLKERGTLKESIEWG 799
           DKDGGWTEVKDTAKEE+ P+EN+A+APDETIHEVPVGKGLSSALKLLK+RGTLKESIEWG
Sbjct: 721 DKDGGWTEVKDTAKEESIPDENKAVAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWG 780

Query: 800 GRNMDKRKSKLVGIVDEDEPKESKSKESRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQ 859
           GRNMDKRKSKLVGIVDEDEPKESKSK+SRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQ
Sbjct: 781 GRNMDKRKSKLVGIVDEDEPKESKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQ 840

Query: 860 LSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSG 919
           LSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSG
Sbjct: 841 LSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSG 900

Query: 920 HVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASNTGTKKAKV 945
           HVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASNTGTKKAKV
Sbjct: 901 HVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASNTGTKKAKV 942

BLAST of Bhi06G000202 vs. ExPASy TrEMBL
Match: A0A0A0KXY6 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G650610 PE=3 SV=1)

HSP 1 Score: 1532.3 bits (3966), Expect = 0.0e+00
Identity = 851/954 (89.20%), Postives = 890/954 (93.29%), Query Frame = 0

Query: 20  MDGERSSAPDERNGPDIAWARERGEGGHDDFGYSGGEKSSKHRSEDHRKSSRGEEKDHRS 79
           MD ERSSAPDER+G              DDFGYSG EKSSKHRSEDHRKSSRGEEKDHRS
Sbjct: 1   MDWERSSAPDERSG--------------DDFGYSGAEKSSKHRSEDHRKSSRGEEKDHRS 60

Query: 80  KDRERSKRSSDDASKEKEKEAKDSERDRVRSREKRKEDRDEHEKERSRGSKVKDKDYDRE 139
           KDRERSKRSSDDASKEKEKEAKDSERDR+RSREKRKEDRDEHEKERSRG KVKDKDYDR+
Sbjct: 61  KDRERSKRSSDDASKEKEKEAKDSERDRIRSREKRKEDRDEHEKERSRG-KVKDKDYDRD 120

Query: 140 IYKEKEYERERDRKDRGKDRERERERELE-----------------------------KD 199
           IYK+KEYERERDRKDRGKDRERERERELE                             KD
Sbjct: 121 IYKDKEYERERDRKDRGKDRERERERELEKDTVRGHDKERGKEKDRDRDKDRDRDRKKKD 180

Query: 200 NDKDRSNENEREKGREKHRDQEDKESYRNIDKDRGKERILEDDRKTDQSKEKSQDKEGIG 259
            DKDRSNE EREKGR+KHRDQEDKESYRNIDKDRGKERILEDDRKTDQ+K+K QDKEGIG
Sbjct: 181 KDKDRSNEIEREKGRDKHRDQEDKESYRNIDKDRGKERILEDDRKTDQNKQKLQDKEGIG 240

Query: 260 SKIDEERIGWIADEGKDYMVESDGDNNRNRDVDQGNMVQDLGGEENFDGLKVGAHASSTM 319
           SK DEERIG I DEGKDYM+ESDG+NNR+RDV+QGNMVQ LG EENFDGLKVG+HASSTM
Sbjct: 241 SKNDEERIGRIGDEGKDYMLESDGENNRDRDVNQGNMVQHLGVEENFDGLKVGSHASSTM 300

Query: 320 LEERIRNMKEDRLKKQTEESEVLAWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGV 379
           LEERIRNMKEDRLKKQTEESEVL+WVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGV
Sbjct: 301 LEERIRNMKEDRLKKQTEESEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGV 360

Query: 380 SDDDIAPEDTTNNHNLAGVKVLHGIDKVLEGGAVVLTLKDQNILADGDINEDVDVLENVE 439
           SDDDIAPEDTTNNH+LAGVKVLHG+DKVLEGGAVVLTLKDQ+ILADG++NE++DVLENVE
Sbjct: 361 SDDDIAPEDTTNNHDLAGVKVLHGVDKVLEGGAVVLTLKDQSILADGNVNEELDVLENVE 420

Query: 440 IGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPAAADEGLTLDGRGGFNND 499
           IGEQKQRD+AYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPA ADEGLTLDGRGGFNND
Sbjct: 421 IGEQKQRDIAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPADADEGLTLDGRGGFNND 480

Query: 500 AEKKLEELRRRLQGSNSVMHFEDLNASTKVSHDYYTQDEMLKFKKPRKKKSLRKKEKLDL 559
           AEKKLEELRRRLQG++SV HFEDLN STKVSHDYYTQDEMLKFKKPRKKKSLRKKEKLD+
Sbjct: 481 AEKKLEELRRRLQGASSVKHFEDLNVSTKVSHDYYTQDEMLKFKKPRKKKSLRKKEKLDI 540

Query: 560 DALEAEAISAGLGVGDLGSRNGSRRQAQKEEQEKSEAEMRHNAYQSAYAKADEASRSLKL 619
           DALEAEAISAGLGVGDLGSRN SRRQA+KEEQEKSEAEMR NAYQSAYAKADEASRSL+L
Sbjct: 541 DALEAEAISAGLGVGDLGSRNDSRRQAKKEEQEKSEAEMRLNAYQSAYAKADEASRSLQL 600

Query: 620 VQTSSVRLEDNDDTLIADDDEDFYKSLDRARKLALKRQEAASGPGAIALLATATTSSQTT 679
           VQ SS RLEDNDD LIADDDEDFYKSL+RARKLALK+Q+AASGPGA+ALLATATTSSQ T
Sbjct: 601 VQNSSARLEDNDDALIADDDEDFYKSLERARKLALKKQDAASGPGAVALLATATTSSQAT 660

Query: 680 DDQNTKAGELQENKVVFTEMEEFVWGLQLDEDAHKPEEEDVFMDDDEVPKEEYHEEIKDK 739
           DDQ+TKAGELQENKVVFTEMEEFVWGLQLDEDAHKPEE+DVFMDDDE+PKEEYHE++KDK
Sbjct: 661 DDQSTKAGELQENKVVFTEMEEFVWGLQLDEDAHKPEEDDVFMDDDEIPKEEYHEDVKDK 720

Query: 740 DGGWTEVKDTAKEETTPEENEAIAPDETIHEVPVGKGLSSALKLLKERGTLKESIEWGGR 799
           DGGWTEVKDTA EE+TPEENEA+APDETIHEVPVGKGLSSALKLLK+RGTLKESIEWGGR
Sbjct: 721 DGGWTEVKDTAMEESTPEENEAVAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGR 780

Query: 800 NMDKRKSKLVGIVDEDEPKESKSKESRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLS 859
           NMDKRKSKLVGIVDEDEPKESKSK+SRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLS
Sbjct: 781 NMDKRKSKLVGIVDEDEPKESKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLS 840

Query: 860 HKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHV 919
           HKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHV
Sbjct: 841 HKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHV 900

Query: 920 KPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASNTGTKKAKV 945
           KPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASNTGTKKAKV
Sbjct: 901 KPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASNTGTKKAKV 939

BLAST of Bhi06G000202 vs. ExPASy TrEMBL
Match: A0A6J1FR42 (SART-1 family protein DOT2 OS=Cucurbita moschata OX=3662 GN=LOC111447438 PE=3 SV=1)

HSP 1 Score: 1431.4 bits (3704), Expect = 0.0e+00
Identity = 809/962 (84.10%), Postives = 859/962 (89.29%), Query Frame = 0

Query: 20  MDGERSSAP--DERNGPDIAWARERGEGGHDDFGYSGGEKSSKHRSEDHRKSSRGEEKDH 79
           MD + SS P  DERNG +   AR+RGE G DDFG SG EKSSKHRSEDHRKSSRGEEKDH
Sbjct: 1   MDADGSSGPEHDERNGHE---ARDRGE-GQDDFGCSGAEKSSKHRSEDHRKSSRGEEKDH 60

Query: 80  RSKDRERSKRSSDDASKEKEKEAKDSERDRVRSREKRKEDRDEHEKERSRGSKVKDKDYD 139
           RSKDR+RSKR SDDASKEKEKE KDSERDRV  RE+RKEDRDEH+KER+R  KVKDKDYD
Sbjct: 61  RSKDRDRSKRRSDDASKEKEKEVKDSERDRVHIRERRKEDRDEHDKERTREKKVKDKDYD 120

Query: 140 REIYKEKEYERERDRKDRGKDRERERERELEKDN-------------------------- 199
           RE+YKEKEYERERDRKDRGKD+ER RERELEKDN                          
Sbjct: 121 REVYKEKEYERERDRKDRGKDKERGRERELEKDNVRGQDKERGKEKDRDRERERERERER 180

Query: 200 ---------DKDRSNENEREKGREKHRDQEDKESYRNIDKDRGKERILEDDRKTDQSKEK 259
                    DKDRSNENEREKGREK RDQE+KESYRNIDKDRGKE+ L DD+K DQ+KEK
Sbjct: 181 DRDRKKKEKDKDRSNENEREKGREKRRDQEEKESYRNIDKDRGKEKNLVDDKKGDQNKEK 240

Query: 260 SQDKEGIGSKIDEERIGWIADEGKDYMVESDGDNNRNRDVDQGNMVQDLGGEENFDGLKV 319
            +DKEG G K +EERI WIA   KDYM+ESDG++NR+R VDQGN VQ LGGEEN DGLKV
Sbjct: 241 LRDKEGTGGKNEEERIDWIAHGAKDYMLESDGEDNRDRGVDQGNAVQQLGGEENSDGLKV 300

Query: 320 GAHASSTMLEERIRNMKEDRLKKQTEESEVLAWVKRSRKLEEKKLSEKEKALQLSKIFEE 379
           GA +SS MLEERIR MKEDRLKKQTEESEVL WVKRSRKLEEKKL+EKEKALQLSKIFEE
Sbjct: 301 GAQSSSAMLEERIRTMKEDRLKKQTEESEVLDWVKRSRKLEEKKLTEKEKALQLSKIFEE 360

Query: 380 QDNIDQGVSDDDIAPEDTTNNHNLAGVKVLHGIDKVLEGGAVVLTLKDQNILADGDINED 439
           QDNIDQG SDDDIA ED T+  NLAGVKVLHGIDKVL GGAVVLTLKDQNILADGD+NED
Sbjct: 361 QDNIDQGASDDDIAAEDITS--NLAGVKVLHGIDKVLGGGAVVLTLKDQNILADGDVNED 420

Query: 440 VDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPAAADEGLTLD 499
           +DVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEN GEKKMLPQYDDPAAADEGLTLD
Sbjct: 421 MDVLENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENDGEKKMLPQYDDPAAADEGLTLD 480

Query: 500 GRGGFNNDAEKKLEELRRRLQGSNSVMHFEDLNASTKVSHDYYTQDEMLKFKKPRKKKSL 559
           G G F+NDAEKKLEELR+RLQG++SV HFEDLNAS KVSHDYYTQDEML+FKKP+KKKSL
Sbjct: 481 GTGRFSNDAEKKLEELRKRLQGASSVKHFEDLNASVKVSHDYYTQDEMLRFKKPKKKKSL 540

Query: 560 RKKEKLDLDALEAEAISAGLGVGDLGSRNGSRRQAQKEEQEKSEAEMRHNAYQSAYAKAD 619
           RKKEKLD+DALEAEAIS+GLGVGDLGSRN S RQA+K EQE+SEAEMR NAYQSAYAKAD
Sbjct: 541 RKKEKLDIDALEAEAISSGLGVGDLGSRNDSSRQARKTEQERSEAEMRQNAYQSAYAKAD 600

Query: 620 EASRSLKLVQTSSVRLEDNDDTLIADDDEDFYKSLDRARKLALKRQEAASGPGAIALLAT 679
           EASRSL+LVQ SSVRL+DN+DTLI DDDED YKSL+RARKLALK+QEAASGP A+ALLAT
Sbjct: 601 EASRSLQLVQ-SSVRLDDNEDTLIEDDDEDLYKSLERARKLALKKQEAASGPEAVALLAT 660

Query: 680 ATTSSQTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEDAHKPEEEDVFMDDDEVPKEE 739
            TTS QTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDE++HKPEEEDVFMDDDE PKEE
Sbjct: 661 TTTSGQTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEESHKPEEEDVFMDDDEAPKEE 720

Query: 740 YHEEIKDKDGGWTEVKDTAKEETTPEENEAIAPDETIHEVPVGKGLSSALKLLKERGTLK 799
           YHE+ KDKDGGWTEVKDTAKEE TPE+NE IAPDETIHEVPVGKGLSS LKLLK+RGTLK
Sbjct: 721 YHEDEKDKDGGWTEVKDTAKEEPTPEDNETIAPDETIHEVPVGKGLSSVLKLLKDRGTLK 780

Query: 800 ESIEWGGRNMDKRKSKLVGIVDEDEPKESKSKESRLSSLVDYKKEIHIERTDEFGRIMTP 859
           ESIEWGGRNMDKRKSKLVGI+DEDEPKE+KSK+SRLSSLVDYKKEIHIERTDEFGRIMTP
Sbjct: 781 ESIEWGGRNMDKRKSKLVGIIDEDEPKEAKSKDSRLSSLVDYKKEIHIERTDEFGRIMTP 840

Query: 860 KESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTP 919
           KESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTP
Sbjct: 841 KESFRQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTP 900

Query: 920 YLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASNTGTKKA 945
           YLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKG+ SNTGTKK 
Sbjct: 901 YLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGDPSNTGTKKP 955

BLAST of Bhi06G000202 vs. ExPASy TrEMBL
Match: A0A6J1IPE4 (SART-1 family protein DOT2 OS=Cucurbita maxima OX=3661 GN=LOC111478749 PE=3 SV=1)

HSP 1 Score: 1413.7 bits (3658), Expect = 0.0e+00
Identity = 802/958 (83.72%), Postives = 853/958 (89.04%), Query Frame = 0

Query: 20  MDGERSSAP--DERNGPDIAWARERGEGGHDDFGYSGGEKSSKHRSEDHRKSSRGEEKDH 79
           MD + SS P  DERNG +   AR+RGE G DDFGYSG EKSSKHRSEDHRKSSRGEEKDH
Sbjct: 1   MDADGSSVPEHDERNGHE---ARDRGE-GQDDFGYSGAEKSSKHRSEDHRKSSRGEEKDH 60

Query: 80  RSKDRERSKRSSDDASKEKEKEAKDSERDRVRSREKRKEDRDEHEKERSRGSKVKDKDYD 139
           RSKDR+RSKR SDDASKEKEKE KDSERDRV  RE+RKEDRDEH+KER+R  KVKDKDYD
Sbjct: 61  RSKDRDRSKRRSDDASKEKEKEVKDSERDRVHIRERRKEDRDEHDKERTREKKVKDKDYD 120

Query: 140 REIYKEKEYERERDRKDRGKDRERERERELEKDN-------------------------- 199
           RE+YKEKEY+RERDRKDRGKD+ER RERELEKDN                          
Sbjct: 121 REVYKEKEYDRERDRKDRGKDKERGRERELEKDNVRGQDKERGKEKDRDRERERERDRDR 180

Query: 200 -----DKDRSNENEREKGREKHRDQEDKESYRNIDKDRGKERILEDDRKTDQSKEKSQDK 259
                DKDRSNENEREKGREK RDQE+KESYRNIDKDRGKE+ L DD+K DQ+KEK +DK
Sbjct: 181 KKKEKDKDRSNENEREKGREKRRDQEEKESYRNIDKDRGKEKNLVDDKKGDQNKEKLRDK 240

Query: 260 EGIGSKIDEERIGWIADEGKDYMVESDGDNNRNRDVDQGNMVQDLGGEENFDGLKVGAHA 319
           EGIG K DEERI W+A        ESDG++NR+R VDQGN VQ LGGE+N DGLKVGA +
Sbjct: 241 EGIGGKNDEERIDWLAHG------ESDGEDNRDRGVDQGNAVQHLGGEDNSDGLKVGAQS 300

Query: 320 SSTMLEERIRNMKEDRLKKQTEESEVLAWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNI 379
           SS MLEERIR MKEDRLKKQTEESEVL WVKRSRKLEEKKL+EKEKALQLSKIFEEQDNI
Sbjct: 301 SSAMLEERIRTMKEDRLKKQTEESEVLDWVKRSRKLEEKKLTEKEKALQLSKIFEEQDNI 360

Query: 380 DQGVSDDDIAPEDTTNNHNLAGVKVLHGIDKVLEGGAVVLTLKDQNILADGDINEDVDVL 439
           DQG SDDDIA ED T+  NLAGVKVLHGIDKVL GGAVVLTLKDQNILADGD+NED+DVL
Sbjct: 361 DQGASDDDIAAEDITS--NLAGVKVLHGIDKVLGGGAVVLTLKDQNILADGDVNEDMDVL 420

Query: 440 ENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPAAADEGLTLDGRGG 499
           ENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDEN GEKKMLPQYDDPAAADEGLTLDG G 
Sbjct: 421 ENVEIGEQKQRDMAYKAAKKKTGIYDDKFNDENDGEKKMLPQYDDPAAADEGLTLDGTGR 480

Query: 500 FNNDAEKKLEELRRRLQGSNSVMHFEDLNASTKVSHDYYTQDEMLKFKKPRKKKSLRKKE 559
           F+NDAEKKLEELR+RLQG++SV HFEDLNAS KVSHDYYTQDEML+FKKP+KKKSLRKKE
Sbjct: 481 FSNDAEKKLEELRKRLQGASSVKHFEDLNASVKVSHDYYTQDEMLRFKKPKKKKSLRKKE 540

Query: 560 KLDLDALEAEAISAGLGVGDLGSRNGSRRQAQKEEQEKSEAEMRHNAYQSAYAKADEASR 619
           KLD+DALEAEAIS+GLGVGDLGSRN S RQA+K EQE+SEAEMR NAYQSAYAKADEASR
Sbjct: 541 KLDIDALEAEAISSGLGVGDLGSRNDSSRQARKTEQERSEAEMRQNAYQSAYAKADEASR 600

Query: 620 SLKLVQTSSVRLEDNDDTLIADDDEDFYKSLDRARKLALKRQEAASGPGAIALLATATTS 679
           SL+LVQ SSVRL+ N+DTLI DDDED YKSL+RARKLALK+QEAASGP A+ALLAT TTS
Sbjct: 601 SLQLVQ-SSVRLDGNEDTLIEDDDEDLYKSLERARKLALKKQEAASGPEAVALLATTTTS 660

Query: 680 SQTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEDAHKPEEEDVFMDDDEVPKEEYHEE 739
            QTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDE++HKPEEEDVFMDDDE PKEEYHE+
Sbjct: 661 GQTTDDQNTKAGELQENKVVFTEMEEFVWGLQLDEESHKPEEEDVFMDDDEAPKEEYHED 720

Query: 740 IKDKDGGWTEVKDTAKEETTPEENEAIAPDETIHEVPVGKGLSSALKLLKERGTLKESIE 799
            KDKDGGWTEVKDTAKEE  PE+NE IAPDETIHEVPVGKGLSS LKLLK+RGTLKESIE
Sbjct: 721 EKDKDGGWTEVKDTAKEEPAPEDNETIAPDETIHEVPVGKGLSSVLKLLKDRGTLKESIE 780

Query: 800 WGGRNMDKRKSKLVGIVDEDEPKESKSKESRLSSLVDYKKEIHIERTDEFGRIMTPKESF 859
           WGGRNMDKRKSKLVGI+DEDEPKE+KSK+SRLSSLVDYKKEIHIERTDEFGRIMTPKESF
Sbjct: 781 WGGRNMDKRKSKLVGIIDEDEPKEAKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESF 840

Query: 860 RQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVL 919
           RQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVL
Sbjct: 841 RQLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVL 900

Query: 920 SGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASNTGTKKAKV 945
           SGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKG+  NTGTKK K+
Sbjct: 901 SGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGDPLNTGTKKPKI 945

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AT5G16780.15.1e-22757.04SART-1 family [more]
AT3G14700.11.4e-1940.88SART-1 family [more]
Match NameE-valueIdentityDescription
Q9LFE07.1e-22657.04SART-1 family protein DOT2 OS=Arabidopsis thaliana OX=3702 GN=DOT2 PE=1 SV=1[more]
Q9Z3158.6e-2227.18U4/U6.U5 tri-snRNP-associated protein 1 OS=Mus musculus OX=10090 GN=Sart1 PE=1 S... [more]
Q5XIW82.5e-2126.97U4/U6.U5 tri-snRNP-associated protein 1 OS=Rattus norvegicus OX=10116 GN=Sart1 P... [more]
O432901.1e-1925.99U4/U6.U5 tri-snRNP-associated protein 1 OS=Homo sapiens OX=9606 GN=SART1 PE=1 SV... [more]
Match NameE-valueIdentityDescription
A0A1S4E2I40.0e+0089.14SART-1 family protein DOT2 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103498705 PE... [more]
A0A1S3CAS50.0e+0089.23SART-1 family protein DOT2 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103498705 PE... [more]
A0A0A0KXY60.0e+0089.20Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G650610 PE=3 SV=1[more]
A0A6J1FR420.0e+0084.10SART-1 family protein DOT2 OS=Cucurbita moschata OX=3662 GN=LOC111447438 PE=3 SV... [more]
A0A6J1IPE40.0e+0083.72SART-1 family protein DOT2 OS=Cucurbita maxima OX=3661 GN=LOC111478749 PE=3 SV=1[more]
InterPro
Analysis Name: InterPro Annotations of Wax gourd (B227) v1
Date Performed: 2021-10-22
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 157..177
NoneNo IPR availableCOILSCoilCoilcoord: 550..577
NoneNo IPR availableCOILSCoilCoilcoord: 93..113
NoneNo IPR availableCOILSCoilCoilcoord: 468..488
NoneNo IPR availableCOILSCoilCoilcoord: 840..860
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..280
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 684..736
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 553..569
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 888..917
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 23..260
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 819..847
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 778..798
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 697..736
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 544..569
NoneNo IPR availablePANTHERPTHR14152:SF5U4/U6.U5 TRI-SNRNP-ASSOCIATED PROTEIN 1coord: 285..907
IPR005011SNU66/SART1 familyPFAMPF03343SART-1coord: 296..853
e-value: 4.6E-69
score: 233.6
IPR005011SNU66/SART1 familyPANTHERPTHR14152SQUAMOUS CELL CARCINOMA ANTIGEN RECOGNISED BY CYTOTOXIC T LYMPHOCYTEScoord: 285..907

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Bhi06M000202Bhi06M000202mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0010588 cotyledon vascular tissue pattern formation
biological_process GO:0009908 flower development
biological_process GO:0010305 leaf vascular tissue pattern formation
biological_process GO:0000481 maturation of 5S rRNA
biological_process GO:0009933 meristem structural organization
biological_process GO:0045292 mRNA cis splicing, via spliceosome
biological_process GO:0010087 phloem or xylem histogenesis
biological_process GO:0048528 post-embryonic root development
biological_process GO:0000398 mRNA splicing, via spliceosome
cellular_component GO:0046540 U4/U6 x U5 tri-snRNP complex