HG10000172 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10000172
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionDNA repair protein RAD16
LocationChr09: 1760884 .. 1768082 (-)
RNA-Seq ExpressionHG10000172
SyntenyHG10000172
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGCTTCGTCCTCGTAAAACTACTTCCAACGTATTGATCGAAGGTACGGCTTCGATTTTGTGCGTTTTGTTCATGTGTCGCATGGTGTGTATTTCTACAGAATTGTGTTTCTTGAAAACCTAATGTTTAATTTACTAGAGAACAAGGATGTATTGCTGTGATTTTAAATCCATTTCGGCATTGTAGGATTTAAATTTATGTTTGGCGTGTTTTTGAATTAAGATTGCTCGAGTTATTTCATACGGTTCTGGCGGAGGAAATTCTTTTGAGATGTTGTGGATTTCTTGGAGTGGAATAAGCGATTACGGCGAAACGAAGATGTATGGCTATGATTTTAATTCGATTGCGGCATTGTAGAATCTAAGATTGTGTTGAGCGTGTTTTTGAAAAAAAAAAAAACGTTTTGATTTTTTTTTGATACTGTTGTGCTGGAGGAAGTTTTATTGAGATGTCATTAGTCTTTCGAACTTGAAATACACAATCTCTGCACATTCTGTGTTCTCTGTTTCCTCTTTTTTCCACTGTTTTAGGGGGAAAGATCGATGAATTGTGAAATAGCTGTGTGATTACATTGTTTTGTTATTGAAGCAGGAAACGGAGACGGAGACGGAGATGCCTTTGATGATATAGAGGTGTCATCTCTTTTTTCTGACAGTGGAAGTGAAGGTATCGTGGTTTTTTTTTCTTTTGCGCTGTAGATATTTTAATTATGGCAGTCCTGGTTAGAAAATATGTCGCAAGCTGGATCAGTTCTGGTGTCAACTATTTCTTTATAGGTTTCTTTTTCTTTCTTGTTTCATTGGTGTACGTGCATGTTCGTTCGGTTAAGCGAACTTTTCTTTCGAACAATCTCTTCAACAATCTACTTTTCTTCACATTAGCTGTGCCTGCAATGCCTTCCTTCGGTTTTACTCATTTACCTGACATGTTATAATCTCATGAACAAAGAGTATAACTGGGAACTATGTTAAACCATGAGTTGACAGTTTTAACTGGGCAGTTCTTTCCTCGAGTTCTGAGGACTCCAGTGAGCCTTCAACAAAGAAGTCTAGAGCAAAGACACAGAGAAAACGTATTAAAAAGGAGGGGCCTAGCATTGAGCAGGAAGTCGGAAGCAATGTAGGTAATGATGAAAACCTACACAATCAGAAACCAGAAATTGCCGACTACCAGGGTGTGGATGATATAGAGAAGCCAAAGACCAAATACTCAAGAAAGAAGAAGCCAAAACCTACCCTTTTGTGGAATGTCTGGGAGGAAGAATATGAGAGGTGGATCGATGAAAATATTGATAAAGATTTTGATTTGGCTAGTCAAAATGAAGTATTGACTGAAGCTGTTGAAACACCCTCTGCACTTACGATGCCCCTACTACGGTACCAGAAAGAGTGGCTAGCTTGGGCACTGAAGCAGGAAGATTCTTCAATTAAAGGAGGGATACTTGCAGATGAAATGGGAATGGGAAAAACCATCCAAGCTATTGCCCTGGTACTTGCTAAACGTCAACTATCTGGAACTGCTGGACTGAAGAGACCCTCACCATATCCAAGTTCTTCCAAGGACTTGCCTTTGATAAAAGCAACACTTGTGATATGTCCCGTGGTTGCTGTGAGCCAGTGGGTCAGTGAGATTGATCGTTTCACATCAAAAGGAAGTTACAAGGTGCTTGTGTATCATGGTCCAAAACGAGTACGGAATCTTGAGATTTTATCAGAATATGATTTTGTTATTACCACATACTCTGTCGTTGAGGCTGATTACAGGAAACATCTGATGCCTCCCAAGGATAGGTGCCCTTACTGTAGTAAACTATTTTATAAGAAGAATTTGAAGATTCACTTGAGGTACATTTGTGGGCCTGATGCTGTCAAAACAGAGAAGCAGGCTAAGCAACAAAGAAAAAGGCCTATACAGCCACAAATATCTAAAAAGGAAGAATCTGTTAATGATAAGAACAATACTGTTCACAAGAGTGGCAGCCAGAAAAGTGCTCTTGGACAGACAATGGGGCAGCATGAGAATGATGAAAAACCTTGTGGGAAATCAATATTGCATTCTGTGATATGGGACCGTGTCATTTTGGATGAGGTGAGGTTCTAAAACCATCTGTTTTCTTTTTGTTTATATGTTTTTTTTTCCTTTCCTTTTTCCTTTTTTTTTTTGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGTTATGGTGCACAGATTTGTTTTGTCACTCCAAAATGGTTAAGAACAAGGGAAAAAGAAACCTACAGAAGTTGATTTCTATCAGTAGTTTGATTGGTTCTGGTTGCTGTTGAAATATAACAAAATTTTATGCAAACGTAGGCGCATTTCATAAAAGATAGGCTGTCTAATACTGCAAAAGCTGTTCTTGCGATTTCTTCTTCATTTAGATGGGCTTTAAGTGGCACGCCTATCCAGAATCGCGTAGGGGAGCTTTACTCTCTTGTAAGTTTTGCTTTGTTTAGAACATTGTCTACCACCGAGGAAAGTGGTCCTTCTCAAGTTAACATGTGTTTTTGGATACTCTCAGGTTCGCTTCCTGCAAATTGTCCCTTATTCTTTCTACTTTTGTAAGGACTGTGATTGTAGAACACTCGATCATAGGTACTAGTTAAGATTCAAGATTGTTCTGCATTGTAAGCTTCCTAGCTTTTGGACTCACTAATTTTCTTATATTAATCACGTGATGTTGGACCCCGACGGGCAATTACAGTTCTCTTACCTGTCCTAACTGCCCTCATAAACGTGTGCGGCATTTTTGCTGGTGGAACAAGGTAAGTTCTGCAATGTGTAGCATTTTTATATCTTTGTAACATTGTTAAACTATGTTTTTTGATTATTATGGTTTGATTTGGAAGAATATTACTGTACGGATTCAAAATTTTGGGAGAGGTCCAGAGTTTCAAAGAGGTATGATATTGCTGAAGCATAAGATTTTAAGTAGCATTGTACTCCGTCGCACCAAAAAGGGTAGAGCTGCCGATCTTGCACTTCCTCCAAGTATTGTAAGTGGACATACACATGGTTAAGTTTGTTAGTTGTTCTTATTGATGGAGAAACTAACTTTAGTTCCTATCCATCCTCTCATAGGTTTCAATTAGGCGAGATACCCTTGACATTCAAGAAGAAGACTTTTATGAATCATTGTATAATGATAGTCGAGCAAAATTTAATACGTGAGTGACCATCATTATCTACTGCCTTTTTTTTCTGTTCTTTATTAACCTGTTTGTTGAAGTATCTTTGATCCATATTAAAAGATCAAAGTCTCTTCAGTGCATATGATTGCAGCTGCATTTGATGCAAGTGATTAAAATGTGGAGACTAATTTGGAAAATGTTTTTTTACTATCTGAGGTCTTCTAGTTTTGGAATTTAATTTAATGATGCGAATATAGAAAATGGTTGCTTTTGTATTTGGAACTATTCTCCAAGTTTTTGCTATCAGTTTATTTTTTTACCTAGGCTAAATTTTAGGTGTTGAAGCTTGAGGCTTTGTCGAGTTCAAGACTTGGTAATTTGTGTCATTTGAATTGCATTCTTTTGTTTTGGTTTCATTTGAATTTTTTTCCGTTAGATAGCTATGGTGGAACTTACTTTGTGTGGGTGAATATGGTATTACTGTATATTATTTGGAAGGAAAATATTATTGTACTGTGAAAAAAAGATCAGGGGACAGGAAGTAACTCAACTGAGAATTCGTAGTGTTATTGTACTACTCTGAATGCATATTCAATGTTACCAATAAAAATGTTGATTATTCAGGAAGCTATTCTCTGAAGCAGCACTATAATTGAGAAAGAACTGAAGTGTGTATGAAAATTTCAGTTTAGAATTGGTTTTACCAAACACTCAAATTAGTTTTTATTTTTATTTTATTTTATTGAGGAAATCAGACATAAATTAGTTAAAATGATTGAAGGGGCAATTTTCAAGAAATTCAAAATAATTCCTAAATCCAATTTTCCTACTTGACACAGTCTGCTAAAGTGAAAGAATTAGGGGAATGGACTGTTATTTAACCTTGATACTCAATTTTCCTACTTGACAAATGTCTGAAAAAATGCCATTGGTCTAATCCGCCATTTCATTGTCTGTGTGCTTTAGAGGTTATTTTCTAATCATTTTTTTTTTTGTTGTTGAAAATATTATGTATTCTTCCACTATTCCGCCAATGTGAAAATGCAAACATTGATGATAGATGCCTTGAGAGATTGTTTGATGTATTTTTTCTTTTAGTTTTGTGGCAGCTGGAACGGCGACAAGTAATTATGCACACATATTTGACCTTTTGATTCGCTTGAGACAGGTCTGTGGCTTCTGGTCAAGCTTAGAAATTTATATCGATTGCTAGTCTTTGGAGTTTGAGGAAGTACTAGAATTTAATTGAGCATTAGATTCACAAGTTTGAAATGTAAATTGCCCAACATCGTTTAGAAGTGGAATCAAGGACAACATGTCTACTGGAGATTTTTGGAGTAAAGTTATGTTAAAGAATAAAACCAAGCTAGCTTTAAGTTTCAACATGGACAATTTCATATAATGGAGATGTTAATCTTTACAGAACTTAAATAAAATGATTAAAGAACTTAGTACTTATCTTATGATAATTTCTTACTGCTCTCACTAGCAATTTAAGTGAATGTATTTTATCATGTTAACCTCAATTTAACAAGAACTTTATCCAATCGATTTGTTCCACAAAACATTCGAAAGTTGATCTGCTAATGATTTTACTTTACTTCATTTTCTCATAATAATTCCAGGCAGTTAATCATCCATATCTCGTGGTGTATTCTAAAACTAATGCCATAAGTTGTGGAAGCATTGGTGATTCTGATAATAATAACAAACAAGTATGTGGAATTTGTCATGAGCCAGCAGAAGAACCTGTGGTGAGTTTCTTTGGTGTGACAAGAATCTTTGGCGAATCCCCACTATCCCTCCCTTTCCTTTTATCTTTGTTTTGCTGTTTGAGAATTTAAATTTTCATTGAATGTATGAGGTTTCAAATGTTCATCTTTTGTGTGCTTTAGTGTGCTCAAGGGAGTTTGAGATCTCCCTCTTCCCTTCTAATGCTCAAAACAGCTACAAAACAATATGCCCAAAATATAAGAACTGACTCCTCTGGGCATAAGATAAAAACTGACTTCCCCCCTCCTCGGCCTCTATAATACTTTTCTCCAACTAACCGAATCCCATGGACTCCACACAATCATACACACCCACGTAAAACAACTTTCACTCAAGTATTATATCATTTTGCTCACCCCTTTTCTCCTTCTTATTCCTCCGAGTGTATACCTTAGTAACTGGGGGTCGAAAGGATGCTGCTGGAAGTGCACATCTTGTATTTGATCACTACCTTACTAAACTGAATCTGTAAGGTAGCAAATTACTTAGAATAAATCCCTCAATTATTCTTGGAATAACTAATGTGTTGAGATTGAATCTCTAACAGTTAATATTGAATGTGATGTCATTTAATATTTATGGTTAATAAGTATGTTAGAATTTAATACCATGATTATTGCTTCTGTTCACTTTATATATGATTTACAAGCCTCTTTGGTAATCTCCTATATTCTATCAGGTTACCTCTTGCGAGCACACATTTTGTAAGGCTTGCATAATTGATCATACCAATGATTTTTTGAAGAGTGTCGCATGTCCTTCTTGCTCAAAGATGCTCACCATTGACTTTCGCACAAGTCTGGCTGTTGGAGATCAAACCATTAAAAATACAATCAAGGGGTTTAAATCTTCAAGTATACTTAACAGAATACAGCTGGAGAATTTTCAGACGAGCACTAAAATAGAAGCTTTGGTACGTTTTATTCAAGACTACCCTGTTTGTATTTAGTCTTTGAATGCGAATGAAGAAATGAGCATGTTGTCATTTAGTATTTTATTTATTTGGTGACACTTTGACTTGGTAGCTGCTAGCTGTCTATTATACTTGAAAATTTGTACTTTGCTGTTTTCTTTCAAATTGAGCAAGTATTTCTGGAGTTCTTTATCATTGCAAGAAAAACGTATCTCTGTTGATGAAAGAGTTTGTAGTACAAGACTCTGGAGTCTTGTCTTGACAAAGATCAATCCAGTTGAAATTATGTACTTGGTTTTGTGTGTTCTGTTATTTTTATCTTCACAAGGTTAGCTAGTACAAGACTCTGGAGTCTTGTTTTGATCTCGTGTCTTGATTTTTTTCCTACTCCAGAGAGAAGAAATTAGATTCATGTTTGAAAGAGATGGTTCTGCCAAAGGAATCGTTTTTAGCCAATTCACGTCATTTTTGGATCTCATGAACTATTCCCTAACCAAGGTGAGTAGAAGTAAGCATTTAGAGTCTTCTTCTGATTTCAAGGCAAATAAGTCACTTTATGTCATGCAACGATCCCTTTGCAGTCTGGTATTACCTGCGTTCAATTAGTTGGAAGCATGTCCTTGAGTCAAAGAGGAGATGCTATTAATAGATTCATTGAGGATCCAGATTGCAAGATTTTTCTAATGAGCTTAAAAGCTGGAGGGGTTGCCCTCAATCTCACTGTCGCATCGCATGTAAGCAGTTTCTTTCGATTGATTATGCAATCATGCACATGACCTCAATTATTATGCATATATGCTGTTATCACACCATCTATAAGGGTGTAGTGCGGTTCAGTTATTTGTCGTGCATAAATATGGTTTTGCTCATCAAATTAATCGTTTATTTTCGGTGATGAACACGCTTTATTTTCCGTCTTTGATCTTCACTTCAGGTCTTCATCATGGACCCTTGGTGGAATCCTGCTGTGGAAAGGCAAGCACAAGACAGAATCCATCGAATTGGGCAATATAAACCTATCAGGTGTAGTATTTTGATAAATATATGTGGCATTTTTCTATTGTCTCAAGACAGGACTAACACGTTTGACATTAACAGAATAACGAGATTCATTATTGAAAACTCTATCGAGGAGAGGATTTTGAAGCTGCAAGAGAGGAAAGAACTGGTATTTGAAGGGTACGTTAGAAAAGTGTAACATTATAAACCACTAACTAATCGACACATCGTTCCTTGTTGTGCCTGCCTTTTGATCTTACAAAAATGCTTTATTGCCTCTCTATGCAGAACTGTAGGTGGCTCTAATGAGGCATTGGGAAAATTATCCTTGGATGACATGAGATTTCTGTTTATTTGA

mRNA sequence

ATGAAGCTTCGTCCTCGTAAAACTACTTCCAACGTATTGATCGAAGGAAACGGAGACGGAGACGGAGATGCCTTTGATGATATAGAGGTGTCATCTCTTTTTTCTGACAGTGGAAGTGAAGTTCTTTCCTCGAGTTCTGAGGACTCCAGTGAGCCTTCAACAAAGAAGTCTAGAGCAAAGACACAGAGAAAACGTATTAAAAAGGAGGGGCCTAGCATTGAGCAGGAAGTCGGAAGCAATGTAGGTAATGATGAAAACCTACACAATCAGAAACCAGAAATTGCCGACTACCAGGGTGTGGATGATATAGAGAAGCCAAAGACCAAATACTCAAGAAAGAAGAAGCCAAAACCTACCCTTTTGTGGAATGTCTGGGAGGAAGAATATGAGAGGTGGATCGATGAAAATATTGATAAAGATTTTGATTTGGCTAGTCAAAATGAAGTATTGACTGAAGCTGTTGAAACACCCTCTGCACTTACGATGCCCCTACTACGGTACCAGAAAGAGTGGCTAGCTTGGGCACTGAAGCAGGAAGATTCTTCAATTAAAGGAGGGATACTTGCAGATGAAATGGGAATGGGAAAAACCATCCAAGCTATTGCCCTGGTACTTGCTAAACGTCAACTATCTGGAACTGCTGGACTGAAGAGACCCTCACCATATCCAAGTTCTTCCAAGGACTTGCCTTTGATAAAAGCAACACTTGTGATATGTCCCGTGGTTGCTGTGAGCCAGTGGGTCAGTGAGATTGATCGTTTCACATCAAAAGGAAGTTACAAGGTGCTTGTGTATCATGGTCCAAAACGAGTACGGAATCTTGAGATTTTATCAGAATATGATTTTGTTATTACCACATACTCTGTCGTTGAGGCTGATTACAGGAAACATCTGATGCCTCCCAAGGATAGGTGCCCTTACTGTAGTAAACTATTTTATAAGAAGAATTTGAAGATTCACTTGAGGTACATTTGTGGGCCTGATGCTGTCAAAACAGAGAAGCAGGCTAAGCAACAAAGAAAAAGGCCTATACAGCCACAAATATCTAAAAAGGAAGAATCTGTTAATGATAAGAACAATACTGTTCACAAGAGTGGCAGCCAGAAAAGTGCTCTTGGACAGACAATGGGGCAGCATGAGAATGATGAAAAACCTTGTGGGAAATCAATATTGCATTCTGTGATATGGGACCGTGTCATTTTGGATGAGGCGCATTTCATAAAAGATAGGCTGTCTAATACTGCAAAAGCTGTTCTTGCGATTTCTTCTTCATTTAGATGGGCTTTAAGTGGCACGCCTATCCAGAATCGCGTAGGGGAGCTTTACTCTCTTGTTCGCTTCCTGCAAATTGTCCCTTATTCTTTCTACTTTTGTAAGGACTGTGATTGTAGAACACTCGATCATAGTTCTCTTACCTGTCCTAACTGCCCTCATAAACGTGTGCGGCATTTTTGCTGGTGGAACAAGAATATTACTGTACGGATTCAAAATTTTGGGAGAGGTCCAGAGTTTCAAAGAGGTATGATATTGCTGAAGCATAAGATTTTAAGTAGCATTGTACTCCGTCGCACCAAAAAGGGTAGAGCTGCCGATCTTGCACTTCCTCCAAGTATTGTTTCAATTAGGCGAGATACCCTTGACATTCAAGAAGAAGACTTTTATGAATCATTGTATAATGATAGTCGAGCAAAATTTAATACTTTTGTGGCAGCTGGAACGGCGACAAGTAATTATGCACACATATTTGACCTTTTGATTCGCTTGAGACAGGCAGTTAATCATCCATATCTCGTGGTGTATTCTAAAACTAATGCCATAAGTTGTGGAAGCATTGGTGATTCTGATAATAATAACAAACAAGTATGTGGAATTTGTCATGAGCCAGCAGAAGAACCTGTGGTTACCTCTTGCGAGCACACATTTTGTAAGGCTTGCATAATTGATCATACCAATGATTTTTTGAAGAGTGTCGCATGTCCTTCTTGCTCAAAGATGCTCACCATTGACTTTCGCACAAGTCTGGCTGTTGGAGATCAAACCATTAAAAATACAATCAAGGGGTTTAAATCTTCAAGTATACTTAACAGAATACAGCTGGAGAATTTTCAGACGAGCACTAAAATAGAAGCTTTGAGAGAAGAAATTAGATTCATGTTTGAAAGAGATGGTTCTGCCAAAGGAATCGTTTTTAGCCAATTCACGTCATTTTTGGATCTCATGAACTATTCCCTAACCAAGTCTGGTATTACCTGCGTTCAATTAGTTGGAAGCATGTCCTTGAGTCAAAGAGGAGATGCTATTAATAGATTCATTGAGGATCCAGATTGCAAGATTTTTCTAATGAGCTTAAAAGCTGGAGGGGTTGCCCTCAATCTCACTGTCGCATCGCATGTCTTCATCATGGACCCTTGGTGGAATCCTGCTGTGGAAAGGCAAGCACAAGACAGAATCCATCGAATTGGGCAATATAAACCTATCAGGTGTAGTATTTTGATAAATATATGTGGCATTTTTCTATTGTCTCAAGACAGGACTAACACGTTTGACATTAACAGAATAACGAGATTCATTATTGAAAACTCTATCGAGGAGAGGATTTTGAAGCTGCAAGAGAGGAAAGAACTGGTATTTGAAGGAACTGTAGGTGGCTCTAATGAGGCATTGGGAAAATTATCCTTGGATGACATGAGATTTCTGTTTATTTGA

Coding sequence (CDS)

ATGAAGCTTCGTCCTCGTAAAACTACTTCCAACGTATTGATCGAAGGAAACGGAGACGGAGACGGAGATGCCTTTGATGATATAGAGGTGTCATCTCTTTTTTCTGACAGTGGAAGTGAAGTTCTTTCCTCGAGTTCTGAGGACTCCAGTGAGCCTTCAACAAAGAAGTCTAGAGCAAAGACACAGAGAAAACGTATTAAAAAGGAGGGGCCTAGCATTGAGCAGGAAGTCGGAAGCAATGTAGGTAATGATGAAAACCTACACAATCAGAAACCAGAAATTGCCGACTACCAGGGTGTGGATGATATAGAGAAGCCAAAGACCAAATACTCAAGAAAGAAGAAGCCAAAACCTACCCTTTTGTGGAATGTCTGGGAGGAAGAATATGAGAGGTGGATCGATGAAAATATTGATAAAGATTTTGATTTGGCTAGTCAAAATGAAGTATTGACTGAAGCTGTTGAAACACCCTCTGCACTTACGATGCCCCTACTACGGTACCAGAAAGAGTGGCTAGCTTGGGCACTGAAGCAGGAAGATTCTTCAATTAAAGGAGGGATACTTGCAGATGAAATGGGAATGGGAAAAACCATCCAAGCTATTGCCCTGGTACTTGCTAAACGTCAACTATCTGGAACTGCTGGACTGAAGAGACCCTCACCATATCCAAGTTCTTCCAAGGACTTGCCTTTGATAAAAGCAACACTTGTGATATGTCCCGTGGTTGCTGTGAGCCAGTGGGTCAGTGAGATTGATCGTTTCACATCAAAAGGAAGTTACAAGGTGCTTGTGTATCATGGTCCAAAACGAGTACGGAATCTTGAGATTTTATCAGAATATGATTTTGTTATTACCACATACTCTGTCGTTGAGGCTGATTACAGGAAACATCTGATGCCTCCCAAGGATAGGTGCCCTTACTGTAGTAAACTATTTTATAAGAAGAATTTGAAGATTCACTTGAGGTACATTTGTGGGCCTGATGCTGTCAAAACAGAGAAGCAGGCTAAGCAACAAAGAAAAAGGCCTATACAGCCACAAATATCTAAAAAGGAAGAATCTGTTAATGATAAGAACAATACTGTTCACAAGAGTGGCAGCCAGAAAAGTGCTCTTGGACAGACAATGGGGCAGCATGAGAATGATGAAAAACCTTGTGGGAAATCAATATTGCATTCTGTGATATGGGACCGTGTCATTTTGGATGAGGCGCATTTCATAAAAGATAGGCTGTCTAATACTGCAAAAGCTGTTCTTGCGATTTCTTCTTCATTTAGATGGGCTTTAAGTGGCACGCCTATCCAGAATCGCGTAGGGGAGCTTTACTCTCTTGTTCGCTTCCTGCAAATTGTCCCTTATTCTTTCTACTTTTGTAAGGACTGTGATTGTAGAACACTCGATCATAGTTCTCTTACCTGTCCTAACTGCCCTCATAAACGTGTGCGGCATTTTTGCTGGTGGAACAAGAATATTACTGTACGGATTCAAAATTTTGGGAGAGGTCCAGAGTTTCAAAGAGGTATGATATTGCTGAAGCATAAGATTTTAAGTAGCATTGTACTCCGTCGCACCAAAAAGGGTAGAGCTGCCGATCTTGCACTTCCTCCAAGTATTGTTTCAATTAGGCGAGATACCCTTGACATTCAAGAAGAAGACTTTTATGAATCATTGTATAATGATAGTCGAGCAAAATTTAATACTTTTGTGGCAGCTGGAACGGCGACAAGTAATTATGCACACATATTTGACCTTTTGATTCGCTTGAGACAGGCAGTTAATCATCCATATCTCGTGGTGTATTCTAAAACTAATGCCATAAGTTGTGGAAGCATTGGTGATTCTGATAATAATAACAAACAAGTATGTGGAATTTGTCATGAGCCAGCAGAAGAACCTGTGGTTACCTCTTGCGAGCACACATTTTGTAAGGCTTGCATAATTGATCATACCAATGATTTTTTGAAGAGTGTCGCATGTCCTTCTTGCTCAAAGATGCTCACCATTGACTTTCGCACAAGTCTGGCTGTTGGAGATCAAACCATTAAAAATACAATCAAGGGGTTTAAATCTTCAAGTATACTTAACAGAATACAGCTGGAGAATTTTCAGACGAGCACTAAAATAGAAGCTTTGAGAGAAGAAATTAGATTCATGTTTGAAAGAGATGGTTCTGCCAAAGGAATCGTTTTTAGCCAATTCACGTCATTTTTGGATCTCATGAACTATTCCCTAACCAAGTCTGGTATTACCTGCGTTCAATTAGTTGGAAGCATGTCCTTGAGTCAAAGAGGAGATGCTATTAATAGATTCATTGAGGATCCAGATTGCAAGATTTTTCTAATGAGCTTAAAAGCTGGAGGGGTTGCCCTCAATCTCACTGTCGCATCGCATGTCTTCATCATGGACCCTTGGTGGAATCCTGCTGTGGAAAGGCAAGCACAAGACAGAATCCATCGAATTGGGCAATATAAACCTATCAGGTGTAGTATTTTGATAAATATATGTGGCATTTTTCTATTGTCTCAAGACAGGACTAACACGTTTGACATTAACAGAATAACGAGATTCATTATTGAAAACTCTATCGAGGAGAGGATTTTGAAGCTGCAAGAGAGGAAAGAACTGGTATTTGAAGGAACTGTAGGTGGCTCTAATGAGGCATTGGGAAAATTATCCTTGGATGACATGAGATTTCTGTTTATTTGA

Protein sequence

MKLRPRKTTSNVLIEGNGDGDGDAFDDIEVSSLFSDSGSEVLSSSSEDSSEPSTKKSRAKTQRKRIKKEGPSIEQEVGSNVGNDENLHNQKPEIADYQGVDDIEKPKTKYSRKKKPKPTLLWNVWEEEYERWIDENIDKDFDLASQNEVLTEAVETPSALTMPLLRYQKEWLAWALKQEDSSIKGGILADEMGMGKTIQAIALVLAKRQLSGTAGLKRPSPYPSSSKDLPLIKATLVICPVVAVSQWVSEIDRFTSKGSYKVLVYHGPKRVRNLEILSEYDFVITTYSVVEADYRKHLMPPKDRCPYCSKLFYKKNLKIHLRYICGPDAVKTEKQAKQQRKRPIQPQISKKEESVNDKNNTVHKSGSQKSALGQTMGQHENDEKPCGKSILHSVIWDRVILDEAHFIKDRLSNTAKAVLAISSSFRWALSGTPIQNRVGELYSLVRFLQIVPYSFYFCKDCDCRTLDHSSLTCPNCPHKRVRHFCWWNKNITVRIQNFGRGPEFQRGMILLKHKILSSIVLRRTKKGRAADLALPPSIVSIRRDTLDIQEEDFYESLYNDSRAKFNTFVAAGTATSNYAHIFDLLIRLRQAVNHPYLVVYSKTNAISCGSIGDSDNNNKQVCGICHEPAEEPVVTSCEHTFCKACIIDHTNDFLKSVACPSCSKMLTIDFRTSLAVGDQTIKNTIKGFKSSSILNRIQLENFQTSTKIEALREEIRFMFERDGSAKGIVFSQFTSFLDLMNYSLTKSGITCVQLVGSMSLSQRGDAINRFIEDPDCKIFLMSLKAGGVALNLTVASHVFIMDPWWNPAVERQAQDRIHRIGQYKPIRCSILINICGIFLLSQDRTNTFDINRITRFIIENSIEERILKLQERKELVFEGTVGGSNEALGKLSLDDMRFLFI
Homology
BLAST of HG10000172 vs. NCBI nr
Match: XP_038902046.1 (ATP-dependent helicase rhp16 [Benincasa hispida])

HSP 1 Score: 1608.6 bits (4164), Expect = 0.0e+00
Identity = 813/901 (90.23%), Postives = 844/901 (93.67%), Query Frame = 0

Query: 1   MKLRPRKTTSNVLIEGNGDGDGDAFDDIEVSSLFSDSGSEVLSSSSEDSSEPSTKKSRAK 60
           MKLRPRKTTSNV IEGN  GD DA DDI+VSSL SDSG EVLSSSSEDS EPS KKSRAK
Sbjct: 1   MKLRPRKTTSNVFIEGN--GDRDASDDIDVSSLVSDSGCEVLSSSSEDSGEPSIKKSRAK 60

Query: 61  TQRKRIKKEGPSIEQEVGSNVGNDENLHNQKPEIADYQGVDDIEKPKTKYSRKKKPKPTL 120
           T+RKRIKKEGPSIEQEVGSNVGNDEN+HNQKPEIA+ QGV DIEKPKTKYSRKKKPKPTL
Sbjct: 61  TRRKRIKKEGPSIEQEVGSNVGNDENIHNQKPEIANSQGVVDIEKPKTKYSRKKKPKPTL 120

Query: 121 LWNVWEEEYERWIDENIDKDFDLASQNEVLTEAVETPSALTMPLLRYQKEWLAWALKQED 180
           LWNVWEEEYERWIDENI+KDFDLA+QNEVLTEAVETPSALTMPLLRYQKEWLAWALKQED
Sbjct: 121 LWNVWEEEYERWIDENIEKDFDLANQNEVLTEAVETPSALTMPLLRYQKEWLAWALKQED 180

Query: 181 SSIKGGILADEMGMGKTIQAIALVLAKRQLSGTAGLKRPSPYPSSSKDLPLIKATLVICP 240
           SSIKGGILADEMGMGKTIQAIALVLAKRQLSGT+GL+RPS +PSSSKDLP IKATLVICP
Sbjct: 181 SSIKGGILADEMGMGKTIQAIALVLAKRQLSGTSGLRRPSTHPSSSKDLPSIKATLVICP 240

Query: 241 VVAVSQWVSEIDRFTSKGSYKVLVYHGPKRVRNLEILSEYDFVITTYSVVEADYRKHLMP 300
           VVAVSQWVSEIDRFTSKGSYKVLVYHGPKRV++LEILSEYDFVITTYSVVEADYRKHLMP
Sbjct: 241 VVAVSQWVSEIDRFTSKGSYKVLVYHGPKRVQSLEILSEYDFVITTYSVVEADYRKHLMP 300

Query: 301 PKDRCPYCSKLFYKKNLKIHLRYICGPDAVKTEKQAKQQRKRPIQPQISKKEESVNDKNN 360
           PKDRCPYC+KLFYKK LK HL YICGPDAVKTEKQAKQQRKRPIQPQI K+EES   KNN
Sbjct: 301 PKDRCPYCNKLFYKKKLKFHLMYICGPDAVKTEKQAKQQRKRPIQPQIYKQEESAKGKNN 360

Query: 361 TVHKSGSQKSALGQTMGQHENDEKPCGKSILHSVIWDRVILDEAHFIKDRLSNTAKAVLA 420
            VHK G QKS LGQTMGQ+ENDEKPCGKS+LHSVIWDRVILDEAHFIKDRLSNTAKAVLA
Sbjct: 361 NVHKRGGQKSTLGQTMGQNENDEKPCGKSVLHSVIWDRVILDEAHFIKDRLSNTAKAVLA 420

Query: 421 ISSSFRWALSGTPIQNRVGELYSLVRFLQIVPYSFYFCKDCDCRTLDHSSLTCPNCPHKR 480
           ISSSFRWALSGTPIQNRVGELYSL+RFLQIVPYSFYFCKDCDCRTLDHSS TCPNCPHKR
Sbjct: 421 ISSSFRWALSGTPIQNRVGELYSLIRFLQIVPYSFYFCKDCDCRTLDHSSPTCPNCPHKR 480

Query: 481 VRHFCWWNKNITVRIQNFGRGPEFQRGMILLKHKILSSIVLRRTKKGRAADLALPPSIVS 540
           VRHFCWWNKNIT+RIQNFGRGPEF+RGMILLKHKILSS VLRRTKKGRAA+LALPPSIVS
Sbjct: 481 VRHFCWWNKNITLRIQNFGRGPEFKRGMILLKHKILSSTVLRRTKKGRAAELALPPSIVS 540

Query: 541 IRRDTLDIQEEDFYESLYNDSRAKFNTFVAAGTATSNYAHIFDLLIRLRQAVNHPYLVVY 600
           IRRDTLDIQEEDFYESLYNDSRAKFNTFVAAGTATSNYAHIFDLLIRLRQAVNHPYLVVY
Sbjct: 541 IRRDTLDIQEEDFYESLYNDSRAKFNTFVAAGTATSNYAHIFDLLIRLRQAVNHPYLVVY 600

Query: 601 SKTNAISCGSIGDSDNNNKQVCGICHEPAEEPVVTSCEHTFCKACIIDHTNDFLKSVACP 660
           SKTNAISCGSI DSDNNN Q+CGICHEPAEEPVV+SCEHTFCKACIID+TNDF K V+CP
Sbjct: 601 SKTNAISCGSIADSDNNN-QLCGICHEPAEEPVVSSCEHTFCKACIIDYTNDFSKRVSCP 660

Query: 661 SCSKMLTIDFRTSLAVGDQTIKNTIKGFKSSSILNRIQLENFQTSTKIEALREEIRFMFE 720
           SCSKMLTIDF TSLAV DQT+KNTIKGFKSSSILNRIQLENFQTSTKIEALREEIRFMFE
Sbjct: 661 SCSKMLTIDFSTSLAVRDQTVKNTIKGFKSSSILNRIQLENFQTSTKIEALREEIRFMFE 720

Query: 721 RDGSAKGIVFSQFTSFLDLMNYSLTKSGITCVQLVGSMSLSQRGDAINRFIEDPDCKIFL 780
           RDGSAKGIVFSQFTSFLDL+NYSLTKSGITCVQL+GSMSL+QRGDAINRFI+DPDCKIFL
Sbjct: 721 RDGSAKGIVFSQFTSFLDLINYSLTKSGITCVQLIGSMSLTQRGDAINRFIDDPDCKIFL 780

Query: 781 MSLKAGGVALNLTVASHVFIMDPWWNPAVERQAQDRIHRIGQYKPIRCSILINICGIFLL 840
           MSLKAGG+ALNLTVAS+VFIMDPWWNPAVERQAQDRIHRIGQYKPI              
Sbjct: 781 MSLKAGGIALNLTVASNVFIMDPWWNPAVERQAQDRIHRIGQYKPI-------------- 840

Query: 841 SQDRTNTFDINRITRFIIENSIEERILKLQERKELVFEGTVGGSNEALGKLSLDDMRFLF 900
                      RITRFIIENSIEERILKLQERKELVFEGTVGGSNEALGKLSLDDMRFLF
Sbjct: 841 -----------RITRFIIENSIEERILKLQERKELVFEGTVGGSNEALGKLSLDDMRFLF 873

Query: 901 I 902
           +
Sbjct: 901 L 873

BLAST of HG10000172 vs. NCBI nr
Match: XP_004151894.2 (DNA repair protein RAD16 isoform X1 [Cucumis sativus] >KGN63243.1 hypothetical protein Csa_021783 [Cucumis sativus])

HSP 1 Score: 1540.8 bits (3988), Expect = 0.0e+00
Identity = 782/901 (86.79%), Postives = 824/901 (91.45%), Query Frame = 0

Query: 1   MKLRPRKTTSNVLIEGNGDGDGDAFDDIEVSSLFSDSGSEVLSSSSEDSSEPSTKKSRAK 60
           MKLRPRK  SNVLIE  G+ DGD  DDI+VSSL SD GSE LSSSSED SE STKKSRA+
Sbjct: 1   MKLRPRKPASNVLIE-EGNVDGDFSDDIDVSSLVSDCGSEDLSSSSEDFSEHSTKKSRAR 60

Query: 61  TQRKRIKKEGPSIEQEVGSNVGNDENLHNQKPEIADYQGVDDIEKPKTKYSRKKKPKPTL 120
           TQ+KRIKK+GPSIEQEVGSNVGNDENL+N +PEIAD QGV DIEKPKTKYSRKKK KPTL
Sbjct: 61  TQKKRIKKDGPSIEQEVGSNVGNDENLNNPRPEIADSQGVVDIEKPKTKYSRKKKTKPTL 120

Query: 121 LWNVWEEEYERWIDENIDKDFDLASQNEVLTEAVETPSALTMPLLRYQKEWLAWALKQED 180
           LWN+WEEEYERWIDENI+KDFDLA+QNEV  EAVETP+ALTMPLLRYQKEWLAWALKQED
Sbjct: 121 LWNIWEEEYERWIDENIEKDFDLANQNEVFAEAVETPAALTMPLLRYQKEWLAWALKQED 180

Query: 181 SSIKGGILADEMGMGKTIQAIALVLAKRQLSGTAGLKRPSPYPSSSKDLPLIKATLVICP 240
           SSIKGGILADEMGMGKTIQAIALVLAKRQLSGTAGL+RPS  PSSSKDLPLIKATLVICP
Sbjct: 181 SSIKGGILADEMGMGKTIQAIALVLAKRQLSGTAGLRRPSSNPSSSKDLPLIKATLVICP 240

Query: 241 VVAVSQWVSEIDRFTSKGSYKVLVYHGPKRVRNLEILSEYDFVITTYSVVEADYRKHLMP 300
           VVAVSQWVSEIDRFTS+GSYKVLVYHGPKR R+LE+LSEYDFVITTYSVVEADYRK+LMP
Sbjct: 241 VVAVSQWVSEIDRFTSEGSYKVLVYHGPKRERSLEVLSEYDFVITTYSVVEADYRKYLMP 300

Query: 301 PKDRCPYCSKLFYKKNLKIHLRYICGPDAVKTEKQAKQQRKRPIQPQISKKEESVNDKNN 360
           PKDRCPYCSKLF+KKNLK HL YICGPDAVKTEKQ+KQQRKRPIQPQI K+E+S  DKNN
Sbjct: 301 PKDRCPYCSKLFHKKNLKFHLMYICGPDAVKTEKQSKQQRKRPIQPQICKQEKSDKDKNN 360

Query: 361 TVHKSGSQKSALGQTMGQHENDEKPCGKSILHSVIWDRVILDEAHFIKDRLSNTAKAVLA 420
            VHKSG QKS LGQT+ +HENDEK  G SILHSVIWDRVILDEAHFIKDRLSNTAKAVLA
Sbjct: 361 NVHKSGGQKSTLGQTVEEHENDEKHRGNSILHSVIWDRVILDEAHFIKDRLSNTAKAVLA 420

Query: 421 ISSSFRWALSGTPIQNRVGELYSLVRFLQIVPYSFYFCKDCDCRTLDHSSLTCPNCPHKR 480
           ISSSFRWALSGTPIQNRVGELYSLVRFLQIVPYSFYFCKDCDCRTLDHSSLTCPNCPHKR
Sbjct: 421 ISSSFRWALSGTPIQNRVGELYSLVRFLQIVPYSFYFCKDCDCRTLDHSSLTCPNCPHKR 480

Query: 481 VRHFCWWNKNITVRIQNFGRGPEFQRGMILLKHKILSSIVLRRTKKGRAADLALPPSIVS 540
           VRHFCWWNKNI+ RIQNFGRGPEF+RGMILLKHKILS+IVLRRTKKGRAADLALPPS VS
Sbjct: 481 VRHFCWWNKNISQRIQNFGRGPEFKRGMILLKHKILSTIVLRRTKKGRAADLALPPSTVS 540

Query: 541 IRRDTLDIQEEDFYESLYNDSRAKFNTFVAAGTATSNYAHIFDLLIRLRQAVNHPYLVVY 600
           IRRDTLDIQEEDFYESLYNDSRAKFNTFVAAGT TSNYAHIFDLLIRLRQAVNHPYLVVY
Sbjct: 541 IRRDTLDIQEEDFYESLYNDSRAKFNTFVAAGTVTSNYAHIFDLLIRLRQAVNHPYLVVY 600

Query: 601 SKTNAISCGSIGDSDNNNKQVCGICHEPAEEPVVTSCEHTFCKACIIDHTNDFLKSVACP 660
           SKTNAI+ G+I DSD+NNKQVCGIC+EPAEEPV TSC+HTFCKAC+ID+  DF K V+CP
Sbjct: 601 SKTNAINSGNIDDSDSNNKQVCGICYEPAEEPVDTSCKHTFCKACLIDYAGDFSKPVSCP 660

Query: 661 SCSKMLTIDFRTSLAVGDQTIKNTIKGFKSSSILNRIQLENFQTSTKIEALREEIRFMFE 720
           SCSKMLT DF TS+A  DQT+KN IKGFKSSSILNRIQLENFQTSTKIEALREEIRFMFE
Sbjct: 661 SCSKMLTSDFITSMAFKDQTVKNKIKGFKSSSILNRIQLENFQTSTKIEALREEIRFMFE 720

Query: 721 RDGSAKGIVFSQFTSFLDLMNYSLTKSGITCVQLVGSMSLSQRGDAINRFIEDPDCKIFL 780
           RDGSAKGIVFSQFTSFLDL+NYSL+KSGITCVQLVGSMSL+QR DAINRFIEDPDCKIFL
Sbjct: 721 RDGSAKGIVFSQFTSFLDLINYSLSKSGITCVQLVGSMSLTQRADAINRFIEDPDCKIFL 780

Query: 781 MSLKAGGVALNLTVASHVFIMDPWWNPAVERQAQDRIHRIGQYKPIRCSILINICGIFLL 840
           MSLKAGGVALNLTVASHVFIMDPWWNPAVERQAQDRIHRIGQYKPI              
Sbjct: 781 MSLKAGGVALNLTVASHVFIMDPWWNPAVERQAQDRIHRIGQYKPI-------------- 840

Query: 841 SQDRTNTFDINRITRFIIENSIEERILKLQERKELVFEGTVGGSNEALGKLSLDDMRFLF 900
                      RI RF IENSIEERILKLQERKELVFEGTVG SNEALG+L+LDDMR+LF
Sbjct: 841 -----------RIMRFFIENSIEERILKLQERKELVFEGTVGRSNEALGRLTLDDMRYLF 875

Query: 901 I 902
           +
Sbjct: 901 L 875

BLAST of HG10000172 vs. NCBI nr
Match: XP_008455894.1 (PREDICTED: DNA repair protein RAD16 [Cucumis melo] >KAA0034825.1 DNA repair protein RAD16 [Cucumis melo var. makuwa] >TYK28904.1 DNA repair protein RAD16 [Cucumis melo var. makuwa])

HSP 1 Score: 1537.3 bits (3979), Expect = 0.0e+00
Identity = 783/901 (86.90%), Postives = 825/901 (91.56%), Query Frame = 0

Query: 1   MKLRPRKTTSNVLIEGNGDGDGDAFDDIEVSSLFSDSGSEVLSSSSEDSSEPSTKKSRAK 60
           MKLRPRK  SNVLIE  G+ DGD+ DDI+V    SD GSE  SSSSED SE STKKSRA+
Sbjct: 1   MKLRPRKPASNVLIE-EGNVDGDSSDDIDV----SDCGSEDHSSSSEDFSEHSTKKSRAR 60

Query: 61  TQRKRIKKEGPSIEQEVGSNVGNDENLHNQKPEIADYQGVDDIEKPKTKYSRKKKPKPTL 120
           TQ+KRIKK+GPSIEQEVGSNVGNDENL+NQKPEIAD QGV +IEKPKTKYSR KKPKPTL
Sbjct: 61  TQKKRIKKDGPSIEQEVGSNVGNDENLNNQKPEIADSQGVVEIEKPKTKYSR-KKPKPTL 120

Query: 121 LWNVWEEEYERWIDENIDKDFDLASQNEVLTEAVETPSALTMPLLRYQKEWLAWALKQED 180
           LWN+WEEEYERWIDENI+KDFDLA+QNEVL E+VETP+ALTMPLLRYQKEWLAWALKQED
Sbjct: 121 LWNIWEEEYERWIDENIEKDFDLANQNEVLAESVETPAALTMPLLRYQKEWLAWALKQED 180

Query: 181 SSIKGGILADEMGMGKTIQAIALVLAKRQLSGTAGLKRPSPYPSSSKDLPLIKATLVICP 240
           SSIKGGILADEMGMGKTIQAIALVLAKRQLSGTAGL+RPS  PSSSK+LPLIKATLVICP
Sbjct: 181 SSIKGGILADEMGMGKTIQAIALVLAKRQLSGTAGLRRPSSNPSSSKELPLIKATLVICP 240

Query: 241 VVAVSQWVSEIDRFTSKGSYKVLVYHGPKRVRNLEILSEYDFVITTYSVVEADYRKHLMP 300
           VVAVSQWVSEIDRFTS+GSYKVLVYHGPKRVR+LEILSEYDFVITTYSVVEADYRK+LMP
Sbjct: 241 VVAVSQWVSEIDRFTSEGSYKVLVYHGPKRVRSLEILSEYDFVITTYSVVEADYRKYLMP 300

Query: 301 PKDRCPYCSKLFYKKNLKIHLRYICGPDAVKTEKQAKQQRKRPIQPQISKKEESVNDKNN 360
           PKDRCPYCSKLF+KKNLK HL YICGPDAVKTEKQ+KQQRKRPIQPQI K+E+S  DKNN
Sbjct: 301 PKDRCPYCSKLFHKKNLKFHLMYICGPDAVKTEKQSKQQRKRPIQPQICKQEKSDKDKNN 360

Query: 361 TVHKSGSQKSALGQTMGQHENDEKPCGKSILHSVIWDRVILDEAHFIKDRLSNTAKAVLA 420
            VHKSG+QKS LGQT+G+HENDEKP G SILHSVIWDRVILDEAHFIKDRLSNTAKAVLA
Sbjct: 361 NVHKSGAQKSTLGQTLGEHENDEKPRGNSILHSVIWDRVILDEAHFIKDRLSNTAKAVLA 420

Query: 421 ISSSFRWALSGTPIQNRVGELYSLVRFLQIVPYSFYFCKDCDCRTLDHSSLTCPNCPHKR 480
           ISSSFRWALSGTPIQNRVGELYSLVRFLQIVPYSFYFCKDCDCRTLDHSS TCPNCPHKR
Sbjct: 421 ISSSFRWALSGTPIQNRVGELYSLVRFLQIVPYSFYFCKDCDCRTLDHSSPTCPNCPHKR 480

Query: 481 VRHFCWWNKNITVRIQNFGRGPEFQRGMILLKHKILSSIVLRRTKKGRAADLALPPSIVS 540
           VRHFCWWNKNIT RIQNFGRGPEF+RGMILLKHKILSSIVLRRTKKGRAADLALPPS VS
Sbjct: 481 VRHFCWWNKNITQRIQNFGRGPEFKRGMILLKHKILSSIVLRRTKKGRAADLALPPSTVS 540

Query: 541 IRRDTLDIQEEDFYESLYNDSRAKFNTFVAAGTATSNYAHIFDLLIRLRQAVNHPYLVVY 600
           IRRDTLDIQEEDFYESLYNDSRAKFNTFVAAGT TSNYAHIFDLLIRLRQAVNHPYLVVY
Sbjct: 541 IRRDTLDIQEEDFYESLYNDSRAKFNTFVAAGTVTSNYAHIFDLLIRLRQAVNHPYLVVY 600

Query: 601 SKTNAISCGSIGDSDNNNKQVCGICHEPAEEPVVTSCEHTFCKACIIDHTNDFLKSVACP 660
           SKT AI+ G+I DSD+NNKQVCG+CHEPAEEPV TSC+H FCKACIID+  DF K V+CP
Sbjct: 601 SKTKAINSGNIDDSDSNNKQVCGLCHEPAEEPVDTSCKHAFCKACIIDYAGDFSKPVSCP 660

Query: 661 SCSKMLTIDFRTSLAVGDQTIKNTIKGFKSSSILNRIQLENFQTSTKIEALREEIRFMFE 720
           SCSKMLT DF TS+A  DQT+KNTIKGFKSSSILNRIQLENFQTSTKIEALREEIRFMFE
Sbjct: 661 SCSKMLTSDFITSMAFKDQTVKNTIKGFKSSSILNRIQLENFQTSTKIEALREEIRFMFE 720

Query: 721 RDGSAKGIVFSQFTSFLDLMNYSLTKSGITCVQLVGSMSLSQRGDAINRFIEDPDCKIFL 780
           RDGSAKGIVFSQFTSFLDL+NYSL+KSGITCVQLVGSMSL+QR DAINRFIEDPDCKIFL
Sbjct: 721 RDGSAKGIVFSQFTSFLDLINYSLSKSGITCVQLVGSMSLTQRADAINRFIEDPDCKIFL 780

Query: 781 MSLKAGGVALNLTVASHVFIMDPWWNPAVERQAQDRIHRIGQYKPIRCSILINICGIFLL 840
           MSLKAGGVALNLTVASHVFIMDPWWNPAVERQAQDRIHRIGQYKPI              
Sbjct: 781 MSLKAGGVALNLTVASHVFIMDPWWNPAVERQAQDRIHRIGQYKPI-------------- 840

Query: 841 SQDRTNTFDINRITRFIIENSIEERILKLQERKELVFEGTVGGSNEALGKLSLDDMRFLF 900
                      RI RF IENSIEERILKLQERKELVFEGTVG SNEALG+L+LDDMR+LF
Sbjct: 841 -----------RIMRFFIENSIEERILKLQERKELVFEGTVGRSNEALGRLTLDDMRYLF 870

Query: 901 I 902
           +
Sbjct: 901 L 870

BLAST of HG10000172 vs. NCBI nr
Match: XP_023512492.1 (DNA repair protein RAD16 isoform X1 [Cucurbita pepo subsp. pepo] >XP_023512493.1 DNA repair protein RAD16 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1511.5 bits (3912), Expect = 0.0e+00
Identity = 766/901 (85.02%), Postives = 809/901 (89.79%), Query Frame = 0

Query: 1   MKLRPRKTTSNVLIEGNGDGDGDAFDDIEVSSLFSDSGSEVLSSSSEDSSEPSTKKSRAK 60
           MKLRPRK TSN+LI+GN   DGDA DDI+VSSL+SDS SE  SSSSED  EPSTKKSRAK
Sbjct: 1   MKLRPRKPTSNILIQGN--ADGDASDDIDVSSLYSDSESEDPSSSSEDFCEPSTKKSRAK 60

Query: 61  TQRKRIKKEGPSIEQEVGSNVGNDENLHNQKPEIADYQGVDDIEKPKTKYSRKKKPKPTL 120
            +RK IK+EGPSIEQEV   VGNDEN HNQ PE+   QGV DI KPKTKYSRKKK KP L
Sbjct: 61  KKRKGIKEEGPSIEQEVWRKVGNDENPHNQTPEVIPVQGVVDIGKPKTKYSRKKKQKPIL 120

Query: 121 LWNVWEEEYERWIDENIDKDFDLASQNEVLTEAVETPSALTMPLLRYQKEWLAWALKQED 180
           LW+VW EE+ERWIDENI+KDFD+ASQNEVLTEAVETPSALTMPLLRYQKEWLAWALKQED
Sbjct: 121 LWDVWAEEHERWIDENIEKDFDMASQNEVLTEAVETPSALTMPLLRYQKEWLAWALKQED 180

Query: 181 SSIKGGILADEMGMGKTIQAIALVLAKRQLSGTAGLKRPSPYPSSSKDLPLIKATLVICP 240
           S ++GGILADEMGMGKTIQAIALVLAKR+L G AGL+RPSPYPSSSKD PLIKATLV+CP
Sbjct: 181 SPVRGGILADEMGMGKTIQAIALVLAKRELPG-AGLRRPSPYPSSSKDFPLIKATLVVCP 240

Query: 241 VVAVSQWVSEIDRFTSKGSYKVLVYHGPKRVRNLEILSEYDFVITTYSVVEADYRKHLMP 300
           V+AVSQWVSEIDRFT KGS KV V+HGPKR ++LE L E+DFVITTYSVVEA+YRKHLMP
Sbjct: 241 VIAVSQWVSEIDRFTLKGSNKVHVFHGPKRAQSLETLFEFDFVITTYSVVEAEYRKHLMP 300

Query: 301 PKDRCPYCSKLFYKKNLKIHLRYICGPDAVKTEKQAKQQRKRPIQPQISKKEESVNDKNN 360
           PKDRCPYCSKLFYKKNLKIHL+YICGPDAVKTEKQAKQ RKRPIQPQISK E S  DKNN
Sbjct: 301 PKDRCPYCSKLFYKKNLKIHLKYICGPDAVKTEKQAKQIRKRPIQPQISKGEVSAKDKNN 360

Query: 361 TVHKSGSQKSALGQTMGQHENDEKPCGKSILHSVIWDRVILDEAHFIKDRLSNTAKAVLA 420
             H SGSQKS  GQTMGQHEN+E PCGKSILHSVIWDRVILDEAHFIKDR SNTAKAVLA
Sbjct: 361 NFHNSGSQKSTFGQTMGQHENEENPCGKSILHSVIWDRVILDEAHFIKDRQSNTAKAVLA 420

Query: 421 ISSSFRWALSGTPIQNRVGELYSLVRFLQIVPYSFYFCKDCDCRTLDHSSLTCPNCPHKR 480
           ISSSFRWALSGTPIQNRVGELYSLVRFLQIVPYSFYFCKDCDCRTLDHSS++CP+CPHKR
Sbjct: 421 ISSSFRWALSGTPIQNRVGELYSLVRFLQIVPYSFYFCKDCDCRTLDHSSVSCPDCPHKR 480

Query: 481 VRHFCWWNKNITVRIQNFGRGPEFQRGMILLKHKILSSIVLRRTKKGRAADLALPPSIVS 540
           +RHFCWWNK IT+RIQN GRGPEF+RGMILLKHKILSSIVLRRTKKGRAADLALPPSIVS
Sbjct: 481 MRHFCWWNKYITLRIQNVGRGPEFKRGMILLKHKILSSIVLRRTKKGRAADLALPPSIVS 540

Query: 541 IRRDTLDIQEEDFYESLYNDSRAKFNTFVAAGTATSNYAHIFDLLIRLRQAVNHPYLVVY 600
           IRRDTLDIQEEDFYESLYNDSRAKFNTFVAAGTATSNYAHIFDLLIRLRQAVNHPYLVVY
Sbjct: 541 IRRDTLDIQEEDFYESLYNDSRAKFNTFVAAGTATSNYAHIFDLLIRLRQAVNHPYLVVY 600

Query: 601 SKTNAISCGSIGDSDNNNKQVCGICHEPAEEPVVTSCEHTFCKACIIDHTNDFLKSVACP 660
           SKTN ISCGSI  +DNNN+Q CGICHEPAEEPVVTSCEHTFCKACII   NDF K V+CP
Sbjct: 601 SKTNVISCGSIDGTDNNNEQACGICHEPAEEPVVTSCEHTFCKACIIGFANDFSKLVSCP 660

Query: 661 SCSKMLTIDFRTSLAVGDQTIKNTIKGFKSSSILNRIQLENFQTSTKIEALREEIRFMFE 720
           SCSKMLTIDF T+LA  D+TIKNTIKGFK +SILNRIQLENFQTSTKIEALREEIRFM E
Sbjct: 661 SCSKMLTIDFSTNLAGRDRTIKNTIKGFKCTSILNRIQLENFQTSTKIEALREEIRFMLE 720

Query: 721 RDGSAKGIVFSQFTSFLDLMNYSLTKSGITCVQLVGSMSLSQRGDAINRFIEDPDCKIFL 780
           RDGSAKGIVFSQFTSFLDL+NYSLTKSGITCVQL+GSMSL QR DAI RFI+DPDCKIFL
Sbjct: 721 RDGSAKGIVFSQFTSFLDLINYSLTKSGITCVQLIGSMSLPQRDDAIKRFIDDPDCKIFL 780

Query: 781 MSLKAGGVALNLTVASHVFIMDPWWNPAVERQAQDRIHRIGQYKPIRCSILINICGIFLL 840
           MSLKAGG+ALNLTVASHVFIMDPWWNPAVERQAQDRIHRIGQYKPI              
Sbjct: 781 MSLKAGGIALNLTVASHVFIMDPWWNPAVERQAQDRIHRIGQYKPI-------------- 840

Query: 841 SQDRTNTFDINRITRFIIENSIEERILKLQERKELVFEGTVGGSNEALGKLSLDDMRFLF 900
                      RITRF+IENSIEERILKLQERKELVFEGTVG SN+ALGKL+LDDMRFLF
Sbjct: 841 -----------RITRFVIENSIEERILKLQERKELVFEGTVGRSNDALGKLTLDDMRFLF 873

Query: 901 I 902
           I
Sbjct: 901 I 873

BLAST of HG10000172 vs. NCBI nr
Match: XP_022943925.1 (DNA repair protein RAD16 [Cucurbita moschata] >XP_022943926.1 DNA repair protein RAD16 [Cucurbita moschata])

HSP 1 Score: 1510.4 bits (3909), Expect = 0.0e+00
Identity = 764/901 (84.79%), Postives = 810/901 (89.90%), Query Frame = 0

Query: 1   MKLRPRKTTSNVLIEGNGDGDGDAFDDIEVSSLFSDSGSEVLSSSSEDSSEPSTKKSRAK 60
           MKLRPRK TSN+LI+GN   DGDA D+I+VSSL+SDS SE  SSSSED  EPSTKKSRAK
Sbjct: 1   MKLRPRKPTSNILIQGN--ADGDASDEIDVSSLYSDSESEDPSSSSEDFCEPSTKKSRAK 60

Query: 61  TQRKRIKKEGPSIEQEVGSNVGNDENLHNQKPEIADYQGVDDIEKPKTKYSRKKKPKPTL 120
            +RK IK+EGPSIEQEV   VGNDEN HNQ PE+   QGV DI KPKTKYSRKKK KP L
Sbjct: 61  KKRKGIKEEGPSIEQEVWRKVGNDENPHNQTPEVIPVQGVVDIGKPKTKYSRKKKQKPIL 120

Query: 121 LWNVWEEEYERWIDENIDKDFDLASQNEVLTEAVETPSALTMPLLRYQKEWLAWALKQED 180
           LW+VW EE+ERWIDENI+KDFD+ASQNEVLTEAVETPSALTMPLLRYQKEWLAWALKQED
Sbjct: 121 LWDVWAEEHERWIDENIEKDFDMASQNEVLTEAVETPSALTMPLLRYQKEWLAWALKQED 180

Query: 181 SSIKGGILADEMGMGKTIQAIALVLAKRQLSGTAGLKRPSPYPSSSKDLPLIKATLVICP 240
           S ++GGILADEMGMGKTIQAIALVLAKR+LSG AGL+RPSPYPSSSKD PLIKATLV+CP
Sbjct: 181 SPVRGGILADEMGMGKTIQAIALVLAKRELSG-AGLRRPSPYPSSSKDFPLIKATLVVCP 240

Query: 241 VVAVSQWVSEIDRFTSKGSYKVLVYHGPKRVRNLEILSEYDFVITTYSVVEADYRKHLMP 300
           V+AVSQWVSEIDRFT KGS KV V+HGPKR ++LE L E+DFVITTYSVVEA+YRKHLMP
Sbjct: 241 VIAVSQWVSEIDRFTLKGSNKVHVFHGPKRAQSLETLFEFDFVITTYSVVEAEYRKHLMP 300

Query: 301 PKDRCPYCSKLFYKKNLKIHLRYICGPDAVKTEKQAKQQRKRPIQPQISKKEESVNDKNN 360
           PKDRCPYCSKLFYKKNLKIHL+YICGPDAVKTEKQAKQ RKRPIQPQISK E S  DKNN
Sbjct: 301 PKDRCPYCSKLFYKKNLKIHLKYICGPDAVKTEKQAKQIRKRPIQPQISKGEVSAKDKNN 360

Query: 361 TVHKSGSQKSALGQTMGQHENDEKPCGKSILHSVIWDRVILDEAHFIKDRLSNTAKAVLA 420
             H SGSQKS  GQTMGQHENDE PCGKSILHSVIWDR+ILDEAHFIKDR SNTAKAVLA
Sbjct: 361 NFHNSGSQKSTFGQTMGQHENDENPCGKSILHSVIWDRIILDEAHFIKDRQSNTAKAVLA 420

Query: 421 ISSSFRWALSGTPIQNRVGELYSLVRFLQIVPYSFYFCKDCDCRTLDHSSLTCPNCPHKR 480
           ISSSFRWALSGTPIQNRVGELYSLVRFLQIVPYSFYFCKDCDCRTLDHSS++CP+CPHKR
Sbjct: 421 ISSSFRWALSGTPIQNRVGELYSLVRFLQIVPYSFYFCKDCDCRTLDHSSVSCPDCPHKR 480

Query: 481 VRHFCWWNKNITVRIQNFGRGPEFQRGMILLKHKILSSIVLRRTKKGRAADLALPPSIVS 540
           +RHFCWWNK IT++IQN GRGPEF+RGMILLKHKILSSIVLRRTKKGRAADLALPPSIVS
Sbjct: 481 MRHFCWWNKYITLQIQNVGRGPEFKRGMILLKHKILSSIVLRRTKKGRAADLALPPSIVS 540

Query: 541 IRRDTLDIQEEDFYESLYNDSRAKFNTFVAAGTATSNYAHIFDLLIRLRQAVNHPYLVVY 600
           IRRDTLDIQEEDFYESLYNDSRAKFNTFVAAGTATSNYAHIFDLLIRLRQAVNHPYLVVY
Sbjct: 541 IRRDTLDIQEEDFYESLYNDSRAKFNTFVAAGTATSNYAHIFDLLIRLRQAVNHPYLVVY 600

Query: 601 SKTNAISCGSIGDSDNNNKQVCGICHEPAEEPVVTSCEHTFCKACIIDHTNDFLKSVACP 660
           S+TN ISCGSI  +DNNN+Q CGICHEPAEEPVVTSCEHTFCKACII   NDF K V+CP
Sbjct: 601 SRTNVISCGSIDGTDNNNEQACGICHEPAEEPVVTSCEHTFCKACIIGFANDFSKLVSCP 660

Query: 661 SCSKMLTIDFRTSLAVGDQTIKNTIKGFKSSSILNRIQLENFQTSTKIEALREEIRFMFE 720
           SCSKMLTIDF T+LA  D+TIKNTIKGFK +SILNRIQLENFQTSTKIEALREEIRFM E
Sbjct: 661 SCSKMLTIDFSTNLAGRDRTIKNTIKGFKCTSILNRIQLENFQTSTKIEALREEIRFMLE 720

Query: 721 RDGSAKGIVFSQFTSFLDLMNYSLTKSGITCVQLVGSMSLSQRGDAINRFIEDPDCKIFL 780
           RDGSAKGIVFSQFTSFLDL+NYSLTKSGITCVQL+GSMSL QR DAI RFI+DPDCKIFL
Sbjct: 721 RDGSAKGIVFSQFTSFLDLINYSLTKSGITCVQLIGSMSLPQRDDAIKRFIDDPDCKIFL 780

Query: 781 MSLKAGGVALNLTVASHVFIMDPWWNPAVERQAQDRIHRIGQYKPIRCSILINICGIFLL 840
           MSLKAGG+ALNLTVASHVFIMDPWWNPAVERQAQDRIHRIGQYKPI              
Sbjct: 781 MSLKAGGIALNLTVASHVFIMDPWWNPAVERQAQDRIHRIGQYKPI-------------- 840

Query: 841 SQDRTNTFDINRITRFIIENSIEERILKLQERKELVFEGTVGGSNEALGKLSLDDMRFLF 900
                      RITRF+IENSIEERILKLQERKELVFEGTVG SN+ALGKL+LDDMRFLF
Sbjct: 841 -----------RITRFVIENSIEERILKLQERKELVFEGTVGRSNDALGKLTLDDMRFLF 873

Query: 901 I 902
           I
Sbjct: 901 I 873

BLAST of HG10000172 vs. ExPASy Swiss-Prot
Match: P79051 (ATP-dependent helicase rhp16 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=rhp16 PE=3 SV=2)

HSP 1 Score: 487.6 bits (1254), Expect = 2.9e-136
Identity = 326/912 (35.75%), Postives = 478/912 (52.41%), Query Frame = 0

Query: 5   PRKTTSNVLIEGNGDGDGDAFDDIEVSSLFSDSGSEV------LSSSSEDSSEPSTKKSR 64
           P ++  +  I+ +   +  +  DI+    F DS  E+       S+ S++ S P + +S+
Sbjct: 129 PEESNESEFIDDDESDEVASIIDIKEDETF-DSKVEIPEAAPSSSTESDEESIPLSYQSK 188

Query: 65  AKTQRKRIKKEGPSIEQEVGSNVGNDENLHNQKPEIADYQGVDDIEKPKTKYSRKKKPKP 124
            +    R      S  +    ++ + E  H        Y+ +            ++ P+ 
Sbjct: 189 RRRVSARASSSASSSSRTQAKSIPSHERTH--------YRLI------------RQHPEL 248

Query: 125 TLLWNVWEEEYERWIDENIDKDFDLASQNEVLTEAVETPSALTMPLLRYQKEWLAWALKQ 184
             +W   EEE  R + +                  +E P  L + LL +Q+E + W  +Q
Sbjct: 249 EHVWEKLEEEAPREVKQ------------------IEQPKELVLNLLPFQREGVYWLKRQ 308

Query: 185 EDSSIKGGILADEMGMGKTIQAIALVLAKRQLSGTAGLKRPSPYPSSSKDLPLIKATLVI 244
           EDSS  GGILADEMGMGKTIQ IAL+L++                      P  K TLV+
Sbjct: 309 EDSSFGGGILADEMGMGKTIQTIALLLSE----------------------PRGKPTLVV 368

Query: 245 CPVVAVSQWVSEIDRFTSKGSYKVLVYHGPKRVRNLEILSEYDFVITTYSVVEADYRKHL 304
            PVVA+ QW  EID  T+K +    +Y+G  R  + E LS YD V+T+Y+V+E+ YRK  
Sbjct: 369 APVVAIMQWKEEIDTHTNK-ALSTYLYYGQARDISGEELSSYDVVLTSYNVIESVYRK-- 428

Query: 305 MPPKDRCPYCSKLFYKKNLKIHLRYICGPDAVKTEKQAKQQRKRPIQPQISKKEESVNDK 364
                                                                     ++
Sbjct: 429 ----------------------------------------------------------ER 488

Query: 365 NNTVHKSGSQKSALGQTMGQHENDEKPCGKSILHSVIWDRVILDEAHFIKDRLSNTAKAV 424
           +    K+G  K                  KS+LH + + R+ILDEAH IK R  NTA+AV
Sbjct: 489 SGFRRKNGVVKE-----------------KSLLHQMEFYRIILDEAHGIKSRTCNTARAV 548

Query: 425 LAISSSFRWALSGTPIQNRVGELYSLVRFLQIVPYSFYFCKDCDCRTL-----DHSSLTC 484
             + ++ +  LSGTP+QNR+GEL+SL+RFL+  P+++Y+C  C+C++L     D S+  C
Sbjct: 549 CGLRTTRKICLSGTPLQNRIGELFSLLRFLRADPFAYYYCLQCECKSLHWRFSDRSN--C 608

Query: 485 PNCPHKRVRHFCWWNKNITVRIQNFG-RGPEFQRGMILLK--HKILSSIVLRRTKKGRAA 544
             C HK + H C++N  +   IQ FG  GP    G +  K  H +L  I+LRRTK  RA 
Sbjct: 609 DECGHKPMSHTCYFNAEMLKPIQKFGYEGP----GKLAFKKVHSLLKHIMLRRTKLERAD 668

Query: 545 DLALPPSIVSIRRDTLDIQEEDFYESLYNDSRAKFNTFVAAGTATSNYAHIFDLLIRLRQ 604
           DL LPP +V +R+D  + +EED Y+SLY DS+ KFNT++A G   +NYA+IF L+ R+RQ
Sbjct: 669 DLGLPPRVVEVRKDLFNEEEEDVYQSLYMDSKRKFNTYLAEGVVLNNYANIFQLITRMRQ 728

Query: 605 AVNHPYLVVYSKTNAISCGSIGDSDNNNKQVCGICHEPAEEPVVTSCEHTFCKACIIDHT 664
             +HP LV+ SK   +      D +N    VC IC E A++ + + C HTFC+ C+ ++ 
Sbjct: 729 MADHPDLVLASKRKTV------DIENQENIVCKICDEVAQDAIESRCHHTFCRLCVTEYI 788

Query: 665 N--DFLKSVACPSCSKMLTIDFRTSLAVGDQTIKNTIKGFKSSSILNRIQLENFQTSTKI 724
           N     ++V CPSC   L+ID  ++ A+ D + +     FK++SILNRI + ++++STKI
Sbjct: 789 NAAGDGENVNCPSCFIPLSIDL-SAPALEDFSEEK----FKNASILNRIDMNSWRSSTKI 848

Query: 725 EALREEIRFMFERDGSAKGIVFSQFTSFLDLMNYSLTKSGITCVQLVGSMSLSQRGDAIN 784
           EAL EE+  + ++D + K IVFSQFTS LDL+++ L K+G  CV+L G M+   R   I 
Sbjct: 849 EALVEELYLLRKKDRTLKSIVFSQFTSMLDLIHWRLRKAGFNCVKLDGGMTPKARAATIE 859

Query: 785 RFIEDPDCKIFLMSLKAGGVALNLTVASHVFIMDPWWNPAVERQAQDRIHRIGQYKPIRC 844
            F  D +  IFL+SLKAGGVALNLT AS VF+MDPWWN AV+ QA DRIHRIGQ +PI+ 
Sbjct: 909 AFSNDINITIFLVSLKAGGVALNLTEASQVFMMDPWWNGAVQWQAMDRIHRIGQKRPIK- 859

Query: 845 SILINICGIFLLSQDRTNTFDINRITRFIIENSIEERILKLQERKELVFEGTVGGSNEAL 901
             +I +C                      IENSIE +I++LQE+K  +   T+    +AL
Sbjct: 969 --VITLC----------------------IENSIESKIIELQEKKAQMIHATIDQDEKAL 859

BLAST of HG10000172 vs. ExPASy Swiss-Prot
Match: P31244 (DNA repair protein RAD16 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=RAD16 PE=1 SV=1)

HSP 1 Score: 481.1 bits (1237), Expect = 2.7e-134
Identity = 309/844 (36.61%), Postives = 444/844 (52.61%), Query Frame = 0

Query: 84  DENLHNQKP------EIADYQGVDDIEKPKTK-------------YSRKKKPKPTLLWNV 143
           DEN H  K       EI + + V D ++P TK              ++KK PK T     
Sbjct: 86  DENTHAIKNDNDEIIEIKEERDVSDDDEPLTKKRKTTARKKKKKTSTKKKSPKVTPYERN 145

Query: 144 WEEEYERWIDENIDKDFDLASQNEVLTEAVETPSALTMPLLRYQKEWLAWALKQEDSSIK 203
               YE    E  +   DL +    + +  + P  +T+ LL +Q E L W + QE+S   
Sbjct: 146 TLRLYEHH-PELRNVFTDLKNAPPYVPQRSKQPDGMTIKLLPFQLEGLHWLISQEESIYA 205

Query: 204 GGILADEMGMGKTIQAIALVLAKRQLSGTAGLKRPSPYPSSSKDLPLIKATLVICPVVAV 263
           GG+LADEMGMGKTIQ IAL++           K PS               LV+ P VA+
Sbjct: 206 GGVLADEMGMGKTIQTIALLMNDL-------TKSPS---------------LVVAPTVAL 265

Query: 264 SQWVSEIDRFTSKGSYKVLVYHGPKRVRNLEILSEYDFVITTYSVVEADYRKHLMPPKDR 323
            QW +EI++ T KG  K+ +YHG  R  +++ L  YD V+TTY+V+E+ +RK        
Sbjct: 266 MQWKNEIEQHT-KGQLKIYIYHGASRTTDIKDLQGYDVVLTTYAVLESVFRKQ------- 325

Query: 324 CPYCSKLFYKKNLKIHLRYICGPDAVKTEKQAKQQRKRPIQPQISKKEESVNDKNNTVHK 383
               +  F +KN                        K+P                     
Sbjct: 326 ----NYGFRRKNGLF---------------------KQP--------------------- 385

Query: 384 SGSQKSALGQTMGQHENDEKPCGKSILHSVIWDRVILDEAHFIKDRLSNTAKAVLAISSS 443
                                   S+LH++ + RVILDEAH IKDR SNTA+AV  + + 
Sbjct: 386 ------------------------SVLHNIDFYRVILDEAHNIKDRQSNTARAVNNLKTQ 445

Query: 444 FRWALSGTPIQNRVGELYSLVRFLQIVPYSFYFCKDCDCRTLD---HSSLTCPNCPHKRV 503
            RW LSGTP+QNR+GE+YSL+RFL I P++ YFC  CDC + D      + C +C H  +
Sbjct: 446 KRWCLSGTPLQNRIGEMYSLIRFLNINPFTKYFCTKCDCASKDWKFTDRMHCDHCSHVIM 505

Query: 504 RHFCWWNKNITVRIQNFG-RGPEFQRGMILLKHKILSSIVLRRTKKGRAADLALPPSIVS 563
           +H  ++N  +   IQ FG  GP  +    +    +L +I+LRRTK  RA DL LPP IV+
Sbjct: 506 QHTNFFNHFMLKNIQKFGVEGPGLESFNNI--QTLLKNIMLRRTKVERADDLGLPPRIVT 565

Query: 564 IRRDTLDIQEEDFYESLYNDSRAKFNTFVAAGTATSNYAHIFDLLIRLRQAVNHPYLVVY 623
           +RRD  + +E+D Y SLY DS+ K+N+FV  G   +NYA+IF L+ R+RQ  +HP LV+ 
Sbjct: 566 VRRDFFNEEEKDLYRSLYTDSKRKYNSFVEEGVVLNNYANIFTLITRMRQLADHPDLVLK 625

Query: 624 SKTNAISCGSIGDSDNNNKQVCGICHEPAEEPVVTSCEHTFCKACIIDHTNDFLKS---V 683
              N          D+    +C +C++ AEEP+ + C H FC+ CI ++   F+++   +
Sbjct: 626 RLNNF-------PGDDIGVVICQLCNDEAEEPIESKCHHKFCRLCIKEYVESFMENNNKL 685

Query: 684 ACPSCSKMLTIDFRTSLAVGDQTIKNTIKGFKSSSILNRIQLE-NFQTSTKIEALREEIR 743
            CP C   L+ID      +    ++  +  FK  SI++R+ +   +Q+STKIEAL EE+ 
Sbjct: 686 TCPVCHIGLSID------LSQPALEVDLDSFKKQSIVSRLNMSGKWQSSTKIEALVEELY 745

Query: 744 FMFERDGSAKGIVFSQFTSFLDLMNYSLTKSGITCVQLVGSMSLSQRGDAINRFIEDPDC 803
            +     + K IVFSQFTS LDL+ + L ++G   V+L GSMS +QR + I  F+ +  C
Sbjct: 746 KLRSNKRTIKSIVFSQFTSMLDLVEWRLKRAGFQTVKLQGSMSPTQRDETIKYFMNNIQC 788

Query: 804 KIFLMSLKAGGVALNLTVASHVFIMDPWWNPAVERQAQDRIHRIGQYKPIRCSILINICG 863
           ++FL+SLKAGGVALNL  AS VFI+DPWWNP+VE Q+ DR+HRIGQY+P+          
Sbjct: 806 EVFLVSLKAGGVALNLCEASQVFILDPWWNPSVEWQSGDRVHRIGQYRPV---------- 788

Query: 864 IFLLSQDRTNTFDINRITRFIIENSIEERILKLQERKELVFEGTVGGSNEALGKLSLDDM 901
                          +ITRF IE+SIE RI++LQE+K  +   T+     A+ +L+  D+
Sbjct: 866 ---------------KITRFCIEDSIEARIIELQEKKANMIHATINQDEAAISRLTPADL 788

BLAST of HG10000172 vs. ExPASy Swiss-Prot
Match: Q9LHE4 (Helicase-like transcription factor CHR27 OS=Arabidopsis thaliana OX=3702 GN=CHR27 PE=1 SV=1)

HSP 1 Score: 325.1 bits (832), Expect = 2.5e-87
Identity = 257/904 (28.43%), Postives = 395/904 (43.69%), Query Frame = 0

Query: 134  DENIDKDFDLASQ------NEVLTEAVETPSALTMPLLRYQKEWLAWALKQEDSSIK--G 193
            D N D D  L  Q      N+ +TE+   P  L++PL+R+QK  LAW  ++E SS    G
Sbjct: 245  DRNPDNDERLVYQAALQVLNQPMTESDLPPGTLSVPLMRHQKIALAWMFQKETSSFNCPG 304

Query: 194  GILADEMGMGKTIQAIALVLAKR---QLSGTAGLK------------------------- 253
            GILAD+ G+GKT+  IAL+L ++   QL   +  K                         
Sbjct: 305  GILADDQGLGKTVSTIALILKQKIVSQLKSESSCKQETEALVLDADDESDNAKHESGSHV 364

Query: 254  RPSPYPSSSKDLPLIKA----------------------------------TLVICPVVA 313
            +P    SS+ +  ++ A                                  TL++CP   
Sbjct: 365  KPELKVSSNSETSVLSACGNDENDSSDMEKAEDEEANSSTRAFQWKRPAAGTLIVCPASV 424

Query: 314  VSQWVSEIDRFTSKGS-YKVLVYHGPKRVRNLEILSEYDFVITTYSVVEADYRKHLMPPK 373
            V QW  E+D   S+ S   VLVYHG  R ++   L+EYD V+TTY++V  +     +  +
Sbjct: 425  VRQWARELDEKVSEESKLSVLVYHGSNRTKDPNELAEYDVVVTTYAIVTNEAPNKFLVDE 484

Query: 374  DRCPYCSKLFYKKNLKIHLRYICGPDAVKTEKQAKQQRKRPIQPQISKKEESVNDKNNTV 433
            D                             E   K   +  +    S      N K   V
Sbjct: 485  D-----------------------------ENDEKNTDRYGLASGFSN-----NKKRKVV 544

Query: 434  HKSGSQKSALGQTMGQHENDEKPCGKSILHSVIWDRVILDEAHFIKDRLSNTAKAVLAIS 493
              +  +    G+      + E  CG   L  V W R++LDEA  IK+  +  A++   + 
Sbjct: 545  VGASKKSKRRGRKSTNDTSSEPDCGP--LGKVGWFRIVLDEAQTIKNYRTQMARSCCTLR 604

Query: 494  SSFRWALSGTPIQNRVGELYSLVRFLQIVPYSFYFCKDCDCRTLDHSSLTCPNCPHKRVR 553
            +  RW LSGTPIQN + +LYS  RFL+  PY+ Y           +S++  P       R
Sbjct: 605  AKRRWCLSGTPIQNTIDDLYSYFRFLRYDPYAVY--------KSFYSTIKVPIS-----R 664

Query: 554  HFCWWNKNITVRIQNFGRGPEFQRGMILLKHKILSSIVLRRTKKGRAAD----LALPPSI 613
            + C   K +                       +L +I+LRRT KG   D    + LPP +
Sbjct: 665  NSCQGYKKL---------------------QAVLRAIMLRRT-KGTLLDGKPIINLPPKV 724

Query: 614  VSIRRDTLDIQEEDFYESLYNDSRAKFNTFVAAGTATSNYAHIFDLLIRLRQAVNHPYLV 673
            V++ +    + E  FY+ L  DSR++F  +  AGT + NYA+I  LL+RLRQA +HP LV
Sbjct: 725  VNLSQVDFSVAERSFYKKLEADSRSQFKAYADAGTLSQNYANILLLLLRLRQACDHPQLV 784

Query: 674  VYSKTNAISCGSIGDSD----------------NNNKQVCGICHEPAEEPVVTSCEHTFC 733
               + N+   G + ++                  ++  +C  C+EP E+PVVT C H FC
Sbjct: 785  --KRYNSDPVGKVSEAAVRRLPREARSRLINRLESSSAICYECNEPPEKPVVTLCGHIFC 844

Query: 734  KACIIDHTNDFLKSVACPSCSKMLTIDFRTSLAVGDQTIKNTIKGFK--SSSILNRIQLE 793
              C++++      +   P C + L  D    +   + +++N        SSS  N +   
Sbjct: 845  YECVLEYITGDENTCPVPRCKQQLARD----VVFSESSLRNCTSDDSGCSSSHDNGLDRS 904

Query: 794  NFQ----TSTKIEALREEIRFMFERD---------------------------------- 853
             FQ     S+KI+A+ + ++ + + D                                  
Sbjct: 905  VFQKRDFCSSKIKAVLDILQSLSQPDSPNSAQHGQMPSSSRPYDDDDVTIVEPMRLHSSS 964

Query: 854  ---GSAKGIVFSQFTSFLDLMNYSLTKSGITCVQLVGSMSLSQRGDAINRFIEDPDCKIF 902
               G+ K I+FSQ+T  LDL+   + +SGI   +L G+MSL+ R  A+  F + PD K+ 
Sbjct: 965  PSQGAVKTIIFSQWTGMLDLVELRILESGIEFRRLDGTMSLAARDRAVKEFSKKPDVKVM 1024

BLAST of HG10000172 vs. ExPASy Swiss-Prot
Match: Q94BR5 (Helicase-like transcription factor CHR28 OS=Arabidopsis thaliana OX=3702 GN=CHR28 PE=1 SV=1)

HSP 1 Score: 318.2 bits (814), Expect = 3.1e-85
Identity = 250/892 (28.03%), Postives = 384/892 (43.05%), Query Frame = 0

Query: 127 EEYERWIDENIDKDFDLASQNEVLTEAVETPSALTMPLLRYQKEWLAWALKQEDSSI--K 186
           EE     DE +     L   N+  +E       L++PL+++QK  LAW  ++E +S+   
Sbjct: 189 EERNSENDERLIYQAALQELNQPKSEVDLPAGLLSVPLMKHQKIALAWMFQKETNSLHCM 248

Query: 187 GGILADEMGMGKTIQAIALVLAKRQ----------------------------------- 246
           GGILAD+ G+GKT+  IAL+L +                                     
Sbjct: 249 GGILADDQGLGKTVSTIALILKQMHEAKLKSKNSGNQEAEALDLDADDESENAFEKPESK 308

Query: 247 ------LSGTAGLKRPSPYPSSSKDLPLIK-----ATLVICPVVAVSQWVSEID-RFTSK 306
                 ++G +G+K+     +S+      +      TL++CP   V QW  E+D + T +
Sbjct: 309 ASNGSGVNGDSGIKKAKGEEASTSTRKFNRKRPAAGTLIVCPASVVRQWARELDEKVTDE 368

Query: 307 GSYKVLVYHGPKRVRNLEILSEYDFVITTYSVVEADYRKHLMPPKDRCPYCSKLFYKKNL 366
               VL+YHG  R ++   L++YD V+TTY++V  +  K  +   D              
Sbjct: 369 AKLSVLIYHGGNRTKDPIELAKYDVVMTTYAIVSNEVPKQPLVDDD-------------- 428

Query: 367 KIHLRYICGPDAVKTEKQAKQQRKRPIQPQISKKEESVNDKNNTVHKSGSQKSALGQTMG 426
                          E   K   K  +         S+N K   V   G+ K +  +   
Sbjct: 429 ---------------ENDEKNSEKYGLASGF-----SINKKRKNV--VGTTKKSKKKKGN 488

Query: 427 QHENDEKPCGKSILHSVIWDRVILDEAHFIKDRLSNTAKAVLAISSSFRWALSGTPIQNR 486
            +  D        L  V W RV+LDEA  IK+  +  A+A   + +  RW LSGTPIQN 
Sbjct: 489 NNAGDSSDPDSGTLAKVGWFRVVLDEAQTIKNHRTQVARACCGLRAKRRWCLSGTPIQNT 548

Query: 487 VGELYSLVRFLQIVPYSFYFCKDCDCRTLDHSSLTCPNCPHKRVRHFCWWNKNITVRIQN 546
           + +LYS  RFL+  PY+ Y                         + FC   K        
Sbjct: 549 IDDLYSYFRFLKYDPYAVY-------------------------KSFCHQIK-------- 608

Query: 547 FGRGPEFQRGMILLK--HKILSSIVLRRTKKGRAAD----LALPPSIVSIRRDTLDIQEE 606
              GP  +  +   K    +L +I+LRRT KG   D    + LPP  +++ +    ++E 
Sbjct: 609 ---GPISRNSLQGYKKLQAVLRAIMLRRT-KGTLLDGQPIINLPPKTINLSQVDFSVEER 668

Query: 607 DFYESLYNDSRAKFNTFVAAGTATSNYAHIFDLLIRLRQAVNHPYLVVYSKTNAISCGSI 666
            FY  L +DSR++F  + AAGT   NYA+I  +L+RLRQA +HP LV   + N+ S G +
Sbjct: 669 SFYVKLESDSRSQFKAYAAAGTLNQNYANILLMLLRLRQACDHPQLV--KRYNSDSVGKV 728

Query: 667 GD---------------SDNNNKQVCGICHEPAEEPVVTSCEHTFCKACIIDHTNDFLKS 726
            +               S   +  +C +CH+P E+PVVT C H FC  C+ D+      +
Sbjct: 729 SEEAVKKLPKEDLVSLLSRLESSPICCVCHDPPEDPVVTLCGHIFCYQCVSDYITGDEDT 788

Query: 727 VACPSCSKMLTID-------FRTSLA-----------VGDQTI--KNTIKGFKSSSILNR 786
              P C + L  D        R+ +A             D+++         K  ++L+ 
Sbjct: 789 CPAPRCREQLAHDVVFSKSTLRSCVADDLGCSSSEDNSHDKSVFQNGEFSSSKIKAVLDI 848

Query: 787 IQ-LENFQTSTKIE------------------------ALREEIRFMFERDGSAKGIVFS 846
           +Q L N  TS   +                          +  ++      G  K I+FS
Sbjct: 849 LQSLSNQGTSNSTQNGQMASSSQQPNDDDDDDDDDVTIVEKTSLKSTPSNGGPIKTIIFS 908

Query: 847 QFTSFLDLMNYSLTKSGITCVQLVGSMSLSQRGDAINRFIEDPDCKIFLMSLKAGGVALN 902
           Q+T  LDL+  SL ++ I   +L G+MSL  R  A+  F  DPD K+ +MSLKAG + LN
Sbjct: 909 QWTGMLDLVELSLIENSIEFRRLDGTMSLIARDRAVKEFSNDPDVKVMIMSLKAGNLGLN 968

BLAST of HG10000172 vs. ExPASy Swiss-Prot
Match: Q9FIY7 (DNA repair protein RAD5B OS=Arabidopsis thaliana OX=3702 GN=RAD5B PE=3 SV=1)

HSP 1 Score: 302.4 bits (773), Expect = 1.7e-80
Identity = 221/760 (29.08%), Postives = 339/760 (44.61%), Query Frame = 0

Query: 184  KGGILADEMGMGKTIQAIALVLAKRQLSGTAGLKRPSPYPSSSK--------DLPLIKA- 243
            +GGILAD MG+GKT+  IAL+LA+                ++ K         L  +KA 
Sbjct: 681  RGGILADAMGLGKTVMTIALILARPGRGNPENEDVLVADVNADKRNRKEIHMALTTVKAK 740

Query: 244  --TLVICPVVAVSQWVSEIDRFTSKGSYKVLVYHGPKRVRNLEILSEYDFVITTYSVVEA 303
              TL+ICP+  +SQW  E++  +   +  VLVY+G  R  + + ++ +D V+TTY V+ +
Sbjct: 741  GGTLIICPMALLSQWKDELETHSKPDTVSVLVYYGGDRTHDAKAIASHDVVLTTYGVLTS 800

Query: 304  DYRKHLMPPKDRCPYCSKLFYKKNLKIHLRYICGPDAVKTEKQAKQQRKRPIQPQISKKE 363
             Y++ +                                                      
Sbjct: 801  AYKQDM------------------------------------------------------ 860

Query: 364  ESVNDKNNTVHKSGSQKSALGQTMGQHENDEKPCGKSILHSVIWDRVILDEAHFIKDRLS 423
                                                SI H + W R++LDEAH IK   +
Sbjct: 861  ----------------------------------ANSIFHRIDWYRIVLDEAHTIKSWKT 920

Query: 424  NTAKAVLAISSSFRWALSGTPIQNRVGELYSLVRFLQIVPYSFYFCKDCDCRTLDHSSLT 483
              AKA   +SS  RW L+GTP+QN++ +LYSL+ FL + P+                   
Sbjct: 921  QAAKATFELSSHCRWCLTGTPLQNKLEDLYSLLCFLHVEPWC------------------ 980

Query: 484  CPNCPHKRVRHFCWWNKNITVRIQNFGRGPEFQRGMILLKHKILSSIVLRRTKKGRAAD- 543
                      ++ WW+K I    +N        RG+ L+K  IL  ++LRRTK+ R  + 
Sbjct: 981  ----------NWAWWSKLIQKPYENGD-----PRGLKLIK-AILRPLMLRRTKETRDKEG 1040

Query: 544  ---LALPPSIVSIRRDTLDIQEEDFYESLYNDSRAKFNTFVAAGTATSNYAHIFDLLIRL 603
               L LPP+ V +        E DFY +L+  S+ +F+ FVA G    NYA+I +LL+RL
Sbjct: 1041 SLILELPPTDVQVIECEQSEAERDFYTALFKRSKVQFDQFVAQGKVLHNYANILELLLRL 1100

Query: 604  RQAVNHPYLVVY-------------------SKTNAISCGS---------IGDSDNNNKQ 663
            RQ  NHP+LV+                    +  +++S  +         I D  + N +
Sbjct: 1101 RQCCNHPFLVMSRADSQQYADLDSLARRFLDNNPDSVSQNAPSRAYIEEVIQDLRDGNSK 1160

Query: 664  VCGICHEPAEEPVVTSCEHTFCKACIIDHTNDFLKSVACPSCSKMLTIDFRTSLAVGDQT 723
             C IC E A++PV+T C H  C+ C++       +S +C  C    TI  RT L      
Sbjct: 1161 ECPICLESADDPVLTPCAHRMCRECLLTS----WRSPSCGLCPICRTILKRTELI----- 1220

Query: 724  IKNTIKGFKSSSILNRIQLENFQTSTKIEALREEIRFMFERDGSAKGIVFSQFTSFLDLM 783
                     + SI     ++N++ S+K+  L + +  + +     K IVFSQ+TSFLDL+
Sbjct: 1221 ------SCPTDSIFRVDVVKNWKESSKVSELLKCLEKIKKSGSGEKSIVFSQWTSFLDLL 1276

Query: 784  NYSLTKSGITCVQLVGSMSLSQRGDAINRFIEDPDCKIFLMSLKAGGVALNLTVASHVFI 843
               L + G   ++  G ++   R   +  F E     I LMSLKAGGV LNLT AS VF+
Sbjct: 1281 EIPLRRRGFEFLRFDGKLAQKGREKVLKEFNETKQKTILLMSLKAGGVGLNLTAASSVFL 1276

Query: 844  MDPWWNPAVERQAQDRIHRIGQYKPIRCSILINICGIFLLSQDRTNTFDINRITRFIIEN 901
            MDPWWNPAVE QA  RIHRIGQ + +                          + RFI+++
Sbjct: 1341 MDPWWNPAVEEQAIMRIHRIGQKRTV-------------------------FVRRFIVKD 1276

BLAST of HG10000172 vs. ExPASy TrEMBL
Match: A0A0A0LN53 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G416820 PE=3 SV=1)

HSP 1 Score: 1540.8 bits (3988), Expect = 0.0e+00
Identity = 782/901 (86.79%), Postives = 824/901 (91.45%), Query Frame = 0

Query: 1   MKLRPRKTTSNVLIEGNGDGDGDAFDDIEVSSLFSDSGSEVLSSSSEDSSEPSTKKSRAK 60
           MKLRPRK  SNVLIE  G+ DGD  DDI+VSSL SD GSE LSSSSED SE STKKSRA+
Sbjct: 1   MKLRPRKPASNVLIE-EGNVDGDFSDDIDVSSLVSDCGSEDLSSSSEDFSEHSTKKSRAR 60

Query: 61  TQRKRIKKEGPSIEQEVGSNVGNDENLHNQKPEIADYQGVDDIEKPKTKYSRKKKPKPTL 120
           TQ+KRIKK+GPSIEQEVGSNVGNDENL+N +PEIAD QGV DIEKPKTKYSRKKK KPTL
Sbjct: 61  TQKKRIKKDGPSIEQEVGSNVGNDENLNNPRPEIADSQGVVDIEKPKTKYSRKKKTKPTL 120

Query: 121 LWNVWEEEYERWIDENIDKDFDLASQNEVLTEAVETPSALTMPLLRYQKEWLAWALKQED 180
           LWN+WEEEYERWIDENI+KDFDLA+QNEV  EAVETP+ALTMPLLRYQKEWLAWALKQED
Sbjct: 121 LWNIWEEEYERWIDENIEKDFDLANQNEVFAEAVETPAALTMPLLRYQKEWLAWALKQED 180

Query: 181 SSIKGGILADEMGMGKTIQAIALVLAKRQLSGTAGLKRPSPYPSSSKDLPLIKATLVICP 240
           SSIKGGILADEMGMGKTIQAIALVLAKRQLSGTAGL+RPS  PSSSKDLPLIKATLVICP
Sbjct: 181 SSIKGGILADEMGMGKTIQAIALVLAKRQLSGTAGLRRPSSNPSSSKDLPLIKATLVICP 240

Query: 241 VVAVSQWVSEIDRFTSKGSYKVLVYHGPKRVRNLEILSEYDFVITTYSVVEADYRKHLMP 300
           VVAVSQWVSEIDRFTS+GSYKVLVYHGPKR R+LE+LSEYDFVITTYSVVEADYRK+LMP
Sbjct: 241 VVAVSQWVSEIDRFTSEGSYKVLVYHGPKRERSLEVLSEYDFVITTYSVVEADYRKYLMP 300

Query: 301 PKDRCPYCSKLFYKKNLKIHLRYICGPDAVKTEKQAKQQRKRPIQPQISKKEESVNDKNN 360
           PKDRCPYCSKLF+KKNLK HL YICGPDAVKTEKQ+KQQRKRPIQPQI K+E+S  DKNN
Sbjct: 301 PKDRCPYCSKLFHKKNLKFHLMYICGPDAVKTEKQSKQQRKRPIQPQICKQEKSDKDKNN 360

Query: 361 TVHKSGSQKSALGQTMGQHENDEKPCGKSILHSVIWDRVILDEAHFIKDRLSNTAKAVLA 420
            VHKSG QKS LGQT+ +HENDEK  G SILHSVIWDRVILDEAHFIKDRLSNTAKAVLA
Sbjct: 361 NVHKSGGQKSTLGQTVEEHENDEKHRGNSILHSVIWDRVILDEAHFIKDRLSNTAKAVLA 420

Query: 421 ISSSFRWALSGTPIQNRVGELYSLVRFLQIVPYSFYFCKDCDCRTLDHSSLTCPNCPHKR 480
           ISSSFRWALSGTPIQNRVGELYSLVRFLQIVPYSFYFCKDCDCRTLDHSSLTCPNCPHKR
Sbjct: 421 ISSSFRWALSGTPIQNRVGELYSLVRFLQIVPYSFYFCKDCDCRTLDHSSLTCPNCPHKR 480

Query: 481 VRHFCWWNKNITVRIQNFGRGPEFQRGMILLKHKILSSIVLRRTKKGRAADLALPPSIVS 540
           VRHFCWWNKNI+ RIQNFGRGPEF+RGMILLKHKILS+IVLRRTKKGRAADLALPPS VS
Sbjct: 481 VRHFCWWNKNISQRIQNFGRGPEFKRGMILLKHKILSTIVLRRTKKGRAADLALPPSTVS 540

Query: 541 IRRDTLDIQEEDFYESLYNDSRAKFNTFVAAGTATSNYAHIFDLLIRLRQAVNHPYLVVY 600
           IRRDTLDIQEEDFYESLYNDSRAKFNTFVAAGT TSNYAHIFDLLIRLRQAVNHPYLVVY
Sbjct: 541 IRRDTLDIQEEDFYESLYNDSRAKFNTFVAAGTVTSNYAHIFDLLIRLRQAVNHPYLVVY 600

Query: 601 SKTNAISCGSIGDSDNNNKQVCGICHEPAEEPVVTSCEHTFCKACIIDHTNDFLKSVACP 660
           SKTNAI+ G+I DSD+NNKQVCGIC+EPAEEPV TSC+HTFCKAC+ID+  DF K V+CP
Sbjct: 601 SKTNAINSGNIDDSDSNNKQVCGICYEPAEEPVDTSCKHTFCKACLIDYAGDFSKPVSCP 660

Query: 661 SCSKMLTIDFRTSLAVGDQTIKNTIKGFKSSSILNRIQLENFQTSTKIEALREEIRFMFE 720
           SCSKMLT DF TS+A  DQT+KN IKGFKSSSILNRIQLENFQTSTKIEALREEIRFMFE
Sbjct: 661 SCSKMLTSDFITSMAFKDQTVKNKIKGFKSSSILNRIQLENFQTSTKIEALREEIRFMFE 720

Query: 721 RDGSAKGIVFSQFTSFLDLMNYSLTKSGITCVQLVGSMSLSQRGDAINRFIEDPDCKIFL 780
           RDGSAKGIVFSQFTSFLDL+NYSL+KSGITCVQLVGSMSL+QR DAINRFIEDPDCKIFL
Sbjct: 721 RDGSAKGIVFSQFTSFLDLINYSLSKSGITCVQLVGSMSLTQRADAINRFIEDPDCKIFL 780

Query: 781 MSLKAGGVALNLTVASHVFIMDPWWNPAVERQAQDRIHRIGQYKPIRCSILINICGIFLL 840
           MSLKAGGVALNLTVASHVFIMDPWWNPAVERQAQDRIHRIGQYKPI              
Sbjct: 781 MSLKAGGVALNLTVASHVFIMDPWWNPAVERQAQDRIHRIGQYKPI-------------- 840

Query: 841 SQDRTNTFDINRITRFIIENSIEERILKLQERKELVFEGTVGGSNEALGKLSLDDMRFLF 900
                      RI RF IENSIEERILKLQERKELVFEGTVG SNEALG+L+LDDMR+LF
Sbjct: 841 -----------RIMRFFIENSIEERILKLQERKELVFEGTVGRSNEALGRLTLDDMRYLF 875

Query: 901 I 902
           +
Sbjct: 901 L 875

BLAST of HG10000172 vs. ExPASy TrEMBL
Match: A0A5D3DZG8 (DNA repair protein RAD16 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold3695G00010 PE=3 SV=1)

HSP 1 Score: 1537.3 bits (3979), Expect = 0.0e+00
Identity = 783/901 (86.90%), Postives = 825/901 (91.56%), Query Frame = 0

Query: 1   MKLRPRKTTSNVLIEGNGDGDGDAFDDIEVSSLFSDSGSEVLSSSSEDSSEPSTKKSRAK 60
           MKLRPRK  SNVLIE  G+ DGD+ DDI+V    SD GSE  SSSSED SE STKKSRA+
Sbjct: 1   MKLRPRKPASNVLIE-EGNVDGDSSDDIDV----SDCGSEDHSSSSEDFSEHSTKKSRAR 60

Query: 61  TQRKRIKKEGPSIEQEVGSNVGNDENLHNQKPEIADYQGVDDIEKPKTKYSRKKKPKPTL 120
           TQ+KRIKK+GPSIEQEVGSNVGNDENL+NQKPEIAD QGV +IEKPKTKYSR KKPKPTL
Sbjct: 61  TQKKRIKKDGPSIEQEVGSNVGNDENLNNQKPEIADSQGVVEIEKPKTKYSR-KKPKPTL 120

Query: 121 LWNVWEEEYERWIDENIDKDFDLASQNEVLTEAVETPSALTMPLLRYQKEWLAWALKQED 180
           LWN+WEEEYERWIDENI+KDFDLA+QNEVL E+VETP+ALTMPLLRYQKEWLAWALKQED
Sbjct: 121 LWNIWEEEYERWIDENIEKDFDLANQNEVLAESVETPAALTMPLLRYQKEWLAWALKQED 180

Query: 181 SSIKGGILADEMGMGKTIQAIALVLAKRQLSGTAGLKRPSPYPSSSKDLPLIKATLVICP 240
           SSIKGGILADEMGMGKTIQAIALVLAKRQLSGTAGL+RPS  PSSSK+LPLIKATLVICP
Sbjct: 181 SSIKGGILADEMGMGKTIQAIALVLAKRQLSGTAGLRRPSSNPSSSKELPLIKATLVICP 240

Query: 241 VVAVSQWVSEIDRFTSKGSYKVLVYHGPKRVRNLEILSEYDFVITTYSVVEADYRKHLMP 300
           VVAVSQWVSEIDRFTS+GSYKVLVYHGPKRVR+LEILSEYDFVITTYSVVEADYRK+LMP
Sbjct: 241 VVAVSQWVSEIDRFTSEGSYKVLVYHGPKRVRSLEILSEYDFVITTYSVVEADYRKYLMP 300

Query: 301 PKDRCPYCSKLFYKKNLKIHLRYICGPDAVKTEKQAKQQRKRPIQPQISKKEESVNDKNN 360
           PKDRCPYCSKLF+KKNLK HL YICGPDAVKTEKQ+KQQRKRPIQPQI K+E+S  DKNN
Sbjct: 301 PKDRCPYCSKLFHKKNLKFHLMYICGPDAVKTEKQSKQQRKRPIQPQICKQEKSDKDKNN 360

Query: 361 TVHKSGSQKSALGQTMGQHENDEKPCGKSILHSVIWDRVILDEAHFIKDRLSNTAKAVLA 420
            VHKSG+QKS LGQT+G+HENDEKP G SILHSVIWDRVILDEAHFIKDRLSNTAKAVLA
Sbjct: 361 NVHKSGAQKSTLGQTLGEHENDEKPRGNSILHSVIWDRVILDEAHFIKDRLSNTAKAVLA 420

Query: 421 ISSSFRWALSGTPIQNRVGELYSLVRFLQIVPYSFYFCKDCDCRTLDHSSLTCPNCPHKR 480
           ISSSFRWALSGTPIQNRVGELYSLVRFLQIVPYSFYFCKDCDCRTLDHSS TCPNCPHKR
Sbjct: 421 ISSSFRWALSGTPIQNRVGELYSLVRFLQIVPYSFYFCKDCDCRTLDHSSPTCPNCPHKR 480

Query: 481 VRHFCWWNKNITVRIQNFGRGPEFQRGMILLKHKILSSIVLRRTKKGRAADLALPPSIVS 540
           VRHFCWWNKNIT RIQNFGRGPEF+RGMILLKHKILSSIVLRRTKKGRAADLALPPS VS
Sbjct: 481 VRHFCWWNKNITQRIQNFGRGPEFKRGMILLKHKILSSIVLRRTKKGRAADLALPPSTVS 540

Query: 541 IRRDTLDIQEEDFYESLYNDSRAKFNTFVAAGTATSNYAHIFDLLIRLRQAVNHPYLVVY 600
           IRRDTLDIQEEDFYESLYNDSRAKFNTFVAAGT TSNYAHIFDLLIRLRQAVNHPYLVVY
Sbjct: 541 IRRDTLDIQEEDFYESLYNDSRAKFNTFVAAGTVTSNYAHIFDLLIRLRQAVNHPYLVVY 600

Query: 601 SKTNAISCGSIGDSDNNNKQVCGICHEPAEEPVVTSCEHTFCKACIIDHTNDFLKSVACP 660
           SKT AI+ G+I DSD+NNKQVCG+CHEPAEEPV TSC+H FCKACIID+  DF K V+CP
Sbjct: 601 SKTKAINSGNIDDSDSNNKQVCGLCHEPAEEPVDTSCKHAFCKACIIDYAGDFSKPVSCP 660

Query: 661 SCSKMLTIDFRTSLAVGDQTIKNTIKGFKSSSILNRIQLENFQTSTKIEALREEIRFMFE 720
           SCSKMLT DF TS+A  DQT+KNTIKGFKSSSILNRIQLENFQTSTKIEALREEIRFMFE
Sbjct: 661 SCSKMLTSDFITSMAFKDQTVKNTIKGFKSSSILNRIQLENFQTSTKIEALREEIRFMFE 720

Query: 721 RDGSAKGIVFSQFTSFLDLMNYSLTKSGITCVQLVGSMSLSQRGDAINRFIEDPDCKIFL 780
           RDGSAKGIVFSQFTSFLDL+NYSL+KSGITCVQLVGSMSL+QR DAINRFIEDPDCKIFL
Sbjct: 721 RDGSAKGIVFSQFTSFLDLINYSLSKSGITCVQLVGSMSLTQRADAINRFIEDPDCKIFL 780

Query: 781 MSLKAGGVALNLTVASHVFIMDPWWNPAVERQAQDRIHRIGQYKPIRCSILINICGIFLL 840
           MSLKAGGVALNLTVASHVFIMDPWWNPAVERQAQDRIHRIGQYKPI              
Sbjct: 781 MSLKAGGVALNLTVASHVFIMDPWWNPAVERQAQDRIHRIGQYKPI-------------- 840

Query: 841 SQDRTNTFDINRITRFIIENSIEERILKLQERKELVFEGTVGGSNEALGKLSLDDMRFLF 900
                      RI RF IENSIEERILKLQERKELVFEGTVG SNEALG+L+LDDMR+LF
Sbjct: 841 -----------RIMRFFIENSIEERILKLQERKELVFEGTVGRSNEALGRLTLDDMRYLF 870

Query: 901 I 902
           +
Sbjct: 901 L 870

BLAST of HG10000172 vs. ExPASy TrEMBL
Match: A0A1S3C1J5 (DNA repair protein RAD16 OS=Cucumis melo OX=3656 GN=LOC103495970 PE=3 SV=1)

HSP 1 Score: 1537.3 bits (3979), Expect = 0.0e+00
Identity = 783/901 (86.90%), Postives = 825/901 (91.56%), Query Frame = 0

Query: 1   MKLRPRKTTSNVLIEGNGDGDGDAFDDIEVSSLFSDSGSEVLSSSSEDSSEPSTKKSRAK 60
           MKLRPRK  SNVLIE  G+ DGD+ DDI+V    SD GSE  SSSSED SE STKKSRA+
Sbjct: 1   MKLRPRKPASNVLIE-EGNVDGDSSDDIDV----SDCGSEDHSSSSEDFSEHSTKKSRAR 60

Query: 61  TQRKRIKKEGPSIEQEVGSNVGNDENLHNQKPEIADYQGVDDIEKPKTKYSRKKKPKPTL 120
           TQ+KRIKK+GPSIEQEVGSNVGNDENL+NQKPEIAD QGV +IEKPKTKYSR KKPKPTL
Sbjct: 61  TQKKRIKKDGPSIEQEVGSNVGNDENLNNQKPEIADSQGVVEIEKPKTKYSR-KKPKPTL 120

Query: 121 LWNVWEEEYERWIDENIDKDFDLASQNEVLTEAVETPSALTMPLLRYQKEWLAWALKQED 180
           LWN+WEEEYERWIDENI+KDFDLA+QNEVL E+VETP+ALTMPLLRYQKEWLAWALKQED
Sbjct: 121 LWNIWEEEYERWIDENIEKDFDLANQNEVLAESVETPAALTMPLLRYQKEWLAWALKQED 180

Query: 181 SSIKGGILADEMGMGKTIQAIALVLAKRQLSGTAGLKRPSPYPSSSKDLPLIKATLVICP 240
           SSIKGGILADEMGMGKTIQAIALVLAKRQLSGTAGL+RPS  PSSSK+LPLIKATLVICP
Sbjct: 181 SSIKGGILADEMGMGKTIQAIALVLAKRQLSGTAGLRRPSSNPSSSKELPLIKATLVICP 240

Query: 241 VVAVSQWVSEIDRFTSKGSYKVLVYHGPKRVRNLEILSEYDFVITTYSVVEADYRKHLMP 300
           VVAVSQWVSEIDRFTS+GSYKVLVYHGPKRVR+LEILSEYDFVITTYSVVEADYRK+LMP
Sbjct: 241 VVAVSQWVSEIDRFTSEGSYKVLVYHGPKRVRSLEILSEYDFVITTYSVVEADYRKYLMP 300

Query: 301 PKDRCPYCSKLFYKKNLKIHLRYICGPDAVKTEKQAKQQRKRPIQPQISKKEESVNDKNN 360
           PKDRCPYCSKLF+KKNLK HL YICGPDAVKTEKQ+KQQRKRPIQPQI K+E+S  DKNN
Sbjct: 301 PKDRCPYCSKLFHKKNLKFHLMYICGPDAVKTEKQSKQQRKRPIQPQICKQEKSDKDKNN 360

Query: 361 TVHKSGSQKSALGQTMGQHENDEKPCGKSILHSVIWDRVILDEAHFIKDRLSNTAKAVLA 420
            VHKSG+QKS LGQT+G+HENDEKP G SILHSVIWDRVILDEAHFIKDRLSNTAKAVLA
Sbjct: 361 NVHKSGAQKSTLGQTLGEHENDEKPRGNSILHSVIWDRVILDEAHFIKDRLSNTAKAVLA 420

Query: 421 ISSSFRWALSGTPIQNRVGELYSLVRFLQIVPYSFYFCKDCDCRTLDHSSLTCPNCPHKR 480
           ISSSFRWALSGTPIQNRVGELYSLVRFLQIVPYSFYFCKDCDCRTLDHSS TCPNCPHKR
Sbjct: 421 ISSSFRWALSGTPIQNRVGELYSLVRFLQIVPYSFYFCKDCDCRTLDHSSPTCPNCPHKR 480

Query: 481 VRHFCWWNKNITVRIQNFGRGPEFQRGMILLKHKILSSIVLRRTKKGRAADLALPPSIVS 540
           VRHFCWWNKNIT RIQNFGRGPEF+RGMILLKHKILSSIVLRRTKKGRAADLALPPS VS
Sbjct: 481 VRHFCWWNKNITQRIQNFGRGPEFKRGMILLKHKILSSIVLRRTKKGRAADLALPPSTVS 540

Query: 541 IRRDTLDIQEEDFYESLYNDSRAKFNTFVAAGTATSNYAHIFDLLIRLRQAVNHPYLVVY 600
           IRRDTLDIQEEDFYESLYNDSRAKFNTFVAAGT TSNYAHIFDLLIRLRQAVNHPYLVVY
Sbjct: 541 IRRDTLDIQEEDFYESLYNDSRAKFNTFVAAGTVTSNYAHIFDLLIRLRQAVNHPYLVVY 600

Query: 601 SKTNAISCGSIGDSDNNNKQVCGICHEPAEEPVVTSCEHTFCKACIIDHTNDFLKSVACP 660
           SKT AI+ G+I DSD+NNKQVCG+CHEPAEEPV TSC+H FCKACIID+  DF K V+CP
Sbjct: 601 SKTKAINSGNIDDSDSNNKQVCGLCHEPAEEPVDTSCKHAFCKACIIDYAGDFSKPVSCP 660

Query: 661 SCSKMLTIDFRTSLAVGDQTIKNTIKGFKSSSILNRIQLENFQTSTKIEALREEIRFMFE 720
           SCSKMLT DF TS+A  DQT+KNTIKGFKSSSILNRIQLENFQTSTKIEALREEIRFMFE
Sbjct: 661 SCSKMLTSDFITSMAFKDQTVKNTIKGFKSSSILNRIQLENFQTSTKIEALREEIRFMFE 720

Query: 721 RDGSAKGIVFSQFTSFLDLMNYSLTKSGITCVQLVGSMSLSQRGDAINRFIEDPDCKIFL 780
           RDGSAKGIVFSQFTSFLDL+NYSL+KSGITCVQLVGSMSL+QR DAINRFIEDPDCKIFL
Sbjct: 721 RDGSAKGIVFSQFTSFLDLINYSLSKSGITCVQLVGSMSLTQRADAINRFIEDPDCKIFL 780

Query: 781 MSLKAGGVALNLTVASHVFIMDPWWNPAVERQAQDRIHRIGQYKPIRCSILINICGIFLL 840
           MSLKAGGVALNLTVASHVFIMDPWWNPAVERQAQDRIHRIGQYKPI              
Sbjct: 781 MSLKAGGVALNLTVASHVFIMDPWWNPAVERQAQDRIHRIGQYKPI-------------- 840

Query: 841 SQDRTNTFDINRITRFIIENSIEERILKLQERKELVFEGTVGGSNEALGKLSLDDMRFLF 900
                      RI RF IENSIEERILKLQERKELVFEGTVG SNEALG+L+LDDMR+LF
Sbjct: 841 -----------RIMRFFIENSIEERILKLQERKELVFEGTVGRSNEALGRLTLDDMRYLF 870

Query: 901 I 902
           +
Sbjct: 901 L 870

BLAST of HG10000172 vs. ExPASy TrEMBL
Match: A0A6J1FYD3 (DNA repair protein RAD16 OS=Cucurbita moschata OX=3662 GN=LOC111448500 PE=3 SV=1)

HSP 1 Score: 1510.4 bits (3909), Expect = 0.0e+00
Identity = 764/901 (84.79%), Postives = 810/901 (89.90%), Query Frame = 0

Query: 1   MKLRPRKTTSNVLIEGNGDGDGDAFDDIEVSSLFSDSGSEVLSSSSEDSSEPSTKKSRAK 60
           MKLRPRK TSN+LI+GN   DGDA D+I+VSSL+SDS SE  SSSSED  EPSTKKSRAK
Sbjct: 1   MKLRPRKPTSNILIQGN--ADGDASDEIDVSSLYSDSESEDPSSSSEDFCEPSTKKSRAK 60

Query: 61  TQRKRIKKEGPSIEQEVGSNVGNDENLHNQKPEIADYQGVDDIEKPKTKYSRKKKPKPTL 120
            +RK IK+EGPSIEQEV   VGNDEN HNQ PE+   QGV DI KPKTKYSRKKK KP L
Sbjct: 61  KKRKGIKEEGPSIEQEVWRKVGNDENPHNQTPEVIPVQGVVDIGKPKTKYSRKKKQKPIL 120

Query: 121 LWNVWEEEYERWIDENIDKDFDLASQNEVLTEAVETPSALTMPLLRYQKEWLAWALKQED 180
           LW+VW EE+ERWIDENI+KDFD+ASQNEVLTEAVETPSALTMPLLRYQKEWLAWALKQED
Sbjct: 121 LWDVWAEEHERWIDENIEKDFDMASQNEVLTEAVETPSALTMPLLRYQKEWLAWALKQED 180

Query: 181 SSIKGGILADEMGMGKTIQAIALVLAKRQLSGTAGLKRPSPYPSSSKDLPLIKATLVICP 240
           S ++GGILADEMGMGKTIQAIALVLAKR+LSG AGL+RPSPYPSSSKD PLIKATLV+CP
Sbjct: 181 SPVRGGILADEMGMGKTIQAIALVLAKRELSG-AGLRRPSPYPSSSKDFPLIKATLVVCP 240

Query: 241 VVAVSQWVSEIDRFTSKGSYKVLVYHGPKRVRNLEILSEYDFVITTYSVVEADYRKHLMP 300
           V+AVSQWVSEIDRFT KGS KV V+HGPKR ++LE L E+DFVITTYSVVEA+YRKHLMP
Sbjct: 241 VIAVSQWVSEIDRFTLKGSNKVHVFHGPKRAQSLETLFEFDFVITTYSVVEAEYRKHLMP 300

Query: 301 PKDRCPYCSKLFYKKNLKIHLRYICGPDAVKTEKQAKQQRKRPIQPQISKKEESVNDKNN 360
           PKDRCPYCSKLFYKKNLKIHL+YICGPDAVKTEKQAKQ RKRPIQPQISK E S  DKNN
Sbjct: 301 PKDRCPYCSKLFYKKNLKIHLKYICGPDAVKTEKQAKQIRKRPIQPQISKGEVSAKDKNN 360

Query: 361 TVHKSGSQKSALGQTMGQHENDEKPCGKSILHSVIWDRVILDEAHFIKDRLSNTAKAVLA 420
             H SGSQKS  GQTMGQHENDE PCGKSILHSVIWDR+ILDEAHFIKDR SNTAKAVLA
Sbjct: 361 NFHNSGSQKSTFGQTMGQHENDENPCGKSILHSVIWDRIILDEAHFIKDRQSNTAKAVLA 420

Query: 421 ISSSFRWALSGTPIQNRVGELYSLVRFLQIVPYSFYFCKDCDCRTLDHSSLTCPNCPHKR 480
           ISSSFRWALSGTPIQNRVGELYSLVRFLQIVPYSFYFCKDCDCRTLDHSS++CP+CPHKR
Sbjct: 421 ISSSFRWALSGTPIQNRVGELYSLVRFLQIVPYSFYFCKDCDCRTLDHSSVSCPDCPHKR 480

Query: 481 VRHFCWWNKNITVRIQNFGRGPEFQRGMILLKHKILSSIVLRRTKKGRAADLALPPSIVS 540
           +RHFCWWNK IT++IQN GRGPEF+RGMILLKHKILSSIVLRRTKKGRAADLALPPSIVS
Sbjct: 481 MRHFCWWNKYITLQIQNVGRGPEFKRGMILLKHKILSSIVLRRTKKGRAADLALPPSIVS 540

Query: 541 IRRDTLDIQEEDFYESLYNDSRAKFNTFVAAGTATSNYAHIFDLLIRLRQAVNHPYLVVY 600
           IRRDTLDIQEEDFYESLYNDSRAKFNTFVAAGTATSNYAHIFDLLIRLRQAVNHPYLVVY
Sbjct: 541 IRRDTLDIQEEDFYESLYNDSRAKFNTFVAAGTATSNYAHIFDLLIRLRQAVNHPYLVVY 600

Query: 601 SKTNAISCGSIGDSDNNNKQVCGICHEPAEEPVVTSCEHTFCKACIIDHTNDFLKSVACP 660
           S+TN ISCGSI  +DNNN+Q CGICHEPAEEPVVTSCEHTFCKACII   NDF K V+CP
Sbjct: 601 SRTNVISCGSIDGTDNNNEQACGICHEPAEEPVVTSCEHTFCKACIIGFANDFSKLVSCP 660

Query: 661 SCSKMLTIDFRTSLAVGDQTIKNTIKGFKSSSILNRIQLENFQTSTKIEALREEIRFMFE 720
           SCSKMLTIDF T+LA  D+TIKNTIKGFK +SILNRIQLENFQTSTKIEALREEIRFM E
Sbjct: 661 SCSKMLTIDFSTNLAGRDRTIKNTIKGFKCTSILNRIQLENFQTSTKIEALREEIRFMLE 720

Query: 721 RDGSAKGIVFSQFTSFLDLMNYSLTKSGITCVQLVGSMSLSQRGDAINRFIEDPDCKIFL 780
           RDGSAKGIVFSQFTSFLDL+NYSLTKSGITCVQL+GSMSL QR DAI RFI+DPDCKIFL
Sbjct: 721 RDGSAKGIVFSQFTSFLDLINYSLTKSGITCVQLIGSMSLPQRDDAIKRFIDDPDCKIFL 780

Query: 781 MSLKAGGVALNLTVASHVFIMDPWWNPAVERQAQDRIHRIGQYKPIRCSILINICGIFLL 840
           MSLKAGG+ALNLTVASHVFIMDPWWNPAVERQAQDRIHRIGQYKPI              
Sbjct: 781 MSLKAGGIALNLTVASHVFIMDPWWNPAVERQAQDRIHRIGQYKPI-------------- 840

Query: 841 SQDRTNTFDINRITRFIIENSIEERILKLQERKELVFEGTVGGSNEALGKLSLDDMRFLF 900
                      RITRF+IENSIEERILKLQERKELVFEGTVG SN+ALGKL+LDDMRFLF
Sbjct: 841 -----------RITRFVIENSIEERILKLQERKELVFEGTVGRSNDALGKLTLDDMRFLF 873

Query: 901 I 902
           I
Sbjct: 901 I 873

BLAST of HG10000172 vs. ExPASy TrEMBL
Match: A0A6J1J723 (DNA repair protein RAD16 OS=Cucurbita maxima OX=3661 GN=LOC111484062 PE=3 SV=1)

HSP 1 Score: 1508.0 bits (3903), Expect = 0.0e+00
Identity = 765/901 (84.91%), Postives = 806/901 (89.46%), Query Frame = 0

Query: 1   MKLRPRKTTSNVLIEGNGDGDGDAFDDIEVSSLFSDSGSEVLSSSSEDSSEPSTKKSRAK 60
           MKLRPRK TSN+LI+GN   DGDA DDI+VSSLFSDS SE  SSSSED  EPSTKKSRAK
Sbjct: 1   MKLRPRKPTSNILIQGN--ADGDASDDIDVSSLFSDSESEDPSSSSEDFCEPSTKKSRAK 60

Query: 61  TQRKRIKKEGPSIEQEVGSNVGNDENLHNQKPEIADYQGVDDIEKPKTKYSRKKKPKPTL 120
            +RK IK+EGPSIEQEV   VGND N HNQ PE+   QGV DI KPK KYSRKKK KP L
Sbjct: 61  KKRKGIKEEGPSIEQEVWRKVGNDGNPHNQTPEVIPVQGVVDIGKPKAKYSRKKKQKPIL 120

Query: 121 LWNVWEEEYERWIDENIDKDFDLASQNEVLTEAVETPSALTMPLLRYQKEWLAWALKQED 180
           LW+VW EE+ERWIDENI+KDFD+ASQNEVLTEAVETPSALTMPLLRYQKEWLAWALKQED
Sbjct: 121 LWDVWAEEHERWIDENIEKDFDMASQNEVLTEAVETPSALTMPLLRYQKEWLAWALKQED 180

Query: 181 SSIKGGILADEMGMGKTIQAIALVLAKRQLSGTAGLKRPSPYPSSSKDLPLIKATLVICP 240
           S ++GGILADEMGMGKTIQAIALVLAKR+LSG AGL+RPSPYPSSSKD PLIKATLV+CP
Sbjct: 181 SPVRGGILADEMGMGKTIQAIALVLAKRELSG-AGLRRPSPYPSSSKDFPLIKATLVVCP 240

Query: 241 VVAVSQWVSEIDRFTSKGSYKVLVYHGPKRVRNLEILSEYDFVITTYSVVEADYRKHLMP 300
           V+AVSQWVSEIDRFT KGS KV V+HGPKR ++LE L E+DFVITTYSVVEA+YRKHLMP
Sbjct: 241 VIAVSQWVSEIDRFTLKGSNKVHVFHGPKRAQSLETLFEFDFVITTYSVVEAEYRKHLMP 300

Query: 301 PKDRCPYCSKLFYKKNLKIHLRYICGPDAVKTEKQAKQQRKRPIQPQISKKEESVNDKNN 360
           PKDRCPYCSKLFYKKNLKIHL+YICGPDAVKTEKQAKQ RKRPIQPQ+SK E S  DKNN
Sbjct: 301 PKDRCPYCSKLFYKKNLKIHLKYICGPDAVKTEKQAKQIRKRPIQPQLSKGEVSAKDKNN 360

Query: 361 TVHKSGSQKSALGQTMGQHENDEKPCGKSILHSVIWDRVILDEAHFIKDRLSNTAKAVLA 420
             H SGSQKS  GQTMGQHENDE PCGKSILHSVIWDRVILDEAHFIKDR SNTAKAVLA
Sbjct: 361 NFHNSGSQKSTFGQTMGQHENDENPCGKSILHSVIWDRVILDEAHFIKDRQSNTAKAVLA 420

Query: 421 ISSSFRWALSGTPIQNRVGELYSLVRFLQIVPYSFYFCKDCDCRTLDHSSLTCPNCPHKR 480
           ISSSFRWALSGTPIQNRVGELYSLVRFLQIVPYSFYFCKDCDCRTLDHSS++CP+CPHKR
Sbjct: 421 ISSSFRWALSGTPIQNRVGELYSLVRFLQIVPYSFYFCKDCDCRTLDHSSVSCPDCPHKR 480

Query: 481 VRHFCWWNKNITVRIQNFGRGPEFQRGMILLKHKILSSIVLRRTKKGRAADLALPPSIVS 540
           +RHFCWWNK IT+RIQN GRGPEF+RGMILLKHKILSSIVLRRTKKGRAADLALPPSIVS
Sbjct: 481 MRHFCWWNKYITLRIQNVGRGPEFKRGMILLKHKILSSIVLRRTKKGRAADLALPPSIVS 540

Query: 541 IRRDTLDIQEEDFYESLYNDSRAKFNTFVAAGTATSNYAHIFDLLIRLRQAVNHPYLVVY 600
           IRRDTLDIQEEDFYESLYNDSRAKFNTFVAAGTATSNYAHIFDLLIRLRQAVNHPYLVVY
Sbjct: 541 IRRDTLDIQEEDFYESLYNDSRAKFNTFVAAGTATSNYAHIFDLLIRLRQAVNHPYLVVY 600

Query: 601 SKTNAISCGSIGDSDNNNKQVCGICHEPAEEPVVTSCEHTFCKACIIDHTNDFLKSVACP 660
           S+TN ISCGSI  +DNNN+  CGICHEPAEEPVVTSCEHTFCKACII   NDF K V+CP
Sbjct: 601 SRTNVISCGSIDGTDNNNEHACGICHEPAEEPVVTSCEHTFCKACIIGFANDFSKLVSCP 660

Query: 661 SCSKMLTIDFRTSLAVGDQTIKNTIKGFKSSSILNRIQLENFQTSTKIEALREEIRFMFE 720
           SCSK LTIDF T+LA  DQTIKNTIKGFK +SILNRIQLENFQTSTKIEALREEIRFM E
Sbjct: 661 SCSKKLTIDFSTNLAGRDQTIKNTIKGFKCTSILNRIQLENFQTSTKIEALREEIRFMLE 720

Query: 721 RDGSAKGIVFSQFTSFLDLMNYSLTKSGITCVQLVGSMSLSQRGDAINRFIEDPDCKIFL 780
           RDGSAKGIVFSQFTSFLDL+NYSLTKSGITCVQL+GSMSL QR DAI RFI+DPDCKIFL
Sbjct: 721 RDGSAKGIVFSQFTSFLDLINYSLTKSGITCVQLIGSMSLPQRDDAIKRFIDDPDCKIFL 780

Query: 781 MSLKAGGVALNLTVASHVFIMDPWWNPAVERQAQDRIHRIGQYKPIRCSILINICGIFLL 840
           MSLKAGG+ALNLTVASHVFIMDPWWNPAVERQAQDRIHRIGQYKPI              
Sbjct: 781 MSLKAGGIALNLTVASHVFIMDPWWNPAVERQAQDRIHRIGQYKPI-------------- 840

Query: 841 SQDRTNTFDINRITRFIIENSIEERILKLQERKELVFEGTVGGSNEALGKLSLDDMRFLF 900
                      RITRF+IENSIEERILKLQERKELVFEGTVG SNEALGKL+LDDMRFLF
Sbjct: 841 -----------RITRFVIENSIEERILKLQERKELVFEGTVGRSNEALGKLTLDDMRFLF 873

Query: 901 I 902
           I
Sbjct: 901 I 873

BLAST of HG10000172 vs. TAIR 10
Match: AT1G05120.1 (Helicase protein with RING/U-box domain )

HSP 1 Score: 945.3 bits (2442), Expect = 3.6e-275
Identity = 486/832 (58.41%), Postives = 609/832 (73.20%), Query Frame = 0

Query: 74  EQEVGSNVGNDENLHNQKPEIADYQGVDDIEKPKTKYSRKK----KPKPTLLWNVWEEEY 133
           E+E+   V ND+ L N  P +A       +  P+    RKK    K K  LLW  WE+E 
Sbjct: 52  EEELEEVVANDD-LPNPVPVLA------IVNLPRASKKRKKPDARKEKVVLLWETWEKEQ 111

Query: 134 ERWIDENIDKDFDLASQNEVLTEAVETPSALTMPLLRYQKEWLAWALKQEDSSIKGGILA 193
             WIDE++ +D DL   N V+ E  E PS L MPLLRYQKE+LAWA KQE  S+ GGILA
Sbjct: 112 NSWIDEHMSEDVDLDQHNAVIAETAEPPSDLIMPLLRYQKEFLAWATKQE-QSVAGGILA 171

Query: 194 DEMGMGKTIQAIALVLAKRQLSGTAGLKRPSPYPSSSKDLPLIKATLVICPVVAVSQWVS 253
           DEMGMGKTIQAI+LVLA+R++               ++       TLV+CP+VAVSQW++
Sbjct: 172 DEMGMGKTIQAISLVLARREV-------------DRAQFGEAAGCTLVLCPLVAVSQWLN 231

Query: 254 EIDRFTSKGSYKVLVYHGPKRVRNLEILSEYDFVITTYSVVEADYRKHLMPPKDRCPYCS 313
           EI RFTS GS KVLVYHG KR +N++    YDFV+TTYS VE++YR+++MP K +C YCS
Sbjct: 232 EIARFTSPGSTKVLVYHGAKRAKNIKEFMNYDFVLTTYSTVESEYRRNIMPSKVQCAYCS 291

Query: 314 KLFYKKNLKIHLRYICGPDAVKTEKQAKQQRKRPIQPQISKKEESVNDKNNTVHKSGSQK 373
           K FY K L IHLRY CGP AVKT KQ+KQ+RK+       + +E+   ++  + KS   K
Sbjct: 292 KSFYPKKLVIHLRYFCGPSAVKTAKQSKQKRKKTSDSSSQQGKEADAGEDKKLKKS---K 351

Query: 374 SALGQTMGQHENDEKPCGKSILHSVIWDRVILDEAHFIKDRLSNTAKAVLAISSSFRWAL 433
               QT+ + +       KS+LHSV W+R+ILDEAH+IK+R SNTA+AV A+ +++RWAL
Sbjct: 352 KKTKQTVEKDQLGSDDKEKSLLHSVKWNRIILDEAHYIKERRSNTARAVFALEATYRWAL 411

Query: 434 SGTPIQNRVGELYSLVRFLQIVPYSFYFCKDCDCRTLDH-SSLTCPNCPHKRVRHFCWWN 493
           SGTP+QNRVGELYSL+RFLQI PYS+YFCKDCDCR LD+ +  +CP+CPH  VRHFCWWN
Sbjct: 412 SGTPLQNRVGELYSLIRFLQIRPYSYYFCKDCDCRILDYVAHQSCPHCPHNAVRHFCWWN 471

Query: 494 KNITVRIQNFGRGPEFQRGMILLKHKILSSIVLRRTKKGRAADLALPPSIVSIRRDTLDI 553
           K +   I  +G     +R MILLKHK+L  I+LRRTK GRAADLALPP I+++RRDTLD+
Sbjct: 472 KYVAKPITVYGSFGLGKRAMILLKHKVLKDILLRRTKLGRAADLALPPRIITLRRDTLDV 531

Query: 554 QEEDFYESLYNDSRAKFNTFVAAGTATSNYAHIFDLLIRLRQAVNHPYLVVYSKTNAISC 613
           +E D+YESLY +S+A+FNT++ AGT  +NYAHIFDLL RLRQAV+HPYLVVYS ++  + 
Sbjct: 532 KEFDYYESLYKNSQAEFNTYIEAGTLMNNYAHIFDLLTRLRQAVDHPYLVVYSNSSGANA 591

Query: 614 GSIGDSDNNNKQVCGICHEPAEEPVVTSCEHTFCKACIIDHTNDFLKSVACPSCSKMLTI 673
             +   +N ++Q CG+CH+PAE+ VVTSC H FCKAC+I  +   L  V CP+CSK+LT+
Sbjct: 592 NLV--DENKSEQECGLCHDPAEDYVVTSCAHVFCKACLIGFSAS-LGKVTCPTCSKLLTV 651

Query: 674 DFRTSLAVGDQTIKNTIKGFKSSSILNRIQLENFQTSTKIEALREEIRFMFERDGSAKGI 733
           D+ T      +  K T+KGF++SSILNRI+L++FQTSTKIEALREEIRFM ERDGSAK I
Sbjct: 652 DWTTKADTEHKASKTTLKGFRASSILNRIKLDDFQTSTKIEALREEIRFMVERDGSAKAI 711

Query: 734 VFSQFTSFLDLMNYSLTKSGITCVQLVGSMSLSQRGDAINRFIEDPDCKIFLMSLKAGGV 793
           VFSQFTSFLDL+NY+L K G++CVQLVGSM+++ R  AIN+F EDPDC++FLMSLKAGGV
Sbjct: 712 VFSQFTSFLDLINYTLGKCGVSCVQLVGSMTMAARDTAINKFKEDPDCRVFLMSLKAGGV 771

Query: 794 ALNLTVASHVFIMDPWWNPAVERQAQDRIHRIGQYKPIRCSILINICGIFLLSQDRTNTF 853
           ALNLTVASHVF+MDPWWNPAVERQAQDRIHRIGQYKPI                      
Sbjct: 772 ALNLTVASHVFMMDPWWNPAVERQAQDRIHRIGQYKPI---------------------- 831

Query: 854 DINRITRFIIENSIEERILKLQERKELVFEGTVGGSNEALGKLSLDDMRFLF 901
              R+ RFIIEN++EERIL+LQ++KELVFEGTVGGS EA+GKL+ +DMRFLF
Sbjct: 832 ---RVVRFIIENTVEERILRLQKKKELVFEGTVGGSQEAIGKLTEEDMRFLF 831

BLAST of HG10000172 vs. TAIR 10
Match: AT1G02670.1 (P-loop containing nucleoside triphosphate hydrolases superfamily protein )

HSP 1 Score: 515.4 bits (1326), Expect = 9.3e-146
Identity = 346/874 (39.59%), Postives = 464/874 (53.09%), Query Frame = 0

Query: 32  SLFSDSGSEVLSSSSEDSSEPSTKKSRAKTQRKRIKKEG-PSIEQEVGSNVGNDENLHNQ 91
           +L++ +    L  + E  S        + +Q + +K+E  P  +  VG  V  + N +  
Sbjct: 25  TLWTGTAQGDLGVAMEPHSHHKNAILPSSSQDENLKEEEVPDDDDSVGGEVQGEVNAN-- 84

Query: 92  KPEIADYQGVDDIEKPKTKYSRKKKPKPTLLWNVWEEEYERWIDENIDKDFDLASQNEVL 151
                     D I  P    + K+K      W + +E+ +   D++ D+      QN V+
Sbjct: 85  ----------DYIPNPAAPANTKRK------WQIMKEKVQMTEDDDFDE------QNAVI 144

Query: 152 TEAVETPSALTMPLLRYQKEWLAWALKQEDSSIKGGILADEMGMGKTIQAIALVLAKRQL 211
            EA E P  L +PLL+YQKE+LAWA  QE S+++GGILADEMGMGKTIQAI+LVLA+R++
Sbjct: 145 AEAAEQPLDLIIPLLKYQKEFLAWATIQELSAVRGGILADEMGMGKTIQAISLVLARREV 204

Query: 212 SGTAGLKRPSPYPSSSKDLPLIKATLVICPVVAVSQWVSEIDRFTSKGSYKVLVYHGPKR 271
                          +K    +  TLV+ P VA+SQW+ EI R TS GS +VL YHGPKR
Sbjct: 205 -------------DRAKSREAVGHTLVLVPPVALSQWLDEISRLTSPGSTRVLQYHGPKR 264

Query: 272 VRNLEILSEYDFVITTYSVVEADYRKHLMPPKDRCPYCSKLFYKKNLKIHLRYICGPDAV 331
            +N++ L  YDFV+TT  +VE +YR                                   
Sbjct: 265 DKNVQKLMNYDFVLTTSPIVENEYR----------------------------------- 324

Query: 332 KTEKQAKQQRKRPIQPQISKKEESVNDKNNTVHKSGSQKSALGQTMGQHENDEKPCGKSI 391
                               K+E V+                 +TM            S 
Sbjct: 325 --------------------KDEGVD-----------------ETM------------SP 384

Query: 392 LHSVIWDRVILDEAHFIKDRLSNTAKAVLAISSSFRWALSGTPIQNRVGELYSLVRFLQI 451
           LHS+ W+R+I+DEAH IK+R S TAKAV A+ +++RWALSGTP+QN V ELYSL      
Sbjct: 385 LHSIKWNRIIVDEAHDIKNRSSRTAKAVFALEATYRWALSGTPLQNDVDELYSL------ 444

Query: 452 VPYSF--YFCKDCDCRTLDHSSLTCPNCPHKRVRHFCWWNKNITVRIQNFGRGPEFQRGM 511
           V YSF  +F          H+ +T              + +N+TV+              
Sbjct: 445 VSYSFLNFFYSTYASFAFRHTHIT--------------FARNVTVK-------------- 504

Query: 512 ILLKHKILS-SIVLRRTKKGRAADLALPPSIVSIRRDTLDIQEEDFYESLYNDSRAKFNT 571
            L+   IL  SI +R         + +  S+   RRD L + E DFYESLY  S+  F+ 
Sbjct: 505 FLIGGNILPLSIPVRIENVPAVLIMQINTSLGGKRRDALSVVEADFYESLYKVSKTTFDG 564

Query: 572 FVAAGTATSNYAHIFDLLIRLRQAVNHPYLVVYSKTNAISCGSIGDSDNNNKQVCGICHE 631
           ++ AGT  +NYAHIF LLIRLRQAV+HPYLV YS  +  +   +    N N++ CG  H+
Sbjct: 565 YIQAGTLMNNYAHIFGLLIRLRQAVDHPYLVSYSSPSGANANLL--DANKNEKECGFGHD 624

Query: 632 PAEEPVVTSCEHTFCKACIIDHTNDFLKSVACPSCSKMLTIDFRTSLAVGDQTIKNTIKG 691
           P+++  VTS EH                                       Q  K  +KG
Sbjct: 625 PSKDYFVTSSEH---------------------------------------QASKTKLKG 677

Query: 692 FKSSSILNRIQLENFQTSTKIEALREEIRFMFERDGSAKGIVFSQFTSFLDLMNYSLTKS 751
           F++SSILNRI L++F+TSTKIEALREEIRFM ERD SAK IVFSQFTSFLDL++Y+L KS
Sbjct: 685 FRASSILNRINLDDFKTSTKIEALREEIRFMVERDWSAKAIVFSQFTSFLDLISYALGKS 677

Query: 752 GITCVQLVGSMSLSQRGDAINRFIEDPDCKIFLMSLKAGGVALNLTVASHVFIMDPWWNP 811
           G++CVQLVGSMS + +  A+  F E+PDC++ LMSL+AGGVALNLT ASHVF+MDPWWNP
Sbjct: 745 GVSCVQLVGSMSKAAKDAALKNFKEEPDCRVLLMSLQAGGVALNLTAASHVFMMDPWWNP 677

Query: 812 AVERQAQDRIHRIGQYKPIRCSILINICGIFLLSQDRTNTFDINRITRFIIENSIEERIL 871
           AVERQAQDRIHRIGQ KP+                         R+ RFI+E ++EE+IL
Sbjct: 805 AVERQAQDRIHRIGQCKPV-------------------------RVVRFIMEKTVEEKIL 677

Query: 872 KLQERKELVFEGTVGGSNEA-LGKLSLDDMRFLF 901
            LQ++KE +FE T+G S EA + KL  DD++ LF
Sbjct: 865 TLQKKKEDLFESTLGDSEEAVVQKLGEDDIKSLF 677

BLAST of HG10000172 vs. TAIR 10
Match: AT3G20010.1 (SNF2 domain-containing protein / helicase domain-containing protein / zinc finger protein-related )

HSP 1 Score: 325.1 bits (832), Expect = 1.8e-88
Identity = 257/904 (28.43%), Postives = 395/904 (43.69%), Query Frame = 0

Query: 134  DENIDKDFDLASQ------NEVLTEAVETPSALTMPLLRYQKEWLAWALKQEDSSIK--G 193
            D N D D  L  Q      N+ +TE+   P  L++PL+R+QK  LAW  ++E SS    G
Sbjct: 245  DRNPDNDERLVYQAALQVLNQPMTESDLPPGTLSVPLMRHQKIALAWMFQKETSSFNCPG 304

Query: 194  GILADEMGMGKTIQAIALVLAKR---QLSGTAGLK------------------------- 253
            GILAD+ G+GKT+  IAL+L ++   QL   +  K                         
Sbjct: 305  GILADDQGLGKTVSTIALILKQKIVSQLKSESSCKQETEALVLDADDESDNAKHESGSHV 364

Query: 254  RPSPYPSSSKDLPLIKA----------------------------------TLVICPVVA 313
            +P    SS+ +  ++ A                                  TL++CP   
Sbjct: 365  KPELKVSSNSETSVLSACGNDENDSSDMEKAEDEEANSSTRAFQWKRPAAGTLIVCPASV 424

Query: 314  VSQWVSEIDRFTSKGS-YKVLVYHGPKRVRNLEILSEYDFVITTYSVVEADYRKHLMPPK 373
            V QW  E+D   S+ S   VLVYHG  R ++   L+EYD V+TTY++V  +     +  +
Sbjct: 425  VRQWARELDEKVSEESKLSVLVYHGSNRTKDPNELAEYDVVVTTYAIVTNEAPNKFLVDE 484

Query: 374  DRCPYCSKLFYKKNLKIHLRYICGPDAVKTEKQAKQQRKRPIQPQISKKEESVNDKNNTV 433
            D                             E   K   +  +    S      N K   V
Sbjct: 485  D-----------------------------ENDEKNTDRYGLASGFSN-----NKKRKVV 544

Query: 434  HKSGSQKSALGQTMGQHENDEKPCGKSILHSVIWDRVILDEAHFIKDRLSNTAKAVLAIS 493
              +  +    G+      + E  CG   L  V W R++LDEA  IK+  +  A++   + 
Sbjct: 545  VGASKKSKRRGRKSTNDTSSEPDCGP--LGKVGWFRIVLDEAQTIKNYRTQMARSCCTLR 604

Query: 494  SSFRWALSGTPIQNRVGELYSLVRFLQIVPYSFYFCKDCDCRTLDHSSLTCPNCPHKRVR 553
            +  RW LSGTPIQN + +LYS  RFL+  PY+ Y           +S++  P       R
Sbjct: 605  AKRRWCLSGTPIQNTIDDLYSYFRFLRYDPYAVY--------KSFYSTIKVPIS-----R 664

Query: 554  HFCWWNKNITVRIQNFGRGPEFQRGMILLKHKILSSIVLRRTKKGRAAD----LALPPSI 613
            + C   K +                       +L +I+LRRT KG   D    + LPP +
Sbjct: 665  NSCQGYKKL---------------------QAVLRAIMLRRT-KGTLLDGKPIINLPPKV 724

Query: 614  VSIRRDTLDIQEEDFYESLYNDSRAKFNTFVAAGTATSNYAHIFDLLIRLRQAVNHPYLV 673
            V++ +    + E  FY+ L  DSR++F  +  AGT + NYA+I  LL+RLRQA +HP LV
Sbjct: 725  VNLSQVDFSVAERSFYKKLEADSRSQFKAYADAGTLSQNYANILLLLLRLRQACDHPQLV 784

Query: 674  VYSKTNAISCGSIGDSD----------------NNNKQVCGICHEPAEEPVVTSCEHTFC 733
               + N+   G + ++                  ++  +C  C+EP E+PVVT C H FC
Sbjct: 785  --KRYNSDPVGKVSEAAVRRLPREARSRLINRLESSSAICYECNEPPEKPVVTLCGHIFC 844

Query: 734  KACIIDHTNDFLKSVACPSCSKMLTIDFRTSLAVGDQTIKNTIKGFK--SSSILNRIQLE 793
              C++++      +   P C + L  D    +   + +++N        SSS  N +   
Sbjct: 845  YECVLEYITGDENTCPVPRCKQQLARD----VVFSESSLRNCTSDDSGCSSSHDNGLDRS 904

Query: 794  NFQ----TSTKIEALREEIRFMFERD---------------------------------- 853
             FQ     S+KI+A+ + ++ + + D                                  
Sbjct: 905  VFQKRDFCSSKIKAVLDILQSLSQPDSPNSAQHGQMPSSSRPYDDDDVTIVEPMRLHSSS 964

Query: 854  ---GSAKGIVFSQFTSFLDLMNYSLTKSGITCVQLVGSMSLSQRGDAINRFIEDPDCKIF 902
               G+ K I+FSQ+T  LDL+   + +SGI   +L G+MSL+ R  A+  F + PD K+ 
Sbjct: 965  PSQGAVKTIIFSQWTGMLDLVELRILESGIEFRRLDGTMSLAARDRAVKEFSKKPDVKVM 1024

BLAST of HG10000172 vs. TAIR 10
Match: AT1G50410.1 (SNF2 domain-containing protein / helicase domain-containing protein / zinc finger protein-related )

HSP 1 Score: 318.2 bits (814), Expect = 2.2e-86
Identity = 250/892 (28.03%), Postives = 384/892 (43.05%), Query Frame = 0

Query: 127 EEYERWIDENIDKDFDLASQNEVLTEAVETPSALTMPLLRYQKEWLAWALKQEDSSI--K 186
           EE     DE +     L   N+  +E       L++PL+++QK  LAW  ++E +S+   
Sbjct: 189 EERNSENDERLIYQAALQELNQPKSEVDLPAGLLSVPLMKHQKIALAWMFQKETNSLHCM 248

Query: 187 GGILADEMGMGKTIQAIALVLAKRQ----------------------------------- 246
           GGILAD+ G+GKT+  IAL+L +                                     
Sbjct: 249 GGILADDQGLGKTVSTIALILKQMHEAKLKSKNSGNQEAEALDLDADDESENAFEKPESK 308

Query: 247 ------LSGTAGLKRPSPYPSSSKDLPLIK-----ATLVICPVVAVSQWVSEID-RFTSK 306
                 ++G +G+K+     +S+      +      TL++CP   V QW  E+D + T +
Sbjct: 309 ASNGSGVNGDSGIKKAKGEEASTSTRKFNRKRPAAGTLIVCPASVVRQWARELDEKVTDE 368

Query: 307 GSYKVLVYHGPKRVRNLEILSEYDFVITTYSVVEADYRKHLMPPKDRCPYCSKLFYKKNL 366
               VL+YHG  R ++   L++YD V+TTY++V  +  K  +   D              
Sbjct: 369 AKLSVLIYHGGNRTKDPIELAKYDVVMTTYAIVSNEVPKQPLVDDD-------------- 428

Query: 367 KIHLRYICGPDAVKTEKQAKQQRKRPIQPQISKKEESVNDKNNTVHKSGSQKSALGQTMG 426
                          E   K   K  +         S+N K   V   G+ K +  +   
Sbjct: 429 ---------------ENDEKNSEKYGLASGF-----SINKKRKNV--VGTTKKSKKKKGN 488

Query: 427 QHENDEKPCGKSILHSVIWDRVILDEAHFIKDRLSNTAKAVLAISSSFRWALSGTPIQNR 486
            +  D        L  V W RV+LDEA  IK+  +  A+A   + +  RW LSGTPIQN 
Sbjct: 489 NNAGDSSDPDSGTLAKVGWFRVVLDEAQTIKNHRTQVARACCGLRAKRRWCLSGTPIQNT 548

Query: 487 VGELYSLVRFLQIVPYSFYFCKDCDCRTLDHSSLTCPNCPHKRVRHFCWWNKNITVRIQN 546
           + +LYS  RFL+  PY+ Y                         + FC   K        
Sbjct: 549 IDDLYSYFRFLKYDPYAVY-------------------------KSFCHQIK-------- 608

Query: 547 FGRGPEFQRGMILLK--HKILSSIVLRRTKKGRAAD----LALPPSIVSIRRDTLDIQEE 606
              GP  +  +   K    +L +I+LRRT KG   D    + LPP  +++ +    ++E 
Sbjct: 609 ---GPISRNSLQGYKKLQAVLRAIMLRRT-KGTLLDGQPIINLPPKTINLSQVDFSVEER 668

Query: 607 DFYESLYNDSRAKFNTFVAAGTATSNYAHIFDLLIRLRQAVNHPYLVVYSKTNAISCGSI 666
            FY  L +DSR++F  + AAGT   NYA+I  +L+RLRQA +HP LV   + N+ S G +
Sbjct: 669 SFYVKLESDSRSQFKAYAAAGTLNQNYANILLMLLRLRQACDHPQLV--KRYNSDSVGKV 728

Query: 667 GD---------------SDNNNKQVCGICHEPAEEPVVTSCEHTFCKACIIDHTNDFLKS 726
            +               S   +  +C +CH+P E+PVVT C H FC  C+ D+      +
Sbjct: 729 SEEAVKKLPKEDLVSLLSRLESSPICCVCHDPPEDPVVTLCGHIFCYQCVSDYITGDEDT 788

Query: 727 VACPSCSKMLTID-------FRTSLA-----------VGDQTI--KNTIKGFKSSSILNR 786
              P C + L  D        R+ +A             D+++         K  ++L+ 
Sbjct: 789 CPAPRCREQLAHDVVFSKSTLRSCVADDLGCSSSEDNSHDKSVFQNGEFSSSKIKAVLDI 848

Query: 787 IQ-LENFQTSTKIE------------------------ALREEIRFMFERDGSAKGIVFS 846
           +Q L N  TS   +                          +  ++      G  K I+FS
Sbjct: 849 LQSLSNQGTSNSTQNGQMASSSQQPNDDDDDDDDDVTIVEKTSLKSTPSNGGPIKTIIFS 908

Query: 847 QFTSFLDLMNYSLTKSGITCVQLVGSMSLSQRGDAINRFIEDPDCKIFLMSLKAGGVALN 902
           Q+T  LDL+  SL ++ I   +L G+MSL  R  A+  F  DPD K+ +MSLKAG + LN
Sbjct: 909 QWTGMLDLVELSLIENSIEFRRLDGTMSLIARDRAVKEFSNDPDVKVMIMSLKAGNLGLN 968

BLAST of HG10000172 vs. TAIR 10
Match: AT5G43530.1 (Helicase protein with RING/U-box domain )

HSP 1 Score: 302.4 bits (773), Expect = 1.2e-81
Identity = 221/760 (29.08%), Postives = 339/760 (44.61%), Query Frame = 0

Query: 184  KGGILADEMGMGKTIQAIALVLAKRQLSGTAGLKRPSPYPSSSK--------DLPLIKA- 243
            +GGILAD MG+GKT+  IAL+LA+                ++ K         L  +KA 
Sbjct: 681  RGGILADAMGLGKTVMTIALILARPGRGNPENEDVLVADVNADKRNRKEIHMALTTVKAK 740

Query: 244  --TLVICPVVAVSQWVSEIDRFTSKGSYKVLVYHGPKRVRNLEILSEYDFVITTYSVVEA 303
              TL+ICP+  +SQW  E++  +   +  VLVY+G  R  + + ++ +D V+TTY V+ +
Sbjct: 741  GGTLIICPMALLSQWKDELETHSKPDTVSVLVYYGGDRTHDAKAIASHDVVLTTYGVLTS 800

Query: 304  DYRKHLMPPKDRCPYCSKLFYKKNLKIHLRYICGPDAVKTEKQAKQQRKRPIQPQISKKE 363
             Y++ +                                                      
Sbjct: 801  AYKQDM------------------------------------------------------ 860

Query: 364  ESVNDKNNTVHKSGSQKSALGQTMGQHENDEKPCGKSILHSVIWDRVILDEAHFIKDRLS 423
                                                SI H + W R++LDEAH IK   +
Sbjct: 861  ----------------------------------ANSIFHRIDWYRIVLDEAHTIKSWKT 920

Query: 424  NTAKAVLAISSSFRWALSGTPIQNRVGELYSLVRFLQIVPYSFYFCKDCDCRTLDHSSLT 483
              AKA   +SS  RW L+GTP+QN++ +LYSL+ FL + P+                   
Sbjct: 921  QAAKATFELSSHCRWCLTGTPLQNKLEDLYSLLCFLHVEPWC------------------ 980

Query: 484  CPNCPHKRVRHFCWWNKNITVRIQNFGRGPEFQRGMILLKHKILSSIVLRRTKKGRAAD- 543
                      ++ WW+K I    +N        RG+ L+K  IL  ++LRRTK+ R  + 
Sbjct: 981  ----------NWAWWSKLIQKPYENGD-----PRGLKLIK-AILRPLMLRRTKETRDKEG 1040

Query: 544  ---LALPPSIVSIRRDTLDIQEEDFYESLYNDSRAKFNTFVAAGTATSNYAHIFDLLIRL 603
               L LPP+ V +        E DFY +L+  S+ +F+ FVA G    NYA+I +LL+RL
Sbjct: 1041 SLILELPPTDVQVIECEQSEAERDFYTALFKRSKVQFDQFVAQGKVLHNYANILELLLRL 1100

Query: 604  RQAVNHPYLVVY-------------------SKTNAISCGS---------IGDSDNNNKQ 663
            RQ  NHP+LV+                    +  +++S  +         I D  + N +
Sbjct: 1101 RQCCNHPFLVMSRADSQQYADLDSLARRFLDNNPDSVSQNAPSRAYIEEVIQDLRDGNSK 1160

Query: 664  VCGICHEPAEEPVVTSCEHTFCKACIIDHTNDFLKSVACPSCSKMLTIDFRTSLAVGDQT 723
             C IC E A++PV+T C H  C+ C++       +S +C  C    TI  RT L      
Sbjct: 1161 ECPICLESADDPVLTPCAHRMCRECLLTS----WRSPSCGLCPICRTILKRTELI----- 1220

Query: 724  IKNTIKGFKSSSILNRIQLENFQTSTKIEALREEIRFMFERDGSAKGIVFSQFTSFLDLM 783
                     + SI     ++N++ S+K+  L + +  + +     K IVFSQ+TSFLDL+
Sbjct: 1221 ------SCPTDSIFRVDVVKNWKESSKVSELLKCLEKIKKSGSGEKSIVFSQWTSFLDLL 1276

Query: 784  NYSLTKSGITCVQLVGSMSLSQRGDAINRFIEDPDCKIFLMSLKAGGVALNLTVASHVFI 843
               L + G   ++  G ++   R   +  F E     I LMSLKAGGV LNLT AS VF+
Sbjct: 1281 EIPLRRRGFEFLRFDGKLAQKGREKVLKEFNETKQKTILLMSLKAGGVGLNLTAASSVFL 1276

Query: 844  MDPWWNPAVERQAQDRIHRIGQYKPIRCSILINICGIFLLSQDRTNTFDINRITRFIIEN 901
            MDPWWNPAVE QA  RIHRIGQ + +                          + RFI+++
Sbjct: 1341 MDPWWNPAVEEQAIMRIHRIGQKRTV-------------------------FVRRFIVKD 1276

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038902046.10.0e+0090.23ATP-dependent helicase rhp16 [Benincasa hispida][more]
XP_004151894.20.0e+0086.79DNA repair protein RAD16 isoform X1 [Cucumis sativus] >KGN63243.1 hypothetical p... [more]
XP_008455894.10.0e+0086.90PREDICTED: DNA repair protein RAD16 [Cucumis melo] >KAA0034825.1 DNA repair prot... [more]
XP_023512492.10.0e+0085.02DNA repair protein RAD16 isoform X1 [Cucurbita pepo subsp. pepo] >XP_023512493.1... [more]
XP_022943925.10.0e+0084.79DNA repair protein RAD16 [Cucurbita moschata] >XP_022943926.1 DNA repair protein... [more]
Match NameE-valueIdentityDescription
P790512.9e-13635.75ATP-dependent helicase rhp16 OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
P312442.7e-13436.61DNA repair protein RAD16 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c... [more]
Q9LHE42.5e-8728.43Helicase-like transcription factor CHR27 OS=Arabidopsis thaliana OX=3702 GN=CHR2... [more]
Q94BR53.1e-8528.03Helicase-like transcription factor CHR28 OS=Arabidopsis thaliana OX=3702 GN=CHR2... [more]
Q9FIY71.7e-8029.08DNA repair protein RAD5B OS=Arabidopsis thaliana OX=3702 GN=RAD5B PE=3 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LN530.0e+0086.79Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G416820 PE=3 SV=1[more]
A0A5D3DZG80.0e+0086.90DNA repair protein RAD16 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffol... [more]
A0A1S3C1J50.0e+0086.90DNA repair protein RAD16 OS=Cucumis melo OX=3656 GN=LOC103495970 PE=3 SV=1[more]
A0A6J1FYD30.0e+0084.79DNA repair protein RAD16 OS=Cucurbita moschata OX=3662 GN=LOC111448500 PE=3 SV=1[more]
A0A6J1J7230.0e+0084.91DNA repair protein RAD16 OS=Cucurbita maxima OX=3661 GN=LOC111484062 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT1G05120.13.6e-27558.41Helicase protein with RING/U-box domain [more]
AT1G02670.19.3e-14639.59P-loop containing nucleoside triphosphate hydrolases superfamily protein [more]
AT3G20010.11.8e-8828.43SNF2 domain-containing protein / helicase domain-containing protein / zinc finge... [more]
AT1G50410.12.2e-8628.03SNF2 domain-containing protein / helicase domain-containing protein / zinc finge... [more]
AT5G43530.11.2e-8129.08Helicase protein with RING/U-box domain [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 694..714
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 334..359
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..100
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 32..50
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 334..385
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 360..377
NoneNo IPR availablePANTHERPTHR45626TRANSCRIPTION TERMINATION FACTOR 2-RELATEDcoord: 847..898
coord: 381..712
NoneNo IPR availablePANTHERPTHR45626:SF33SUBFAMILY NOT NAMEDcoord: 847..898
NoneNo IPR availablePANTHERPTHR45626:SF33SUBFAMILY NOT NAMEDcoord: 102..352
NoneNo IPR availablePANTHERPTHR45626:SF33SUBFAMILY NOT NAMEDcoord: 714..827
coord: 381..712
NoneNo IPR availablePANTHERPTHR45626TRANSCRIPTION TERMINATION FACTOR 2-RELATEDcoord: 102..352
NoneNo IPR availablePANTHERPTHR45626TRANSCRIPTION TERMINATION FACTOR 2-RELATEDcoord: 714..827
NoneNo IPR availableCDDcd18008DEXDc_SHPRH-likecoord: 164..463
e-value: 9.26079E-81
score: 259.142
NoneNo IPR availableCDDcd18793SF2_C_SNFcoord: 705..832
e-value: 1.69574E-50
score: 171.891
NoneNo IPR availableSUPERFAMILY57850RING/U-boxcoord: 615..667
IPR001650Helicase, C-terminalSMARTSM00490helicmild6coord: 738..821
e-value: 9.0E-15
score: 65.1
IPR001650Helicase, C-terminalPFAMPF00271Helicase_Ccoord: 709..821
e-value: 1.5E-12
score: 47.8
IPR001650Helicase, C-terminalPROSITEPS51194HELICASE_CTERcoord: 703..873
score: 13.212744
IPR014001Helicase superfamily 1/2, ATP-binding domainSMARTSM00487ultradead3coord: 160..469
e-value: 4.4E-25
score: 99.3
IPR014001Helicase superfamily 1/2, ATP-binding domainPROSITEPS51192HELICASE_ATP_BIND_1coord: 177..451
score: 15.098885
IPR001841Zinc finger, RING-typeSMARTSM00184ring_2coord: 622..662
e-value: 1.0E-6
score: 38.3
IPR001841Zinc finger, RING-typePROSITEPS50089ZF_RING_2coord: 622..663
score: 11.766381
IPR000330SNF2, N-terminalPFAMPF00176SNF2_Ncoord: 179..598
e-value: 1.6E-52
score: 178.4
IPR038718SNF2-like, N-terminal domain superfamilyGENE3D3.40.50.10810coord: 330..542
e-value: 9.1E-24
score: 85.6
IPR038718SNF2-like, N-terminal domain superfamilyGENE3D3.40.50.10810coord: 133..305
e-value: 9.5E-25
score: 88.8
IPR027417P-loop containing nucleoside triphosphate hydrolaseGENE3D3.40.50.300coord: 543..897
e-value: 1.3E-56
score: 194.0
IPR027417P-loop containing nucleoside triphosphate hydrolaseSUPERFAMILY52540P-loop containing nucleoside triphosphate hydrolasescoord: 525..900
IPR027417P-loop containing nucleoside triphosphate hydrolaseSUPERFAMILY52540P-loop containing nucleoside triphosphate hydrolasescoord: 118..500
IPR013083Zinc finger, RING/FYVE/PHD-typeGENE3D3.30.40.10Zinc/RING finger domain, C3HC4 (zinc finger)coord: 617..693
e-value: 9.9E-12
score: 46.3
IPR018957Zinc finger, C3HC4 RING-typePFAMPF00097zf-C3HC4coord: 622..662
e-value: 3.8E-7
score: 29.9
IPR017907Zinc finger, RING-type, conserved sitePROSITEPS00518ZF_RING_1coord: 637..646

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10000172.1HG10000172.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0016567 protein ubiquitination
molecular_function GO:0005524 ATP binding
molecular_function GO:0140658 ATP-dependent chromatin remodeler activity
molecular_function GO:0046872 metal ion binding
molecular_function GO:0004842 ubiquitin-protein transferase activity