Moc10g04410 (gene) Bitter gourd (OHB3-1) v2

Overview
NameMoc10g04410
Typegene
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionhomeobox protein HAT3.1
Locationchr10: 3067746 .. 3074082 (+)
RNA-Seq ExpressionMoc10g04410
SyntenyMoc10g04410
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAGAGAGACATGAATATACAGAACCAAGACCTAATAATAACTGTGAAGCCGTACAAGAAGCCAAGGCCAGTGTTGAAGTGCTAACTTGTTTTTCAAATGAACAAATGCATTCAATACCTGACAATCAGGAACTGGGAACAACTCCAGAATGTACCAGCAAAACTGCTGGTCCAGATGATGAAAAATCGGGGGTCCAGCAGAATATGGAGGAAGAGACTAAGGAACTTGGTTCTGGAGATGTGCTTAGTGAGTTACCGGAAAAGAATAATCAGACTATCTCTAAGCTTGCTGAAATTGATCAAGTTGAAGCTGGCAATTTATTATCCAGCGATATAGAAACTGAAAATTTAATATTACCTATTGAATTAGAGACAACAACTCTTAATGAGTGCTCTGAACTTCCACCAGAAGATGCCAACAAAAACTCTATCAAACAGGTGAACCCTCCCATTGAAGATTTAACCCAGAATACTTCTATCCAAAGGTTAGAAACAGTCCCCATAACTAGTGTCAGTATTTCCCAACAGTTGGGCCACAAGGATAAGAAAATTTTGAAATCAAAAAAGAAAAACTATATGTTAAGGTCCCTTGTAAGTAGTGACAGAGTTTTGCGTTCAAGGACCCAAGAGAAAGCTAAAGCTCCTGAACCAAGTAATGAGTTGAATAAGTTGACTGCCGGAGAGGGAAAAAGGAAGAAGAAGAAGAGAAATATAAAAGGAAAGGGAGCTAGTGGTGATGAATTTTCATCAATCAGGAATCGTTTGAGATATTTAGTGAACCGCATCAAATACGAACAGAGCTTGATTGATGCTTATTCTAGTGAAGGCTGGAAAGGGTTCAGGTATGTATTTTCTTCTTTAATGGGTTCTAAATTGACTGTGTGCTGATACTTTCTTGCAGTACTTTATTTTTAAGATGGTCATTGTGTTCACGCCTACGTTGATTTTAATTCTGGTATTTGTATTCCTTGAAATGTCGAGGGAACTAAACACTACGCAGCACACTAGATTTAAAAGGTTGTTACTGGTCTCCAACCCATGCTTGACAATGGATCTGGGTAAAGGCTTGTTACAGGTTTACCGGCTTATGTACTTTCTAGCTTTTGGGTTTACTACCGTTGTTTATAGTCTTTCTTATTTTCAGCAGCTATCACGTACAAGTGCTCACTTGATCTTCTTATGGCTTTAACTAAGTCAAACTTGTAGAATTCCTACCTACAAATAATTTGAATGATAAAAAAGGTAGAAGCAATTCTATTAATTCATTCTGTTATGGAGTGGATTAGTGGATAACAGTTTCTGTGCTCCATATTTTTGGACCAATATGATTTGACCTTTTTGTTATAATTTTTCCTTTTGTTTCAGCCATTAATGTTTCCTACGATTATTTATCTCTTCATTTCATCTATCATATTGGATTGTAATGAAAAGCATGTCATCTGCTGATCATCACTATAACCATGCAGCTCAGATAAATTGAAGCCTGAAAAGGAACTTCAACGAGCATCAAGTGAAATAATGCGGGGTAAATTGAAAATAAGAGATCTATTTCAACATCTTGATTCACTTTGTGCTGAAGGAAGGCTTTCTGAGTCTCTATTCGATTCTGAAGGACAGATAGACAGTGAGGATGTATGGAGATATTTTCTTAAGATGAATTTTTATTAGATGTTATCTTTTGATTGATTCTCCATAAATTTCTACTCTGTTAGAGCATCTGGTTTTTGTGATAAATGACTTCTATATTGTGGACTTCTGGCCTTGTATCAGATATTCTGTGCAAAATGTGGATCCAAAGAACTGTCCCTTGAAAATGACATCATACTATGTGATGGTATTTGTGATCGTGGGTTCCACCAATTCTGTTTAGAACCACCTTTGCTAAATACAGACAGTAATATTCTTACTACGGACTAAAAATTAAAAAGTTGATTTGCGTTTCTCAAGTAACTTTTATTCATGCTCCCTCCTTCACTCTCACAGTTCCGCCGGATGATGAGGGCTGGCTATGTCCTGGATGTGATTGCAAAGATGACTGCTTGGATCTGCTTAATGAATTTCAAGGATCAAATCTTTCTATTACTGATGGTTGGGAGGTAATTAAAAATTTGCAACATTTGTTCAGTAGGTGCTTTTCTGGTTGTTTGTTCTTTTGTTATTTTCCTCTAACTGTGTGCAGAAAGTCTATCCTGAGGCTGCAGCAGCAGCTGCTGGGCAAAATTCTGATCACGCCTTGGGTCTTCCTTCAGATGATTCTGAAGATGGTGATTATGATCCTGATGCTCCAGACACTATTAACCAGGAAGATGAATCGAGTTCTGATCAATCAAGTTCTGATGAATCTGGGTATGCTTCTGCTTCTGAGGAATTGGAGGCTGCACCCAATGATGACCAATACTTAGGTCTTCCTTCTGATGACTCGGAGGATGATGACTATGAACCTGGTGCTCCAGAACTTGATGAAGGTGTTAAACAGGAAAGTTCAGGTTCTGACTTTACATCTGATTCTGAGGATCTAGCTGCACTTGATGATGGCACTACGCCTGTGAGAAACTCTAATGGGCAAGGTTCTGGATGCGGTCCTCGCACGAGTGTACTACATAATGAGTTACAAAGTCTTCTAGAGTCAGGTCCTGATAAGGATGGTCTTGAACCTGTTTCAGGAAGAAGACAGGTGGAACGGTTGGATTACAAGAAGCTGCATGATGTGAGTATTCTCTTATGTTTCACACTATGTTTTATTCTTATAATAAATTATATTGCTATTTTCTAATTTAGTCTTTGAAGTATGCAGGTGATTCTGTCTCTACCATATGCAAACATAGTTGATTGACCATATTCTTCTTATATGCTTCTTTTTAGAAAATAACTTTTCAAATGTAGGAGTGAGACTTTGAATCTTTGATAGGGAGGTTACTAATATTTTAATCAATTAGTTAAACTAGGCTCAACCTTTTATTTTGATAATTTTCAGTGTGCTACTTTGTATTTTATTGTGATAGAAAGATATTCCCTCTCGCTGACAGTTGTATTTATTGTGACAGTAGTATATGATTAAAAAAAAAGGCAAAATCTCATTTAACTTTAACTTTGGTTCGTAAAATTTAAAATTTATTATTTAAATCACAATTCCATAAATGGGTCAAAATAGTCCGTGCCTTTAGTTTTTTGTTAGCTAACAAATGAAAAGTTGACAAAGACACTTTTTAAATTTTGTTTTATATTTTACAATATTAGTTACGCAGATGCGGTGCTACAAAATTAATGATTTTTTTTTTTAACTCCTTTTGTGGAAAAAGTTAATATACTTCATTAAACGCCAAAATGATTAAACATAAAACAGATCTTAATTGAAGACAGAATTGGACACCAAATCTCAAATTTCTTCTAAATGTCAGCATAATTAATATTATAAAATTAATTAAAAGAAATGTCAAGATTTCTGTAGCTAAAAGAAATAGAACCATTTTGACCTATATTTTAAAATTCATAATTTAGATCTTAAGATTTGGAAACTTAGAAGGCCAAATAGCTTCAAAGGTAAAAGCTTGGGGACTTAAAATTCATAATTTTGTTATTTTAGTTGTCGCATTAAACATCAGGCAGTATTCCCTTTCTTTGGTTGCTATAATATTATTTCTGTGATTTTTTTTAGTTTCTAGGTGAGAGTGTGTTCTATGGTTTTTGCTAGATTGTAGAATATAAAATCAGTAGTAGAGGATTTCTCATTCTTTTTTCTTCTTTATTGAACAAGATACGAACTTTTCATTCAATGAAAAGGAAAAAAACTATTCAAATGATACAATCTCCCAAGGGGGAGAGGGAAGAAAATAAAAAGCAAGAACCCAAAGAGGGAAATACAACCCCACATAAAAAGAAAGATAAAAGACAAACAAGAACATTGAATAAAAGCTTTAGAAAAAGTCGGATTTGCAAGCATGGTAAACCTCCCACTTGCATGGAACCAGCCTTCGAAAGACTTAATGAACTTTTAACTTCAGGAACCACTTCAAAAATAAACCTCTTCTTCTAAATTATTTTTTGAAATTAAAATAAAATGGAAAATTTTTCTACTGGCCTTGTCCCCTCCAAGATGTATTTATGTAGCTTTAGTACAACATGTTTATGTGTCTTGTGAGAACAAGTGTGAATGTACTTATGCATTAAATTTCCTATAAAACAAGCTCTAAATATCCAAAAGAGCTCATTTCAGACTAAATGAGTTATACTATTTCCCTTTTACATTTTTGAAGGAGACATACGGGAATGTTCCATCCGACTCAAGCGATGACACATTCGGGAGTATTTCTATTGATTCAAGCGATGACAGAGGTCGGGGTAGTAGAACAAGGAAGAGAAGCCCTAAAAACCTGGTTCCTGCATTAAATGGAACTAATGATGATTTGAAAAATAAAAAAACTAAACGTAGTTATAAGAGGAGAACTCATCAAAAGCCAGGTGCCGAAAATATGAAGAATTCTGTGACTAGGACTCCTGAAGACTCTGTGAAATCTAGTTCTTCTGTTAGACGAACTGCGTCATCATCAAATAGAAGACTCAGTCAACCAGCGTTGGAGGTATTTTTTATATTTTTCAGCTCTGTTATGATTCCCAAACATTATCTCCTTAAAATAGGTTATTTTATTTTCCCCTTTTGTTTGAACTGGGATTTCTTTTTTCCTTACTTTATCTTATTTCTCGTGCATCTGTGTTTTTCTTTTCCTTTCTTTCTATCCCCGTTTTTTTAGGGTGGGGTTGTGACATAAGGTCTTATTTTAAGATTTCTCTGATCTTGCTAACATATTTAAGAATGCACACTCTGGTGCTTAAAGACTTATATTGTGGCAAAGCAGATGAATATTGGTGTTACCTAATGTTAACTATGCGATCTGTTAGATTTGTTGTTTCAAAATTTCTTATTAAATTAAATGTGTCATTATGAATGTCAACTCTGAATCTGGTCGAGTGGTCTCATAAATGCTGAGACCCTCAAACAGCTTTCTTCTGTCTCTCATCCTCTATCCCATAAAAATTCAAAGAGTTTCAGACTGAATCTCGTGTTATGAGTTTAAATTCTCTCATTAGGTCAGTTATATTTATGAAATTGAACTGTCATGTGTTAATTTATATTAAAATGCTATTCCTTTTTGGGTTTGAATCAAGATTTTTCTTGATTATTGCAAATGATTTTGAGCATTTTTGCTCTTTATGTTTCCTGGTTTTCTTTTCTAATTTTTTTTTGCAACAGGCAGTAACAATTTTGAACTATTTATTTGAAGAGCACACTCTTGGTTTTAACTTTCAACTTGGATGCTTTACTATAGGTCTATAATTTATCAGCTTCCATGACATTGTTAGGTTCATTAGTTATCGAAACCCTTCATCTTCTTTAGATGCTTTATTGTGAAATGTCAGTTTCTCCAAATGCTTGTATGGTTGACTTGGTTTTTTAAACAAACGGGCCTTAGAATTTGGAACATGGTTATGATTTCCTTAAGAAGTACAAAATAGTTAGTTTTGAGTAAGAAGGCCACACATATAGGACGGAAAGGAGACAAGGAGGTCTTTATTTCTCTATTGTGGGAAAAAGAGTGTATACAATTAACTGTGAAAGATCAACTTTTTTAATCTGCAGGTCCAACATTTTAATGGGACAAACATTAATAATTATTAATTTAATGCCATTACATGTTCTTAATATTGTTGCAAACGTTCCAGAGACTTCTTGCATCGTTCCAAGAAAATCAGTATCCTAAACGAGCTACAAAAGAGAGTTTGGCACAAGAACTAGGACTCAGTCTGAAGCAGGTTATGCATTGGGTTCCTTGGAAGTTTAGCTAACATTTCTACTTTAGTGAAAGAACTTGGGCTCATTTATTTATTCCATTATTAGGTTAGCAAATGGTTTGAGAACACACGATGGAGCACACGCCATCCTTCAAGCATTGAATCCAATAAAGCAAAGAGTGCTTTAAGAATGGGGATTCAGTCATCTGAGACGAGTGGAAAGCTGCCCAAGCCTGAGCAAGAATCTGGTGCATGTTTCAGAGACACCGATAACAATGGTGCTCAACACCAAGTATCACCAAACACAGATGGTGCTGTGGCCCCATGTCAGAGTGGAGATACAAGGGATGACAAATTGGCGACTCAGAAAACTACTAGACCAGAATCTACTGCTACAAAATCCAGAAAACGGAAGGGCAGGTCAGATCATGTGGCATCACATTCAAAGGACAGAAAGGAATCACAAAAGCCTCCTGCTAAGTCACCAAAAGTTAATCAAATACAAACAGCGGATAAGGTTAGGACAAGGAGGAGGAGATCCATTTAG

mRNA sequence

ATGGAAGAGAGACATGAATATACAGAACCAAGACCTAATAATAACTGTGAAGCCGTACAAGAAGCCAAGGCCAGTGTTGAAGTGCTAACTTGTTTTTCAAATGAACAAATGCATTCAATACCTGACAATCAGGAACTGGGAACAACTCCAGAATGTACCAGCAAAACTGCTGGTCCAGATGATGAAAAATCGGGGGTCCAGCAGAATATGGAGGAAGAGACTAAGGAACTTGGTTCTGGAGATGTGCTTAGTGAGTTACCGGAAAAGAATAATCAGACTATCTCTAAGCTTGCTGAAATTGATCAAGTTGAAGCTGGCAATTTATTATCCAGCGATATAGAAACTGAAAATTTAATATTACCTATTGAATTAGAGACAACAACTCTTAATGAGTGCTCTGAACTTCCACCAGAAGATGCCAACAAAAACTCTATCAAACAGGTGAACCCTCCCATTGAAGATTTAACCCAGAATACTTCTATCCAAAGGTTAGAAACAGTCCCCATAACTAGTGTCAGTATTTCCCAACAGTTGGGCCACAAGGATAAGAAAATTTTGAAATCAAAAAAGAAAAACTATATGTTAAGGTCCCTTGTAAGTAGTGACAGAGTTTTGCGTTCAAGGACCCAAGAGAAAGCTAAAGCTCCTGAACCAAGTAATGAGTTGAATAAGTTGACTGCCGGAGAGGGAAAAAGGAAGAAGAAGAAGAGAAATATAAAAGGAAAGGGAGCTAGTGGTGATGAATTTTCATCAATCAGGAATCGTTTGAGATATTTAGTGAACCGCATCAAATACGAACAGAGCTTGATTGATGCTTATTCTAGTGAAGGCTGGAAAGGGTTCAGCTCAGATAAATTGAAGCCTGAAAAGGAACTTCAACGAGCATCAAGTGAAATAATGCGGGGTAAATTGAAAATAAGAGATCTATTTCAACATCTTGATTCACTTTGTGCTGAAGGAAGGCTTTCTGAGTCTCTATTCGATTCTGAAGGACAGATAGACAGTGAGGATATATTCTGTGCAAAATGTGGATCCAAAGAACTGTCCCTTGAAAATGACATCATACTATGTGATGGTATTTGTGATCGTGGGTTCCACCAATTCTGTTTAGAACCACCTTTGCTAAATACAGACATTCCGCCGGATGATGAGGGCTGGCTATGTCCTGGATGTGATTGCAAAGATGACTGCTTGGATCTGCTTAATGAATTTCAAGGATCAAATCTTTCTATTACTGATGGTTGGGAGAAAGTCTATCCTGAGGCTGCAGCAGCAGCTGCTGGGCAAAATTCTGATCACGCCTTGGGTCTTCCTTCAGATGATTCTGAAGATGGTGATTATGATCCTGATGCTCCAGACACTATTAACCAGGAAGATGAATCGAGTTCTGATCAATCAAGTTCTGATGAATCTGGGTATGCTTCTGCTTCTGAGGAATTGGAGGCTGCACCCAATGATGACCAATACTTAGGTCTTCCTTCTGATGACTCGGAGGATGATGACTATGAACCTGGTGCTCCAGAACTTGATGAAGGTGTTAAACAGGAAAGTTCAGGTTCTGACTTTACATCTGATTCTGAGGATCTAGCTGCACTTGATGATGGCACTACGCCTGTGAGAAACTCTAATGGGCAAGGTTCTGGATGCGGTCCTCGCACGAGTGTACTACATAATGAGTTACAAAGTCTTCTAGAGTCAGGTCCTGATAAGGATGGTCTTGAACCTGTTTCAGGAAGAAGACAGGTGGAACGGTTGGATTACAAGAAGCTGCATGATGAGACATACGGGAATGTTCCATCCGACTCAAGCGATGACACATTCGGGAGTATTTCTATTGATTCAAGCGATGACAGAGGTCGGGGTAGTAGAACAAGGAAGAGAAGCCCTAAAAACCTGGTTCCTGCATTAAATGGAACTAATGATGATTTGAAAAATAAAAAAACTAAACGTAGTTATAAGAGGAGAACTCATCAAAAGCCAGGTGCCGAAAATATGAAGAATTCTGTGACTAGGACTCCTGAAGACTCTGTGAAATCTAGTTCTTCTGTTAGACGAACTGCGTCATCATCAAATAGAAGACTCAGTCAACCAGCGTTGGAGAGACTTCTTGCATCGTTCCAAGAAAATCAGTATCCTAAACGAGCTACAAAAGAGAGTTTGGCACAAGAACTAGGACTCAGTCTGAAGCAGGTTAGCAAATGGTTTGAGAACACACGATGGAGCACACGCCATCCTTCAAGCATTGAATCCAATAAAGCAAAGAGTGCTTTAAGAATGGGGATTCAGTCATCTGAGACGAGTGGAAAGCTGCCCAAGCCTGAGCAAGAATCTGGTGCATGTTTCAGAGACACCGATAACAATGGTGCTCAACACCAAGTATCACCAAACACAGATGGTGCTGTGGCCCCATGTCAGAGTGGAGATACAAGGGATGACAAATTGGCGACTCAGAAAACTACTAGACCAGAATCTACTGCTACAAAATCCAGAAAACGGAAGGGCAGGTCAGATCATGTGGCATCACATTCAAAGGACAGAAAGGAATCACAAAAGCCTCCTGCTAAGTCACCAAAAGTTAATCAAATACAAACAGCGGATAAGGTTAGGACAAGGAGGAGGAGATCCATTTAG

Coding sequence (CDS)

ATGGAAGAGAGACATGAATATACAGAACCAAGACCTAATAATAACTGTGAAGCCGTACAAGAAGCCAAGGCCAGTGTTGAAGTGCTAACTTGTTTTTCAAATGAACAAATGCATTCAATACCTGACAATCAGGAACTGGGAACAACTCCAGAATGTACCAGCAAAACTGCTGGTCCAGATGATGAAAAATCGGGGGTCCAGCAGAATATGGAGGAAGAGACTAAGGAACTTGGTTCTGGAGATGTGCTTAGTGAGTTACCGGAAAAGAATAATCAGACTATCTCTAAGCTTGCTGAAATTGATCAAGTTGAAGCTGGCAATTTATTATCCAGCGATATAGAAACTGAAAATTTAATATTACCTATTGAATTAGAGACAACAACTCTTAATGAGTGCTCTGAACTTCCACCAGAAGATGCCAACAAAAACTCTATCAAACAGGTGAACCCTCCCATTGAAGATTTAACCCAGAATACTTCTATCCAAAGGTTAGAAACAGTCCCCATAACTAGTGTCAGTATTTCCCAACAGTTGGGCCACAAGGATAAGAAAATTTTGAAATCAAAAAAGAAAAACTATATGTTAAGGTCCCTTGTAAGTAGTGACAGAGTTTTGCGTTCAAGGACCCAAGAGAAAGCTAAAGCTCCTGAACCAAGTAATGAGTTGAATAAGTTGACTGCCGGAGAGGGAAAAAGGAAGAAGAAGAAGAGAAATATAAAAGGAAAGGGAGCTAGTGGTGATGAATTTTCATCAATCAGGAATCGTTTGAGATATTTAGTGAACCGCATCAAATACGAACAGAGCTTGATTGATGCTTATTCTAGTGAAGGCTGGAAAGGGTTCAGCTCAGATAAATTGAAGCCTGAAAAGGAACTTCAACGAGCATCAAGTGAAATAATGCGGGGTAAATTGAAAATAAGAGATCTATTTCAACATCTTGATTCACTTTGTGCTGAAGGAAGGCTTTCTGAGTCTCTATTCGATTCTGAAGGACAGATAGACAGTGAGGATATATTCTGTGCAAAATGTGGATCCAAAGAACTGTCCCTTGAAAATGACATCATACTATGTGATGGTATTTGTGATCGTGGGTTCCACCAATTCTGTTTAGAACCACCTTTGCTAAATACAGACATTCCGCCGGATGATGAGGGCTGGCTATGTCCTGGATGTGATTGCAAAGATGACTGCTTGGATCTGCTTAATGAATTTCAAGGATCAAATCTTTCTATTACTGATGGTTGGGAGAAAGTCTATCCTGAGGCTGCAGCAGCAGCTGCTGGGCAAAATTCTGATCACGCCTTGGGTCTTCCTTCAGATGATTCTGAAGATGGTGATTATGATCCTGATGCTCCAGACACTATTAACCAGGAAGATGAATCGAGTTCTGATCAATCAAGTTCTGATGAATCTGGGTATGCTTCTGCTTCTGAGGAATTGGAGGCTGCACCCAATGATGACCAATACTTAGGTCTTCCTTCTGATGACTCGGAGGATGATGACTATGAACCTGGTGCTCCAGAACTTGATGAAGGTGTTAAACAGGAAAGTTCAGGTTCTGACTTTACATCTGATTCTGAGGATCTAGCTGCACTTGATGATGGCACTACGCCTGTGAGAAACTCTAATGGGCAAGGTTCTGGATGCGGTCCTCGCACGAGTGTACTACATAATGAGTTACAAAGTCTTCTAGAGTCAGGTCCTGATAAGGATGGTCTTGAACCTGTTTCAGGAAGAAGACAGGTGGAACGGTTGGATTACAAGAAGCTGCATGATGAGACATACGGGAATGTTCCATCCGACTCAAGCGATGACACATTCGGGAGTATTTCTATTGATTCAAGCGATGACAGAGGTCGGGGTAGTAGAACAAGGAAGAGAAGCCCTAAAAACCTGGTTCCTGCATTAAATGGAACTAATGATGATTTGAAAAATAAAAAAACTAAACGTAGTTATAAGAGGAGAACTCATCAAAAGCCAGGTGCCGAAAATATGAAGAATTCTGTGACTAGGACTCCTGAAGACTCTGTGAAATCTAGTTCTTCTGTTAGACGAACTGCGTCATCATCAAATAGAAGACTCAGTCAACCAGCGTTGGAGAGACTTCTTGCATCGTTCCAAGAAAATCAGTATCCTAAACGAGCTACAAAAGAGAGTTTGGCACAAGAACTAGGACTCAGTCTGAAGCAGGTTAGCAAATGGTTTGAGAACACACGATGGAGCACACGCCATCCTTCAAGCATTGAATCCAATAAAGCAAAGAGTGCTTTAAGAATGGGGATTCAGTCATCTGAGACGAGTGGAAAGCTGCCCAAGCCTGAGCAAGAATCTGGTGCATGTTTCAGAGACACCGATAACAATGGTGCTCAACACCAAGTATCACCAAACACAGATGGTGCTGTGGCCCCATGTCAGAGTGGAGATACAAGGGATGACAAATTGGCGACTCAGAAAACTACTAGACCAGAATCTACTGCTACAAAATCCAGAAAACGGAAGGGCAGGTCAGATCATGTGGCATCACATTCAAAGGACAGAAAGGAATCACAAAAGCCTCCTGCTAAGTCACCAAAAGTTAATCAAATACAAACAGCGGATAAGGTTAGGACAAGGAGGAGGAGATCCATTTAG

Protein sequence

MEERHEYTEPRPNNNCEAVQEAKASVEVLTCFSNEQMHSIPDNQELGTTPECTSKTAGPDDEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISKLAEIDQVEAGNLLSSDIETENLILPIELETTTLNECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSISQQLGHKDKKILKSKKKNYMLRSLVSSDRVLRSRTQEKAKAPEPSNELNKLTAGEGKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQEDESSSDQSSSDESGYASASEELEAAPNDDQYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDGTTPVRNSNGQGSGCGPRTSVLHNELQSLLESGPDKDGLEPVSGRRQVERLDYKKLHDETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPALNGTNDDLKNKKTKRSYKRRTHQKPGAENMKNSVTRTPEDSVKSSSSVRRTASSSNRRLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETSGKLPKPEQESGACFRDTDNNGAQHQVSPNTDGAVAPCQSGDTRDDKLATQKTTRPESTATKSRKRKGRSDHVASHSKDRKESQKPPAKSPKVNQIQTADKVRTRRRRSI
Homology
BLAST of Moc10g04410 vs. NCBI nr
Match: XP_022149322.1 (homeobox protein HAT3.1 isoform X1 [Momordica charantia] >XP_022149323.1 homeobox protein HAT3.1 isoform X1 [Momordica charantia] >XP_022149324.1 homeobox protein HAT3.1 isoform X1 [Momordica charantia] >XP_022149325.1 homeobox protein HAT3.1 isoform X1 [Momordica charantia] >XP_022149326.1 homeobox protein HAT3.1 isoform X1 [Momordica charantia])

HSP 1 Score: 1669.1 bits (4321), Expect = 0.0e+00
Identity = 876/876 (100.00%), Postives = 876/876 (100.00%), Query Frame = 0

Query: 1   MEERHEYTEPRPNNNCEAVQEAKASVEVLTCFSNEQMHSIPDNQELGTTPECTSKTAGPD 60
           MEERHEYTEPRPNNNCEAVQEAKASVEVLTCFSNEQMHSIPDNQELGTTPECTSKTAGPD
Sbjct: 1   MEERHEYTEPRPNNNCEAVQEAKASVEVLTCFSNEQMHSIPDNQELGTTPECTSKTAGPD 60

Query: 61  DEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISKLAEIDQVEAGNLLSSDIETENLIL 120
           DEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISKLAEIDQVEAGNLLSSDIETENLIL
Sbjct: 61  DEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISKLAEIDQVEAGNLLSSDIETENLIL 120

Query: 121 PIELETTTLNECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSISQQLGH 180
           PIELETTTLNECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSISQQLGH
Sbjct: 121 PIELETTTLNECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSISQQLGH 180

Query: 181 KDKKILKSKKKNYMLRSLVSSDRVLRSRTQEKAKAPEPSNELNKLTAGEGKRKKKKRNIK 240
           KDKKILKSKKKNYMLRSLVSSDRVLRSRTQEKAKAPEPSNELNKLTAGEGKRKKKKRNIK
Sbjct: 181 KDKKILKSKKKNYMLRSLVSSDRVLRSRTQEKAKAPEPSNELNKLTAGEGKRKKKKRNIK 240

Query: 241 GKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIM 300
           GKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIM
Sbjct: 241 GKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIM 300

Query: 301 RGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGI 360
           RGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGI
Sbjct: 301 RGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGI 360

Query: 361 CDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYP 420
           CDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYP
Sbjct: 361 CDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYP 420

Query: 421 EAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQEDESSSDQSSSDESGYASASEEL 480
           EAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQEDESSSDQSSSDESGYASASEEL
Sbjct: 421 EAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQEDESSSDQSSSDESGYASASEEL 480

Query: 481 EAAPNDDQYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDGTTPVR 540
           EAAPNDDQYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDGTTPVR
Sbjct: 481 EAAPNDDQYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDGTTPVR 540

Query: 541 NSNGQGSGCGPRTSVLHNELQSLLESGPDKDGLEPVSGRRQVERLDYKKLHDETYGNVPS 600
           NSNGQGSGCGPRTSVLHNELQSLLESGPDKDGLEPVSGRRQVERLDYKKLHDETYGNVPS
Sbjct: 541 NSNGQGSGCGPRTSVLHNELQSLLESGPDKDGLEPVSGRRQVERLDYKKLHDETYGNVPS 600

Query: 601 DSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPALNGTNDDLKNKKTKRSYKRRTHQKP 660
           DSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPALNGTNDDLKNKKTKRSYKRRTHQKP
Sbjct: 601 DSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPALNGTNDDLKNKKTKRSYKRRTHQKP 660

Query: 661 GAENMKNSVTRTPEDSVKSSSSVRRTASSSNRRLSQPALERLLASFQENQYPKRATKESL 720
           GAENMKNSVTRTPEDSVKSSSSVRRTASSSNRRLSQPALERLLASFQENQYPKRATKESL
Sbjct: 661 GAENMKNSVTRTPEDSVKSSSSVRRTASSSNRRLSQPALERLLASFQENQYPKRATKESL 720

Query: 721 AQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETSGKLPKPEQESGAC 780
           AQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETSGKLPKPEQESGAC
Sbjct: 721 AQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETSGKLPKPEQESGAC 780

Query: 781 FRDTDNNGAQHQVSPNTDGAVAPCQSGDTRDDKLATQKTTRPESTATKSRKRKGRSDHVA 840
           FRDTDNNGAQHQVSPNTDGAVAPCQSGDTRDDKLATQKTTRPESTATKSRKRKGRSDHVA
Sbjct: 781 FRDTDNNGAQHQVSPNTDGAVAPCQSGDTRDDKLATQKTTRPESTATKSRKRKGRSDHVA 840

Query: 841 SHSKDRKESQKPPAKSPKVNQIQTADKVRTRRRRSI 877
           SHSKDRKESQKPPAKSPKVNQIQTADKVRTRRRRSI
Sbjct: 841 SHSKDRKESQKPPAKSPKVNQIQTADKVRTRRRRSI 876

BLAST of Moc10g04410 vs. NCBI nr
Match: KAG7030959.1 (Homeobox protein HAZ1 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1232.6 bits (3188), Expect = 0.0e+00
Identity = 691/906 (76.27%), Postives = 759/906 (83.77%), Query Frame = 0

Query: 1   MEERHEYTEPRPNNNCEAVQEAKAS--VEVLTCFSNEQMHSIPDNQELGTTPECTSKTAG 60
           MEER EYTE R  N   AVQEAKA+  VEVLT  +NEQMHS P+  ELGT  + TSKT  
Sbjct: 1   MEERDEYTESRTINKSAAVQEAKANVEVEVLTSLANEQMHSAPNYLELGTIRDWTSKTGS 60

Query: 61  PDDEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISKLAEIDQVEAGNLLSSDIETENL 120
           PD+EK GV+QNMEE+ KELG G+  S LPE+++QTISKLA+ DQ EAGNLLSSD +TENL
Sbjct: 61  PDEEKPGVKQNMEEDRKELGLGEAHSGLPERSSQTISKLADNDQGEAGNLLSSDKDTENL 120

Query: 121 ILPIELET-TTLNECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSISQQ 180
           ILPIE+ET   LNECSE P ED NKN I+Q NPPIED  QNTSI+ L  VP      S +
Sbjct: 121 ILPIEVETMALLNECSEPPTEDDNKNYIEQANPPIEDSIQNTSIKNLNMVP----DNSPE 180

Query: 181 LGHKDKKILKSKKKNYMLRSLVSSDRVLRSRTQEKAKAPEPSNELNKLTAGE---GKRKK 240
           LG KDK++L+SKKKNY+LRSLVSSDRVLRSRTQ+KAKAPEPSN+L+ +TAGE   GK+KK
Sbjct: 181 LGCKDKRVLRSKKKNYILRSLVSSDRVLRSRTQDKAKAPEPSNDLSNVTAGEEGKGKKKK 240

Query: 241 KKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQR 300
           K R IKGKGA  DEFSSIRN LRYLVNRIKYEQSLI+AYSSEGWKGFSSDKLKPEKELQR
Sbjct: 241 KNRKIKGKGARVDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQR 300

Query: 301 ASSEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDI 360
           AS+EIMR KLKIRDLFQ +D+LC+EGR SE+LFDSEGQIDSEDIFC KCGSKELSLENDI
Sbjct: 301 ASNEIMRRKLKIRDLFQRIDALCSEGRFSEALFDSEGQIDSEDIFCGKCGSKELSLENDI 360

Query: 361 ILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDG 420
           ILCDG+CDRGFHQFCLEPPLLN+DIPPDDEGWLCPGCDCKDDC+DLLNEFQGSNLSITDG
Sbjct: 361 ILCDGVCDRGFHQFCLEPPLLNSDIPPDDEGWLCPGCDCKDDCIDLLNEFQGSNLSITDG 420

Query: 421 WEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQE-----DESSSDQSSSD 480
           WEKV+PEAAAAAAG++SDH + LPSDDS+DGDYDPD PD I+Q+     D SSSDQSSSD
Sbjct: 421 WEKVFPEAAAAAAGRSSDHTMSLPSDDSDDGDYDPDVPDAIDQDGESSSDHSSSDQSSSD 480

Query: 481 ESGY--ASASEELEAAPNDDQYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSE 540
           +SGY  ASASEELEA PNDDQYLGLPSDDSEDDDY+PGAP  DEGV+QESS SDFTSDSE
Sbjct: 481 KSGYASASASEELEAPPNDDQYLGLPSDDSEDDDYDPGAPVRDEGVEQESSSSDFTSDSE 540

Query: 541 DLAALDD---------------GTTPVRNSNGQGSGCGPRTSVLHNELQSLLESGPDKDG 600
           DLAAL D                T PVRNSNGQ SG GP  +  HN+L SL+ SGPD+ G
Sbjct: 541 DLAALVDNGSSKDDNIASSPLNNTVPVRNSNGQSSGRGPNKNAQHNKLSSLVGSGPDEGG 600

Query: 601 LEPVSGRRQVERLDYKKLHDETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNL 660
           LE VSGRR VERLDYKKLHDET+GNVP+DSSDDT+GS SIDSSDDRGRG  TRK SPKN 
Sbjct: 601 LELVSGRRHVERLDYKKLHDETFGNVPTDSSDDTYGSDSIDSSDDRGRGRSTRKGSPKNP 660

Query: 661 VPAL--NGTNDDLKNKKTKRSYKRRTHQKPGAENMKNSVTRTPEDSVKSSSSVRRTASSS 720
           VPAL  NGT DDLKN KTKRS K RT QKP AENM NSVT+TPE ++KSSSSVRRT SSS
Sbjct: 661 VPALSRNGT-DDLKNIKTKRSSK-RTRQKPAAENMDNSVTKTPEGTLKSSSSVRRTTSSS 720

Query: 721 NRRLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIES 780
           +RRLSQP LERLLASFQENQYP+RATKESLA+ELGLSLKQVSKWFENTRWSTRHPSS E+
Sbjct: 721 HRRLSQPTLERLLASFQENQYPERATKESLARELGLSLKQVSKWFENTRWSTRHPSS-EA 780

Query: 781 NKAKSALRMGIQSSETSGKLPKPEQESGACFRDTDNNGAQHQVSPNTDGAVAPCQSGDTR 840
           NKAKS  RMG QSS+TS K PKPEQESGACFRD  +NGAQHQ SP     VAPCQSG T 
Sbjct: 781 NKAKSGSRMGTQSSQTSRKPPKPEQESGACFRDICSNGAQHQESPKAISVVAPCQSGVTG 840

Query: 841 DDKLATQKTTRPESTATKSRKRKGRSDHVASHSKDRKESQKPPAKSPKVNQIQTADKVRT 877
           DDKLA QK  RPESTATKSRKRKGRSD VAS SKDRK+S+KPPAKS KV++IQTADKV+ 
Sbjct: 841 DDKLANQKPKRPESTATKSRKRKGRSDQVASRSKDRKKSRKPPAKSSKVDEIQTADKVKK 899

BLAST of Moc10g04410 vs. NCBI nr
Match: XP_022942376.1 (homeobox protein HAT3.1 [Cucurbita moschata] >XP_022942377.1 homeobox protein HAT3.1 [Cucurbita moschata])

HSP 1 Score: 1229.9 bits (3181), Expect = 0.0e+00
Identity = 692/909 (76.13%), Postives = 757/909 (83.28%), Query Frame = 0

Query: 1   MEERHEYTEPRPNNNCEAVQEAKAS--VEVLTCFSNEQMHSIPDNQELGTTPECTSKTAG 60
           MEER EYTE R  N   AVQEAKAS  VEVLT  +NEQMHS P+  ELGT  + TSKT  
Sbjct: 1   MEERDEYTESRTINKSAAVQEAKASVEVEVLTSLANEQMHSAPNYLELGTIRDWTSKTGS 60

Query: 61  PDDEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISKLAEIDQVEAGNLLSSDIETENL 120
           PD+EK GV+QNMEE+ KELG G+    LPEK++QTISKLA+ DQ EAGNLLSSD +TENL
Sbjct: 61  PDEEKPGVKQNMEEDRKELGLGEAHRGLPEKSSQTISKLADNDQDEAGNLLSSDKDTENL 120

Query: 121 ILPIELETTT-LNECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSISQQ 180
           ILPIE+ETT  LNECSE P ED NKN I+Q NPPIED  QNTSI  L  VP      S +
Sbjct: 121 ILPIEVETTALLNECSEPPTEDNNKNYIEQANPPIEDSIQNTSITNLNMVP----DNSPE 180

Query: 181 LGHKDKKILKSKKKNYMLRSLVSSDRVLRSRTQEKAKAPEPSNELNKLTAG-EGKRKKKK 240
           +G KDK++LKSKKKNY+LRSL+SSDRVLRSRTQ+KAKAPEPSN+L+ +TAG EGK KKK 
Sbjct: 181 VGCKDKRVLKSKKKNYILRSLISSDRVLRSRTQDKAKAPEPSNDLSNVTAGEEGKGKKKN 240

Query: 241 RNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRAS 300
           R IKGKGA  DEFSSIRN LRYLVNRIKYEQSLI+AYSSEGWKGFSSDKLKPEKELQRAS
Sbjct: 241 RKIKGKGARVDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRAS 300

Query: 301 SEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIIL 360
           +EIMR KLKIRDLFQ +D+LC+EGR SE+LFDSEGQIDSEDIFC KCGSKELSLENDIIL
Sbjct: 301 NEIMRRKLKIRDLFQRIDALCSEGRFSEALFDSEGQIDSEDIFCGKCGSKELSLENDIIL 360

Query: 361 CDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWE 420
           CDG+CDRGFHQFCLEPPLLN+DIPPDDEGWLCPGCDCKDDC+DLLNEFQGSNLSITDGWE
Sbjct: 361 CDGVCDRGFHQFCLEPPLLNSDIPPDDEGWLCPGCDCKDDCIDLLNEFQGSNLSITDGWE 420

Query: 421 KVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQE----------DESSSDQS 480
           KV+PEAAAAAAGQ+SDH + LPSDDS+DGDYDPD PD I+Q+          D+SSSD S
Sbjct: 421 KVFPEAAAAAAGQSSDHTMSLPSDDSDDGDYDPDVPDAIDQDGESRSDHSSSDQSSSDLS 480

Query: 481 SSDESGY--ASASEELEAAPNDDQYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTS 540
           SSD+SGY  ASASEELEA PNDDQYLGLPSDDSEDDDY+PGAP  DEGV QESS SDFTS
Sbjct: 481 SSDKSGYASASASEELEAPPNDDQYLGLPSDDSEDDDYDPGAPVRDEGVGQESSSSDFTS 540

Query: 541 DSEDLAALDD---------------GTTPVRNSNGQGSGCGPRTSVLHNELQSLLESGPD 600
           DSEDLAAL D                T PVRNS+GQ SG GP  +  HN+L SL+ SGPD
Sbjct: 541 DSEDLAALVDNGSSKDDNIASSPLNNTVPVRNSDGQSSGRGPNKNAQHNKLSSLVGSGPD 600

Query: 601 KDGLEPVSGRRQVERLDYKKLHDETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSP 660
           + GLE VSGRR VERLDYKKLHDET+GNVP+DSSDDT+GS SIDSSDDRGRG  TRK SP
Sbjct: 601 EGGLELVSGRRHVERLDYKKLHDETFGNVPTDSSDDTYGSDSIDSSDDRGRGRSTRKGSP 660

Query: 661 KNLVPAL--NGTNDDLKNKKTKRSYKRRTHQKPGAENMKNSVTRTPEDSVKSSSSVRRTA 720
           KN VPAL  NGT DDLKN KTKRS K RT QKP AENM NSVT+TPE ++KSSSSVRRT 
Sbjct: 661 KNPVPALSRNGT-DDLKNIKTKRSSK-RTRQKPAAENMDNSVTKTPEGTLKSSSSVRRTT 720

Query: 721 SSSNRRLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSS 780
           SSS+RRLSQP LERLLASFQENQYP+RATKESLA+ELGLSLKQVSKWFENTRWSTRHPSS
Sbjct: 721 SSSHRRLSQPTLERLLASFQENQYPERATKESLARELGLSLKQVSKWFENTRWSTRHPSS 780

Query: 781 IESNKAKSALRMGIQSSETSGKLPKPEQESGACFRDTDNNGAQHQVSPNTDGAVAPCQSG 840
            E+NKAKSA RMG QSS+TS K PKPEQESGACFRDT +NGAQHQ SP     VAPCQSG
Sbjct: 781 -EANKAKSASRMGTQSSQTSRKPPKPEQESGACFRDTCSNGAQHQESPKAISVVAPCQSG 840

Query: 841 DTRDDKLATQKTTRPESTATKSRKRKGRSDHVASHSKDRKESQKPPAKSPKVNQIQTADK 877
            T DDKLA QK  RPES ATKSRKRKGRSD VAS SKDRK+S+KPPAKS KV++IQTADK
Sbjct: 841 VTGDDKLANQKPKRPESAATKSRKRKGRSDQVASRSKDRKKSRKPPAKSSKVDEIQTADK 900

BLAST of Moc10g04410 vs. NCBI nr
Match: XP_038876083.1 (homeobox protein HAT3.1 isoform X1 [Benincasa hispida] >XP_038876090.1 homeobox protein HAT3.1 isoform X1 [Benincasa hispida] >XP_038876099.1 homeobox protein HAT3.1 isoform X1 [Benincasa hispida])

HSP 1 Score: 1229.9 bits (3181), Expect = 0.0e+00
Identity = 697/906 (76.93%), Postives = 753/906 (83.11%), Query Frame = 0

Query: 1    MEERHEY--TEPRPNNNCEAVQEAKAS--VEVLTCFSNEQMHSIPDNQELGTTPECTSKT 60
            MEER E   TE RPNN+ E VQEAKAS  VEVLTC SNE MHS    QELGTTPE +SKT
Sbjct: 146  MEERDENTDTESRPNNSAEPVQEAKASVEVEVLTCLSNEPMHS--GYQELGTTPEYSSKT 205

Query: 61   AGPDDEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISKLAEIDQVEAGNLLSSDIETE 120
             GPD+EK GVQQNM     ELGSG +LSEL EK+NQT+S  A+ DQVEAGNLLSSD +TE
Sbjct: 206  DGPDEEKPGVQQNM-----ELGSGYLLSELLEKDNQTVSNHADNDQVEAGNLLSSDKDTE 265

Query: 121  NLILPIELETTT-LNECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSIS 180
            NL LPIE+ETTT LNECSELP ED NKN I+Q+NPPIEDLTQN SIQ LE +P    S S
Sbjct: 266  NLKLPIEVETTTLLNECSELPVEDVNKNHIEQMNPPIEDLTQNNSIQNLEKIP----SNS 325

Query: 181  QQLGHKDKKILKSKKKNYMLRSLVSSDRVLRSRTQEKAKAPEPSNELNKLTAGEGKR--K 240
            QQLG KDK ILKSKK NY LRSLVSSDRVLRSRTQEKAKAPEPSN LN  TA EGKR  K
Sbjct: 326  QQLGRKDKGILKSKKTNYRLRSLVSSDRVLRSRTQEKAKAPEPSNYLNNFTAEEGKRKKK 385

Query: 241  KKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQ 300
            KKKRNI+GK A  DE+SSIR +LRYL+NRI YEQSLI+AYSSEGWKGFSSDKLKPEKELQ
Sbjct: 386  KKKRNIQGKEARVDEYSSIRKQLRYLLNRIGYEQSLIEAYSSEGWKGFSSDKLKPEKELQ 445

Query: 301  RASSEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLEND 360
            RAS+EIM+ KLKIRDLFQ +D+LCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLEND
Sbjct: 446  RASNEIMQRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLEND 505

Query: 361  IILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITD 420
            IILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITD
Sbjct: 506  IILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITD 565

Query: 421  GWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQEDE-------SSSDQS 480
             WEKVYPEAAAAAAGQNSDH LGLPSDDSEDGDYDPD PDTI+Q++E       SSSDQS
Sbjct: 566  TWEKVYPEAAAAAAGQNSDHTLGLPSDDSEDGDYDPDVPDTIDQDNESSSDESSSSSDQS 625

Query: 481  SSDESGYASASEELEAAPNDDQYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDS 540
            +SD SGYASASE LE  PNDDQYLGLPSDDSEDDDY+P  PELDEGV++ESS SDFTSDS
Sbjct: 626  NSDTSGYASASEGLEVPPNDDQYLGLPSDDSEDDDYDPSVPELDEGVRRESSSSDFTSDS 685

Query: 541  EDLAALD--------------DGTTPVRNSNGQGSGCGPRTSVLHNELQSLLESGPDKDG 600
            EDLAALD              + T  V+NSNGQ SGCGP  S LHNEL SL      KDG
Sbjct: 686  EDLAALDNNRPSKDDDFVSSLNNTLSVKNSNGQSSGCGPSKSALHNELSSL------KDG 745

Query: 601  LEPVSGRRQVERLDYKKLHDETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNL 660
            LEPVSGRRQVERLDYKKLHDETYGNVP+DSSDDT+GS S+DSS DRG  S TRKR P+NL
Sbjct: 746  LEPVSGRRQVERLDYKKLHDETYGNVPTDSSDDTYGSTSMDSSHDRGWDSSTRKRGPENL 805

Query: 661  VPAL--NGTNDDLKNKKTKRSYKRRTHQKPGAENMKNSVTRTPEDSVKSSSSVRRTASSS 720
            V AL  NGTNDDL N KTKRS+K RT QK  A N+ NSVT TP D+ KSSSS R+T SSS
Sbjct: 806  VLALSNNGTNDDLTNVKTKRSHK-RTRQKAAAINVNNSVTETPVDTAKSSSSARQTTSSS 865

Query: 721  NRRLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIES 780
            NRRLSQPALERL ASFQEN+YPKRATKESLAQELGLSLKQVS+WFENTRWSTRHPSS   
Sbjct: 866  NRRLSQPALERLFASFQENEYPKRATKESLAQELGLSLKQVSRWFENTRWSTRHPSS-GG 925

Query: 781  NKAKSALRMGIQSSETSGKLPKPEQESGACFRDTDNNGAQHQVSPNTDGAVAPCQSGDTR 840
            N+AKS+ RM   SS+ SG+LPK EQESGACFRDTD+NGAQHQ  P  +    PCQSGDT 
Sbjct: 926  NRAKSSSRMSNLSSKASGELPKNEQESGACFRDTDSNGAQHQDLPTANSFATPCQSGDTG 985

Query: 841  DDKLATQKTTRPESTATKSRKRKGRSDHVASHSKDRKESQKPPAKSPKVNQIQTADKVRT 877
            D KL T+KT R ES+ATKSRKRK  SDH+ASH+KD++ SQ+PPAKSPKVN+IQTAD+ +T
Sbjct: 986  DKKLVTRKTKRAESSATKSRKRKRPSDHMASHAKDKEISQRPPAKSPKVNEIQTADRFKT 1032

BLAST of Moc10g04410 vs. NCBI nr
Match: XP_023531864.1 (homeobox protein HAT3.1 isoform X1 [Cucurbita pepo subsp. pepo] >XP_023531873.1 homeobox protein HAT3.1 isoform X2 [Cucurbita pepo subsp. pepo] >XP_023531880.1 homeobox protein HAT3.1 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1226.5 bits (3172), Expect = 0.0e+00
Identity = 692/911 (75.96%), Postives = 759/911 (83.32%), Query Frame = 0

Query: 1   MEERHEYTEPRPNNNCEAVQEAKAS--VEVLTCFSNEQMHSIPDNQELGTTPECTSKTAG 60
           MEER EYTE R  N   AVQEAKAS  VEVLT  +NEQMHS P+  ELGT  + TSKT  
Sbjct: 1   MEERDEYTESRTINKSAAVQEAKASVEVEVLTSLANEQMHSAPNYLELGTIRDWTSKTGS 60

Query: 61  PDDEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISKLAEIDQVEAGNLLSSDIETENL 120
           PD+EK GV+QNMEE+ +ELG G+  S LPEK++QTISKLA+ DQ EAGNLLSSD +TENL
Sbjct: 61  PDEEKPGVKQNMEEDRRELGLGEAHSGLPEKSSQTISKLADNDQGEAGNLLSSDKDTENL 120

Query: 121 ILPIELETTT-LNECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSISQQ 180
           ILPIE+ETT  LNEC E P ED NKN I+Q NPPIED  QNT I+ L  VP      S +
Sbjct: 121 ILPIEVETTALLNECLEPPTEDDNKNYIEQANPPIEDSIQNTYIKNLNMVP----DNSPE 180

Query: 181 LGHKDKKILKSKKKNYMLRSLVSSDRVLRSRTQEKAKAPEPSNELNKLTAGE---GKRKK 240
           LG KDK++LKSKKKNY+LRSLVSSDRVLRSRTQ+KAKAPEPSN+L+ +TAGE   GK+KK
Sbjct: 181 LGCKDKRVLKSKKKNYILRSLVSSDRVLRSRTQDKAKAPEPSNDLSNVTAGEEGKGKKKK 240

Query: 241 KKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQR 300
           K R IKGKGA  DEFSSIRN LRYLVNRIKYEQSLI+AYSSEGWKGFSSDKLKPEKELQR
Sbjct: 241 KNRKIKGKGARVDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQR 300

Query: 301 ASSEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDI 360
           AS+EIMR KLKIRDLFQ +D+LC+EGR SE+LFDSEGQIDSEDIFC KCGSKELSLENDI
Sbjct: 301 ASNEIMRRKLKIRDLFQRIDALCSEGRFSEALFDSEGQIDSEDIFCGKCGSKELSLENDI 360

Query: 361 ILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDG 420
           ILCDG+CDRGFHQFCLEPPLLN+DIPPDDEGWLCPGCDCKDDC+DLLNEFQGSNLSITDG
Sbjct: 361 ILCDGVCDRGFHQFCLEPPLLNSDIPPDDEGWLCPGCDCKDDCIDLLNEFQGSNLSITDG 420

Query: 421 WEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQE----------DESSSD 480
           WEKV+PEAAAAAAG++SDH + LPSDDS+DGDYDPD PD I+Q+          D+SSSD
Sbjct: 421 WEKVFPEAAAAAAGRSSDHTMSLPSDDSDDGDYDPDVPDAIDQDGESSSDHSSSDQSSSD 480

Query: 481 QSSSDESGY--ASASEELEAAPNDDQYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDF 540
            SSSD+SGY  ASASEELEA PNDDQYLGLPSDDSEDDDY+PGAP  DEGV QESS SDF
Sbjct: 481 LSSSDKSGYASASASEELEAPPNDDQYLGLPSDDSEDDDYDPGAPVRDEGVGQESSSSDF 540

Query: 541 TSDSEDLAAL-------DD--------GTTPVRNSNGQGSGCGPRTSVLHNELQSLLESG 600
           TSDSEDLAAL       DD         T PVRNSNGQ SG GP  S  HN+L SL+ SG
Sbjct: 541 TSDSEDLAALVNNGSSKDDNIASSPLNNTVPVRNSNGQSSGRGPNKSAQHNKLSSLVGSG 600

Query: 601 PDKDGLEPVSGRRQVERLDYKKLHDETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKR 660
           PD+ GLE VSGRR VERLDYKKLHDET+GNVP+DSSDDT+GS SIDSSDDRGRG  TRK 
Sbjct: 601 PDEGGLELVSGRRHVERLDYKKLHDETFGNVPTDSSDDTYGSDSIDSSDDRGRGRSTRKG 660

Query: 661 SPKNLVPAL--NGTNDDLKNKKTKRSYKRRTHQKPGAENMKNSVTRTPEDSVKSSSSVRR 720
           SPKN VPAL  NGT DDLKN KTK S K RT QKP AENM NSVT+TPE ++KSSSSVRR
Sbjct: 661 SPKNPVPALSRNGT-DDLKNIKTKCSSK-RTRQKPAAENMDNSVTKTPEGTLKSSSSVRR 720

Query: 721 TASSSNRRLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHP 780
           T SSS+RRLSQP LERLLASFQENQYP+RATKESLA+ELGLSLKQVSKWFENTRWSTRHP
Sbjct: 721 TTSSSHRRLSQPTLERLLASFQENQYPERATKESLARELGLSLKQVSKWFENTRWSTRHP 780

Query: 781 SSIESNKAKSALRMGIQSSETSGKLPKPEQESGACFRDTDNNGAQHQVSPNTDGAVAPCQ 840
           SS E+NKA+SA RMG QSS+TS K PKPEQESGACFRDT +NGAQHQ SP     VAPCQ
Sbjct: 781 SS-EANKARSASRMGTQSSQTSRKPPKPEQESGACFRDTCSNGAQHQESPKAISVVAPCQ 840

Query: 841 SGDTRDDKLATQKTTRPESTATKSRKRKGRSDHVASHSKDRKESQKPPAKSPKVNQIQTA 877
           SG T DDK A Q+T RPESTATKSRKRKGRSD VAS SKDRK+S+KPPAKS KV++IQTA
Sbjct: 841 SGVTGDDKSANQRTKRPESTATKSRKRKGRSDQVASRSKDRKKSRKPPAKSSKVDEIQTA 900

BLAST of Moc10g04410 vs. ExPASy Swiss-Prot
Match: Q04996 (Homeobox protein HAT3.1 OS=Arabidopsis thaliana OX=3702 GN=HAT3.1 PE=1 SV=3)

HSP 1 Score: 437.6 bits (1124), Expect = 3.4e-121
Identity = 280/557 (50.27%), Postives = 359/557 (64.45%), Query Frame = 0

Query: 208 RTQEKAKAPEPSNEL-NKLTAGEGKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYE 267
           R Q   +   PS+ + N    G  K+K K  N KG+    DE++ I+ +LRY +NRI YE
Sbjct: 136 RAQRSKEDAGPSSVVANSTPVGRPKKKNKTMN-KGQVREDDEYTRIKKKLRYFLNRINYE 195

Query: 268 QSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIMRGKLKIRDLFQHLDSLCAEGRLSESL 327
           QSLIDAYS EGWKG S +K++PEKEL+RA+ EI+R KLKIRDLFQHLD+LCAEG L ESL
Sbjct: 196 QSLIDAYSLEGWKGSSLEKIRPEKELERATKEILRRKLKIRDLFQHLDTLCAEGSLPESL 255

Query: 328 FDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGW 387
           FD++G+I SEDIFCAKCGSK+LS++NDIILCDG CDRGFHQ+CLEPPL   DIPPDDEGW
Sbjct: 256 FDTDGEISSEDIFCAKCGSKDLSVDNDIILCDGFCDRGFHQYCLEPPLRKEDIPPDDEGW 315

Query: 388 LCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGD 447
           LCPGCDCKDD LDLLN+  G+  S++D WEK++PEAAAA  G   +    LPSDDS+D +
Sbjct: 316 LCPGCDCKDDSLDLLNDSLGTKFSVSDSWEKIFPEAAAALVGGGQNLDCDLPSDDSDDEE 375

Query: 448 YDPDA-PDTINQED------ESSSDQSSSDESGYASASEEL-----EAAPNDDQYLGLPS 507
           YDPD   D  N ED      ES ++  SSDE+ + SAS+E+     E        + LPS
Sbjct: 376 YDPDCLNDNENDEDGSDDNEESENEDGSSDETEFTSASDEMIESFKEGKDIMKDVMALPS 435

Query: 508 DDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDG-TTPVRNSNGQGSGCGPRT 567
           DDSEDDDY+P AP  D+   +ESS SD TSD+EDL     G  T  +  +      G +T
Sbjct: 436 DDSEDDDYDPDAPTCDD--DKESSNSDCTSDTEDLETSFKGDETNQQAEDTPLEDPGRQT 495

Query: 568 SVLHNELQSLLES--GPDKDGLEPVSGRRQVERLDYKKLHDETYGNVPSDSSDDTFGSIS 627
           S L  +  ++LES  G D DG   VS RR VERLDYKKL+DE Y NVP+ SSDD      
Sbjct: 496 SQLQGD--AILESDVGLD-DGPAGVSRRRNVERLDYKKLYDEEYDNVPTSSSDD------ 555

Query: 628 IDSSDDRGRGSRTRKRSPK--NLVPALNGTN-DDLKNK----KTKRSYKRRTHQKPGAEN 687
            D  D   R  +    S    + VP    +N +D  +K    K+KR+ K+ T + P    
Sbjct: 556 -DDWDKTARMGKEDSESEDEGDTVPLKQSSNAEDHTSKKLIRKSKRADKKDTLEMPQEGP 615

Query: 688 MKNSVTRTPEDSVKSSSSVRRTASSSNRRLSQPALERLLASFQENQYPKRATKESLAQEL 742
            +N            S  + +++SS+ ++ + P  +RL  SFQENQYP +ATKESLA+EL
Sbjct: 616 GENG----------GSGEIEKSSSSACKQ-TDPKTQRLYISFQENQYPDKATKESLAKEL 668

BLAST of Moc10g04410 vs. ExPASy Swiss-Prot
Match: P48786 (Pathogenesis-related homeodomain protein OS=Petroselinum crispum OX=4043 GN=PRH PE=2 SV=1)

HSP 1 Score: 433.7 bits (1114), Expect = 4.9e-120
Identity = 310/727 (42.64%), Postives = 409/727 (56.26%), Query Frame = 0

Query: 116  ENLILPIELETTTLN-ECSELPPEDANKN-SIKQVNPPIEDLTQ---------------- 175
            E+L +P + ++ T N + SELPPE+A KN +  Q     +D T+                
Sbjct: 302  ESLTIPTDNQSRTYNSDQSELPPENAAKNCNHAQFGHQSDDTTKISGFKELVIGQETVAK 361

Query: 176  -------------------NTSIQRLETVPITSVSISQQLGHKDK--------------- 235
                                T +++L  V  T+   S QLG   K               
Sbjct: 362  SPSQLVDAGKRGRGRPRKVQTGLEQLVPVQETAAKSSSQLGDTGKRSRGRPRKVQDSPTS 421

Query: 236  -----KILKSKKKNYMLRSLVSSDRVLRSRTQEKAKAPEPSNELNKLTAGEG---KRKKK 295
                 K++  K K+    S V+S R LRSR+QEK+  P    ++N + A EG   ++ +K
Sbjct: 422  LGGNVKVVPEKGKDSQELS-VNSSRSLRSRSQEKSIEP----DVNNIVADEGADREKPRK 481

Query: 296  KRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRA 355
            KR  + +    DEF  IR  LRYL++RIKYE++ +DAYS EGWKG S DK+KPEKEL+RA
Sbjct: 482  KRKKRMEENRVDEFCRIRTHLRYLLHRIKYEKNFLDAYSGEGWKGQSLDKIKPEKELKRA 541

Query: 356  SSEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDII 415
             +EI   KLKIRDLFQ LD   +EGRL E LFDS G+IDSEDIFCAKCGSK+++L NDII
Sbjct: 542  KAEIFGRKLKIRDLFQRLDLARSEGRLPEILFDSRGEIDSEDIFCAKCGSKDVTLSNDII 601

Query: 416  LCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGW 475
            LCDG CDRGFHQFCL+PPLL   IPPDDEGWLCPGC+CK DC+ LLN+ Q +N+ + D W
Sbjct: 602  LCDGACDRGFHQFCLDPPLLKEYIPPDDEGWLCPGCECKIDCIKLLNDSQETNILLGDSW 661

Query: 476  EKVY-PEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQEDESSSDQSSSDESGYA 535
            EKV+  EAAAAA+G+N D   GLPSDDSED DYDP  PD    +++   D SS+DES Y 
Sbjct: 662  EKVFAEEAAAAASGKNLDDNSGLPSDDSEDDDYDPGGPDL---DEKVQGDDSSTDESDYQ 721

Query: 536  SASEELEAAPNDDQYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDD 595
            S S++++     +   GLPSDDSEDD+Y+P     D+  K +SS SDFTSDSED   + D
Sbjct: 722  SESDDMQVIRQKNS-RGLPSDDSEDDEYDPSGLVTDQMYK-DSSCSDFTSDSEDFTGVFD 781

Query: 596  GTTPVRNSNGQGSGCGPRTSVLHNELQSLLESG-PDKDGLEPVSGRRQVERLDYKKLHD- 655
                  +    G   GP  S   +   +    G P++    P+  RRQVE LDYKKL+D 
Sbjct: 782  ------DYKDTGKAQGPLASTPDHVRNNEEGCGHPEQGDTAPLYPRRQVESLDYKKLNDI 841

Query: 656  -------------------------ETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKR 715
                                     E YGN  SDSSD+ +    + SS D+    +    
Sbjct: 842  EFSKMCDILDILSSQLDVIICTGNQEEYGNTSSDSSDEDY---MVTSSPDKNNSDKEATA 901

Query: 716  SPKNLVPALNGTNDDLKNKKTKRSYKRRTHQKPGAENMKNSVTRTPEDSVKSSSSVRRTA 755
              +         + +L  K  + ++ RR  +K   E   + ++R+ ED   S++ V  + 
Sbjct: 902  MER----GRESGDLELDQKARESTHNRRYIKKFAVEGTDSFLSRSCED---SAAPVAGSK 961

BLAST of Moc10g04410 vs. ExPASy Swiss-Prot
Match: P46605 (Homeobox protein HOX1A OS=Zea mays OX=4577 GN=HOX1A PE=2 SV=1)

HSP 1 Score: 389.8 bits (1000), Expect = 8.1e-107
Identity = 271/655 (41.37%), Postives = 374/655 (57.10%), Query Frame = 0

Query: 193 YMLRSLVSSDRVLRSRTQEKAKAPEPSNELNKLTAGEGKRKKKKRNIKGKGASGDEFSSI 252
           Y L S  S  RVLRS +  K  + E    +        KR+K  R      +S DEFS I
Sbjct: 70  YTLMSSNSDVRVLRSTSSSKTTSTE---HVQAPVQPAAKRRKMSR--ASNKSSTDEFSQI 129

Query: 253 RNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIMRGKLKIRDLFQH 312
           R R+RY++NR+ YEQSLI+AY+SEGWK  S DK++PEKEL+RA SEI+R KL+IR++F++
Sbjct: 130 RKRVRYILNRMNYEQSLIEAYASEGWKNQSLDKIRPEKELERAKSEILRCKLRIREVFRN 189

Query: 313 LDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEP 372
           +DSL ++G++ E+LFDSEG+I  EDIFC+ CGS + +L NDIILCDG CDRGFHQ CL P
Sbjct: 190 IDSLLSKGKIDETLFDSEGEISCEDIFCSTCGSNDATLGNDIILCDGACDRGFHQNCLNP 249

Query: 373 PLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQNSD 432
           PL   DIP  DEGWLCP CDCK DC+DL+NE  GSN+SI D WEKV+P+AAA A     D
Sbjct: 250 PLRTEDIPMGDEGWLCPACDCKIDCIDLINELHGSNISIEDSWEKVFPDAAAMANDSKQD 309

Query: 433 HALGLPSDDSEDGDYDPDAPD--TINQEDESSSDQ----SSSDESGYASASEELEAAPN- 492
            A  LPSDDS+D D+DP+ P+   + +++ESS +     S SD+S + + S++ E   + 
Sbjct: 310 DAFDLPSDDSDDNDFDPNMPEEHVVGKDEESSEEDEDGGSDSDDSDFLTCSDDSEPLIDK 369

Query: 493 --DDQYLGLPSDDSEDDDYEPGAPELDEGVKQESSG--SDFTSDSEDL------AALDDG 552
             DD  L LPS+DSEDDDY+P  P+ D+ V+++SS   SDFTSDS+D       +  D+ 
Sbjct: 370 KVDD--LRLPSEDSEDDDYDPAGPDSDKDVEKKSSSDESDFTSDSDDFCKEISKSGHDEV 429

Query: 553 TTPVRNSNGQG-----SGCGPRTSVLHNELQSLLESGPDKDGLEPVSGRRQVERLDYKKL 612
           ++P+      G     +     TS   + +++ ++ G     + P S RRQ ERLDYKKL
Sbjct: 430 SSPLLPDAKVGDMEKITAQAKTTSSADDPMETEIDQGV----VLPDSRRRQAERLDYKKL 489

Query: 613 HDETYGNVPSDSSDDTFGS---ISIDSSDDRGRGSRTRKRSPKNLVPALNGTNDDLKNKK 672
           +DE YG   SDSSDD   S     I  S++ G  +     SP      +   ND+L  + 
Sbjct: 490 YDEAYGEASSDSSDDEEWSGKNTPIIKSNEEGEAN-----SPAGKGSRVVHHNDELTTQS 549

Query: 673 TKRSYKRRTHQKPGAENMKNSVTRTPEDSVKSSSSVRRTASSSNRRLSQPAL-ERLLASF 732
           TK+S            ++  SV   P D   + S+     S++ +    P + ++L   F
Sbjct: 550 TKKS----------LHSIHGSVDEKPGDLTSNGSN-----STARKGHFGPVINQKLHEHF 609

Query: 733 QENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSET 792
           +   YP R+ KESLA+ELGL+ +QV+KWFE  R S R  SS +             S  T
Sbjct: 610 KTQPYPSRSVKESLAEELGLTFRQVNKWFETRRHSARVASSRKGISLDKHSPQNTNSQVT 669

Query: 793 SGKLPK-PE----QESGACFR---DTDNNGAQHQVSPNTDGAVAPCQSGDTRDDK 814
           +   PK PE    +ES  C              +V   T G+       D+ +D+
Sbjct: 670 ASMEPKEPEGTVVEESNVCLNGGTTISKEAVSSKVGSRTPGSDVGGSKVDSAEDQ 693

BLAST of Moc10g04410 vs. ExPASy Swiss-Prot
Match: Q8H991 (Homeobox protein HAZ1 OS=Oryza sativa subsp. japonica OX=39947 GN=HAZ1 PE=2 SV=1)

HSP 1 Score: 387.9 bits (995), Expect = 3.1e-106
Identity = 267/620 (43.06%), Postives = 368/620 (59.35%), Query Frame = 0

Query: 183 KKILKSKKKNYMLRSLVSSDRVLRSRTQEKAKAPEPSNELNKLTAG---EGKRKKKKRNI 242
           +++ K +K++  LR   S  RVLRS +++K KA    NEL    AG     K++K  R  
Sbjct: 93  QRVAKKRKRSKPLRPAPS--RVLRSTSEKKNKA---HNELLNDGAGVQPAEKKRKVGRPP 152

Query: 243 KGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEI 302
           KG G   D++  IR R+RY++NR+ YEQSLI AY+SEGWKG S +K++PEKEL+RA  EI
Sbjct: 153 KG-GTPKDDYLMIRKRVRYVLNRMNYEQSLIQAYASEGWKGQSLEKIRPEKELERAKVEI 212

Query: 303 MRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDG 362
           +R K +IR+ F++LDSL +EG+L ES+FDS G+I SEDIFCA CGSK+++L+NDIILCDG
Sbjct: 213 LRCKSRIREAFRNLDSLLSEGKLDESMFDSAGEISSEDIFCAACGSKDVTLKNDIILCDG 272

Query: 363 ICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVY 422
           ICDRGFHQ+CL PPLL  DIP  DEGWLCP CDCK DC+D+LNE QG  LSI D WEKV+
Sbjct: 273 ICDRGFHQYCLNPPLLAEDIPQGDEGWLCPACDCKIDCIDVLNELQGVKLSIHDSWEKVF 332

Query: 423 PEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQEDESSSDQ---------SSSDE 482
           PEAA+   G     A  LPSDDS D DYDP        ++E SS +          SS E
Sbjct: 333 PEAASFLNGSKQIDASDLPSDDSADNDYDPTLAQGHKVDEEKSSGEDGGEGLDSDDSSSE 392

Query: 483 SGYASASEELEAAPN----DDQYLGLPSDDSEDDDYEPGAPELDEGVKQESSG-----SD 542
              +S  E+ + + N    DD  LGLPS+DSED D++P  P+ D+    ES+      SD
Sbjct: 393 DSESSEKEKSKTSQNGRTVDD--LGLPSEDSEDGDFDPAGPDSDKEQNDESNSDQSDESD 452

Query: 543 FTSDSEDLAALDDGTTPVRNSNGQGSGCGPRTSVL-----------------HNELQSLL 602
           FTSDS+D  A       +  S GQ    GP +S +                  N   + +
Sbjct: 453 FTSDSDDFCA------EIAKSCGQDEISGPSSSQIRTVDRTDGSGFDGEPNAENSNLAFM 512

Query: 603 ESGPDKDGLEPVSGRRQVERLDYKKLHDETYGNVPSDSSDDT--FGSISIDSSDDRGRGS 662
           E+  ++D + P+S +RQVERLDYKKL++E YG   SDSSDD   +G    +S+ ++G   
Sbjct: 513 ETELEQDMVLPISSKRQVERLDYKKLYNEAYGKASSDSSDDEEWYG----NSTPEKG--- 572

Query: 663 RTRKRSPKNLVPALNGTNDDLKNKKTKRSYKRRTHQ--KPGAENMKNSVTRTPEDSVKSS 722
                   +L  +  G     +    +      T Q  +PG      SV+    + + S+
Sbjct: 573 NLEDSETDSLAESPQGGKGFSRRAPVRYHNNEHTPQNVRPG-----GSVSDQQTEVLCSN 632

Query: 723 SSVRRTASSSNRRLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRW 756
           S+    +++ NR       ++L A F+E+ YP RATKE+LAQELGL+  QV+KWF +TR 
Sbjct: 633 SN---GSTAKNRHFGPAINQKLKAHFKEDPYPSRATKENLAQELGLTFNQVTKWFSSTRH 683

BLAST of Moc10g04410 vs. ExPASy Swiss-Prot
Match: P48785 (Pathogenesis-related homeodomain protein OS=Arabidopsis thaliana OX=3702 GN=PRH PE=2 SV=1)

HSP 1 Score: 206.5 bits (524), Expect = 1.3e-51
Identity = 183/626 (29.23%), Postives = 284/626 (45.37%), Query Frame = 0

Query: 181 KDKKILKSKKKNYMLRSLVSSDRVLRSRTQEKAKAPEPSNELNKLTAGEGKRKKKKRNIK 240
           K  K + +K+ +   +     +   +SRT++ ++      E+ +    + +++K KR  K
Sbjct: 34  KKGKEVSNKRNSKQNKRKAEEELCSKSRTKKYSRGWVRCEEMEEEKVKKTRKRKSKRQQK 93

Query: 241 GKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIM 300
                 D+   ++ R RYL+ ++K +Q+LIDAY++EGWKG S +K++P+KEL+RA  EI+
Sbjct: 94  DNKVEVDDSLRLQRRTRYLLIKMKMQQNLIDAYATEGWKGQSREKIRPDKELERARKEIL 153

Query: 301 RGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGI 360
             KL +RD  + LD L + G + E +  S+G I  + IFCA+C S+E   +NDIILCDG 
Sbjct: 154 NCKLGLRDAIRQLDLLSSVGSMEEKVIASDGSIHHDHIFCAECNSREAFPDNDIILCDGT 213

Query: 361 CDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYP 420
           C+R FHQ CL+PPL    IPP D+GW C  CDCK + +D +N   G++  +   W+ ++ 
Sbjct: 214 CNRAFHQKCLDPPLETESIPPGDQGWFCKFCDCKIEIIDTMNAQIGTHFPVDSNWQDIFN 273

Query: 421 EAAAAAAGQNS--DHALGLPSDDSEDGDYDPDAPDTINQEDESSSDQSSSDESGYASASE 480
           E A+   G  +  ++    PSDDS+D DYDP        E   +   +SS+ SG      
Sbjct: 274 EEASLPIGSEATVNNEADWPSDDSKDDDYDP--------EMRENGGGNSSNVSG------ 333

Query: 481 ELEAAPNDDQYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDGTTP 540
                           D   D+D E                    S S  L+   DG   
Sbjct: 334 ----------------DGGGDNDEE--------------------SISTSLSLSSDGV-- 393

Query: 541 VRNSNGQGSGCGPRTSVLHNELQSLLESGPDKDGLEPVSGRRQVERLDYKKLHDETYGNV 600
              +   GS  G R S +  + ++  E        E V G RQ   +DY +L+ E +G  
Sbjct: 394 ---ALSTGSWEGHRLSNMVEQCETSNE--------ETVCGPRQRRTVDYTQLYYEMFG-- 453

Query: 601 PSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPALNGTNDDLKNKKTKRSYKRRTHQ 660
             D+     GS   D   +  R  +    +   LV     +  D                
Sbjct: 454 -KDAVLQEQGSEDEDWGPNDRRKRKRESDAGSTLVTMCESSKKD---------------- 513

Query: 661 KPGAENMKNSVTRTPEDSVKSSSSVRRTASSSNR-RLSQPALERLLASFQENQYPKRATK 720
                     V  T E S + S SV          RL + A+E+L   F E + P +A +
Sbjct: 514 --------QDVVETLEQSERDSVSVENKGGRRRMFRLPRNAVEKLRQVFAETELPSKAVR 556

Query: 721 ESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETSGKLPKPEQES 780
           + LA+EL L  ++V+KWF+NTR+      ++ + K +S  + G  S   SG    PE   
Sbjct: 574 DRLAKELSLDPEKVNKWFKNTRY-----MALRNRKTESVKQPG-DSKTVSGGDSGPEAV- 556

Query: 781 GACFRDTDNNGAQHQVSPNTDGAVAP 804
                  +NN   ++V    D  V P
Sbjct: 634 ------MENNTETNEVQDTLDDTVPP 556

BLAST of Moc10g04410 vs. ExPASy TrEMBL
Match: A0A6J1D6Q5 (homeobox protein HAT3.1 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111017765 PE=3 SV=1)

HSP 1 Score: 1669.1 bits (4321), Expect = 0.0e+00
Identity = 876/876 (100.00%), Postives = 876/876 (100.00%), Query Frame = 0

Query: 1   MEERHEYTEPRPNNNCEAVQEAKASVEVLTCFSNEQMHSIPDNQELGTTPECTSKTAGPD 60
           MEERHEYTEPRPNNNCEAVQEAKASVEVLTCFSNEQMHSIPDNQELGTTPECTSKTAGPD
Sbjct: 1   MEERHEYTEPRPNNNCEAVQEAKASVEVLTCFSNEQMHSIPDNQELGTTPECTSKTAGPD 60

Query: 61  DEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISKLAEIDQVEAGNLLSSDIETENLIL 120
           DEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISKLAEIDQVEAGNLLSSDIETENLIL
Sbjct: 61  DEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISKLAEIDQVEAGNLLSSDIETENLIL 120

Query: 121 PIELETTTLNECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSISQQLGH 180
           PIELETTTLNECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSISQQLGH
Sbjct: 121 PIELETTTLNECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSISQQLGH 180

Query: 181 KDKKILKSKKKNYMLRSLVSSDRVLRSRTQEKAKAPEPSNELNKLTAGEGKRKKKKRNIK 240
           KDKKILKSKKKNYMLRSLVSSDRVLRSRTQEKAKAPEPSNELNKLTAGEGKRKKKKRNIK
Sbjct: 181 KDKKILKSKKKNYMLRSLVSSDRVLRSRTQEKAKAPEPSNELNKLTAGEGKRKKKKRNIK 240

Query: 241 GKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIM 300
           GKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIM
Sbjct: 241 GKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIM 300

Query: 301 RGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGI 360
           RGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGI
Sbjct: 301 RGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGI 360

Query: 361 CDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYP 420
           CDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYP
Sbjct: 361 CDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYP 420

Query: 421 EAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQEDESSSDQSSSDESGYASASEEL 480
           EAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQEDESSSDQSSSDESGYASASEEL
Sbjct: 421 EAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQEDESSSDQSSSDESGYASASEEL 480

Query: 481 EAAPNDDQYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDGTTPVR 540
           EAAPNDDQYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDGTTPVR
Sbjct: 481 EAAPNDDQYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDGTTPVR 540

Query: 541 NSNGQGSGCGPRTSVLHNELQSLLESGPDKDGLEPVSGRRQVERLDYKKLHDETYGNVPS 600
           NSNGQGSGCGPRTSVLHNELQSLLESGPDKDGLEPVSGRRQVERLDYKKLHDETYGNVPS
Sbjct: 541 NSNGQGSGCGPRTSVLHNELQSLLESGPDKDGLEPVSGRRQVERLDYKKLHDETYGNVPS 600

Query: 601 DSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPALNGTNDDLKNKKTKRSYKRRTHQKP 660
           DSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPALNGTNDDLKNKKTKRSYKRRTHQKP
Sbjct: 601 DSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPALNGTNDDLKNKKTKRSYKRRTHQKP 660

Query: 661 GAENMKNSVTRTPEDSVKSSSSVRRTASSSNRRLSQPALERLLASFQENQYPKRATKESL 720
           GAENMKNSVTRTPEDSVKSSSSVRRTASSSNRRLSQPALERLLASFQENQYPKRATKESL
Sbjct: 661 GAENMKNSVTRTPEDSVKSSSSVRRTASSSNRRLSQPALERLLASFQENQYPKRATKESL 720

Query: 721 AQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETSGKLPKPEQESGAC 780
           AQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETSGKLPKPEQESGAC
Sbjct: 721 AQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETSGKLPKPEQESGAC 780

Query: 781 FRDTDNNGAQHQVSPNTDGAVAPCQSGDTRDDKLATQKTTRPESTATKSRKRKGRSDHVA 840
           FRDTDNNGAQHQVSPNTDGAVAPCQSGDTRDDKLATQKTTRPESTATKSRKRKGRSDHVA
Sbjct: 781 FRDTDNNGAQHQVSPNTDGAVAPCQSGDTRDDKLATQKTTRPESTATKSRKRKGRSDHVA 840

Query: 841 SHSKDRKESQKPPAKSPKVNQIQTADKVRTRRRRSI 877
           SHSKDRKESQKPPAKSPKVNQIQTADKVRTRRRRSI
Sbjct: 841 SHSKDRKESQKPPAKSPKVNQIQTADKVRTRRRRSI 876

BLAST of Moc10g04410 vs. ExPASy TrEMBL
Match: A0A6J1FNP3 (homeobox protein HAT3.1 OS=Cucurbita moschata OX=3662 GN=LOC111447439 PE=3 SV=1)

HSP 1 Score: 1229.9 bits (3181), Expect = 0.0e+00
Identity = 692/909 (76.13%), Postives = 757/909 (83.28%), Query Frame = 0

Query: 1   MEERHEYTEPRPNNNCEAVQEAKAS--VEVLTCFSNEQMHSIPDNQELGTTPECTSKTAG 60
           MEER EYTE R  N   AVQEAKAS  VEVLT  +NEQMHS P+  ELGT  + TSKT  
Sbjct: 1   MEERDEYTESRTINKSAAVQEAKASVEVEVLTSLANEQMHSAPNYLELGTIRDWTSKTGS 60

Query: 61  PDDEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISKLAEIDQVEAGNLLSSDIETENL 120
           PD+EK GV+QNMEE+ KELG G+    LPEK++QTISKLA+ DQ EAGNLLSSD +TENL
Sbjct: 61  PDEEKPGVKQNMEEDRKELGLGEAHRGLPEKSSQTISKLADNDQDEAGNLLSSDKDTENL 120

Query: 121 ILPIELETTT-LNECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSISQQ 180
           ILPIE+ETT  LNECSE P ED NKN I+Q NPPIED  QNTSI  L  VP      S +
Sbjct: 121 ILPIEVETTALLNECSEPPTEDNNKNYIEQANPPIEDSIQNTSITNLNMVP----DNSPE 180

Query: 181 LGHKDKKILKSKKKNYMLRSLVSSDRVLRSRTQEKAKAPEPSNELNKLTAG-EGKRKKKK 240
           +G KDK++LKSKKKNY+LRSL+SSDRVLRSRTQ+KAKAPEPSN+L+ +TAG EGK KKK 
Sbjct: 181 VGCKDKRVLKSKKKNYILRSLISSDRVLRSRTQDKAKAPEPSNDLSNVTAGEEGKGKKKN 240

Query: 241 RNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRAS 300
           R IKGKGA  DEFSSIRN LRYLVNRIKYEQSLI+AYSSEGWKGFSSDKLKPEKELQRAS
Sbjct: 241 RKIKGKGARVDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRAS 300

Query: 301 SEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIIL 360
           +EIMR KLKIRDLFQ +D+LC+EGR SE+LFDSEGQIDSEDIFC KCGSKELSLENDIIL
Sbjct: 301 NEIMRRKLKIRDLFQRIDALCSEGRFSEALFDSEGQIDSEDIFCGKCGSKELSLENDIIL 360

Query: 361 CDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWE 420
           CDG+CDRGFHQFCLEPPLLN+DIPPDDEGWLCPGCDCKDDC+DLLNEFQGSNLSITDGWE
Sbjct: 361 CDGVCDRGFHQFCLEPPLLNSDIPPDDEGWLCPGCDCKDDCIDLLNEFQGSNLSITDGWE 420

Query: 421 KVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQE----------DESSSDQS 480
           KV+PEAAAAAAGQ+SDH + LPSDDS+DGDYDPD PD I+Q+          D+SSSD S
Sbjct: 421 KVFPEAAAAAAGQSSDHTMSLPSDDSDDGDYDPDVPDAIDQDGESRSDHSSSDQSSSDLS 480

Query: 481 SSDESGY--ASASEELEAAPNDDQYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTS 540
           SSD+SGY  ASASEELEA PNDDQYLGLPSDDSEDDDY+PGAP  DEGV QESS SDFTS
Sbjct: 481 SSDKSGYASASASEELEAPPNDDQYLGLPSDDSEDDDYDPGAPVRDEGVGQESSSSDFTS 540

Query: 541 DSEDLAALDD---------------GTTPVRNSNGQGSGCGPRTSVLHNELQSLLESGPD 600
           DSEDLAAL D                T PVRNS+GQ SG GP  +  HN+L SL+ SGPD
Sbjct: 541 DSEDLAALVDNGSSKDDNIASSPLNNTVPVRNSDGQSSGRGPNKNAQHNKLSSLVGSGPD 600

Query: 601 KDGLEPVSGRRQVERLDYKKLHDETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSP 660
           + GLE VSGRR VERLDYKKLHDET+GNVP+DSSDDT+GS SIDSSDDRGRG  TRK SP
Sbjct: 601 EGGLELVSGRRHVERLDYKKLHDETFGNVPTDSSDDTYGSDSIDSSDDRGRGRSTRKGSP 660

Query: 661 KNLVPAL--NGTNDDLKNKKTKRSYKRRTHQKPGAENMKNSVTRTPEDSVKSSSSVRRTA 720
           KN VPAL  NGT DDLKN KTKRS K RT QKP AENM NSVT+TPE ++KSSSSVRRT 
Sbjct: 661 KNPVPALSRNGT-DDLKNIKTKRSSK-RTRQKPAAENMDNSVTKTPEGTLKSSSSVRRTT 720

Query: 721 SSSNRRLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSS 780
           SSS+RRLSQP LERLLASFQENQYP+RATKESLA+ELGLSLKQVSKWFENTRWSTRHPSS
Sbjct: 721 SSSHRRLSQPTLERLLASFQENQYPERATKESLARELGLSLKQVSKWFENTRWSTRHPSS 780

Query: 781 IESNKAKSALRMGIQSSETSGKLPKPEQESGACFRDTDNNGAQHQVSPNTDGAVAPCQSG 840
            E+NKAKSA RMG QSS+TS K PKPEQESGACFRDT +NGAQHQ SP     VAPCQSG
Sbjct: 781 -EANKAKSASRMGTQSSQTSRKPPKPEQESGACFRDTCSNGAQHQESPKAISVVAPCQSG 840

Query: 841 DTRDDKLATQKTTRPESTATKSRKRKGRSDHVASHSKDRKESQKPPAKSPKVNQIQTADK 877
            T DDKLA QK  RPES ATKSRKRKGRSD VAS SKDRK+S+KPPAKS KV++IQTADK
Sbjct: 841 VTGDDKLANQKPKRPESAATKSRKRKGRSDQVASRSKDRKKSRKPPAKSSKVDEIQTADK 900

BLAST of Moc10g04410 vs. ExPASy TrEMBL
Match: A0A6J1IPM8 (homeobox protein HAT3.1-like OS=Cucurbita maxima OX=3661 GN=LOC111478790 PE=3 SV=1)

HSP 1 Score: 1225.3 bits (3169), Expect = 0.0e+00
Identity = 691/906 (76.27%), Postives = 759/906 (83.77%), Query Frame = 0

Query: 1   MEERHEYTEPRPNNNCEAVQEAKAS--VEVLTCFSNEQMHSIPDNQELGTTPECTSKTAG 60
           MEER EYTE R  +   AVQEAKAS  VEVLT  +NEQ+ S P+  ELGT  + TSKT  
Sbjct: 1   MEERDEYTESRTISKSAAVQEAKASVEVEVLTSLANEQIDSAPNYLELGTIRDWTSKTGS 60

Query: 61  PDDEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISKLAEIDQVEAGNLLSSDIETENL 120
           PD+EK GV+QNMEE++KEL  G+  SELPEK++QTISKLAE DQ EAGNLLSSD +TENL
Sbjct: 61  PDEEKPGVKQNMEEDSKELCLGEAHSELPEKSSQTISKLAENDQGEAGNLLSSDKDTENL 120

Query: 121 ILPIELETTT-LNECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSISQQ 180
           ILPIE+ETT  LNECSE P ED NKN I+Q NPPIE   QNTSI+ L  VP      S +
Sbjct: 121 ILPIEVETTALLNECSEPPTEDDNKNYIEQANPPIEASIQNTSIKNLNMVP----DNSPE 180

Query: 181 LGHKDKKILKSKKKNYMLRSLVSSDRVLRSRTQEKAKAPEPSNELNKLTAGE---GKRKK 240
           LG KDK++LKSKKKNY+LRSLVSSDRVLRSRTQ+KAKAPEPSN+L+ +TAGE   GK++K
Sbjct: 181 LGCKDKRVLKSKKKNYILRSLVSSDRVLRSRTQDKAKAPEPSNDLSNVTAGEEGKGKKRK 240

Query: 241 KKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQR 300
           K R IKGKGA  DEFSSIRN LRYLVNRIKYEQSLI+AYSSEGWKGFSSDKLKPEKELQR
Sbjct: 241 KNRKIKGKGARVDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQR 300

Query: 301 ASSEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDI 360
           AS+EIMR KLKIRDLFQ +D+LC+EGR SE+LFDSEGQIDSEDIFC KCGSKELSLENDI
Sbjct: 301 ASNEIMRRKLKIRDLFQRIDALCSEGRFSEALFDSEGQIDSEDIFCGKCGSKELSLENDI 360

Query: 361 ILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDG 420
           ILCDG+CDRGFHQFCLEPPLLN+DIPPDDEGWLCPGCDCKDDC+DLLNEFQGSNLSITDG
Sbjct: 361 ILCDGVCDRGFHQFCLEPPLLNSDIPPDDEGWLCPGCDCKDDCIDLLNEFQGSNLSITDG 420

Query: 421 WEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQE-----DESSSDQSSSD 480
           WEKV+PEAAAAAAG++SDH + LPSDDS+DGDYDPD PD I+Q+     D SSSDQSSSD
Sbjct: 421 WEKVFPEAAAAAAGRSSDHTMSLPSDDSDDGDYDPDVPDAIDQDGESSSDHSSSDQSSSD 480

Query: 481 ESGY--ASASEELEAAPNDDQYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSE 540
           +SGY  ASASEELEA PNDDQYLGLPSDDSEDDDY+PGAP  DEGV QESS SDFTSDSE
Sbjct: 481 KSGYASASASEELEAPPNDDQYLGLPSDDSEDDDYDPGAPVRDEGVGQESSSSDFTSDSE 540

Query: 541 DLAALDD---------------GTTPVRNSNGQGSGCGPRTSVLHNELQSLLESGPDKDG 600
           DLAAL D                T PVRNSNGQ SG GP  +  HN+L SL+ SGPD+ G
Sbjct: 541 DLAALVDNGSSKDDNIASSPLNNTAPVRNSNGQSSGRGPNKNAQHNKLSSLVGSGPDEGG 600

Query: 601 LEPVSGRRQVERLDYKKLHDETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNL 660
           LE VSGRR VERLDYKKLHDET+GNVPS+SSDDT+GS SIDSSDDRGRG  TRK SPKNL
Sbjct: 601 LELVSGRRHVERLDYKKLHDETFGNVPSNSSDDTYGSDSIDSSDDRGRGRSTRKGSPKNL 660

Query: 661 VPAL--NGTNDDLKNKKTKRSYKRRTHQKPGAENMKNSVTRTPEDSVKSSSSVRRTASSS 720
           VPAL  NGT DD KN KTK S  RRT QKP AENM NSVT+TPE ++KSSSSVRRT SSS
Sbjct: 661 VPALSRNGT-DDSKNIKTKCS-SRRTRQKPAAENMDNSVTKTPEGTLKSSSSVRRTTSSS 720

Query: 721 NRRLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIES 780
           +RRLSQP LERLLASFQENQYP+RATKESLA+ELGLSLKQVSKWFENTRWSTRHPSS E+
Sbjct: 721 HRRLSQPTLERLLASFQENQYPERATKESLARELGLSLKQVSKWFENTRWSTRHPSS-EA 780

Query: 781 NKAKSALRMGIQSSETSGKLPKPEQESGACFRDTDNNGAQHQVSPNTDGAVAPCQSGDTR 840
           NKAKSA RMG QSS+TS K PKPEQESGACFRDT +NGAQHQ SP     VAPCQSG T 
Sbjct: 781 NKAKSASRMGTQSSQTSRKSPKPEQESGACFRDTCSNGAQHQESPKAITVVAPCQSGVTG 840

Query: 841 DDKLATQKTTRPESTATKSRKRKGRSDHVASHSKDRKESQKPPAKSPKVNQIQTADKVRT 877
           DDKLA  KT RPESTATKSRKRKGRSD VAS SK+RK+S+KPPAKS KV++IQTADKV+ 
Sbjct: 841 DDKLAYHKTKRPESTATKSRKRKGRSDQVASRSKNRKKSRKPPAKSSKVDEIQTADKVKK 899

BLAST of Moc10g04410 vs. ExPASy TrEMBL
Match: A0A1S3C283 (pathogenesis-related homeodomain protein OS=Cucumis melo OX=3656 GN=LOC103496194 PE=3 SV=1)

HSP 1 Score: 1201.8 bits (3108), Expect = 0.0e+00
Identity = 687/904 (76.00%), Postives = 751/904 (83.08%), Query Frame = 0

Query: 1    MEERHEY--TEPRPNNNCEAVQEAKAS--VEVLTCFSNEQMHSIPDNQELGTTPECTSKT 60
            MEER E   TE RPN   EAVQEAKAS  VEV TC SNE M+S    QELGTTPE + KT
Sbjct: 175  MEERDENTDTESRPNKIAEAVQEAKASVEVEVRTCLSNEPMYS--GYQELGTTPEFSRKT 234

Query: 61   AGPDDEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISKLAEIDQVEAGNLLSSDIETE 120
             GPD+EK+GVQQNM     ELGSG +LSEL EK+NQTIS  A+ DQVEAGN LS D +T+
Sbjct: 235  DGPDEEKAGVQQNM-----ELGSGYLLSELSEKDNQTISNHADNDQVEAGNSLSIDKDTK 294

Query: 121  NLILPIELETTT-LNECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSIS 180
            NL L IE ETTT LNECSELP ED  KN I+++NPPIEDLTQ TSIQ LET+P    S S
Sbjct: 295  NLKLSIEDETTTLLNECSELPLEDVTKNYIEKMNPPIEDLTQITSIQSLETIP----SNS 354

Query: 181  QQLGHKDKKILKSKKKNYMLRSLVSSDRVLRSRTQEKAKAPEPSNELNKLTA-GEGKR-K 240
            QQL HKD++  KSKKKNY LRSLVSSDRVLRSRTQEKAKAPEPSN+LN  TA  EGKR K
Sbjct: 355  QQLDHKDERFFKSKKKNYKLRSLVSSDRVLRSRTQEKAKAPEPSNDLNNFTAEEEGKRKK 414

Query: 241  KKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQ 300
            KKKRNI+GKGA  DE+SSIRN LRYL+NRI+YEQSLI+AYSSEGWKGFSSDKLKPEKELQ
Sbjct: 415  KKKRNIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQ 474

Query: 301  RASSEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLEND 360
            RAS+EIMR KLKIRDLFQ +D+LCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLEND
Sbjct: 475  RASNEIMRRKLKIRDLFQRIDTLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLEND 534

Query: 361  IILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITD 420
            IILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITD
Sbjct: 535  IILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITD 594

Query: 421  GWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQE-----DESSSDQSSS 480
            GWEKVYPE AAAAAG+NSD  LGLPSDDSEDGDYDPD PDTI+Q+     DESSSDQS+S
Sbjct: 595  GWEKVYPE-AAAAAGRNSDDTLGLPSDDSEDGDYDPDIPDTIDQDNELSSDESSSDQSNS 654

Query: 481  DESGYASASEELEAAPNDDQYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSED 540
            D SGYASASE LE  PNDDQYLGLPSDDSED+DY+P  PELDEG +QESS SDFTSDSED
Sbjct: 655  DTSGYASASEGLEVPPNDDQYLGLPSDDSEDNDYDPSVPELDEGDRQESSSSDFTSDSED 714

Query: 541  LAALD--------------DGTTPVRNSNGQGSGCGPRTSVLHNELQSLLESGPDKDGLE 600
            LAAL+              + T PV+N+NG+ S  GP  S LHNEL SLL+SG DKDGLE
Sbjct: 715  LAALENNCSSKDDDLVSSLNNTLPVKNTNGRSS--GPSKSTLHNELSSLLDSGLDKDGLE 774

Query: 601  PVSGRRQVERLDYKKLHDETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVP 660
            P+SGRRQVERLDYKKLHDETYGNVP++SSDDT+GS ++DSSDDRG  S TRKR PK LV 
Sbjct: 775  PISGRRQVERLDYKKLHDETYGNVPTESSDDTYGS-TLDSSDDRGCDSGTRKRGPKTLVL 834

Query: 661  AL--NGTNDDLKNKKTKRSYKRRTHQKPGAENMKNSVTRTPEDSVKSSSSVRRTASSSNR 720
            AL  NG+NDDL N KTKRSYKRRT QKPGA N+ NSVT TP D+ KSSSSVR+  SSSNR
Sbjct: 835  ALSNNGSNDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVRQCTSSSNR 894

Query: 721  RLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNK 780
            RLSQPALERL ASFQEN+YPKRATKESLAQELGL+LKQVSKWFENTRWSTRHPSS    K
Sbjct: 895  RLSQPALERLFASFQENEYPKRATKESLAQELGLNLKQVSKWFENTRWSTRHPSS-GGKK 954

Query: 781  AKSALRMGIQSSETSGKLPKPEQESGACFRDTDNNGAQHQVSPNTDGAVAPCQSGDTRDD 840
            AKS+ RM I  S+ SG+L K EQES  CFRDTD+NGA+HQ  P  +  VA CQSGDT D 
Sbjct: 955  AKSSSRMSIHLSQASGELSKNEQESATCFRDTDSNGARHQDLPMANSVVASCQSGDTGDK 1014

Query: 841  KLATQKTTRPESTATKSRKRKGRSDHVASHSKDRKESQKPPAKSPKVNQIQTADKVRTRR 877
            KL T+KT R ES+ATKSRKRKGRSD+ AS+SKDR+ S +PPAKSPKVN+ QTAD+ +TRR
Sbjct: 1015 KLTTRKTKRGESSATKSRKRKGRSDNTASNSKDREGSPRPPAKSPKVNETQTADRFKTRR 1062

BLAST of Moc10g04410 vs. ExPASy TrEMBL
Match: A0A6J1E4I6 (homeobox protein HAT3.1-like OS=Cucurbita moschata OX=3662 GN=LOC111430686 PE=3 SV=1)

HSP 1 Score: 1173.3 bits (3034), Expect = 0.0e+00
Identity = 659/897 (73.47%), Postives = 728/897 (81.16%), Query Frame = 0

Query: 1   MEERHEYTEPRPNNNCEAVQEAKASV--EVLTCFSNEQMHSIPDNQELGTTPECTSKTAG 60
           MEER EYTE R NNN EAVQEAK SV  E+ TC SNEQ HS+PD  EL  TP  ++KT G
Sbjct: 1   MEERDEYTESRSNNNAEAVQEAKISVEAEMRTCLSNEQKHSVPDYHELEATPGYSNKTGG 60

Query: 61  PDDEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISKLAEIDQVEAGNLLSSDIETENL 120
            D+EK  VQQNMEEE +ELGSGDVL EL EK+NQT S LA+ DQVEAGNLL  D +TENL
Sbjct: 61  SDEEKPEVQQNMEEENRELGSGDVLIELSEKHNQTFSNLADNDQVEAGNLLCCDKDTENL 120

Query: 121 ILPIELETTT-LNECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSISQQ 180
           I+PIE+ETTT L +CSELPPE  NKN I+Q+NPP E LTQNT  Q LETVP    S S+Q
Sbjct: 121 IVPIEVETTTLLVDCSELPPEVVNKNYIEQMNPPNEKLTQNTPFQNLETVP----SNSEQ 180

Query: 181 LGHKDKKILKSKKKNYMLRSLVSSDRVLRSRTQEKAKAPEPSNELNKLTAGEGKRKKKKR 240
             HKDK+ILKS K N +LRSLVSSDR LRS+TQEK K PEPSN+LN  TA EGK KKK+R
Sbjct: 181 SDHKDKRILKSIKINSILRSLVSSDRNLRSKTQEKDKDPEPSNDLNNFTAEEGKGKKKER 240

Query: 241 NIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASS 300
           NI+GKGA  DEFSSIRN LRYL+NRIKYEQ+LI+AYSSEGWKGFSSDKLKPEKELQRAS+
Sbjct: 241 NIQGKGARVDEFSSIRNHLRYLLNRIKYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASN 300

Query: 301 EIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILC 360
           EIMR KLKIRD+FQ +D+LC EG LS+SLFDS+GQIDSEDIFCAKCGSKELS ENDIILC
Sbjct: 301 EIMRRKLKIRDVFQRIDALCGEGGLSKSLFDSQGQIDSEDIFCAKCGSKELSFENDIILC 360

Query: 361 DGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEK 420
           DGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCL+LLNEFQGS LSITDGWEK
Sbjct: 361 DGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLNLLNEFQGSRLSITDGWEK 420

Query: 421 VYPEAAAAAAGQNSDHALGLPSDDS-EDGDYDPDAPDTINQEDESSSDQSSSDESGYASA 480
           VYPEAAA+AAG+N DHA GLPSDDS +D DYDPD PDTI Q+DE     SS + SGYASA
Sbjct: 421 VYPEAAASAAGRNFDHASGLPSDDSVDDDDYDPDVPDTIVQDDE-----SSPETSGYASA 480

Query: 481 SEELEAAPNDDQYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALD--- 540
           SEELE+ PN DQYLGLPSDDSEDDDY+P APE DE V+QESS SDFTSDSEDLAALD   
Sbjct: 481 SEELESPPNVDQYLGLPSDDSEDDDYDPSAPERDEDVRQESSSSDFTSDSEDLAALDSNP 540

Query: 541 ------------DGTTPVRNSNGQGSGCGPRTSVLHNELQSLLESGPDKDGLEPVSGRRQ 600
                       + TT ++N +G+ SG GPR S L+NEL SLLESGPDKDG EPV GRRQ
Sbjct: 541 SSKADNLVSPSLNNTTSMKNPDGRSSGGGPRKSALYNELSSLLESGPDKDGPEPVLGRRQ 600

Query: 601 VERLDYKKLHDETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPALNG--T 660
           VERLDYKKLHDETYGNVP+DSSDDT+ S+S+DSSDD+G  S TRKRSPK LV AL    T
Sbjct: 601 VERLDYKKLHDETYGNVPTDSSDDTYASVSMDSSDDQGWDSNTRKRSPKTLVLALPNYRT 660

Query: 661 NDDLKNKKTKRSYKRRTHQKPGAENMKNSVTRTPEDSVKSSSSVRRTASSSNRRLSQPAL 720
           NDDL N KTK S KR T QK  A NM  SV++TPED+ K+SSSVRRT  SS RRLS+ AL
Sbjct: 661 NDDLTNIKTKHSSKRGTRQKAVAANMNKSVSKTPEDTGKASSSVRRTTPSSYRRLSRLAL 720

Query: 721 ERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRM 780
           ERLLASFQENQYP+RATKESLAQELGLS+KQVSKWF NTRWSTRHPSS+E NKAKS+ RM
Sbjct: 721 ERLLASFQENQYPERATKESLAQELGLSVKQVSKWFTNTRWSTRHPSSVEGNKAKSSSRM 780

Query: 781 GIQSSETSGKLPKPEQESGACFRDTDNNGAQHQVSPNTDGAVAPCQSGDTRDDKLATQKT 840
           GI SS+ SG+L +PEQE           GAQHQ  P  D  VAPCQSGDT D KLATQ+T
Sbjct: 781 GIHSSQASGELHQPEQEF----------GAQHQELPTADSVVAPCQSGDTGDVKLATQET 840

Query: 841 TRPESTATKSRKRKGRSDHVASHSKDRKESQKPPAKSPKVNQIQTADKVRTRRRRSI 877
            R E +ATKSRKRKGRSDH AS SKD KESQ+PPAKSPKVN+IQTA  ++TRRR S+
Sbjct: 841 KRSEFSATKSRKRKGRSDHAASCSKDSKESQRPPAKSPKVNEIQTAHSIKTRRRNSL 878

BLAST of Moc10g04410 vs. TAIR 10
Match: AT3G19510.1 (Homeodomain-like protein with RING/FYVE/PHD-type zinc finger domain )

HSP 1 Score: 437.6 bits (1124), Expect = 2.4e-122
Identity = 280/557 (50.27%), Postives = 359/557 (64.45%), Query Frame = 0

Query: 208 RTQEKAKAPEPSNEL-NKLTAGEGKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYE 267
           R Q   +   PS+ + N    G  K+K K  N KG+    DE++ I+ +LRY +NRI YE
Sbjct: 136 RAQRSKEDAGPSSVVANSTPVGRPKKKNKTMN-KGQVREDDEYTRIKKKLRYFLNRINYE 195

Query: 268 QSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIMRGKLKIRDLFQHLDSLCAEGRLSESL 327
           QSLIDAYS EGWKG S +K++PEKEL+RA+ EI+R KLKIRDLFQHLD+LCAEG L ESL
Sbjct: 196 QSLIDAYSLEGWKGSSLEKIRPEKELERATKEILRRKLKIRDLFQHLDTLCAEGSLPESL 255

Query: 328 FDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGW 387
           FD++G+I SEDIFCAKCGSK+LS++NDIILCDG CDRGFHQ+CLEPPL   DIPPDDEGW
Sbjct: 256 FDTDGEISSEDIFCAKCGSKDLSVDNDIILCDGFCDRGFHQYCLEPPLRKEDIPPDDEGW 315

Query: 388 LCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGD 447
           LCPGCDCKDD LDLLN+  G+  S++D WEK++PEAAAA  G   +    LPSDDS+D +
Sbjct: 316 LCPGCDCKDDSLDLLNDSLGTKFSVSDSWEKIFPEAAAALVGGGQNLDCDLPSDDSDDEE 375

Query: 448 YDPDA-PDTINQED------ESSSDQSSSDESGYASASEEL-----EAAPNDDQYLGLPS 507
           YDPD   D  N ED      ES ++  SSDE+ + SAS+E+     E        + LPS
Sbjct: 376 YDPDCLNDNENDEDGSDDNEESENEDGSSDETEFTSASDEMIESFKEGKDIMKDVMALPS 435

Query: 508 DDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDG-TTPVRNSNGQGSGCGPRT 567
           DDSEDDDY+P AP  D+   +ESS SD TSD+EDL     G  T  +  +      G +T
Sbjct: 436 DDSEDDDYDPDAPTCDD--DKESSNSDCTSDTEDLETSFKGDETNQQAEDTPLEDPGRQT 495

Query: 568 SVLHNELQSLLES--GPDKDGLEPVSGRRQVERLDYKKLHDETYGNVPSDSSDDTFGSIS 627
           S L  +  ++LES  G D DG   VS RR VERLDYKKL+DE Y NVP+ SSDD      
Sbjct: 496 SQLQGD--AILESDVGLD-DGPAGVSRRRNVERLDYKKLYDEEYDNVPTSSSDD------ 555

Query: 628 IDSSDDRGRGSRTRKRSPK--NLVPALNGTN-DDLKNK----KTKRSYKRRTHQKPGAEN 687
            D  D   R  +    S    + VP    +N +D  +K    K+KR+ K+ T + P    
Sbjct: 556 -DDWDKTARMGKEDSESEDEGDTVPLKQSSNAEDHTSKKLIRKSKRADKKDTLEMPQEGP 615

Query: 688 MKNSVTRTPEDSVKSSSSVRRTASSSNRRLSQPALERLLASFQENQYPKRATKESLAQEL 742
            +N            S  + +++SS+ ++ + P  +RL  SFQENQYP +ATKESLA+EL
Sbjct: 616 GENG----------GSGEIEKSSSSACKQ-TDPKTQRLYISFQENQYPDKATKESLAKEL 668

BLAST of Moc10g04410 vs. TAIR 10
Match: AT4G29940.1 (pathogenesis related homeodomain protein A )

HSP 1 Score: 206.5 bits (524), Expect = 9.0e-53
Identity = 183/626 (29.23%), Postives = 284/626 (45.37%), Query Frame = 0

Query: 181 KDKKILKSKKKNYMLRSLVSSDRVLRSRTQEKAKAPEPSNELNKLTAGEGKRKKKKRNIK 240
           K  K + +K+ +   +     +   +SRT++ ++      E+ +    + +++K KR  K
Sbjct: 34  KKGKEVSNKRNSKQNKRKAEEELCSKSRTKKYSRGWVRCEEMEEEKVKKTRKRKSKRQQK 93

Query: 241 GKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIM 300
                 D+   ++ R RYL+ ++K +Q+LIDAY++EGWKG S +K++P+KEL+RA  EI+
Sbjct: 94  DNKVEVDDSLRLQRRTRYLLIKMKMQQNLIDAYATEGWKGQSREKIRPDKELERARKEIL 153

Query: 301 RGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGI 360
             KL +RD  + LD L + G + E +  S+G I  + IFCA+C S+E   +NDIILCDG 
Sbjct: 154 NCKLGLRDAIRQLDLLSSVGSMEEKVIASDGSIHHDHIFCAECNSREAFPDNDIILCDGT 213

Query: 361 CDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYP 420
           C+R FHQ CL+PPL    IPP D+GW C  CDCK + +D +N   G++  +   W+ ++ 
Sbjct: 214 CNRAFHQKCLDPPLETESIPPGDQGWFCKFCDCKIEIIDTMNAQIGTHFPVDSNWQDIFN 273

Query: 421 EAAAAAAGQNS--DHALGLPSDDSEDGDYDPDAPDTINQEDESSSDQSSSDESGYASASE 480
           E A+   G  +  ++    PSDDS+D DYDP        E   +   +SS+ SG      
Sbjct: 274 EEASLPIGSEATVNNEADWPSDDSKDDDYDP--------EMRENGGGNSSNVSG------ 333

Query: 481 ELEAAPNDDQYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDGTTP 540
                           D   D+D E                    S S  L+   DG   
Sbjct: 334 ----------------DGGGDNDEE--------------------SISTSLSLSSDGV-- 393

Query: 541 VRNSNGQGSGCGPRTSVLHNELQSLLESGPDKDGLEPVSGRRQVERLDYKKLHDETYGNV 600
              +   GS  G R S +  + ++  E        E V G RQ   +DY +L+ E +G  
Sbjct: 394 ---ALSTGSWEGHRLSNMVEQCETSNE--------ETVCGPRQRRTVDYTQLYYEMFG-- 453

Query: 601 PSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPALNGTNDDLKNKKTKRSYKRRTHQ 660
             D+     GS   D   +  R  +    +   LV     +  D                
Sbjct: 454 -KDAVLQEQGSEDEDWGPNDRRKRKRESDAGSTLVTMCESSKKD---------------- 513

Query: 661 KPGAENMKNSVTRTPEDSVKSSSSVRRTASSSNR-RLSQPALERLLASFQENQYPKRATK 720
                     V  T E S + S SV          RL + A+E+L   F E + P +A +
Sbjct: 514 --------QDVVETLEQSERDSVSVENKGGRRRMFRLPRNAVEKLRQVFAETELPSKAVR 556

Query: 721 ESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETSGKLPKPEQES 780
           + LA+EL L  ++V+KWF+NTR+      ++ + K +S  + G  S   SG    PE   
Sbjct: 574 DRLAKELSLDPEKVNKWFKNTRY-----MALRNRKTESVKQPG-DSKTVSGGDSGPEAV- 556

Query: 781 GACFRDTDNNGAQHQVSPNTDGAVAP 804
                  +NN   ++V    D  V P
Sbjct: 634 ------MENNTETNEVQDTLDDTVPP 556

BLAST of Moc10g04410 vs. TAIR 10
Match: AT4G29940.2 (pathogenesis related homeodomain protein A )

HSP 1 Score: 206.5 bits (524), Expect = 9.0e-53
Identity = 183/626 (29.23%), Postives = 284/626 (45.37%), Query Frame = 0

Query: 181 KDKKILKSKKKNYMLRSLVSSDRVLRSRTQEKAKAPEPSNELNKLTAGEGKRKKKKRNIK 240
           K  K + +K+ +   +     +   +SRT++ ++      E+ +    + +++K KR  K
Sbjct: 34  KKGKEVSNKRNSKQNKRKAEEELCSKSRTKKYSRGWVRCEEMEEEKVKKTRKRKSKRQQK 93

Query: 241 GKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIM 300
                 D+   ++ R RYL+ ++K +Q+LIDAY++EGWKG S +K++P+KEL+RA  EI+
Sbjct: 94  DNKVEVDDSLRLQRRTRYLLIKMKMQQNLIDAYATEGWKGQSREKIRPDKELERARKEIL 153

Query: 301 RGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGI 360
             KL +RD  + LD L + G + E +  S+G I  + IFCA+C S+E   +NDIILCDG 
Sbjct: 154 NCKLGLRDAIRQLDLLSSVGSMEEKVIASDGSIHHDHIFCAECNSREAFPDNDIILCDGT 213

Query: 361 CDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYP 420
           C+R FHQ CL+PPL    IPP D+GW C  CDCK + +D +N   G++  +   W+ ++ 
Sbjct: 214 CNRAFHQKCLDPPLETESIPPGDQGWFCKFCDCKIEIIDTMNAQIGTHFPVDSNWQDIFN 273

Query: 421 EAAAAAAGQNS--DHALGLPSDDSEDGDYDPDAPDTINQEDESSSDQSSSDESGYASASE 480
           E A+   G  +  ++    PSDDS+D DYDP        E   +   +SS+ SG      
Sbjct: 274 EEASLPIGSEATVNNEADWPSDDSKDDDYDP--------EMRENGGGNSSNVSG------ 333

Query: 481 ELEAAPNDDQYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDGTTP 540
                           D   D+D E                    S S  L+   DG   
Sbjct: 334 ----------------DGGGDNDEE--------------------SISTSLSLSSDGV-- 393

Query: 541 VRNSNGQGSGCGPRTSVLHNELQSLLESGPDKDGLEPVSGRRQVERLDYKKLHDETYGNV 600
              +   GS  G R S +  + ++  E        E V G RQ   +DY +L+ E +G  
Sbjct: 394 ---ALSTGSWEGHRLSNMVEQCETSNE--------ETVCGPRQRRTVDYTQLYYEMFG-- 453

Query: 601 PSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPALNGTNDDLKNKKTKRSYKRRTHQ 660
             D+     GS   D   +  R  +    +   LV     +  D                
Sbjct: 454 -KDAVLQEQGSEDEDWGPNDRRKRKRESDAGSTLVTMCESSKKD---------------- 513

Query: 661 KPGAENMKNSVTRTPEDSVKSSSSVRRTASSSNR-RLSQPALERLLASFQENQYPKRATK 720
                     V  T E S + S SV          RL + A+E+L   F E + P +A +
Sbjct: 514 --------QDVVETLEQSERDSVSVENKGGRRRMFRLPRNAVEKLRQVFAETELPSKAVR 556

Query: 721 ESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETSGKLPKPEQES 780
           + LA+EL L  ++V+KWF+NTR+      ++ + K +S  + G  S   SG    PE   
Sbjct: 574 DRLAKELSLDPEKVNKWFKNTRY-----MALRNRKTESVKQPG-DSKTVSGGDSGPEAV- 556

Query: 781 GACFRDTDNNGAQHQVSPNTDGAVAP 804
                  +NN   ++V    D  V P
Sbjct: 634 ------MENNTETNEVQDTLDDTVPP 556

BLAST of Moc10g04410 vs. TAIR 10
Match: AT5G09790.1 (ARABIDOPSIS TRITHORAX-RELATED PROTEIN 5 )

HSP 1 Score: 47.4 bits (111), Expect = 7.0e-05
Identity = 26/68 (38.24%), Postives = 37/68 (54.41%), Query Frame = 0

Query: 328 DSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWL 387
           + E +    ++ C KCGS E   +++++LCD  CDRGFH  CL P ++   I      WL
Sbjct: 55  EEEDEDSYSNVTCEKCGSGE--GDDELLLCDK-CDRGFHMKCLRPIVVRVPIGT----WL 113

Query: 388 CPGCDCKD 396
           C   DC D
Sbjct: 115 C--VDCSD 113

BLAST of Moc10g04410 vs. TAIR 10
Match: AT5G09790.2 (ARABIDOPSIS TRITHORAX-RELATED PROTEIN 5 )

HSP 1 Score: 47.4 bits (111), Expect = 7.0e-05
Identity = 26/68 (38.24%), Postives = 37/68 (54.41%), Query Frame = 0

Query: 328 DSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWL 387
           + E +    ++ C KCGS E   +++++LCD  CDRGFH  CL P ++   I      WL
Sbjct: 55  EEEDEDSYSNVTCEKCGSGE--GDDELLLCDK-CDRGFHMKCLRPIVVRVPIGT----WL 113

Query: 388 CPGCDCKD 396
           C   DC D
Sbjct: 115 C--VDCSD 113

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022149322.10.0e+00100.00homeobox protein HAT3.1 isoform X1 [Momordica charantia] >XP_022149323.1 homeobo... [more]
KAG7030959.10.0e+0076.27Homeobox protein HAZ1 [Cucurbita argyrosperma subsp. argyrosperma][more]
XP_022942376.10.0e+0076.13homeobox protein HAT3.1 [Cucurbita moschata] >XP_022942377.1 homeobox protein HA... [more]
XP_038876083.10.0e+0076.93homeobox protein HAT3.1 isoform X1 [Benincasa hispida] >XP_038876090.1 homeobox ... [more]
XP_023531864.10.0e+0075.96homeobox protein HAT3.1 isoform X1 [Cucurbita pepo subsp. pepo] >XP_023531873.1 ... [more]
Match NameE-valueIdentityDescription
Q049963.4e-12150.27Homeobox protein HAT3.1 OS=Arabidopsis thaliana OX=3702 GN=HAT3.1 PE=1 SV=3[more]
P487864.9e-12042.64Pathogenesis-related homeodomain protein OS=Petroselinum crispum OX=4043 GN=PRH ... [more]
P466058.1e-10741.37Homeobox protein HOX1A OS=Zea mays OX=4577 GN=HOX1A PE=2 SV=1[more]
Q8H9913.1e-10643.06Homeobox protein HAZ1 OS=Oryza sativa subsp. japonica OX=39947 GN=HAZ1 PE=2 SV=1[more]
P487851.3e-5129.23Pathogenesis-related homeodomain protein OS=Arabidopsis thaliana OX=3702 GN=PRH ... [more]
Match NameE-valueIdentityDescription
A0A6J1D6Q50.0e+00100.00homeobox protein HAT3.1 isoform X1 OS=Momordica charantia OX=3673 GN=LOC11101776... [more]
A0A6J1FNP30.0e+0076.13homeobox protein HAT3.1 OS=Cucurbita moschata OX=3662 GN=LOC111447439 PE=3 SV=1[more]
A0A6J1IPM80.0e+0076.27homeobox protein HAT3.1-like OS=Cucurbita maxima OX=3661 GN=LOC111478790 PE=3 SV... [more]
A0A1S3C2830.0e+0076.00pathogenesis-related homeodomain protein OS=Cucumis melo OX=3656 GN=LOC103496194... [more]
A0A6J1E4I60.0e+0073.47homeobox protein HAT3.1-like OS=Cucurbita moschata OX=3662 GN=LOC111430686 PE=3 ... [more]
Match NameE-valueIdentityDescription
AT3G19510.12.4e-12250.27Homeodomain-like protein with RING/FYVE/PHD-type zinc finger domain [more]
AT4G29940.19.0e-5329.23pathogenesis related homeodomain protein A [more]
AT4G29940.29.0e-5329.23pathogenesis related homeodomain protein A [more]
AT5G09790.17.0e-0538.24ARABIDOPSIS TRITHORAX-RELATED PROTEIN 5 [more]
AT5G09790.27.0e-0538.24ARABIDOPSIS TRITHORAX-RELATED PROTEIN 5 [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (OHB3-1) v2
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001356Homeobox domainSMARTSM00389HOX_1coord: 688..749
e-value: 4.5E-12
score: 56.1
IPR001356Homeobox domainPFAMPF00046Homeodomaincoord: 693..741
e-value: 1.4E-10
score: 40.8
IPR001356Homeobox domainPROSITEPS50071HOMEOBOX_2coord: 685..745
score: 14.041749
IPR001356Homeobox domainCDDcd00086homeodomaincoord: 692..746
e-value: 1.81313E-13
score: 63.8016
IPR001965Zinc finger, PHD-typeSMARTSM00249PHD_3coord: 339..392
e-value: 1.8E-9
score: 47.5
NoneNo IPR availableGENE3D1.10.10.60coord: 686..756
e-value: 1.8E-14
score: 54.7
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 572..595
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 736..771
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 38..79
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 492..506
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 596..615
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 811..830
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 536..561
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 780..803
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 838..853
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 662..696
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 458..474
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 441..457
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 424..698
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 736..876
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 38..63
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 205..246
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 65..79
NoneNo IPR availablePANTHERPTHR12628POLYCOMB-LIKE TRANSCRIPTION FACTORcoord: 128..757
NoneNo IPR availablePANTHERPTHR12628:SF13HOMEOBOX PROTEIN HAT3.1coord: 128..757
NoneNo IPR availableCDDcd15504PHD_PRHA_likecoord: 339..391
e-value: 1.19571E-27
score: 104.054
IPR013083Zinc finger, RING/FYVE/PHD-typeGENE3D3.30.40.10Zinc/RING finger domain, C3HC4 (zinc finger)coord: 333..396
e-value: 9.5E-13
score: 49.5
IPR019787Zinc finger, PHD-fingerPFAMPF00628PHDcoord: 339..394
e-value: 1.5E-10
score: 40.8
IPR019787Zinc finger, PHD-fingerPROSITEPS50016ZF_PHD_2coord: 337..394
score: 10.9496
IPR017970Homeobox, conserved sitePROSITEPS00027HOMEOBOX_1coord: 720..743
IPR019786Zinc finger, PHD-type, conserved sitePROSITEPS01359ZF_PHD_1coord: 340..391
IPR011011Zinc finger, FYVE/PHD-typeSUPERFAMILY57903FYVE/PHD zinc fingercoord: 329..397
IPR009057Homeobox-like domain superfamilySUPERFAMILY46689Homeodomain-likecoord: 681..747

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Moc10g04410.1Moc10g04410.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006357 regulation of transcription by RNA polymerase II
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005634 nucleus
molecular_function GO:0003677 DNA binding
molecular_function GO:0000981 DNA-binding transcription factor activity, RNA polymerase II-specific
molecular_function GO:0046872 metal ion binding