Bhi04G001041 (gene) Wax gourd (B227) v1

Overview
NameBhi04G001041
Typegene
OrganismBenincasa hispida (Wax gourd (B227) v1)
DescriptionPlastid envelope DNA binding protein
Locationchr4: 33026477 .. 33031909 (+)
RNA-Seq ExpressionBhi04G001041
SyntenyBhi04G001041
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATTCATTCTCCCACACTCCGCCATTCCCATCCCCCACTGCCCATTTCTGTTGGAATTTTACAGAATGTATCACTGCTTCGCTCTCTCTAATCCATCGGTAGGGTTTTAGGCTTCTTTATCTTCAATTCTCCTTCCACCCTTTCATTTCGCGCTCTCTCTGCTTCTAGGTAAACTTCTCTCAGCTTTTGCATGCCATTCTTATATTTTTGGCTCCTTCCCCTTCAATGGCCATCATTTTTTTTTTCTCTTTTTTGTGTTTAAGTGTATCATTTGCTCTCATGGGTGCCATAGTTTCCTGCAGTGTTAGATTTTGGCCACTGGGTTTTTCTTCTTTAGTTGTTTTGTTATCTGGGATGCACTTACATGGGCTTCCTCTGGTTCTTGGAAATTTGTTGGGACATGAACTGTTAACGTGTATTTGGGTTGTTCATCGTGGTTAAGGATGTGGTGTTTGATTGTATCCGGGCTTTGATCCTAATTTAGTTTTTATGTGAGATTTTAATTTTTAATTGTGCGTCTCGTTAATATGTAATCCAAATGACTATGCCGGAAGGGACAATGTGTATTTGTAGATACGGTTCAACAGTGTTTTAACCAATGGAGTACTTGGTTGGTATGAGTTAAACCTAAGATTATTTTGTTTTAATATGAGTTGCAAATTTATGGTTAGAAATTCGCTGCAAGCAGCAATTTATTCATTTTCATATATGATCTTTCAACTTCAACTAATTTGTAGCTCAAAAAGTGGATAGTTGTATGTGCTATATCAGAGGTGTGACTTGTGATAAGGAAAGTTAGCTTCCAGTATACTTATATGATATCAGACTTGTTAAAATTTAGTGTGGAGCGACATTAAGCAGTCACTCCACATGTTGCAGGTTGTTTATAATAATTTCTAAGCTGGTCTAAAAGGGTAGTCAATCTGTTTCTGTATAATCTATTAAAAGATCTTTTTCTTTTCTTTTTTTAATTCTTTTTATTATAGGCAACAGAAATTTTTCGTTGAAATCTGTTGAATGATCTCGTCAATCCATTTTTGAAATTCCAATGGCTCTTAGCTTATCCAGTTCATTCTGGAACACAAACGTTATATGGGCTGAGTCATCTTATTACTGCTCTGTTATCTTGTAGGCTTGTACTCTTGGAAAAGCTGAACTTTGTGGATTTCATGCATGCTATAAAGGGTGGGTGGACGGGGCATCCTCTTGCCCTAGCCCAGAACAATGAGGCTGAAGGGAGGAAGACCAGAATTCGGCGTTCAAAGGAGGAAAGGAAGGCAATGGTTGAAGTCTTCATAAAAAAGTAACCACTCAATGGATTTAAATTCAATTCATATTGTCTATACGATGCATGGATCAGTTAGGTTGTTGAAATTATTCATTGTTTCTGCGAGACACCTGCCATTATGTTGAATTACCAAGTGCATGATATCTTTCTTGAGAAAACCTTGTCCTCCTAGATGACGTTCTCACTCCATAATTTATCTGGCTGTTCACGTTTAAAAATGGATCATGGATCGTTGTTGTTTGGCACTTGTTCTTGGATTTATTTACGTTTCTGTTTTCTAGTTGAGATTTTGTGTAACCTTTTGGATAGGAAAATTCTAGAAACTGTTAGGTTGATAAATTCTCCTGCAGCTTAGTAGTCATGGTTATTTCCCCTTAGCATGAAGTGGGATAAAAACATAATCCAGTTAGTTGAGCTGTCAACAGGGAAATAATTATGGATGTGTTAGCTTCCACTAATCAGGATGGATAACATTTTGTTAATAGCATGTATATTGGACGAACATTTGTATTTTTGAACAATAAATAATGGATGAGCATTTTTTTTTTCTTTTTTTGTTCTCAGGTAGAACCATTGAAAATTGAGATCCTTGTGAAACAATGTTGTGTATAAGCAAAATAAAATTAATGCTAAGGAGTATAACCTTCCAATGTTGAGAGTTCAAGGAACTCAACTCTTGAAAAAAAAAAAAAGAAAAGAAATAAGAAGAAGAAGAAGTAGAAGAAGAAGAAGAAGAAGAAAAGAAGCGAGTTGTTCATTTCTTCCACTTGTTTCATGAAAAATCTCTTAAACCGAACAAACCTTGCAATAAGTGAGTTTGTTGACTGATATCCACACCACATACTAATAAAGATGAATTAAAGAGGTTAAAAACCTTTCAGAATACCTAGTATCTAAAAGCATAATTTAATTAATAACATAAAATAGGATTATGGCCTTCCCTCTCCCTTACATGACCAACCTTTTCCTAGAGGGGGAGATATTTGAAGATGATTCTGGTATTATGTGACTTGTCGTTGGGTCTTTCAATAGCAAAATTCACCAACTCCTTTGATGTGTTTTCTACAAATCTCTGTATTGATGCCTTTGAAAATGTTAGAGCTCTCTTGGACGAGGTGAAAAGGCTCTAGTTCTTTTCTTAGGTCCAGAACTGGTCTTCTTCGTTGAAACCAGCTATAGTACCAATTCACGTGGTATGTTTTGGAAAAAAAAGATATTGAAACAGAAATTATAAAGTTATTCAAGAGAGTTTAGACCTCTATGGTTGAACTTTGGAGAATATGATACTTCCCAAATTCCTCTTTAATGTCATGATTTAAATAAATACTAGGATATGAAAACAATTTCCTGTTAGAAATCCTAAAATCAAAGAAATTACTAGACCACGAGAACTCCCATTACAATAGGAAACTTATTTAACTATTAATGACTAAATCTAGACACCAATGTAATTTTTTTAATTTGTAGAAGTATGAAATCACATCATAACATCTTTGCTTGCAGGTACCAGGAATCAAATAATGGGAGTTTCCCCTCACTCAACCTTACTCACAAGGAAGTTGGTGGATCTTTCTATACGGTGCGAGAGATTGTACGTGATATAATCCAAGAAAATAGAGTCCTTGGTCCAGGAAAGTTATTAGAAGAGGAAGAACACAGAATTGATCATTCACTTGAAGAGAATCCACTCCACTCAATTGCTATTGAACCTCAATCTCCTTTAACGTTATCGGCTAAGGAAGTCCATTTTCCAATCAACTACGACCAAGATATAAATGAAGAACCAATCTTTGTTTCAGATGAGCAATGCACTACAACAAATATTCAGGGATCACAGAATGGGCCAATAATTAATGGTAGCCTGGTGGACATAAACGACAAGGAACCTGGGGAATTTATCGAGTCAGAGTTGCTAGTAAATGAACACAAGAAAGTAGAGGAAGTGGTAAAAGAGGAATCAGGAATGCCAATTAATCATGTAACTCCTTTGGCAACAGATGTTGTGGTAGAGACATTCCCATTGGATTCAGGTTCTTGGGGTGTTAATGGTTCAGATGTAAGATCTGAGATATTGATTTCAACCAGTGCCTCAGAAAAGCAAGTTAGTCAAACCATTGAGTTAGAATCAGATGTTGGCTTGTTTAACATTAAAGCTTCTGGCTGTGTAGTTGAGAAAGCAGAGGAAAACTTTGCAGGTCCATTATCAGAATCTGATATGGTGGAGGCAGCACAAATTGTTGAAACATCTAATGGATCTACTGTGAAAGAAGGTATCATTTATGAAGTTGGGGGTCCTGAGTTGGAAGTTTGCAGTGATACTCCAATATCTGTGACCTTTGAACAAGGCCAGAAATCTAGTGAAATGAAGGTGAGTGTTCTACTTTTTTCTGGCCAAGCATATAATGACTTTAATTTTCTTTTTTCTTTTTAATCTACCATCTTTTAAATTTCTGGGGCATAGGTATAACTTCTTAATGTTGACTTTTAGATTATGACAAAATAGTGTGGGGACTATTACTTGAGTAGTTTCTGGGCCAGTTGAAACTTTAGATTTAAGAAAAATAATAGAAGAATTCCTTTTATCTAAAACTTTATTGATGACATTTGGACGTTCATTTGCATGGTGATCTTGTTTCCATGTTTTTGAAATACCTACTTTTCTCGTTTGAAAAGTGCACCCTGGATTAGGCCAACCAGTTGTCTAATCAAAGTTCGTCACATGGAGAACTCTTCTTTCTTCTTTCTTCGAGTTTTATTTTATTTTATTTTATTTTGCTGTGTGATTAAGATGGGTTCAACCCAGAATGCATCCAAGCATCTCACGTAAATAAATTGTCAAACAAAATGTTCCCATCTAACCAGAGAAAACCCTGTCTTTCTTCTTTTTCTTTTGGTGGGGGAATGAGAAAGAAAAAGGAAAAGAAGAAAATGATACGTTGAACTTTGGAGTTGTTTATTTTGAGGATTTGCCACTCTTTTTGTGTTTGTAACTCAACTTTTGAAAGGCTAGTTTGCATTTGGACCATTATTGATGCTCTTTTACAGGCTCCAAATGCTTCTCCAAGTACTATTGAGAATCTCAACAAGACATTCAGCAATGGGTTTGATCAGGCCTCAAAAATCAAAGAGGAGACAGAGATGGAAAATAAAGTAGATGCTGGACAAACTGGTGGCTCCCAGAAAGAAAGCATTCCAACTTTAAACAGAATTAATCTGTAAGAATATCCAACCGCCCGCCCCGCCCCCTCCCTCCAAATTAGTCAATCATTTTACATGGTTATTGATGAGAAGATACATTTTTTCTCTTATTGCCCATTGATTTTAAGATGAAATCCCATGATTATCTAATACTTATCAAACAGTTTAACCAACATATTCATCTTCCATCTTTTTGTTTTCAGTGAATCATGGGAAGGGATGTCCAAAAACTCATCAAAAGCCGAAAACAACCCGGTTTTGGAAATCTTCAAGGCATTTATCGCTGCCTTCGTGAAGTTTTGGTCTGAGTAAATAATATGATTGTCGAGTATAAACGAATAGAGAGTAGTAGTAGTTAAATTTTCTGCCACAGAACCTGTCTGTCTTTGTACCAAGTTGCAATCGGTTACCCCGTTCACTCGAGTCGGTCCCATCGTCGATAATTACAAGGACAAAACTGGATTCTGGATGTGGGTTGGCATTTCTGTACGCTGCAGTAAGAAAGAAAGTTTAGGAGTAGGCATTTCCCACCCCTCTCAGTAGCTTGTAGAAGGGGTATTTTTCTTTTTAATCTTTTTACTGTAACCGTGAGATATGTCCCACCCTTATGATTTTATTCCAGATATGAATGGTTTCCTATTCATTTTTCATTACCATATGATAAAAAAGAGGAAGAAGAAAAAGTTGGAGTAACGTGAGAAGGAGCCTTTGATGGTAAGGTGTTTGTGTGTTAGGAGGAGGACAGGGAAACAAGAGAGAAAGGCATATACAGATTGGAGTTTTAATAGGCATCTTCTTTCCCCCTTTTTGCTCTGACCCGACATCTTATATCATATGGATCATGTAGTCACATGATTTTCTGGATTCCCCAATTTTCTAATACTACTTTTCTACCCTTTTTTTTTTTTTTTTTTTTAAATAAAAAAAATTACATTCCAAATAATGCAC

mRNA sequence

ATTCATTCTCCCACACTCCGCCATTCCCATCCCCCACTGCCCATTTCTGTTGGAATTTTACAGAATGTATCACTGCTTCGCTCTCTCTAATCCATCGGTAGGGTTTTAGGCTTCTTTATCTTCAATTCTCCTTCCACCCTTTCATTTCGCGCTCTCTCTGCTTCTAGGCTTGTACTCTTGGAAAAGCTGAACTTTGTGGATTTCATGCATGCTATAAAGGGTGGGTGGACGGGGCATCCTCTTGCCCTAGCCCAGAACAATGAGGCTGAAGGGAGGAAGACCAGAATTCGGCGTTCAAAGGAGGAAAGGAAGGCAATGGTTGAAGTCTTCATAAAAAAGTACCAGGAATCAAATAATGGGAGTTTCCCCTCACTCAACCTTACTCACAAGGAAGTTGGTGGATCTTTCTATACGGTGCGAGAGATTGTACGTGATATAATCCAAGAAAATAGAGTCCTTGGTCCAGGAAAGTTATTAGAAGAGGAAGAACACAGAATTGATCATTCACTTGAAGAGAATCCACTCCACTCAATTGCTATTGAACCTCAATCTCCTTTAACGTTATCGGCTAAGGAAGTCCATTTTCCAATCAACTACGACCAAGATATAAATGAAGAACCAATCTTTGTTTCAGATGAGCAATGCACTACAACAAATATTCAGGGATCACAGAATGGGCCAATAATTAATGGTAGCCTGGTGGACATAAACGACAAGGAACCTGGGGAATTTATCGAGTCAGAGTTGCTAGTAAATGAACACAAGAAAGTAGAGGAAGTGGTAAAAGAGGAATCAGGAATGCCAATTAATCATGTAACTCCTTTGGCAACAGATGTTGTGGTAGAGACATTCCCATTGGATTCAGGTTCTTGGGGTGTTAATGGTTCAGATGTAAGATCTGAGATATTGATTTCAACCAGTGCCTCAGAAAAGCAAGTTAGTCAAACCATTGAGTTAGAATCAGATGTTGGCTTGTTTAACATTAAAGCTTCTGGCTGTGTAGTTGAGAAAGCAGAGGAAAACTTTGCAGGTCCATTATCAGAATCTGATATGGTGGAGGCAGCACAAATTGTTGAAACATCTAATGGATCTACTGTGAAAGAAGGTATCATTTATGAAGTTGGGGGTCCTGAGTTGGAAGTTTGCAGTGATACTCCAATATCTGTGACCTTTGAACAAGGCCAGAAATCTAGTGAAATGAAGGCTCCAAATGCTTCTCCAAGTACTATTGAGAATCTCAACAAGACATTCAGCAATGGGTTTGATCAGGCCTCAAAAATCAAAGAGGAGACAGAGATGGAAAATAAAGTAGATGCTGGACAAACTGGTGGCTCCCAGAAAGAAAGCATTCCAACTTTAAACAGAATTAATCTTGAATCATGGGAAGGGATGTCCAAAAACTCATCAAAAGCCGAAAACAACCCGGTTTTGGAAATCTTCAAGGCATTTATCGCTGCCTTCGTGAAGTTTTGGTCTGAGTAAATAATATGATTGTCGAGTATAAACGAATAGAGAGTAGTAGTAGTTAAATTTTCTGCCACAGAACCTGTCTGTCTTTGTACCAAGTTGCAATCGGTTACCCCGTTCACTCGAGTCGGTCCCATCGTCGATAATTACAAGGACAAAACTGGATTCTGGATGTGGGTTGGCATTTCTGTACGCTGCAGTAAGAAAGAAAGTTTAGGAGTAGGCATTTCCCACCCCTCTCAGTAGCTTGTAGAAGGGGTATTTTTCTTTTTAATCTTTTTACTGTAACCGTGAGATATGTCCCACCCTTATGATTTTATTCCAGATATGAATGGTTTCCTATTCATTTTTCATTACCATATGATAAAAAAGAGGAAGAAGAAAAAGTTGGAGTAACGTGAGAAGGAGCCTTTGATGGTAAGGTGTTTGTGTGTTAGGAGGAGGACAGGGAAACAAGAGAGAAAGGCATATACAGATTGGAGTTTTAATAGGCATCTTCTTTCCCCCTTTTTGCTCTGACCCGACATCTTATATCATATGGATCATGTAGTCACATGATTTTCTGGATTCCCCAATTTTCTAATACTACTTTTCTACCCTTTTTTTTTTTTTTTTTTTTAAATAAAAAAAATTACATTCCAAATAATGCAC

Coding sequence (CDS)

ATGCATGCTATAAAGGGTGGGTGGACGGGGCATCCTCTTGCCCTAGCCCAGAACAATGAGGCTGAAGGGAGGAAGACCAGAATTCGGCGTTCAAAGGAGGAAAGGAAGGCAATGGTTGAAGTCTTCATAAAAAAGTACCAGGAATCAAATAATGGGAGTTTCCCCTCACTCAACCTTACTCACAAGGAAGTTGGTGGATCTTTCTATACGGTGCGAGAGATTGTACGTGATATAATCCAAGAAAATAGAGTCCTTGGTCCAGGAAAGTTATTAGAAGAGGAAGAACACAGAATTGATCATTCACTTGAAGAGAATCCACTCCACTCAATTGCTATTGAACCTCAATCTCCTTTAACGTTATCGGCTAAGGAAGTCCATTTTCCAATCAACTACGACCAAGATATAAATGAAGAACCAATCTTTGTTTCAGATGAGCAATGCACTACAACAAATATTCAGGGATCACAGAATGGGCCAATAATTAATGGTAGCCTGGTGGACATAAACGACAAGGAACCTGGGGAATTTATCGAGTCAGAGTTGCTAGTAAATGAACACAAGAAAGTAGAGGAAGTGGTAAAAGAGGAATCAGGAATGCCAATTAATCATGTAACTCCTTTGGCAACAGATGTTGTGGTAGAGACATTCCCATTGGATTCAGGTTCTTGGGGTGTTAATGGTTCAGATGTAAGATCTGAGATATTGATTTCAACCAGTGCCTCAGAAAAGCAAGTTAGTCAAACCATTGAGTTAGAATCAGATGTTGGCTTGTTTAACATTAAAGCTTCTGGCTGTGTAGTTGAGAAAGCAGAGGAAAACTTTGCAGGTCCATTATCAGAATCTGATATGGTGGAGGCAGCACAAATTGTTGAAACATCTAATGGATCTACTGTGAAAGAAGGTATCATTTATGAAGTTGGGGGTCCTGAGTTGGAAGTTTGCAGTGATACTCCAATATCTGTGACCTTTGAACAAGGCCAGAAATCTAGTGAAATGAAGGCTCCAAATGCTTCTCCAAGTACTATTGAGAATCTCAACAAGACATTCAGCAATGGGTTTGATCAGGCCTCAAAAATCAAAGAGGAGACAGAGATGGAAAATAAAGTAGATGCTGGACAAACTGGTGGCTCCCAGAAAGAAAGCATTCCAACTTTAAACAGAATTAATCTTGAATCATGGGAAGGGATGTCCAAAAACTCATCAAAAGCCGAAAACAACCCGGTTTTGGAAATCTTCAAGGCATTTATCGCTGCCTTCGTGAAGTTTTGGTCTGAGTAA

Protein sequence

MHAIKGGWTGHPLALAQNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLEEEEHRIDHSLEENPLHSIAIEPQSPLTLSAKEVHFPINYDQDINEEPIFVSDEQCTTTNIQGSQNGPIINGSLVDINDKEPGEFIESELLVNEHKKVEEVVKEESGMPINHVTPLATDVVVETFPLDSGSWGVNGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEKAEENFAGPLSESDMVEAAQIVETSNGSTVKEGIIYEVGGPELEVCSDTPISVTFEQGQKSSEMKAPNASPSTIENLNKTFSNGFDQASKIKEETEMENKVDAGQTGGSQKESIPTLNRINLESWEGMSKNSSKAENNPVLEIFKAFIAAFVKFWSE
Homology
BLAST of Bhi04G001041 vs. TAIR 10
Match: AT3G52170.1 (DNA binding )

HSP 1 Score: 172.9 bits (437), Expect = 5.4e-43
Identity = 167/513 (32.55%), Postives = 243/513 (47.37%), Query Frame = 0

Query: 1   MHAIKGGWTGHPLALAQNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLT 60
           MH++K    G   ALA+ +++ G++TR R  KEERK +VE FIKK+Q+ NNGSFPSL+LT
Sbjct: 1   MHSLKTTCVGQIFALAKPHDSVGKRTRNRIPKEERKTLVESFIKKHQKLNNGSFPSLSLT 60

Query: 61  HKEVGGSFYTVREIVRDIIQENRVLGPGKLLEEEEHRI-DHSLEENPLHSIAIEPQSPLT 120
           HKEVGGSFYT+REIVR+IIQENRVLGPG LL E    + D SL      SI ++P  PL+
Sbjct: 61  HKEVGGSFYTIREIVREIIQENRVLGPGDLLLEGNGSVQDQSLSS----SILMDPVPPLS 120

Query: 121 LSAKEVHFPINYDQDIN---------------------------EEPIFVSDEQCTTTNI 180
           LS    H       D +                           +E I +  +   +T+I
Sbjct: 121 LSPNGFHSGSYQSLDFSSESPEGNVNGSQVCLDNCREVSGSQLLKEDIGLVHQSMDSTDI 180

Query: 181 QGSQ--------NGPIINGSL-------------------VDINDKEPG----EFIESE- 240
             +Q        N    N  L                   +D+++K+ G     F+ES+ 
Sbjct: 181 SMTQLATSCSEDNDIKSNAGLQNRMETVCDSVDTKPQDKRLDVDNKDEGFEELRFMESDG 240

Query: 241 -LLVNEHKKVEE----VVKEESGMPINHVTPLATDVVVETFPLDSGSWGVNGSDVRSEIL 300
              VN   +V +    + + ++G+       ++ + VVETFPL S +  ++  D +   L
Sbjct: 241 TKPVNNDDRVNDAGAAMTEIKNGL---GTIDMSAETVVETFPLKSVTSTMDSPDAQPTEL 300

Query: 301 ISTSASEKQVSQTIELES------DVGLFNIKASGCVVEK-AEENFAGPLSESDMVEAAQ 360
                  K     +E +       D+G  +   S  V+E    E   G +     V   +
Sbjct: 301 NKVCEGGKGTETEVEADRSTVNHVDLGEISSSTSSAVLEDIGTEVIVGQIPNHISVPMEK 360

Query: 361 IV--ETSNGSTV-------KEGIIYEVGGPELEVCSDTPISVTFEQGQKSSEMKAPNASP 420
            V  E  N ++V       KE ++  V G    V  +   +  F  G  ++E K P +S 
Sbjct: 361 KVGEEIVNSASVDVECADAKETVV--VNG----VIGNVHETKEFSNGTLTAEQKMPTSST 420

Query: 421 STIENLN-----KTFSN--GFDQASKIKEETEMENKVDAGQTGGSQKESIPTLNRINLES 426
            +    N      T S+  G + AS  K+ T  + K+DA  +  SQKE+  TLNRI  ES
Sbjct: 421 ESGSRKNDRAKVDTVSSYAGNEVASVEKKATMEKGKIDAPDSSSSQKENNATLNRIKPES 480

BLAST of Bhi04G001041 vs. TAIR 10
Match: AT3G52170.2 (DNA binding )

HSP 1 Score: 172.9 bits (437), Expect = 5.4e-43
Identity = 167/513 (32.55%), Postives = 243/513 (47.37%), Query Frame = 0

Query: 1   MHAIKGGWTGHPLALAQNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLT 60
           MH++K    G   ALA+ +++ G++TR R  KEERK +VE FIKK+Q+ NNGSFPSL+LT
Sbjct: 1   MHSLKTTCVGQIFALAKPHDSVGKRTRNRIPKEERKTLVESFIKKHQKLNNGSFPSLSLT 60

Query: 61  HKEVGGSFYTVREIVRDIIQENRVLGPGKLLEEEEHRI-DHSLEENPLHSIAIEPQSPLT 120
           HKEVGGSFYT+REIVR+IIQENRVLGPG LL E    + D SL      SI ++P  PL+
Sbjct: 61  HKEVGGSFYTIREIVREIIQENRVLGPGDLLLEGNGSVQDQSLSS----SILMDPVPPLS 120

Query: 121 LSAKEVHFPINYDQDIN---------------------------EEPIFVSDEQCTTTNI 180
           LS    H       D +                           +E I +  +   +T+I
Sbjct: 121 LSPNGFHSGSYQSLDFSSESPEGNVNGSQVCLDNCREVSGSQLLKEDIGLVHQSMDSTDI 180

Query: 181 QGSQ--------NGPIINGSL-------------------VDINDKEPG----EFIESE- 240
             +Q        N    N  L                   +D+++K+ G     F+ES+ 
Sbjct: 181 SMTQLATSCSEDNDIKSNAGLQNRMETVCDSVDTKPQDKRLDVDNKDEGFEELRFMESDG 240

Query: 241 -LLVNEHKKVEE----VVKEESGMPINHVTPLATDVVVETFPLDSGSWGVNGSDVRSEIL 300
              VN   +V +    + + ++G+       ++ + VVETFPL S +  ++  D +   L
Sbjct: 241 TKPVNNDDRVNDAGAAMTEIKNGL---GTIDMSAETVVETFPLKSVTSTMDSPDAQPTEL 300

Query: 301 ISTSASEKQVSQTIELES------DVGLFNIKASGCVVEK-AEENFAGPLSESDMVEAAQ 360
                  K     +E +       D+G  +   S  V+E    E   G +     V   +
Sbjct: 301 NKVCEGGKGTETEVEADRSTVNHVDLGEISSSTSSAVLEDIGTEVIVGQIPNHISVPMEK 360

Query: 361 IV--ETSNGSTV-------KEGIIYEVGGPELEVCSDTPISVTFEQGQKSSEMKAPNASP 420
            V  E  N ++V       KE ++  V G    V  +   +  F  G  ++E K P +S 
Sbjct: 361 KVGEEIVNSASVDVECADAKETVV--VNG----VIGNVHETKEFSNGTLTAEQKMPTSST 420

Query: 421 STIENLN-----KTFSN--GFDQASKIKEETEMENKVDAGQTGGSQKESIPTLNRINLES 426
            +    N      T S+  G + AS  K+ T  + K+DA  +  SQKE+  TLNRI  ES
Sbjct: 421 ESGSRKNDRAKVDTVSSYAGNEVASVEKKATMEKGKIDAPDSSSSQKENNATLNRIKPES 480

BLAST of Bhi04G001041 vs. TAIR 10
Match: AT5G58210.3 (hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 52.8 bits (125), Expect = 8.1e-07
Identity = 81/352 (23.01%), Postives = 151/352 (42.90%), Query Frame = 0

Query: 29  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGP- 88
           R SK++R+A+VE F+ +Y+ +N G FPSL+ THK+VGGS+Y VR+I +++  + +   P 
Sbjct: 48  RLSKDDRRALVESFVNEYRATNAGRFPSLDATHKQVGGSYYIVRDIFQELKLKPKAHMPI 107

Query: 89  -GKLLEEEEHRI--DHSLEENPLHSIAIEPQSPLTLSAKEVHFPINYDQDINEEPIFVSD 148
             K L E    +  D S   +P     +E ++   LS      P +    ++  P+ + +
Sbjct: 108 VAKALSEVSSSVPGDASSHSSPAPVPTVEAKA---LSEVSPSVPADASSHLSPSPVPIVE 167

Query: 149 EQ----------CTTTNIQGSQNGPIING-SLVDINDKEPGE----FIESELLVNEHKKV 208
            +            T++       PI+   +L  ++   P +    F  + + + E K +
Sbjct: 168 AKALSEVSPSVPADTSSHFSPAPVPIVEAEALSVVSPSVPADTSSHFSPAPVPIVEAKTL 227

Query: 209 EEVVKEESGMPINHVTPLATDVVVETFPLDSGSWGVN-GSDVRSEILISTSASEKQVSQT 268
            EV         +H +P    VVVE     +GS  +  GS+ R +I+ ++ ++    S  
Sbjct: 228 SEVSPSVPDDASSHFSP----VVVEP-EFQAGSVDIEIGSEQRPDIIDTSHSNSDDES-- 287

Query: 269 IELESDVGLFNIKASGCVV----EKAEENFAGPLSESDMVEAAQIVETSNGSTVKE---- 328
                     N + +  V+    +K E      +  S+  E   +    N  +  +    
Sbjct: 288 ----------NFQGNNLVITAKDDKRETEATQGIENSETDEERNLTHLGNQESKADHLEG 347

Query: 329 -GIIYEVGGPELEVCSDTPISVTFEQGQKSSEMKAPNASPSTIENLNKTFSN 352
             +  +V   E    S+T      E   ++ E+K  ++S S I +  K F+N
Sbjct: 348 ADVSADVVPTETRQISETGAGEVKE--TETGEVKERSSSWSNIMSFAKEFAN 377

BLAST of Bhi04G001041 vs. TAIR 10
Match: AT5G58210.1 (hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 52.8 bits (125), Expect = 8.1e-07
Identity = 81/352 (23.01%), Postives = 151/352 (42.90%), Query Frame = 0

Query: 29  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGP- 88
           R SK++R+A+VE F+ +Y+ +N G FPSL+ THK+VGGS+Y VR+I +++  + +   P 
Sbjct: 48  RLSKDDRRALVESFVNEYRATNAGRFPSLDATHKQVGGSYYIVRDIFQELKLKPKAHMPI 107

Query: 89  -GKLLEEEEHRI--DHSLEENPLHSIAIEPQSPLTLSAKEVHFPINYDQDINEEPIFVSD 148
             K L E    +  D S   +P     +E ++   LS      P +    ++  P+ + +
Sbjct: 108 VAKALSEVSSSVPGDASSHSSPAPVPTVEAKA---LSEVSPSVPADASSHLSPSPVPIVE 167

Query: 149 EQ----------CTTTNIQGSQNGPIING-SLVDINDKEPGE----FIESELLVNEHKKV 208
            +            T++       PI+   +L  ++   P +    F  + + + E K +
Sbjct: 168 AKALSEVSPSVPADTSSHFSPAPVPIVEAEALSVVSPSVPADTSSHFSPAPVPIVEAKTL 227

Query: 209 EEVVKEESGMPINHVTPLATDVVVETFPLDSGSWGVN-GSDVRSEILISTSASEKQVSQT 268
            EV         +H +P    VVVE     +GS  +  GS+ R +I+ ++ ++    S  
Sbjct: 228 SEVSPSVPDDASSHFSP----VVVEP-EFQAGSVDIEIGSEQRPDIIDTSHSNSDDES-- 287

Query: 269 IELESDVGLFNIKASGCVV----EKAEENFAGPLSESDMVEAAQIVETSNGSTVKE---- 328
                     N + +  V+    +K E      +  S+  E   +    N  +  +    
Sbjct: 288 ----------NFQGNNLVITAKDDKRETEATQGIENSETDEERNLTHLGNQESKADHLEG 347

Query: 329 -GIIYEVGGPELEVCSDTPISVTFEQGQKSSEMKAPNASPSTIENLNKTFSN 352
             +  +V   E    S+T      E   ++ E+K  ++S S I +  K F+N
Sbjct: 348 ADVSADVVPTETRQISETGAGEVKE--TETGEVKERSSSWSNIMSFAKEFAN 377

BLAST of Bhi04G001041 vs. TAIR 10
Match: AT5G58210.2 (hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 52.8 bits (125), Expect = 8.1e-07
Identity = 81/352 (23.01%), Postives = 151/352 (42.90%), Query Frame = 0

Query: 29  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGP- 88
           R SK++R+A+VE F+ +Y+ +N G FPSL+ THK+VGGS+Y VR+I +++  + +   P 
Sbjct: 48  RLSKDDRRALVESFVNEYRATNAGRFPSLDATHKQVGGSYYIVRDIFQELKLKPKAHMPI 107

Query: 89  -GKLLEEEEHRI--DHSLEENPLHSIAIEPQSPLTLSAKEVHFPINYDQDINEEPIFVSD 148
             K L E    +  D S   +P     +E ++   LS      P +    ++  P+ + +
Sbjct: 108 VAKALSEVSSSVPGDASSHSSPAPVPTVEAKA---LSEVSPSVPADASSHLSPSPVPIVE 167

Query: 149 EQ----------CTTTNIQGSQNGPIING-SLVDINDKEPGE----FIESELLVNEHKKV 208
            +            T++       PI+   +L  ++   P +    F  + + + E K +
Sbjct: 168 AKALSEVSPSVPADTSSHFSPAPVPIVEAEALSVVSPSVPADTSSHFSPAPVPIVEAKTL 227

Query: 209 EEVVKEESGMPINHVTPLATDVVVETFPLDSGSWGVN-GSDVRSEILISTSASEKQVSQT 268
            EV         +H +P    VVVE     +GS  +  GS+ R +I+ ++ ++    S  
Sbjct: 228 SEVSPSVPDDASSHFSP----VVVEP-EFQAGSVDIEIGSEQRPDIIDTSHSNSDDES-- 287

Query: 269 IELESDVGLFNIKASGCVV----EKAEENFAGPLSESDMVEAAQIVETSNGSTVKE---- 328
                     N + +  V+    +K E      +  S+  E   +    N  +  +    
Sbjct: 288 ----------NFQGNNLVITAKDDKRETEATQGIENSETDEERNLTHLGNQESKADHLEG 347

Query: 329 -GIIYEVGGPELEVCSDTPISVTFEQGQKSSEMKAPNASPSTIENLNKTFSN 352
             +  +V   E    S+T      E   ++ E+K  ++S S I +  K F+N
Sbjct: 348 ADVSADVVPTETRQISETGAGEVKE--TETGEVKERSSSWSNIMSFAKEFAN 377

BLAST of Bhi04G001041 vs. ExPASy TrEMBL
Match: A0A1S3C473 (uncharacterized protein LOC103496473 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103496473 PE=4 SV=1)

HSP 1 Score: 675.6 bits (1742), Expect = 1.3e-190
Identity = 364/457 (79.65%), Postives = 389/457 (85.12%), Query Frame = 0

Query: 1   MHAIKGGWTGHPLALAQNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLT 60
           MHAIKGGWTG PLALA+NNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLT
Sbjct: 1   MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLT 60

Query: 61  HKEVGGSFYTVREIVRDIIQENRVLGPGKLLEEEEHRIDHSLEENPLHSIAIEPQSPLTL 120
           HKEVGGSFYTVREIVRDIIQENR+LGPGKLL EEEH  DHSL++NPLHSIAIEPQSPLTL
Sbjct: 61  HKEVGGSFYTVREIVRDIIQENRILGPGKLLLEEEHNTDHSLDQNPLHSIAIEPQSPLTL 120

Query: 121 SAKEVHFPINYDQDINEEPIFVSDEQCTTTNIQGSQNGPIINGSLVDINDKEPGEFIESE 180
           S+KEVHFP+NY++ INEEPIFVSDEQCT TNIQGSQN  IINGSLVD+++++  EFI+SE
Sbjct: 121 SSKEVHFPLNYNKYINEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSE 180

Query: 181 LLVNEHK------------------------------KVEEVVKEESGMPINHVTPLATD 240
           LLVNEHK                              KVEEVVKEESGMPINHVTPLATD
Sbjct: 181 LLVNEHKEVEEVVEKESGMPKNHATPLATDVLVNEHNKVEEVVKEESGMPINHVTPLATD 240

Query: 241 VVVETFPLDSGSWGVNGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEKA 300
           VVVETFPLD   W VNGSDVRSEILIST+ASEKQVSQ+IELESDVGL NI AS  VVEKA
Sbjct: 241 VVVETFPLDPVPWDVNGSDVRSEILISTNASEKQVSQSIELESDVGLSNITASDSVVEKA 300

Query: 301 EENFAGPLSE--SDMVEAAQIVETSNGSTVKEGIIYEVGGPELEVCSDTPISVTFEQGQK 360
            ENFAGPLSE  SD+VE AQIVE SNGSTVKEG ++EVGGPELEVCSDTPISV FEQGQK
Sbjct: 301 GENFAGPLSETKSDLVEVAQIVEISNGSTVKEGSMHEVGGPELEVCSDTPISVNFEQGQK 360

Query: 361 SSEMKAPNASPSTIENLNKTFSNGFDQASKIKEETEMENKVDAGQTGGSQKESIPTLNRI 420
           SS+MK+P AS    ENLNKTFSN FDQASKI    E+ENKVD GQTGGSQKES+PTLNRI
Sbjct: 361 SSKMKSPIAS----ENLNKTFSNDFDQASKI----EIENKVDPGQTGGSQKESVPTLNRI 420

Query: 421 NLESWEGMSKNSSKAENNPVLEIFKAFIAAFVKFWSE 426
           NLESWEGMSKNSSK ENNP+LEI K+FIAAFVKFWSE
Sbjct: 421 NLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWSE 449

BLAST of Bhi04G001041 vs. ExPASy TrEMBL
Match: A0A5A7UUF2 (Plastid envelope DNA binding protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold153G00240 PE=4 SV=1)

HSP 1 Score: 674.5 bits (1739), Expect = 2.9e-190
Identity = 363/457 (79.43%), Postives = 389/457 (85.12%), Query Frame = 0

Query: 1   MHAIKGGWTGHPLALAQNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLT 60
           MHAIKGGWTG PLALA+NNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLT
Sbjct: 1   MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLT 60

Query: 61  HKEVGGSFYTVREIVRDIIQENRVLGPGKLLEEEEHRIDHSLEENPLHSIAIEPQSPLTL 120
           HKEVGGSFYTVREIVRDIIQENR+LGPGKLL EEEH  DHSL++NPLHSIAIEPQSPLTL
Sbjct: 61  HKEVGGSFYTVREIVRDIIQENRILGPGKLLLEEEHNTDHSLDQNPLHSIAIEPQSPLTL 120

Query: 121 SAKEVHFPINYDQDINEEPIFVSDEQCTTTNIQGSQNGPIINGSLVDINDKEPGEFIESE 180
           S+KEVHFP+NY++ INEEPIFVSDEQCT TNIQGSQN  IINGSLVD+++++  EFI+SE
Sbjct: 121 SSKEVHFPLNYNKYINEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSE 180

Query: 181 LLVNEHK------------------------------KVEEVVKEESGMPINHVTPLATD 240
           LLVNEHK                              KVEEVVKEESGMPINHVTPLATD
Sbjct: 181 LLVNEHKEVEEVVEKESGMPKNHATPLATDVLVNEHNKVEEVVKEESGMPINHVTPLATD 240

Query: 241 VVVETFPLDSGSWGVNGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEKA 300
           VVVETFPLD   W VNGSDVRSEILIST+ASEKQVSQ+IELESDVGL NI AS  VVEKA
Sbjct: 241 VVVETFPLDPVPWDVNGSDVRSEILISTNASEKQVSQSIELESDVGLSNITASDSVVEKA 300

Query: 301 EENFAGPLSE--SDMVEAAQIVETSNGSTVKEGIIYEVGGPELEVCSDTPISVTFEQGQK 360
            ENFAGPLSE  SD+VE AQIVE SNGSTVKEG ++EVGGPELEVCSDTPISV FEQGQK
Sbjct: 301 GENFAGPLSETKSDLVEVAQIVEISNGSTVKEGSMHEVGGPELEVCSDTPISVNFEQGQK 360

Query: 361 SSEMKAPNASPSTIENLNKTFSNGFDQASKIKEETEMENKVDAGQTGGSQKESIPTLNRI 420
           SS+MK+P AS    ENLNKTFSN FDQASKI    E+ENKVD GQTGGSQKES+PTLNRI
Sbjct: 361 SSKMKSPIAS----ENLNKTFSNDFDQASKI----EIENKVDPGQTGGSQKESVPTLNRI 420

Query: 421 NLESWEGMSKNSSKAENNPVLEIFKAFIAAFVKFWSE 426
           NLESWEGMSKNSSK ENNP+LEI K+FIAAFVKFWS+
Sbjct: 421 NLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWSD 449

BLAST of Bhi04G001041 vs. ExPASy TrEMBL
Match: A0A0A0LML9 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G384970 PE=4 SV=1)

HSP 1 Score: 648.7 bits (1672), Expect = 1.7e-182
Identity = 350/458 (76.42%), Postives = 382/458 (83.41%), Query Frame = 0

Query: 1   MHAIKGGWTGHPLALAQNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLT 60
           MHAIKGGWTG PLALA+NNEAEGR+TRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLT
Sbjct: 1   MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLT 60

Query: 61  HKEVGGSFYTVREIVRDIIQENRVLGPGKLLEEEEHRIDHSLEENPLHSIAIEPQSPLTL 120
           HKEVGGSFYTVREIVRDIIQENR+LGPG LL  EEH  DHSLE+NPLHSIAIEP SPLTL
Sbjct: 61  HKEVGGSFYTVREIVRDIIQENRILGPGNLL-LEEHNPDHSLEQNPLHSIAIEPHSPLTL 120

Query: 121 SAKEVHFPINYDQDINEEPIFVSDEQCTTTNIQGSQNGPIINGSLVDINDKEPGEFIESE 180
           S+ EVHFP+NY++ I+EEPIFVSDEQCT TNIQGSQN  IINGSLVD+++++  EFI+SE
Sbjct: 121 SSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSE 180

Query: 181 L-------------------------------LVNEHKKVEEVVKEESGMPINHVTPLAT 240
           L                               LVNEH KVEEVVKEESGMPIN+VTPLAT
Sbjct: 181 LLVNGHKEVEEMVEKESGMPKNHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLAT 240

Query: 241 DVVVETFPLDSGSWGVNGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEK 300
           DVVVETFPLDS  W VNG DVRSEILISTSASEKQVSQ+IELESDVGLFNI  S CVVEK
Sbjct: 241 DVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEK 300

Query: 301 AEENFAGPL--SESDMVEAAQIVETSNGSTVKEGIIYEVGGPELEVCSDTPISVTFEQGQ 360
           AEEN   PL  ++SD+V+ AQIVE SNGSTVKEG I+EVGGPELEVCSDTP+SV+FEQGQ
Sbjct: 301 AEENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQ 360

Query: 361 KSSEMKAPNASPSTIENLNKTFSNGFDQASKIKEETEMENKVDAGQTGGSQKESIPTLNR 420
           KSS+MK+P AS    ENLNKTFSN FDQASKI    E++NKVD GQTGGSQKES+PTLNR
Sbjct: 361 KSSKMKSPIAS----ENLNKTFSNDFDQASKI----EIKNKVDPGQTGGSQKESVPTLNR 420

Query: 421 INLESWEGMSKNSSKAENNPVLEIFKAFIAAFVKFWSE 426
           INL+SWEGMSKNSSK  NNP+LEI K+FI AFVKFWSE
Sbjct: 421 INLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE 449

BLAST of Bhi04G001041 vs. ExPASy TrEMBL
Match: A0A1S3C344 (uncharacterized protein LOC103496473 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103496473 PE=4 SV=1)

HSP 1 Score: 641.3 bits (1653), Expect = 2.8e-180
Identity = 349/457 (76.37%), Postives = 373/457 (81.62%), Query Frame = 0

Query: 1   MHAIKGGWTGHPLALAQNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLT 60
           MHAIKGGWTG PLALA+NNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLT
Sbjct: 1   MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLT 60

Query: 61  HKEVGGSFYTVREIVRDIIQENRVLGPGKLLEEEEHRIDHSLEENPLHSIAIEPQSPLTL 120
           HKEVGGSFYTVREIVRDIIQENR+LGPGKLL EEEH  DHSL++NPLHSIAIEPQSPLTL
Sbjct: 61  HKEVGGSFYTVREIVRDIIQENRILGPGKLLLEEEHNTDHSLDQNPLHSIAIEPQSPLTL 120

Query: 121 SAKEVHFPINYDQDINEEPIFVSDEQCTTTNIQGSQNGPIINGSLVDINDKEPGEFIESE 180
           S+KEVHFP+NY++ INEEPIFVSDEQCT TNIQGSQN  IINGSLVD+++++  EFI+SE
Sbjct: 121 SSKEVHFPLNYNKYINEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSE 180

Query: 181 LLVNEHK------------------------------KVEEVVKEESGMPINHVTPLATD 240
           LLVNEHK                              KVEEVVKEESGMPINHVTPLATD
Sbjct: 181 LLVNEHKEVEEVVEKESGMPKNHATPLATDVLVNEHNKVEEVVKEESGMPINHVTPLATD 240

Query: 241 VVVETFPLDSGSWGVNGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEKA 300
           VVVETFPLD   W VNGSDVRSEILIST+ASEKQVSQ+IELESDVGL NI AS  VVEKA
Sbjct: 241 VVVETFPLDPVPWDVNGSDVRSEILISTNASEKQVSQSIELESDVGLSNITASDSVVEKA 300

Query: 301 EENFAGPLSE--SDMVEAAQIVETSNGSTVKEGIIYEVGGPELEVCSDTPISVTFEQGQK 360
            ENFAGPLSE  SD+VE AQIVE SNGSTVKEG ++EVGGPELEVCSDTPISV FEQGQK
Sbjct: 301 GENFAGPLSETKSDLVEVAQIVEISNGSTVKEGSMHEVGGPELEVCSDTPISVNFEQGQK 360

Query: 361 SSEMKAPNASPSTIENLNKTFSNGFDQASKIKEETEMENKVDAGQTGGSQKESIPTLNRI 420
           SS+MK                      ASKI    E+ENKVD GQTGGSQKES+PTLNRI
Sbjct: 361 SSKMK----------------------ASKI----EIENKVDPGQTGGSQKESVPTLNRI 420

Query: 421 NLESWEGMSKNSSKAENNPVLEIFKAFIAAFVKFWSE 426
           NLESWEGMSKNSSK ENNP+LEI K+FIAAFVKFWSE
Sbjct: 421 NLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWSE 431

BLAST of Bhi04G001041 vs. ExPASy TrEMBL
Match: A0A5D3BB97 (Plastid envelope DNA binding protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold371G00180 PE=4 SV=1)

HSP 1 Score: 640.2 bits (1650), Expect = 6.2e-180
Identity = 348/457 (76.15%), Postives = 373/457 (81.62%), Query Frame = 0

Query: 1   MHAIKGGWTGHPLALAQNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLT 60
           MHAIKGGWTG PLALA+NNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLT
Sbjct: 1   MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLT 60

Query: 61  HKEVGGSFYTVREIVRDIIQENRVLGPGKLLEEEEHRIDHSLEENPLHSIAIEPQSPLTL 120
           HKEVGGSFYTVREIVRDIIQENR+LGPGKLL EEEH  DHSL++NPLHSIAIEPQSPLTL
Sbjct: 61  HKEVGGSFYTVREIVRDIIQENRILGPGKLLLEEEHNTDHSLDQNPLHSIAIEPQSPLTL 120

Query: 121 SAKEVHFPINYDQDINEEPIFVSDEQCTTTNIQGSQNGPIINGSLVDINDKEPGEFIESE 180
           S+KEVHFP+NY++ INEEPIFVSDEQCT TNIQGSQN  IINGSLVD+++++  EFI+SE
Sbjct: 121 SSKEVHFPLNYNKYINEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSE 180

Query: 181 LLVNEHK------------------------------KVEEVVKEESGMPINHVTPLATD 240
           LLVNEHK                              KVEEVVKEESGMPINHVTPLATD
Sbjct: 181 LLVNEHKEVEEVVEKESGMPKNHATPLATDVLVNEHNKVEEVVKEESGMPINHVTPLATD 240

Query: 241 VVVETFPLDSGSWGVNGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEKA 300
           VVVETFPLD   W VNGSDVRSEILIST+ASEKQVSQ+IELESDVGL NI AS  VVEKA
Sbjct: 241 VVVETFPLDPVPWDVNGSDVRSEILISTNASEKQVSQSIELESDVGLSNITASDSVVEKA 300

Query: 301 EENFAGPLSE--SDMVEAAQIVETSNGSTVKEGIIYEVGGPELEVCSDTPISVTFEQGQK 360
            ENFAGPLSE  SD+VE AQIVE SNGSTVKEG ++EVGGPELEVCSDTPISV FEQGQK
Sbjct: 301 GENFAGPLSETKSDLVEVAQIVEISNGSTVKEGSMHEVGGPELEVCSDTPISVNFEQGQK 360

Query: 361 SSEMKAPNASPSTIENLNKTFSNGFDQASKIKEETEMENKVDAGQTGGSQKESIPTLNRI 420
           SS+MK                      ASKI    E+ENKVD GQTGGSQKES+PTLNRI
Sbjct: 361 SSKMK----------------------ASKI----EIENKVDPGQTGGSQKESVPTLNRI 420

Query: 421 NLESWEGMSKNSSKAENNPVLEIFKAFIAAFVKFWSE 426
           NLESWEGMSKNSSK ENNP+LEI K+FIAAFVKFWS+
Sbjct: 421 NLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWSD 431

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AT3G52170.15.4e-4332.55DNA binding [more]
AT3G52170.25.4e-4332.55DNA binding [more]
AT5G58210.38.1e-0723.01hydroxyproline-rich glycoprotein family protein [more]
AT5G58210.18.1e-0723.01hydroxyproline-rich glycoprotein family protein [more]
AT5G58210.28.1e-0723.01hydroxyproline-rich glycoprotein family protein [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A1S3C4731.3e-19079.65uncharacterized protein LOC103496473 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A5A7UUF22.9e-19079.43Plastid envelope DNA binding protein OS=Cucumis melo var. makuwa OX=1194695 GN=E... [more]
A0A0A0LML91.7e-18276.42Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G384970 PE=4 SV=1[more]
A0A1S3C3442.8e-18076.37uncharacterized protein LOC103496473 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A5D3BB976.2e-18076.15Plastid envelope DNA binding protein OS=Cucumis melo var. makuwa OX=1194695 GN=E... [more]
InterPro
Analysis Name: InterPro Annotations of Wax gourd (B227) v1
Date Performed: 2021-10-22
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 324..346
NoneNo IPR availablePANTHERPTHR34568FAMILY NOT NAMEDcoord: 1..425
NoneNo IPR availablePANTHERPTHR34568:SF1DNA BINDING PROTEINcoord: 1..425

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Bhi04M001041Bhi04M001041mRNA