Cla97C08G150710 (gene) Watermelon (97103) v2

NameCla97C08G150710
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionDNA binding protein
LocationCla97Chr08 : 19133803 .. 19136836 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCATGCTGTAAAAGGTGGGTGGACGGGGCGTCCTCTTGCCCTAGCCAAGAACAATGAGCCTGAAGGGAGGAAGACAAGAATTCGGCGTTCAAAGGAGGAAAGGAAGGCAATGGTTGAAGTTTTCATAAAAAAGTAACCACACAATGGTTTTAAATTCAATTCATATTTTCTATAGAATGCATGGATTGGTTAGATTGTCGCGATTATTAATCATCTATGCATTATGACATTTGACCTGCAGCTATATTGAATTCCCAAGTGAATGATACCTTTCTCGAAAAACCTTGTCCTGCTATATGACGTTCTCACTCCATGATTTATCTGGCTGTTCATGTTTAAACATGGATCATTGTTGTTTGGCGCTCATTCTTGGATTTGTTTACGTCACTGTTTTCTAGTTGAGCTTTTGTGTAAGCTTTTGGATAGCAAGCATCTTGAAACTGTTAGGTTGATAAATTCTCTTGCAGCTTAATAGTCATGGTCGTTTCCTCTTAGTATGAAGTGGGGGTAAAAACATAATTCAGAGCTGTCAACAGGGGAATATTTATGAAAGTTCTAGATGGATAACATTTTGTTGATTGCACCTGCACGTGTATTGGATGTACATTTGTATTTTTGAACAGGAAATAATGGATTAGTGAAACAATATTGTATACAAGCAAAATAATAATGATGGTAGGCTTATTTGGAGGAGTGTAACCGTTGAGAGTTCAAGGAACTCAACTCTTGCCAAGACAATAATAATAATAATAAAAGAAAGAGAAAAAGGAGGTTGCATGATATTGAAATAGAAATAATGAAGTTATTCAAGAGAGTTTAGACCTCAATGGTTGGACTTTGGAGAATCTTAAACTTATCAAATGAAATTATCTGTACGTTGTTATCCTCTTGATTTAAATACTACAATTTATTGTTAGAATTCCTACAATTAAAGAAACTACTAGACCACGAAAACTTCCATCATAATAGGAAATTTATTTAACTGTTAATGACTGAATCTAGATAAGGATGTAATTTATTTAATTAGTAAAAGTATGAAATCGCATCCTAAGATCTCTTGCAGGTACCAGGAATCAAATAAGGGGAGCTTCCCCTCGCTCAACCTTACTCACAAGGAAGTTGGTGGATCTTTCTATACAGTACGAGAGATTGTACGTGATATAATCCAAGAAAATAGAGTCCTTGGTCCAGGAAAGTTGTTATTAGAAGAGCACAGCATTGATCATTCACTTGAAGAGAATCCACTTCACTCAATTGCTATTGAACCTCCATCTCCTTTAACCTTATCGTCTAAGGAAGTCCATTTTCCAGTCAACTACAACCAATATATAAATGAAGAAGCAATCTTTGTTTCAGATGAGCGCTGCACTGCAACAAGTCTTCAGGGATCACAGAATGGGCCAATAGTTAATGGCAGCCTGGTGGACGCAAGCGAAAAGGATTCTGATGATTTTATCAAGTCAGAGTTGCCAGTAAATGAACACAAGAAAGTAGAGGAAGTGGTGAAAGAGGAATCAGGAATGCCAATTAATCATTTAACTCCTTTGGCAACAGATGTTGTGGTTGAGACATTCCCATTGGATTCAGTTTCTTGGTCTGTTAATGGTTCAGATGTAAGATCTGAGATATTAATTTCAACCAGTGCCTCAGAAAAGCAAGTTAGTCAAACCATTGAGTTAGAATCAGATGTTGGCTTGTTTAACATTAAAGCTTCTGGCTGTGTAGTTGAGAAAGAAGAGGAAAACTTTGCAGGTCCATTATCAGAAACAAAGTCTGACTTGGTGGAGGTAGCACAAATTGTTGAAACATCTAATGGATCTACTCTGAAAGAAGGTATCATATATGAAGTTGGGGGTTCTGAGTTGGAAGTTTGCAGTGATACTCCAATATCTGTGACCTCTGAACAAGGCCAGAAATCTAGTGAAATGAAGGTGAGTGTTCTATCTTTTTCTGGCCAAGCATTTAATGACTTTAATTTATTGTTTTTCTTTTCTTTTTAATCAATCGTCTTTTAAATTTCTGGGGCGTAGGTATAACTTCTTAAAGTTGACTTTTAGATTATGACAAAATAGTATGGGGACAATTACTTGAGTAGTTTCTGGGCCAGTTGAAACTTTAGATTGAAGAAAAAGAATAGAGAATTCCTTTTATATAAAAACTTTATGAATGACATTGGGACGTTCATTTGCATGTTGGTCTTGTTTCCATGTTTTTGAAGTACTTTTTCGTTTGAAAAGTGCATCCCAGACTGCACTCATCTTTTGACAGCCATGTAGCTGTCCAACCAAAGTTACAAACTCTTCTTTCTTCGTTCTTCGTTCTATTTTTATTTTTATTTTGCTGTGTGATTAAAGATGGGTGCAACCCATAATGCATCCAAACATCGCTCATAAATAAATTGTCAAACAATATGTTCTCATCCAACCAGAGAAAACCTTGTCTTTCTTCTTTTTCTTGGGAAGAGAGGGGGGGGGGGGGGGATTGAGAAAGAAAAAGGAGACGAAGAAAAGGATATGTTGAACTTTGGAGTTGTTTATTTGAGGATTTGCCACTTTTTTTGTGTTTGTAACTCAACTTTTGAAAGCCTAATCTGCATTTGGACCATTATTGATGCTCTTTTACAGGCTCCAAATGCTTCTCCAAGTACTATTGAGAATCTCAACAAGACATTTAGCAATGGCTTTGATCAGGCCTCAAAAATCAAAGAGGCGACAGAGATGGAAAAGAAAGTAGATGCTGGACAAACTGGTGGCTCCCAGAATGAAAGCATTCCAACTTTAAACAGAATTAATCTGTAAGAATATTCCCCGAAGATACATCTGTTGACTTATTATTGCCAATTGATTTTGAGATGAAATCTCATGATTATCCGATACTTATCAAACAGTTTAACCAACATATTCATCTTCCACCTTTTTTTCAGTGAATCATGGGAAGGGATGTCCAAAAACTCATTAAAACCCGAGAACAACCCGCTTTTGGAAATCTTGAAGGCATTCATCACTGCCTTTGTGAAGTTTTGGTCCGAGTAA

mRNA sequence

ATGCATGCTGTAAAAGGTGGGTGGACGGGGCGTCCTCTTGCCCTAGCCAAGAACAATGAGCCTGAAGGGAGGAAGACAAGAATTCGGCGTTCAAAGGAGGAAAGGAAGGCAATGGTTGAAGTTTTCATAAAAAAGTACCAGGAATCAAATAAGGGGAGCTTCCCCTCGCTCAACCTTACTCACAAGGAAGTTGGTGGATCTTTCTATACAGTACGAGAGATTGTACGTGATATAATCCAAGAAAATAGAGTCCTTGGTCCAGGAAAGTTGTTATTAGAAGAGCACAGCATTGATCATTCACTTGAAGAGAATCCACTTCACTCAATTGCTATTGAACCTCCATCTCCTTTAACCTTATCGTCTAAGGAAGTCCATTTTCCAGTCAACTACAACCAATATATAAATGAAGAAGCAATCTTTGTTTCAGATGAGCGCTGCACTGCAACAAGTCTTCAGGGATCACAGAATGGGCCAATAGTTAATGGCAGCCTGGTGGACGCAAGCGAAAAGGATTCTGATGATTTTATCAAGTCAGAGTTGCCAGTAAATGAACACAAGAAAGTAGAGGAAGTGGTGAAAGAGGAATCAGGAATGCCAATTAATCATTTAACTCCTTTGGCAACAGATGTTGTGGTTGAGACATTCCCATTGGATTCAGTTTCTTGGTCTGTTAATGGTTCAGATGTAAGATCTGAGATATTAATTTCAACCAGTGCCTCAGAAAAGCAAGTTAGTCAAACCATTGAGTTAGAATCAGATGTTGGCTTGTTTAACATTAAAGCTTCTGGCTGTGTAGTTGAGAAAGAAGAGGAAAACTTTGCAGGTCCATTATCAGAAACAAAGTCTGACTTGGTGGAGGTAGCACAAATTGTTGAAACATCTAATGGATCTACTCTGAAAGAAGGTATCATATATGAAGTTGGGGGTTCTGAGTTGGAAGTTTGCAGTGATACTCCAATATCTGTGACCTCTGAACAAGGCCAGAAATCTAGTGAAATGAAGGCTCCAAATGCTTCTCCAAGTACTATTGAGAATCTCAACAAGACATTTAGCAATGGCTTTGATCAGGCCTCAAAAATCAAAGAGGCGACAGAGATGGAAAAGAAAGTAGATGCTGGACAAACTGGTGGCTCCCAGAATGAAAGCATTCCAACTTTAAACAGAATTAATCTTGAATCATGGGAAGGGATGTCCAAAAACTCATTAAAACCCGAGAACAACCCGCTTTTGGAAATCTTGAAGGCATTCATCACTGCCTTTGTGAAGTTTTGGTCCGAGTAA

Coding sequence (CDS)

ATGCATGCTGTAAAAGGTGGGTGGACGGGGCGTCCTCTTGCCCTAGCCAAGAACAATGAGCCTGAAGGGAGGAAGACAAGAATTCGGCGTTCAAAGGAGGAAAGGAAGGCAATGGTTGAAGTTTTCATAAAAAAGTACCAGGAATCAAATAAGGGGAGCTTCCCCTCGCTCAACCTTACTCACAAGGAAGTTGGTGGATCTTTCTATACAGTACGAGAGATTGTACGTGATATAATCCAAGAAAATAGAGTCCTTGGTCCAGGAAAGTTGTTATTAGAAGAGCACAGCATTGATCATTCACTTGAAGAGAATCCACTTCACTCAATTGCTATTGAACCTCCATCTCCTTTAACCTTATCGTCTAAGGAAGTCCATTTTCCAGTCAACTACAACCAATATATAAATGAAGAAGCAATCTTTGTTTCAGATGAGCGCTGCACTGCAACAAGTCTTCAGGGATCACAGAATGGGCCAATAGTTAATGGCAGCCTGGTGGACGCAAGCGAAAAGGATTCTGATGATTTTATCAAGTCAGAGTTGCCAGTAAATGAACACAAGAAAGTAGAGGAAGTGGTGAAAGAGGAATCAGGAATGCCAATTAATCATTTAACTCCTTTGGCAACAGATGTTGTGGTTGAGACATTCCCATTGGATTCAGTTTCTTGGTCTGTTAATGGTTCAGATGTAAGATCTGAGATATTAATTTCAACCAGTGCCTCAGAAAAGCAAGTTAGTCAAACCATTGAGTTAGAATCAGATGTTGGCTTGTTTAACATTAAAGCTTCTGGCTGTGTAGTTGAGAAAGAAGAGGAAAACTTTGCAGGTCCATTATCAGAAACAAAGTCTGACTTGGTGGAGGTAGCACAAATTGTTGAAACATCTAATGGATCTACTCTGAAAGAAGGTATCATATATGAAGTTGGGGGTTCTGAGTTGGAAGTTTGCAGTGATACTCCAATATCTGTGACCTCTGAACAAGGCCAGAAATCTAGTGAAATGAAGGCTCCAAATGCTTCTCCAAGTACTATTGAGAATCTCAACAAGACATTTAGCAATGGCTTTGATCAGGCCTCAAAAATCAAAGAGGCGACAGAGATGGAAAAGAAAGTAGATGCTGGACAAACTGGTGGCTCCCAGAATGAAAGCATTCCAACTTTAAACAGAATTAATCTTGAATCATGGGAAGGGATGTCCAAAAACTCATTAAAACCCGAGAACAACCCGCTTTTGGAAATCTTGAAGGCATTCATCACTGCCTTTGTGAAGTTTTGGTCCGAGTAA

Protein sequence

MHAVKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEEHSIDHSLEENPLHSIAIEPPSPLTLSSKEVHFPVNYNQYINEEAIFVSDERCTATSLQGSQNGPIVNGSLVDASEKDSDDFIKSELPVNEHKKVEEVVKEESGMPINHLTPLATDVVVETFPLDSVSWSVNGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEKEEENFAGPLSETKSDLVEVAQIVETSNGSTLKEGIIYEVGGSELEVCSDTPISVTSEQGQKSSEMKAPNASPSTIENLNKTFSNGFDQASKIKEATEMEKKVDAGQTGGSQNESIPTLNRINLESWEGMSKNSLKPENNPLLEILKAFITAFVKFWSE
BLAST of Cla97C08G150710 vs. NCBI nr
Match: XP_008456554.1 (PREDICTED: uncharacterized protein LOC103496473 isoform X1 [Cucumis melo] >XP_008456555.1 PREDICTED: uncharacterized protein LOC103496473 isoform X1 [Cucumis melo] >XP_008456556.1 PREDICTED: uncharacterized protein LOC103496473 isoform X1 [Cucumis melo])

HSP 1 Score: 647.5 bits (1669), Expect = 3.1e-182
Identity = 350/457 (76.59%), Postives = 378/457 (82.71%), Query Frame = 0

Query: 1   MHAVKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLT 60
           MHA+KGGWTGRPLALAKNNE EGRKTRIRRSKEERKAMVEVFIKKYQESN GSFPSLNLT
Sbjct: 1   MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLT 60

Query: 61  HKEVGGSFYTVREIVRDIIQENRVLGPGKLLL-EEHSIDHSLEENPLHSIAIEPPSPLTL 120
           HKEVGGSFYTVREIVRDIIQENR+LGPGKLLL EEH+ DHSL++NPLHSIAIEP SPLTL
Sbjct: 61  HKEVGGSFYTVREIVRDIIQENRILGPGKLLLEEEHNTDHSLDQNPLHSIAIEPQSPLTL 120

Query: 121 SSKEVHFPVNYNQYINEEAIFVSDERCTATSLQGSQNGPIVNGSLVDASEKDSDDFIKSE 180
           SSKEVHFP+NYN+YINEE IFVSDE+CTAT++QGSQN  I+NGSLVD S +DSD+FI+SE
Sbjct: 121 SSKEVHFPLNYNKYINEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSE 180

Query: 181 LPVNEHKKVEEVVKEESGMPINHLTPL------------------------------ATD 240
           L VNEHK+VEEVV++ESGMP NH TPL                                 
Sbjct: 181 LLVNEHKEVEEVVEKESGMPKNHATPLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 240

Query: 241 VVVETFPLDSVSWSVNGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEKE 300
               TFPLD V W VNGSDVRSEILIST+ASEKQVSQ+IELESDVGL NI AS  VVEK 
Sbjct: 241 XXXXTFPLDPVPWDVNGSDVRSEILISTNASEKQVSQSIELESDVGLSNITASDSVVEKA 300

Query: 301 EENFAGPLSETKSDLVEVAQIVETSNGSTLKEGIIYEVGGSELEVCSDTPISVTSEQGQK 360
            ENFAGPLSETKSDLVEVAQIVE SNGST+KEG ++EVGG ELEVCSDTPISV  EQGQK
Sbjct: 301 GENFAGPLSETKSDLVEVAQIVEISNGSTVKEGSMHEVGGPELEVCSDTPISVNFEQGQK 360

Query: 361 SSEMKAPNASPSTIENLNKTFSNGFDQASKIKEATEMEKKVDAGQTGGSQNESIPTLNRI 420
           SS+MK+P AS    ENLNKTFSN FDQASKI    E+E KVD GQTGGSQ ES+PTLNRI
Sbjct: 361 SSKMKSPIAS----ENLNKTFSNDFDQASKI----EIENKVDPGQTGGSQKESVPTLNRI 420

Query: 421 NLESWEGMSKNSLKPENNPLLEILKAFITAFVKFWSE 427
           NLESWEGMSKNS KPENNPLLEI+K+FI AFVKFWSE
Sbjct: 421 NLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWSE 449

BLAST of Cla97C08G150710 vs. NCBI nr
Match: XP_004138835.1 (PREDICTED: uncharacterized protein LOC101202832 isoform X1 [Cucumis sativus] >KGN63028.1 hypothetical protein Csa_2G384970 [Cucumis sativus])

HSP 1 Score: 642.5 bits (1656), Expect = 1.0e-180
Identity = 343/457 (75.05%), Postives = 378/457 (82.71%), Query Frame = 0

Query: 1   MHAVKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLT 60
           MHA+KGGWTGRPLALAKNNE EGR+TRIRRSKEERKAMVEVFIKKYQESN GSFPSLNLT
Sbjct: 1   MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLT 60

Query: 61  HKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEEHSIDHSLEENPLHSIAIEPPSPLTLS 120
           HKEVGGSFYTVREIVRDIIQENR+LGPG LLLEEH+ DHSLE+NPLHSIAIEP SPLTLS
Sbjct: 61  HKEVGGSFYTVREIVRDIIQENRILGPGNLLLEEHNPDHSLEQNPLHSIAIEPHSPLTLS 120

Query: 121 SKEVHFPVNYNQYINEEAIFVSDERCTATSLQGSQNGPIVNGSLVDASEKDSDDFIKSEL 180
           S EVHFPVNYN+YI+EE IFVSDE+CTAT++QGSQN  I+NGSLVD S +DSD+FI+SEL
Sbjct: 121 SNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSEL 180

Query: 181 PVNEHKKVEEVVKEESGMPINHLTPLATD------------------------------- 240
            VN HK+VEE+V++ESGMP NH+T LATD                               
Sbjct: 181 LVNGHKEVEEMVEKESGMPKNHVTSLATDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 240

Query: 241 VVVETFPLDSVSWSVNGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEKE 300
              ETFPLDSV W VNG DVRSEILISTSASEKQVSQ+IELESDVGLFNI  S CVVEK 
Sbjct: 241 XXXETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEKA 300

Query: 301 EENFAGPLSETKSDLVEVAQIVETSNGSTLKEGIIYEVGGSELEVCSDTPISVTSEQGQK 360
           EEN   PL++TKSDLV+ AQIVE SNGST+KEG I+EVGG ELEVCSDTP+SV+ EQGQK
Sbjct: 301 EENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQK 360

Query: 361 SSEMKAPNASPSTIENLNKTFSNGFDQASKIKEATEMEKKVDAGQTGGSQNESIPTLNRI 420
           SS+MK+P AS    ENLNKTFSN FDQASKI    E++ KVD GQTGGSQ ES+PTLNRI
Sbjct: 361 SSKMKSPIAS----ENLNKTFSNDFDQASKI----EIKNKVDPGQTGGSQKESVPTLNRI 420

Query: 421 NLESWEGMSKNSLKPENNPLLEILKAFITAFVKFWSE 427
           NL+SWEGMSKNS KP NNPLLEI+K+FITAFVKFWSE
Sbjct: 421 NLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE 449

BLAST of Cla97C08G150710 vs. NCBI nr
Match: XP_008456557.1 (PREDICTED: uncharacterized protein LOC103496473 isoform X2 [Cucumis melo])

HSP 1 Score: 613.2 bits (1580), Expect = 6.6e-172
Identity = 335/457 (73.30%), Postives = 362/457 (79.21%), Query Frame = 0

Query: 1   MHAVKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLT 60
           MHA+KGGWTGRPLALAKNNE EGRKTRIRRSKEERKAMVEVFIKKYQESN GSFPSLNLT
Sbjct: 1   MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLT 60

Query: 61  HKEVGGSFYTVREIVRDIIQENRVLGPGKLLL-EEHSIDHSLEENPLHSIAIEPPSPLTL 120
           HKEVGGSFYTVREIVRDIIQENR+LGPGKLLL EEH+ DHSL++NPLHSIAIEP SPLTL
Sbjct: 61  HKEVGGSFYTVREIVRDIIQENRILGPGKLLLEEEHNTDHSLDQNPLHSIAIEPQSPLTL 120

Query: 121 SSKEVHFPVNYNQYINEEAIFVSDERCTATSLQGSQNGPIVNGSLVDASEKDSDDFIKSE 180
           SSKEVHFP+NYN+YINEE IFVSDE+CTAT++QGSQN  I+NGSLVD S +DSD+FI+SE
Sbjct: 121 SSKEVHFPLNYNKYINEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSE 180

Query: 181 LPVNEHKKVEEVVKEESGMPINHLTPL------------------------------ATD 240
           L VNEHK+VEEVV++ESGMP NH TPL                                 
Sbjct: 181 LLVNEHKEVEEVVEKESGMPKNHATPLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 240

Query: 241 VVVETFPLDSVSWSVNGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEKE 300
               TFPLD V W VNGSDVRSEILIST+ASEKQVSQ+IELESDVGL NI AS  VVEK 
Sbjct: 241 XXXXTFPLDPVPWDVNGSDVRSEILISTNASEKQVSQSIELESDVGLSNITASDSVVEKA 300

Query: 301 EENFAGPLSETKSDLVEVAQIVETSNGSTLKEGIIYEVGGSELEVCSDTPISVTSEQGQK 360
            ENFAGPLSETKSDLVEVAQIVE SNGST+KEG ++EVGG ELEVCSDTPISV  EQGQK
Sbjct: 301 GENFAGPLSETKSDLVEVAQIVEISNGSTVKEGSMHEVGGPELEVCSDTPISVNFEQGQK 360

Query: 361 SSEMKAPNASPSTIENLNKTFSNGFDQASKIKEATEMEKKVDAGQTGGSQNESIPTLNRI 420
           SS+MK                      ASKI    E+E KVD GQTGGSQ ES+PTLNRI
Sbjct: 361 SSKMK----------------------ASKI----EIENKVDPGQTGGSQKESVPTLNRI 420

Query: 421 NLESWEGMSKNSLKPENNPLLEILKAFITAFVKFWSE 427
           NLESWEGMSKNS KPENNPLLEI+K+FI AFVKFWSE
Sbjct: 421 NLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWSE 431

BLAST of Cla97C08G150710 vs. NCBI nr
Match: XP_023534676.1 (uncharacterized protein LOC111796177 isoform X1 [Cucurbita pepo subsp. pepo] >XP_023534686.1 uncharacterized protein LOC111796177 isoform X1 [Cucurbita pepo subsp. pepo] >XP_023534695.1 uncharacterized protein LOC111796177 isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 609.4 bits (1570), Expect = 9.5e-171
Identity = 327/433 (75.52%), Postives = 364/433 (84.06%), Query Frame = 0

Query: 1   MHAVKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLT 60
           MHA+KGGW G PLALAK NE EGRKTRIRRSKEERKAMVEVFIKKYQESN+GSFPSLNLT
Sbjct: 1   MHAIKGGWPGCPLALAKYNESEGRKTRIRRSKEERKAMVEVFIKKYQESNEGSFPSLNLT 60

Query: 61  HKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEEHSIDHSLEENPLHSIAIEPPSPLTLS 120
           HKEVGGSFYTVREIVRDIIQENRVLGPGKL LEEHS DH LEENPLHSIAIEP SPLT  
Sbjct: 61  HKEVGGSFYTVREIVRDIIQENRVLGPGKLSLEEHSTDHLLEENPLHSIAIEPQSPLTSP 120

Query: 121 SKEVHFPVNYNQYINEEAIFVSD-ERCTATSLQGSQNGPIVNGSLVDASEKDSDDFIKSE 180
           S+E  FP+N+N  INEE I VSD E+ T+ ++QGSQNG I+NGSLVDAS+KDSD+FI++E
Sbjct: 121 SEEFDFPINHNHCINEEPIIVSDEEQYTSKNIQGSQNGVIINGSLVDASDKDSDEFIQTE 180

Query: 181 LPVNEHKKVEEVVKEESGMPINHLTPLATDVVVETFPLDSVSWSVNGSDVRSEILISTSA 240
           LPVNEHKK+EEV+KEESGMPINH+TPLA DV V TFPLDS SW+ NGSDV SE LIST A
Sbjct: 181 LPVNEHKKIEEVLKEESGMPINHVTPLAVDVKVGTFPLDSFSWAANGSDVISETLISTDA 240

Query: 241 SEKQVSQTIELESDVGLFN------IKASGCVVEKEEENFAGPLSETKSDLVEVAQIVET 300
           SEK+VSQTIELESDV LFN       KASG   EK        LSET SDLVEVAQIVE 
Sbjct: 241 SEKKVSQTIELESDVSLFNSEDNNSTKASGRADEK-------ALSETMSDLVEVAQIVED 300

Query: 301 SNGSTLKEGIIYEVGGSELEVCSDTPISVTSEQGQKSSEMKAPNASPSTIENLNKTFSNG 360
           ++G+ +K+G I+EV G  LE+C+DTPISVT EQGQKSSE+KAPNASPS  +NLN + +NG
Sbjct: 301 TDGTIMKDGRIHEVDGPGLEICTDTPISVTFEQGQKSSEIKAPNASPSGTKNLNDSRNNG 360

Query: 361 FDQASKIKEATEMEKKVDAGQTGGSQNESIPTLNRINLESWEGMSKNSLKPENNPLLEIL 420
            DQASKIKE TE++ KV+A QTGGSQ ESIPTLNR+NL+SW G SK+S KPENNPLLEIL
Sbjct: 361 IDQASKIKEETEVKNKVEAEQTGGSQKESIPTLNRLNLKSWHGTSKDSSKPENNPLLEIL 420

Query: 421 KAFITAFVKFWSE 427
            AFI AFVKFWSE
Sbjct: 421 NAFIAAFVKFWSE 426

BLAST of Cla97C08G150710 vs. NCBI nr
Match: XP_011649859.1 (PREDICTED: uncharacterized protein LOC101202832 isoform X2 [Cucumis sativus])

HSP 1 Score: 608.2 bits (1567), Expect = 2.1e-170
Identity = 328/457 (71.77%), Postives = 362/457 (79.21%), Query Frame = 0

Query: 1   MHAVKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLT 60
           MHA+KGGWTGRPLALAKNNE EGR+TRIRRSKEERKAMVEVFIKKYQESN GSFPSLNLT
Sbjct: 1   MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLT 60

Query: 61  HKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEEHSIDHSLEENPLHSIAIEPPSPLTLS 120
           HKEVGGSFYTVREIVRDIIQENR+LGPG LLLEEH+ DHSLE+NPLHSIAIEP SPLTLS
Sbjct: 61  HKEVGGSFYTVREIVRDIIQENRILGPGNLLLEEHNPDHSLEQNPLHSIAIEPHSPLTLS 120

Query: 121 SKEVHFPVNYNQYINEEAIFVSDERCTATSLQGSQNGPIVNGSLVDASEKDSDDFIKSEL 180
           S EVHFPVNYN+YI+EE IFVSDE+CTAT++QGSQN  I+NGSLVD S +DSD+FI+SEL
Sbjct: 121 SNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSEL 180

Query: 181 PVNEHKKVEEVVKEESGMPINHLTPLATD------------------------------- 240
            VN HK+VEE+V++ESGMP NH+T LATD                               
Sbjct: 181 LVNGHKEVEEMVEKESGMPKNHVTSLATDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 240

Query: 241 VVVETFPLDSVSWSVNGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEKE 300
              ETFPLDSV W VNG DVRSEILISTSASEKQVSQ+IELESDVGLFNI  S CVVEK 
Sbjct: 241 XXXETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEKA 300

Query: 301 EENFAGPLSETKSDLVEVAQIVETSNGSTLKEGIIYEVGGSELEVCSDTPISVTSEQGQK 360
           EEN   PL++TKSDLV+ AQIVE SNGST+KEG I+EVGG ELEVCSDTP+SV+ EQGQK
Sbjct: 301 EENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQK 360

Query: 361 SSEMKAPNASPSTIENLNKTFSNGFDQASKIKEATEMEKKVDAGQTGGSQNESIPTLNRI 420
           SS+MK                      ASKI    E++ KVD GQTGGSQ ES+PTLNRI
Sbjct: 361 SSKMK----------------------ASKI----EIKNKVDPGQTGGSQKESVPTLNRI 420

Query: 421 NLESWEGMSKNSLKPENNPLLEILKAFITAFVKFWSE 427
           NL+SWEGMSKNS KP NNPLLEI+K+FITAFVKFWSE
Sbjct: 421 NLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE 431

BLAST of Cla97C08G150710 vs. TrEMBL
Match: tr|A0A1S3C473|A0A1S3C473_CUCME (uncharacterized protein LOC103496473 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103496473 PE=4 SV=1)

HSP 1 Score: 647.5 bits (1669), Expect = 2.1e-182
Identity = 350/457 (76.59%), Postives = 378/457 (82.71%), Query Frame = 0

Query: 1   MHAVKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLT 60
           MHA+KGGWTGRPLALAKNNE EGRKTRIRRSKEERKAMVEVFIKKYQESN GSFPSLNLT
Sbjct: 1   MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLT 60

Query: 61  HKEVGGSFYTVREIVRDIIQENRVLGPGKLLL-EEHSIDHSLEENPLHSIAIEPPSPLTL 120
           HKEVGGSFYTVREIVRDIIQENR+LGPGKLLL EEH+ DHSL++NPLHSIAIEP SPLTL
Sbjct: 61  HKEVGGSFYTVREIVRDIIQENRILGPGKLLLEEEHNTDHSLDQNPLHSIAIEPQSPLTL 120

Query: 121 SSKEVHFPVNYNQYINEEAIFVSDERCTATSLQGSQNGPIVNGSLVDASEKDSDDFIKSE 180
           SSKEVHFP+NYN+YINEE IFVSDE+CTAT++QGSQN  I+NGSLVD S +DSD+FI+SE
Sbjct: 121 SSKEVHFPLNYNKYINEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSE 180

Query: 181 LPVNEHKKVEEVVKEESGMPINHLTPL------------------------------ATD 240
           L VNEHK+VEEVV++ESGMP NH TPL                                 
Sbjct: 181 LLVNEHKEVEEVVEKESGMPKNHATPLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 240

Query: 241 VVVETFPLDSVSWSVNGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEKE 300
               TFPLD V W VNGSDVRSEILIST+ASEKQVSQ+IELESDVGL NI AS  VVEK 
Sbjct: 241 XXXXTFPLDPVPWDVNGSDVRSEILISTNASEKQVSQSIELESDVGLSNITASDSVVEKA 300

Query: 301 EENFAGPLSETKSDLVEVAQIVETSNGSTLKEGIIYEVGGSELEVCSDTPISVTSEQGQK 360
            ENFAGPLSETKSDLVEVAQIVE SNGST+KEG ++EVGG ELEVCSDTPISV  EQGQK
Sbjct: 301 GENFAGPLSETKSDLVEVAQIVEISNGSTVKEGSMHEVGGPELEVCSDTPISVNFEQGQK 360

Query: 361 SSEMKAPNASPSTIENLNKTFSNGFDQASKIKEATEMEKKVDAGQTGGSQNESIPTLNRI 420
           SS+MK+P AS    ENLNKTFSN FDQASKI    E+E KVD GQTGGSQ ES+PTLNRI
Sbjct: 361 SSKMKSPIAS----ENLNKTFSNDFDQASKI----EIENKVDPGQTGGSQKESVPTLNRI 420

Query: 421 NLESWEGMSKNSLKPENNPLLEILKAFITAFVKFWSE 427
           NLESWEGMSKNS KPENNPLLEI+K+FI AFVKFWSE
Sbjct: 421 NLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWSE 449

BLAST of Cla97C08G150710 vs. TrEMBL
Match: tr|A0A0A0LML9|A0A0A0LML9_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G384970 PE=4 SV=1)

HSP 1 Score: 642.5 bits (1656), Expect = 6.7e-181
Identity = 343/457 (75.05%), Postives = 378/457 (82.71%), Query Frame = 0

Query: 1   MHAVKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLT 60
           MHA+KGGWTGRPLALAKNNE EGR+TRIRRSKEERKAMVEVFIKKYQESN GSFPSLNLT
Sbjct: 1   MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLT 60

Query: 61  HKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEEHSIDHSLEENPLHSIAIEPPSPLTLS 120
           HKEVGGSFYTVREIVRDIIQENR+LGPG LLLEEH+ DHSLE+NPLHSIAIEP SPLTLS
Sbjct: 61  HKEVGGSFYTVREIVRDIIQENRILGPGNLLLEEHNPDHSLEQNPLHSIAIEPHSPLTLS 120

Query: 121 SKEVHFPVNYNQYINEEAIFVSDERCTATSLQGSQNGPIVNGSLVDASEKDSDDFIKSEL 180
           S EVHFPVNYN+YI+EE IFVSDE+CTAT++QGSQN  I+NGSLVD S +DSD+FI+SEL
Sbjct: 121 SNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSEL 180

Query: 181 PVNEHKKVEEVVKEESGMPINHLTPLATD------------------------------- 240
            VN HK+VEE+V++ESGMP NH+T LATD                               
Sbjct: 181 LVNGHKEVEEMVEKESGMPKNHVTSLATDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 240

Query: 241 VVVETFPLDSVSWSVNGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEKE 300
              ETFPLDSV W VNG DVRSEILISTSASEKQVSQ+IELESDVGLFNI  S CVVEK 
Sbjct: 241 XXXETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEKA 300

Query: 301 EENFAGPLSETKSDLVEVAQIVETSNGSTLKEGIIYEVGGSELEVCSDTPISVTSEQGQK 360
           EEN   PL++TKSDLV+ AQIVE SNGST+KEG I+EVGG ELEVCSDTP+SV+ EQGQK
Sbjct: 301 EENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQK 360

Query: 361 SSEMKAPNASPSTIENLNKTFSNGFDQASKIKEATEMEKKVDAGQTGGSQNESIPTLNRI 420
           SS+MK+P AS    ENLNKTFSN FDQASKI    E++ KVD GQTGGSQ ES+PTLNRI
Sbjct: 361 SSKMKSPIAS----ENLNKTFSNDFDQASKI----EIKNKVDPGQTGGSQKESVPTLNRI 420

Query: 421 NLESWEGMSKNSLKPENNPLLEILKAFITAFVKFWSE 427
           NL+SWEGMSKNS KP NNPLLEI+K+FITAFVKFWSE
Sbjct: 421 NLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE 449

BLAST of Cla97C08G150710 vs. TrEMBL
Match: tr|A0A1S3C344|A0A1S3C344_CUCME (uncharacterized protein LOC103496473 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103496473 PE=4 SV=1)

HSP 1 Score: 613.2 bits (1580), Expect = 4.3e-172
Identity = 335/457 (73.30%), Postives = 362/457 (79.21%), Query Frame = 0

Query: 1   MHAVKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLT 60
           MHA+KGGWTGRPLALAKNNE EGRKTRIRRSKEERKAMVEVFIKKYQESN GSFPSLNLT
Sbjct: 1   MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLT 60

Query: 61  HKEVGGSFYTVREIVRDIIQENRVLGPGKLLL-EEHSIDHSLEENPLHSIAIEPPSPLTL 120
           HKEVGGSFYTVREIVRDIIQENR+LGPGKLLL EEH+ DHSL++NPLHSIAIEP SPLTL
Sbjct: 61  HKEVGGSFYTVREIVRDIIQENRILGPGKLLLEEEHNTDHSLDQNPLHSIAIEPQSPLTL 120

Query: 121 SSKEVHFPVNYNQYINEEAIFVSDERCTATSLQGSQNGPIVNGSLVDASEKDSDDFIKSE 180
           SSKEVHFP+NYN+YINEE IFVSDE+CTAT++QGSQN  I+NGSLVD S +DSD+FI+SE
Sbjct: 121 SSKEVHFPLNYNKYINEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSE 180

Query: 181 LPVNEHKKVEEVVKEESGMPINHLTPL------------------------------ATD 240
           L VNEHK+VEEVV++ESGMP NH TPL                                 
Sbjct: 181 LLVNEHKEVEEVVEKESGMPKNHATPLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 240

Query: 241 VVVETFPLDSVSWSVNGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEKE 300
               TFPLD V W VNGSDVRSEILIST+ASEKQVSQ+IELESDVGL NI AS  VVEK 
Sbjct: 241 XXXXTFPLDPVPWDVNGSDVRSEILISTNASEKQVSQSIELESDVGLSNITASDSVVEKA 300

Query: 301 EENFAGPLSETKSDLVEVAQIVETSNGSTLKEGIIYEVGGSELEVCSDTPISVTSEQGQK 360
            ENFAGPLSETKSDLVEVAQIVE SNGST+KEG ++EVGG ELEVCSDTPISV  EQGQK
Sbjct: 301 GENFAGPLSETKSDLVEVAQIVEISNGSTVKEGSMHEVGGPELEVCSDTPISVNFEQGQK 360

Query: 361 SSEMKAPNASPSTIENLNKTFSNGFDQASKIKEATEMEKKVDAGQTGGSQNESIPTLNRI 420
           SS+MK                      ASKI    E+E KVD GQTGGSQ ES+PTLNRI
Sbjct: 361 SSKMK----------------------ASKI----EIENKVDPGQTGGSQKESVPTLNRI 420

Query: 421 NLESWEGMSKNSLKPENNPLLEILKAFITAFVKFWSE 427
           NLESWEGMSKNS KPENNPLLEI+K+FI AFVKFWSE
Sbjct: 421 NLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWSE 431

BLAST of Cla97C08G150710 vs. TrEMBL
Match: tr|Q5DW97|Q5DW97_CUCSA (Plastid envelope DNA binding protein (Fragment) OS=Cucumis sativus OX=3659 GN=PEND PE=4 SV=1)

HSP 1 Score: 380.2 bits (975), Expect = 6.2e-102
Identity = 241/433 (55.66%), Postives = 276/433 (63.74%), Query Frame = 0

Query: 64  VGGSFYTVREIVRDIIQENRVLGPGKLLLEEHSIDHSLEENPLHSIAIEPPSPLTLSSKE 123
           VGGSFYTVREIVRDIIQENR+LGPG LLLEEH+ DHSLE+NPLHSIAIEP SPLTLSS E
Sbjct: 1   VGGSFYTVREIVRDIIQENRILGPGNLLLEEHNPDHSLEQNPLHSIAIEPHSPLTLSSNE 60

Query: 124 VHFPVNYNQYINEEAIFVSDERCTATSLQGSQNGPIVNGSLVDASEKDSDDFIKSELPVN 183
           VHFPVNYN+YI+EE IFVSDE+CTAT++QGSQN  I+NGSLVD S +DSD+FI+SEL VN
Sbjct: 61  VHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVN 120

Query: 184 EHKKVEEVVKEESGMPINHLTPLATD-------------------------------VVV 243
            HK+VEE+V++ESGMP NH+T LATD                                  
Sbjct: 121 GHKEVEEMVEKESGMPKNHVTSLATDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 180

Query: 244 ETFPLDSVSWSVNGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEKEEEN 303
           ETFPLDSV W VNG DVRSEILISTSASEKQVSQ+IELESDVGLFNI  S CVVEK EEN
Sbjct: 181 ETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEKAEEN 240

Query: 304 FAGPLSETKSDLVEVAQIVETSNGSTLKEGIIYEVGGSELE------------------- 363
              PL++TKSDLV+ AQIVE SNGST+KEG I+EVGG   +                   
Sbjct: 241 LTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPSWKFAVILQYLIPFILNFMDGI 300

Query: 364 ---VCSDTPISVTS------EQGQKSSEMKA--PNASPSTIENLNKTFSNGFDQASKIKE 423
              +C      V        E+    + + A  P   P             F    K+  
Sbjct: 301 WTFICMVVLFCVFEIPTFLFEKCIDCTNLLAVHPVIHPKFTMCQTLFLFLVFFAVMKVS- 360

Query: 424 ATEMEKKVDAGQ--TGGSQNESIPT-LNRINL------ESWEGMSKNSLKPENNPLLEIL 427
                KK+DA    T      +I T + R NL      +SWEGMSKNS KP NNPLLEI+
Sbjct: 361 ----CKKMDANPKCTQVLLINTISTHVTRENLVFLFICDSWEGMSKNSSKPGNNPLLEII 420

BLAST of Cla97C08G150710 vs. TrEMBL
Match: tr|A0A251QEP0|A0A251QEP0_PRUPE (Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_2G095200 PE=4 SV=1)

HSP 1 Score: 295.8 bits (756), Expect = 1.5e-76
Identity = 200/474 (42.19%), Postives = 259/474 (54.64%), Query Frame = 0

Query: 1   MHAVKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLT 60
           MHA+KGGW GR  ALAKNNE EGRK+RIRRSKEERKAMVE FIK YQ+ N GSFPSLNLT
Sbjct: 1   MHAIKGGWAGRTFALAKNNESEGRKSRIRRSKEERKAMVESFIKTYQKLNNGSFPSLNLT 60

Query: 61  HKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEEHSIDHSLEENPLHSIAIEPPSPLTLS 120
           HKEVGGSFYTVREIVRDIIQENRVLGP K   EE +IDH LE+NPL SIA EPP+ L++S
Sbjct: 61  HKEVGGSFYTVREIVRDIIQENRVLGPAKFTAEEQTIDHFLEQNPLGSIATEPPNTLSIS 120

Query: 121 SKEVHFPVNYNQYINEEAIFVSDERCTATSLQGSQNGPIVNGSLVDASEKDSDDFIKSEL 180
             +  F  N NQ   EE +F SD        Q   NG I+NG+ V+ + K+  +   +EL
Sbjct: 121 LNQSQFISNQNQGRIEELVFTSDGHLATLERQNFDNGKIINGTQVEVNNKEF-ELKCTEL 180

Query: 181 PVNEHKKVEEVVKEESGMPINHLTPLATDVVVETFPLDSVSWSVNGSDVRSEILISTSAS 240
            V    + E+ V EES      +      +  E   +D+   + N  D++ +       +
Sbjct: 181 RVKGPVETEKNVAEES------VVHNGDCIGPEYQMVDNGLINGNQVDLKDKKTEELPCT 240

Query: 241 EKQVSQTIEL------------------------------------ESDVGLFNIKASGC 300
           E Q  + +E                                     ES  G         
Sbjct: 241 ELQTIEPLEAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKPANETSESLDGRLQEVTDLA 300

Query: 301 V--VEKEEENFAGPLSETKSDLVEVAQI-------VETSNGSTLKEGIIYEVGGSELEVC 360
           +   ++ EEN + PL E  S  ++   +       +E+SN ST  +G++ E G ++L V 
Sbjct: 301 ISTEDRVEENLSSPLLENNSGSLDEEALGNARDPSLESSNCSTFNDGVVREKGSTDLNVK 360

Query: 361 S---DTPISVTSEQGQKSSEMKAPNASPSTIENLNKTFSNGFDQASKIKEATEMEKKVDA 420
           +   D P S    Q Q ++  KA  A P ++   +   + G  + SK KE   +E +VD 
Sbjct: 361 APHKDVPTSEILVQSQLTAGPKAIKA-PDSLHTNHINSTGGSSELSKTKEVLVIEDEVDV 420

Query: 421 GQTGGSQNESIPTLNRINLESWEGMSKNSLKPENNPLLEILKAFITAFVKFWSE 427
             +G SQ  S PTL+RINLESWEG S+ S KPE NPL ++ KAFI AFVKFWSE
Sbjct: 421 QSSGSSQKGSSPTLDRINLESWEGRSQKSAKPEGNPLWDVFKAFIDAFVKFWSE 466

BLAST of Cla97C08G150710 vs. TAIR10
Match: AT3G52170.1 (DNA binding)

HSP 1 Score: 177.6 bits (449), Expect = 1.7e-44
Identity = 155/502 (30.88%), Postives = 241/502 (48.01%), Query Frame = 0

Query: 1   MHAVKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLT 60
           MH++K    G+  ALAK ++  G++TR R  KEERK +VE FIKK+Q+ N GSFPSL+LT
Sbjct: 1   MHSLKTTCVGQIFALAKPHDSVGKRTRNRIPKEERKTLVESFIKKHQKLNNGSFPSLSLT 60

Query: 61  HKEVGGSFYTVREIVRDIIQENRVLGPGKLLLE------EHSIDHSLEENPLHSIAIEPP 120
           HKEVGGSFYT+REIVR+IIQENRVLGPG LLLE      + S+  S+  +P+  +++ P 
Sbjct: 61  HKEVGGSFYTIREIVREIIQENRVLGPGDLLLEGNGSVQDQSLSSSILMDPVPPLSLSPN 120

Query: 121 -------SPLTLSSKEVHFPVNYNQYINEEAIFVSDERCTATSL----QGSQNGPIVNGS 180
                    L  SS+     VN +Q   +    VS  +     +    Q   +  I    
Sbjct: 121 GFHSGSYQSLDFSSESPEGNVNGSQVCLDNCREVSGSQLLKEDIGLVHQSMDSTDISMTQ 180

Query: 181 LVDASEKDSDDFIKSELPV-----------------------NEHKKVEEV--VKEESGM 240
           L  +  +D+D  IKS   +                       N+ +  EE+  ++ +   
Sbjct: 181 LATSCSEDND--IKSNAGLQNRMETVCDSVDTKPQDKRLDVDNKDEGFEELRFMESDGTK 240

Query: 241 PINH-------------------LTPLATDVVVETFPLDSVSWSVNGSDVRSEILISTSA 300
           P+N+                      ++ + VVETFPL SV+ +++  D +   L     
Sbjct: 241 PVNNDDRVNDAGAAMTEIKNGLGTIDMSAETVVETFPLKSVTSTMDSPDAQPTELNKVCE 300

Query: 301 SEKQVSQTIELES------DVGLFNIKASGCVVE---------KEEENFAGPLSETKSDL 360
             K     +E +       D+G  +   S  V+E         +   + + P+ +   + 
Sbjct: 301 GGKGTETEVEADRSTVNHVDLGEISSSTSSAVLEDIGTEVIVGQIPNHISVPMEKKVGEE 360

Query: 361 VEVAQIVETSNGSTLKEGIIYEVGGSELEVCSDTPISVTSEQGQKSSEMKAPNASPSTIE 420
           +  +  V+       +  ++  V G+  E    +  ++T+EQ   +S  ++ +      +
Sbjct: 361 IVNSASVDVECADAKETVVVNGVIGNVHETKEFSNGTLTAEQKMPTSSTESGSRKNDRAK 420

Query: 421 NLNKTFSNGFDQASKIKEATEMEKKVDAGQTGGSQNESIPTLNRINLESWEGMSKNSLKP 427
               +   G + AS  K+AT  + K+DA  +  SQ E+  TLNRI  ESW+G S N  + 
Sbjct: 421 VDTVSSYAGNEVASVEKKATMEKGKIDAPDSSSSQKENNATLNRIKPESWKGES-NMGRQ 480

BLAST of Cla97C08G150710 vs. TAIR10
Match: AT5G58210.3 (hydroxyproline-rich glycoprotein family protein)

HSP 1 Score: 61.6 bits (148), Expect = 1.3e-09
Identity = 33/89 (37.08%), Postives = 54/89 (60.67%), Query Frame = 0

Query: 29  RRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPG 88
           R SK++R+A+VE F+ +Y+ +N G FPSL+ THK+VGGS+Y    IVRDI QE ++    
Sbjct: 48  RLSKDDRRALVESFVNEYRATNAGRFPSLDATHKQVGGSYY----IVRDIFQELKLKPKA 107

Query: 89  KLLLEEHSIDHSLEENPLHSIAIEPPSPL 118
            + +   ++       P  + +   P+P+
Sbjct: 108 HMPIVAKALSEVSSSVPGDASSHSSPAPV 132

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008456554.13.1e-18276.59PREDICTED: uncharacterized protein LOC103496473 isoform X1 [Cucumis melo] >XP_00... [more]
XP_004138835.11.0e-18075.05PREDICTED: uncharacterized protein LOC101202832 isoform X1 [Cucumis sativus] >KG... [more]
XP_008456557.16.6e-17273.30PREDICTED: uncharacterized protein LOC103496473 isoform X2 [Cucumis melo][more]
XP_023534676.19.5e-17175.52uncharacterized protein LOC111796177 isoform X1 [Cucurbita pepo subsp. pepo] >XP... [more]
XP_011649859.12.1e-17071.77PREDICTED: uncharacterized protein LOC101202832 isoform X2 [Cucumis sativus][more]
Match NameE-valueIdentityDescription
tr|A0A1S3C473|A0A1S3C473_CUCME2.1e-18276.59uncharacterized protein LOC103496473 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
tr|A0A0A0LML9|A0A0A0LML9_CUCSA6.7e-18175.05Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G384970 PE=4 SV=1[more]
tr|A0A1S3C344|A0A1S3C344_CUCME4.3e-17273.30uncharacterized protein LOC103496473 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... [more]
tr|Q5DW97|Q5DW97_CUCSA6.2e-10255.66Plastid envelope DNA binding protein (Fragment) OS=Cucumis sativus OX=3659 GN=PE... [more]
tr|A0A251QEP0|A0A251QEP0_PRUPE1.5e-7642.19Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_2G095200 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
AT3G52170.11.7e-4430.88DNA binding[more]
AT5G58210.31.3e-0937.08hydroxyproline-rich glycoprotein family protein[more]
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0016020 membrane
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C08G150710.1Cla97C08G150710.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 323..342
NoneNo IPR availablePANTHERPTHR34568FAMILY NOT NAMEDcoord: 1..426
NoneNo IPR availablePANTHERPTHR34568:SF2DNA BINDING PROTEINcoord: 1..426

The following gene(s) are paralogous to this gene:

None