Cla97C05G092090 (gene) Watermelon (97103) v2

NameCla97C05G092090
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionHydroxyproline-rich glycoprotein family protein, putative
LocationCla97Chr05 : 10196980 .. 10199609 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGGAAAGGGGAAGAGCAAAATCTGCCGTTGCAGCAGCGTCGTGAGGTGGCTCCAAGTGGGGATTCTTCTGGGTTTCTTTGTGGTCAATGTTCGACTGCTTTTCATAGAGTTTGTAAGGAGTTGAATTTCAAGTGTTTCTTCGTTTTGATTTTGGGGTTTGTGGTGTTTGTCCCTGGATTCTTTTGGCTTCTTCCTCTTCATGAAAGAAATTCTGGGTTTGAGGCAAAAGACACCATTAAACTCAGTGGTATGTATATCTAGAGTTTCCTATTTCTCTTTTCTTCTTTCTTCTTCAATTCTGCCTTTGGGAATTTCTGCTTTGATTCTTGATTGCTCTGATTTTTCTTGTAATTTCTTGATGGGGTTTTATACTTTTCTGATGCTTTTGAATGATTTGAATGATCTTTCTGTTTTCTGTAATGTGTTTTCTCTGTCTCTTGAAATTTGAAATTTGGAAGTGTTATGATGATTGTGCTTCAGATCAACTCATCTGATCTACCATTTTTATTCTCCCACAATTTTCTGATAGTTTATTTTGATGGTCATTTTAATATGAAGATGAGAGGCCATTTCTCAAGGTTCAATCCTGGTTGAAGAAGGTTTTTTTTGTTCCTTTTATCTTGTTTTGGTTTTCCTTGAGGGGTTTTATTTTGTTTGATGTTTTGATAGAAATGTTTGAAAATATGGTGCCTTAGTCCTAATTCTGCAGAGTTTGTTATGTGAATTTAGTGTAAGTTGCAACCAACACTCACTTGATGCGGGTTTCATTTTTCCATGTTTGGAATATAATTTTCACATGGGAAGAAAGTTCGAACCAAAATTTATACTATTTAGAAAATAGTAAAGGCATCATATTCCTTCCTTACCATTTTGCTTAGGGAATCATATTCCCAGAAGAATTGGATTCAAGACAATTGGGTGGATCGTCTGAACACTCCTCCTAGCATCAACTATGAAACACTTACTCTATAGTTTATGTGATAGCAATAAAACCGGACTTTGCTTTTGCTTCAATTTTAGATTTTCTGTGAACTATTTGGTTAGAATCCGATTCGTGAGGGATCGGGAACGCTTCAAGAAATTAGACTTTGCTTTTGCTTCAACTTAGATCTTCTGTGAGTATTTGTTACTTAGAAAGTTTTTGTTCTCTTCTCCAGCTACAGTTCAGGTGTATTTCGTTCTTGAAAAGCCCGTGAATGAGCTTCTCCCTCACATCAAGAGATTAGAGTTTGATATCAATGGTGAATTAGACATTCCAAACTTGAAGGTTTTGTGCTTTGTTATTGTTTTTGGATTTAGTTTTACTAGTAAGGTCTTTTTATGTCTAAGGGTTGTCGTATATGTTACCAGGTTTCCATTCTATCCATGCATGGTATAGGTGAGTCGAACAGGACTTACGTGGTTTTTGGTCTTCTTTCTGAATACATAACTGCTCCAATAAATCCAGTGTCCTTAAGTCTGCTGAGATCGTCTTTATATGACTTTTTCCTTTCCGAATCCAACCTTACTTTGACGACATCGATTTTTGGACAGCCATCGACATTTCAAATTCTCAAGTTTCCAGGGGGAATTTCTATAATCCCATTTCAACATGCTTCAATTTGGCAGTTTCCCCAGATTGTATTTAACTTCACTCTTACTAACTCCATTTCTGAAATACTCGACAACTTTGCCAAGTTCAAGAGCCAACTAAAGTTTGGATTGAGTCTGAGGACTTATGAGGTATGAAGACTTATATAAGTTAAACTTATCTTTTCTGTTATGGATCGAATTTATCAAATATTAGCTCTAGCATTTGAGATCATAAGTTGAGATGAAATTTTTAATTGTTGTTAGTTTTGTGTATCTGGGAGTGGATTTGGTTAATGGATGTTGGATTCCATTCTTTCTTTTCTTGTTATACAGAATGTGTATTTGCAAATAACAAACAAGATTGGCTCGACGATGCAACCACTTGTAATTGTTCAGGCTTCTATTACGTCGGAATTGGGACGCATAACGTCACAGAGATTACAGCAGTTGGCTGCAATCATCAACACCTCTCCTGAAAGAAATCTTGGCCTTGATTATTCTGTTTTTGGAGAAGTCAAGAGTGTCAGTTTGTCTTCTTATTCGAAGAGACCTGCCAAGGCAATGCCTCCGAGTTTTTCTCCCGCTCCTGCCCCAGTGCCTGGTGACCATGTAAAACTACCGAGTAGCCCACATCCATCGAGATCCGCGCAATCACCTGCAACTCATTCCCCACCTCATGCAAATTGTGAAACCTCGTCTCCAACCCCTTCAATGGTTCCTCCACATTCCCCTCGTGAACATTCAATACCCCCAACCTCCTATCCAAAGTCTACAAGACTGATCGTTCCTCCGGCTGAACAACCTCGAGTTTCTTCTCCACGTGCATCTCCGGTAGAGTTTTCACCGCTTTTGCCCCCCGATCTGTTACCTAAACCAAAGCCTTCTTTTCACTCCAAACCAGGGCAGACAAAGGAAGATACGTCACATCCAGATCATGTAAGCTTTGACATATTTTGTTAGATTATAGAGAGTTGAAAGCATGTGGTATACACATTTCTTAGCACACATCAAGTCACTTATATCTATCATTCCTTTTCTCATCTTTGTAGGATTAA

mRNA sequence

ATGGGGAAAGGGGAAGAGCAAAATCTGCCGTTGCAGCAGCGTCGTGAGGTGGCTCCAAGTGGGGATTCTTCTGGGTTTCTTTGTGGTCAATGTTCGACTGCTTTTCATAGAGTTTGTAAGGAGTTGAATTTCAAGTGTTTCTTCGTTTTGATTTTGGGGTTTGTGGTGTTTGTCCCTGGATTCTTTTGGCTTCTTCCTCTTCATGAAAGAAATTCTGGGTTTGAGGCAAAAGACACCATTAAACTCAGTGCTACAGTTCAGGTGTATTTCGTTCTTGAAAAGCCCGTGAATGAGCTTCTCCCTCACATCAAGAGATTAGAGTTTGATATCAATGGTGAATTAGACATTCCAAACTTGAAGGTTTCCATTCTATCCATGCATGGTATAGGTGAGTCGAACAGGACTTACGTGGTTTTTGGTCTTCTTTCTGAATACATAACTGCTCCAATAAATCCAGTGTCCTTAAGTCTGCTGAGATCGTCTTTATATGACTTTTTCCTTTCCGAATCCAACCTTACTTTGACGACATCGATTTTTGGACAGCCATCGACATTTCAAATTCTCAAGTTTCCAGGGGGAATTTCTATAATCCCATTTCAACATGCTTCAATTTGGCAGTTTCCCCAGATTGTATTTAACTTCACTCTTACTAACTCCATTTCTGAAATACTCGACAACTTTGCCAAGTTCAAGAGCCAACTAAAGTTTGGATTGAGTCTGAGGACTTATGAGAATGTGTATTTGCAAATAACAAACAAGATTGGCTCGACGATGCAACCACTTGTAATTGTTCAGGCTTCTATTACGTCGGAATTGGGACGCATAACGTCACAGAGATTACAGCAGTTGGCTGCAATCATCAACACCTCTCCTGAAAGAAATCTTGGCCTTGATTATTCTGTTTTTGGAGAAGTCAAGAGTGTCAGTTTGTCTTCTTATTCGAAGAGACCTGCCAAGGCAATGCCTCCGAGTTTTTCTCCCGCTCCTGCCCCAGTGCCTGGTGACCATGTAAAACTACCGAGTAGCCCACATCCATCGAGATCCGCGCAATCACCTGCAACTCATTCCCCACCTCATGCAAATTGTGAAACCTCGTCTCCAACCCCTTCAATGGTTCCTCCACATTCCCCTCGTGAACATTCAATACCCCCAACCTCCTATCCAAAGTCTACAAGACTGATCGTTCCTCCGGCTGAACAACCTCGAGTTTCTTCTCCACGTGCATCTCCGGTAGAGTTTTCACCGCTTTTGCCCCCCGATCTGTTACCTAAACCAAAGCCTTCTTTTCACTCCAAACCAGGGCAGACAAAGGAAGATACGTCACATCCAGATCATGATTAA

Coding sequence (CDS)

ATGGGGAAAGGGGAAGAGCAAAATCTGCCGTTGCAGCAGCGTCGTGAGGTGGCTCCAAGTGGGGATTCTTCTGGGTTTCTTTGTGGTCAATGTTCGACTGCTTTTCATAGAGTTTGTAAGGAGTTGAATTTCAAGTGTTTCTTCGTTTTGATTTTGGGGTTTGTGGTGTTTGTCCCTGGATTCTTTTGGCTTCTTCCTCTTCATGAAAGAAATTCTGGGTTTGAGGCAAAAGACACCATTAAACTCAGTGCTACAGTTCAGGTGTATTTCGTTCTTGAAAAGCCCGTGAATGAGCTTCTCCCTCACATCAAGAGATTAGAGTTTGATATCAATGGTGAATTAGACATTCCAAACTTGAAGGTTTCCATTCTATCCATGCATGGTATAGGTGAGTCGAACAGGACTTACGTGGTTTTTGGTCTTCTTTCTGAATACATAACTGCTCCAATAAATCCAGTGTCCTTAAGTCTGCTGAGATCGTCTTTATATGACTTTTTCCTTTCCGAATCCAACCTTACTTTGACGACATCGATTTTTGGACAGCCATCGACATTTCAAATTCTCAAGTTTCCAGGGGGAATTTCTATAATCCCATTTCAACATGCTTCAATTTGGCAGTTTCCCCAGATTGTATTTAACTTCACTCTTACTAACTCCATTTCTGAAATACTCGACAACTTTGCCAAGTTCAAGAGCCAACTAAAGTTTGGATTGAGTCTGAGGACTTATGAGAATGTGTATTTGCAAATAACAAACAAGATTGGCTCGACGATGCAACCACTTGTAATTGTTCAGGCTTCTATTACGTCGGAATTGGGACGCATAACGTCACAGAGATTACAGCAGTTGGCTGCAATCATCAACACCTCTCCTGAAAGAAATCTTGGCCTTGATTATTCTGTTTTTGGAGAAGTCAAGAGTGTCAGTTTGTCTTCTTATTCGAAGAGACCTGCCAAGGCAATGCCTCCGAGTTTTTCTCCCGCTCCTGCCCCAGTGCCTGGTGACCATGTAAAACTACCGAGTAGCCCACATCCATCGAGATCCGCGCAATCACCTGCAACTCATTCCCCACCTCATGCAAATTGTGAAACCTCGTCTCCAACCCCTTCAATGGTTCCTCCACATTCCCCTCGTGAACATTCAATACCCCCAACCTCCTATCCAAAGTCTACAAGACTGATCGTTCCTCCGGCTGAACAACCTCGAGTTTCTTCTCCACGTGCATCTCCGGTAGAGTTTTCACCGCTTTTGCCCCCCGATCTGTTACCTAAACCAAAGCCTTCTTTTCACTCCAAACCAGGGCAGACAAAGGAAGATACGTCACATCCAGATCATGATTAA

Protein sequence

MGKGEEQNLPLQQRREVAPSGDSSGFLCGQCSTAFHRVCKELNFKCFFVLILGFVVFVPGFFWLLPLHERNSGFEAKDTIKLSATVQVYFVLEKPVNELLPHIKRLEFDINGELDIPNLKVSILSMHGIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFGQPSTFQILKFPGGISIIPFQHASIWQFPQIVFNFTLTNSISEILDNFAKFKSQLKFGLSLRTYENVYLQITNKIGSTMQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYSVFGEVKSVSLSSYSKRPAKAMPPSFSPAPAPVPGDHVKLPSSPHPSRSAQSPATHSPPHANCETSSPTPSMVPPHSPREHSIPPTSYPKSTRLIVPPAEQPRVSSPRASPVEFSPLLPPDLLPKPKPSFHSKPGQTKEDTSHPDHD
BLAST of Cla97C05G092090 vs. NCBI nr
Match: XP_004149972.2 (PREDICTED: uncharacterized protein LOC101222031 isoform X2 [Cucumis sativus] >KGN57565.1 hypothetical protein Csa_3G214050 [Cucumis sativus])

HSP 1 Score: 733.8 bits (1893), Expect = 3.5e-208
Identity = 386/446 (86.55%), Postives = 400/446 (89.69%), Query Frame = 0

Query: 1   MGKGEEQNLPLQQRREVAPSGDSSGFLCGQCSTAFHRVCKELNFKCFFVLILGFVVFVPG 60
           MGKGEEQNLPLQQRREVA +GDSSGFLCGQCS AFHRVCKELNFKCFFVL+LGFVVFVPG
Sbjct: 1   MGKGEEQNLPLQQRREVALTGDSSGFLCGQCSIAFHRVCKELNFKCFFVLVLGFVVFVPG 60

Query: 61  FFWLLPLHERNSGFEAKDTIKLSATVQVYFVLEKPVNELLPHIKRLEFDINGELDIPNLK 120
           FFWLLPLHERNSGFEAKD IKLSATVQVYFVLEKPV ELLPHIKRLEFDINGELDIPN+K
Sbjct: 61  FFWLLPLHERNSGFEAKDNIKLSATVQVYFVLEKPVTELLPHIKRLEFDINGELDIPNVK 120

Query: 121 VSILSMHGIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFG 180
           VSILSMH IGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFG
Sbjct: 121 VSILSMHDIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFG 180

Query: 181 QPSTFQILKFPGGISIIPFQHASIWQFPQIVFNFTLTNSISEILDNFAKFKSQLKFGLSL 240
           QPST QILKFPGGISIIPFQHASIW+FPQIVFNFTLTNSISEILDNFAKFKSQLKFGL L
Sbjct: 181 QPSTLQILKFPGGISIIPFQHASIWEFPQIVFNFTLTNSISEILDNFAKFKSQLKFGLRL 240

Query: 241 RTYENVYLQITNKIGSTMQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS 300
           R+YENVYLQITNKIGST+QPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS
Sbjct: 241 RSYENVYLQITNKIGSTVQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS 300

Query: 301 VFGEVKSVSLSSYSKRPAKAMPPSFSPAPAPVPGDHVKLPSSPHPSRSAQSPATHSPPHA 360
           VFGEVKSVSLSSY KR +KAMPPSFSPA     G+HV++PS PHP RS + PA HSPPHA
Sbjct: 301 VFGEVKSVSLSSYPKRTSKAMPPSFSPAXXXXXGNHVEVPSGPHPLRSMRPPANHSPPHA 360

Query: 361 NCETSSPTPSMVPPHXPREHSIPPTSYPKSTRLIVPPAEQPRVSSXXXXPVEFSPLLPPD 420
           NC++SSP PSMVP + P EHSIPP SYPKSTRLIVPPA QPRV S XXX           
Sbjct: 361 NCKSSSPNPSMVPANSPHEHSIPPISYPKSTRLIVPPANQPRVYSPXXXXXXXXXXXXXX 420

Query: 421 LLPKPKPSFHSKPGQTKEDTSHPDHD 447
              KPKPSF SK GQT ED SHP HD
Sbjct: 421 XXXKPKPSFRSKSGQTNEDPSHPVHD 446

BLAST of Cla97C05G092090 vs. NCBI nr
Match: XP_011651267.1 (PREDICTED: uncharacterized protein LOC101222031 isoform X1 [Cucumis sativus])

HSP 1 Score: 731.9 bits (1888), Expect = 1.3e-207
Identity = 385/445 (86.52%), Postives = 399/445 (89.66%), Query Frame = 0

Query: 1   MGKGEEQNLPLQQRREVAPSGDSSGFLCGQCSTAFHRVCKELNFKCFFVLILGFVVFVPG 60
           MGKGEEQNLPLQQRREVA +GDSSGFLCGQCS AFHRVCKELNFKCFFVL+LGFVVFVPG
Sbjct: 1   MGKGEEQNLPLQQRREVALTGDSSGFLCGQCSIAFHRVCKELNFKCFFVLVLGFVVFVPG 60

Query: 61  FFWLLPLHERNSGFEAKDTIKLSATVQVYFVLEKPVNELLPHIKRLEFDINGELDIPNLK 120
           FFWLLPLHERNSGFEAKD IKLSATVQVYFVLEKPV ELLPHIKRLEFDINGELDIPN+K
Sbjct: 61  FFWLLPLHERNSGFEAKDNIKLSATVQVYFVLEKPVTELLPHIKRLEFDINGELDIPNVK 120

Query: 121 VSILSMHGIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFG 180
           VSILSMH IGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFG
Sbjct: 121 VSILSMHDIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFG 180

Query: 181 QPSTFQILKFPGGISIIPFQHASIWQFPQIVFNFTLTNSISEILDNFAKFKSQLKFGLSL 240
           QPST QILKFPGGISIIPFQHASIW+FPQIVFNFTLTNSISEILDNFAKFKSQLKFGL L
Sbjct: 181 QPSTLQILKFPGGISIIPFQHASIWEFPQIVFNFTLTNSISEILDNFAKFKSQLKFGLRL 240

Query: 241 RTYENVYLQITNKIGSTMQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS 300
           R+YENVYLQITNKIGST+QPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS
Sbjct: 241 RSYENVYLQITNKIGSTVQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS 300

Query: 301 VFGEVKSVSLSSYSKRPAKAMPPSFSPAPAPVPGDHVKLPSSPHPSRSAQSPATHSPPHA 360
           VFGEVKSVSLSSY KR +KAMPPSFSPA     G+HV++PS PHP RS + PA HSPPHA
Sbjct: 301 VFGEVKSVSLSSYPKRTSKAMPPSFSPAXXXXXGNHVEVPSGPHPLRSMRPPANHSPPHA 360

Query: 361 NCETSSPTPSMVPPHXPREHSIPPTSYPKSTRLIVPPAEQPRVSSXXXXPVEFSPLLPPD 420
           NC++SSP PSMVP + P EHSIPP SYPKSTRLIVPPA QPRV S XXX           
Sbjct: 361 NCKSSSPNPSMVPANSPHEHSIPPISYPKSTRLIVPPANQPRVYSPXXXXXXXXXXXXXX 420

Query: 421 LLPKPKPSFHSKPGQTKEDTSHPDH 446
              KPKPSF SK GQT ED SHP H
Sbjct: 421 XXXKPKPSFRSKSGQTNEDPSHPVH 445

BLAST of Cla97C05G092090 vs. NCBI nr
Match: XP_008456084.1 (PREDICTED: uncharacterized protein LOC103496125 isoform X1 [Cucumis melo])

HSP 1 Score: 726.1 bits (1873), Expect = 7.3e-206
Identity = 381/446 (85.43%), Postives = 397/446 (89.01%), Query Frame = 0

Query: 1   MGKGEEQNLPLQQRREVAPSGDSSGFLCGQCSTAFHRVCKELNFKCFFVLILGFVVFVPG 60
           MGKGEEQNLPLQQRREVA SGDSSGFLCGQCS AFHRVCKELNFKCFFVL+LGFVVFVPG
Sbjct: 1   MGKGEEQNLPLQQRREVALSGDSSGFLCGQCSIAFHRVCKELNFKCFFVLVLGFVVFVPG 60

Query: 61  FFWLLPLHERNSGFEAKDTIKLSATVQVYFVLEKPVNELLPHIKRLEFDINGELDIPNLK 120
            FWLLPLHERNSGFEAK+ +KLSATVQVYFVLEKPV ELLPHIKRLEFDINGELDIP++K
Sbjct: 61  LFWLLPLHERNSGFEAKENVKLSATVQVYFVLEKPVTELLPHIKRLEFDINGELDIPDVK 120

Query: 121 VSILSMHGIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFG 180
           VSILSMH IGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFG
Sbjct: 121 VSILSMHDIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFG 180

Query: 181 QPSTFQILKFPGGISIIPFQHASIWQFPQIVFNFTLTNSISEILDNFAKFKSQLKFGLSL 240
           QPST QILKFPGGISIIPFQHASIW+FPQIVFNFTLTNSISEILDNFAKFKS+LKFGL L
Sbjct: 181 QPSTLQILKFPGGISIIPFQHASIWEFPQIVFNFTLTNSISEILDNFAKFKSELKFGLRL 240

Query: 241 RTYENVYLQITNKIGSTMQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS 300
           R+YENVYLQITNKIGST+QPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS
Sbjct: 241 RSYENVYLQITNKIGSTVQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS 300

Query: 301 VFGEVKSVSLSSYSKRPAKAMPPSFSPAPAPVPGDHVKLPSSPHPSRSAQSPATHSPPHA 360
           VFGEVKSVSLSSY KR +KAMPPSFSPA     GDHV++PS PH  RS + PA HSPPHA
Sbjct: 301 VFGEVKSVSLSSYPKRTSKAMPPSFSPAXXXXXGDHVEVPSDPHRLRSTRPPANHSPPHA 360

Query: 361 NCETSSPTPSMVPPHXPREHSIPPTSYPKSTRLIVPPAEQPRVSSXXXXPVEFSPLLPPD 420
           NC++ SP PSMVP H P EHSIPP SYPKSTRL+VPPA QPR   XXXX           
Sbjct: 361 NCKSLSPNPSMVPAHSPHEHSIPPISYPKSTRLVVPPANQPRXXXXXXXXXXXXXXXXXX 420

Query: 421 LLPKPKPSFHSKPGQTKEDTSHPDHD 447
              KPKPSFHSK GQT ED SHP HD
Sbjct: 421 XXXKPKPSFHSKSGQTNEDLSHPVHD 446

BLAST of Cla97C05G092090 vs. NCBI nr
Match: XP_022922926.1 (uncharacterized protein LOC111430758 isoform X1 [Cucurbita moschata])

HSP 1 Score: 634.4 bits (1635), Expect = 2.9e-178
Identity = 346/456 (75.88%), Postives = 369/456 (80.92%), Query Frame = 0

Query: 1   MGKGEEQNLPLQQRREVAPSGDSSGFLCGQCSTAFHRVCKELNFKCFFVLILGFVVFVPG 60
           MGKGE+QNLP Q RRE     DSSGF+C +CS +F R   ELNFKC FVLILGF VF+PG
Sbjct: 1   MGKGEDQNLPQQHRRE-----DSSGFICRECSISFCRASTELNFKCLFVLILGFAVFLPG 60

Query: 61  FFWLLPLHERNSGFEAKDTIKLSATVQVYFVLEKPVNELLPHIKRLEFDINGELDIPNLK 120
           FFWLLPLHERN GFEAKD IKLSATVQVYFVLEKPV ELLPHIKRLEFDINGELDIPN+K
Sbjct: 61  FFWLLPLHERNLGFEAKDAIKLSATVQVYFVLEKPVEELLPHIKRLEFDINGELDIPNVK 120

Query: 121 VSILSMHGIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFG 180
           VS+LSMH +GESNRTYVVFGLLSEYIT PINPVSLSLLRSSLYD FL +SNLTLTTSIFG
Sbjct: 121 VSVLSMHDLGESNRTYVVFGLLSEYITTPINPVSLSLLRSSLYDLFLHKSNLTLTTSIFG 180

Query: 181 QPSTFQILKFPGGISIIPFQHASIWQFPQIVFNFTLTNSISEILDNFAKFKSQLKFGLSL 240
           QPS FQILKFPGGISIIPFQ ASIWQFPQIVFNFTLTNSISEIL+ FAKF SQLK  L L
Sbjct: 181 QPSAFQILKFPGGISIIPFQRASIWQFPQIVFNFTLTNSISEILNKFAKFMSQLKLELCL 240

Query: 241 RTYENVYLQITNKIGSTMQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS 300
           R YENVYLQITNKIGSTMQP+V+VQASI+SELGRIT+QRLQQLAAIINTS ERNLGLDYS
Sbjct: 241 RPYENVYLQITNKIGSTMQPIVVVQASISSELGRITTQRLQQLAAIINTSSERNLGLDYS 300

Query: 301 VFGEVKSVSLSSYSKRPAKAMPPSFSPAPAPVPGDHVKLPSSPHPSRSAQSPATHSPPHA 360
           VFGEVK +SLSSY K  + AMPPSFSP      GDHV+LPS+PHPSRSA+SPA  SPP A
Sbjct: 301 VFGEVKGISLSSYPKGTSMAMPPSFSPXXXXXXGDHVELPSAPHPSRSARSPANCSPPRA 360

Query: 361 NCETSSPTPSMVPPHXPREHSIPPTSYPKSTRLI-VPPAEQPRVSSXXXXPVEFSPLLPP 420
           NCETSSP  SMVP     EHS+PP  YPKSTRLI VPPA+QPRVSS    PV        
Sbjct: 361 NCETSSPALSMVPAPSLHEHSMPPIVYPKSTRLIVVPPADQPRVSSPRASPV-------- 420

Query: 421 DLLPKPKPSFHSKPGQTKED---------TSHPDHD 447
                    FH KPG+TKED         +SH DHD
Sbjct: 421 --------LFHYKPGKTKEDSHRVWQPTHSSHRDHD 435

BLAST of Cla97C05G092090 vs. NCBI nr
Match: XP_022984786.1 (uncharacterized protein LOC111482968 isoform X1 [Cucurbita maxima])

HSP 1 Score: 632.9 bits (1631), Expect = 8.4e-178
Identity = 347/459 (75.60%), Postives = 369/459 (80.39%), Query Frame = 0

Query: 1   MGKGEEQNLPLQQRREVAPSGDSSGFLCGQCSTAFHRVCKELNFKCFFVLILGFVVFVPG 60
           MGKGE+QNLP Q RRE     DSSGFLC +CS AF RV  ELNFKC FVLILGF VF+PG
Sbjct: 1   MGKGEDQNLPQQHRRE-----DSSGFLCRECSIAFRRVSTELNFKCLFVLILGFAVFLPG 60

Query: 61  FFWLLPLHERNSGFEAKDTIKLSATVQVYFVLEKPVNELLPHIKRLEFDINGELDIPNLK 120
           FFWLLPLHERN GFEAKD IKLSATVQVYFVLEKPV ELLPHIKRLEFDINGELDIPN+K
Sbjct: 61  FFWLLPLHERNLGFEAKDAIKLSATVQVYFVLEKPVEELLPHIKRLEFDINGELDIPNVK 120

Query: 121 VSILSMHGIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFG 180
           VS+LSMH +GESNRTYVVFGLLSEYIT PINPVSLSLLRSSLYD FL +SNLTLTTSIFG
Sbjct: 121 VSVLSMHDLGESNRTYVVFGLLSEYITTPINPVSLSLLRSSLYDLFLHKSNLTLTTSIFG 180

Query: 181 QPSTFQILKFPGGISIIPFQHASIWQFPQIVFNFTLTNSISEILDNFAKFKSQLKFGLSL 240
           QPSTFQILKFPGGISIIPFQHASIWQFPQIVFNFTLTNSISEIL+ FAKF SQ K  L L
Sbjct: 181 QPSTFQILKFPGGISIIPFQHASIWQFPQIVFNFTLTNSISEILNKFAKFMSQFKLELCL 240

Query: 241 RTYENVYLQITNKIGSTMQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS 300
           R YENVYLQITNKIGSTMQP+V+VQASI+SELGRIT+QRLQQLAAIINTS ERNLGLDYS
Sbjct: 241 RPYENVYLQITNKIGSTMQPIVVVQASISSELGRITTQRLQQLAAIINTSSERNLGLDYS 300

Query: 301 VFGEVKSVSLSSYSKRPAKAMPPSFSPAPAPVPGDHVKLPSSPHPSRSAQSPATHSPPHA 360
           VFGEVK +SL SY K  + AMPPSFSP      GDHV+L S+P PSRSA+ PA  SPP A
Sbjct: 301 VFGEVKGISLPSYPKGTSMAMPPSFSPXXXXXXGDHVELLSAPQPSRSARPPANRSPPQA 360

Query: 361 NCETSSPTPSMVPPHXPREHSIPPTSYPKSTRLI-VPPAEQPRVSSXXXXPVEFSPLLPP 420
           NCETSSP  SMVP   P EHS+PP  YPKSTRLI VPPA+QPRVSS              
Sbjct: 361 NCETSSPALSMVPAPSPHEHSMPPIFYPKSTRLIVVPPADQPRVSS-------------- 420

Query: 421 DLLPKPKPS---FHSKPGQTKED---------TSHPDHD 447
                P+ S   F  KPG+TKED         +SHPDHD
Sbjct: 421 -----PRASLLLFRYKPGKTKEDSHRVRQPTHSSHPDHD 435

BLAST of Cla97C05G092090 vs. TrEMBL
Match: tr|A0A0A0L6J0|A0A0A0L6J0_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G214050 PE=4 SV=1)

HSP 1 Score: 733.8 bits (1893), Expect = 2.3e-208
Identity = 386/446 (86.55%), Postives = 400/446 (89.69%), Query Frame = 0

Query: 1   MGKGEEQNLPLQQRREVAPSGDSSGFLCGQCSTAFHRVCKELNFKCFFVLILGFVVFVPG 60
           MGKGEEQNLPLQQRREVA +GDSSGFLCGQCS AFHRVCKELNFKCFFVL+LGFVVFVPG
Sbjct: 1   MGKGEEQNLPLQQRREVALTGDSSGFLCGQCSIAFHRVCKELNFKCFFVLVLGFVVFVPG 60

Query: 61  FFWLLPLHERNSGFEAKDTIKLSATVQVYFVLEKPVNELLPHIKRLEFDINGELDIPNLK 120
           FFWLLPLHERNSGFEAKD IKLSATVQVYFVLEKPV ELLPHIKRLEFDINGELDIPN+K
Sbjct: 61  FFWLLPLHERNSGFEAKDNIKLSATVQVYFVLEKPVTELLPHIKRLEFDINGELDIPNVK 120

Query: 121 VSILSMHGIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFG 180
           VSILSMH IGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFG
Sbjct: 121 VSILSMHDIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFG 180

Query: 181 QPSTFQILKFPGGISIIPFQHASIWQFPQIVFNFTLTNSISEILDNFAKFKSQLKFGLSL 240
           QPST QILKFPGGISIIPFQHASIW+FPQIVFNFTLTNSISEILDNFAKFKSQLKFGL L
Sbjct: 181 QPSTLQILKFPGGISIIPFQHASIWEFPQIVFNFTLTNSISEILDNFAKFKSQLKFGLRL 240

Query: 241 RTYENVYLQITNKIGSTMQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS 300
           R+YENVYLQITNKIGST+QPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS
Sbjct: 241 RSYENVYLQITNKIGSTVQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS 300

Query: 301 VFGEVKSVSLSSYSKRPAKAMPPSFSPAPAPVPGDHVKLPSSPHPSRSAQSPATHSPPHA 360
           VFGEVKSVSLSSY KR +KAMPPSFSPA     G+HV++PS PHP RS + PA HSPPHA
Sbjct: 301 VFGEVKSVSLSSYPKRTSKAMPPSFSPAXXXXXGNHVEVPSGPHPLRSMRPPANHSPPHA 360

Query: 361 NCETSSPTPSMVPPHXPREHSIPPTSYPKSTRLIVPPAEQPRVSSXXXXPVEFSPLLPPD 420
           NC++SSP PSMVP + P EHSIPP SYPKSTRLIVPPA QPRV S XXX           
Sbjct: 361 NCKSSSPNPSMVPANSPHEHSIPPISYPKSTRLIVPPANQPRVYSPXXXXXXXXXXXXXX 420

Query: 421 LLPKPKPSFHSKPGQTKEDTSHPDHD 447
              KPKPSF SK GQT ED SHP HD
Sbjct: 421 XXXKPKPSFRSKSGQTNEDPSHPVHD 446

BLAST of Cla97C05G092090 vs. TrEMBL
Match: tr|A0A1S3C2E0|A0A1S3C2E0_CUCME (uncharacterized protein LOC103496125 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103496125 PE=4 SV=1)

HSP 1 Score: 726.1 bits (1873), Expect = 4.8e-206
Identity = 381/446 (85.43%), Postives = 397/446 (89.01%), Query Frame = 0

Query: 1   MGKGEEQNLPLQQRREVAPSGDSSGFLCGQCSTAFHRVCKELNFKCFFVLILGFVVFVPG 60
           MGKGEEQNLPLQQRREVA SGDSSGFLCGQCS AFHRVCKELNFKCFFVL+LGFVVFVPG
Sbjct: 1   MGKGEEQNLPLQQRREVALSGDSSGFLCGQCSIAFHRVCKELNFKCFFVLVLGFVVFVPG 60

Query: 61  FFWLLPLHERNSGFEAKDTIKLSATVQVYFVLEKPVNELLPHIKRLEFDINGELDIPNLK 120
            FWLLPLHERNSGFEAK+ +KLSATVQVYFVLEKPV ELLPHIKRLEFDINGELDIP++K
Sbjct: 61  LFWLLPLHERNSGFEAKENVKLSATVQVYFVLEKPVTELLPHIKRLEFDINGELDIPDVK 120

Query: 121 VSILSMHGIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFG 180
           VSILSMH IGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFG
Sbjct: 121 VSILSMHDIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFG 180

Query: 181 QPSTFQILKFPGGISIIPFQHASIWQFPQIVFNFTLTNSISEILDNFAKFKSQLKFGLSL 240
           QPST QILKFPGGISIIPFQHASIW+FPQIVFNFTLTNSISEILDNFAKFKS+LKFGL L
Sbjct: 181 QPSTLQILKFPGGISIIPFQHASIWEFPQIVFNFTLTNSISEILDNFAKFKSELKFGLRL 240

Query: 241 RTYENVYLQITNKIGSTMQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS 300
           R+YENVYLQITNKIGST+QPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS
Sbjct: 241 RSYENVYLQITNKIGSTVQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS 300

Query: 301 VFGEVKSVSLSSYSKRPAKAMPPSFSPAPAPVPGDHVKLPSSPHPSRSAQSPATHSPPHA 360
           VFGEVKSVSLSSY KR +KAMPPSFSPA     GDHV++PS PH  RS + PA HSPPHA
Sbjct: 301 VFGEVKSVSLSSYPKRTSKAMPPSFSPAXXXXXGDHVEVPSDPHRLRSTRPPANHSPPHA 360

Query: 361 NCETSSPTPSMVPPHXPREHSIPPTSYPKSTRLIVPPAEQPRVSSXXXXPVEFSPLLPPD 420
           NC++ SP PSMVP H P EHSIPP SYPKSTRL+VPPA QPR   XXXX           
Sbjct: 361 NCKSLSPNPSMVPAHSPHEHSIPPISYPKSTRLVVPPANQPRXXXXXXXXXXXXXXXXXX 420

Query: 421 LLPKPKPSFHSKPGQTKEDTSHPDHD 447
              KPKPSFHSK GQT ED SHP HD
Sbjct: 421 XXXKPKPSFHSKSGQTNEDLSHPVHD 446

BLAST of Cla97C05G092090 vs. TrEMBL
Match: tr|A0A1S4E116|A0A1S4E116_CUCME (uncharacterized protein LOC103496125 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103496125 PE=4 SV=1)

HSP 1 Score: 587.4 bits (1513), Expect = 2.7e-164
Identity = 321/446 (71.97%), Postives = 337/446 (75.56%), Query Frame = 0

Query: 1   MGKGEEQNLPLQQRREVAPSGDSSGFLCGQCSTAFHRVCKELNFKCFFVLILGFVVFVPG 60
           MGKGEEQNLPLQQRREVA SGDSSGFLCGQCS AFHRVCKELNFKCFFVL+LGFVVFVPG
Sbjct: 1   MGKGEEQNLPLQQRREVALSGDSSGFLCGQCSIAFHRVCKELNFKCFFVLVLGFVVFVPG 60

Query: 61  FFWLLPLHERNSGFEAKDTIKLSATVQVYFVLEKPVNELLPHIKRLEFDINGELDIPNLK 120
            FWLLPLHERNSGFEAK+ +KLSATVQVYFVLEKPV ELLPHIKRLEFDINGELDIP++K
Sbjct: 61  LFWLLPLHERNSGFEAKENVKLSATVQVYFVLEKPVTELLPHIKRLEFDINGELDIPDVK 120

Query: 121 VSILSMHGIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFG 180
                                                                       
Sbjct: 121 ------------------------------------------------------------ 180

Query: 181 QPSTFQILKFPGGISIIPFQHASIWQFPQIVFNFTLTNSISEILDNFAKFKSQLKFGLSL 240
            PST QILKFPGGISIIPFQHASIW+FPQIVFNFTLTNSISEILDNFAKFKS+LKFGL L
Sbjct: 181 -PSTLQILKFPGGISIIPFQHASIWEFPQIVFNFTLTNSISEILDNFAKFKSELKFGLRL 240

Query: 241 RTYENVYLQITNKIGSTMQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS 300
           R+YENVYLQITNKIGST+QPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS
Sbjct: 241 RSYENVYLQITNKIGSTVQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS 300

Query: 301 VFGEVKSVSLSSYSKRPAKAMPPSFSPAPAPVPGDHVKLPSSPHPSRSAQSPATHSPPHA 360
           VFGEVKSVSLSSY KR +KAMPPSFSPA     GDHV++PS PH  RS + PA HSPPHA
Sbjct: 301 VFGEVKSVSLSSYPKRTSKAMPPSFSPAXXXXXGDHVEVPSDPHRLRSTRPPANHSPPHA 360

Query: 361 NCETSSPTPSMVPPHXPREHSIPPTSYPKSTRLIVPPAEQPRVSSXXXXPVEFSPLLPPD 420
           NC++ SP PSMVP H P EHSIPP SYPKSTRL+VPPA QPR   XXXX           
Sbjct: 361 NCKSLSPNPSMVPAHSPHEHSIPPISYPKSTRLVVPPANQPRXXXXXXXXXXXXXXXXXX 385

Query: 421 LLPKPKPSFHSKPGQTKEDTSHPDHD 447
              KPKPSFHSK GQT ED SHP HD
Sbjct: 421 XXXKPKPSFHSKSGQTNEDLSHPVHD 385

BLAST of Cla97C05G092090 vs. TrEMBL
Match: tr|A0A2I4GDV1|A0A2I4GDV1_9ROSI (uncharacterized protein LOC109007039 isoform X2 OS=Juglans regia OX=51240 GN=LOC109007039 PE=4 SV=1)

HSP 1 Score: 374.8 bits (961), Expect = 2.7e-100
Identity = 232/452 (51.33%), Postives = 281/452 (62.17%), Query Frame = 0

Query: 1   MGKGEEQNLPLQQRREVAPSGDSSGFLCGQCSTAFHRVCKELNFKCFFVLILGFVVFVPG 60
           MGK E QNL  QQ  E   + DSSG  C  CS A + V K  +FKCFFVLIL   V V G
Sbjct: 1   MGKSELQNLFRQQNPEADNNRDSSGLFCDGCSVALNGVVKAFSFKCFFVLILTLSVLVSG 60

Query: 61  FFWLLPLHERNSGFEAKDTIKLSATVQVYFVLEKPVNELLPHIKRLEFDINGELDIPNLK 120
            FW+LP H    GFEAKD IKLSATVQ YF L++PV+ L+PHI RLE+DI GE+ +P +K
Sbjct: 61  VFWILPCHSTKFGFEAKDEIKLSATVQAYFRLDRPVSHLVPHIGRLEYDIYGEIGVPGMK 120

Query: 121 VSILSMHGIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFG 180
           V ILSM+  G SN T VVFG+LS+ I  PI PVSLS+LRSS+ D FL +SNLTLT+SIFG
Sbjct: 121 VVILSMNQSGASNWTDVVFGVLSDPINVPITPVSLSVLRSSVIDLFLQQSNLTLTSSIFG 180

Query: 181 QPSTFQILKFPGGISIIPFQHASIWQFPQIVFNFTLTNSISEILDNFAKFKSQLKFGLSL 240
           + + F+ILKFPGG+++IP Q ASIWQ PQI+FNFTL NSIS+IL+NF + K QLK GL L
Sbjct: 181 KATMFEILKFPGGLTLIPVQSASIWQTPQILFNFTLNNSISDILENFIELKDQLKMGLYL 240

Query: 241 RTYENVYLQITNKIGSTMQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS 300
           R++E +Y+QITNK+GST+ P V VQAS+ S+LG +  QRL+QLA II  SP +NLGL+ +
Sbjct: 241 RSHETLYIQITNKVGSTIAPPVTVQASVMSDLGSLLPQRLKQLAEIITGSPAKNLGLNNT 300

Query: 301 VFGEVKSVSLSSYSKRPAKAMPPS------------FSPAPAPVPGDHVKLPSSPHPSRS 360
           VFG+VKS+SLSSY K      PPS                                P+  
Sbjct: 301 VFGKVKSISLSSYLKGTLNVTPPSPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSPNIK 360

Query: 361 AQSPATHSPPHA---NCETSSPTPSMVPPHXPREHSIPPTSYPKSTRLIVPPAEQPRVSS 420
             SPA   PPH       T+SP+PS+  P  P                 VPPA       
Sbjct: 361 HHSPA--HPPHPCLYRGFTNSPSPSLASPDDP----------------AVPPAFXXXXXX 420

Query: 421 XXXXPVEFSPLLPPDLLPKPKPSFHSKPGQTK 438
           XXXX       LPP L PKPK S+   PGQ K
Sbjct: 421 XXXXXXXXXXKLPPVLSPKPKVSYAPSPGQDK 434

BLAST of Cla97C05G092090 vs. TrEMBL
Match: tr|A0A2I4GDX6|A0A2I4GDX6_9ROSI (uncharacterized protein LOC109007039 isoform X4 OS=Juglans regia OX=51240 GN=LOC109007039 PE=4 SV=1)

HSP 1 Score: 374.8 bits (961), Expect = 2.7e-100
Identity = 232/452 (51.33%), Postives = 281/452 (62.17%), Query Frame = 0

Query: 1   MGKGEEQNLPLQQRREVAPSGDSSGFLCGQCSTAFHRVCKELNFKCFFVLILGFVVFVPG 60
           MGK E QNL  QQ  E   + DSSG  C  CS A + V K  +FKCFFVLIL   V V G
Sbjct: 1   MGKSELQNLFRQQNPEADNNRDSSGLFCDGCSVALNGVVKAFSFKCFFVLILTLSVLVSG 60

Query: 61  FFWLLPLHERNSGFEAKDTIKLSATVQVYFVLEKPVNELLPHIKRLEFDINGELDIPNLK 120
            FW+LP H    GFEAKD IKLSATVQ YF L++PV+ L+PHI RLE+DI GE+ +P +K
Sbjct: 61  VFWILPCHSTKFGFEAKDEIKLSATVQAYFRLDRPVSHLVPHIGRLEYDIYGEIGVPGMK 120

Query: 121 VSILSMHGIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFG 180
           V ILSM+  G SN T VVFG+LS+ I  PI PVSLS+LRSS+ D FL +SNLTLT+SIFG
Sbjct: 121 VVILSMNQSGASNWTDVVFGVLSDPINVPITPVSLSVLRSSVIDLFLQQSNLTLTSSIFG 180

Query: 181 QPSTFQILKFPGGISIIPFQHASIWQFPQIVFNFTLTNSISEILDNFAKFKSQLKFGLSL 240
           + + F+ILKFPGG+++IP Q ASIWQ PQI+FNFTL NSIS+IL+NF + K QLK GL L
Sbjct: 181 KATMFEILKFPGGLTLIPVQSASIWQTPQILFNFTLNNSISDILENFIELKDQLKMGLYL 240

Query: 241 RTYENVYLQITNKIGSTMQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS 300
           R++E +Y+QITNK+GST+ P V VQAS+ S+LG +  QRL+QLA II  SP +NLGL+ +
Sbjct: 241 RSHETLYIQITNKVGSTIAPPVTVQASVMSDLGSLLPQRLKQLAEIITGSPAKNLGLNNT 300

Query: 301 VFGEVKSVSLSSYSKRPAKAMPPS------------FSPAPAPVPGDHVKLPSSPHPSRS 360
           VFG+VKS+SLSSY K      PPS                                P+  
Sbjct: 301 VFGKVKSISLSSYLKGTLNVTPPSPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSPNIK 360

Query: 361 AQSPATHSPPHA---NCETSSPTPSMVPPHXPREHSIPPTSYPKSTRLIVPPAEQPRVSS 420
             SPA   PPH       T+SP+PS+  P  P                 VPPA       
Sbjct: 361 HHSPA--HPPHPCLYRGFTNSPSPSLASPDDP----------------AVPPAFXXXXXX 420

Query: 421 XXXXPVEFSPLLPPDLLPKPKPSFHSKPGQTK 438
           XXXX       LPP L PKPK S+   PGQ K
Sbjct: 421 XXXXXXXXXXKLPPVLSPKPKVSYAPSPGQDK 434

BLAST of Cla97C05G092090 vs. TAIR10
Match: AT1G10790.1 (BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT3G56590.2))

HSP 1 Score: 198.7 bits (504), Expect = 7.3e-51
Identity = 127/329 (38.60%), Postives = 184/329 (55.93%), Query Frame = 0

Query: 5   EEQNLPLQQRREVAPSGDSSGFLCGQ-CSTAFHRVCKELNFKCFFVLILGFVVFVPGFFW 64
           +E  L LQQ      + +SS    G+ CS+AF R+   +  +C  VL+L   + +   FW
Sbjct: 6   KENALALQQETLDLENPESSPRSSGRSCSSAFSRL---VGLRCLIVLVLSCAILLSAIFW 65

Query: 65  LLPLHERNSGFEAKDTIKLSATVQVYFVLEKPVNELLPHIKRLEFDINGELDIP-NLKVS 124
           L P     S F+A  T+KL+A+VQ  F L+KPV+E++ H  ++E DI   + +  N KV+
Sbjct: 66  LFP-RRSVSEFKADGTVKLNASVQASFRLQKPVSEVVRHKGKIEHDILRSIGLSNNSKVT 125

Query: 125 ILSMHGIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFGQP 184
           +LS++  G SN T V F +L       I+  SLSLLRSS    F   S L LTTS FG+P
Sbjct: 126 VLSLNQSGASNYTDVEFAVLPVPPDHEISKHSLSLLRSSFVKLFAKRSKLKLTTSGFGKP 185

Query: 185 STFQILKFPGGISIIPFQHASIWQFPQIVFNFTLTNSISEILDNFAKFKSQLKFGLSLRT 244
           ++FQ+LKFPGGI++ P + A +     ++F+ T+  SIS + D         +  LSL  
Sbjct: 186 TSFQVLKFPGGITVDPLEPAPVSGVALVLFSVTIKTSISTVQDRLDLLNGLFEHMLSLEP 245

Query: 245 YENVYLQITNKIGSTMQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYSVF 304
           YE+V+ Q+TNK GST+ P +  Q  +   + +   QRL     II TS  +NLGLD +VF
Sbjct: 246 YESVHFQLTNKQGSTISPPLTFQVYVAFTMRKYLHQRLNHFTQIIQTSRAKNLGLDEAVF 305

Query: 305 GEVKSVSLSSYSKRPAKAMPPSFSPAPAP 332
           GEVK ++ S+Y            +PAP P
Sbjct: 306 GEVKDITFSTYLDGKVPDSDLELAPAPTP 330

BLAST of Cla97C05G092090 vs. TAIR10
Match: AT3G10810.1 (zinc finger (C3HC4-type RING finger) family protein)

HSP 1 Score: 166.8 bits (421), Expect = 3.1e-41
Identity = 112/297 (37.71%), Postives = 165/297 (55.56%), Query Frame = 0

Query: 20  SGDSS--GFLCGQCSTAFHRVCKELNFKCFFVLILGFVVFVPGFFWLLPLHERNSGFEAK 79
           +GDS+     CG C      +   + FKC FVL+L   +F+   F LLP           
Sbjct: 18  TGDSTVRNARCGCCKW----ISSFVGFKCLFVLLLSVALFLSALFLLLPFPMDREDSNLD 77

Query: 80  DTIKLSATVQVYFVLEKPVNELLPHIKRLEFDINGELDIPNLKVSILSMHGIGESNRTYV 139
              +  A V   F + +  + L  +  +L+ DI  E+   ++KV+IL++    E N T V
Sbjct: 78  PRFRGHAIV-ASFSINRSASFLNENTLQLQNDIFQEMSYISIKVTILAVEPSDELNITKV 137

Query: 140 VFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFGQPSTFQILKFPGGISII 199
           VFG+  +     I P+SLS ++       +++S L LT S+FG+   F++LKFPGGI++I
Sbjct: 138 VFGIDPDTGYREILPLSLSSIKEMFESVLINQSTLQLTKSLFGETFLFEVLKFPGGITVI 197

Query: 200 PFQHASIWQFPQIVFNFTLTNSISEILDNFAKFKSQLKFGLSLRTYENVYLQITNKIGST 259
           P Q A   Q  +IVFNFTL  SI +I  NF    SQLK GL+L  YEN+Y+ ++N  GST
Sbjct: 198 PPQSAFPLQKFKIVFNFTLNYSIHQIQINFNTLASQLKNGLNLAPYENLYVSLSNSEGST 257

Query: 260 MQPLVIVQASITSELGRI-TSQRLQQLAAIINTSPERNLGLDYSVFGEVKSVSLSSY 314
           + P   V +S+   +G   +S RL+QL   I  S  +NLGL+ ++FG+VK V LSS+
Sbjct: 258 VSPPTTVHSSVLLRVGTSNSSPRLKQLTDTITGSRSKNLGLNNTIFGKVKQVRLSSF 309

BLAST of Cla97C05G092090 vs. TAIR10
Match: AT3G56590.2 (hydroxyproline-rich glycoprotein family protein)

HSP 1 Score: 154.8 bits (390), Expect = 1.2e-37
Identity = 116/348 (33.33%), Postives = 176/348 (50.57%), Query Frame = 0

Query: 43  NFKCFFVLILGFVVFVPGFFWLLP-LHERNSGFEAKDTIKLSATVQVYFVLEKPVNELLP 102
           + +C  +L     VF+   FWL P L   + G    D       +   F + KP++ +  
Sbjct: 42  SLRCVLILAFSAAVFLSALFWLPPFLGFADPGDLDLDPRFKDHRIVASFDVGKPISFMED 101

Query: 103 HIKRLEFDINGELDIPNLKVSILSMHGIGESNRTYVVFGLLSEYITAPINPVSLSLLRSS 162
           ++ +LE DI  E+  P  KV +L++  +G+ NRT V+F +  E   + I     SL++++
Sbjct: 102 NLMQLENDITDEISFPMTKVVVLALERLGDLNRTMVIFAIDPEKENSKIPAEIESLIKAA 161

Query: 163 LYDFFLSESNLTLTTSIFGQPSTFQILKFPGGISIIPFQHASIWQFPQIVFNFTLTNSIS 222
                  + +  LT S+FG+P  F++LKFPGGI++IP Q     Q  Q++FNFTL  SI 
Sbjct: 162 FETLVQKQLSFRLTESLFGEPFFFEVLKFPGGITVIPPQPIFPLQKAQLLFNFTLNFSIY 221

Query: 223 EILDNFAKFKSQLKFGLSLRTYENVYLQITNKIGSTMQPLVIVQASITSELGRITSQRLQ 282
           +I  NF +  SQLK G++L +YEN+Y+ ++N  GST+ P  IV +S+    G  +S RL+
Sbjct: 222 QIQSNFEELASQLKKGINLASYENLYITLSNSRGSTVAPPTIVHSSVLLTFG--SSSRLK 281

Query: 283 QLAAIINTSPERNLGLDYSVFGEVKSVSLSSYSKRPAKAMPPSFSPAPAPVPGDHVKLPS 342
           QLA  I +S  +NLGL+++VFG+VK V LSS        +P   SPA             
Sbjct: 282 QLAQTITSSHSKNLGLNHTVFGKVKQVRLSS-------ILP--HSPATXXXXXXXXXXXX 341

Query: 343 SPHPSRSAQSPATHSPPHANCETSSPTPSMVPPHXPREHS-IPPTSYP 389
                                  S PT    P   P +HS +PP + P
Sbjct: 342 XXXXXXXXXXXXXXXXXXXXXSLSPPTKGFAPASAPTKHSPLPPRNPP 378

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004149972.23.5e-20886.55PREDICTED: uncharacterized protein LOC101222031 isoform X2 [Cucumis sativus] >KG... [more]
XP_011651267.11.3e-20786.52PREDICTED: uncharacterized protein LOC101222031 isoform X1 [Cucumis sativus][more]
XP_008456084.17.3e-20685.43PREDICTED: uncharacterized protein LOC103496125 isoform X1 [Cucumis melo][more]
XP_022922926.12.9e-17875.88uncharacterized protein LOC111430758 isoform X1 [Cucurbita moschata][more]
XP_022984786.18.4e-17875.60uncharacterized protein LOC111482968 isoform X1 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
tr|A0A0A0L6J0|A0A0A0L6J0_CUCSA2.3e-20886.55Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G214050 PE=4 SV=1[more]
tr|A0A1S3C2E0|A0A1S3C2E0_CUCME4.8e-20685.43uncharacterized protein LOC103496125 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
tr|A0A1S4E116|A0A1S4E116_CUCME2.7e-16471.97uncharacterized protein LOC103496125 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... [more]
tr|A0A2I4GDV1|A0A2I4GDV1_9ROSI2.7e-10051.33uncharacterized protein LOC109007039 isoform X2 OS=Juglans regia OX=51240 GN=LOC... [more]
tr|A0A2I4GDX6|A0A2I4GDX6_9ROSI2.7e-10051.33uncharacterized protein LOC109007039 isoform X4 OS=Juglans regia OX=51240 GN=LOC... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
AT1G10790.17.3e-5138.60BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein fam... [more]
AT3G10810.13.1e-4137.71zinc finger (C3HC4-type RING finger) family protein[more]
AT3G56590.21.2e-3733.33hydroxyproline-rich glycoprotein family protein[more]
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0016020 membrane
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C05G092090.1Cla97C05G092090.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 316..446
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 412..426
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 343..369
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 322..338
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 431..446
NoneNo IPR availablePANTHERPTHR33826FAMILY NOT NAMEDcoord: 1..442
NoneNo IPR availablePANTHERPTHR33826:SF4F20B24.21coord: 1..442