Clc03G05570 (gene) Watermelon (cordophanus) v2

Overview
NameClc03G05570
Typegene
OrganismCitrullus lanatus subsp. cordophanus cv. cordophanus (Watermelon (cordophanus) v2)
DescriptionBEST Arabidopsis thaliana protein match is: glycine-rich protein .
LocationClcChr03: 5261057 .. 5262368 (-)
RNA-Seq ExpressionClc03G05570
SyntenyClc03G05570
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTGGTTTAGCTTGACTTCTTTAATCAAGTTGTTAATCTTAATTAACTTTCGTAATCATAAAGATAAAACCATGACACCCCAAAAGAACAAAAGGACAGTCCTATAACACCCCCACTCACCACTATTGCGGCAAAAAAGTTGCGTGGCATTCATTTTCGCTACCAAAATCCCAATTAGATAAGCGCTATGTGGTTACAAGAACTTGGGTTGCATTTGATAATGGCGTTGAAACAAATTTCCCAAATGAAGATACCCTAATTTTCTTCATCAGCTTAACATCAGATCTACAATGGCCTTCTACAATTCCTACTACGATTCTGCTCAAACAGAACCCCCAATTTCGCAATTCAGTAACGAACCCACCTTCTACAATCTCTTTGACTACCCACCTCCTTGTTATTTCGAACAGTCTTATGATTCCTGCACATCCAATTTCTATGAATTTCCCCAGTTGATCGAACATGAATCCATTGACCATGGCGGTTATGGTTATCCAATTAGCTACTCGGCCAATGCTTGTTCTGCATCGAGTTTCACTTTGCCAAAAGTAATCGAATACGACCCTGATTTGTACAGCGAGGTGCCAACTCAATTTGTGATCTCTTACTCTGTTTCCGAATTCAACGAGACAGAATTTGAAGAGTACGACCCAACCCCTTACGGTGGTGGTTATGACATTTCTGAAACCTACGGTAAGCCCCTTCCACCTTCAACTGAAATTTGTTACCCACCGTCCTCTTCTTCACAGCCGAGTACTGCCACCGCCATTCCCATCTCCACAATACCCAAGGTAGAGGAAGCACCAAAAGGAAAAATCGAAGAACAAACAAAGCCATCGAGTGAAATCAAGCCGACCCAGATCGAAAAAGTTAACGACAGCTCTTCGAGTGAGAGCGACATGGATTCTGAATCTGAAGAAATTGAGGAAATTAAAGCGATACAATTGGCAGATCCGGGAATTGGGTATGGAAATGGAAGGGAAGTGAATCAATTTCCAAGTGGGTACGGACTGGAAGCGATGGATCTTTGTGAAAGCTTATTTGGGTATTGGCCATGTCTCTCACGGATTAAAAAACAAACAGCTTGTAGGCAACCCAAGAACGGTTGTGGGCGTTGCCATGGCCATTGCTATTGCTATGGGAATTACGGCAACCAGTGGCAGACGGCGGCGGATTATCTATTTGGAAGCCATAATCCATATCCAGATGGAAATGCTATTTATGGCTATCAAAGACAGTTCCAAGGGGAGGCTGCTCATGGGTATGTTTGGTTGAATCAAAATGACTTCAATCGGTGTGAAGATGTTTGA

mRNA sequence

ATGGTGCTTAACATCAGATCTACAATGGCCTTCTACAATTCCTACTACGATTCTGCTCAAACAGAACCCCCAATTTCGCAATTCAGTAACGAACCCACCTTCTACAATCTCTTTGACTACCCACCTCCTTGTTATTTCGAACAGTCTTATGATTCCTGCACATCCAATTTCTATGAATTTCCCCAGTTGATCGAACATGAATCCATTGACCATGGCGGTTATGGTTATCCAATTAGCTACTCGGCCAATGCTTGTTCTGCATCGAGTTTCACTTTGCCAAAAGTAATCGAATACGACCCTGATTTGTACAGCGAGGTGCCAACTCAATTTGTGATCTCTTACTCTGTTTCCGAATTCAACGAGACAGAATTTGAAGAGTACGACCCAACCCCTTACGGTGGTGGTTATGACATTTCTGAAACCTACGGTAAGCCCCTTCCACCTTCAACTGAAATTTGTTACCCACCGTCCTCTTCTTCACAGCCGAGTACTGCCACCGCCATTCCCATCTCCACAATACCCAAGGTAGAGGAAGCACCAAAAGGAAAAATCGAAGAACAAACAAAGCCATCGAGTGAAATCAAGCCGACCCAGATCGAAAAAGTTAACGACAGCTCTTCGAGTGAGAGCGACATGGATTCTGAATCTGAAGAAATTGAGGAAATTAAAGCGATACAATTGGCAGATCCGGGAATTGGGTATGGAAATGGAAGGGAAGTGAATCAATTTCCAAGTGGGTACGGACTGGAAGCGATGGATCTTTGTGAAAGCTTATTTGGGTATTGGCCATGTCTCTCACGGATTAAAAAACAAACAGCTTGTAGGCAACCCAAGAACGGTTGTGGGCGTTGCCATGGCCATTGCTATTGCTATGGGAATTACGGCAACCAGTGGCAGACGGCGGCGGATTATCTATTTGGAAGCCATAATCCATATCCAGATGGAAATGCTATTTATGGCTATCAAAGACAGTTCCAAGGGGAGGCTGCTCATGGGTATGTTTGGTTGAATCAAAATGACTTCAATCGGTGTGAAGATGTTTGA

Coding sequence (CDS)

ATGGTGCTTAACATCAGATCTACAATGGCCTTCTACAATTCCTACTACGATTCTGCTCAAACAGAACCCCCAATTTCGCAATTCAGTAACGAACCCACCTTCTACAATCTCTTTGACTACCCACCTCCTTGTTATTTCGAACAGTCTTATGATTCCTGCACATCCAATTTCTATGAATTTCCCCAGTTGATCGAACATGAATCCATTGACCATGGCGGTTATGGTTATCCAATTAGCTACTCGGCCAATGCTTGTTCTGCATCGAGTTTCACTTTGCCAAAAGTAATCGAATACGACCCTGATTTGTACAGCGAGGTGCCAACTCAATTTGTGATCTCTTACTCTGTTTCCGAATTCAACGAGACAGAATTTGAAGAGTACGACCCAACCCCTTACGGTGGTGGTTATGACATTTCTGAAACCTACGGTAAGCCCCTTCCACCTTCAACTGAAATTTGTTACCCACCGTCCTCTTCTTCACAGCCGAGTACTGCCACCGCCATTCCCATCTCCACAATACCCAAGGTAGAGGAAGCACCAAAAGGAAAAATCGAAGAACAAACAAAGCCATCGAGTGAAATCAAGCCGACCCAGATCGAAAAAGTTAACGACAGCTCTTCGAGTGAGAGCGACATGGATTCTGAATCTGAAGAAATTGAGGAAATTAAAGCGATACAATTGGCAGATCCGGGAATTGGGTATGGAAATGGAAGGGAAGTGAATCAATTTCCAAGTGGGTACGGACTGGAAGCGATGGATCTTTGTGAAAGCTTATTTGGGTATTGGCCATGTCTCTCACGGATTAAAAAACAAACAGCTTGTAGGCAACCCAAGAACGGTTGTGGGCGTTGCCATGGCCATTGCTATTGCTATGGGAATTACGGCAACCAGTGGCAGACGGCGGCGGATTATCTATTTGGAAGCCATAATCCATATCCAGATGGAAATGCTATTTATGGCTATCAAAGACAGTTCCAAGGGGAGGCTGCTCATGGGTATGTTTGGTTGAATCAAAATGACTTCAATCGGTGTGAAGATGTTTGA

Protein sequence

MVLNIRSTMAFYNSYYDSAQTEPPISQFSNEPTFYNLFDYPPPCYFEQSYDSCTSNFYEFPQLIEHESIDHGGYGYPISYSANACSASSFTLPKVIEYDPDLYSEVPTQFVISYSVSEFNETEFEEYDPTPYGGGYDISETYGKPLPPSTEICYPPSSSSQPSTATAIPISTIPKVEEAPKGKIEEQTKPSSEIKPTQIEKVNDSSSSESDMDSESEEIEEIKAIQLADPGIGYGNGREVNQFPSGYGLEAMDLCESLFGYWPCLSRIKKQTACRQPKNGCGRCHGHCYCYGNYGNQWQTAADYLFGSHNPYPDGNAIYGYQRQFQGEAAHGYVWLNQNDFNRCEDV
Homology
BLAST of Clc03G05570 vs. NCBI nr
Match: XP_038895690.1 (uncharacterized protein LOC120083862 [Benincasa hispida])

HSP 1 Score: 597.8 bits (1540), Expect = 5.9e-167
Identity = 295/351 (84.05%), Postives = 308/351 (87.75%), Query Frame = 0

Query: 10  AFYNSYYDSAQTEPPISQFSNEPTFYNLFDYPPPCYFEQSYDS---------CTSNFYEF 69
           ++ +SYY SAQ EPPISQ SNEPTFYNLFDYPPPCY EQ YDS           SNFYEF
Sbjct: 9   SYNDSYYHSAQIEPPISQSSNEPTFYNLFDYPPPCYLEQVYDSEVGYFANAPYRSNFYEF 68

Query: 70  PQLIEHESIDHGGYGYPISYSANACSASSFTLPKVIEYDPDLYSEVPTQFVISYSVSEFN 129
           PQLIE E+++HG YGY ISYSANACSA SFT+PKVIEYDPD YSEV TQFVISYSVSEFN
Sbjct: 69  PQLIERETVNHGAYGYAISYSANACSAPSFTVPKVIEYDPDFYSEVSTQFVISYSVSEFN 128

Query: 130 ETEFEEYDPTPYGGGYDISETYGKPLPPSTEICYPPSSSSQPSTATAIPISTIPKVEEAP 189
           ETEFEEYDPTPYGGGYDISETYGKPL PSTEICYPPSSSS P TATAIPI TIPK EE P
Sbjct: 129 ETEFEEYDPTPYGGGYDISETYGKPLQPSTEICYPPSSSS-PPTATAIPIFTIPKEEEPP 188

Query: 190 KGKIEEQTKPSSEIKPTQIEKVNDSSSSESDMDSESEEIEEIKAIQLADPGIGYGNGREV 249
           KGKIEEQTKPSSEIKPTQIEKVN SSSSESD  SESEEIEE+KAIQLADPGI YGNGRE 
Sbjct: 189 KGKIEEQTKPSSEIKPTQIEKVNHSSSSESDTASESEEIEEVKAIQLADPGIEYGNGREA 248

Query: 250 NQFPSGYGLEAMDLCESLFGYWPCLSRIKKQTACRQPKNGCGRCHGHCYCYGNYGNQWQT 309
           NQFPSGYGLEAMDLCESLFGYWPCLSR+KKQT CRQPKNGCGRCHGHCYCYGNYGNQWQT
Sbjct: 249 NQFPSGYGLEAMDLCESLFGYWPCLSRVKKQTPCRQPKNGCGRCHGHCYCYGNYGNQWQT 308

Query: 310 AADYLFGSHNPYPD----GNAIYGYQRQFQGEAAHGYVWLNQNDFNRCEDV 348
           AA+YLFGSHNPYPD    G+A+YGYQRQ QGE  +GYVWLNQNDFN CEDV
Sbjct: 309 AAEYLFGSHNPYPDGRGEGDAVYGYQRQIQGEPVYGYVWLNQNDFNGCEDV 358

BLAST of Clc03G05570 vs. NCBI nr
Match: XP_008441695.1 (PREDICTED: uncharacterized protein LOC103485767 [Cucumis melo])

HSP 1 Score: 589.0 bits (1517), Expect = 2.7e-164
Identity = 295/370 (79.73%), Postives = 311/370 (84.05%), Query Frame = 0

Query: 3   LNIRSTMAFY-------NSYYDSAQTEPPISQFSNEPTFYNLFDYPPPCYFEQSYDS--- 62
           LNIRS MAFY       +SYY+SAQ EPPI Q SNEPTFYNLFDYPPPCYF Q+YDS   
Sbjct: 11  LNIRSPMAFYDSYDFYDDSYYNSAQIEPPILQSSNEPTFYNLFDYPPPCYFGQAYDSEVG 70

Query: 63  -------CTSNFYEFPQLIEHESIDHGGYGYPISYSANACSASSFTLPKVIEYDPDLYSE 122
                    SNF EFPQLIEHE +DHG YGY I YSANACSASSFTLPKV  YDPDLYSE
Sbjct: 71  YFAINAAYGSNFSEFPQLIEHEPVDHGDYGYAIRYSANACSASSFTLPKVFGYDPDLYSE 130

Query: 123 VPTQFVISYSVSEFNETEFEEYDPTPYGGGYDISETYGKPLPPSTEICYPPSSSS----Q 182
           V TQFVISYSVSEFNET+FEEYDPTPY GGYDI ETYGKPL PSTEICYPPSSSS     
Sbjct: 131 VSTQFVISYSVSEFNETDFEEYDPTPYDGGYDIYETYGKPLQPSTEICYPPSSSSPSKPP 190

Query: 183 PSTATAIPISTIPKVEEAPKGKIEEQTKPSSEIKPTQIEKVNDSSSSESDMDSESEEIEE 242
           P TATAIPI+TIPK++EAPKGKIEEQTKPSSEIKP QIEK N+SSSS+SD  SES EIEE
Sbjct: 191 PPTATAIPITTIPKIDEAPKGKIEEQTKPSSEIKPIQIEKTNNSSSSDSDTTSESGEIEE 250

Query: 243 IKAIQLADPGIGYGNGREVNQFPSGYGLEAMDLCESLFGYWPCLSRIKKQTACRQPKNGC 302
           +KAIQL DPGIGYGNGREVN+FPSGYGLEAMDLCESLFGYWPCLSR K+QT CRQPKNGC
Sbjct: 251 VKAIQLGDPGIGYGNGREVNEFPSGYGLEAMDLCESLFGYWPCLSRAKRQTLCRQPKNGC 310

Query: 303 GRCHGHCYCYGNYGNQWQTAADYLFGSHNPYPD----GNAIYGYQRQFQGEAAHGYVWLN 348
           GRCHGHCYCYGNYGNQWQTAA+YLFGSHNPY D    G+  YGYQRQFQ E  +GYVWLN
Sbjct: 311 GRCHGHCYCYGNYGNQWQTAAEYLFGSHNPYLDGRGEGDGFYGYQRQFQEEPVYGYVWLN 370

BLAST of Clc03G05570 vs. NCBI nr
Match: KAA0056916.1 (uncharacterized protein E6C27_scaffold96G00880 [Cucumis melo var. makuwa] >TYK26343.1 uncharacterized protein E5676_scaffold861G00010 [Cucumis melo var. makuwa])

HSP 1 Score: 576.2 bits (1484), Expect = 1.8e-160
Identity = 288/364 (79.12%), Postives = 305/364 (83.79%), Query Frame = 0

Query: 9   MAFY-------NSYYDSAQTEPPISQFSNEPTFYNLFDYPPPCYFEQSYDS--------- 68
           MAFY       +SYY+SAQ EPPI Q SNEPTFYNLFDYPPPCYF Q+YDS         
Sbjct: 1   MAFYDSYDFYDDSYYNSAQIEPPILQSSNEPTFYNLFDYPPPCYFGQAYDSEVGYFAINA 60

Query: 69  -CTSNFYEFPQLIEHESIDHGGYGYPISYSANACSASSFTLPKVIEYDPDLYSEVPTQFV 128
              SNF EFPQLIEHE +DHG YGY I YSANACSASSFTLPKV  YDPDLYSEV TQFV
Sbjct: 61  AYGSNFSEFPQLIEHEPVDHGDYGYAIRYSANACSASSFTLPKVFGYDPDLYSEVSTQFV 120

Query: 129 ISYSVSEFNETEFEEYDPTPYGGGYDISETYGKPLPPSTEICYPPSSSS----QPSTATA 188
           ISYSVSEFNET+FEEYDPTPY GGYDI ETYGKPL PSTEICYPPSSSS     P TATA
Sbjct: 121 ISYSVSEFNETDFEEYDPTPYDGGYDIYETYGKPLQPSTEICYPPSSSSPSKPPPPTATA 180

Query: 189 IPISTIPKVEEAPKGKIEEQTKPSSEIKPTQIEKVNDSSSSESDMDSESEEIEEIKAIQL 248
           IPI+TIPK++EAPKGKIEEQTKPSSEIKP QIEK N+S SS+SD  SES EIEE+KAIQL
Sbjct: 181 IPITTIPKIDEAPKGKIEEQTKPSSEIKPIQIEKTNNSYSSDSDTTSESGEIEEVKAIQL 240

Query: 249 ADPGIGYGNGREVNQFPSGYGLEAMDLCESLFGYWPCLSRIKKQTACRQPKNGCGRCHGH 308
            DPGIGYGNGREVN+FPSGYGLEAMDLCESLFGYWPCLSR K+QT CRQPKNGCGRCHGH
Sbjct: 241 GDPGIGYGNGREVNEFPSGYGLEAMDLCESLFGYWPCLSRAKRQTLCRQPKNGCGRCHGH 300

Query: 309 CYCYGNYGNQWQTAADYLFGSHNPYPD----GNAIYGYQRQFQGEAAHGYVWLNQNDFNR 348
           CYCYGNYGNQWQTAA+YLFGSHNPY D    G+  YGYQR+FQ E  +GYVWLNQNDFNR
Sbjct: 301 CYCYGNYGNQWQTAAEYLFGSHNPYLDGRGEGDGFYGYQRRFQEEPVYGYVWLNQNDFNR 360

BLAST of Clc03G05570 vs. NCBI nr
Match: XP_011652905.1 (uncharacterized protein At5g39570 [Cucumis sativus] >KGN64592.1 hypothetical protein Csa_013087 [Cucumis sativus])

HSP 1 Score: 555.8 bits (1431), Expect = 2.6e-154
Identity = 282/369 (76.42%), Postives = 301/369 (81.57%), Query Frame = 0

Query: 9   MAFYN-------SYYDSAQTEPPISQFSNEPTFYNLFDYPPPCYFEQSYD---------- 68
           MAFYN       SYY+ AQ EPPI Q SNEP FYNLFDYPPPCYF Q+YD          
Sbjct: 1   MAFYNSYDFYDDSYYNYAQIEPPIPQSSNEPNFYNLFDYPPPCYFGQAYDYEVGYSANDA 60

Query: 69  SCTSNFYEFPQLIEHESIDHGGYGYPISYSANACSASSFTLPKVIEYDPDLYSEVPTQFV 128
              SNF E PQLI+HE +DHG YGY I YSANACSASSFTLPK+ EY+PDLYSEV TQFV
Sbjct: 61  PYRSNFNELPQLIDHEPVDHGDYGYAIRYSANACSASSFTLPKLCEYNPDLYSEVSTQFV 120

Query: 129 ISYSVSEFNETEFEEYDPTPYGGGYDISETYGKPLPPSTEICYPPSSSS--------QPS 188
           ISYSVS+FNETEFEEYDPTPY GGYDISETYGKPL PS EICYPPSSSS         P 
Sbjct: 121 ISYSVSQFNETEFEEYDPTPYDGGYDISETYGKPLQPSIEICYPPSSSSPSKSPPPPPPP 180

Query: 189 TATAIP-ISTIPKVEEAPKGKIEEQTKPSSEIKPTQIEKVNDSSSSESDMDSESEEIEEI 248
           TATAIP I+TIPK++EAPKGKIEEQTKPSSEIKPTQIEK N+SSSS+SD  SES EIEE 
Sbjct: 181 TATAIPIITTIPKIDEAPKGKIEEQTKPSSEIKPTQIEKTNNSSSSDSDTTSESGEIEED 240

Query: 249 KAIQLADPGIGYGNGREVNQFPSGYGLEAMDLCESLFGYWPCLSRIKKQTACRQPKNGCG 308
           KAIQL DPGIGYGN REVN+FPSG GLEAMDLCESLFGYWPCLSR K+QTA RQPKNGCG
Sbjct: 241 KAIQLGDPGIGYGNAREVNEFPSGCGLEAMDLCESLFGYWPCLSRAKRQTAYRQPKNGCG 300

Query: 309 RCHGHCYCYGNYGNQWQTAADYLFGSHNPYPD----GNAIYGYQRQFQGEAAHGYVWLNQ 348
           RCHGHCYCYGNYGN+WQTAA+YLFGSHNPY D    G+ +YGYQRQFQ E  +GYVWLNQ
Sbjct: 301 RCHGHCYCYGNYGNEWQTAAEYLFGSHNPYLDGRREGDVVYGYQRQFQEEPVYGYVWLNQ 360

BLAST of Clc03G05570 vs. NCBI nr
Match: XP_023001286.1 (uncharacterized protein LOC111495462 [Cucurbita maxima])

HSP 1 Score: 543.1 bits (1398), Expect = 1.7e-150
Identity = 274/351 (78.06%), Postives = 293/351 (83.48%), Query Frame = 0

Query: 9   MAF---YNSYYDSAQTEPPISQFSNEPTFYNLFDYPPPCYFEQSYDSCTSNFYEFPQLIE 68
           MAF   Y+SYYDSAQTEPPI Q S EPTFYNLFDYPPPCYF Q+Y   TSNF EFPQLIE
Sbjct: 1   MAFYDSYDSYYDSAQTEPPIPQSSYEPTFYNLFDYPPPCYFGQAYAPYTSNFNEFPQLIE 60

Query: 69  HESIDHGGYGYPISYSANACSASSFTLPKVIEYDPDLYSE----VPTQFVISYSVSEFNE 128
           H+ +DHG YGY ISYSANACSAS+F++PKVIEYD DLYS+    V +QFVISYSVSEFNE
Sbjct: 61  HQPVDHGAYGYTISYSANACSASTFSVPKVIEYDSDLYSDGTQKVSSQFVISYSVSEFNE 120

Query: 129 TEFEEYDPTPYGGGYDISETYGKPLPPSTEICYPPSSSSQPS-TATAIPISTIPKVEEAP 188
           TEFEEYDPTPYGGGYDI ETYGKPL PST+ICY PSSSS P    TAIPIS I    EAP
Sbjct: 121 TEFEEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSPPKPPPTAIPISAI---HEAP 180

Query: 189 KGKIEEQTKPSSEIKPTQIEKVNDSSSSESDMDSESEEIEEIKAIQLADPGIGYGNGREV 248
           K KIEE+T+PSSEIKPTQIEK N +        SESEEIEE+KAI  ADPGIGYGNGREV
Sbjct: 181 KEKIEEKTEPSSEIKPTQIEKDNTA--------SESEEIEEVKAIPFADPGIGYGNGREV 240

Query: 249 NQFPSGYGLEAMDLCESLFGYWPCLSRIKKQTACRQPKNGCGRCHGHCYCYGNYGNQWQT 308
           NQFPSGYGLEAMDLCESLFGYWPCLSRIKKQTACRQP NGCGRCHGHCYCYGNYGNQWQT
Sbjct: 241 NQFPSGYGLEAMDLCESLFGYWPCLSRIKKQTACRQPNNGCGRCHGHCYCYGNYGNQWQT 300

Query: 309 AADYLFGSHNPYPD----GNAIYGYQRQFQGEAAHGYVWLNQNDFNRCEDV 348
           AADYLFGSHNPYPD    G+ +YGYQRQ+Q E  + YVWLNQNDF R +DV
Sbjct: 301 AADYLFGSHNPYPDGRSEGDGVYGYQRQYQAEPVYRYVWLNQNDFVRSDDV 340

BLAST of Clc03G05570 vs. ExPASy Swiss-Prot
Match: Q9FKA5 (Uncharacterized protein At5g39570 OS=Arabidopsis thaliana OX=3702 GN=At5g39570 PE=1 SV=1)

HSP 1 Score: 49.7 bits (117), Expect = 7.9e-05
Identity = 50/159 (31.45%), Postives = 69/159 (43.40%), Query Frame = 0

Query: 114 YSVSEFNETEFEEYDPTPYGGGYDISETYGKPLPPSTEICYPPSSSSQPSTATAIPISTI 173
           Y+  + +  +F+E+DPTPY GGYDI+  YG+P+PPS E CYP SS          P  T 
Sbjct: 4   YTRDDNDVDDFDEFDPTPYSGGYDITVIYGRPIPPSDETCYPLSSGVDDDFEYERPEFT- 63

Query: 174 PKVEEAPKGKIEE----------QTKPSSEIKP-------TQIEKVNDSSSSESDMDSES 233
            ++ E P    +E          + KP    +P        Q E+ N    SES    + 
Sbjct: 64  -QIHE-PSAYGDEALNTEYSSYSRPKPRPAFRPDSGGGGHVQGERPNPGYGSESGYGRKP 123

Query: 234 EEIEEIKAIQLADPGIGYGNGREV-------NQFPSGYG 249
           E          ++ G GYG   EV         + SGYG
Sbjct: 124 E----------SEYGSGYGGQTEVEYGRRPEQSYGSGYG 149

BLAST of Clc03G05570 vs. ExPASy TrEMBL
Match: A0A1S3B404 (uncharacterized protein LOC103485767 OS=Cucumis melo OX=3656 GN=LOC103485767 PE=4 SV=1)

HSP 1 Score: 589.0 bits (1517), Expect = 1.3e-164
Identity = 295/370 (79.73%), Postives = 311/370 (84.05%), Query Frame = 0

Query: 3   LNIRSTMAFY-------NSYYDSAQTEPPISQFSNEPTFYNLFDYPPPCYFEQSYDS--- 62
           LNIRS MAFY       +SYY+SAQ EPPI Q SNEPTFYNLFDYPPPCYF Q+YDS   
Sbjct: 11  LNIRSPMAFYDSYDFYDDSYYNSAQIEPPILQSSNEPTFYNLFDYPPPCYFGQAYDSEVG 70

Query: 63  -------CTSNFYEFPQLIEHESIDHGGYGYPISYSANACSASSFTLPKVIEYDPDLYSE 122
                    SNF EFPQLIEHE +DHG YGY I YSANACSASSFTLPKV  YDPDLYSE
Sbjct: 71  YFAINAAYGSNFSEFPQLIEHEPVDHGDYGYAIRYSANACSASSFTLPKVFGYDPDLYSE 130

Query: 123 VPTQFVISYSVSEFNETEFEEYDPTPYGGGYDISETYGKPLPPSTEICYPPSSSS----Q 182
           V TQFVISYSVSEFNET+FEEYDPTPY GGYDI ETYGKPL PSTEICYPPSSSS     
Sbjct: 131 VSTQFVISYSVSEFNETDFEEYDPTPYDGGYDIYETYGKPLQPSTEICYPPSSSSPSKPP 190

Query: 183 PSTATAIPISTIPKVEEAPKGKIEEQTKPSSEIKPTQIEKVNDSSSSESDMDSESEEIEE 242
           P TATAIPI+TIPK++EAPKGKIEEQTKPSSEIKP QIEK N+SSSS+SD  SES EIEE
Sbjct: 191 PPTATAIPITTIPKIDEAPKGKIEEQTKPSSEIKPIQIEKTNNSSSSDSDTTSESGEIEE 250

Query: 243 IKAIQLADPGIGYGNGREVNQFPSGYGLEAMDLCESLFGYWPCLSRIKKQTACRQPKNGC 302
           +KAIQL DPGIGYGNGREVN+FPSGYGLEAMDLCESLFGYWPCLSR K+QT CRQPKNGC
Sbjct: 251 VKAIQLGDPGIGYGNGREVNEFPSGYGLEAMDLCESLFGYWPCLSRAKRQTLCRQPKNGC 310

Query: 303 GRCHGHCYCYGNYGNQWQTAADYLFGSHNPYPD----GNAIYGYQRQFQGEAAHGYVWLN 348
           GRCHGHCYCYGNYGNQWQTAA+YLFGSHNPY D    G+  YGYQRQFQ E  +GYVWLN
Sbjct: 311 GRCHGHCYCYGNYGNQWQTAAEYLFGSHNPYLDGRGEGDGFYGYQRQFQEEPVYGYVWLN 370

BLAST of Clc03G05570 vs. ExPASy TrEMBL
Match: A0A5D3DRV2 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold861G00010 PE=4 SV=1)

HSP 1 Score: 576.2 bits (1484), Expect = 8.9e-161
Identity = 288/364 (79.12%), Postives = 305/364 (83.79%), Query Frame = 0

Query: 9   MAFY-------NSYYDSAQTEPPISQFSNEPTFYNLFDYPPPCYFEQSYDS--------- 68
           MAFY       +SYY+SAQ EPPI Q SNEPTFYNLFDYPPPCYF Q+YDS         
Sbjct: 1   MAFYDSYDFYDDSYYNSAQIEPPILQSSNEPTFYNLFDYPPPCYFGQAYDSEVGYFAINA 60

Query: 69  -CTSNFYEFPQLIEHESIDHGGYGYPISYSANACSASSFTLPKVIEYDPDLYSEVPTQFV 128
              SNF EFPQLIEHE +DHG YGY I YSANACSASSFTLPKV  YDPDLYSEV TQFV
Sbjct: 61  AYGSNFSEFPQLIEHEPVDHGDYGYAIRYSANACSASSFTLPKVFGYDPDLYSEVSTQFV 120

Query: 129 ISYSVSEFNETEFEEYDPTPYGGGYDISETYGKPLPPSTEICYPPSSSS----QPSTATA 188
           ISYSVSEFNET+FEEYDPTPY GGYDI ETYGKPL PSTEICYPPSSSS     P TATA
Sbjct: 121 ISYSVSEFNETDFEEYDPTPYDGGYDIYETYGKPLQPSTEICYPPSSSSPSKPPPPTATA 180

Query: 189 IPISTIPKVEEAPKGKIEEQTKPSSEIKPTQIEKVNDSSSSESDMDSESEEIEEIKAIQL 248
           IPI+TIPK++EAPKGKIEEQTKPSSEIKP QIEK N+S SS+SD  SES EIEE+KAIQL
Sbjct: 181 IPITTIPKIDEAPKGKIEEQTKPSSEIKPIQIEKTNNSYSSDSDTTSESGEIEEVKAIQL 240

Query: 249 ADPGIGYGNGREVNQFPSGYGLEAMDLCESLFGYWPCLSRIKKQTACRQPKNGCGRCHGH 308
            DPGIGYGNGREVN+FPSGYGLEAMDLCESLFGYWPCLSR K+QT CRQPKNGCGRCHGH
Sbjct: 241 GDPGIGYGNGREVNEFPSGYGLEAMDLCESLFGYWPCLSRAKRQTLCRQPKNGCGRCHGH 300

Query: 309 CYCYGNYGNQWQTAADYLFGSHNPYPD----GNAIYGYQRQFQGEAAHGYVWLNQNDFNR 348
           CYCYGNYGNQWQTAA+YLFGSHNPY D    G+  YGYQR+FQ E  +GYVWLNQNDFNR
Sbjct: 301 CYCYGNYGNQWQTAAEYLFGSHNPYLDGRGEGDGFYGYQRRFQEEPVYGYVWLNQNDFNR 360

BLAST of Clc03G05570 vs. ExPASy TrEMBL
Match: A0A0A0LUY1 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G070580 PE=4 SV=1)

HSP 1 Score: 555.8 bits (1431), Expect = 1.2e-154
Identity = 282/369 (76.42%), Postives = 301/369 (81.57%), Query Frame = 0

Query: 9   MAFYN-------SYYDSAQTEPPISQFSNEPTFYNLFDYPPPCYFEQSYD---------- 68
           MAFYN       SYY+ AQ EPPI Q SNEP FYNLFDYPPPCYF Q+YD          
Sbjct: 1   MAFYNSYDFYDDSYYNYAQIEPPIPQSSNEPNFYNLFDYPPPCYFGQAYDYEVGYSANDA 60

Query: 69  SCTSNFYEFPQLIEHESIDHGGYGYPISYSANACSASSFTLPKVIEYDPDLYSEVPTQFV 128
              SNF E PQLI+HE +DHG YGY I YSANACSASSFTLPK+ EY+PDLYSEV TQFV
Sbjct: 61  PYRSNFNELPQLIDHEPVDHGDYGYAIRYSANACSASSFTLPKLCEYNPDLYSEVSTQFV 120

Query: 129 ISYSVSEFNETEFEEYDPTPYGGGYDISETYGKPLPPSTEICYPPSSSS--------QPS 188
           ISYSVS+FNETEFEEYDPTPY GGYDISETYGKPL PS EICYPPSSSS         P 
Sbjct: 121 ISYSVSQFNETEFEEYDPTPYDGGYDISETYGKPLQPSIEICYPPSSSSPSKSPPPPPPP 180

Query: 189 TATAIP-ISTIPKVEEAPKGKIEEQTKPSSEIKPTQIEKVNDSSSSESDMDSESEEIEEI 248
           TATAIP I+TIPK++EAPKGKIEEQTKPSSEIKPTQIEK N+SSSS+SD  SES EIEE 
Sbjct: 181 TATAIPIITTIPKIDEAPKGKIEEQTKPSSEIKPTQIEKTNNSSSSDSDTTSESGEIEED 240

Query: 249 KAIQLADPGIGYGNGREVNQFPSGYGLEAMDLCESLFGYWPCLSRIKKQTACRQPKNGCG 308
           KAIQL DPGIGYGN REVN+FPSG GLEAMDLCESLFGYWPCLSR K+QTA RQPKNGCG
Sbjct: 241 KAIQLGDPGIGYGNAREVNEFPSGCGLEAMDLCESLFGYWPCLSRAKRQTAYRQPKNGCG 300

Query: 309 RCHGHCYCYGNYGNQWQTAADYLFGSHNPYPD----GNAIYGYQRQFQGEAAHGYVWLNQ 348
           RCHGHCYCYGNYGN+WQTAA+YLFGSHNPY D    G+ +YGYQRQFQ E  +GYVWLNQ
Sbjct: 301 RCHGHCYCYGNYGNEWQTAAEYLFGSHNPYLDGRREGDVVYGYQRQFQEEPVYGYVWLNQ 360

BLAST of Clc03G05570 vs. ExPASy TrEMBL
Match: A0A6J1KI70 (uncharacterized protein LOC111495462 OS=Cucurbita maxima OX=3661 GN=LOC111495462 PE=4 SV=1)

HSP 1 Score: 543.1 bits (1398), Expect = 8.4e-151
Identity = 274/351 (78.06%), Postives = 293/351 (83.48%), Query Frame = 0

Query: 9   MAF---YNSYYDSAQTEPPISQFSNEPTFYNLFDYPPPCYFEQSYDSCTSNFYEFPQLIE 68
           MAF   Y+SYYDSAQTEPPI Q S EPTFYNLFDYPPPCYF Q+Y   TSNF EFPQLIE
Sbjct: 1   MAFYDSYDSYYDSAQTEPPIPQSSYEPTFYNLFDYPPPCYFGQAYAPYTSNFNEFPQLIE 60

Query: 69  HESIDHGGYGYPISYSANACSASSFTLPKVIEYDPDLYSE----VPTQFVISYSVSEFNE 128
           H+ +DHG YGY ISYSANACSAS+F++PKVIEYD DLYS+    V +QFVISYSVSEFNE
Sbjct: 61  HQPVDHGAYGYTISYSANACSASTFSVPKVIEYDSDLYSDGTQKVSSQFVISYSVSEFNE 120

Query: 129 TEFEEYDPTPYGGGYDISETYGKPLPPSTEICYPPSSSSQPS-TATAIPISTIPKVEEAP 188
           TEFEEYDPTPYGGGYDI ETYGKPL PST+ICY PSSSS P    TAIPIS I    EAP
Sbjct: 121 TEFEEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSPPKPPPTAIPISAI---HEAP 180

Query: 189 KGKIEEQTKPSSEIKPTQIEKVNDSSSSESDMDSESEEIEEIKAIQLADPGIGYGNGREV 248
           K KIEE+T+PSSEIKPTQIEK N +        SESEEIEE+KAI  ADPGIGYGNGREV
Sbjct: 181 KEKIEEKTEPSSEIKPTQIEKDNTA--------SESEEIEEVKAIPFADPGIGYGNGREV 240

Query: 249 NQFPSGYGLEAMDLCESLFGYWPCLSRIKKQTACRQPKNGCGRCHGHCYCYGNYGNQWQT 308
           NQFPSGYGLEAMDLCESLFGYWPCLSRIKKQTACRQP NGCGRCHGHCYCYGNYGNQWQT
Sbjct: 241 NQFPSGYGLEAMDLCESLFGYWPCLSRIKKQTACRQPNNGCGRCHGHCYCYGNYGNQWQT 300

Query: 309 AADYLFGSHNPYPD----GNAIYGYQRQFQGEAAHGYVWLNQNDFNRCEDV 348
           AADYLFGSHNPYPD    G+ +YGYQRQ+Q E  + YVWLNQNDF R +DV
Sbjct: 301 AADYLFGSHNPYPDGRSEGDGVYGYQRQYQAEPVYRYVWLNQNDFVRSDDV 340

BLAST of Clc03G05570 vs. ExPASy TrEMBL
Match: A0A6J1EHF5 (uncharacterized protein LOC111434325 OS=Cucurbita moschata OX=3662 GN=LOC111434325 PE=4 SV=1)

HSP 1 Score: 529.6 bits (1363), Expect = 9.6e-147
Identity = 264/346 (76.30%), Postives = 286/346 (82.66%), Query Frame = 0

Query: 9   MAFYNSYYDSAQTEPPISQFSNEPTFYNLFDYPPPCYFEQSYDSCTSNFYEFPQLIEHES 68
           MAFY+SYYDSAQ EPPI Q S EPTFYNLFDYPPPCYF Q+Y   TS+  EFPQLIE++ 
Sbjct: 1   MAFYDSYYDSAQIEPPIPQSSYEPTFYNLFDYPPPCYFGQAYAPYTSSSNEFPQLIEYQP 60

Query: 69  IDHGGYGYPISYSANACSASSFTLPKVIEYDPDLYSE----VPTQFVISYSVSEFNETEF 128
           +DHG YGY ISYSANACSAS+F++PKVIEYDPD YS+    V +QFVISYSVSEFNETEF
Sbjct: 61  VDHGAYGYTISYSANACSASTFSVPKVIEYDPDFYSDGYQKVSSQFVISYSVSEFNETEF 120

Query: 129 EEYDPTPYGGGYDISETYGKPLPPSTEICYPPSSSSQPSTATAIPISTIPKVEEAPKGKI 188
           EEYDPTPYGGGYDI ETYGKPL PST+ICY PSSSS P      P  T   ++EAPK KI
Sbjct: 121 EEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSPPK-----PPPT--AIQEAPKEKI 180

Query: 189 EEQTKPSSEIKPTQIEKVNDSSSSESDMDSESEEIEEIKAIQLADPGIGYGNGREVNQFP 248
           EE+TKPSSEIKPTQIEK N +        SESEEIEE+KAI  ADPGIGYGNGREVNQFP
Sbjct: 181 EEKTKPSSEIKPTQIEKDNTA--------SESEEIEEVKAIPFADPGIGYGNGREVNQFP 240

Query: 249 SGYGLEAMDLCESLFGYWPCLSRIKKQTACRQPKNGCGRCHGHCYCYGNYGNQWQTAADY 308
           SGYGLEAMDLCESLFGYWPCLSRIKKQTACRQP NGCGRCHGHCYCYGNYGNQWQTAADY
Sbjct: 241 SGYGLEAMDLCESLFGYWPCLSRIKKQTACRQPNNGCGRCHGHCYCYGNYGNQWQTAADY 300

Query: 309 LFGSHNPYPD----GNAIYGYQRQFQGEAAHGYVWLNQNDFNRCED 347
           LFGSHNPYPD    G+ +YGYQ Q+Q E  +GYVWLNQND  R +D
Sbjct: 301 LFGSHNPYPDGRSEGDGVYGYQTQYQTEPVYGYVWLNQNDLVRSDD 331

BLAST of Clc03G05570 vs. TAIR 10
Match: AT1G11440.1 (BEST Arabidopsis thaliana protein match is: glycine-rich protein (TAIR:AT3G29075.1); Has 19337 Blast hits to 8589 proteins in 488 species: Archae - 26; Bacteria - 641; Metazoa - 7852; Fungi - 2167; Plants - 955; Viruses - 616; Other Eukaryotes - 7080 (source: NCBI BLink). )

HSP 1 Score: 143.7 bits (361), Expect = 2.8e-34
Identity = 117/351 (33.33%), Postives = 167/351 (47.58%), Query Frame = 0

Query: 14  SYYDSAQTEPPISQFSNEPTFYNLFDYPPPCYFEQ----SYDSCTSNFYEFPQL-IEHES 73
           ++Y++ Q+    +Q +      NL+D     Y +Q     ++  + N+Y + +   E E 
Sbjct: 3   NFYENYQSPYDYNQVN------NLYDQNHYHYNQQQQQLGFEPMSYNYYNWNESESESEY 62

Query: 74  IDHGGYGYPISYSAN--------------ACSASSFTLPKVIEYDPDLYS--EVPTQFVI 133
           + + GY  P+SY+                A S S+ + PK + YDP+LY+  E P QF I
Sbjct: 63  VAYSGYDDPMSYNCYNWNGSESETTSAYVAYSVSTMSEPKHLFYDPNLYTTYESPPQFSI 122

Query: 134 SYSVS---EFNETEFEEYDPTPYGGGYDISETYGKPLPPSTEICYPPSSS------SQPS 193
             SV+   +FNE EF+EYDPTPYGGGYD+  TYGKPLPPS E CYP S++      S P 
Sbjct: 123 YCSVASALDFNEPEFDEYDPTPYGGGYDVVATYGKPLPPSVETCYPCSTAPHAKAPSPPE 182

Query: 194 TATAIPISTIPKVEEAPKGKIEEQTKPSSEIKPTQIEKVNDSSSSE------------SD 253
               +P+      ++    K     +P  E+KP +  K  +    E             D
Sbjct: 183 IIAPVPLGIYDGGQKNVVKKRVSFAEPVEEVKPIETIKEQEQEQDEDYDEESEDEDDGDD 242

Query: 254 MDSESEEIEEIKAIQLADPGIGYGNGR-------EVNQF--PSGYGLEAMDLCESLF-GY 313
            D E EE +E    +  D    YGN         EV     PSGYGLEA DLCE +F GY
Sbjct: 243 DDEEEEEGDEEAKEEEKDHSSSYGNEEYEVVDKGEVKALYVPSGYGLEATDLCEVIFGGY 302

BLAST of Clc03G05570 vs. TAIR 10
Match: AT5G39570.1 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cytosol, nucleus; EXPRESSED IN: 25 plant structures; EXPRESSED DURING: 14 growth stages; BEST Arabidopsis thaliana protein match is: glycine-rich protein (TAIR:AT3G29075.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 49.7 bits (117), Expect = 5.6e-06
Identity = 50/159 (31.45%), Postives = 69/159 (43.40%), Query Frame = 0

Query: 114 YSVSEFNETEFEEYDPTPYGGGYDISETYGKPLPPSTEICYPPSSSSQPSTATAIPISTI 173
           Y+  + +  +F+E+DPTPY GGYDI+  YG+P+PPS E CYP SS          P  T 
Sbjct: 4   YTRDDNDVDDFDEFDPTPYSGGYDITVIYGRPIPPSDETCYPLSSGVDDDFEYERPEFT- 63

Query: 174 PKVEEAPKGKIEE----------QTKPSSEIKP-------TQIEKVNDSSSSESDMDSES 233
            ++ E P    +E          + KP    +P        Q E+ N    SES    + 
Sbjct: 64  -QIHE-PSAYGDEALNTEYSSYSRPKPRPAFRPDSGGGGHVQGERPNPGYGSESGYGRKP 123

Query: 234 EEIEEIKAIQLADPGIGYGNGREV-------NQFPSGYG 249
           E          ++ G GYG   EV         + SGYG
Sbjct: 124 E----------SEYGSGYGGQTEVEYGRRPEQSYGSGYG 149

BLAST of Clc03G05570 vs. TAIR 10
Match: AT3G29075.1 (glycine-rich protein )

HSP 1 Score: 47.4 bits (111), Expect = 2.8e-05
Identity = 41/135 (30.37%), Postives = 64/135 (47.41%), Query Frame = 0

Query: 114 YSVSEFNETEFEEYDPTPYGGGYDISETYGKPLPPSTEICYPPSSSSQPSTATAIPISTI 173
           Y+  + +  +F EYDP PY GGYDI+ TYG+ +PPS E CYP SS S  +     P +  
Sbjct: 4   YTNDDNDVDDFTEYDPMPYSGGYDITVTYGRSIPPSDETCYPLSSLSGDAFEYQRP-NFS 63

Query: 174 PKVEEAPKGKIEEQTKPSSEIKPTQIEKVNDSSSSESDMDSESEEIEEIKAIQLADPGIG 233
              + +       +T+ SS  +P  +   +D     +       E+E  +  + ++ G G
Sbjct: 64  SNHDSSAYDDQALKTEYSSYARPGPVGSGSDFGRKPNSGYGGRTEVEYGRKTE-SEHGSG 123

Query: 234 YGNGREVNQFPSGYG 249
           YG   E +     YG
Sbjct: 124 YGGRIESDYVKPSYG 136

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038895690.15.9e-16784.05uncharacterized protein LOC120083862 [Benincasa hispida][more]
XP_008441695.12.7e-16479.73PREDICTED: uncharacterized protein LOC103485767 [Cucumis melo][more]
KAA0056916.11.8e-16079.12uncharacterized protein E6C27_scaffold96G00880 [Cucumis melo var. makuwa] >TYK26... [more]
XP_011652905.12.6e-15476.42uncharacterized protein At5g39570 [Cucumis sativus] >KGN64592.1 hypothetical pro... [more]
XP_023001286.11.7e-15078.06uncharacterized protein LOC111495462 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
Q9FKA57.9e-0531.45Uncharacterized protein At5g39570 OS=Arabidopsis thaliana OX=3702 GN=At5g39570 P... [more]
Match NameE-valueIdentityDescription
A0A1S3B4041.3e-16479.73uncharacterized protein LOC103485767 OS=Cucumis melo OX=3656 GN=LOC103485767 PE=... [more]
A0A5D3DRV28.9e-16179.12Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A0A0LUY11.2e-15476.42Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G070580 PE=4 SV=1[more]
A0A6J1KI708.4e-15178.06uncharacterized protein LOC111495462 OS=Cucurbita maxima OX=3661 GN=LOC111495462... [more]
A0A6J1EHF59.6e-14776.30uncharacterized protein LOC111434325 OS=Cucurbita moschata OX=3662 GN=LOC1114343... [more]
Match NameE-valueIdentityDescription
AT1G11440.12.8e-3433.33BEST Arabidopsis thaliana protein match is: glycine-rich protein (TAIR:AT3G29075... [more]
AT5G39570.15.6e-0631.45FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... [more]
AT3G29075.12.8e-0530.37glycine-rich protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (cordophanus) v2
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 191..205
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 154..214
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 154..170
NoneNo IPR availablePANTHERPTHR33971:SF3OS02G0743600 PROTEINcoord: 9..345
IPR038943PLD-regulated protein1-likePANTHERPTHR33971OS06G0232000 PROTEINcoord: 9..345

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Clc03G05570.2Clc03G05570.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0070300 phosphatidic acid binding