CaUC09G164250 (gene) Watermelon (USVL246-FR2) v1

Overview
NameCaUC09G164250
Typegene
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionGlutamic acid-rich protein
LocationCiama_Chr09: 5948748 .. 5949170 (+)
RNA-Seq ExpressionCaUC09G164250
SyntenyCaUC09G164250
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTCTCAACTCCTCTCTTCTTCAACTTCTATCCTCACCATTTTGCTCATAAGCAAGGCAACCACATTGAGGATATTTCCTCTTACTTTCTCTTCGAGGCGACCGGCGACTCGGAGGTTGACTCCTCGGTCGACCCCGGGGGCTCGGCCGCCTCGACGGAGTTCAACGACGCTGAATCTTGCACCGATGACACTCCTACTATATGTATTAATGAATTTGGGATATATGGAAGTTATGAATATGAAGAAGAAGAAGAAGAAGAAGAGATGGTTGAGAATGATGATGATCATGAAGTAGTTGAAAGCAAGCCAATTGGATTCACAACAAAGTCAAATGCTTCTATTGATTCAACTAAGGAGTTCAAAATGTTGAATGAGGTGGACAAAAACAGGTTGTTTTGGGAGGCTTGTTTGGCTTCATAG

mRNA sequence

ATGCTCTCAACTCCTCTCTTCTTCAACTTCTATCCTCACCATTTTGCTCATAAGCAAGGCAACCACATTGAGGATATTTCCTCTTACTTTCTCTTCGAGGCGACCGGCGACTCGGAGGTTGACTCCTCGGTCGACCCCGGGGGCTCGGCCGCCTCGACGGAGTTCAACGACGCTGAATCTTGCACCGATGACACTCCTACTATATGTATTAATGAATTTGGGATATATGGAAGTTATGAATATGAAGAAGAAGAAGAAGAAGAAGAGATGGTTGAGAATGATGATGATCATGAAGTAGTTGAAAGCAAGCCAATTGGATTCACAACAAAGTCAAATGCTTCTATTGATTCAACTAAGGAGTTCAAAATGTTGAATGAGGTGGACAAAAACAGGTTGTTTTGGGAGGCTTGTTTGGCTTCATAG

Coding sequence (CDS)

ATGCTCTCAACTCCTCTCTTCTTCAACTTCTATCCTCACCATTTTGCTCATAAGCAAGGCAACCACATTGAGGATATTTCCTCTTACTTTCTCTTCGAGGCGACCGGCGACTCGGAGGTTGACTCCTCGGTCGACCCCGGGGGCTCGGCCGCCTCGACGGAGTTCAACGACGCTGAATCTTGCACCGATGACACTCCTACTATATGTATTAATGAATTTGGGATATATGGAAGTTATGAATATGAAGAAGAAGAAGAAGAAGAAGAGATGGTTGAGAATGATGATGATCATGAAGTAGTTGAAAGCAAGCCAATTGGATTCACAACAAAGTCAAATGCTTCTATTGATTCAACTAAGGAGTTCAAAATGTTGAATGAGGTGGACAAAAACAGGTTGTTTTGGGAGGCTTGTTTGGCTTCATAG

Protein sequence

MLSTPLFFNFYPHHFAHKQGNHIEDISSYFLFEATGDSEVDSSVDPGGSAASTEFNDAESCTDDTPTICINEFGIYGSYEYEEEEEEEEMVENDDDHEVVESKPIGFTTKSNASIDSTKEFKMLNEVDKNRLFWEACLAS
Homology
BLAST of CaUC09G164250 vs. NCBI nr
Match: XP_038896353.1 (uncharacterized protein LOC120084618 [Benincasa hispida])

HSP 1 Score: 230.3 bits (586), Expect = 1.0e-56
Identity = 123/140 (87.86%), Postives = 126/140 (90.00%), Query Frame = 0

Query: 1   MLSTPLFFNFYPHHFAHKQGNHIEDISSYFLFEATGDSEVDSSVDPGGSAASTEFNDAES 60
           MLSTPLFFNFYPH FAHKQGNHIEDISSYFL EATGDSE+DSSVD GGS ASTEFNDAES
Sbjct: 1   MLSTPLFFNFYPHPFAHKQGNHIEDISSYFLLEATGDSEIDSSVDLGGSVASTEFNDAES 60

Query: 61  CTDDTPTICINEFGIYGSYEYEEEEEEEEMVENDDDHEVVESKPIGFTTKSNASIDSTKE 120
           CTDDTPTIC NEF   G YEY E+E+EEEMVEND D EVVESK IGFTTKSNASIDSTKE
Sbjct: 61  CTDDTPTICTNEF---GKYEY-EDEDEEEMVENDYD-EVVESKAIGFTTKSNASIDSTKE 120

Query: 121 FKMLNEVDKNRLFWEACLAS 141
           FKMLNEVDKNRLFWE CLAS
Sbjct: 121 FKMLNEVDKNRLFWETCLAS 135

BLAST of CaUC09G164250 vs. NCBI nr
Match: KAA0057545.1 (glutamic acid-rich protein [Cucumis melo var. makuwa])

HSP 1 Score: 187.2 bits (474), Expect = 9.7e-44
Identity = 109/152 (71.71%), Postives = 120/152 (78.95%), Query Frame = 0

Query: 1   MLST-PLFFNFYPHHFAHKQGNHIEDISSYFLFEATGDSEVDSSVD--PGGSAASTEFND 60
           MLST PLFFNFYPHHF H QGNHIEDISSYFLFEATGDSE DSSVD     SA++TEFND
Sbjct: 1   MLSTSPLFFNFYPHHFVHNQGNHIEDISSYFLFEATGDSEADSSVDLESSSSASTTEFND 60

Query: 61  AESCTDDTPT-ICINEFGIYGSYEYEEEEEEEEMVENDDD-------HEVVESKPIGFTT 120
           AESCTDDTPT +C NE     + + +++ + EEMVENDDD        EVVESK IGF+ 
Sbjct: 61  AESCTDDTPTMLCNNE-----NEDDDDDGDGEEMVENDDDDHDNEEEEEVVESKAIGFSI 120

Query: 121 KSNASIDSTK-EFKMLNEVDKNRLFWEACLAS 141
           KSNASIDSTK +FKMLNEVDKNRLFWE CLAS
Sbjct: 121 KSNASIDSTKDDFKMLNEVDKNRLFWETCLAS 147

BLAST of CaUC09G164250 vs. NCBI nr
Match: XP_008451422.1 (PREDICTED: glutamic acid-rich protein [Cucumis melo])

HSP 1 Score: 186.4 bits (472), Expect = 1.7e-43
Identity = 109/154 (70.78%), Postives = 119/154 (77.27%), Query Frame = 0

Query: 1   MLST-PLFFNFYPHHFAHKQGNHIEDISSYFLFEATGDSEVDSSVD--PGGSAASTEFND 60
           MLST PLFFNFYPHHF H QGNHIEDISSYFLFEATGDSE DSSVD     SA++TEFND
Sbjct: 1   MLSTSPLFFNFYPHHFVHNQGNHIEDISSYFLFEATGDSEADSSVDLESSSSASTTEFND 60

Query: 61  AESCTDDTPT-ICINEFGIYGSYEYEEEEEEEEMVENDDD---------HEVVESKPIGF 120
           AESCTDDTPT +C NE       + +++ + EEMVENDDD          EVVESK IGF
Sbjct: 61  AESCTDDTPTMLCNNE----NEDDDDDDGDGEEMVENDDDDHDDEDEEEEEVVESKAIGF 120

Query: 121 TTKSNASIDSTK-EFKMLNEVDKNRLFWEACLAS 141
           + KSNASIDSTK +FKMLNEVDKNRLFWE CLAS
Sbjct: 121 SIKSNASIDSTKDDFKMLNEVDKNRLFWETCLAS 150

BLAST of CaUC09G164250 vs. NCBI nr
Match: XP_011659319.1 (nuclear polyadenylated RNA-binding protein 3 [Cucumis sativus] >KGN44840.1 hypothetical protein Csa_016864 [Cucumis sativus])

HSP 1 Score: 179.5 bits (454), Expect = 2.0e-41
Identity = 106/151 (70.20%), Postives = 111/151 (73.51%), Query Frame = 0

Query: 1   MLSTPLFFNFYPHHFAHKQGNHIEDISSYFLFEATGDSEVDSSVDPGGSAASTEFNDAES 60
           MLS+PLFFNFYPHHF H QGNHIEDISSYFLFEATGDSEVD       S  STEFNDAES
Sbjct: 1   MLSSPLFFNFYPHHFVHNQGNHIEDISSYFLFEATGDSEVDLQ---SSSPVSTEFNDAES 60

Query: 61  CTDDTPTI--CINEFGIYGSYEYEEEEEEEEMVENDDD--------HEVVESKPIGFTTK 120
           CTDDT  I  C NE         ++E+EEE  VENDDD         EVVESK IGF+ K
Sbjct: 61  CTDDTDRIMLCNNE---------DDEDEEEMGVENDDDDGDEEEEEEEVVESKAIGFSIK 120

Query: 121 SNASIDSTK-EFKMLNEVDKNRLFWEACLAS 141
           SNASIDSTK EFKMLNEVDKNRLFWE CLAS
Sbjct: 121 SNASIDSTKDEFKMLNEVDKNRLFWETCLAS 139

BLAST of CaUC09G164250 vs. NCBI nr
Match: XP_022991734.1 (uncharacterized protein LOC111488264 [Cucurbita maxima])

HSP 1 Score: 166.4 bits (420), Expect = 1.8e-37
Identity = 92/140 (65.71%), Postives = 107/140 (76.43%), Query Frame = 0

Query: 1   MLSTPLFFNFYPHHFAHKQGNHIEDISSYFLFEATGDSEVDSSVDPGGSAASTEFNDAES 60
           MLSTP FFNF+PH F    GNHIED SS+ LFEATGDSE DS V  G S  S EFNDAES
Sbjct: 1   MLSTPFFFNFHPHQF----GNHIEDFSSHLLFEATGDSEADSLVGHGSSTVSVEFNDAES 60

Query: 61  CTDDTPTICINEFGIYGSYEYEEEEEEEEMVENDDDHEVVESKPIGFTTKSNASIDSTKE 120
           CTDDTP +C NEF  Y + + +++++E    END++   VESKPIGF T+S+ASIDS +E
Sbjct: 61  CTDDTPAMC-NEFKTYENDDDDDDDDE----ENDEE---VESKPIGFMTESSASIDSAEE 120

Query: 121 FKMLNEVDKNRLFWEACLAS 141
           FKMLNEVDKNRLFWE CLAS
Sbjct: 121 FKMLNEVDKNRLFWETCLAS 128

BLAST of CaUC09G164250 vs. ExPASy TrEMBL
Match: A0A5A7UVG0 (Glutamic acid-rich protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold497G00160 PE=4 SV=1)

HSP 1 Score: 187.2 bits (474), Expect = 4.7e-44
Identity = 109/152 (71.71%), Postives = 120/152 (78.95%), Query Frame = 0

Query: 1   MLST-PLFFNFYPHHFAHKQGNHIEDISSYFLFEATGDSEVDSSVD--PGGSAASTEFND 60
           MLST PLFFNFYPHHF H QGNHIEDISSYFLFEATGDSE DSSVD     SA++TEFND
Sbjct: 1   MLSTSPLFFNFYPHHFVHNQGNHIEDISSYFLFEATGDSEADSSVDLESSSSASTTEFND 60

Query: 61  AESCTDDTPT-ICINEFGIYGSYEYEEEEEEEEMVENDDD-------HEVVESKPIGFTT 120
           AESCTDDTPT +C NE     + + +++ + EEMVENDDD        EVVESK IGF+ 
Sbjct: 61  AESCTDDTPTMLCNNE-----NEDDDDDGDGEEMVENDDDDHDNEEEEEVVESKAIGFSI 120

Query: 121 KSNASIDSTK-EFKMLNEVDKNRLFWEACLAS 141
           KSNASIDSTK +FKMLNEVDKNRLFWE CLAS
Sbjct: 121 KSNASIDSTKDDFKMLNEVDKNRLFWETCLAS 147

BLAST of CaUC09G164250 vs. ExPASy TrEMBL
Match: A0A1S3BRI2 (glutamic acid-rich protein OS=Cucumis melo OX=3656 GN=LOC103492721 PE=4 SV=1)

HSP 1 Score: 186.4 bits (472), Expect = 8.0e-44
Identity = 109/154 (70.78%), Postives = 119/154 (77.27%), Query Frame = 0

Query: 1   MLST-PLFFNFYPHHFAHKQGNHIEDISSYFLFEATGDSEVDSSVD--PGGSAASTEFND 60
           MLST PLFFNFYPHHF H QGNHIEDISSYFLFEATGDSE DSSVD     SA++TEFND
Sbjct: 1   MLSTSPLFFNFYPHHFVHNQGNHIEDISSYFLFEATGDSEADSSVDLESSSSASTTEFND 60

Query: 61  AESCTDDTPT-ICINEFGIYGSYEYEEEEEEEEMVENDDD---------HEVVESKPIGF 120
           AESCTDDTPT +C NE       + +++ + EEMVENDDD          EVVESK IGF
Sbjct: 61  AESCTDDTPTMLCNNE----NEDDDDDDGDGEEMVENDDDDHDDEDEEEEEVVESKAIGF 120

Query: 121 TTKSNASIDSTK-EFKMLNEVDKNRLFWEACLAS 141
           + KSNASIDSTK +FKMLNEVDKNRLFWE CLAS
Sbjct: 121 SIKSNASIDSTKDDFKMLNEVDKNRLFWETCLAS 150

BLAST of CaUC09G164250 vs. ExPASy TrEMBL
Match: A0A0A0K519 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G390190 PE=4 SV=1)

HSP 1 Score: 179.5 bits (454), Expect = 9.8e-42
Identity = 106/151 (70.20%), Postives = 111/151 (73.51%), Query Frame = 0

Query: 1   MLSTPLFFNFYPHHFAHKQGNHIEDISSYFLFEATGDSEVDSSVDPGGSAASTEFNDAES 60
           MLS+PLFFNFYPHHF H QGNHIEDISSYFLFEATGDSEVD       S  STEFNDAES
Sbjct: 1   MLSSPLFFNFYPHHFVHNQGNHIEDISSYFLFEATGDSEVDLQ---SSSPVSTEFNDAES 60

Query: 61  CTDDTPTI--CINEFGIYGSYEYEEEEEEEEMVENDDD--------HEVVESKPIGFTTK 120
           CTDDT  I  C NE         ++E+EEE  VENDDD         EVVESK IGF+ K
Sbjct: 61  CTDDTDRIMLCNNE---------DDEDEEEMGVENDDDDGDEEEEEEEVVESKAIGFSIK 120

Query: 121 SNASIDSTK-EFKMLNEVDKNRLFWEACLAS 141
           SNASIDSTK EFKMLNEVDKNRLFWE CLAS
Sbjct: 121 SNASIDSTKDEFKMLNEVDKNRLFWETCLAS 139

BLAST of CaUC09G164250 vs. ExPASy TrEMBL
Match: A0A6J1JVN0 (uncharacterized protein LOC111488264 OS=Cucurbita maxima OX=3661 GN=LOC111488264 PE=4 SV=1)

HSP 1 Score: 166.4 bits (420), Expect = 8.6e-38
Identity = 92/140 (65.71%), Postives = 107/140 (76.43%), Query Frame = 0

Query: 1   MLSTPLFFNFYPHHFAHKQGNHIEDISSYFLFEATGDSEVDSSVDPGGSAASTEFNDAES 60
           MLSTP FFNF+PH F    GNHIED SS+ LFEATGDSE DS V  G S  S EFNDAES
Sbjct: 1   MLSTPFFFNFHPHQF----GNHIEDFSSHLLFEATGDSEADSLVGHGSSTVSVEFNDAES 60

Query: 61  CTDDTPTICINEFGIYGSYEYEEEEEEEEMVENDDDHEVVESKPIGFTTKSNASIDSTKE 120
           CTDDTP +C NEF  Y + + +++++E    END++   VESKPIGF T+S+ASIDS +E
Sbjct: 61  CTDDTPAMC-NEFKTYENDDDDDDDDE----ENDEE---VESKPIGFMTESSASIDSAEE 120

Query: 121 FKMLNEVDKNRLFWEACLAS 141
           FKMLNEVDKNRLFWE CLAS
Sbjct: 121 FKMLNEVDKNRLFWETCLAS 128

BLAST of CaUC09G164250 vs. ExPASy TrEMBL
Match: A0A6J1DA36 (uncharacterized protein LOC111018429 OS=Momordica charantia OX=3673 GN=LOC111018429 PE=4 SV=1)

HSP 1 Score: 154.5 bits (389), Expect = 3.4e-34
Identity = 95/144 (65.97%), Postives = 106/144 (73.61%), Query Frame = 0

Query: 1   MLSTPLFFNFYPHHFAHKQGNHIEDISSYFLFEATGDSEVD-SSVDPGGSAA---STEFN 60
           M  TPLFFN + HHFA K G  ++D SSY LFEATGDSE D SSVDPGGS+A   S E N
Sbjct: 1   MFKTPLFFNLHHHHFAPKDG-CLQDFSSYLLFEATGDSEADASSVDPGGSSAAPGSPECN 60

Query: 61  DAESCTDDTPTICINEFGIYGSYEYEEEEEEEEMVENDDDHEVVESKPIGFTTKSNASID 120
           DAESCTDDTP    +EF +    ++E+ EE E   +NDD  EVVESK     TKS+ASID
Sbjct: 61  DAESCTDDTPD--CDEFDMDEDEDHEDGEEGENYEKNDD--EVVESKATKGFTKSSASID 120

Query: 121 STKEFKMLNEVDKNRLFWEACLAS 141
           STKEFKMLNEVDKNRLFWEACLAS
Sbjct: 121 STKEFKMLNEVDKNRLFWEACLAS 139

BLAST of CaUC09G164250 vs. TAIR 10
Match: AT2G47950.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: root, flower; EXPRESSED DURING: petal differentiation and expansion stage; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G62990.1); Has 22 Blast hits to 22 proteins in 5 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 22; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 49.3 bits (116), Expect = 2.9e-06
Identity = 37/127 (29.13%), Postives = 61/127 (48.03%), Query Frame = 0

Query: 21  NHIEDISSYFLFEATGDSEV-DSSVDPGGSAASTEFNDAESCTDDTPTICINE------F 80
           N++ D+S + L EA+ DSE    SVD        +     S      T C+++      F
Sbjct: 5   NNVFDVSPFLLLEASADSEAGHDSVDDDKCVKDYDLGHESSSASSCETSCVSQRTSLLGF 64

Query: 81  GIYGSYEYEEEEEEEEMVENDDDHEVVESKPIGFTTKSNASIDSTKEFKMLNEVDKNRLF 140
            +       EE +  +  + D + EV      G + + N ++DS     +++E+D+NR+F
Sbjct: 65  DLQDDTVNHEERDAGDEEDEDGEGEVNSYIRCGRSQRENLAVDSA---AVVSEMDQNRMF 124

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038896353.11.0e-5687.86uncharacterized protein LOC120084618 [Benincasa hispida][more]
KAA0057545.19.7e-4471.71glutamic acid-rich protein [Cucumis melo var. makuwa][more]
XP_008451422.11.7e-4370.78PREDICTED: glutamic acid-rich protein [Cucumis melo][more]
XP_011659319.12.0e-4170.20nuclear polyadenylated RNA-binding protein 3 [Cucumis sativus] >KGN44840.1 hypot... [more]
XP_022991734.11.8e-3765.71uncharacterized protein LOC111488264 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A5A7UVG04.7e-4471.71Glutamic acid-rich protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaff... [more]
A0A1S3BRI28.0e-4470.78glutamic acid-rich protein OS=Cucumis melo OX=3656 GN=LOC103492721 PE=4 SV=1[more]
A0A0A0K5199.8e-4270.20Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G390190 PE=4 SV=1[more]
A0A6J1JVN08.6e-3865.71uncharacterized protein LOC111488264 OS=Cucurbita maxima OX=3661 GN=LOC111488264... [more]
A0A6J1DA363.4e-3465.97uncharacterized protein LOC111018429 OS=Momordica charantia OX=3673 GN=LOC111018... [more]
Match NameE-valueIdentityDescription
AT2G47950.12.9e-0629.13unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (USVL246-FR2) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 45..63
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 40..63
NoneNo IPR availablePANTHERPTHR35726GLUTAMIC ACID-RICH PROTEIN-LIKEcoord: 19..140
NoneNo IPR availablePANTHERPTHR35726:SF4GLUTAMIC ACID-RICH PROTEIN-LIKEcoord: 19..140

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CaUC09G164250.1CaUC09G164250.1mRNA