CaUC03G054820 (gene) Watermelon (USVL246-FR2) v1

Overview
NameCaUC03G054820
Typegene
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionUnknown protein
LocationCiama_Chr03: 5345064 .. 5345405 (-)
RNA-Seq ExpressionCaUC03G054820
SyntenyCaUC03G054820
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGTGTTTTAGTGACAATGAGGAAAAATGGTGGTTATTCAAGGATGGAGAAGGAAGACCCAGATGAGAAAAAGCATAGACAAGCACAGTTTTTAATTTACAAGATAATGGAGCAGGCAAATAGTAGCAGTAGGAGAAGACCCTCATATCTCAGAATCAGAATAAGGAAGTTTAAGTTAAGGATTGGAAAGAGGTTGAAGAAGCTGAAGAAGACAATGTCAATGAGTTTGGCCACAGCAAGAATTGGGATTTGCAAGCAATTTGAACAACTTAGAACTTGTAAGAGCTTGTTTGGAGGAGCCAAGGTGGAAACTTTGGCCTTTCCTACTTTGGTTACTTGA

mRNA sequence

ATGAGTGTTTTAGTGACAATGAGGAAAAATGGTGGTTATTCAAGGATGGAGAAGGAAGACCCAGATGAGAAAAAGCATAGACAAGCACAGTTTTTAATTTACAAGATAATGGAGCAGGCAAATAGTAGCAGTAGGAGAAGACCCTCATATCTCAGAATCAGAATAAGGAAGTTTAAGTTAAGGATTGGAAAGAGGTTGAAGAAGCTGAAGAAGACAATGTCAATGAGTTTGGCCACAGCAAGAATTGGGATTTGCAAGCAATTTGAACAACTTAGAACTTGTAAGAGCTTGTTTGGAGGAGCCAAGGTGGAAACTTTGGCCTTTCCTACTTTGGTTACTTGA

Coding sequence (CDS)

ATGAGTGTTTTAGTGACAATGAGGAAAAATGGTGGTTATTCAAGGATGGAGAAGGAAGACCCAGATGAGAAAAAGCATAGACAAGCACAGTTTTTAATTTACAAGATAATGGAGCAGGCAAATAGTAGCAGTAGGAGAAGACCCTCATATCTCAGAATCAGAATAAGGAAGTTTAAGTTAAGGATTGGAAAGAGGTTGAAGAAGCTGAAGAAGACAATGTCAATGAGTTTGGCCACAGCAAGAATTGGGATTTGCAAGCAATTTGAACAACTTAGAACTTGTAAGAGCTTGTTTGGAGGAGCCAAGGTGGAAACTTTGGCCTTTCCTACTTTGGTTACTTGA

Protein sequence

MSVLVTMRKNGGYSRMEKEDPDEKKHRQAQFLIYKIMEQANSSSRRRPSYLRIRIRKFKLRIGKRLKKLKKTMSMSLATARIGICKQFEQLRTCKSLFGGAKVETLAFPTLVT
Homology
BLAST of CaUC03G054820 vs. NCBI nr
Match: KAG6584084.1 (hypothetical protein SDJN03_20016, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 174.1 bits (440), Expect = 6.9e-40
Identity = 95/113 (84.07%), Postives = 101/113 (89.38%), Query Frame = 0

Query: 1   MSVLVTMRKNGGYSRMEKEDPDEKKHRQAQFLIYKIMEQANSSSRRRPSYLRIRIRKFKL 60
           MS LVT R+N GYSRMEKEDP+EKKHRQAQFLIYKIMEQANS SRRR S LRIRIRKFKL
Sbjct: 1   MSGLVTRRQN-GYSRMEKEDPEEKKHRQAQFLIYKIMEQANSGSRRRSSCLRIRIRKFKL 60

Query: 61  RIGKRLKKLKKTMSMSLATARIGICKQFEQLRTCKSLFGGAKVETLAFPTLVT 114
           RIG+RLKKLKKTMSM + TARIGICKQF QLRTCKSLFG AK E LAFP+LV+
Sbjct: 61  RIGRRLKKLKKTMSMGICTARIGICKQFGQLRTCKSLFGRAKEEALAFPSLVS 112

BLAST of CaUC03G054820 vs. NCBI nr
Match: KGN64527.1 (hypothetical protein Csa_013452 [Cucumis sativus])

HSP 1 Score: 174.1 bits (440), Expect = 6.9e-40
Identity = 88/110 (80.00%), Postives = 101/110 (91.82%), Query Frame = 0

Query: 1   MSVLVTMRKNGGYSRMEKEDPDEKKHRQAQFLIYKIMEQANSSSRRRPSYLRIRIRKFKL 60
           MSVL+T+R+NGGYS+MEKEDPDEKKHRQAQFLIYKIMEQA++ S+RRPS LRIRIRKFKL
Sbjct: 1   MSVLMTVRRNGGYSKMEKEDPDEKKHRQAQFLIYKIMEQASNGSKRRPSCLRIRIRKFKL 60

Query: 61  RIGKRLKKLKKTMSMSLATARIGICKQFEQLRTCKSLFGGAKVETLAFPT 111
           R+G+R KK+KKTM+MS +TARIGIC QF QLR+CKSLFG  KVETL FPT
Sbjct: 61  RMGRRWKKMKKTMAMSFSTARIGICNQFGQLRSCKSLFGKTKVETLNFPT 110

BLAST of CaUC03G054820 vs. NCBI nr
Match: KAG7019685.1 (hypothetical protein SDJN02_18648, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 171.4 bits (433), Expect = 4.4e-39
Identity = 94/113 (83.19%), Postives = 100/113 (88.50%), Query Frame = 0

Query: 1   MSVLVTMRKNGGYSRMEKEDPDEKKHRQAQFLIYKIMEQANSSSRRRPSYLRIRIRKFKL 60
           MS LVT R+N GYSRMEKEDP+EKKHRQAQFLIYKIMEQANS SRRR S LRIRIRKFKL
Sbjct: 1   MSGLVTRRQN-GYSRMEKEDPEEKKHRQAQFLIYKIMEQANSGSRRRSSCLRIRIRKFKL 60

Query: 61  RIGKRLKKLKKTMSMSLATARIGICKQFEQLRTCKSLFGGAKVETLAFPTLVT 114
           RIG+RLKKLKKTMSM + TAR GICKQF QLRTCKSLFG AK E LAFP+LV+
Sbjct: 61  RIGRRLKKLKKTMSMGICTARNGICKQFGQLRTCKSLFGRAKEEALAFPSLVS 112

BLAST of CaUC03G054820 vs. NCBI nr
Match: XP_022140046.1 (uncharacterized protein LOC111010796 [Momordica charantia])

HSP 1 Score: 171.4 bits (433), Expect = 4.4e-39
Identity = 93/114 (81.58%), Postives = 104/114 (91.23%), Query Frame = 0

Query: 1   MSVLVTMRKNGGYSRMEKEDPDEKKHRQAQFLIYKIMEQANSSSRRRPSYLRIRIRKFKL 60
           MSV++T RK  GYSRMEKEDP+EKKHR+AQFLIYKIMEQANS SRRRP+ LRIRIRKFKL
Sbjct: 1   MSVILT-RKQSGYSRMEKEDPEEKKHREAQFLIYKIMEQANSGSRRRPTCLRIRIRKFKL 60

Query: 61  RIGKRLKKLKKTMSMSLATARIGICKQFEQLRTCKSLFG-GAKVETLAFPTLVT 114
           RIG+RLKKL+K+MS SL+TARIGICKQ  QLRTCKSLFG G KVETLAFP+LV+
Sbjct: 61  RIGRRLKKLRKSMSASLSTARIGICKQIGQLRTCKSLFGRGRKVETLAFPSLVS 113

BLAST of CaUC03G054820 vs. NCBI nr
Match: XP_023001322.1 (uncharacterized protein LOC111495486 [Cucurbita maxima])

HSP 1 Score: 167.5 bits (423), Expect = 6.4e-38
Identity = 92/113 (81.42%), Postives = 98/113 (86.73%), Query Frame = 0

Query: 1   MSVLVTMRKNGGYSRMEKEDPDEKKHRQAQFLIYKIMEQANSSSRRRPSYLRIRIRKFKL 60
           MS LVT R+N GYSRMEKEDP+EKKHRQAQFLIYKIMEQANS +RRR S LRIRIRKFKL
Sbjct: 1   MSGLVTRRQN-GYSRMEKEDPEEKKHRQAQFLIYKIMEQANSGTRRRSSCLRIRIRKFKL 60

Query: 61  RIGKRLKKLKKTMSMSLATARIGICKQFEQLRTCKSLFGGAKVETLAFPTLVT 114
           RIG+RLKKLKKTMSM   TAR GICKQF QLRTCKSLFG  K E LAFP+LV+
Sbjct: 61  RIGRRLKKLKKTMSMGFCTARNGICKQFGQLRTCKSLFGRPKEEALAFPSLVS 112

BLAST of CaUC03G054820 vs. ExPASy TrEMBL
Match: A0A0A0LUT4 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G063510 PE=4 SV=1)

HSP 1 Score: 174.1 bits (440), Expect = 3.3e-40
Identity = 88/110 (80.00%), Postives = 101/110 (91.82%), Query Frame = 0

Query: 1   MSVLVTMRKNGGYSRMEKEDPDEKKHRQAQFLIYKIMEQANSSSRRRPSYLRIRIRKFKL 60
           MSVL+T+R+NGGYS+MEKEDPDEKKHRQAQFLIYKIMEQA++ S+RRPS LRIRIRKFKL
Sbjct: 1   MSVLMTVRRNGGYSKMEKEDPDEKKHRQAQFLIYKIMEQASNGSKRRPSCLRIRIRKFKL 60

Query: 61  RIGKRLKKLKKTMSMSLATARIGICKQFEQLRTCKSLFGGAKVETLAFPT 111
           R+G+R KK+KKTM+MS +TARIGIC QF QLR+CKSLFG  KVETL FPT
Sbjct: 61  RMGRRWKKMKKTMAMSFSTARIGICNQFGQLRSCKSLFGKTKVETLNFPT 110

BLAST of CaUC03G054820 vs. ExPASy TrEMBL
Match: A0A6J1CEK3 (uncharacterized protein LOC111010796 OS=Momordica charantia OX=3673 GN=LOC111010796 PE=4 SV=1)

HSP 1 Score: 171.4 bits (433), Expect = 2.2e-39
Identity = 93/114 (81.58%), Postives = 104/114 (91.23%), Query Frame = 0

Query: 1   MSVLVTMRKNGGYSRMEKEDPDEKKHRQAQFLIYKIMEQANSSSRRRPSYLRIRIRKFKL 60
           MSV++T RK  GYSRMEKEDP+EKKHR+AQFLIYKIMEQANS SRRRP+ LRIRIRKFKL
Sbjct: 1   MSVILT-RKQSGYSRMEKEDPEEKKHREAQFLIYKIMEQANSGSRRRPTCLRIRIRKFKL 60

Query: 61  RIGKRLKKLKKTMSMSLATARIGICKQFEQLRTCKSLFG-GAKVETLAFPTLVT 114
           RIG+RLKKL+K+MS SL+TARIGICKQ  QLRTCKSLFG G KVETLAFP+LV+
Sbjct: 61  RIGRRLKKLRKSMSASLSTARIGICKQIGQLRTCKSLFGRGRKVETLAFPSLVS 113

BLAST of CaUC03G054820 vs. ExPASy TrEMBL
Match: A0A6J1KG77 (uncharacterized protein LOC111495486 OS=Cucurbita maxima OX=3661 GN=LOC111495486 PE=4 SV=1)

HSP 1 Score: 167.5 bits (423), Expect = 3.1e-38
Identity = 92/113 (81.42%), Postives = 98/113 (86.73%), Query Frame = 0

Query: 1   MSVLVTMRKNGGYSRMEKEDPDEKKHRQAQFLIYKIMEQANSSSRRRPSYLRIRIRKFKL 60
           MS LVT R+N GYSRMEKEDP+EKKHRQAQFLIYKIMEQANS +RRR S LRIRIRKFKL
Sbjct: 1   MSGLVTRRQN-GYSRMEKEDPEEKKHRQAQFLIYKIMEQANSGTRRRSSCLRIRIRKFKL 60

Query: 61  RIGKRLKKLKKTMSMSLATARIGICKQFEQLRTCKSLFGGAKVETLAFPTLVT 114
           RIG+RLKKLKKTMSM   TAR GICKQF QLRTCKSLFG  K E LAFP+LV+
Sbjct: 61  RIGRRLKKLKKTMSMGFCTARNGICKQFGQLRTCKSLFGRPKEEALAFPSLVS 112

BLAST of CaUC03G054820 vs. ExPASy TrEMBL
Match: W9RFE3 (Uncharacterized protein OS=Morus notabilis OX=981085 GN=L484_006294 PE=4 SV=1)

HSP 1 Score: 119.4 bits (298), Expect = 9.7e-24
Identity = 63/116 (54.31%), Postives = 85/116 (73.28%), Query Frame = 0

Query: 5   VTMRKNGGYSRMEKEDPDEKKHRQAQFLIYKIMEQANSSSRRRPSYLRIRIRKFKLRIGK 64
           + +RK   YS+++KEDP+E+  R+AQFLIYK+MEQA+S SRRRPSYLRIRIRK K++IGK
Sbjct: 3   LVLRKPNAYSKIDKEDPEERNRRRAQFLIYKVMEQADSMSRRRPSYLRIRIRKLKVKIGK 62

Query: 65  RLKKLKKTMSMSLATARIGICKQF-EQLRTCKSLFGGAK-------VETLAFPTLV 113
           RL KL+K+M +S++ A++ +CKQ   Q +TC+ LFGG         V TL  P  V
Sbjct: 63  RLTKLRKSMLLSISAAKVSVCKQVCSQFKTCRRLFGGGDNHSSANLVNTLPSPLFV 118

BLAST of CaUC03G054820 vs. ExPASy TrEMBL
Match: A0A7J6HWJ0 (Uncharacterized protein OS=Cannabis sativa OX=3483 GN=F8388_027202 PE=4 SV=1)

HSP 1 Score: 111.3 bits (277), Expect = 2.6e-21
Identity = 62/107 (57.94%), Postives = 79/107 (73.83%), Query Frame = 0

Query: 4   LVTMRKNGGYSRMEKEDPDEKKHRQAQFLIYKIMEQANSSSRRRPSYLRIRIRKFKLRIG 63
           +V  + + GYS+MEKEDP+EK HR+AQFLIYK+MEQA   SR+R SYLR+RI+K K++IG
Sbjct: 3   IVLKKSSNGYSKMEKEDPEEKNHRRAQFLIYKVMEQA--ESRKRTSYLRLRIKKLKVKIG 62

Query: 64  KRLKKLKKTMSMSLATARIGICKQFE-QLRTCKSLFGGAKVETLAFP 110
           KRL KL+K    SL TARI + +Q   QL+TCK LFGG   +T   P
Sbjct: 63  KRLTKLRK----SLNTARINVYRQVSTQLKTCKRLFGGTSAQTTLVP 103

BLAST of CaUC03G054820 vs. TAIR 10
Match: AT1G11655.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G21902.1); Has 22 Blast hits to 22 proteins in 6 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 22; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 60.8 bits (146), Expect = 7.9e-10
Identity = 38/101 (37.62%), Postives = 64/101 (63.37%), Query Frame = 0

Query: 13  YSRMEKEDPDEKKHRQAQFLIYKIMEQANSSSRRRP--SYLRIRIRKFKLRIGKRLKKLK 72
           YS+M+KEDP+E   R+A+FLIYK +++A+  SRR P  S++R+++   K++IGKRL KL+
Sbjct: 10  YSKMDKEDPEEVLSRRAKFLIYKTLQEADLISRRDPHSSFIRLKLYLLKVKIGKRLAKLR 69

Query: 73  KTMSMSLATARIGICKQFEQ--LRTCKSLFGGAKVETLAFP 110
           +++   ++  R G  ++     +R  K +F G     L  P
Sbjct: 70  RSV---VSAVRFGGIRKHSHNGVRALKKMFQGGATTGLPRP 107

BLAST of CaUC03G054820 vs. TAIR 10
Match: AT4G04745.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G21902.1); Has 32 Blast hits to 32 proteins in 9 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 32; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 56.6 bits (135), Expect = 1.5e-08
Identity = 34/67 (50.75%), Postives = 49/67 (73.13%), Query Frame = 0

Query: 13  YSRMEKEDPDEKKHRQAQFLIYKIMEQANSSS-----RRRPS--YLRIRIRKFKLRIGKR 72
           Y++MEKEDP E  HR+AQFLI K++E+A+S +     RRR S   + IR+   ++RIGK+
Sbjct: 37  YTKMEKEDPQELIHRRAQFLIQKVLERADSKTRQHQQRRRSSGPLIMIRVVGIRMRIGKK 96

BLAST of CaUC03G054820 vs. TAIR 10
Match: AT4G21902.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G04745.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 46.2 bits (108), Expect = 2.0e-05
Identity = 28/79 (35.44%), Postives = 50/79 (63.29%), Query Frame = 0

Query: 4  LVTMRKNGGYSRMEKEDPDEKKHRQAQFLIYKIMEQANSSSRRRPSYLRIRIRKF----- 63
          +  M+    Y+++EKED +E  HR+AQFLI+KI+++A+  + R+       I+ F     
Sbjct: 1  MAMMKNPNQYTKIEKEDLNEIIHRRAQFLIHKILQRADIETLRQQQKRNTTIKLFSFRVV 60

Query: 64 --KLRIGKRLKKLKKTMSM 76
            +++IGK+L+KL+K+  M
Sbjct: 61 GIRMKIGKKLRKLRKSCVM 79

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAG6584084.16.9e-4084.07hypothetical protein SDJN03_20016, partial [Cucurbita argyrosperma subsp. sorori... [more]
KGN64527.16.9e-4080.00hypothetical protein Csa_013452 [Cucumis sativus][more]
KAG7019685.14.4e-3983.19hypothetical protein SDJN02_18648, partial [Cucurbita argyrosperma subsp. argyro... [more]
XP_022140046.14.4e-3981.58uncharacterized protein LOC111010796 [Momordica charantia][more]
XP_023001322.16.4e-3881.42uncharacterized protein LOC111495486 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0LUT43.3e-4080.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G063510 PE=4 SV=1[more]
A0A6J1CEK32.2e-3981.58uncharacterized protein LOC111010796 OS=Momordica charantia OX=3673 GN=LOC111010... [more]
A0A6J1KG773.1e-3881.42uncharacterized protein LOC111495486 OS=Cucurbita maxima OX=3661 GN=LOC111495486... [more]
W9RFE39.7e-2454.31Uncharacterized protein OS=Morus notabilis OX=981085 GN=L484_006294 PE=4 SV=1[more]
A0A7J6HWJ02.6e-2157.94Uncharacterized protein OS=Cannabis sativa OX=3483 GN=F8388_027202 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G11655.17.9e-1037.62unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G04745.11.5e-0850.75unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G21902.12.0e-0535.44unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (USVL246-FR2) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..24
NoneNo IPR availablePANTHERPTHR35687OS07G0516700 PROTEINcoord: 5..103
NoneNo IPR availablePANTHERPTHR35687:SF1OS07G0516700 PROTEINcoord: 5..103

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CaUC03G054820.1CaUC03G054820.1mRNA