Cla97C11G207150 (gene) Watermelon (97103) v2

NameCla97C11G207150
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionUnknown protein
LocationCla97Chr11 : 1037771 .. 1038394 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTATGAACCAAGTGAAAGCATCCAAATGAAATTTACTTGTAACTTTTCTATGGTTTATTGTTTGCAGATGGATGTGGAGAAATGGCTTCAGCATTTAACTGCTGCAAGAAGCAAAGCAGAGCAGGAGACTACTTTTGAAGAAGTGGATGGCTTGATCAATCAATTAAATTTGCTCTCTAATGCTTTAAGCATGAGGTAGCCAACTTTGTACATTCTCTTCATAATAAGCCTTAACTTAATATCCATCCGCTTCTATCATCGCTTATCGATGCTAACGTTCAGCTGTCAAATGAATTTTACATCAGTGAGACTGAGGAAAATGTAACTCAAGCGATATCGATTTCAAAGAGTTTATATTCGAGGAGGATGGAGTTGGAGCCTATGTTTACTAAACTACTGAATGATGATCCTAAAACGGAAGTTGGTCAGATGTCGGGTATCAAGAATACCGAAGATGACGAGAATGTGAATCAAGATTGTAACGACAGCAGCCCTGAAGAATGTAGAGGAGTGGAGGCTGTGAAAGCTGAACCTGTTCTGTCCCAAGCAATGGATAAGAAAGGGAAAGCGAAAGGGAAAAATAAGCCCAGGAAAAACAGGAAGGGTCGTAAAATTAATTGA

mRNA sequence

ATGATGGATGTGGAGAAATGGCTTCAGCATTTAACTGCTGCAAGAAGCAAAGCAGAGCAGGAGACTACTTTTGAAGAAGTGGATGGCTTGATCAATCAATTAAATTTGCTCTCTAATGCTTTAAGCATGAGTGAGACTGAGGAAAATGTAACTCAAGCGATATCGATTTCAAAGAGTTTATATTCGAGGAGGATGGAGTTGGAGCCTATGTTTACTAAACTACTGAATGATGATCCTAAAACGGAAGTTGGTCAGATGTCGGGTATCAAGAATACCGAAGATGACGAGAATGTGAATCAAGATTGTAACGACAGCAGCCCTGAAGAATGTAGAGGAGTGGAGGCTGTGAAAGCTGAACCTGTTCTGTCCCAAGCAATGGATAAGAAAGGGAAAGCGAAAGGGAAAAATAAGCCCAGGAAAAACAGGAAGGGTCGTAAAATTAATTGA

Coding sequence (CDS)

ATGATGGATGTGGAGAAATGGCTTCAGCATTTAACTGCTGCAAGAAGCAAAGCAGAGCAGGAGACTACTTTTGAAGAAGTGGATGGCTTGATCAATCAATTAAATTTGCTCTCTAATGCTTTAAGCATGAGTGAGACTGAGGAAAATGTAACTCAAGCGATATCGATTTCAAAGAGTTTATATTCGAGGAGGATGGAGTTGGAGCCTATGTTTACTAAACTACTGAATGATGATCCTAAAACGGAAGTTGGTCAGATGTCGGGTATCAAGAATACCGAAGATGACGAGAATGTGAATCAAGATTGTAACGACAGCAGCCCTGAAGAATGTAGAGGAGTGGAGGCTGTGAAAGCTGAACCTGTTCTGTCCCAAGCAATGGATAAGAAAGGGAAAGCGAAAGGGAAAAATAAGCCCAGGAAAAACAGGAAGGGTCGTAAAATTAATTGA

Protein sequence

MMDVEKWLQHLTAARSKAEQETTFEEVDGLINQLNLLSNALSMSETEENVTQAISISKSLYSRRMELEPMFTKLLNDDPKTEVGQMSGIKNTEDDENVNQDCNDSSPEECRGVEAVKAEPVLSQAMDKKGKAKGKNKPRKNRKGRKIN
BLAST of Cla97C11G207150 vs. NCBI nr
Match: XP_008460225.1 (PREDICTED: uncharacterized protein LOC103499108 [Cucumis melo] >XP_016902516.1 PREDICTED: uncharacterized protein LOC103499108 [Cucumis melo])

HSP 1 Score: 189.1 bits (479), Expect = 1.1e-44
Identity = 103/142 (72.54%), Postives = 116/142 (81.69%), Query Frame = 0

Query: 2    MDVEKWLQHLTAARSKAEQETTFEEVDGLINQLNLLSNALSMSETEENVTQAISISKSLY 61
            MDV+KWLQHLTAA+S  E+E   E+VDGL+N+L+LLS ALSMS+ EENVTQ ISISKSLY
Sbjct: 2660 MDVDKWLQHLTAAKSMGEKEIPLEKVDGLLNELDLLSTALSMSKPEENVTQVISISKSLY 2719

Query: 62   SRRMELEPMFTKLLNDDPKTEVGQMSGIKNTEDDENVNQDCNDSSPEECRGVEAVKAEPV 121
            SRR ELE +FTKLLNDDP+ EVGQMSGIKN E DE VN DCND SPEECRGVEAVK EPV
Sbjct: 2720 SRRTELESIFTKLLNDDPEMEVGQMSGIKNAEGDEIVNPDCNDKSPEECRGVEAVKVEPV 2779

Query: 122  LSQAMDKKGKAKGKNKPRKNRK 144
            L QAM++KGK        K+RK
Sbjct: 2780 LPQAMNQKGKXXXXXXXXKSRK 2801

BLAST of Cla97C11G207150 vs. NCBI nr
Match: KGN50803.1 (hypothetical protein Csa_5G266320 [Cucumis sativus])

HSP 1 Score: 163.3 bits (412), Expect = 6.2e-37
Identity = 90/127 (70.87%), Postives = 103/127 (81.10%), Query Frame = 0

Query: 2    MDVEKWLQHLTAARSK-AEQETTFEEVDGLINQLNLLSNALSMSETEENVTQAISISKSL 61
            MDV+KW+QHLTAA+SK AE+E   E+VDGL+N+L LLS ALSMS+ EEN T+ ISISKSL
Sbjct: 1460 MDVDKWVQHLTAAKSKAAEKEVPLEKVDGLLNELCLLSTALSMSKPEENATEVISISKSL 1519

Query: 62   YSRRMELEPMFTKLLNDDPKTEVGQMSGIKNTEDDENVNQDCNDSSPEECRGVEAVKA-- 121
            Y RR EL  +F+ LL+DDP+ EVGQMSGIKN E DENVN DCND SPEECR VEAVKA  
Sbjct: 1520 YGRRTELGSIFSNLLSDDPEMEVGQMSGIKNAEGDENVNPDCNDESPEECREVEAVKALK 1579

Query: 122  -EPVLSQ 125
             EPVL Q
Sbjct: 1580 VEPVLPQ 1586

BLAST of Cla97C11G207150 vs. NCBI nr
Match: XP_011655089.1 (PREDICTED: uncharacterized protein LOC105435477 [Cucumis sativus])

HSP 1 Score: 163.3 bits (412), Expect = 6.2e-37
Identity = 90/127 (70.87%), Postives = 103/127 (81.10%), Query Frame = 0

Query: 2    MDVEKWLQHLTAARSK-AEQETTFEEVDGLINQLNLLSNALSMSETEENVTQAISISKSL 61
            MDV+KW+QHLTAA+SK AE+E   E+VDGL+N+L LLS ALSMS+ EEN T+ ISISKSL
Sbjct: 1463 MDVDKWVQHLTAAKSKAAEKEVPLEKVDGLLNELCLLSTALSMSKPEENATEVISISKSL 1522

Query: 62   YSRRMELEPMFTKLLNDDPKTEVGQMSGIKNTEDDENVNQDCNDSSPEECRGVEAVKA-- 121
            Y RR EL  +F+ LL+DDP+ EVGQMSGIKN E DENVN DCND SPEECR VEAVKA  
Sbjct: 1523 YGRRTELGSIFSNLLSDDPEMEVGQMSGIKNAEGDENVNPDCNDESPEECREVEAVKALK 1582

Query: 122  -EPVLSQ 125
             EPVL Q
Sbjct: 1583 VEPVLPQ 1589

BLAST of Cla97C11G207150 vs. NCBI nr
Match: XP_022144470.1 (uncharacterized protein LOC111014151 [Momordica charantia])

HSP 1 Score: 117.1 bits (292), Expect = 5.1e-23
Identity = 76/177 (42.94%), Postives = 101/177 (57.06%), Query Frame = 0

Query: 2    MDVEKWLQHLTAARSKAEQETTFEEVDGLINQLNLLSNALSMSETEENVTQAISISKSLY 61
            ++VEKW  HL+AARS AE+    + VD L+N+LNLLS ALSMSE ++N+++  SISKSLY
Sbjct: 2483 INVEKWHHHLSAARSNAEEGIPLDVVDRLLNELNLLSTALSMSEPKQNISRVASISKSLY 2542

Query: 62   SRRMELEPMFTKLLNDDPKTEVGQMSGIKNTEDDENVNQDCNDSSPEECRG--------- 121
            SRR+ELEP+  KL+ DDP TE+G MSG +N ED ++  +    SSP E  G         
Sbjct: 2543 SRRIELEPILAKLVRDDPVTELGDMSGFENAEDSKHGEEVSKGSSPVEGGGLEEPVEPXX 2602

Query: 122  -----------------------------VEAVKAEPVLSQAMDKKGKAKGKNKPRK 141
                                         V+ V+ E V+SQA DKKGK K K K ++
Sbjct: 2603 XXXXXXXXXXXXXXXXXXXXXXXXXGLEPVQPVEVETVVSQATDKKGKVKKKAKEKQ 2659

BLAST of Cla97C11G207150 vs. NCBI nr
Match: XP_022942070.1 (uncharacterized protein LOC111447259 isoform X1 [Cucurbita moschata] >XP_022942071.1 uncharacterized protein LOC111447259 isoform X2 [Cucurbita moschata])

HSP 1 Score: 107.5 bits (267), Expect = 4.1e-20
Identity = 63/111 (56.76%), Postives = 81/111 (72.97%), Query Frame = 0

Query: 2    MDVEKWLQHLTAARSKAEQETTFEEVDGLINQLNLLSNALSMSETEENVTQAISISKSLY 61
            MDVE+W++HL+AARSKA++E  FE VDGL+ +LNLLS ALSMS+ +ENV+Q +SISKS+Y
Sbjct: 2637 MDVERWVKHLSAARSKADEEIRFEVVDGLVVELNLLSTALSMSDPKENVSQVVSISKSVY 2696

Query: 62   SRRMELEPMFTK----LLNDDPKTEVGQMSGIKNTEDDENVNQDCNDSSPE 109
            SRRMELEP+ ++    LL+DDP+ EV Q S      DD    QDC     E
Sbjct: 2697 SRRMELEPILSELLLLLLHDDPEVEVDQRS-----IDD----QDCEGGKAE 2738

BLAST of Cla97C11G207150 vs. TrEMBL
Match: tr|A0A1S3CD94|A0A1S3CD94_CUCME (uncharacterized protein LOC103499108 OS=Cucumis melo OX=3656 GN=LOC103499108 PE=4 SV=1)

HSP 1 Score: 189.1 bits (479), Expect = 7.0e-45
Identity = 103/142 (72.54%), Postives = 116/142 (81.69%), Query Frame = 0

Query: 2    MDVEKWLQHLTAARSKAEQETTFEEVDGLINQLNLLSNALSMSETEENVTQAISISKSLY 61
            MDV+KWLQHLTAA+S  E+E   E+VDGL+N+L+LLS ALSMS+ EENVTQ ISISKSLY
Sbjct: 2660 MDVDKWLQHLTAAKSMGEKEIPLEKVDGLLNELDLLSTALSMSKPEENVTQVISISKSLY 2719

Query: 62   SRRMELEPMFTKLLNDDPKTEVGQMSGIKNTEDDENVNQDCNDSSPEECRGVEAVKAEPV 121
            SRR ELE +FTKLLNDDP+ EVGQMSGIKN E DE VN DCND SPEECRGVEAVK EPV
Sbjct: 2720 SRRTELESIFTKLLNDDPEMEVGQMSGIKNAEGDEIVNPDCNDKSPEECRGVEAVKVEPV 2779

Query: 122  LSQAMDKKGKAKGKNKPRKNRK 144
            L QAM++KGK        K+RK
Sbjct: 2780 LPQAMNQKGKXXXXXXXXKSRK 2801

BLAST of Cla97C11G207150 vs. TrEMBL
Match: tr|A0A0A0KMH5|A0A0A0KMH5_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G266320 PE=4 SV=1)

HSP 1 Score: 163.3 bits (412), Expect = 4.1e-37
Identity = 90/127 (70.87%), Postives = 103/127 (81.10%), Query Frame = 0

Query: 2    MDVEKWLQHLTAARSK-AEQETTFEEVDGLINQLNLLSNALSMSETEENVTQAISISKSL 61
            MDV+KW+QHLTAA+SK AE+E   E+VDGL+N+L LLS ALSMS+ EEN T+ ISISKSL
Sbjct: 1460 MDVDKWVQHLTAAKSKAAEKEVPLEKVDGLLNELCLLSTALSMSKPEENATEVISISKSL 1519

Query: 62   YSRRMELEPMFTKLLNDDPKTEVGQMSGIKNTEDDENVNQDCNDSSPEECRGVEAVKA-- 121
            Y RR EL  +F+ LL+DDP+ EVGQMSGIKN E DENVN DCND SPEECR VEAVKA  
Sbjct: 1520 YGRRTELGSIFSNLLSDDPEMEVGQMSGIKNAEGDENVNPDCNDESPEECREVEAVKALK 1579

Query: 122  -EPVLSQ 125
             EPVL Q
Sbjct: 1580 VEPVLPQ 1586

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008460225.11.1e-4472.54PREDICTED: uncharacterized protein LOC103499108 [Cucumis melo] >XP_016902516.1 P... [more]
KGN50803.16.2e-3770.87hypothetical protein Csa_5G266320 [Cucumis sativus][more]
XP_011655089.16.2e-3770.87PREDICTED: uncharacterized protein LOC105435477 [Cucumis sativus][more]
XP_022144470.15.1e-2342.94uncharacterized protein LOC111014151 [Momordica charantia][more]
XP_022942070.14.1e-2056.76uncharacterized protein LOC111447259 isoform X1 [Cucurbita moschata] >XP_0229420... [more]
Match NameE-valueIdentityDescription
tr|A0A1S3CD94|A0A1S3CD94_CUCME7.0e-4572.54uncharacterized protein LOC103499108 OS=Cucumis melo OX=3656 GN=LOC103499108 PE=... [more]
tr|A0A0A0KMH5|A0A0A0KMH5_CUCSA4.1e-3770.87Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G266320 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C11G207150.1Cla97C11G207150.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 76..148
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 132..148

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cla97C11G207150CmoCh04G000120Cucurbita moschata (Rifu)cmowmbB703
The following gene(s) are paralogous to this gene:

None