Cla97C08G147810 (gene) Watermelon (97103) v2.5

Overview
NameCla97C08G147810
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionCopper-binding periplasmic protein
LocationCla97Chr08: 14436328 .. 14437143 (-)
RNA-Seq ExpressionCla97C08G147810
SyntenyCla97C08G147810
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTAATGGCGATCGAAGTTTGCTCTGAGATTTCCACTGTAGGAATCAGTCCTAGAATCTCGTTTTCACATGATCTGAACCAAACCGATTTGTTACCTTCTTCAAATTGCGATCGCGATCGGTTGGATTTGAGTCTTCTCGAATCCGATTTCGATTTCTGCATTGGTAATCTTCTTTTACAAGATCTCTCCTCTGCGGATGAACTTTTCTCCAATGGCAAAATTCTTCCCAAATCGATCGAACCAAACAGAGAAATTCTTAAACCTAACAAATCTCCTCTTCTAATTCCTCCAATTCCCCCCGATCCTTCCAGAAATTCAGTCTCCTCCGAGAAGAAAAGCCTTAAAGAGCTTCTATCCGCCAGTTTCGATGCCGATGAGAAACCGCAATCGAAATCCTTCTGGCAGTTCAAACGAAGTAGTAGTCTGAATTGCGAAAGCTCCAAGAGTAGAAGTTTGATTCGATCCTTGCATTTTCTGTCCAGAAGCAATTCAACTGGTTCGGCTTTGAATCCGAAACATCAATCGAATTCGAAGGATTCTCAGAGACCAAATTTGCAGAAACAGCCATCGATTTCGAGTAGTAGAAGATCGTCTTCATCATCATCTTCATCTTCTTTCTCGAACTCGTATTTCGCAAATTTGTCTACTCAAAAGCCTTCAATGAGGAAGAATTCGGGGCCGAACAATGGGAATGGAGTTCGAATTAGTCCTCTTCTCAATCTTCCTCCGCCATACATTTCTAAAGTAACTGTGAGTTTTTTTGGATTCGGTTCTCTGTTTTGTAATGGTAAAATTAAAAAGAAAAAGAAATGA

mRNA sequence

ATGCTAATGGCGATCGAAGTTTGCTCTGAGATTTCCACTGTAGGAATCAGTCCTAGAATCTCGTTTTCACATGATCTGAACCAAACCGATTTGTTACCTTCTTCAAATTGCGATCGCGATCGGTTGGATTTGAGTCTTCTCGAATCCGATTTCGATTTCTGCATTGGTAATCTTCTTTTACAAGATCTCTCCTCTGCGGATGAACTTTTCTCCAATGGCAAAATTCTTCCCAAATCGATCGAACCAAACAGAGAAATTCTTAAACCTAACAAATCTCCTCTTCTAATTCCTCCAATTCCCCCCGATCCTTCCAGAAATTCAGTCTCCTCCGAGAAGAAAAGCCTTAAAGAGCTTCTATCCGCCAGTTTCGATGCCGATGAGAAACCGCAATCGAAATCCTTCTGGCAGTTCAAACGAAGTAGTAGTCTGAATTGCGAAAGCTCCAAGAGTAGAAGTTTGATTCGATCCTTGCATTTTCTGTCCAGAAGCAATTCAACTGGTTCGGCTTTGAATCCGAAACATCAATCGAATTCGAAGGATTCTCAGAGACCAAATTTGCAGAAACAGCCATCGATTTCGAGTAGTAGAAGATCGTCTTCATCATCATCTTCATCTTCTTTCTCGAACTCGTATTTCGCAAATTTGTCTACTCAAAAGCCTTCAATGAGGAAGAATTCGGGGCCGAACAATGGGAATGGAGTTCGAATTAGTCCTCTTCTCAATCTTCCTCCGCCATACATTTCTAAAGTAACTGTGAGTTTTTTTGGATTCGGTTCTCTGTTTTGTAATGGTAAAATTAAAAAGAAAAAGAAATGA

Coding sequence (CDS)

ATGCTAATGGCGATCGAAGTTTGCTCTGAGATTTCCACTGTAGGAATCAGTCCTAGAATCTCGTTTTCACATGATCTGAACCAAACCGATTTGTTACCTTCTTCAAATTGCGATCGCGATCGGTTGGATTTGAGTCTTCTCGAATCCGATTTCGATTTCTGCATTGGTAATCTTCTTTTACAAGATCTCTCCTCTGCGGATGAACTTTTCTCCAATGGCAAAATTCTTCCCAAATCGATCGAACCAAACAGAGAAATTCTTAAACCTAACAAATCTCCTCTTCTAATTCCTCCAATTCCCCCCGATCCTTCCAGAAATTCAGTCTCCTCCGAGAAGAAAAGCCTTAAAGAGCTTCTATCCGCCAGTTTCGATGCCGATGAGAAACCGCAATCGAAATCCTTCTGGCAGTTCAAACGAAGTAGTAGTCTGAATTGCGAAAGCTCCAAGAGTAGAAGTTTGATTCGATCCTTGCATTTTCTGTCCAGAAGCAATTCAACTGGTTCGGCTTTGAATCCGAAACATCAATCGAATTCGAAGGATTCTCAGAGACCAAATTTGCAGAAACAGCCATCGATTTCGAGTAGTAGAAGATCGTCTTCATCATCATCTTCATCTTCTTTCTCGAACTCGTATTTCGCAAATTTGTCTACTCAAAAGCCTTCAATGAGGAAGAATTCGGGGCCGAACAATGGGAATGGAGTTCGAATTAGTCCTCTTCTCAATCTTCCTCCGCCATACATTTCTAAAGTAACTGTGAGTTTTTTTGGATTCGGTTCTCTGTTTTGTAATGGTAAAATTAAAAAGAAAAAGAAATGA

Protein sequence

MLMAIEVCSEISTVGISPRISFSHDLNQTDLLPSSNCDRDRLDLSLLESDFDFCIGNLLLQDLSSADELFSNGKILPKSIEPNREILKPNKSPLLIPPIPPDPSRNSVSSEKKSLKELLSASFDADEKPQSKSFWQFKRSSSLNCESSKSRSLIRSLHFLSRSNSTGSALNPKHQSNSKDSQRPNLQKQPSISSSRRSSSSSSSSSFSNSYFANLSTQKPSMRKNSGPNNGNGVRISPLLNLPPPYISKVTVSFFGFGSLFCNGKIKKKKK
Homology
BLAST of Cla97C08G147810 vs. NCBI nr
Match: XP_038886056.1 (uncharacterized protein LOC120076333 [Benincasa hispida])

HSP 1 Score: 452.2 bits (1162), Expect = 3.1e-123
Identity = 248/269 (92.19%), Postives = 254/269 (94.42%), Query Frame = 0

Query: 3   MAIEVCSEISTVGISPRISFSHDLNQTDLLPSSNCDRDRLDLSLLESDFDFCIGNLLLQD 62
           MAI+VCSEISTVGISPRISFSHDLNQTD LPSSN DRDRLDLSLLESDFDFCIGNLL QD
Sbjct: 1   MAIDVCSEISTVGISPRISFSHDLNQTDSLPSSNFDRDRLDLSLLESDFDFCIGNLLRQD 60

Query: 63  LSSADELFSNGKILPKSIEPNREILKPNKSPLLIPPIPPDPSRNSVSSEKKSLKELLSAS 122
           LSSADELFSNGKILPKSIEPNREILKPN+SP LIPPIPPDPSRNSVSSEKKSLKELLSAS
Sbjct: 61  LSSADELFSNGKILPKSIEPNREILKPNQSPPLIPPIPPDPSRNSVSSEKKSLKELLSAS 120

Query: 123 FDADEKPQSKSFWQFKRSSSLNCESSKSRSLIRSLHFLSRSNSTGSALNPKHQSNSKDSQ 182
           FD DEKPQSKSFWQFKRSSSLNCESSKSRSLIRSLHFLSRSNSTGS LNPK QS SK  +
Sbjct: 121 FDGDEKPQSKSFWQFKRSSSLNCESSKSRSLIRSLHFLSRSNSTGSTLNPKQQSVSK--E 180

Query: 183 RPNLQKQPSISSSRRSSSSSSSSSFSNSYFANLSTQKPSMRKNSGPNNGNGVRISPLLNL 242
           RPNLQKQ SISSSRRS+SSSSSSSFSNSYF N  +QKPS+RKN GPNNGNGVRISPLLNL
Sbjct: 181 RPNLQKQASISSSRRSTSSSSSSSFSNSYFVNSCSQKPSIRKNLGPNNGNGVRISPLLNL 240

Query: 243 PPPYISKVTVSFFGFGSLFCNGKIKKKKK 272
           PPPYISKVTVSFFGFGSLFCNGKIKKKKK
Sbjct: 241 PPPYISKVTVSFFGFGSLFCNGKIKKKKK 267

BLAST of Cla97C08G147810 vs. NCBI nr
Match: XP_004151494.1 (uncharacterized protein LOC101215559 [Cucumis sativus])

HSP 1 Score: 445.7 bits (1145), Expect = 2.9e-121
Identity = 250/275 (90.91%), Postives = 255/275 (92.73%), Query Frame = 0

Query: 1   MLMAIEVCSEISTVGISPRISFSHDLNQTDLLPSSNC--DRDRLDLSLLESDFDFCIGNL 60
           MLMAI+VCSEISTVGISPRISFSHDLNQTDLLPSSNC  DRDRLDLSLLESDFDFCIGNL
Sbjct: 1   MLMAIDVCSEISTVGISPRISFSHDLNQTDLLPSSNCDRDRDRLDLSLLESDFDFCIGNL 60

Query: 61  LLQDLSSADELFSNGKILPKSIEPNREIL-KPNKSPLLIPPIPPDPSRNSVSSEKKSLKE 120
           LLQDLSSADELFSNGKILPKSI+PNR++L KPNKS  LIPPIPPDPSRNSVSSEKKSLKE
Sbjct: 61  LLQDLSSADELFSNGKILPKSIQPNRQLLSKPNKSHRLIPPIPPDPSRNSVSSEKKSLKE 120

Query: 121 LLSASFDADEKPQSKSFWQFKRSSSLNCESSKSRSLIRSLHFLSRSNSTGSALNPKHQSN 180
           LLSASFD DEKPQSKSFWQFKRSSSLNCESSKSR LIRSLHFLSRSNSTGS LNPK QSN
Sbjct: 121 LLSASFDGDEKPQSKSFWQFKRSSSLNCESSKSRGLIRSLHFLSRSNSTGSVLNPKQQSN 180

Query: 181 SKDSQRPNLQKQPSISSSRRSSSSSSSSSFSNSYFANLSTQKPSMRKNSGPNNGNGV-RI 240
           SKD QRPNLQKQ S SSSRRSSSSSSSSSFSNSYFAN  +QKPSMRKN G NNGNGV   
Sbjct: 181 SKDCQRPNLQKQGSSSSSRRSSSSSSSSSFSNSYFANTCSQKPSMRKNFGWNNGNGVGSS 240

Query: 241 SPLLNLPPPYISKVTVSFFGFGSLFCNGKIKKKKK 272
           SPLLNLPPPYISKVTVSFFGFGSLFCNGK KKKKK
Sbjct: 241 SPLLNLPPPYISKVTVSFFGFGSLFCNGKTKKKKK 275

BLAST of Cla97C08G147810 vs. NCBI nr
Match: XP_008445637.1 (PREDICTED: uncharacterized protein LOC103488594 [Cucumis melo])

HSP 1 Score: 438.7 bits (1127), Expect = 3.6e-119
Identity = 250/277 (90.25%), Postives = 256/277 (92.42%), Query Frame = 0

Query: 1   MLMAIEVCSEISTVGISPRISFSHDLNQTDLLPSSNC--DRDRLDLSLLESDFDFCIGNL 60
           MLMAI+VCSEISTVGISPRISFSHDLNQTDLLPSSNC  DRDRLDLSLLESDFDFCIGNL
Sbjct: 1   MLMAIDVCSEISTVGISPRISFSHDLNQTDLLPSSNCDRDRDRLDLSLLESDFDFCIGNL 60

Query: 61  LLQDLSSADELFSNGKILPKSIEPNREIL-KPNKSPLLIPPIPPDPSRNSVSSEKKSLKE 120
           LLQDLSSADELFSNGKILPKSIEPNR++L KPN+S  LIPPIPPDPSRNSVSSEKKSLKE
Sbjct: 61  LLQDLSSADELFSNGKILPKSIEPNRQVLSKPNQSHRLIPPIPPDPSRNSVSSEKKSLKE 120

Query: 121 LLSASFDADEKPQSKSFWQFKRSSSLNCESSKSRSLIRSLHFLSRSNSTGSALNPKHQSN 180
           LLSASFD +EK QSKSFWQFKRSSSLNCESSKSRSLIRSLHFLSRSNSTGS LNPK QSN
Sbjct: 121 LLSASFDGEEKQQSKSFWQFKRSSSLNCESSKSRSLIRSLHFLSRSNSTGSVLNPKQQSN 180

Query: 181 SKDSQRPNLQKQPSISSSRRSSSSSSSS-SFSNSYFANLSTQKPSMRKNSGPNNGNGVRI 240
           SKD QRPNLQKQ SISSSRRSSSSSSSS SFSNSYFAN  +QK SMRKN G NNGNGV I
Sbjct: 181 SKDCQRPNLQKQASISSSRRSSSSSSSSTSFSNSYFANTCSQKHSMRKNFGWNNGNGVGI 240

Query: 241 --SPLLNLPPPYISKVTVSFFGFGSLFCNGKIKKKKK 272
             SPLLNLPPPYISKVTVSFFGFGSLFCNGK KKKKK
Sbjct: 241 SSSPLLNLPPPYISKVTVSFFGFGSLFCNGKTKKKKK 277

BLAST of Cla97C08G147810 vs. NCBI nr
Match: XP_022939173.1 (uncharacterized protein LOC111445167 [Cucurbita moschata])

HSP 1 Score: 436.0 bits (1120), Expect = 2.3e-118
Identity = 244/274 (89.05%), Postives = 256/274 (93.43%), Query Frame = 0

Query: 3   MAIEVCSEISTVGISPRISFSHDLNQTDLLP-SSNCDRDRLDLSLLESDFDFCIGNLLLQ 62
           MAIEVCSEISTVGISPRISFSHDLNQ DLLP SS+C+R+RLDL+LLESDFDFCIGNLL Q
Sbjct: 1   MAIEVCSEISTVGISPRISFSHDLNQADLLPSSSDCNRERLDLTLLESDFDFCIGNLLRQ 60

Query: 63  DLSSADELFSNGKILP----KSIEPNREILKPNKSPLLIPPIPPDPSRNSVSSEKKSLKE 122
           DLSSADELFSNGKILP    KSIEPNRE+LKP +SP   PP+PPDP+RNSVSSEKKSLKE
Sbjct: 61  DLSSADELFSNGKILPVEIKKSIEPNREVLKPIQSP-PPPPVPPDPARNSVSSEKKSLKE 120

Query: 123 LLSASFDADEKPQSKSFWQFKRSSSLNCESSKSRSLIRSLHFLSRSNSTGSALNPKHQSN 182
           LLSASFDADEKPQSKSFWQFKRSSSLNCESSKSRSLIRSLHFLSRSNSTGSALNPK QS 
Sbjct: 121 LLSASFDADEKPQSKSFWQFKRSSSLNCESSKSRSLIRSLHFLSRSNSTGSALNPKSQST 180

Query: 183 SKDSQRPNLQKQPSISSSRRSSSSSSSSSFSNSYFANLSTQKPSMRKNSGPNNGNGVRIS 242
           SKD QRPNLQKQPSISSSRRSS+S S+S  +NSYFANLS+QKPSMRKNSGPNNGNGVRIS
Sbjct: 181 SKDPQRPNLQKQPSISSSRRSSTSISTS--ANSYFANLSSQKPSMRKNSGPNNGNGVRIS 240

Query: 243 PLLNLPPPYISKVTVSFFGFGSLFCNGKIKKKKK 272
           P LNLPPPYISKVTVSFFGFGSLFCNGKIKKKKK
Sbjct: 241 PFLNLPPPYISKVTVSFFGFGSLFCNGKIKKKKK 271

BLAST of Cla97C08G147810 vs. NCBI nr
Match: KAG6579211.1 (hypothetical protein SDJN03_23659, partial [Cucurbita argyrosperma subsp. sororia] >KAG7016724.1 hypothetical protein SDJN02_21834, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 433.7 bits (1114), Expect = 1.2e-117
Identity = 244/276 (88.41%), Postives = 257/276 (93.12%), Query Frame = 0

Query: 1   MLMAIEVCSEISTVGISPRISFSHDLNQTDLLP-SSNCDRDRLDLSLLESDFDFCIGNLL 60
           MLMAIEV SEISTVGISPRISFSHDLNQ DLLP SS+C+R+RLDL+LLESDFDFCIGNLL
Sbjct: 1   MLMAIEVFSEISTVGISPRISFSHDLNQADLLPSSSDCNRERLDLTLLESDFDFCIGNLL 60

Query: 61  LQDLSSADELFSNGKILP----KSIEPNREILKPNKSPLLIPPIPPDPSRNSVSSEKKSL 120
            QDLSSADELFSNGKILP    KSIEPNRE+LKP +SP   PP+PPDP+RNSVSSEKKSL
Sbjct: 61  RQDLSSADELFSNGKILPVEIKKSIEPNREVLKPIQSP-PPPPVPPDPARNSVSSEKKSL 120

Query: 121 KELLSASFDADEKPQSKSFWQFKRSSSLNCESSKSRSLIRSLHFLSRSNSTGSALNPKHQ 180
           KELLSASFDADEKPQSKSFWQFKRSSSLNCESSKSRSLIRSLHFLSRSNSTGSALNPK Q
Sbjct: 121 KELLSASFDADEKPQSKSFWQFKRSSSLNCESSKSRSLIRSLHFLSRSNSTGSALNPKSQ 180

Query: 181 SNSKDSQRPNLQKQPSISSSRRSSSSSSSSSFSNSYFANLSTQKPSMRKNSGPNNGNGVR 240
           S SKD Q+PNLQKQPSISSSRRSS+S S+S  +NSYFANLS+QKPSMRKNSGPNNGNGVR
Sbjct: 181 STSKDPQKPNLQKQPSISSSRRSSTSISTS--ANSYFANLSSQKPSMRKNSGPNNGNGVR 240

Query: 241 ISPLLNLPPPYISKVTVSFFGFGSLFCNGKIKKKKK 272
           ISP LNLPPPYISKVTVSFFGFGSLFCNGKIKKKKK
Sbjct: 241 ISPFLNLPPPYISKVTVSFFGFGSLFCNGKIKKKKK 273

BLAST of Cla97C08G147810 vs. ExPASy TrEMBL
Match: A0A0A0KAS8 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G027440 PE=4 SV=1)

HSP 1 Score: 445.7 bits (1145), Expect = 1.4e-121
Identity = 250/275 (90.91%), Postives = 255/275 (92.73%), Query Frame = 0

Query: 1   MLMAIEVCSEISTVGISPRISFSHDLNQTDLLPSSNC--DRDRLDLSLLESDFDFCIGNL 60
           MLMAI+VCSEISTVGISPRISFSHDLNQTDLLPSSNC  DRDRLDLSLLESDFDFCIGNL
Sbjct: 1   MLMAIDVCSEISTVGISPRISFSHDLNQTDLLPSSNCDRDRDRLDLSLLESDFDFCIGNL 60

Query: 61  LLQDLSSADELFSNGKILPKSIEPNREIL-KPNKSPLLIPPIPPDPSRNSVSSEKKSLKE 120
           LLQDLSSADELFSNGKILPKSI+PNR++L KPNKS  LIPPIPPDPSRNSVSSEKKSLKE
Sbjct: 61  LLQDLSSADELFSNGKILPKSIQPNRQLLSKPNKSHRLIPPIPPDPSRNSVSSEKKSLKE 120

Query: 121 LLSASFDADEKPQSKSFWQFKRSSSLNCESSKSRSLIRSLHFLSRSNSTGSALNPKHQSN 180
           LLSASFD DEKPQSKSFWQFKRSSSLNCESSKSR LIRSLHFLSRSNSTGS LNPK QSN
Sbjct: 121 LLSASFDGDEKPQSKSFWQFKRSSSLNCESSKSRGLIRSLHFLSRSNSTGSVLNPKQQSN 180

Query: 181 SKDSQRPNLQKQPSISSSRRSSSSSSSSSFSNSYFANLSTQKPSMRKNSGPNNGNGV-RI 240
           SKD QRPNLQKQ S SSSRRSSSSSSSSSFSNSYFAN  +QKPSMRKN G NNGNGV   
Sbjct: 181 SKDCQRPNLQKQGSSSSSRRSSSSSSSSSFSNSYFANTCSQKPSMRKNFGWNNGNGVGSS 240

Query: 241 SPLLNLPPPYISKVTVSFFGFGSLFCNGKIKKKKK 272
           SPLLNLPPPYISKVTVSFFGFGSLFCNGK KKKKK
Sbjct: 241 SPLLNLPPPYISKVTVSFFGFGSLFCNGKTKKKKK 275

BLAST of Cla97C08G147810 vs. ExPASy TrEMBL
Match: A0A1S3BE16 (uncharacterized protein LOC103488594 OS=Cucumis melo OX=3656 GN=LOC103488594 PE=4 SV=1)

HSP 1 Score: 438.7 bits (1127), Expect = 1.7e-119
Identity = 250/277 (90.25%), Postives = 256/277 (92.42%), Query Frame = 0

Query: 1   MLMAIEVCSEISTVGISPRISFSHDLNQTDLLPSSNC--DRDRLDLSLLESDFDFCIGNL 60
           MLMAI+VCSEISTVGISPRISFSHDLNQTDLLPSSNC  DRDRLDLSLLESDFDFCIGNL
Sbjct: 1   MLMAIDVCSEISTVGISPRISFSHDLNQTDLLPSSNCDRDRDRLDLSLLESDFDFCIGNL 60

Query: 61  LLQDLSSADELFSNGKILPKSIEPNREIL-KPNKSPLLIPPIPPDPSRNSVSSEKKSLKE 120
           LLQDLSSADELFSNGKILPKSIEPNR++L KPN+S  LIPPIPPDPSRNSVSSEKKSLKE
Sbjct: 61  LLQDLSSADELFSNGKILPKSIEPNRQVLSKPNQSHRLIPPIPPDPSRNSVSSEKKSLKE 120

Query: 121 LLSASFDADEKPQSKSFWQFKRSSSLNCESSKSRSLIRSLHFLSRSNSTGSALNPKHQSN 180
           LLSASFD +EK QSKSFWQFKRSSSLNCESSKSRSLIRSLHFLSRSNSTGS LNPK QSN
Sbjct: 121 LLSASFDGEEKQQSKSFWQFKRSSSLNCESSKSRSLIRSLHFLSRSNSTGSVLNPKQQSN 180

Query: 181 SKDSQRPNLQKQPSISSSRRSSSSSSSS-SFSNSYFANLSTQKPSMRKNSGPNNGNGVRI 240
           SKD QRPNLQKQ SISSSRRSSSSSSSS SFSNSYFAN  +QK SMRKN G NNGNGV I
Sbjct: 181 SKDCQRPNLQKQASISSSRRSSSSSSSSTSFSNSYFANTCSQKHSMRKNFGWNNGNGVGI 240

Query: 241 --SPLLNLPPPYISKVTVSFFGFGSLFCNGKIKKKKK 272
             SPLLNLPPPYISKVTVSFFGFGSLFCNGK KKKKK
Sbjct: 241 SSSPLLNLPPPYISKVTVSFFGFGSLFCNGKTKKKKK 277

BLAST of Cla97C08G147810 vs. ExPASy TrEMBL
Match: A0A6J1FKX8 (uncharacterized protein LOC111445167 OS=Cucurbita moschata OX=3662 GN=LOC111445167 PE=4 SV=1)

HSP 1 Score: 436.0 bits (1120), Expect = 1.1e-118
Identity = 244/274 (89.05%), Postives = 256/274 (93.43%), Query Frame = 0

Query: 3   MAIEVCSEISTVGISPRISFSHDLNQTDLLP-SSNCDRDRLDLSLLESDFDFCIGNLLLQ 62
           MAIEVCSEISTVGISPRISFSHDLNQ DLLP SS+C+R+RLDL+LLESDFDFCIGNLL Q
Sbjct: 1   MAIEVCSEISTVGISPRISFSHDLNQADLLPSSSDCNRERLDLTLLESDFDFCIGNLLRQ 60

Query: 63  DLSSADELFSNGKILP----KSIEPNREILKPNKSPLLIPPIPPDPSRNSVSSEKKSLKE 122
           DLSSADELFSNGKILP    KSIEPNRE+LKP +SP   PP+PPDP+RNSVSSEKKSLKE
Sbjct: 61  DLSSADELFSNGKILPVEIKKSIEPNREVLKPIQSP-PPPPVPPDPARNSVSSEKKSLKE 120

Query: 123 LLSASFDADEKPQSKSFWQFKRSSSLNCESSKSRSLIRSLHFLSRSNSTGSALNPKHQSN 182
           LLSASFDADEKPQSKSFWQFKRSSSLNCESSKSRSLIRSLHFLSRSNSTGSALNPK QS 
Sbjct: 121 LLSASFDADEKPQSKSFWQFKRSSSLNCESSKSRSLIRSLHFLSRSNSTGSALNPKSQST 180

Query: 183 SKDSQRPNLQKQPSISSSRRSSSSSSSSSFSNSYFANLSTQKPSMRKNSGPNNGNGVRIS 242
           SKD QRPNLQKQPSISSSRRSS+S S+S  +NSYFANLS+QKPSMRKNSGPNNGNGVRIS
Sbjct: 181 SKDPQRPNLQKQPSISSSRRSSTSISTS--ANSYFANLSSQKPSMRKNSGPNNGNGVRIS 240

Query: 243 PLLNLPPPYISKVTVSFFGFGSLFCNGKIKKKKK 272
           P LNLPPPYISKVTVSFFGFGSLFCNGKIKKKKK
Sbjct: 241 PFLNLPPPYISKVTVSFFGFGSLFCNGKIKKKKK 271

BLAST of Cla97C08G147810 vs. ExPASy TrEMBL
Match: A0A5D3BPP8 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold109G00020 PE=4 SV=1)

HSP 1 Score: 401.0 bits (1029), Expect = 4.0e-108
Identity = 231/258 (89.53%), Postives = 238/258 (92.25%), Query Frame = 0

Query: 1   MLMAIEVCSEISTVGISPRISFSHDLNQTDLLPSSNC--DRDRLDLSLLESDFDFCIGNL 60
           MLMAI+VCSEISTVGISPRISFSHDLNQTDLLPSSNC  DRDRLDLSLLESDFDFCIGNL
Sbjct: 1   MLMAIDVCSEISTVGISPRISFSHDLNQTDLLPSSNCDRDRDRLDLSLLESDFDFCIGNL 60

Query: 61  LLQDLSSADELFSNGKILPKSIEPNREIL-KPNKSPLLIPPIPPDPSRNSVSSEKKSLKE 120
           LLQDLSSADELFSNGKILPKSIEPNR++L KPN+S  LIPPIPPDPSRNSVSSEKKSLKE
Sbjct: 61  LLQDLSSADELFSNGKILPKSIEPNRQVLSKPNQSHRLIPPIPPDPSRNSVSSEKKSLKE 120

Query: 121 LLSASFDADEKPQSKSFWQFKRSSSLNCESSKSRSLIRSLHFLSRSNSTGSALNPKHQSN 180
           LLSASFD +EK QSKSFWQFKRSSSLNCESSKSRSLIRSLHFLSRSNSTGS LNPK QSN
Sbjct: 121 LLSASFDGEEKQQSKSFWQFKRSSSLNCESSKSRSLIRSLHFLSRSNSTGSVLNPKQQSN 180

Query: 181 SKDSQRPNLQKQPSISSSRRSSSSSSSS-SFSNSYFANLSTQKPSMRKNSGPNNGNGVRI 240
           SKD QRPNLQKQ SISSSRRSSSSSSSS SFSNSYFAN  +QK SMRKN G NNGNGV I
Sbjct: 181 SKDCQRPNLQKQASISSSRRSSSSSSSSTSFSNSYFANTCSQKHSMRKNFGWNNGNGVGI 240

Query: 241 --SPLLNLPPPYISKVTV 253
             SPLLNLPPPYISKVT+
Sbjct: 241 SSSPLLNLPPPYISKVTI 258

BLAST of Cla97C08G147810 vs. ExPASy TrEMBL
Match: A0A6J1DFL1 (uncharacterized protein LOC111020041 OS=Momordica charantia OX=3673 GN=LOC111020041 PE=4 SV=1)

HSP 1 Score: 397.5 bits (1020), Expect = 4.4e-107
Identity = 226/276 (81.88%), Postives = 243/276 (88.04%), Query Frame = 0

Query: 3   MAIEVCSEISTVGISPRISFSHDLNQTDLLPSSNCDRDRLDLSLLESDFDFCIGNLLLQD 62
           MAIEVCSEIS+VGISPRISFSHDLNQTDLLPSS+  RDRLDL+LLESDFDFCIGNLL+QD
Sbjct: 1   MAIEVCSEISSVGISPRISFSHDLNQTDLLPSSDGSRDRLDLTLLESDFDFCIGNLLIQD 60

Query: 63  LSSADELFSNGKILP----KSIEPNREIL-KPNKSPLLIPPIPPDPSRNSVSSEKKSLKE 122
           LSSADELFSNG+I P    KS++ N +++ KPN+S    P  PP+PSRNS+SSEKKSLKE
Sbjct: 61  LSSADELFSNGRIRPVQIKKSLDANTQLVHKPNES---APIAPPEPSRNSISSEKKSLKE 120

Query: 123 LLSASFDADEKPQSKSFWQFKRSSSLNCESSKSRSLIRSLHFLSRSNSTGSALNPKHQSN 182
           LLSASFDADEKPQSKSFWQFKRSSSLNCESSKSRSLIRSLHFLSRSNSTGS+LNPK Q N
Sbjct: 121 LLSASFDADEKPQSKSFWQFKRSSSLNCESSKSRSLIRSLHFLSRSNSTGSSLNPKQQPN 180

Query: 183 SKD--SQRPNLQKQPSISSSRRSSSSSSSSSFSNSYFANLSTQKPSMRKNSGPNNGNGVR 242
           SKD  SQRPNLQKQPSISS +     SS+SSF NSYF N S+QK S RKN GPNNGN VR
Sbjct: 181 SKDSSSQRPNLQKQPSISSRK-----SSASSFPNSYFTNSSSQKSSTRKNFGPNNGNAVR 240

Query: 243 ISPLLNLPPPYISKVTVSFFGFGSLFCNGKIKKKKK 272
           ISPLLNLPPPYISKVTVSFFGFGSLFCNGKIKKKKK
Sbjct: 241 ISPLLNLPPPYISKVTVSFFGFGSLFCNGKIKKKKK 268

BLAST of Cla97C08G147810 vs. TAIR 10
Match: AT1G68330.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G48780.1); Has 155 Blast hits to 147 proteins in 23 species: Archae - 0; Bacteria - 0; Metazoa - 19; Fungi - 3; Plants - 126; Viruses - 0; Other Eukaryotes - 7 (source: NCBI BLink). )

HSP 1 Score: 133.7 bits (335), Expect = 2.3e-31
Identity = 132/291 (45.36%), Postives = 164/291 (56.36%), Query Frame = 0

Query: 3   MAIEV-CSEISTVGISPRISFSHDLNQTDLLPSSNCDRDRLDLSLLE--SDFDFCIG-NL 62
           MAI+V CSE S  GISPRISFS+DL+ TD          RLD +LL+  S+FDFC G + 
Sbjct: 1   MAIDVCCSEASGSGISPRISFSYDLDSTD------DGEVRLDSTLLDSGSEFDFCFGSSC 60

Query: 63  LLQDLSSADELFSNGKILPKSIEPNREILKPNKSPLLIPPIPPDPSRNSVSS-------- 122
            +Q++S ADELFS GKILP  I+    +  P      +P      S +S SS        
Sbjct: 61  SVQEVSPADELFSEGKILPVQIKKEESL--PQTVTFRVPRSASLSSSSSSSSSSSSSSRA 120

Query: 123 --EKKSLKE-LLSASFDADEKPQSKSFWQFKRSSSLNCESSK-SRSLIRSLHFLSRSNST 182
             +K  LKE LL+   D ++KP+   F QFKRS SLN + S+ S+ LIRS HFLSRSNST
Sbjct: 121 PEKKMRLKELLLNPESDFEDKPRG-LFLQFKRSISLNYDKSRNSKGLIRSFHFLSRSNST 180

Query: 183 GSALNPKHQSNSKDSQRP----NLQK-QPSISSSRRSSSSSSSSSFSNSYFANLSTQKPS 242
               NP      K++  P    NL K +P +   RRSSS SSSS           ++KP 
Sbjct: 181 P---NPNLDLLPKETHHPHKTHNLPKHKPPL---RRSSSLSSSS-------VPFYSKKPL 240

Query: 243 MRKNSGPNNGNGVRISPLLNLPPP-YISKVTVSFFGFGSLFCNGKIKKKKK 272
            R + G  NG GVR+SP+LN PPP +IS V   FF  GSL CNGK   K K
Sbjct: 241 GRNSFGNGNG-GVRVSPVLNFPPPAFISNVADGFFSIGSL-CNGKTNTKTK 267

BLAST of Cla97C08G147810 vs. TAIR 10
Match: AT1G67050.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G38320.1); Has 617 Blast hits to 318 proteins in 80 species: Archae - 0; Bacteria - 16; Metazoa - 141; Fungi - 62; Plants - 128; Viruses - 2; Other Eukaryotes - 268 (source: NCBI BLink). )

HSP 1 Score: 127.1 bits (318), Expect = 2.1e-29
Identity = 115/284 (40.49%), Postives = 158/284 (55.63%), Query Frame = 0

Query: 3   MAIEVCSEISTVGISPRISFSHDLNQTDLLP-------SSNCDRDRLDLSLLESDFDFCI 62
           MA+++ SE S   +SPRISFS D  Q+D +P       SSN     L+ S+   DFDFCI
Sbjct: 1   MAVDLLSENS--NMSPRISFSRDFCQSDAIPIEKRPLRSSNSKPSSLNSSI---DFDFCI 60

Query: 63  ------GNLLLQDLSSADELFSNGKILPKSIEPNREILKPNKSPLLIPPIPPDPSRNSVS 122
                 G    Q   SADELFSNGKILP  I+   E  K    P  +   P    +    
Sbjct: 61  PGGVNSGESFDQGSWSADELFSNGKILPTEIKKKPEPGKKEPEPKPVKSKPDSRKQRKQP 120

Query: 123 SEKKSLKELLSASFDADEKPQSKSFWQFKRSSSLNCESSKSRSLIRSLHFLSRSNSTGSA 182
           +E++   +++      +EK  +KSFW FKRSSSLNC S+  RSL   L  L+RSNSTGS 
Sbjct: 121 NEEQQEDDVI---ITTEEKTNTKSFWGFKRSSSLNCGSTYGRSLC-PLPLLNRSNSTGST 180

Query: 183 LNPKHQSNS-KDSQRPNLQKQPSISSSRRSSSSSSSSSFSNSYFANLSTQKPSMRKNSGP 242
            + + QS+S K ++   LQ+     SS  SSSSS+SSS SN+ F+    +K     + G 
Sbjct: 181 SSKQKQSSSRKHNEHVKLQQ-----SSSLSSSSSASSSLSNNGFSKPPLKKSYGGYSYGS 240

Query: 243 NNGNGVRISPLLNLPPPYISKVTVSFFGFGSLFC-NGKIKKKKK 272
           + G G+R+SP++N+ P      + + FGFGS+F  NG+ K KK+
Sbjct: 241 HGGGGIRVSPVINVVP------SGNLFGFGSMFSGNGRDKNKKR 264

BLAST of Cla97C08G147810 vs. TAIR 10
Match: AT1G48780.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G18300.1); Has 89 Blast hits to 89 proteins in 11 species: Archae - 0; Bacteria - 2; Metazoa - 0; Fungi - 0; Plants - 86; Viruses - 0; Other Eukaryotes - 1 (source: NCBI BLink). )

HSP 1 Score: 82.4 bits (202), Expect = 6.1e-16
Identity = 99/269 (36.80%), Postives = 123/269 (45.72%), Query Frame = 0

Query: 19  RISFSHDLNQTD------LLPSSNCDRDRLDLSLLESDFDFCIGNLL-LQDLSSADELFS 78
           RISFS DL Q+D      + PS    RD   L    SDF+F I N     D S ADE+F+
Sbjct: 9   RISFSSDLGQSDKAPPPVIEPSGLIRRDETLLDSSNSDFEFHISNSFDPGDSSPADEIFA 68

Query: 79  NGKILP-----KSIEPNREI---LKPNKSPLLIPPIPPDPSRNSVSSEKKSLKELLSASF 138
           +G ILP      S  P R     L P  S L   P+ P P      SEK++      A+ 
Sbjct: 69  DGMILPFHVTAASTVPKRLYKYELPPITSSLSPSPLSPQPLPTK-HSEKETNGRASGANS 128

Query: 139 DADEKPQSKSFWQFKRSSSLNCESSKSRSLIRSLHFLSRSNSTGSALNPKH-QSNSKDSQ 198
           D++ +  SKSFW FKRSSSLNC+  K  SLI S   L+RSNSTGS  N K       ++ 
Sbjct: 129 DSEAEKSSKSFWSFKRSSSLNCDIKK--SLICSFPRLTRSNSTGSVTNSKRAMLRDVNNH 188

Query: 199 RPNLQKQPSISSSRRSSSSSSSSSFSNSYFANLSTQKPSMRKNSGPNNGNGVRISPLLNL 258
           RP                 SS SS  N+Y      QK + +K  G   G    + P+LN 
Sbjct: 189 RP-----------------SSRSSCCNAY--QFRPQKHTGKKGEG---GGSFSVIPVLNG 243

Query: 259 PPPYISKVTVSFFGFGSLFCNGKIKKKKK 272
           P         S FG GS+  +   K K K
Sbjct: 249 P---------STFGLGSILRHSNSKDKTK 243

BLAST of Cla97C08G147810 vs. TAIR 10
Match: AT3G18300.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G48780.1); Has 69 Blast hits to 69 proteins in 7 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 69; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 71.2 bits (173), Expect = 1.4e-12
Identity = 103/293 (35.15%), Postives = 130/293 (44.37%), Query Frame = 0

Query: 11  ISTVGISPRISFSHDLNQTD------LLPSSNCDRDRLDLSLLESDFDFCI-GNLLLQDL 70
           I T     R SF+ DL Q+D        PS    RD   L    SDF+F I  N    D 
Sbjct: 2   ICTESNHQRFSFAGDLGQSDKGTPMEQQPSGPVRRDTTLLDSSNSDFEFHISSNFDPGDS 61

Query: 71  SSADELFSNGKILP--------KSIEPNR----EILKPNKSPLLIPPIPPDPSRNSVSSE 130
           S ADE+F++G ILP         S  P R    E+     +P L   +PP P      S 
Sbjct: 62  SPADEIFADGMILPVLPFQVTATSTMPKRLYKYELPPIVSAPTLSSYLPPLPLPLPEHSR 121

Query: 131 KKSLKEL--------LSASFDADEKPQSKSFWQFKRSSSLNCESSKSRSLIRSLHFLSRS 190
           K S+KE           A+ D++ +  SKSFW FKRSSSLNC+  K  SLI S   L+RS
Sbjct: 122 KYSVKETRGSLNGRGSGANSDSEAEKSSKSFWSFKRSSSLNCDIKK--SLICSFPRLTRS 181

Query: 191 NSTGSALNPKHQS----NSKDSQRPNLQKQPSI--SSSRRSSSSSSSSSFSNSYFANLST 250
           NSTGS    K +     N   SQR  + + P +  SS  R  SS   SS+          
Sbjct: 182 NSTGSVAISKREMLRDINKHSSQRHGVPR-PGVNPSSHMRPPSSFCCSSY------QFRP 241

Query: 251 QKPSMRKNSGPNNGNGVRISPLLNLPPPYISKVTVSFFGFGSLFCNGKIKKKK 271
           QK + +   G   G    I+P++  P P         FG GS+    K KKKK
Sbjct: 242 QKHAGKNGGG--RGGSFWIAPVIGGPSP---------FGLGSILRLTKEKKKK 274

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038886056.13.1e-12392.19uncharacterized protein LOC120076333 [Benincasa hispida][more]
XP_004151494.12.9e-12190.91uncharacterized protein LOC101215559 [Cucumis sativus][more]
XP_008445637.13.6e-11990.25PREDICTED: uncharacterized protein LOC103488594 [Cucumis melo][more]
XP_022939173.12.3e-11889.05uncharacterized protein LOC111445167 [Cucurbita moschata][more]
KAG6579211.11.2e-11788.41hypothetical protein SDJN03_23659, partial [Cucurbita argyrosperma subsp. sorori... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KAS81.4e-12190.91Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G027440 PE=4 SV=1[more]
A0A1S3BE161.7e-11990.25uncharacterized protein LOC103488594 OS=Cucumis melo OX=3656 GN=LOC103488594 PE=... [more]
A0A6J1FKX81.1e-11889.05uncharacterized protein LOC111445167 OS=Cucurbita moschata OX=3662 GN=LOC1114451... [more]
A0A5D3BPP84.0e-10889.53Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A6J1DFL14.4e-10781.88uncharacterized protein LOC111020041 OS=Momordica charantia OX=3673 GN=LOC111020... [more]
Match NameE-valueIdentityDescription
AT1G68330.12.3e-3145.36unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G67050.12.1e-2940.49unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G48780.16.1e-1636.80unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT3G18300.11.4e-1235.15unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 89..111
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 164..209
NoneNo IPR availablePANTHERPTHR31722:SF33COPPER-BINDING PERIPLASMIC PROTEINcoord: 3..271
NoneNo IPR availablePANTHERPTHR31722OS06G0675200 PROTEINcoord: 3..271

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C08G147810.1Cla97C08G147810.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0071704 organic substance metabolic process
cellular_component GO:0005576 extracellular region
molecular_function GO:0016985 mannan endo-1,4-beta-mannosidase activity