Cla97C02G042220 (gene) Watermelon (97103) v2

NameCla97C02G042220
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionPlant protein of unknown function (DUF868)
LocationCla97Chr02 : 30148969 .. 30150354 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTTCCATCTTGTTTTAGCCATTCCTCCATCTCCTCCACACTCTCTAATGAGCCACATCCACCACAATCCCTAATTAGTTGCATTTACCAAACCAATCTCTTCAATTCTCCCACTCTTCTCACCCTCACTTGGTCTCTCTCCCTTTCCTCCCACTCTCTCTCCCTCCATTCCTCTCCCTCTCCCTCTCTCTCCACCACTATCTCTCTCTCCCCTTCTTCTTTCTCCCTTTTCTCTCCCACTTCCAAATCCATCTCCCTCCCTAACTCCCACAAGCTCAAACTCCATTGGGACTTCTCTAAAGCTAAGTACACTCCCAACTCAGCTCAACCTATTTCCTCATTTTACCTCGCCATCTCTTCTGATGCCAACCTTCAATTCTTCATTGGTGATCTGGTTGACGATTTCCTTCGACGAGCTAAGACCATTCTCCTCTTTGATCCTTCCCTACGGGAGGATTCAATACTCTTGTCCCGTCGGGAGCACGTGTTCCAGCGCCGAAATTGCTATGTTTCGAGGGCCGAGTTCTTGGGATCGCAACGTGAGATTGCAGTCGAGTTGTGCGGCGGAATCTTGAAAGTCACCGTCGATGGAGAGGTTAAACTCGTAGTCAAACGGCTTGCGTGGAAGTTTAGAGGGAATGAGAGGTTGGTTATTGAAAATTGTTATGATTTCAATTATGTTTTGATCTCTCGTAATAGTAGTAGTAGTAATGATGATGATGATGGTGGTGGTGGTAGCGAAAGATCAAATCATCGACTTTTAAAATAATAATGGAGGATTGCCTACTAAAATTTTTTAGTTTAAAAATGTAAGATAGCGAAGTTAATATTTTTAATCTTCAAGCAGAATGCAATATACGTCGATTACTGTTGAGATATACTCACTTTTAACGTCTACAATTTTCCCTATTTTCAATGTACATTAAAAAAAAATTAGAAAAATTATCTTTGAAAAACAACATTTGGATGGGAATTGAAAGATTTGATTCCATTGTTTTCTTATTTGCCCCATATTAGGTTCTTCATCAGCGGCAACGCCGTCGACTTCTTTTGGGACGTTTTCAATTGGGTCAAAACTGAAGGCGGCGGTGGCGGGCCGGGAGTTTTTGTTTTCCAAATTGGAGAAGGCGGGGTTTGGCCGGAGGTCATCGGAGCCGAGGGGAAGTTGATGAAGCGGTGCTTCTCATCGTCGGTGGCCGGAACTGGATCGACGCCGGCAGCGGCGTTTCCGGCGATGTCGCCGGCAGGTTCAAGCTCGAGTGTTTTGCAGTGGGCGGAAGAGAGCAGCAGCGACGGCGGAAGAAGCTCGTGTTCGTCGTCGTCAAGGTCGAGCGGAATCAATGGCGGATTCTCTCTATTGTTGTACGCTTGGAGGAAGAACTGA

mRNA sequence

ATGGTTCCATCTTGTTTTAGCCATTCCTCCATCTCCTCCACACTCTCTAATGAGCCACATCCACCACAATCCCTAATTAGTTGCATTTACCAAACCAATCTCTTCAATTCTCCCACTCTTCTCACCCTCACTTGGTCTCTCTCCCTTTCCTCCCACTCTCTCTCCCTCCATTCCTCTCCCTCTCCCTCTCTCTCCACCACTATCTCTCTCTCCCCTTCTTCTTTCTCCCTTTTCTCTCCCACTTCCAAATCCATCTCCCTCCCTAACTCCCACAAGCTCAAACTCCATTGGGACTTCTCTAAAGCTAAGTACACTCCCAACTCAGCTCAACCTATTTCCTCATTTTACCTCGCCATCTCTTCTGATGCCAACCTTCAATTCTTCATTGGTGATCTGGTTGACGATTTCCTTCGACGAGCTAAGACCATTCTCCTCTTTGATCCTTCCCTACGGGAGGATTCAATACTCTTGTCCCGTCGGGAGCACGTGTTCCAGCGCCGAAATTGCTATGTTTCGAGGGCCGAGTTCTTGGGATCGCAACGTGAGATTGCAGTCGAGTTGTGCGGCGGAATCTTGAAAGTCACCGTCGATGGAGAGGTTAAACTCGTAGTCAAACGGCTTGCGTGGAAGTTTAGAGGGAATGAGAGGTTCTTCATCAGCGGCAACGCCGTCGACTTCTTTTGGGACGTTTTCAATTGGGTCAAAACTGAAGGCGGCGGTGGCGGGCCGGGAGTTTTTGTTTTCCAAATTGGAGAAGGCGGGGTTTGGCCGGAGGTCATCGGAGCCGAGGGGAAGTTGATGAAGCGGTGCTTCTCATCGTCGGTGGCCGGAACTGGATCGACGCCGGCAGCGGCGTTTCCGGCGATGTCGCCGGCAGGTTCAAGCTCGAGTGTTTTGCAGTGGGCGGAAGAGAGCAGCAGCGACGGCGGAAGAAGCTCGTGTTCGTCGTCGTCAAGGTCGAGCGGAATCAATGGCGGATTCTCTCTATTGTTGTACGCTTGGAGGAAGAACTGA

Coding sequence (CDS)

ATGGTTCCATCTTGTTTTAGCCATTCCTCCATCTCCTCCACACTCTCTAATGAGCCACATCCACCACAATCCCTAATTAGTTGCATTTACCAAACCAATCTCTTCAATTCTCCCACTCTTCTCACCCTCACTTGGTCTCTCTCCCTTTCCTCCCACTCTCTCTCCCTCCATTCCTCTCCCTCTCCCTCTCTCTCCACCACTATCTCTCTCTCCCCTTCTTCTTTCTCCCTTTTCTCTCCCACTTCCAAATCCATCTCCCTCCCTAACTCCCACAAGCTCAAACTCCATTGGGACTTCTCTAAAGCTAAGTACACTCCCAACTCAGCTCAACCTATTTCCTCATTTTACCTCGCCATCTCTTCTGATGCCAACCTTCAATTCTTCATTGGTGATCTGGTTGACGATTTCCTTCGACGAGCTAAGACCATTCTCCTCTTTGATCCTTCCCTACGGGAGGATTCAATACTCTTGTCCCGTCGGGAGCACGTGTTCCAGCGCCGAAATTGCTATGTTTCGAGGGCCGAGTTCTTGGGATCGCAACGTGAGATTGCAGTCGAGTTGTGCGGCGGAATCTTGAAAGTCACCGTCGATGGAGAGGTTAAACTCGTAGTCAAACGGCTTGCGTGGAAGTTTAGAGGGAATGAGAGGTTCTTCATCAGCGGCAACGCCGTCGACTTCTTTTGGGACGTTTTCAATTGGGTCAAAACTGAAGGCGGCGGTGGCGGGCCGGGAGTTTTTGTTTTCCAAATTGGAGAAGGCGGGGTTTGGCCGGAGGTCATCGGAGCCGAGGGGAAGTTGATGAAGCGGTGCTTCTCATCGTCGGTGGCCGGAACTGGATCGACGCCGGCAGCGGCGTTTCCGGCGATGTCGCCGGCAGGTTCAAGCTCGAGTGTTTTGCAGTGGGCGGAAGAGAGCAGCAGCGACGGCGGAAGAAGCTCGTGTTCGTCGTCGTCAAGGTCGAGCGGAATCAATGGCGGATTCTCTCTATTGTTGTACGCTTGGAGGAAGAACTGA

Protein sequence

MVPSCFSHSSISSTLSNEPHPPQSLISCIYQTNLFNSPTLLTLTWSLSLSSHSLSLHSSPSPSLSTTISLSPSSFSLFSPTSKSISLPNSHKLKLHWDFSKAKYTPNSAQPISSFYLAISSDANLQFFIGDLVDDFLRRAKTILLFDPSLREDSILLSRREHVFQRRNCYVSRAEFLGSQREIAVELCGGILKVTVDGEVKLVVKRLAWKFRGNERFFISGNAVDFFWDVFNWVKTEGGGGGPGVFVFQIGEGGVWPEVIGAEGKLMKRCFSSSVAGTGSTPAAAFPAMSPAGSSSSVLQWAEESSSDGGRSSCSSSSRSSGINGGFSLLLYAWRKN
BLAST of Cla97C02G042220 vs. NCBI nr
Match: XP_004139221.2 (PREDICTED: uncharacterized protein LOC101219794 [Cucumis sativus] >KGN60828.1 hypothetical protein Csa_2G011610 [Cucumis sativus])

HSP 1 Score: 429.5 bits (1103), Expect = 1.1e-116
Identity = 294/353 (83.29%), Postives = 302/353 (85.55%), Query Frame = 0

Query: 1   MVPSCFSHSSISSTLSNEPHPPQSLISCIYQTNLFN--SPTLLT---------XXXXXXX 60
           MVPSCFSHSSISSTLSNEP  PQSLISCIYQTNLFN   PTLLT         XXXXXXX
Sbjct: 1   MVPSCFSHSSISSTLSNEPFQPQSLISCIYQTNLFNHSPPTLLTXXXXXXXXXXXXXXXX 60

Query: 61  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPNSHKLKLHWDFSKAKYTPNSA 120
           XXXXXXXXXXXXXXXXXXXXXXXXXXX           P+SHKLKLHWDFSKAKYTPNSA
Sbjct: 61  XXXXXXXXXXXXXXXXXXXXXXXXXXXLFSPTSKSISLPDSHKLKLHWDFSKAKYTPNSA 120

Query: 121 QPISSFYLAISSDANLQFFIGDLVDDFLRRAKTILLFDPSLRED-SILLSRREHVFQRRN 180
           QPISSFYLAI+ D  L FFIGDL++DF RRAKTI L DPSLRED S LLSRREHVF+RRN
Sbjct: 121 QPISSFYLAITCDGKLHFFIGDLLEDFARRAKTISLSDPSLREDYSTLLSRREHVFERRN 180

Query: 181 CYVSRAEFLGSQREIAVELCGGILKVTVDGEVKLVVKRLAWKFRGNERFFISGNAVDFFW 240
           CYVSR EFLGSQREIAVELC GILKV+VDGEVKLVVKRLAWKFRGNERFFISGNAVDFFW
Sbjct: 181 CYVSRVEFLGSQREIAVELCSGILKVSVDGEVKLVVKRLAWKFRGNERFFISGNAVDFFW 240

Query: 241 DVFNWVKTEG--GGGGPGVFVFQIGEGGVWPEVIGAEGKLMKRCFSSS--VAGTGSTPAA 300
           DVFNWVK+EG  G GGPGVFVFQ+GEGGVWPEVIGAEGKLMKRC SSS   AG GSTPAA
Sbjct: 241 DVFNWVKSEGGAGSGGPGVFVFQVGEGGVWPEVIGAEGKLMKRCLSSSAAAAGIGSTPAA 300

Query: 301 AFPAMSPAGSXXXXXXXXXXXXXXXXXXXXXXXXXXXGINGGFSLLLYAWRKN 338
           AFPAMSPAGS      XXXXXXXXXXXXXXXXXXXXX INGGFSLLLYAWRKN
Sbjct: 301 AFPAMSPAGSNSSVLQXXXXXXXXXXXXXXXXXXXXXXINGGFSLLLYAWRKN 353

BLAST of Cla97C02G042220 vs. NCBI nr
Match: XP_008455765.1 (PREDICTED: uncharacterized protein LOC103495865 [Cucumis melo])

HSP 1 Score: 424.5 bits (1090), Expect = 3.4e-115
Identity = 307/348 (88.22%), Postives = 316/348 (90.80%), Query Frame = 0

Query: 1   MVPSCFSHSSISSTLSNEPHPPQSLISCIYQTNLFN-SPTLLT----XXXXXXXXXXXXX 60
           MVPSCFSHSSISSTLSNEP  PQSLISCIYQTNLFN SPTLLT    XXXXXXXXXXXXX
Sbjct: 1   MVPSCFSHSSISSTLSNEPLQPQSLISCIYQTNLFNHSPTLLTXXXXXXXXXXXXXXXXX 60

Query: 61  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPNSHKLKLHWDFSKAKYTPNSAQPISSF 120
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX NSHKLKLHWDFSKAKYTPNSAQPISSF
Sbjct: 61  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNSHKLKLHWDFSKAKYTPNSAQPISSF 120

Query: 121 YLAISSDANLQFFIGDLVDDFLRRAKTILLFDPSLRED--SILLSRREHVFQRRNCYVSR 180
           YLAI+ D  L FFIGDL+DDF RRAKT+ L DPSLRED  + LLSRREHVF+RRNCYVSR
Sbjct: 121 YLAITCDGKLHFFIGDLLDDFARRAKTVSLSDPSLREDYSTTLLSRREHVFERRNCYVSR 180

Query: 181 AEFLGSQREIAVELCGGILKVTVDGEVKLVVKRLAWKFRGNERFFISGNAVDFFWDVFNW 240
            EFLGSQREIAVELC GILKV+VDGEVKLVVKRLAWKFRGNERFFISGNAVDFFWDVFNW
Sbjct: 181 VEFLGSQREIAVELCSGILKVSVDGEVKLVVKRLAWKFRGNERFFISGNAVDFFWDVFNW 240

Query: 241 VKTEG--GGGGPGVFVFQIGEGGVWPEVIGAEGKLMKRCFSSSVA--GTGSTPAAAFPAM 300
           VK+EG  G GGPGVFVFQ+GEGG+WPEVIGAEGKLMKRC SSS A  G GSTPAAAFPAM
Sbjct: 241 VKSEGSAGAGGPGVFVFQVGEGGIWPEVIGAEGKLMKRCLSSSAAAGGIGSTPAAAFPAM 300

Query: 301 SPAGSXXXXXXXXXXXXXXXXXXXXXXXXXXXGINGGFSLLLYAWRKN 338
           SP   XXXXXXXXXXXXXXXXXXXXXXXXXXX INGGFSLLLYAWRKN
Sbjct: 301 SPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXINGGFSLLLYAWRKN 348

BLAST of Cla97C02G042220 vs. NCBI nr
Match: XP_022999579.1 (uncharacterized protein LOC111493906 [Cucurbita maxima])

HSP 1 Score: 377.1 bits (967), Expect = 6.3e-101
Identity = 273/342 (79.82%), Postives = 290/342 (84.80%), Query Frame = 0

Query: 1   MVPSCFSHSSISSTLSNEP-HPPQSLISCIYQTNLFNSPTLLT----XXXXXXXXXXXXX 60
           M+PSCFSHS+ISSTLSN+P  PP S+ISCIYQTNLFNSPTLLT             XXXX
Sbjct: 1   MLPSCFSHSAISSTLSNDPIPPPLSIISCIYQTNLFNSPTLLTLTWSLSLSSHSLSXXXX 60

Query: 61  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPNSHKLKLHWDFSKAKYTPNSAQPISSF 120
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX        HWDFS AKYTPNSAQPISSF
Sbjct: 61  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHWDFSTAKYTPNSAQPISSF 120

Query: 121 YLAISSDANLQFFIGDLVDDFLRRAKTILLFDPSLREDSILLSRREHVFQRRNCYVSRAE 180
           YLAIS +  LQFF+GDLVD+F RR K ++     L E+S LLSRREHVF+RRN YVSRAE
Sbjct: 121 YLAISCNGKLQFFLGDLVDEFSRRVKAVV-----LPEESTLLSRREHVFERRNRYVSRAE 180

Query: 181 FLGSQREIAVELCGGILKVTVDGEVKLVVKRLAWKFRGNERFFISGNAVDFFWDVFNWVK 240
           FLGS REIAVELCGG+LKV VDGEVKL VKRLAWKFRGNERFFI+GNAVDFFWDVFNWVK
Sbjct: 181 FLGSLREIAVELCGGVLKVAVDGEVKLAVKRLAWKFRGNERFFIAGNAVDFFWDVFNWVK 240

Query: 241 TEGGGGGPGVFVFQIGEGGVWPEVIGAEGKLMKRCFSSSVAGTGSTPAAAFPAMSPAGSX 300
           +E GG GPGVFVFQIGEGGVWPE IGAEGKLMKR  SS+ AG  STPAAAFPA+SPAG X
Sbjct: 241 SE-GGSGPGVFVFQIGEGGVWPEYIGAEGKLMKRSLSSAAAGNVSTPAAAFPALSPAGXX 300

Query: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXGINGGFSLLLYAWRKN 338
           XXXXXXXXXXXXXXXXXXXXXXXXXXGINGGFSL LYAWRK+
Sbjct: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXGINGGFSLSLYAWRKD 336

BLAST of Cla97C02G042220 vs. NCBI nr
Match: XP_023521544.1 (uncharacterized protein LOC111785364 isoform X1 [Cucurbita pepo subsp. pepo] >XP_023521545.1 uncharacterized protein LOC111785364 isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 370.5 bits (950), Expect = 5.9e-99
Identity = 270/342 (78.95%), Postives = 289/342 (84.50%), Query Frame = 0

Query: 1   MVPSCFSHSSISSTLSNEP-HPPQSLISCIYQTNLFNSPTLLT----XXXXXXXXXXXXX 60
           M+PSCFSHS++SSTLSN+P  PP S+ISCIYQTNLFNSPTLLT             XXXX
Sbjct: 1   MLPSCFSHSAVSSTLSNDPIPPPLSIISCIYQTNLFNSPTLLTLTWSLSLSSHSLSXXXX 60

Query: 61  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPNSHKLKLHWDFSKAKYTPNSAQPISSF 120
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX        HWDFS AKYTPNSAQPISSF
Sbjct: 61  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHWDFSTAKYTPNSAQPISSF 120

Query: 121 YLAISSDANLQFFIGDLVDDFLRRAKTILLFDPSLREDSILLSRREHVFQRRNCYVSRAE 180
           YLAIS +  LQFF+GDL D+F RR K+++     L E+S LLSRREHVF+RRN YVSRAE
Sbjct: 121 YLAISCNGKLQFFLGDLADEFSRRVKSVV-----LPEESTLLSRREHVFERRNRYVSRAE 180

Query: 181 FLGSQREIAVELCGGILKVTVDGEVKLVVKRLAWKFRGNERFFISGNAVDFFWDVFNWVK 240
           FLGS REIAVELCGG+LKV +DGEVKL VKRLAWKFRGNERFFI+G AVDFFWDVFNWVK
Sbjct: 181 FLGSLREIAVELCGGVLKVAIDGEVKLAVKRLAWKFRGNERFFIAGTAVDFFWDVFNWVK 240

Query: 241 TEGGGGGPGVFVFQIGEGGVWPEVIGAEGKLMKRCFSSSVAGTGSTPAAAFPAMSPAGSX 300
           +E GG GPGVFVFQIGEGGVWPE IGAEGKLMKR  S S AG GSTPAAAFPA+SPAG X
Sbjct: 241 SE-GGSGPGVFVFQIGEGGVWPEYIGAEGKLMKR--SLSAAGNGSTPAAAFPALSPAGXX 300

Query: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXGINGGFSLLLYAWRKN 338
           XXXXXXXXXXXXXXXXXXXXXXXXXXGINGGFSL LYAWRK+
Sbjct: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXGINGGFSLSLYAWRKD 334

BLAST of Cla97C02G042220 vs. NCBI nr
Match: XP_023546799.1 (uncharacterized protein LOC111805798 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 328.9 bits (842), Expect = 2.0e-86
Identity = 253/345 (73.33%), Postives = 271/345 (78.55%), Query Frame = 0

Query: 1   MVPSCFSHSSISSTLSNEP-HPPQSLISCIYQTNLFNSPTLLT----XXXXXXXXXXXXX 60
           M+PSCFSHS++SSTLSN+P  PP S+ISCIYQTNLFNSPTLLT             XXXX
Sbjct: 1   MLPSCFSHSAVSSTLSNDPIPPPLSIISCIYQTNLFNSPTLLTLTWSLSLSSHSLSXXXX 60

Query: 61  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPNSHKLKLHWDFSKAKYTPNSAQPISSF 120
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX        HWDFS AKYTPNSAQPISSF
Sbjct: 61  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHWDFSTAKYTPNSAQPISSF 120

Query: 121 YLAISSDANLQFFIGDLVDDFLRRAKTILLFDPSLREDSILLSRREHVFQRRNCYVSRAE 180
           YLAIS +  LQFF+GDL D+F RR K+++     L E+S LLSRREHVF+RRN YVSRAE
Sbjct: 121 YLAISCNGKLQFFLGDLADEFSRRVKSVV-----LPEESTLLSRREHVFERRNRYVSRAE 180

Query: 181 FLGSQREIAVELCGGILKVTVDGEVKLVVKRLAWKFRGNERFFISGNAVDFFWDVFNWVK 240
           FLGS REIAVELCGG+LKV +DGEVKL VKRLAWKFRGNERFFI+GNAVDFFWDVFNWVK
Sbjct: 181 FLGSLREIAVELCGGVLKVAIDGEVKLAVKRLAWKFRGNERFFIAGNAVDFFWDVFNWVK 240

Query: 241 TEGGGGGPGVFVFQIGEGGVWPEVIGAEGKLMKRCF---SSSVAGTGSTPAAAFPAMSPA 300
           +E GG GPGVFVFQIGEGGVWPE IGAEGKLMKR                          
Sbjct: 241 SE-GGSGPGVFVFQIGEGGVWPEYIGAEGKLMKRSLXXXXXXXXXXXXXXXXXXXXXXXX 300

Query: 301 GSXXXXXXXXXXXXXXXXXXXXXXXXXXXGINGGFSLLLYAWRKN 338
             XXXXXXXXXXXXXXXXXXXXXXXXXXXGINGGFSL LYAWRK+
Sbjct: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXGINGGFSLSLYAWRKD 339

BLAST of Cla97C02G042220 vs. TrEMBL
Match: tr|A0A0A0LG84|A0A0A0LG84_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G011610 PE=4 SV=1)

HSP 1 Score: 429.5 bits (1103), Expect = 7.0e-117
Identity = 294/353 (83.29%), Postives = 302/353 (85.55%), Query Frame = 0

Query: 1   MVPSCFSHSSISSTLSNEPHPPQSLISCIYQTNLFN--SPTLLT---------XXXXXXX 60
           MVPSCFSHSSISSTLSNEP  PQSLISCIYQTNLFN   PTLLT         XXXXXXX
Sbjct: 1   MVPSCFSHSSISSTLSNEPFQPQSLISCIYQTNLFNHSPPTLLTXXXXXXXXXXXXXXXX 60

Query: 61  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPNSHKLKLHWDFSKAKYTPNSA 120
           XXXXXXXXXXXXXXXXXXXXXXXXXXX           P+SHKLKLHWDFSKAKYTPNSA
Sbjct: 61  XXXXXXXXXXXXXXXXXXXXXXXXXXXLFSPTSKSISLPDSHKLKLHWDFSKAKYTPNSA 120

Query: 121 QPISSFYLAISSDANLQFFIGDLVDDFLRRAKTILLFDPSLRED-SILLSRREHVFQRRN 180
           QPISSFYLAI+ D  L FFIGDL++DF RRAKTI L DPSLRED S LLSRREHVF+RRN
Sbjct: 121 QPISSFYLAITCDGKLHFFIGDLLEDFARRAKTISLSDPSLREDYSTLLSRREHVFERRN 180

Query: 181 CYVSRAEFLGSQREIAVELCGGILKVTVDGEVKLVVKRLAWKFRGNERFFISGNAVDFFW 240
           CYVSR EFLGSQREIAVELC GILKV+VDGEVKLVVKRLAWKFRGNERFFISGNAVDFFW
Sbjct: 181 CYVSRVEFLGSQREIAVELCSGILKVSVDGEVKLVVKRLAWKFRGNERFFISGNAVDFFW 240

Query: 241 DVFNWVKTEG--GGGGPGVFVFQIGEGGVWPEVIGAEGKLMKRCFSSS--VAGTGSTPAA 300
           DVFNWVK+EG  G GGPGVFVFQ+GEGGVWPEVIGAEGKLMKRC SSS   AG GSTPAA
Sbjct: 241 DVFNWVKSEGGAGSGGPGVFVFQVGEGGVWPEVIGAEGKLMKRCLSSSAAAAGIGSTPAA 300

Query: 301 AFPAMSPAGSXXXXXXXXXXXXXXXXXXXXXXXXXXXGINGGFSLLLYAWRKN 338
           AFPAMSPAGS      XXXXXXXXXXXXXXXXXXXXX INGGFSLLLYAWRKN
Sbjct: 301 AFPAMSPAGSNSSVLQXXXXXXXXXXXXXXXXXXXXXXINGGFSLLLYAWRKN 353

BLAST of Cla97C02G042220 vs. TrEMBL
Match: tr|A0A1S3C1L8|A0A1S3C1L8_CUCME (uncharacterized protein LOC103495865 OS=Cucumis melo OX=3656 GN=LOC103495865 PE=4 SV=1)

HSP 1 Score: 424.5 bits (1090), Expect = 2.3e-115
Identity = 307/348 (88.22%), Postives = 316/348 (90.80%), Query Frame = 0

Query: 1   MVPSCFSHSSISSTLSNEPHPPQSLISCIYQTNLFN-SPTLLT----XXXXXXXXXXXXX 60
           MVPSCFSHSSISSTLSNEP  PQSLISCIYQTNLFN SPTLLT    XXXXXXXXXXXXX
Sbjct: 1   MVPSCFSHSSISSTLSNEPLQPQSLISCIYQTNLFNHSPTLLTXXXXXXXXXXXXXXXXX 60

Query: 61  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPNSHKLKLHWDFSKAKYTPNSAQPISSF 120
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX NSHKLKLHWDFSKAKYTPNSAQPISSF
Sbjct: 61  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNSHKLKLHWDFSKAKYTPNSAQPISSF 120

Query: 121 YLAISSDANLQFFIGDLVDDFLRRAKTILLFDPSLRED--SILLSRREHVFQRRNCYVSR 180
           YLAI+ D  L FFIGDL+DDF RRAKT+ L DPSLRED  + LLSRREHVF+RRNCYVSR
Sbjct: 121 YLAITCDGKLHFFIGDLLDDFARRAKTVSLSDPSLREDYSTTLLSRREHVFERRNCYVSR 180

Query: 181 AEFLGSQREIAVELCGGILKVTVDGEVKLVVKRLAWKFRGNERFFISGNAVDFFWDVFNW 240
            EFLGSQREIAVELC GILKV+VDGEVKLVVKRLAWKFRGNERFFISGNAVDFFWDVFNW
Sbjct: 181 VEFLGSQREIAVELCSGILKVSVDGEVKLVVKRLAWKFRGNERFFISGNAVDFFWDVFNW 240

Query: 241 VKTEG--GGGGPGVFVFQIGEGGVWPEVIGAEGKLMKRCFSSSVA--GTGSTPAAAFPAM 300
           VK+EG  G GGPGVFVFQ+GEGG+WPEVIGAEGKLMKRC SSS A  G GSTPAAAFPAM
Sbjct: 241 VKSEGSAGAGGPGVFVFQVGEGGIWPEVIGAEGKLMKRCLSSSAAAGGIGSTPAAAFPAM 300

Query: 301 SPAGSXXXXXXXXXXXXXXXXXXXXXXXXXXXGINGGFSLLLYAWRKN 338
           SP   XXXXXXXXXXXXXXXXXXXXXXXXXXX INGGFSLLLYAWRKN
Sbjct: 301 SPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXINGGFSLLLYAWRKN 348

BLAST of Cla97C02G042220 vs. TrEMBL
Match: tr|M5W225|M5W225_PRUPE (Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_6G339700 PE=4 SV=1)

HSP 1 Score: 254.6 bits (649), Expect = 3.1e-64
Identity = 148/341 (43.40%), Postives = 196/341 (57.48%), Query Frame = 0

Query: 1   MVPSCFSHSSISSTLSNEPHPPQSLISCIYQTNLFNSPTLLTXXXXXXXXXXXXXXXXXX 60
           M+P+CFS     +TLS+    PQ+LI+CIYQT L NSPT LT                  
Sbjct: 1   MIPACFSQ---PNTLSSSSQVPQNLITCIYQTQLCNSPTYLTLTWSKNLFSHSLTIHASD 60

Query: 61  XXXXXXXXXXXXXXXXXXXXXXXXXXXPNSH--KLKLHWDFSKAKYTPNSAQPISSFYLA 120
                                       ++H  K+KL+WDF++A +T NSA+P S FY+A
Sbjct: 61  SFSITISLNPSTFSFFRTKPGSKSISLTHNHCQKIKLYWDFTRADFTHNSAEPESCFYIA 120

Query: 121 ISSDANLQFFIGDLVDDFLRRAKTILLFDPSLREDSILLSRREHVFQRRNCYVSRAEFLG 180
           IS +A ++FF+GDL+D+F RR+  ++    S      LLSRREHVF RRN Y+SRA+FLG
Sbjct: 121 ISCNAKIEFFLGDLLDEFTRRSGVVVTHKLS---QPALLSRREHVFGRRN-YISRAQFLG 180

Query: 181 SQREIAVELCGGILKVTVDGEVKLVVKRLAWKFRGNERFFISGNAVDFFWDVFNWVKTEG 240
           S+ EI +E   G LKV VDG + L+VKRLAWKFRGNER F+ G   +F+WDVFNWV    
Sbjct: 181 SKHEIGIECSEGTLKVKVDGVISLIVKRLAWKFRGNERIFVGGVEFEFYWDVFNWVNNHN 240

Query: 241 G--GGGPGVFVFQIGEGGVWPEVIGAEGKLMKRCFSSSVAGTGSTPAAAFPAMSPAGSXX 300
           G  G G GVFVFQ+G+GGVWPE++G E +LM++  S+S A + S P+    + SP+ S  
Sbjct: 241 GANGNGHGVFVFQVGDGGVWPEMVGPEKRLMRKSLSTS-AASASMPSMTSLSPSPSCS-- 300

Query: 301 XXXXXXXXXXXXXXXXXXXXXXXXXGINGGFSLLLYAWRKN 338
                                    G NGGFSLLLYAW+++
Sbjct: 301 -SVLQWAEESSDGGRSSCSSSTRSYGSNGGFSLLLYAWKRD 330

BLAST of Cla97C02G042220 vs. TrEMBL
Match: tr|B9RES4|B9RES4_RICCO (Uncharacterized protein OS=Ricinus communis OX=3988 GN=RCOM_1428180 PE=4 SV=1)

HSP 1 Score: 254.6 bits (649), Expect = 3.1e-64
Identity = 147/342 (42.98%), Postives = 193/342 (56.43%), Query Frame = 0

Query: 1   MVPSCFSHSSISSTLSNEPHPPQSLISCIYQTNLFNSPTLLTXXXXXXXXXXXXXXXXXX 60
           M+P+CFSH    +TLS+    PQSLI+CIYQT   NSPT LT                  
Sbjct: 2   MIPACFSH---PNTLSSASQLPQSLITCIYQTQFCNSPTYLTLSWSKTLFSHSLTVFASD 61

Query: 61  XXXXXXXXXXXXXXXXXXXXXXXXXXXPNSH--KLKLHWDFSKAKYTPNSAQPISSFYLA 120
                                       + H  ++KL+WDF++A++T NSA+P S FY+A
Sbjct: 62  SFSITIPLYPSTFSFFRNKPGSKSICLTHHHYQRIKLYWDFTRAQFTHNSAEPDSCFYIA 121

Query: 121 ISSDANLQFFIGDLVDDFLRRAKTILLFDPSLREDSILLSRREHVFQRRNCYVSRAEFLG 180
           I  +A L+FF+GDL  +  RR  + L+    +  +  LLSRREHVF  ++ Y SRAEFLG
Sbjct: 122 IICNARLEFFLGDLQSELTRRVSSGLVLTRQVVAEPTLLSRREHVFGHKS-YASRAEFLG 181

Query: 181 SQREIAVELCGGILKVTVDGEVKLVVKRLAWKFRGNERFFISGNAVDFFWDVFNWVKTEG 240
           S+ EI +E  GG L V VDGE+ LVVKRLAWKFRGNERFF+ G  V+FFWDVFNWV    
Sbjct: 182 SKHEIGIECNGGALIVKVDGEISLVVKRLAWKFRGNERFFVGGMEVEFFWDVFNWVNDSN 241

Query: 241 GG--GGPGVFVFQIGEGGVWPEVIGAEGKLMKRCFSSSVAGTGSTPAAAFPAMSPAGSXX 300
           G      GVF+FQ+G+GGVWPE++G E +L+++  SSSV  + +   AA  ++SP+ S  
Sbjct: 242 GNXXXXXGVFIFQVGDGGVWPEMVGLEKRLIRKSLSSSVGQSQTLMPAAMASLSPSPSCS 301

Query: 301 XXXXXXXXXXXXXXXXXXXXXXXXXGI-NGGFSLLLYAWRKN 338
                                    G  NGGFSLLLYAWR++
Sbjct: 302 SVLQWAEESSDCGRSSCSSSTTRSCGSNNGGFSLLLYAWRRD 339

BLAST of Cla97C02G042220 vs. TrEMBL
Match: tr|A0A061F380|A0A061F380_THECC (Sulfate/thiosulfate import ATP-binding protein cysA OS=Theobroma cacao OX=3641 GN=TCM_026400 PE=4 SV=1)

HSP 1 Score: 254.2 bits (648), Expect = 4.1e-64
Identity = 147/341 (43.11%), Postives = 195/341 (57.18%), Query Frame = 0

Query: 1   MVPSCFSHSSISSTLSNEPHPPQSLISCIYQTNLFNSPTLLTXXXXXXXXXXXXXXXXXX 60
           M+P+CFSH    +TLS+    PQ+LI+CIYQT L NSPT LT                  
Sbjct: 1   MIPACFSH---PNTLSSSSQLPQNLITCIYQTQLCNSPTYLTLTWSKTLFSHSLTIYAAD 60

Query: 61  XXXXXXXXXXXXXXXXXXXXXXXXXXXPNSH--KLKLHWDFSKAKYTPNSAQPISSFYLA 120
                                       + H  ++KL+WDF++A +  NSA+P S FY+A
Sbjct: 61  SFSITISLYPSTFSFFRNRPGSKSIYLTHHHYQRIKLYWDFTRADFAENSAEPESCFYIA 120

Query: 121 ISSDANLQFFIGDLVDDFLRRAKTILLFDPSLREDSILLSRREHVFQRRNCYVSRAEFLG 180
           IS +A L+FF+GDL ++  RR+   L+    +  +  LLSRREHVF RR+ Y+SRA+FLG
Sbjct: 121 ISCNARLEFFLGDLQEELTRRSG--LVIARQVLPEPTLLSRREHVFGRRS-YISRAKFLG 180

Query: 181 SQREIAVELCGGILKVTVDGEVKLVVKRLAWKFRGNERFFISGNAVDFFWDVFNWVKTEG 240
           S+ EI +E  GG+LKV VDGE  LV+KRLAWKFRGNER +++G  V+FFWDVFNWV ++ 
Sbjct: 181 SKHEIGIECSGGVLKVKVDGETSLVIKRLAWKFRGNERIYVNGIEVEFFWDVFNWVSSDN 240

Query: 241 GG--GGPGVFVFQIGEGGVWPEVIGAEGKLMKRCFSSSVAGTGSTPAAAFPAMSPAGSXX 300
                G GVF+FQ+G+GGVWPE+IG E +LM++  SS+    GS P      +SP+ S  
Sbjct: 241 NSNTNGHGVFIFQVGDGGVWPEMIGPEKRLMRKSLSSA---AGSAPKMPSTTLSPSPS-C 300

Query: 301 XXXXXXXXXXXXXXXXXXXXXXXXXGINGGFSLLLYAWRKN 338
                                    G NGGFSLLLYAW K+
Sbjct: 301 SSVLQWAEESSDGGRSSCSSSTRSYGSNGGFSLLLYAWNKD 331

BLAST of Cla97C02G042220 vs. TAIR10
Match: AT5G11000.1 (Plant protein of unknown function (DUF868))

HSP 1 Score: 97.1 bits (240), Expect = 2.3e-20
Identity = 58/148 (39.19%), Postives = 84/148 (56.76%), Query Frame = 0

Query: 90  SHKLKLHWDFSKAKYTPNSAQPISSFYLAISSDANLQFFIGDLVDDFLRRAKTILLFDPS 149
           S K+++ WD SKAK+   S +P S FY+A+  D  +   +GD V +   RAK+     P 
Sbjct: 110 SPKIQVFWDLSKAKFDSGS-EPRSGFYIAVVVDGEMGLLVGDSVKEAYARAKSA---KPP 169

Query: 150 LREDSILLSRREHVFQRRNCYVSRAEFLGSQREIAVELC---GGILKVTVDGEVKLVVKR 209
               ++LL R+EHVF  R  + ++A F G  REI+++        L  +VD +  L +KR
Sbjct: 170 TNPQALLL-RKEHVFGAR-VFTTKARFGGKNREISIDCRVDEDAKLCFSVDSKQVLQIKR 229

Query: 210 LAWKFRGNERFFISGNAVDFFWDVFNWV 235
           L WKFRGNE+  I G  V   WDV+NW+
Sbjct: 230 LRWKFRGNEKVEIDGVHVQISWDVYNWL 251

BLAST of Cla97C02G042220 vs. TAIR10
Match: AT2G27770.1 (Plant protein of unknown function (DUF868))

HSP 1 Score: 92.0 bits (227), Expect = 7.3e-19
Identity = 56/178 (31.46%), Postives = 85/178 (47.75%), Query Frame = 0

Query: 92  KLKLHWDFSKAKYTPN--SAQPISSFYLAISSDANLQFFIGDLVDDFLRRAKTILLFDPS 151
           K+++ WD S AKY  N    +PI+ FY+ +  D  +   +GD  ++ LR+      F   
Sbjct: 116 KIEVFWDLSSAKYDSNLCGPEPINGFYVIVLVDGQMGLLLGDSSEETLRKKG----FSGD 175

Query: 152 LREDSILLSRREHVFQRRNCYVSRAEFL--GSQREIAVELCG------------GILKVT 211
           +  D  L+SR+EH       Y ++  F+  G   EI +  C              +L V 
Sbjct: 176 IGFDFSLVSRQEHFTGNNTFYSTKVRFVETGDSHEIVIR-CNKETEGLKQSNHYPVLSVC 235

Query: 212 VDGEVKLVVKRLAWKFRGNERFFISGNAVDFFWDVFNWVKTEGGGGGPGVFVFQIGEG 254
           +D +  + VKRL W FRGN+  F+ G  VD  WDV +W  +  G  G  VF+F+   G
Sbjct: 236 IDKKTVIKVKRLQWNFRGNQTIFLDGLLVDLMWDVHDWFFSNQGACGRAVFMFRTRNG 288

BLAST of Cla97C02G042220 vs. TAIR10
Match: AT2G04220.1 (Plant protein of unknown function (DUF868))

HSP 1 Score: 91.7 bits (226), Expect = 9.5e-19
Identity = 57/166 (34.34%), Postives = 95/166 (57.23%), Query Frame = 0

Query: 93  LKLHWDFSKAKYTPNSAQPISSFYLAISSDANLQFFIGDLVDDFLRRAKTILLFDPSLRE 152
           ++++WDF  AK+T +S +P S FY+A+ S+  +   +GD      +R K+     P+L E
Sbjct: 99  VEVYWDFRSAKFT-SSPEPSSDFYVALVSEEEVVLLVGDYKKKAFKRTKS----RPALVE 158

Query: 153 DSILLSRREHVFQRRNCYVSRAEFLG--SQREIAVELCGGILK-----VTVDGEVKLVVK 212
            + L  ++E+VF ++ C+ +RA+F     + EI VE      K     +++DG V + VK
Sbjct: 159 -AALFYKKENVFGKK-CFTTRAKFYDRKKEHEIIVESSTSGPKEPEMWISIDGIVLIQVK 218

Query: 213 RLAWKFRGNERFFISGNAVDFFWDVFNWVKTEGGGGGPGVFVFQIG 252
            L WKFRGN+   +    V  FWDV++W+ +   G G G+F+F+ G
Sbjct: 219 NLQWKFRGNQTVLVDKQPVQVFWDVYDWLFSM-PGTGHGLFIFKPG 256

BLAST of Cla97C02G042220 vs. TAIR10
Match: AT4G12690.1 (Plant protein of unknown function (DUF868))

HSP 1 Score: 85.5 bits (210), Expect = 6.8e-17
Identity = 56/169 (33.14%), Postives = 94/169 (55.62%), Query Frame = 0

Query: 91  HKLKLHWDFSKAKYTPNSAQPISSFYLAISSDANLQFFIGDLVDDFLRRAKTILLFDPSL 150
           +++ ++WDF  AK+     +P S FY+A+ S+  +   +GD      +R K+     PSL
Sbjct: 93  NQVDVYWDFRSAKFN-GGPEPSSDFYVALVSEEEVVLLLGDHKKKAFKRTKS----RPSL 152

Query: 151 REDSILLSRREHVFQRRNCYVSRAEFLGSQR--EIAVELCGGI----LKVTVDGEVKLVV 210
             D+ L  ++E+VF ++  + +RA+F   +R  EI VE   G     + ++VDG V + V
Sbjct: 153 -VDAALFYKKENVFGKK-IFSTRAKFHDRKREHEIVVESSTGAKEPEMWISVDGIVLVQV 212

Query: 211 KRLAWKFRGNERFFISGNAVDFFWDVFNWVKTEGGGGGPGVFVFQIGEG 254
           + L WKFRGN+   +    V  FWDV++W+ +   G G G+F+F+   G
Sbjct: 213 RNLQWKFRGNQTVLVDKEPVQVFWDVYDWLFST-PGTGHGLFIFKPESG 253

BLAST of Cla97C02G042220 vs. TAIR10
Match: AT3G13229.1 (Plant protein of unknown function (DUF868))

HSP 1 Score: 84.7 bits (208), Expect = 1.2e-16
Identity = 60/170 (35.29%), Postives = 95/170 (55.88%), Query Frame = 0

Query: 89  NSHKLKLHWDFSKAKYTPNSAQPISSFYLAISSDANLQFFIGDLVDDFLRRAKTILLFDP 148
           N  ++ ++WDF +AK++ N  +P S FY+++ S       IGDL ++ L+R K     +P
Sbjct: 75  NGTRVDVYWDFRQAKFS-NFPEPSSGFYVSLVSQNATVLTIGDLRNEALKRTKK----NP 134

Query: 149 SLREDSILLSRREHVFQRRNCYVSRAEFLGSQR---EIAVE--LCGGI---LKVTVDGEV 208
           S  E + L+S++EHV  +R  Y   A   G  R   E+ +E  L G     + +TVDG  
Sbjct: 135 SATE-AALVSKQEHVHGKRVFYTRTAFGGGESRRENEVVIETSLSGPSDPEMWITVDGVP 194

Query: 209 KLVVKRLAWKFRGNERFFIS-GNAVDFFWDVFNWVKTEGGGGGPGVFVFQ 250
            + +  L W+FRGNE   +S G +++ FWDV +W+  E  G   G+FVF+
Sbjct: 195 AIRIMNLNWRFRGNEVVTVSDGVSLEIFWDVHDWL-FEPSGSSSGLFVFK 237

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004139221.21.1e-11683.29PREDICTED: uncharacterized protein LOC101219794 [Cucumis sativus] >KGN60828.1 hy... [more]
XP_008455765.13.4e-11588.22PREDICTED: uncharacterized protein LOC103495865 [Cucumis melo][more]
XP_022999579.16.3e-10179.82uncharacterized protein LOC111493906 [Cucurbita maxima][more]
XP_023521544.15.9e-9978.95uncharacterized protein LOC111785364 isoform X1 [Cucurbita pepo subsp. pepo] >XP... [more]
XP_023546799.12.0e-8673.33uncharacterized protein LOC111805798 [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
tr|A0A0A0LG84|A0A0A0LG84_CUCSA7.0e-11783.29Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G011610 PE=4 SV=1[more]
tr|A0A1S3C1L8|A0A1S3C1L8_CUCME2.3e-11588.22uncharacterized protein LOC103495865 OS=Cucumis melo OX=3656 GN=LOC103495865 PE=... [more]
tr|M5W225|M5W225_PRUPE3.1e-6443.40Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_6G339700 PE=4 SV=1[more]
tr|B9RES4|B9RES4_RICCO3.1e-6442.98Uncharacterized protein OS=Ricinus communis OX=3988 GN=RCOM_1428180 PE=4 SV=1[more]
tr|A0A061F380|A0A061F380_THECC4.1e-6443.11Sulfate/thiosulfate import ATP-binding protein cysA OS=Theobroma cacao OX=3641 G... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
AT5G11000.12.3e-2039.19Plant protein of unknown function (DUF868)[more]
AT2G27770.17.3e-1931.46Plant protein of unknown function (DUF868)[more]
AT2G04220.19.5e-1934.34Plant protein of unknown function (DUF868)[more]
AT4G12690.16.8e-1733.14Plant protein of unknown function (DUF868)[more]
AT3G13229.11.2e-1635.29Plant protein of unknown function (DUF868)[more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR008586DUF868_pln
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
molecular_function GO:0005524 ATP binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C02G042220.1Cla97C02G042220.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR008586Protein of unknown function DUF868, plantPFAMPF05910DUF868coord: 22..335
e-value: 5.7E-66
score: 222.9
IPR008586Protein of unknown function DUF868, plantPANTHERPTHR31972FAMILY NOT NAMEDcoord: 1..337
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..21
NoneNo IPR availablePANTHERPTHR31972:SF6SUBFAMILY NOT NAMEDcoord: 1..337