Clc01G16740 (gene) Watermelon (cordophanus) v2

Overview
NameClc01G16740
Typegene
OrganismCitrullus lanatus subsp. cordophanus cv. cordophanus (Watermelon (cordophanus) v2)
DescriptionSMI1_KNR4 domain-containing protein
LocationClcChr01: 29522136 .. 29523670 (+)
RNA-Seq ExpressionClc01G16740
SyntenyClc01G16740
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTATAAGTACCTCTACCTTACCTTCCTTTTCTTTCCGAATTCTTCCCCAACTTATACAACTTTCCTCATAAATCCCTCTTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTATTTATCACTGCTCCATCCATTCACTTCTGCCTTCCAATGGTGGACGTCGACCGCAGGATGGCCGGTCTCAATCCGGCCCATATCGCTGGACTGAGACGCCTCTCTGCTCGGGCCGCCGCTGTCACTCCTTCTCACCCTTCCCGCGCCGGCCTTCTCTCCTTCTCTTCTCTCGCTGACAAGGTCATCACCCATTTGCGCAACACCGGCGTAGAGGTCCAATCCGGCCTCTCCGTCGCCGAGTTCGCTCGAGCTGAGGCCGAGTTTGGCTTCGTTTTCCCTCCGGATCTCCGAGCCGTACTTTCCGCTGGTTTACCCGTCGGTCCTGGGTTCCCCGATTGGCGAGCCTCTGGTGCCAGGCAGCATCTTAGAGCCACGCTCGATCTTCCTATTGCGGCCATTAGTTTCCAAATCGCCAAGAATACGTTCTGGTCTAAGTCTTGGGGTCCCAGGCCCTTAGACCCGGAAAAGGCCTTGCGGGTCGCTAGAAATGCTCTGAAGAGAGCGCCTCTTTTGATTCCCCTCTTCAATCATTGCTACATTCCCTGCAACCCTTCTCTGGCGGGGAACCCAATCTTCTCCGTCGATGAGAATCGGATCTCATTTTGCGGTCTGGATCTATCCGATTTCTTCGAGCGGGAATTCCTTTTCCGGAGCTCCGAATCTGATGCCCACCTTCTCAAAAAGCAAAGGTCCATTAGTGAAAAATCTGCGGGCTCCTCCTCTAACTTCTCTCGACGGAGTCTGGACACCGGAGCAAGAACGCCGAGGTGGGTTGAATTTTGGAGCGACGCCGTGGTGGACCGGCGGCGAAGAAACTCGTCGTCGTCGTCATCTTCGTCGCCGGATAGGGTAATCGAGATGCCGAGGTCGGGAATTCCGAAATGGGTAAACGAATACATAGAAGAAATAGGATCGACTTTGAGAGAAGGGGGATGGAGCGAAACGGACATCACAGACATAGCACAGGTTTCCGCGTCCGGATTCTTCGAAGGAGCAGCGATGGTATTAGTAGACAACCAAGCGGTTCTAGACGCTCTGCTTCTAAAAACGGATCGGTTTTCGGACGTTCTCCGGAAAGCGGGGTGGAGCTCGGAAGAAGTGTCGTACGCTCTGGGATTCGATCATCGACCGGAAAAGGAACGAAAACCGGCAAAGAAGTTATCCCCAGAACTGGTAGAAAGAATCGGGAAACTGGCGGAGTCGGTTACTCGGTCATAGTACGGGCATCCCAATCTCAATCCCCATCTCCCCAGATTCCTCACTTCGCTTCTTTTTAAAATTTCAAGGTGTTGGGTATACCATCTTCTGCCCTCCAGTCATTACTATTTCTTTCTTTTTTTTTTTCTTTTTTTCTTTTTTTAATATATATATATATATATATATATATATATATATATTATTATTATATAAAT

mRNA sequence

CTATAAGTACCTCTACCTTACCTTCCTTTTCTTTCCGAATTCTTCCCCAACTTATACAACTTTCCTCATAAATCCCTCTTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTATTTATCACTGCTCCATCCATTCACTTCTGCCTTCCAATGGTGGACGTCGACCGCAGGATGGCCGGTCTCAATCCGGCCCATATCGCTGGACTGAGACGCCTCTCTGCTCGGGCCGCCGCTGTCACTCCTTCTCACCCTTCCCGCGCCGGCCTTCTCTCCTTCTCTTCTCTCGCTGACAAGGTCATCACCCATTTGCGCAACACCGGCGTAGAGGTCCAATCCGGCCTCTCCGTCGCCGAGTTCGCTCGAGCTGAGGCCGAGTTTGGCTTCGTTTTCCCTCCGGATCTCCGAGCCGTACTTTCCGCTGGTTTACCCGTCGGTCCTGGGTTCCCCGATTGGCGAGCCTCTGGTGCCAGGCAGCATCTTAGAGCCACGCTCGATCTTCCTATTGCGGCCATTAGTTTCCAAATCGCCAAGAATACGTTCTGGTCTAAGTCTTGGGGTCCCAGGCCCTTAGACCCGGAAAAGGCCTTGCGGGTCGCTAGAAATGCTCTGAAGAGAGCGCCTCTTTTGATTCCCCTCTTCAATCATTGCTACATTCCCTGCAACCCTTCTCTGGCGGGGAACCCAATCTTCTCCGTCGATGAGAATCGGATCTCATTTTGCGGTCTGGATCTATCCGATTTCTTCGAGCGGGAATTCCTTTTCCGGAGCTCCGAATCTGATGCCCACCTTCTCAAAAAGCAAAGGTCCATTAGTGAAAAATCTGCGGGCTCCTCCTCTAACTTCTCTCGACGGAGTCTGGACACCGGAGCAAGAACGCCGAGGTGGGTTGAATTTTGGAGCGACGCCGTGGTGGACCGGCGGCGAAGAAACTCGTCGTCGTCGTCATCTTCGTCGCCGGATAGGGTAATCGAGATGCCGAGGTCGGGAATTCCGAAATGGGTAAACGAATACATAGAAGAAATAGGATCGACTTTGAGAGAAGGGGGATGGAGCGAAACGGACATCACAGACATAGCACAGGTTTCCGCGTCCGGATTCTTCGAAGGAGCAGCGATGGTATTAGTAGACAACCAAGCGGTTCTAGACGCTCTGCTTCTAAAAACGGATCGGTTTTCGGACGTTCTCCGGAAAGCGGGGTGGAGCTCGGAAGAAGTGTCGTACGCTCTGGGATTCGATCATCGACCGGAAAAGGAACGAAAACCGGCAAAGAAGTTATCCCCAGAACTGGTAGAAAGAATCGGGAAACTGGCGGAGTCGGTTACTCGGTCATAGTACGGGCATCCCAATCTCAATCCCCATCTCCCCAGATTCCTCACTTCGCTTCTTTTTAAAATTTCAAGGTGTTGGGTATACCATCTTCTGCCCTCCAGTCATTACTATTTCTTTCTTTTTTTTTTTCTTTTTTTCTTTTTTTAATATATATATATATATATATATATATATATATATATTATTATTATATAAAT

Coding sequence (CDS)

ATGGTGGACGTCGACCGCAGGATGGCCGGTCTCAATCCGGCCCATATCGCTGGACTGAGACGCCTCTCTGCTCGGGCCGCCGCTGTCACTCCTTCTCACCCTTCCCGCGCCGGCCTTCTCTCCTTCTCTTCTCTCGCTGACAAGGTCATCACCCATTTGCGCAACACCGGCGTAGAGGTCCAATCCGGCCTCTCCGTCGCCGAGTTCGCTCGAGCTGAGGCCGAGTTTGGCTTCGTTTTCCCTCCGGATCTCCGAGCCGTACTTTCCGCTGGTTTACCCGTCGGTCCTGGGTTCCCCGATTGGCGAGCCTCTGGTGCCAGGCAGCATCTTAGAGCCACGCTCGATCTTCCTATTGCGGCCATTAGTTTCCAAATCGCCAAGAATACGTTCTGGTCTAAGTCTTGGGGTCCCAGGCCCTTAGACCCGGAAAAGGCCTTGCGGGTCGCTAGAAATGCTCTGAAGAGAGCGCCTCTTTTGATTCCCCTCTTCAATCATTGCTACATTCCCTGCAACCCTTCTCTGGCGGGGAACCCAATCTTCTCCGTCGATGAGAATCGGATCTCATTTTGCGGTCTGGATCTATCCGATTTCTTCGAGCGGGAATTCCTTTTCCGGAGCTCCGAATCTGATGCCCACCTTCTCAAAAAGCAAAGGTCCATTAGTGAAAAATCTGCGGGCTCCTCCTCTAACTTCTCTCGACGGAGTCTGGACACCGGAGCAAGAACGCCGAGGTGGGTTGAATTTTGGAGCGACGCCGTGGTGGACCGGCGGCGAAGAAACTCGTCGTCGTCGTCATCTTCGTCGCCGGATAGGGTAATCGAGATGCCGAGGTCGGGAATTCCGAAATGGGTAAACGAATACATAGAAGAAATAGGATCGACTTTGAGAGAAGGGGGATGGAGCGAAACGGACATCACAGACATAGCACAGGTTTCCGCGTCCGGATTCTTCGAAGGAGCAGCGATGGTATTAGTAGACAACCAAGCGGTTCTAGACGCTCTGCTTCTAAAAACGGATCGGTTTTCGGACGTTCTCCGGAAAGCGGGGTGGAGCTCGGAAGAAGTGTCGTACGCTCTGGGATTCGATCATCGACCGGAAAAGGAACGAAAACCGGCAAAGAAGTTATCCCCAGAACTGGTAGAAAGAATCGGGAAACTGGCGGAGTCGGTTACTCGGTCATAG

Protein sequence

MVDVDRRMAGLNPAHIAGLRRLSARAAAVTPSHPSRAGLLSFSSLADKVITHLRNTGVEVQSGLSVAEFARAEAEFGFVFPPDLRAVLSAGLPVGPGFPDWRASGARQHLRATLDLPIAAISFQIAKNTFWSKSWGPRPLDPEKALRVARNALKRAPLLIPLFNHCYIPCNPSLAGNPIFSVDENRISFCGLDLSDFFEREFLFRSSESDAHLLKKQRSISEKSAGSSSNFSRRSLDTGARTPRWVEFWSDAVVDRRRRNSSSSSSSSPDRVIEMPRSGIPKWVNEYIEEIGSTLREGGWSETDITDIAQVSASGFFEGAAMVLVDNQAVLDALLLKTDRFSDVLRKAGWSSEEVSYALGFDHRPEKERKPAKKLSPELVERIGKLAESVTRS
Homology
BLAST of Clc01G16740 vs. NCBI nr
Match: XP_038881140.1 (uncharacterized protein LOC120072739 [Benincasa hispida])

HSP 1 Score: 757.7 bits (1955), Expect = 5.1e-215
Identity = 385/393 (97.96%), Postives = 390/393 (99.24%), Query Frame = 0

Query: 1   MVDVDRRMAGLNPAHIAGLRRLSARAAAVTPSHPSRAGLLSFSSLADKVITHLRNTGVEV 60
           MVDVDRRMAGLNPAHIAGLRRLSARAAAVTPSHPSRAGLLSFSSLADKVITHLRNTGVEV
Sbjct: 1   MVDVDRRMAGLNPAHIAGLRRLSARAAAVTPSHPSRAGLLSFSSLADKVITHLRNTGVEV 60

Query: 61  QSGLSVAEFARAEAEFGFVFPPDLRAVLSAGLPVGPGFPDWRASGARQHLRATLDLPIAA 120
           Q GLSVAEFARAEAEFGFVFPPDLRAVLSAGLPVGPGFPDWR+SGARQHLRATLDLPIAA
Sbjct: 61  QPGLSVAEFARAEAEFGFVFPPDLRAVLSAGLPVGPGFPDWRSSGARQHLRATLDLPIAA 120

Query: 121 ISFQIAKNTFWSKSWGPRPLDPEKALRVARNALKRAPLLIPLFNHCYIPCNPSLAGNPIF 180
           ISFQIAKNTFWSKSWGPRPLDPEKALRVARNALKRAPLLIPLFNHCYIPCNPSLAGNPIF
Sbjct: 121 ISFQIAKNTFWSKSWGPRPLDPEKALRVARNALKRAPLLIPLFNHCYIPCNPSLAGNPIF 180

Query: 181 SVDENRISFCGLDLSDFFEREFLFRSSESDAHLLKKQRSISEKSAGSSSNFSRRSLDTGA 240
           SVDENRISFCGLDLSDFFEREFLFRSS+S+AHLLKKQRSISEKSAGSSSNFSRRSLDTGA
Sbjct: 181 SVDENRISFCGLDLSDFFEREFLFRSSDSNAHLLKKQRSISEKSAGSSSNFSRRSLDTGA 240

Query: 241 RTPRWVEFWSDAVVDRRRRNSSSSSSSSPDRVIEMPRSGIPKWVNEYIEEIGSTLREGGW 300
           RTPRWVEFWSDAVVDRRRRNSSSSSSSSPDRVIEMPRSGIPKWVNEYIEEIGSTLREGGW
Sbjct: 241 RTPRWVEFWSDAVVDRRRRNSSSSSSSSPDRVIEMPRSGIPKWVNEYIEEIGSTLREGGW 300

Query: 301 SETDITDIAQVSASGFFEGAAMVLVDNQAVLDALLLKTDRFSDVLRKAGWSSEEVSYALG 360
           SETDIT+I QVSASGFFEGAAMVLVDNQAVLDALLLKTDRFSDVLRKAGWSSEEVSYALG
Sbjct: 301 SETDITEIVQVSASGFFEGAAMVLVDNQAVLDALLLKTDRFSDVLRKAGWSSEEVSYALG 360

Query: 361 FDHRPEKERKPAKKLSPELVERIGKLAESVTRS 394
           FD+RPEKERKPAKKLSPELVERI KLAESVTRS
Sbjct: 361 FDYRPEKERKPAKKLSPELVERIEKLAESVTRS 393

BLAST of Clc01G16740 vs. NCBI nr
Match: XP_008467037.1 (PREDICTED: uncharacterized protein LOC103504469 [Cucumis melo] >TYK12888.1 uncharacterized protein E5676_scaffold255G004820 [Cucumis melo var. makuwa])

HSP 1 Score: 755.7 bits (1950), Expect = 1.9e-214
Identity = 381/393 (96.95%), Postives = 389/393 (98.98%), Query Frame = 0

Query: 1   MVDVDRRMAGLNPAHIAGLRRLSARAAAVTPSHPSRAGLLSFSSLADKVITHLRNTGVEV 60
           MVDVDRRMAGLNPAH+AGLRRLSARAAAVTPSHPSRAGLLSFSSLAD VITHLRNTGVEV
Sbjct: 1   MVDVDRRMAGLNPAHVAGLRRLSARAAAVTPSHPSRAGLLSFSSLADNVITHLRNTGVEV 60

Query: 61  QSGLSVAEFARAEAEFGFVFPPDLRAVLSAGLPVGPGFPDWRASGARQHLRATLDLPIAA 120
           Q+GLS+AEFARAEAEFGFVFPPDLRAVLSAGLP+GPGFPDWR+SGARQHLRATLDLPIAA
Sbjct: 61  QNGLSIAEFARAEAEFGFVFPPDLRAVLSAGLPIGPGFPDWRSSGARQHLRATLDLPIAA 120

Query: 121 ISFQIAKNTFWSKSWGPRPLDPEKALRVARNALKRAPLLIPLFNHCYIPCNPSLAGNPIF 180
           ISFQIAKNTFWSKSWGPRPLDPEKALRVARNALKRAPLLIPLFNHCYIPCNPSLAGNPIF
Sbjct: 121 ISFQIAKNTFWSKSWGPRPLDPEKALRVARNALKRAPLLIPLFNHCYIPCNPSLAGNPIF 180

Query: 181 SVDENRISFCGLDLSDFFEREFLFRSSESDAHLLKKQRSISEKSAGSSSNFSRRSLDTGA 240
           SVDENRISFCGLDLSDFFEREFLFRSS+SDAH LKKQRSISEKSAGSSSNFSRRSLDTGA
Sbjct: 181 SVDENRISFCGLDLSDFFEREFLFRSSQSDAHHLKKQRSISEKSAGSSSNFSRRSLDTGA 240

Query: 241 RTPRWVEFWSDAVVDRRRRNSSSSSSSSPDRVIEMPRSGIPKWVNEYIEEIGSTLREGGW 300
           RTPRWVEFWSDAVVDRRRRNSSSSSSSSPDRVIEMPRSGIPKWVNEYIEEIGSTLREGGW
Sbjct: 241 RTPRWVEFWSDAVVDRRRRNSSSSSSSSPDRVIEMPRSGIPKWVNEYIEEIGSTLREGGW 300

Query: 301 SETDITDIAQVSASGFFEGAAMVLVDNQAVLDALLLKTDRFSDVLRKAGWSSEEVSYALG 360
           SETDIT+I +VSASGFFEGAAMVLVDNQAVLDALLLKTDRFSDVLRKAGWSSEEVSYALG
Sbjct: 301 SETDITEIVEVSASGFFEGAAMVLVDNQAVLDALLLKTDRFSDVLRKAGWSSEEVSYALG 360

Query: 361 FDHRPEKERKPAKKLSPELVERIGKLAESVTRS 394
           FDHR EKERKPAKKLSPELVERIGKLAESVTRS
Sbjct: 361 FDHRAEKERKPAKKLSPELVERIGKLAESVTRS 393

BLAST of Clc01G16740 vs. NCBI nr
Match: XP_004141914.1 (uncharacterized protein LOC101216316 [Cucumis sativus] >KGN48525.1 hypothetical protein Csa_003451 [Cucumis sativus])

HSP 1 Score: 751.1 bits (1938), Expect = 4.7e-213
Identity = 380/393 (96.69%), Postives = 388/393 (98.73%), Query Frame = 0

Query: 1   MVDVDRRMAGLNPAHIAGLRRLSARAAAVTPSHPSRAGLLSFSSLADKVITHLRNTGVEV 60
           MVDVDRRMAGLNPAHIAGLRRLSARAAAVTPSHPSRAGLLSFSSLAD VITHLRNTGVEV
Sbjct: 1   MVDVDRRMAGLNPAHIAGLRRLSARAAAVTPSHPSRAGLLSFSSLADNVITHLRNTGVEV 60

Query: 61  QSGLSVAEFARAEAEFGFVFPPDLRAVLSAGLPVGPGFPDWRASGARQHLRATLDLPIAA 120
           Q+GLS+A+FARAEAEFGFVFPPDLRAVLSAGLP+GPGFPDWR+SGARQHLRATLDLPIAA
Sbjct: 61  QTGLSIADFARAEAEFGFVFPPDLRAVLSAGLPIGPGFPDWRSSGARQHLRATLDLPIAA 120

Query: 121 ISFQIAKNTFWSKSWGPRPLDPEKALRVARNALKRAPLLIPLFNHCYIPCNPSLAGNPIF 180
           ISFQIAKNTFWSKSWGPRPLDPEKALRVARNALKRAPLLIPLFNHCYIPCNPSLAGNPIF
Sbjct: 121 ISFQIAKNTFWSKSWGPRPLDPEKALRVARNALKRAPLLIPLFNHCYIPCNPSLAGNPIF 180

Query: 181 SVDENRISFCGLDLSDFFEREFLFRSSESDAHLLKKQRSISEKSAGSSSNFSRRSLDTGA 240
           SVDENRISF GLDLSDFFEREFLFRSS+SDAH LKKQRSISEKSAGSSSNFSRRSLDTGA
Sbjct: 181 SVDENRISFSGLDLSDFFEREFLFRSSQSDAHHLKKQRSISEKSAGSSSNFSRRSLDTGA 240

Query: 241 RTPRWVEFWSDAVVDRRRRNSSSSSSSSPDRVIEMPRSGIPKWVNEYIEEIGSTLREGGW 300
           RTPRWVEFWSDAVVDRRRRNSSSSSSSSPDRVIEMPRSGIPKWVNEYIEEIGSTLREGGW
Sbjct: 241 RTPRWVEFWSDAVVDRRRRNSSSSSSSSPDRVIEMPRSGIPKWVNEYIEEIGSTLREGGW 300

Query: 301 SETDITDIAQVSASGFFEGAAMVLVDNQAVLDALLLKTDRFSDVLRKAGWSSEEVSYALG 360
           SETDIT+I QVSASGFFEGAAMVLVDNQAVLDALLLKTDRFSDVLRKAGWSSEEVSYALG
Sbjct: 301 SETDITEIVQVSASGFFEGAAMVLVDNQAVLDALLLKTDRFSDVLRKAGWSSEEVSYALG 360

Query: 361 FDHRPEKERKPAKKLSPELVERIGKLAESVTRS 394
           FDHR E+ERKPAKKLSPELVERIGKLAESVTRS
Sbjct: 361 FDHRAERERKPAKKLSPELVERIGKLAESVTRS 393

BLAST of Clc01G16740 vs. NCBI nr
Match: XP_023518713.1 (uncharacterized protein LOC111782143 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 714.9 bits (1844), Expect = 3.8e-202
Identity = 364/398 (91.46%), Postives = 378/398 (94.97%), Query Frame = 0

Query: 1   MVDVDRRMAGLNPAHIAGLRRLSARA-----AAVTPSHPSRAGLLSFSSLADKVITHLRN 60
           MVDVDRRMAGLNPAHIAGLRRLSARA     AA+TPSHP RAGLLSFSSLA+KV+THLR 
Sbjct: 1   MVDVDRRMAGLNPAHIAGLRRLSARAAAAPSAAITPSHPPRAGLLSFSSLAEKVMTHLRK 60

Query: 61  TGVEVQSGLSVAEFARAEAEFGFVFPPDLRAVLSAGLPVGPGFPDWRASGARQHLRATLD 120
            GVEVQSGLSVAEFARAEAEFGF FPPDLRAVLSAGLPVGPGFPDWR+SGARQHLRATLD
Sbjct: 61  AGVEVQSGLSVAEFARAEAEFGFAFPPDLRAVLSAGLPVGPGFPDWRSSGARQHLRATLD 120

Query: 121 LPIAAISFQIAKNTFWSKSWGPRPLDPEKALRVARNALKRAPLLIPLFNHCYIPCNPSLA 180
           LPIAAIS QIAKNTFWSK WGPRPLDPEKA+RVARNALKRAPLLIPLFNHCYIPCNPSLA
Sbjct: 121 LPIAAISIQIAKNTFWSKCWGPRPLDPEKAIRVARNALKRAPLLIPLFNHCYIPCNPSLA 180

Query: 181 GNPIFSVDENRISFCGLDLSDFFEREFLFRSSESDAHLLKKQRSISEKSAGSSSNFSRRS 240
           GNPIFSVDENRISFCGLDLSDFFEREFLFRSSESD HLLK+ RSISEKSA SSSNF RRS
Sbjct: 181 GNPIFSVDENRISFCGLDLSDFFEREFLFRSSESDTHLLKRHRSISEKSAASSSNFLRRS 240

Query: 241 LDTGARTPRWVEFWSDAVVDRRRRNSSSSSSSSPDRVIEMPRSGIPKWVNEYIEEIGSTL 300
           LDTGARTPRWVEFWSDAVVDRRRRNSSSS+SSSPDRV EMPRSGIPKWVNEYIEE+GSTL
Sbjct: 241 LDTGARTPRWVEFWSDAVVDRRRRNSSSSTSSSPDRVFEMPRSGIPKWVNEYIEEMGSTL 300

Query: 301 REGGWSETDITDIAQVSASGFFEGAAMVLVDNQAVLDALLLKTDRFSDVLRKAGWSSEEV 360
           REGGWSETDI+++ QVSASGF EG AM+LVDNQAVLDALLLKTDRFSDVLRKAGWSSEEV
Sbjct: 301 REGGWSETDISEMVQVSASGFLEG-AMLLVDNQAVLDALLLKTDRFSDVLRKAGWSSEEV 360

Query: 361 SYALGFDHRPEKERKPAKKLSPELVERIGKLAESVTRS 394
           SYALGF+HR EKERKPAKKLSPELVERIGKLAESVTR+
Sbjct: 361 SYALGFEHRAEKERKPAKKLSPELVERIGKLAESVTRA 397

BLAST of Clc01G16740 vs. NCBI nr
Match: XP_022962882.1 (uncharacterized protein LOC111463249 [Cucurbita moschata] >KAG6595035.1 hypothetical protein SDJN03_11588, partial [Cucurbita argyrosperma subsp. sororia] >KAG7027058.1 hypothetical protein SDJN02_11067, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 712.2 bits (1837), Expect = 2.4e-201
Identity = 362/398 (90.95%), Postives = 378/398 (94.97%), Query Frame = 0

Query: 1   MVDVDRRMAGLNPAHIAGLRRLSARA-----AAVTPSHPSRAGLLSFSSLADKVITHLRN 60
           MVDVDRRMAGLNPAHIAGLRRLSARA     AA+TPSHP RAGLLSFSSLA+KV+THLR 
Sbjct: 1   MVDVDRRMAGLNPAHIAGLRRLSARAAAAPSAAITPSHPPRAGLLSFSSLAEKVMTHLRK 60

Query: 61  TGVEVQSGLSVAEFARAEAEFGFVFPPDLRAVLSAGLPVGPGFPDWRASGARQHLRATLD 120
            GVEVQSGLSVAEFARAEAEFGF FPPDLRAVLSAGLPVGPGFPDWR+SGARQHLRATLD
Sbjct: 61  AGVEVQSGLSVAEFARAEAEFGFAFPPDLRAVLSAGLPVGPGFPDWRSSGARQHLRATLD 120

Query: 121 LPIAAISFQIAKNTFWSKSWGPRPLDPEKALRVARNALKRAPLLIPLFNHCYIPCNPSLA 180
           LPIAAIS QIAKNTFWSK WGPRPLDPEKA+RVARNALKRAPLLIPLFNHCYIPCNPSLA
Sbjct: 121 LPIAAISIQIAKNTFWSKCWGPRPLDPEKAIRVARNALKRAPLLIPLFNHCYIPCNPSLA 180

Query: 181 GNPIFSVDENRISFCGLDLSDFFEREFLFRSSESDAHLLKKQRSISEKSAGSSSNFSRRS 240
           GNPIFSVDENRISFCGLDLSDFFEREFLFRSSESD HLLK+ RSISEKSA SSSNF RRS
Sbjct: 181 GNPIFSVDENRISFCGLDLSDFFEREFLFRSSESDTHLLKRHRSISEKSAASSSNFLRRS 240

Query: 241 LDTGARTPRWVEFWSDAVVDRRRRNSSSSSSSSPDRVIEMPRSGIPKWVNEYIEEIGSTL 300
           LDTGARTPRWVEFWSDAVVDRRRRNSSSS+SSSPDRV EMPRSGIPKWVNEYIEE+GSTL
Sbjct: 241 LDTGARTPRWVEFWSDAVVDRRRRNSSSSTSSSPDRVFEMPRSGIPKWVNEYIEEMGSTL 300

Query: 301 REGGWSETDITDIAQVSASGFFEGAAMVLVDNQAVLDALLLKTDRFSDVLRKAGWSSEEV 360
           REGGWSETDI+++ QVSASGF EG A++LVDNQAVLDALLLKTDRFSDVLRKAGWSSEEV
Sbjct: 301 REGGWSETDISEMVQVSASGFLEG-AILLVDNQAVLDALLLKTDRFSDVLRKAGWSSEEV 360

Query: 361 SYALGFDHRPEKERKPAKKLSPELVERIGKLAESVTRS 394
           SYALGF+HR EKERKPAKKLSP+LVERIGKLAESVTR+
Sbjct: 361 SYALGFEHRAEKERKPAKKLSPQLVERIGKLAESVTRA 397

BLAST of Clc01G16740 vs. ExPASy TrEMBL
Match: A0A5D3CP02 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold255G004820 PE=4 SV=1)

HSP 1 Score: 755.7 bits (1950), Expect = 9.3e-215
Identity = 381/393 (96.95%), Postives = 389/393 (98.98%), Query Frame = 0

Query: 1   MVDVDRRMAGLNPAHIAGLRRLSARAAAVTPSHPSRAGLLSFSSLADKVITHLRNTGVEV 60
           MVDVDRRMAGLNPAH+AGLRRLSARAAAVTPSHPSRAGLLSFSSLAD VITHLRNTGVEV
Sbjct: 1   MVDVDRRMAGLNPAHVAGLRRLSARAAAVTPSHPSRAGLLSFSSLADNVITHLRNTGVEV 60

Query: 61  QSGLSVAEFARAEAEFGFVFPPDLRAVLSAGLPVGPGFPDWRASGARQHLRATLDLPIAA 120
           Q+GLS+AEFARAEAEFGFVFPPDLRAVLSAGLP+GPGFPDWR+SGARQHLRATLDLPIAA
Sbjct: 61  QNGLSIAEFARAEAEFGFVFPPDLRAVLSAGLPIGPGFPDWRSSGARQHLRATLDLPIAA 120

Query: 121 ISFQIAKNTFWSKSWGPRPLDPEKALRVARNALKRAPLLIPLFNHCYIPCNPSLAGNPIF 180
           ISFQIAKNTFWSKSWGPRPLDPEKALRVARNALKRAPLLIPLFNHCYIPCNPSLAGNPIF
Sbjct: 121 ISFQIAKNTFWSKSWGPRPLDPEKALRVARNALKRAPLLIPLFNHCYIPCNPSLAGNPIF 180

Query: 181 SVDENRISFCGLDLSDFFEREFLFRSSESDAHLLKKQRSISEKSAGSSSNFSRRSLDTGA 240
           SVDENRISFCGLDLSDFFEREFLFRSS+SDAH LKKQRSISEKSAGSSSNFSRRSLDTGA
Sbjct: 181 SVDENRISFCGLDLSDFFEREFLFRSSQSDAHHLKKQRSISEKSAGSSSNFSRRSLDTGA 240

Query: 241 RTPRWVEFWSDAVVDRRRRNSSSSSSSSPDRVIEMPRSGIPKWVNEYIEEIGSTLREGGW 300
           RTPRWVEFWSDAVVDRRRRNSSSSSSSSPDRVIEMPRSGIPKWVNEYIEEIGSTLREGGW
Sbjct: 241 RTPRWVEFWSDAVVDRRRRNSSSSSSSSPDRVIEMPRSGIPKWVNEYIEEIGSTLREGGW 300

Query: 301 SETDITDIAQVSASGFFEGAAMVLVDNQAVLDALLLKTDRFSDVLRKAGWSSEEVSYALG 360
           SETDIT+I +VSASGFFEGAAMVLVDNQAVLDALLLKTDRFSDVLRKAGWSSEEVSYALG
Sbjct: 301 SETDITEIVEVSASGFFEGAAMVLVDNQAVLDALLLKTDRFSDVLRKAGWSSEEVSYALG 360

Query: 361 FDHRPEKERKPAKKLSPELVERIGKLAESVTRS 394
           FDHR EKERKPAKKLSPELVERIGKLAESVTRS
Sbjct: 361 FDHRAEKERKPAKKLSPELVERIGKLAESVTRS 393

BLAST of Clc01G16740 vs. ExPASy TrEMBL
Match: A0A1S3CSK6 (uncharacterized protein LOC103504469 OS=Cucumis melo OX=3656 GN=LOC103504469 PE=4 SV=1)

HSP 1 Score: 755.7 bits (1950), Expect = 9.3e-215
Identity = 381/393 (96.95%), Postives = 389/393 (98.98%), Query Frame = 0

Query: 1   MVDVDRRMAGLNPAHIAGLRRLSARAAAVTPSHPSRAGLLSFSSLADKVITHLRNTGVEV 60
           MVDVDRRMAGLNPAH+AGLRRLSARAAAVTPSHPSRAGLLSFSSLAD VITHLRNTGVEV
Sbjct: 1   MVDVDRRMAGLNPAHVAGLRRLSARAAAVTPSHPSRAGLLSFSSLADNVITHLRNTGVEV 60

Query: 61  QSGLSVAEFARAEAEFGFVFPPDLRAVLSAGLPVGPGFPDWRASGARQHLRATLDLPIAA 120
           Q+GLS+AEFARAEAEFGFVFPPDLRAVLSAGLP+GPGFPDWR+SGARQHLRATLDLPIAA
Sbjct: 61  QNGLSIAEFARAEAEFGFVFPPDLRAVLSAGLPIGPGFPDWRSSGARQHLRATLDLPIAA 120

Query: 121 ISFQIAKNTFWSKSWGPRPLDPEKALRVARNALKRAPLLIPLFNHCYIPCNPSLAGNPIF 180
           ISFQIAKNTFWSKSWGPRPLDPEKALRVARNALKRAPLLIPLFNHCYIPCNPSLAGNPIF
Sbjct: 121 ISFQIAKNTFWSKSWGPRPLDPEKALRVARNALKRAPLLIPLFNHCYIPCNPSLAGNPIF 180

Query: 181 SVDENRISFCGLDLSDFFEREFLFRSSESDAHLLKKQRSISEKSAGSSSNFSRRSLDTGA 240
           SVDENRISFCGLDLSDFFEREFLFRSS+SDAH LKKQRSISEKSAGSSSNFSRRSLDTGA
Sbjct: 181 SVDENRISFCGLDLSDFFEREFLFRSSQSDAHHLKKQRSISEKSAGSSSNFSRRSLDTGA 240

Query: 241 RTPRWVEFWSDAVVDRRRRNSSSSSSSSPDRVIEMPRSGIPKWVNEYIEEIGSTLREGGW 300
           RTPRWVEFWSDAVVDRRRRNSSSSSSSSPDRVIEMPRSGIPKWVNEYIEEIGSTLREGGW
Sbjct: 241 RTPRWVEFWSDAVVDRRRRNSSSSSSSSPDRVIEMPRSGIPKWVNEYIEEIGSTLREGGW 300

Query: 301 SETDITDIAQVSASGFFEGAAMVLVDNQAVLDALLLKTDRFSDVLRKAGWSSEEVSYALG 360
           SETDIT+I +VSASGFFEGAAMVLVDNQAVLDALLLKTDRFSDVLRKAGWSSEEVSYALG
Sbjct: 301 SETDITEIVEVSASGFFEGAAMVLVDNQAVLDALLLKTDRFSDVLRKAGWSSEEVSYALG 360

Query: 361 FDHRPEKERKPAKKLSPELVERIGKLAESVTRS 394
           FDHR EKERKPAKKLSPELVERIGKLAESVTRS
Sbjct: 361 FDHRAEKERKPAKKLSPELVERIGKLAESVTRS 393

BLAST of Clc01G16740 vs. ExPASy TrEMBL
Match: A0A0A0KFS2 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G490890 PE=4 SV=1)

HSP 1 Score: 751.1 bits (1938), Expect = 2.3e-213
Identity = 380/393 (96.69%), Postives = 388/393 (98.73%), Query Frame = 0

Query: 1   MVDVDRRMAGLNPAHIAGLRRLSARAAAVTPSHPSRAGLLSFSSLADKVITHLRNTGVEV 60
           MVDVDRRMAGLNPAHIAGLRRLSARAAAVTPSHPSRAGLLSFSSLAD VITHLRNTGVEV
Sbjct: 1   MVDVDRRMAGLNPAHIAGLRRLSARAAAVTPSHPSRAGLLSFSSLADNVITHLRNTGVEV 60

Query: 61  QSGLSVAEFARAEAEFGFVFPPDLRAVLSAGLPVGPGFPDWRASGARQHLRATLDLPIAA 120
           Q+GLS+A+FARAEAEFGFVFPPDLRAVLSAGLP+GPGFPDWR+SGARQHLRATLDLPIAA
Sbjct: 61  QTGLSIADFARAEAEFGFVFPPDLRAVLSAGLPIGPGFPDWRSSGARQHLRATLDLPIAA 120

Query: 121 ISFQIAKNTFWSKSWGPRPLDPEKALRVARNALKRAPLLIPLFNHCYIPCNPSLAGNPIF 180
           ISFQIAKNTFWSKSWGPRPLDPEKALRVARNALKRAPLLIPLFNHCYIPCNPSLAGNPIF
Sbjct: 121 ISFQIAKNTFWSKSWGPRPLDPEKALRVARNALKRAPLLIPLFNHCYIPCNPSLAGNPIF 180

Query: 181 SVDENRISFCGLDLSDFFEREFLFRSSESDAHLLKKQRSISEKSAGSSSNFSRRSLDTGA 240
           SVDENRISF GLDLSDFFEREFLFRSS+SDAH LKKQRSISEKSAGSSSNFSRRSLDTGA
Sbjct: 181 SVDENRISFSGLDLSDFFEREFLFRSSQSDAHHLKKQRSISEKSAGSSSNFSRRSLDTGA 240

Query: 241 RTPRWVEFWSDAVVDRRRRNSSSSSSSSPDRVIEMPRSGIPKWVNEYIEEIGSTLREGGW 300
           RTPRWVEFWSDAVVDRRRRNSSSSSSSSPDRVIEMPRSGIPKWVNEYIEEIGSTLREGGW
Sbjct: 241 RTPRWVEFWSDAVVDRRRRNSSSSSSSSPDRVIEMPRSGIPKWVNEYIEEIGSTLREGGW 300

Query: 301 SETDITDIAQVSASGFFEGAAMVLVDNQAVLDALLLKTDRFSDVLRKAGWSSEEVSYALG 360
           SETDIT+I QVSASGFFEGAAMVLVDNQAVLDALLLKTDRFSDVLRKAGWSSEEVSYALG
Sbjct: 301 SETDITEIVQVSASGFFEGAAMVLVDNQAVLDALLLKTDRFSDVLRKAGWSSEEVSYALG 360

Query: 361 FDHRPEKERKPAKKLSPELVERIGKLAESVTRS 394
           FDHR E+ERKPAKKLSPELVERIGKLAESVTRS
Sbjct: 361 FDHRAERERKPAKKLSPELVERIGKLAESVTRS 393

BLAST of Clc01G16740 vs. ExPASy TrEMBL
Match: A0A6J1HDS6 (uncharacterized protein LOC111463249 OS=Cucurbita moschata OX=3662 GN=LOC111463249 PE=4 SV=1)

HSP 1 Score: 712.2 bits (1837), Expect = 1.2e-201
Identity = 362/398 (90.95%), Postives = 378/398 (94.97%), Query Frame = 0

Query: 1   MVDVDRRMAGLNPAHIAGLRRLSARA-----AAVTPSHPSRAGLLSFSSLADKVITHLRN 60
           MVDVDRRMAGLNPAHIAGLRRLSARA     AA+TPSHP RAGLLSFSSLA+KV+THLR 
Sbjct: 1   MVDVDRRMAGLNPAHIAGLRRLSARAAAAPSAAITPSHPPRAGLLSFSSLAEKVMTHLRK 60

Query: 61  TGVEVQSGLSVAEFARAEAEFGFVFPPDLRAVLSAGLPVGPGFPDWRASGARQHLRATLD 120
            GVEVQSGLSVAEFARAEAEFGF FPPDLRAVLSAGLPVGPGFPDWR+SGARQHLRATLD
Sbjct: 61  AGVEVQSGLSVAEFARAEAEFGFAFPPDLRAVLSAGLPVGPGFPDWRSSGARQHLRATLD 120

Query: 121 LPIAAISFQIAKNTFWSKSWGPRPLDPEKALRVARNALKRAPLLIPLFNHCYIPCNPSLA 180
           LPIAAIS QIAKNTFWSK WGPRPLDPEKA+RVARNALKRAPLLIPLFNHCYIPCNPSLA
Sbjct: 121 LPIAAISIQIAKNTFWSKCWGPRPLDPEKAIRVARNALKRAPLLIPLFNHCYIPCNPSLA 180

Query: 181 GNPIFSVDENRISFCGLDLSDFFEREFLFRSSESDAHLLKKQRSISEKSAGSSSNFSRRS 240
           GNPIFSVDENRISFCGLDLSDFFEREFLFRSSESD HLLK+ RSISEKSA SSSNF RRS
Sbjct: 181 GNPIFSVDENRISFCGLDLSDFFEREFLFRSSESDTHLLKRHRSISEKSAASSSNFLRRS 240

Query: 241 LDTGARTPRWVEFWSDAVVDRRRRNSSSSSSSSPDRVIEMPRSGIPKWVNEYIEEIGSTL 300
           LDTGARTPRWVEFWSDAVVDRRRRNSSSS+SSSPDRV EMPRSGIPKWVNEYIEE+GSTL
Sbjct: 241 LDTGARTPRWVEFWSDAVVDRRRRNSSSSTSSSPDRVFEMPRSGIPKWVNEYIEEMGSTL 300

Query: 301 REGGWSETDITDIAQVSASGFFEGAAMVLVDNQAVLDALLLKTDRFSDVLRKAGWSSEEV 360
           REGGWSETDI+++ QVSASGF EG A++LVDNQAVLDALLLKTDRFSDVLRKAGWSSEEV
Sbjct: 301 REGGWSETDISEMVQVSASGFLEG-AILLVDNQAVLDALLLKTDRFSDVLRKAGWSSEEV 360

Query: 361 SYALGFDHRPEKERKPAKKLSPELVERIGKLAESVTRS 394
           SYALGF+HR EKERKPAKKLSP+LVERIGKLAESVTR+
Sbjct: 361 SYALGFEHRAEKERKPAKKLSPQLVERIGKLAESVTRA 397

BLAST of Clc01G16740 vs. ExPASy TrEMBL
Match: A0A6J1GFT6 (uncharacterized protein LOC111453560 OS=Cucurbita moschata OX=3662 GN=LOC111453560 PE=4 SV=1)

HSP 1 Score: 710.3 bits (1832), Expect = 4.5e-201
Identity = 359/391 (91.82%), Postives = 376/391 (96.16%), Query Frame = 0

Query: 1   MVDVDRRMAGLNPAHIAGLRRLSARAAAVTPSHPSRAGLLSFSSLADKVITHLRNTGVEV 60
           MVDVD RMAGLNPAHIAGLRRLSARAAAVTPSHP+RAGLLSF+SLA+KVITH+RNTGV+V
Sbjct: 1   MVDVDSRMAGLNPAHIAGLRRLSARAAAVTPSHPARAGLLSFASLAEKVITHMRNTGVQV 60

Query: 61  QSGLSVAEFARAEAEFGFVFPPDLRAVLSAGLPVGPGFPDWRASGARQHLRATLDLPIAA 120
           Q GLSVAEFARAEAEFGFVFPPDLRAVLSAGLPVGPGFPDWRASGARQHLRATLD PIAA
Sbjct: 61  QPGLSVAEFARAEAEFGFVFPPDLRAVLSAGLPVGPGFPDWRASGARQHLRATLDFPIAA 120

Query: 121 ISFQIAKNTFWSKSWGPRPLDPEKALRVARNALKRAPLLIPLFNHCYIPCNPSLAGNPIF 180
           ISFQIAKNTFWSKSWGPRPLDPEKALRV+RNALKRAPLLIPLF+HCYIPCNPSLAGNPIF
Sbjct: 121 ISFQIAKNTFWSKSWGPRPLDPEKALRVSRNALKRAPLLIPLFSHCYIPCNPSLAGNPIF 180

Query: 181 SVDENRISFCGLDLSDFFEREFLFRSSESDAHLLKKQRSISEKSAGSSSNFSRRSLDTGA 240
           SVDENRISFCGLDLSDFFEREFL RSSESDAH LK QRSISEKSAG SSNFSRRSLDTGA
Sbjct: 181 SVDENRISFCGLDLSDFFEREFLIRSSESDAHFLKNQRSISEKSAG-SSNFSRRSLDTGA 240

Query: 241 RTPRWVEFWSDAVVDRRRRNSSSSSSSSPDRVIEMPRSGIPKWVNEYIEEIGSTLREGGW 300
           RTPRWVEFWSDA VDRRRRNS SS+ SSPDRV EMPRSGIPKWV +YIEEIGS LREGGW
Sbjct: 241 RTPRWVEFWSDAAVDRRRRNSLSSACSSPDRVFEMPRSGIPKWVKDYIEEIGSRLREGGW 300

Query: 301 SETDITDIAQVSASGFFEGAAMVLVDNQAVLDALLLKTDRFSDVLRKAGWSSEEVSYALG 360
           SETDI+++ QVSASGFFEG AMV+VDNQAVLDALL+KTDRFS++LRKAGWSSEEVSYALG
Sbjct: 301 SETDISEMVQVSASGFFEG-AMVIVDNQAVLDALLVKTDRFSELLRKAGWSSEEVSYALG 360

Query: 361 FDHRPEKERKPAKKLSPELVERIGKLAESVT 392
           FDHRPEKERKPAKKLSPELVERIGKLAESV+
Sbjct: 361 FDHRPEKERKPAKKLSPELVERIGKLAESVS 389

BLAST of Clc01G16740 vs. TAIR 10
Match: AT3G50340.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G67020.1); Has 128 Blast hits to 128 proteins in 39 species: Archae - 0; Bacteria - 46; Metazoa - 0; Fungi - 3; Plants - 76; Viruses - 0; Other Eukaryotes - 3 (source: NCBI BLink). )

HSP 1 Score: 565.5 bits (1456), Expect = 3.4e-161
Identity = 296/406 (72.91%), Postives = 338/406 (83.25%), Query Frame = 0

Query: 1   MVDVDRRMAGLNPAHIAGLRRLSARAAAVTPSHPS-RAGLLSFSSLADKVITHLRNTGVE 60
           MVDVDRRM GL PAH AGLRRLSARAAA  P+ P+ R  L+SFSSLAD+VI+HL  + ++
Sbjct: 1   MVDVDRRMTGLRPAHAAGLRRLSARAAA--PTTPTVRNSLVSFSSLADQVISHLHTSRIQ 60

Query: 61  VQSGLSVAEFARAEAEFGFVFPPDLRAVLSAGLPVGPGFPDWRASGARQHLRATLDLPIA 120
           VQ GL+ +EFARAEAEF F FPPDLRAVL+AGLPVG GFPDWR+ GAR HLRA +DLPIA
Sbjct: 61  VQPGLTDSEFARAEAEFAFAFPPDLRAVLTAGLPVGAGFPDWRSPGARLHLRAMIDLPIA 120

Query: 121 AISFQIAKNTFWSKSWGPRPLDPEKALRVARNALKRAPLLIPLFNHCYIPCNPSLAGNPI 180
           A+SFQIA+NT WSKSWG RP DPEKALRVARNALKRAPL+IP+F+HCYIPCNPSLAGNP+
Sbjct: 121 AVSFQIARNTLWSKSWGLRPSDPEKALRVARNALKRAPLMIPIFDHCYIPCNPSLAGNPV 180

Query: 181 FSVDENRISFCGLDLSDFFEREFLFRSSESDAHLLKKQRSISEKSAG----SSSNFSRRS 240
           F +DE RI  CG DLSDFFERE +FR S++   +L KQRS+SEKSAG    SSSNFSR S
Sbjct: 181 FYIDETRIFCCGSDLSDFFERESVFRGSDTCPVVLTKQRSVSEKSAGSSSSSSSNFSRMS 240

Query: 241 LDT----GARTPRWVEFWSDAVVDRRRRNS----SSSSSSSPDRVIEMPRSGIPKWVNEY 300
           LD+    G+ TPRWVEFWSDA VDRRRRNS    SSS SSSP+R +++PRS  PKWV++Y
Sbjct: 241 LDSGRVHGSSTPRWVEFWSDAAVDRRRRNSASSMSSSHSSSPERYLDLPRSETPKWVDDY 300

Query: 301 IEEIGSTLREGGWSETDITDIAQVSASGFFEGAAMVLVDNQAVLDALLLKTDRFSDVLRK 360
           +  IGS LR GGWSE+D+ DI  VSASGFFEG  MV++DNQAVLDALLLK  RFS+ LRK
Sbjct: 301 VNRIGSVLRGGGWSESDVDDIVHVSASGFFEG-EMVILDNQAVLDALLLKAGRFSESLRK 360

Query: 361 AGWSSEEVSYALGFDHRPEKERKPAKKLSPELVERIGKLAESVTRS 394
           AGWSSEEVS ALGFD RPEKE+KP KKLSPELV+RIGKLAESV+RS
Sbjct: 361 AGWSSEEVSDALGFDFRPEKEKKPVKKLSPELVQRIGKLAESVSRS 403

BLAST of Clc01G16740 vs. TAIR 10
Match: AT5G67020.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G50340.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 540.8 bits (1392), Expect = 9.0e-154
Identity = 286/401 (71.32%), Postives = 320/401 (79.80%), Query Frame = 0

Query: 1   MVDVDRRMAGLNPAHIAGLRRLSARAAAVTPSHPS-RAGLLSFSSLADKVITHLRNTGVE 60
           MVDVDRRM GL PAH AGLRRLSARAAA  PS P+ R  L SFS  ADKVI HL+N+G++
Sbjct: 1   MVDVDRRMTGLTPAHAAGLRRLSARAAA--PSTPTIRNSLQSFSPFADKVINHLKNSGIK 60

Query: 61  VQSGLSVAEFARAEAEFGFVFPPDLRAVLSAGLPVGPGFPDWRASGARQHLRATLDLPIA 120
           +Q GLS  EFAR EAEFGF FPPDLR +LSAGL VG GFPDWR+ GAR HLRA +DLP+A
Sbjct: 61  IQPGLSDTEFARVEAEFGFTFPPDLRVILSAGLSVGAGFPDWRSPGARLHLRAMIDLPVA 120

Query: 121 AISFQIAKNTFWSKSWGPRPLDPEKALRVARNALKRAPLLIPLFNHCYIPCNPSLAGNPI 180
           A+SFQIAKN+ W KSWG +P DPEKALRVARNALKRAPLLIP+F+HCYIPCNPSLAGNP+
Sbjct: 121 AVSFQIAKNSLWCKSWGLKPPDPEKALRVARNALKRAPLLIPIFDHCYIPCNPSLAGNPV 180

Query: 181 FSVDENRISFCGLDLSDFFEREFLFRSSESDAHLLKKQRSISEKSAGSSSNFSRRSLD-- 240
           F +DE RI  CG DLS+FFERE  FRSSE    +L KQRS+SEKSAGSSSNFSRRSLD  
Sbjct: 181 FFIDETRIFCCGSDLSEFFERESAFRSSEFFPRILTKQRSVSEKSAGSSSNFSRRSLDLG 240

Query: 241 --TGARTPRWVEFWSDAVVDRRRRNS---SSSSSSSPDRVIEMPRSGIPKWVNEYIEEIG 300
              GA   RWVEFWSDA VDR RRNS   SSSSSSSPD    +P++  PKWVN+Y+  IG
Sbjct: 241 RANGAGKSRWVEFWSDAAVDRCRRNSASTSSSSSSSPD----LPKTETPKWVNQYVNRIG 300

Query: 301 STLREGGWSETDITDIAQVSASGFFEGAAMVLVDNQAVLDALLLKTDRFSDVLRKAGWSS 360
           S LR GGWSE+DI +I  VSASGFFEG  MV++DNQ VLD LLLK  R S+ LRK+GWSS
Sbjct: 301 SVLRRGGWSESDIDEIIHVSASGFFEG-EMVIIDNQTVLDVLLLKAGRISESLRKSGWSS 360

Query: 361 EEVSYALGFDHRPEKERKPAKKLSPELVERIGKLAESVTRS 394
           EEVS ALGFD RPEKERKP KKLSP LVE+  KLAE V++S
Sbjct: 361 EEVSDALGFDFRPEKERKPVKKLSPMLVEQFEKLAEWVSQS 394

BLAST of Clc01G16740 vs. TAIR 10
Match: AT2G22790.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G67020.1); Has 111 Blast hits to 111 proteins in 33 species: Archae - 0; Bacteria - 44; Metazoa - 0; Fungi - 0; Plants - 67; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 119.0 bits (297), Expect = 8.5e-27
Identity = 83/291 (28.52%), Postives = 133/291 (45.70%), Query Frame = 0

Query: 25  RAAAVTPSHPSRAGLLSFSSLADKVITHLRN-TGVEVQSGLSVAEFARAEAEFGFVFPPD 84
           R+++V PS P              ++ H ++ TG  V  GL+  E +  E+  GF FP D
Sbjct: 21  RSSSVNPSSP---------VYYKTIVNHFKSQTGNHVSPGLTNQEISAVESSHGFSFPLD 80

Query: 85  LRAVLSAGLPVGPGFPDWRASGARQHLRATLDLPIAAISFQIAKNTFWSKSWGPRPLDPE 144
           LR++L  GLPVG  FP+WR    R +L     LP+  +S  + +N FW  SWG RP +  
Sbjct: 81  LRSILQTGLPVGTNFPNWRTGSNRNNLL----LPLLNLSQHVVRNGFWVDSWGIRPGNDA 140

Query: 145 KALRVARNALKRAPLLIPLFNHCYIP-CNPSLAGNPIFSVDENRISFCGLDLSDFFEREF 204
           +AL + +  ++ AP+L+P++   Y+P   P+LAGNP+F +D + +     D+  F     
Sbjct: 141 EALSLVKKLIEIAPVLVPVYGDFYVPSTTPNLAGNPVFQIDGDGVRELSCDVVGFL---- 200

Query: 205 LFRSSESDAHLLKKQRSISEKSAGSSSNFSRRSLDTGARTPRWVEFWSDAVVDRRRRNSS 264
                          + I      +     R       R PR VEFWSD     R     
Sbjct: 201 ---------------KGIGRSETPTEDRRRR-------RRPRRVEFWSDVAEGWR---FV 260

Query: 265 SSSSSSPDRVIEMPRSGIPKWVNEYIEEIGSTLREGGWSETDITDIAQVSA 314
            +   + D    +   G+   +++   +    LRE GW+E D+ D+  + +
Sbjct: 261 VARDYTRDWWSALGFEGLTACLDDAFWK----LREAGWTEDDVRDMMMMDS 265

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038881140.15.1e-21597.96uncharacterized protein LOC120072739 [Benincasa hispida][more]
XP_008467037.11.9e-21496.95PREDICTED: uncharacterized protein LOC103504469 [Cucumis melo] >TYK12888.1 uncha... [more]
XP_004141914.14.7e-21396.69uncharacterized protein LOC101216316 [Cucumis sativus] >KGN48525.1 hypothetical ... [more]
XP_023518713.13.8e-20291.46uncharacterized protein LOC111782143 [Cucurbita pepo subsp. pepo][more]
XP_022962882.12.4e-20190.95uncharacterized protein LOC111463249 [Cucurbita moschata] >KAG6595035.1 hypothet... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A5D3CP029.3e-21596.95Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A1S3CSK69.3e-21596.95uncharacterized protein LOC103504469 OS=Cucumis melo OX=3656 GN=LOC103504469 PE=... [more]
A0A0A0KFS22.3e-21396.69Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G490890 PE=4 SV=1[more]
A0A6J1HDS61.2e-20190.95uncharacterized protein LOC111463249 OS=Cucurbita moschata OX=3662 GN=LOC1114632... [more]
A0A6J1GFT64.5e-20191.82uncharacterized protein LOC111453560 OS=Cucurbita moschata OX=3662 GN=LOC1114535... [more]
Match NameE-valueIdentityDescription
AT3G50340.13.4e-16172.91unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT5G67020.19.0e-15471.32unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT2G22790.18.5e-2728.52unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (cordophanus) v2
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR32011:SF8BNAC08G21750D PROTEINcoord: 1..393
NoneNo IPR availablePANTHERPTHR32011OS08G0472400 PROTEINcoord: 1..393

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Clc01G16740.1Clc01G16740.1mRNA