Clc02G23620 (gene) Watermelon (cordophanus) v2

Overview
NameClc02G23620
Typegene
OrganismCitrullus lanatus subsp. cordophanus cv. cordophanus (Watermelon (cordophanus) v2)
DescriptionGolgin family A protein
LocationClcChr02: 35535051 .. 35539651 (+)
RNA-Seq ExpressionClc02G23620
SyntenyClc02G23620
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAAAGAAGTTATTTTTTTAGTATATATATAAAAAGAGAACATTCCAGCCGAGCTGAAGGAGGAGAAAGGAGACAAGAATGTAAATACACAGAACCCAGTCAAGATTCTAAATGGCTCCATTTACACATGCACGTCCCTCATTGATTGTTGAATGCTCACGTGGCTGCCATGTCATTTTCCATTTCTTCTTTCCCTCTTAAGATTTCCGAAACAAAGGTTGTATTCTATTTTCCATTCAAACAAACAAACGCCATCTTTCTTTCTTTCTTTCTTTCTTTCCTACTGGGTTTCGAGATCTCTTTGAGTCAAGACTCAAGACTGACCCGCCCTACCCCCAAAAGTAGAATCTTTAATAATTTCTACAATTTTTCTTCGCTGAATTCAAGGCTTACTGTTCCAGAACTTGCCTTCTTTTATCTCACTCTCACATGAACTTCACCACAATTTCTGCAATACCCACCACCCTGTAAGTAAACTCTTCACAAATTTCTTCAATCTCTTACATTTTCCTTTTCTTTTATTTTGAATCCCCTTTTTGGAAGTTGGGTTTAAATAATGTCGTCGTGGGCAGAGCAGAAAACAGAGAAGAGATGTAAGATTAGAAAACGAGGGTGTTTATCATCGCCATCTTCTTCCACTTTGGTTCGTAAGTACAGATTCAAGAAACCCCCCACCTGGAAAATGAGTACAAAATCCCATTCTTCCAAGTTATCCACCGGCGACCTCCCCAACCGGTCACCGTCATGCTCACTCGACGGCGGCGGAAAAGGGAAAGAAGGCTCCGTTTCAGTGTCAGCACGGAAATCAAATTTACAGAAGTTGAAGAACAATTCAGATGTTGTTGAAGAGAAGAAAGAGGTAACGAAAACCCGGGAATTGGTTTCTCAGATCTCGCATTCGTGTTTATCAGATCCGGATCGGAGTATGAAGAACACAAAAACAGAGGTGAGTTTAAATTTTTTTTTTTTTTTAAAATAAAAAATGGCATTGGGGATTGTGTTAAATTTTGGGTTATTGAAATCTAAGAAGTGAATGAATTTGTAGAAAGTTGAAGTTGGGAGAATACACAGAAGAAGAAGGTCGGCATCTTCTTTGAGGATTGGAATAGAGGAAATGGTGGGTGGTTCGAATTTTCATGGCAATCACTGTTTGATGGAGGTACGATTCATCTATTATTGCTTCCTTTCCACGAAATCATAATTCTTCGTTTCTTCTTCTTCTTCTTCTTTTAATTAATTTATTTAATTCTTTTTTTTTTTCTCCACTGCTTTCAATCAATCCCATCAACCCATCTCTTGCTTTGGTTGCTGTCATTTTCCATCTGTACTTCTGCGGCAACCTTTTTTTTCTTTCCTTTTCATTATTCCCTTCATTTTTTTTTTTTTTTGTTTTGAATCTTCAAAAAAATTTAATTTAAGAAAGAATCTAAATGCTTTAATTGTTTATCTCTTTAATTTCATTACTAAAATAAATTTCTTTTCGGTGTAATTGAATACTTTGGTCAAATTTACAAGTTTTACCTTCTTCAAAACTTAAATCAAATAGAAAATATTATTATTTTTTCCGGAATAGGAAATATTTGTTGATTACCATTATTGTATAACCTATTTTTTCTTTTCATATGGTAACTTATTAATACAGTAGCTTTTTTAAATTGATTTTAATAAAATAGCTACCGCATAACGACCGAGATAATCGGTATGATTTGATTAAATCATAAACCAATTTGTTTCTTTTTTTTTTTTTTTTAAACCGAAAAAACGAATTCGAATATTTAAAAAAAAAAAAAATATAGGACCACGTTGAAAATTTTAAAACCAAAATCTCAGTGTAATAAATCTTGTGAAGAGGAGGCACCAATTTGATGGCCATAAAAGCGGTCGTAGACCCACTCTATTATTTGAATATGTAATATATGGTGCGGTCCATCCATATTTTAGTCTCATTCGATTTTTTTTTTAATTTTTAAAAATTAAGCCAATAAATATTTTTTTTATTTATTATTATCTACTTTTATTAATATTTTAAAAATCAAATCAAGTTTTAAAATTTTAAAAGAAGTTGCCTCAAAATTAAAAAAAAAAAAAAATTAAAGAAGTAGTTTGTAAAGATTTATTTTCGTTTTTAGGATTTAAATAAATATTCAACACTCTTACTTAAAAAAAGAAAAAAATTATAAAAAAATTGAGAGGAGAGAAAATAGACTTGATTTTCTTACAATGATTTGCATCTTTCTTTAAGTACATGACTTGAATTCTTAGACAAATTCTAAAAACAAAAATAAGTTTTTAAAAACGACTCTTTTTAGTTTTCAAATTTTGGCTTGGTTTTTTAAAATATGAGTGAAAAGTCAATAATAAAATAAGAAATTTGGAAGTAGTAGAGTTGCTTGTAGGCTTAAATTTTAAAAACTAAAAGCTGAAAACAAAAATAGTTATCAAACGAGTCTGAATTTTGATTTTTGATTTTTATTTTTAAAATTTATTTTTTGTTATACAAATGACCCATATTGGAAGTCCCAACTAAAAGATGATCTGTTTTTTAGTGTATTTCTTGAATTTTTGCTTAGGACCATATAAGGGAGAAATTACTAAAATAAAACATAGTGTATGCAAGTTTTTATATCACTCTACTCCTTAGAGTTTAGGTTTTCTCACTTCTCACTCTCTTTATTCATGTTCCCTAGGGTTTAAGTTTTTCCACTCATCACTCTCCCTATTTCTTCTCACTAGTTGTTTCTCATCTTTAGCTCTTTTAAGTCAGGATCTAAAAGCAAGGATATTCATAAATGAGAACAAGGAACGAGAGCGAGAGTCAATGGTAAATATTGAAAGTAAAGATGTCTAATAGCAACCAGAGTTAAAAGAATCAAAGCTTAACAAGGATGAATAGTAAGATCATTTTTGTAGTTTCCATCCATGATAACGCAATCAAAAAGTCAAAAACTTAAAAAGTGGACAAACTCTATTTTGGGCTCAAAAGTTTTGTCAACCATGCAATTATCTCAATTTCTATGAAACATTTTTCTTTAAAAAAAAAATTGAATCTATTATGAATGAAATGCAGATTGAAAACGGTAATGTGGGGAAAGCGACACGTCGTAAGACAAAATCGACAGTAAAAACACGTTTGAAGGAAGTAAGCAATTGCCTAACGACATCAAAGGAGCTTCTAAGAGTTCTACATCATGTTTTGGGTCATGAAAACCATCGCCCATCTTCAACTTCATCTCTAATTACAGCTCTGAAATCAGAGCTGGATCGAGCCAAAACCCGAGTCGACAATTTAATCAAGGACCAAACCTTCCATGGCGATGAAATTGAAGTCCTAAGAAAGCGATTTGCGGAGGAAAAAGCGGCATGGAAATACAGGGAGAGAGCTAGATTTGGGAGCGCCATTAATTCAATGGCAGAGGAAGTGGAAGTTGAGAAGAAGCTAAGAAGACAAGCAGAGAGATTGAACAAAAGCATTGCGAAGGAACTTGCTGAAGCTAAAGTTTCAGTTTCAAAAGCAATGAAAGATGTTGAAAGGGAAAAAAGAGCGAAGGAGATTTTGGAGCAAATATGTGAGGAATTAGCGAAAGGGATTGGAGAAGACAGAGCGGAATTCGAGGAGTTGAAGAAGGAATCGGCGAAAGTCAGAGAAGAAGTGGAAAAAGAGAGGGAAATGCTTCATTTAGCGGATGTTTTAAGAGAGGAAAGAGTTCAGATGAAGTTATCGGAGGCTAAATATCAGTTCGAGGAGAAAAACGCCGCCGTGGAAAGGCTTAAACAACAACTCCAAGGCTATTTCTTAACCCAATTCGGAAATGAAGAACCAAACGGCGGCGAGAATCAAGAGTATTCTTGCAATGAATTTGAGAAAATCAAGGAGTTGGAAGCGTATTTGAAGAAAATCAATTTTGGGTCGTGTCAAGATTCTGAAAAATTGGGAAGGAAAGAAGAACAAAATGAAGATTGTTCAGATGAGGAGGAGGAAAGCGATTTGCATTCCATTGAACTCAATATGGATAACAACAACAAGAGTTACAGATGGAGTTTCGTCCATGGCGAAAAGGCGGATAATAATCAAATTCAAAGCAACAATGGAAGAAAATCGGTTTCGGAGAAAATCCAATGGGGAAGCATTTGTTTGAATACAACCAACAATACCCATCAACAAAATTCAAACACATTTGATTGGGATACATTTTCAGAGCTTTTTACACATAAACAATTGGAGGATCTTCATAAACTGGAGGCCGACGATGATCATGAAATAAAATCCGTTAAGTGTCTTCGAGATATTCTGTTTCCAGAATTAGATCAAAATCATAATGGGGTTGCGAAAATCGATGATGAAGCTTCATCCATGGTGAGGAGATGATGATGATGATGATGCAAAAAAATAGATATAAAAACTATTCTCTTTATTTGGTTTTCCATTGGGAACTACTTTGCTTAAGAATATTTTCTTCTTCTTATATATTATTCAATGCTTCCAAAGGGCTAATTTTGGCTAAAAAGAAGAAAGAAAGGAAAATGTAAATAGAGTTGTATACAGCAAGACCAGACGTCCACCACGTCCAATATATTATATTAATACCATTTTACATTTTTCA

mRNA sequence

AAAAAGAAGTTATTTTTTTAGTATATATATAAAAAGAGAACATTCCAGCCGAGCTGAAGGAGGAGAAAGGAGACAAGAATGTAAATACACAGAACCCAGTCAAGATTCTAAATGGCTCCATTTACACATGCACGTCCCTCATTGATTGTTGAATGCTCACGTGGCTGCCATGTCATTTTCCATTTCTTCTTTCCCTCTTAAGATTTCCGAAACAAAGGTTGTATTCTATTTTCCATTCAAACAAACAAACGCCATCTTTCTTTCTTTCTTTCTTTCTTTCCTACTGGGTTTCGAGATCTCTTTGAGTCAAGACTCAAGACTGACCCGCCCTACCCCCAAAAGTAGAATCTTTAATAATTTCTACAATTTTTCTTCGCTGAATTCAAGGCTTACTGTTCCAGAACTTGCCTTCTTTTATCTCACTCTCACATGAACTTCACCACAATTTCTGCAATACCCACCACCCTGTAAGTAAACTCTTCACAAATTTCTTCAATCTCTTACATTTTCCTTTTCTTTTATTTTGAATCCCCTTTTTGGAAGTTGGGTTTAAATAATGTCGTCGTGGGCAGAGCAGAAAACAGAGAAGAGATGTAAGATTAGAAAACGAGGGTGTTTATCATCGCCATCTTCTTCCACTTTGGTTCGTAAGTACAGATTCAAGAAACCCCCCACCTGGAAAATGAGTACAAAATCCCATTCTTCCAAGTTATCCACCGGCGACCTCCCCAACCGGTCACCGTCATGCTCACTCGACGGCGGCGGAAAAGGGAAAGAAGGCTCCGTTTCAGTGTCAGCACGGAAATCAAATTTACAGAAGTTGAAGAACAATTCAGATGTTGTTGAAGAGAAGAAAGAGGTAACGAAAACCCGGGAATTGGTTTCTCAGATCTCGCATTCGTGTTTATCAGATCCGGATCGGAGTATGAAGAACACAAAAACAGAGAAAGTTGAAGTTGGGAGAATACACAGAAGAAGAAGGTCGGCATCTTCTTTGAGGATTGGAATAGAGGAAATGGTGGGTGGTTCGAATTTTCATGGCAATCACTGTTTGATGGAGATTGAAAACGGTAATGTGGGGAAAGCGACACGTCGTAAGACAAAATCGACAGTAAAAACACGTTTGAAGGAAGTAAGCAATTGCCTAACGACATCAAAGGAGCTTCTAAGAGTTCTACATCATGTTTTGGGTCATGAAAACCATCGCCCATCTTCAACTTCATCTCTAATTACAGCTCTGAAATCAGAGCTGGATCGAGCCAAAACCCGAGTCGACAATTTAATCAAGGACCAAACCTTCCATGGCGATGAAATTGAAGTCCTAAGAAAGCGATTTGCGGAGGAAAAAGCGGCATGGAAATACAGGGAGAGAGCTAGATTTGGGAGCGCCATTAATTCAATGGCAGAGGAAGTGGAAGTTGAGAAGAAGCTAAGAAGACAAGCAGAGAGATTGAACAAAAGCATTGCGAAGGAACTTGCTGAAGCTAAAGTTTCAGTTTCAAAAGCAATGAAAGATGTTGAAAGGGAAAAAAGAGCGAAGGAGATTTTGGAGCAAATATGTGAGGAATTAGCGAAAGGGATTGGAGAAGACAGAGCGGAATTCGAGGAGTTGAAGAAGGAATCGGCGAAAGTCAGAGAAGAAGTGGAAAAAGAGAGGGAAATGCTTCATTTAGCGGATGTTTTAAGAGAGGAAAGAGTTCAGATGAAGTTATCGGAGGCTAAATATCAGTTCGAGGAGAAAAACGCCGCCGTGGAAAGGCTTAAACAACAACTCCAAGGCTATTTCTTAACCCAATTCGGAAATGAAGAACCAAACGGCGGCGAGAATCAAGAGTATTCTTGCAATGAATTTGAGAAAATCAAGGAGTTGGAAGCGTATTTGAAGAAAATCAATTTTGGGTCGTGTCAAGATTCTGAAAAATTGGGAAGGAAAGAAGAACAAAATGAAGATTGTTCAGATGAGGAGGAGGAAAGCGATTTGCATTCCATTGAACTCAATATGGATAACAACAACAAGAGTTACAGATGGAGTTTCGTCCATGGCGAAAAGGCGGATAATAATCAAATTCAAAGCAACAATGGAAGAAAATCGGTTTCGGAGAAAATCCAATGGGGAAGCATTTGTTTGAATACAACCAACAATACCCATCAACAAAATTCAAACACATTTGATTGGGATACATTTTCAGAGCTTTTTACACATAAACAATTGGAGGATCTTCATAAACTGGAGGCCGACGATGATCATGAAATAAAATCCGTTAAGTGTCTTCGAGATATTCTGTTTCCAGAATTAGATCAAAATCATAATGGGGTTGCGAAAATCGATGATGAAGCTTCATCCATGGTGAGGAGATGATGATGATGATGATGCAAAAAAATAGATATAAAAACTATTCTCTTTATTTGGTTTTCCATTGGGAACTACTTTGCTTAAGAATATTTTCTTCTTCTTATATATTATTCAATGCTTCCAAAGGGCTAATTTTGGCTAAAAAGAAGAAAGAAAGGAAAATGTAAATAGAGTTGTATACAGCAAGACCAGACGTCCACCACGTCCAATATATTATATTAATACCATTTTACATTTTTCA

Coding sequence (CDS)

ATGTCGTCGTGGGCAGAGCAGAAAACAGAGAAGAGATGTAAGATTAGAAAACGAGGGTGTTTATCATCGCCATCTTCTTCCACTTTGGTTCGTAAGTACAGATTCAAGAAACCCCCCACCTGGAAAATGAGTACAAAATCCCATTCTTCCAAGTTATCCACCGGCGACCTCCCCAACCGGTCACCGTCATGCTCACTCGACGGCGGCGGAAAAGGGAAAGAAGGCTCCGTTTCAGTGTCAGCACGGAAATCAAATTTACAGAAGTTGAAGAACAATTCAGATGTTGTTGAAGAGAAGAAAGAGGTAACGAAAACCCGGGAATTGGTTTCTCAGATCTCGCATTCGTGTTTATCAGATCCGGATCGGAGTATGAAGAACACAAAAACAGAGAAAGTTGAAGTTGGGAGAATACACAGAAGAAGAAGGTCGGCATCTTCTTTGAGGATTGGAATAGAGGAAATGGTGGGTGGTTCGAATTTTCATGGCAATCACTGTTTGATGGAGATTGAAAACGGTAATGTGGGGAAAGCGACACGTCGTAAGACAAAATCGACAGTAAAAACACGTTTGAAGGAAGTAAGCAATTGCCTAACGACATCAAAGGAGCTTCTAAGAGTTCTACATCATGTTTTGGGTCATGAAAACCATCGCCCATCTTCAACTTCATCTCTAATTACAGCTCTGAAATCAGAGCTGGATCGAGCCAAAACCCGAGTCGACAATTTAATCAAGGACCAAACCTTCCATGGCGATGAAATTGAAGTCCTAAGAAAGCGATTTGCGGAGGAAAAAGCGGCATGGAAATACAGGGAGAGAGCTAGATTTGGGAGCGCCATTAATTCAATGGCAGAGGAAGTGGAAGTTGAGAAGAAGCTAAGAAGACAAGCAGAGAGATTGAACAAAAGCATTGCGAAGGAACTTGCTGAAGCTAAAGTTTCAGTTTCAAAAGCAATGAAAGATGTTGAAAGGGAAAAAAGAGCGAAGGAGATTTTGGAGCAAATATGTGAGGAATTAGCGAAAGGGATTGGAGAAGACAGAGCGGAATTCGAGGAGTTGAAGAAGGAATCGGCGAAAGTCAGAGAAGAAGTGGAAAAAGAGAGGGAAATGCTTCATTTAGCGGATGTTTTAAGAGAGGAAAGAGTTCAGATGAAGTTATCGGAGGCTAAATATCAGTTCGAGGAGAAAAACGCCGCCGTGGAAAGGCTTAAACAACAACTCCAAGGCTATTTCTTAACCCAATTCGGAAATGAAGAACCAAACGGCGGCGAGAATCAAGAGTATTCTTGCAATGAATTTGAGAAAATCAAGGAGTTGGAAGCGTATTTGAAGAAAATCAATTTTGGGTCGTGTCAAGATTCTGAAAAATTGGGAAGGAAAGAAGAACAAAATGAAGATTGTTCAGATGAGGAGGAGGAAAGCGATTTGCATTCCATTGAACTCAATATGGATAACAACAACAAGAGTTACAGATGGAGTTTCGTCCATGGCGAAAAGGCGGATAATAATCAAATTCAAAGCAACAATGGAAGAAAATCGGTTTCGGAGAAAATCCAATGGGGAAGCATTTGTTTGAATACAACCAACAATACCCATCAACAAAATTCAAACACATTTGATTGGGATACATTTTCAGAGCTTTTTACACATAAACAATTGGAGGATCTTCATAAACTGGAGGCCGACGATGATCATGAAATAAAATCCGTTAAGTGTCTTCGAGATATTCTGTTTCCAGAATTAGATCAAAATCATAATGGGGTTGCGAAAATCGATGATGAAGCTTCATCCATGGTGAGGAGATGA

Protein sequence

MSSWAEQKTEKRCKIRKRGCLSSPSSSTLVRKYRFKKPPTWKMSTKSHSSKLSTGDLPNRSPSCSLDGGGKGKEGSVSVSARKSNLQKLKNNSDVVEEKKEVTKTRELVSQISHSCLSDPDRSMKNTKTEKVEVGRIHRRRRSASSLRIGIEEMVGGSNFHGNHCLMEIENGNVGKATRRKTKSTVKTRLKEVSNCLTTSKELLRVLHHVLGHENHRPSSTSSLITALKSELDRAKTRVDNLIKDQTFHGDEIEVLRKRFAEEKAAWKYRERARFGSAINSMAEEVEVEKKLRRQAERLNKSIAKELAEAKVSVSKAMKDVEREKRAKEILEQICEELAKGIGEDRAEFEELKKESAKVREEVEKEREMLHLADVLREERVQMKLSEAKYQFEEKNAAVERLKQQLQGYFLTQFGNEEPNGGENQEYSCNEFEKIKELEAYLKKINFGSCQDSEKLGRKEEQNEDCSDEEEESDLHSIELNMDNNNKSYRWSFVHGEKADNNQIQSNNGRKSVSEKIQWGSICLNTTNNTHQQNSNTFDWDTFSELFTHKQLEDLHKLEADDDHEIKSVKCLRDILFPELDQNHNGVAKIDDEASSMVRR
Homology
BLAST of Clc02G23620 vs. NCBI nr
Match: XP_038900292.1 (uncharacterized protein At5g41620 isoform X1 [Benincasa hispida])

HSP 1 Score: 1061.2 bits (2743), Expect = 3.3e-306
Identity = 566/604 (93.71%), Postives = 582/604 (96.36%), Query Frame = 0

Query: 1   MSSWAEQKTE-KRCKIRKRGCLSSPSSSTLVRKYRFKKPPTWKMSTKSHSSKLSTGDLPN 60
           MSSWAEQKTE KRCKIRKR CLSSPSSSTLVRKYRFKKPPTWKMSTKSHSSKLSTGDLPN
Sbjct: 1   MSSWAEQKTEKKRCKIRKRVCLSSPSSSTLVRKYRFKKPPTWKMSTKSHSSKLSTGDLPN 60

Query: 61  RSPSCSLDGGGKGKEGSVSVSARKS---NLQKLKNNSDVVEEKKEVTKTRELVSQISHSC 120
           RSPSCSLDGGGKGKEGSVSVSARKS   N QKLKNNSDVVEEKKE+ KTRE+VSQISHSC
Sbjct: 61  RSPSCSLDGGGKGKEGSVSVSARKSAGNNSQKLKNNSDVVEEKKELMKTREMVSQISHSC 120

Query: 121 LSDPDRSMKNTKTEKVEVGRIHRRRRSASSLRIGIEEMVGGSNFHGNHCLMEIENGNVGK 180
           LSDPDRSMKNTKTEK EVGR+HRRR SASSLRIGI EMVGGSNFHGN CLMEIENGNVGK
Sbjct: 121 LSDPDRSMKNTKTEKDEVGRVHRRRGSASSLRIGIGEMVGGSNFHGNDCLMEIENGNVGK 180

Query: 181 ATRRKTKSTVKTRLKEVSNCLTTSKELLRVLHHVLGHENHRPSSTSSLITALKSELDRAK 240
            TRRKTKST+KTRLKEVSNCLTTSKELLRVLHHVLGHE H PSSTSSLITALKSELDRAK
Sbjct: 181 TTRRKTKSTIKTRLKEVSNCLTTSKELLRVLHHVLGHEEHPPSSTSSLITALKSELDRAK 240

Query: 241 TRVDNLIKDQTFHGDEIEVLRKRFAEEKAAWKYRERARFGSAINSMAEEVEVEKKLRRQA 300
           TRVD+LIKDQTFHGDEIEVLRKRFAEEKAAWKYRERARFGSAI+SMAEEVEVEKKLRRQA
Sbjct: 241 TRVDHLIKDQTFHGDEIEVLRKRFAEEKAAWKYRERARFGSAISSMAEEVEVEKKLRRQA 300

Query: 301 ERLNKSIAKELAEAKVSVSKAMKDVEREKRAKEILEQICEELAKGIGEDRAEFEELKKES 360
           ERLNK+IAKELAEAKVSVSKAMK+VEREKRAKEILEQICEELAKGIGEDRAEFEELKKES
Sbjct: 301 ERLNKTIAKELAEAKVSVSKAMKEVEREKRAKEILEQICEELAKGIGEDRAEFEELKKES 360

Query: 361 AKVREEVEKEREMLHLADVLREERVQMKLSEAKYQFEEKNAAVERLKQQLQGYFLTQFGN 420
           AKVREEVEKEREMLHLADVLREERVQMKLSEAKYQFEEKNAAVERLK QLQ Y +TQFGN
Sbjct: 361 AKVREEVEKEREMLHLADVLREERVQMKLSEAKYQFEEKNAAVERLKHQLQAYLVTQFGN 420

Query: 421 EEPNGGENQEYSCNEFEKIKELEAYLKKINFGSCQDSEKLGRKEEQNEDCSDEEEESDLH 480
           EE NGGENQEYSCNEFEKIKELEAYLKKINFGSCQDSEKL RKEE+NEDCSDEEEESDLH
Sbjct: 421 EEQNGGENQEYSCNEFEKIKELEAYLKKINFGSCQDSEKLVRKEEENEDCSDEEEESDLH 480

Query: 481 SIELNMDNNNKSYRWSFVHGEKADNNQIQSNNGRKSVSEKIQWGSICLNTTNNTHQQNSN 540
           SIELNMDNNNKSYRWSFVHGEKADNNQIQSNNGRKS+SEKIQWGSICLNTTNNTHQ NSN
Sbjct: 481 SIELNMDNNNKSYRWSFVHGEKADNNQIQSNNGRKSISEKIQWGSICLNTTNNTHQGNSN 540

Query: 541 TFDWDTFSELFTHKQLEDLHKLEADDDHEIKSVKCLRDILFPELDQNHNGVAKIDDEASS 600
           TFDWDTFSELFT KQLEDL +LEADDDH+IKSVKCLRDILFPELDQNHNG+AK+DDEASS
Sbjct: 541 TFDWDTFSELFTQKQLEDLQELEADDDHQIKSVKCLRDILFPELDQNHNGIAKMDDEASS 600

BLAST of Clc02G23620 vs. NCBI nr
Match: XP_038900293.1 (uncharacterized protein At5g41620 isoform X2 [Benincasa hispida] >XP_038900294.1 uncharacterized protein At5g41620 isoform X2 [Benincasa hispida])

HSP 1 Score: 985.3 bits (2546), Expect = 2.3e-283
Identity = 525/561 (93.58%), Postives = 541/561 (96.43%), Query Frame = 0

Query: 43  MSTKSHSSKLSTGDLPNRSPSCSLDGGGKGKEGSVSVSARKS---NLQKLKNNSDVVEEK 102
           MSTKSHSSKLSTGDLPNRSPSCSLDGGGKGKEGSVSVSARKS   N QKLKNNSDVVEEK
Sbjct: 1   MSTKSHSSKLSTGDLPNRSPSCSLDGGGKGKEGSVSVSARKSAGNNSQKLKNNSDVVEEK 60

Query: 103 KEVTKTRELVSQISHSCLSDPDRSMKNTKTEKVEVGRIHRRRRSASSLRIGIEEMVGGSN 162
           KE+ KTRE+VSQISHSCLSDPDRSMKNTKTEK EVGR+HRRR SASSLRIGI EMVGGSN
Sbjct: 61  KELMKTREMVSQISHSCLSDPDRSMKNTKTEKDEVGRVHRRRGSASSLRIGIGEMVGGSN 120

Query: 163 FHGNHCLMEIENGNVGKATRRKTKSTVKTRLKEVSNCLTTSKELLRVLHHVLGHENHRPS 222
           FHGN CLMEIENGNVGK TRRKTKST+KTRLKEVSNCLTTSKELLRVLHHVLGHE H PS
Sbjct: 121 FHGNDCLMEIENGNVGKTTRRKTKSTIKTRLKEVSNCLTTSKELLRVLHHVLGHEEHPPS 180

Query: 223 STSSLITALKSELDRAKTRVDNLIKDQTFHGDEIEVLRKRFAEEKAAWKYRERARFGSAI 282
           STSSLITALKSELDRAKTRVD+LIKDQTFHGDEIEVLRKRFAEEKAAWKYRERARFGSAI
Sbjct: 181 STSSLITALKSELDRAKTRVDHLIKDQTFHGDEIEVLRKRFAEEKAAWKYRERARFGSAI 240

Query: 283 NSMAEEVEVEKKLRRQAERLNKSIAKELAEAKVSVSKAMKDVEREKRAKEILEQICEELA 342
           +SMAEEVEVEKKLRRQAERLNK+IAKELAEAKVSVSKAMK+VEREKRAKEILEQICEELA
Sbjct: 241 SSMAEEVEVEKKLRRQAERLNKTIAKELAEAKVSVSKAMKEVEREKRAKEILEQICEELA 300

Query: 343 KGIGEDRAEFEELKKESAKVREEVEKEREMLHLADVLREERVQMKLSEAKYQFEEKNAAV 402
           KGIGEDRAEFEELKKESAKVREEVEKEREMLHLADVLREERVQMKLSEAKYQFEEKNAAV
Sbjct: 301 KGIGEDRAEFEELKKESAKVREEVEKEREMLHLADVLREERVQMKLSEAKYQFEEKNAAV 360

Query: 403 ERLKQQLQGYFLTQFGNEEPNGGENQEYSCNEFEKIKELEAYLKKINFGSCQDSEKLGRK 462
           ERLK QLQ Y +TQFGNEE NGGENQEYSCNEFEKIKELEAYLKKINFGSCQDSEKL RK
Sbjct: 361 ERLKHQLQAYLVTQFGNEEQNGGENQEYSCNEFEKIKELEAYLKKINFGSCQDSEKLVRK 420

Query: 463 EEQNEDCSDEEEESDLHSIELNMDNNNKSYRWSFVHGEKADNNQIQSNNGRKSVSEKIQW 522
           EE+NEDCSDEEEESDLHSIELNMDNNNKSYRWSFVHGEKADNNQIQSNNGRKS+SEKIQW
Sbjct: 421 EEENEDCSDEEEESDLHSIELNMDNNNKSYRWSFVHGEKADNNQIQSNNGRKSISEKIQW 480

Query: 523 GSICLNTTNNTHQQNSNTFDWDTFSELFTHKQLEDLHKLEADDDHEIKSVKCLRDILFPE 582
           GSICLNTTNNTHQ NSNTFDWDTFSELFT KQLEDL +LEADDDH+IKSVKCLRDILFPE
Sbjct: 481 GSICLNTTNNTHQGNSNTFDWDTFSELFTQKQLEDLQELEADDDHQIKSVKCLRDILFPE 540

Query: 583 LDQNHNGVAKIDDEASSMVRR 601
           LDQNHNG+AK+DDEASSMVR+
Sbjct: 541 LDQNHNGIAKMDDEASSMVRK 561

BLAST of Clc02G23620 vs. NCBI nr
Match: XP_011657588.1 (uncharacterized protein At5g41620 isoform X1 [Cucumis sativus] >KGN48038.1 hypothetical protein Csa_002793 [Cucumis sativus])

HSP 1 Score: 873.2 bits (2255), Expect = 1.3e-249
Identity = 505/624 (80.93%), Postives = 545/624 (87.34%), Query Frame = 0

Query: 1   MSSWAEQKTE-KRCKIRKRGCLSSPSSSTLVRKYRFKKPPTWKMST--KSHSSKLS-TGD 60
           MSSWAEQKTE K+CKIRKR CLSSPSSST VRKYRFKKPPTWKMST  KSHSSKLS T D
Sbjct: 1   MSSWAEQKTEKKKCKIRKRVCLSSPSSSTFVRKYRFKKPPTWKMSTKSKSHSSKLSTTDD 60

Query: 61  LPNRSPSCSLDGGGKGKEGSVSVSARKSNLQKLKNNSDVVEEKKEVTKTRELVSQISHSC 120
           + NRSPSCS++   KGKE         S  + LK NS+VVE+     K+RELVS+IS + 
Sbjct: 61  IVNRSPSCSVN---KGKEEEEGGGGGGSVSRILKKNSEVVED-----KSRELVSEISETN 120

Query: 121 LSDPDRSMKNTK-TEKVEVG---RIHRRRRSASS---LRIGIEEMVGGSNFHGNHCL-ME 180
           LSDPDRS+KNTK TEK E+G   R+HRRRRSA++   LRIG  EMVGGSNFHGN CL ME
Sbjct: 121 LSDPDRSVKNTKTTEKDEIGTMKRVHRRRRSAATEPCLRIGNGEMVGGSNFHGNDCLTME 180

Query: 181 IENGNVGKATRRKTKSTVKTRLKEVSNCLTTSKELLRVLHHVLGHENHRPSSTSSLITAL 240
           IENGNV K TRRKTK+TVKTRLKEVSNCLTTSKELLRVLHH+L HE+H PSSTSSLI+AL
Sbjct: 181 IENGNVEKTTRRKTKTTVKTRLKEVSNCLTTSKELLRVLHHILLHEDHLPSSTSSLISAL 240

Query: 241 KSELDRAKTRVDNLIKDQTFHGDEIEVLRKRFAEEKAAWKYRERARFGSAINSMAEEVEV 300
           KSELDRAKTRVD+LIKDQTF+ DEIEVL++R AEEKAAWKYRERARFGSAI+SMAEE+E+
Sbjct: 241 KSELDRAKTRVDHLIKDQTFNVDEIEVLKRRLAEEKAAWKYRERARFGSAISSMAEEMEI 300

Query: 301 EKKLRRQAERLNKSIAKELAEAKVSVSKAMKDVEREKRAKEILEQICEELAKGIGEDRAE 360
           EKKLRRQAERLNKSIAKELAEAKVSVSKAMK+VEREKRAKEILEQICEELAKGIGEDRAE
Sbjct: 301 EKKLRRQAERLNKSIAKELAEAKVSVSKAMKEVEREKRAKEILEQICEELAKGIGEDRAE 360

Query: 361 FEELKKESAKVREEVEKEREMLHLADVLREERVQMKLSEAKYQFEEKNAAVERLKQQLQG 420
           FEELKKESAKVREEVEKEREMLHLADVLREERVQMKLSEAKYQFEEKNAAVERLK QLQG
Sbjct: 361 FEELKKESAKVREEVEKEREMLHLADVLREERVQMKLSEAKYQFEEKNAAVERLKHQLQG 420

Query: 421 YFLTQFGNEEPNGGENQEYSCNEFEKIKELEAYLKKINFGSCQDSEKLGRKEEQNEDCSD 480
           YF+   GNEE N GEN+EYSCNEFEKIKELEAYLKKINFGSCQD+EK+G+KEE N DCSD
Sbjct: 421 YFV--IGNEEQNAGENREYSCNEFEKIKELEAYLKKINFGSCQDTEKMGKKEE-NGDCSD 480

Query: 481 -------EEEESDLHSIELNMDNNNKSYRWSFVHGEKADNNQIQSNNGRKSVSEKIQWGS 540
                  EEEESD+HSIELNMDNNNKSYRWSFV  EKADNNQIQ NNGRKSVSEKIQWGS
Sbjct: 481 EEEEEEEEEEESDMHSIELNMDNNNKSYRWSFV--EKADNNQIQINNGRKSVSEKIQWGS 540

Query: 541 ICLNTT-NNTHQQNSNTFDWDTFSELFTHKQLEDLHKLEADDD----HEIKSVKCLRDIL 600
           ICLNT+ NNTHQQNSN+FDWDTFSELFT K LE+LH    DDD    H+IKSVKCLRDIL
Sbjct: 541 ICLNTSNNNTHQQNSNSFDWDTFSELFTRKNLEELHDQLDDDDGGDNHQIKSVKCLRDIL 600

BLAST of Clc02G23620 vs. NCBI nr
Match: XP_031743163.1 (uncharacterized protein At5g41620 isoform X2 [Cucumis sativus] >XP_031743164.1 uncharacterized protein At5g41620 isoform X2 [Cucumis sativus])

HSP 1 Score: 800.4 bits (2066), Expect = 1.0e-227
Identity = 463/577 (80.24%), Postives = 503/577 (87.18%), Query Frame = 0

Query: 45  TKSHSSKLS-TGDLPNRSPSCSLDGGGKGKEGSVSVSARKSNLQKLKNNSDVVEEKKEVT 104
           +KSHSSKLS T D+ NRSPSCS++   KGKE         S  + LK NS+VVE+     
Sbjct: 5   SKSHSSKLSTTDDIVNRSPSCSVN---KGKEEEEGGGGGGSVSRILKKNSEVVED----- 64

Query: 105 KTRELVSQISHSCLSDPDRSMKNTK-TEKVEVG---RIHRRRRSASS---LRIGIEEMVG 164
           K+RELVS+IS + LSDPDRS+KNTK TEK E+G   R+HRRRRSA++   LRIG  EMVG
Sbjct: 65  KSRELVSEISETNLSDPDRSVKNTKTTEKDEIGTMKRVHRRRRSAATEPCLRIGNGEMVG 124

Query: 165 GSNFHGNHCL-MEIENGNVGKATRRKTKSTVKTRLKEVSNCLTTSKELLRVLHHVLGHEN 224
           GSNFHGN CL MEIENGNV K TRRKTK+TVKTRLKEVSNCLTTSKELLRVLHH+L HE+
Sbjct: 125 GSNFHGNDCLTMEIENGNVEKTTRRKTKTTVKTRLKEVSNCLTTSKELLRVLHHILLHED 184

Query: 225 HRPSSTSSLITALKSELDRAKTRVDNLIKDQTFHGDEIEVLRKRFAEEKAAWKYRERARF 284
           H PSSTSSLI+ALKSELDRAKTRVD+LIKDQTF+ DEIEVL++R AEEKAAWKYRERARF
Sbjct: 185 HLPSSTSSLISALKSELDRAKTRVDHLIKDQTFNVDEIEVLKRRLAEEKAAWKYRERARF 244

Query: 285 GSAINSMAEEVEVEKKLRRQAERLNKSIAKELAEAKVSVSKAMKDVEREKRAKEILEQIC 344
           GSAI+SMAEE+E+EKKLRRQAERLNKSIAKELAEAKVSVSKAMK+VEREKRAKEILEQIC
Sbjct: 245 GSAISSMAEEMEIEKKLRRQAERLNKSIAKELAEAKVSVSKAMKEVEREKRAKEILEQIC 304

Query: 345 EELAKGIGEDRAEFEELKKESAKVREEVEKEREMLHLADVLREERVQMKLSEAKYQFEEK 404
           EELAKGIGEDRAEFEELKKESAKVREEVEKEREMLHLADVLREERVQMKLSEAKYQFEEK
Sbjct: 305 EELAKGIGEDRAEFEELKKESAKVREEVEKEREMLHLADVLREERVQMKLSEAKYQFEEK 364

Query: 405 NAAVERLKQQLQGYFLTQFGNEEPNGGENQEYSCNEFEKIKELEAYLKKINFGSCQDSEK 464
           NAAVERLK QLQGYF+   GNEE N GEN+EYSCNEFEKIKELEAYLKKINFGSCQD+EK
Sbjct: 365 NAAVERLKHQLQGYFV--IGNEEQNAGENREYSCNEFEKIKELEAYLKKINFGSCQDTEK 424

Query: 465 LGRKEEQNEDCSD-------EEEESDLHSIELNMDNNNKSYRWSFVHGEKADNNQIQSNN 524
           +G+KEE N DCSD       EEEESD+HSIELNMDNNNKSYRWSFV  EKADNNQIQ NN
Sbjct: 425 MGKKEE-NGDCSDEEEEEEEEEEESDMHSIELNMDNNNKSYRWSFV--EKADNNQIQINN 484

Query: 525 GRKSVSEKIQWGSICLNTT-NNTHQQNSNTFDWDTFSELFTHKQLEDLHKLEADDD---- 584
           GRKSVSEKIQWGSICLNT+ NNTHQQNSN+FDWDTFSELFT K LE+LH    DDD    
Sbjct: 485 GRKSVSEKIQWGSICLNTSNNNTHQQNSNSFDWDTFSELFTRKNLEELHDQLDDDDGGDN 544

Query: 585 HEIKSVKCLRDILFPELDQNHNGVAKIDDEASSMVRR 601
           H+IKSVKCLRDILFPEL+QNHNGV K+DDEASSMVR+
Sbjct: 545 HQIKSVKCLRDILFPELEQNHNGVMKMDDEASSMVRK 568

BLAST of Clc02G23620 vs. NCBI nr
Match: XP_016900719.1 (PREDICTED: uncharacterized protein At5g41620 [Cucumis melo])

HSP 1 Score: 731.1 bits (1886), Expect = 7.7e-207
Identity = 410/476 (86.13%), Postives = 440/476 (92.44%), Query Frame = 0

Query: 136 RIHRRRRSA---SSLRIGIEEMV-GGSNFHGNHCL-MEIENGNVGKATRRKTKSTVKTRL 195
           R HRRRRSA   S LR+G  E+V GGSNFHGN CL MEIENGNVGK TRRKTK+TVKTRL
Sbjct: 3   RGHRRRRSAATESCLRMGNGEIVAGGSNFHGNDCLTMEIENGNVGKTTRRKTKTTVKTRL 62

Query: 196 KEVSNCLTTSKELLRVLHHVLGHENHRPSSTSSLITALKSELDRAKTRVDNLIKDQTFHG 255
           KEVSNCLTTSKELLRVLHH+L HE+H PSSTSSLI+ALKSELDRAK+RVD+LIKDQTF+ 
Sbjct: 63  KEVSNCLTTSKELLRVLHHILLHEDHLPSSTSSLISALKSELDRAKSRVDHLIKDQTFNV 122

Query: 256 DEIEVLRKRFAEEKAAWKYRERARFGSAINSMAEEVEVEKKLRRQAERLNKSIAKELAEA 315
           DEIEV++KR AEEKAAWKYRERARFGSAI+SMAEE+EVEKKLRRQAERLNKSIAKELAEA
Sbjct: 123 DEIEVVKKRLAEEKAAWKYRERARFGSAISSMAEEMEVEKKLRRQAERLNKSIAKELAEA 182

Query: 316 KVSVSKAMKDVEREKRAKEILEQICEELAKGIGEDRAEFEELKKESAKVREEVEKEREML 375
           KVSVSKAMK+VEREKRAKEILEQICEELAKGIGEDRAEFEELKKESAKVREEVEKEREML
Sbjct: 183 KVSVSKAMKEVEREKRAKEILEQICEELAKGIGEDRAEFEELKKESAKVREEVEKEREML 242

Query: 376 HLADVLREERVQMKLSEAKYQFEEKNAAVERLKQQLQGYFLTQFGNEEPNGGENQEYSCN 435
           HLADVLREERVQMKLSEAKYQFEEKNAAVERLK QLQGYF+   GNE+ N GEN+EYSCN
Sbjct: 243 HLADVLREERVQMKLSEAKYQFEEKNAAVERLKHQLQGYFV--IGNEDQNAGENREYSCN 302

Query: 436 EFEKIKELEAYLKKINFGSCQDSEKLGRKEEQNEDCSD---EEEESDLHSIELNMDNNNK 495
           EFEKIKELEAYLKKINFGSCQD+EK+GRKEE N DCSD   EEEESD+HSIELNMDNNNK
Sbjct: 303 EFEKIKELEAYLKKINFGSCQDTEKIGRKEE-NGDCSDEEEEEEESDMHSIELNMDNNNK 362

Query: 496 SYRWSFVHGEKADNNQIQSNNGRKSVSEKIQWGSICLNTT-NNTHQQNSNTFDWDTFSEL 555
           SYRWSFV  EKADNNQIQ NNGRKSVSEKIQWGSICLNT+ NNTHQQN+N+FDWDTFSEL
Sbjct: 363 SYRWSFV--EKADNNQIQINNGRKSVSEKIQWGSICLNTSNNNTHQQNTNSFDWDTFSEL 422

Query: 556 FTHKQLEDLH-KLEADDD-HEIKSVKCLRDILFPELDQNHNGVAKIDDEASSMVRR 601
           FT K L++LH +LE DDD H+IKSVKCLRDILFPEL+QNHNGV K+DDEASSMVR+
Sbjct: 423 FTRKNLDELHDQLEEDDDNHQIKSVKCLRDILFPELEQNHNGVMKMDDEASSMVRK 473

BLAST of Clc02G23620 vs. ExPASy Swiss-Prot
Match: Q66GQ2 (Uncharacterized protein At5g41620 OS=Arabidopsis thaliana OX=3702 GN=At5g41620 PE=2 SV=2)

HSP 1 Score: 89.4 bits (220), Expect = 1.5e-16
Identity = 117/443 (26.41%), Postives = 209/443 (47.18%), Query Frame = 0

Query: 143 SASSLRIGIEEMV---GGSNFHGNHCLMEIENGNVGKATRRKTKSTVKT----------R 202
           SA SLR  I +M+     S    NH L  +   + G +    T +   T           
Sbjct: 126 SAGSLRRQIGQMLIKHHQSIDRNNHALQPVSPASYGSSLEVTTYNKAVTPSSSLEFRGRP 185

Query: 203 LKEVSNCLTTSKELLRVLHHVLGHENHRPSSTSSLITALKSELDRAKTRVDNLIKDQTFH 262
            +E    L TS ELL+VL+ +   E    S+  SLI ALK+E+  ++ R+  L++ Q   
Sbjct: 186 SREPHYNLKTSTELLKVLNRIWSLEEQHVSNI-SLIKALKTEVAHSRVRIKELLRYQQAD 245

Query: 263 GDEIEVLRKRFAEEKAAWKYRERARFGSAINSMAEEVEVEKKLRRQAERLNKSIAKELAE 322
             E++ + K+ AEEK   K +E  R  SA+ S+ + +E E+KLR+++E L++ +A+EL+E
Sbjct: 246 RHELDSVVKQLAEEKLLSKNKEVERMSSAVQSVRKALEDERKLRKRSESLHRKMARELSE 305

Query: 323 AKVSVSKAMKDVEREKRAKEILEQICEELAKGIGEDRAEFEELKKESAKV--REEVEKER 382
            K S+S  +K++ER  ++ +++E +C+E AKGI     E   LKK++           ++
Sbjct: 306 VKSSLSNCVKELERGSKSNKMMELLCDEFAKGIKSYEEEIHGLKKKNLDKDWAGRGGGDQ 365

Query: 383 EMLHLADVLREERVQMKLSEAKYQFEEKNAAVERLKQQLQGYFLTQFGNEEPNGGEN--- 442
            +LH+A+   +ER+QM+L        +  + +++L+ +++  FL +  NE P    N   
Sbjct: 366 LVLHIAESWLDERMQMRLEGGDTLNGKNRSVLDKLEVEIE-TFLQEKRNEIPRNRRNSLE 425

Query: 443 -----------QEYSCNEFEKIKELEAYLKKINFGSCQDSEKLGRKEEQNEDCSDEEEES 502
                      ++  C E     +   +  K    S  D  K  +  + N+D S +E+  
Sbjct: 426 SVPFNTLSAPPRDVDCEEDSGGSDSNCFELKKPAESYGDETK--KPNQHNKDGSIDEKPK 485

Query: 503 DLHSIELNMDNNNKSYRWSF-VHGEKADNNQIQSNNGRKSVSEKIQWGSICLNTTNNTHQ 553
              S ++N ++      W+   +G+K     I+     + V  +        N+ NN   
Sbjct: 486 SPSSFQVNFED---QMAWALSSNGKKKTTRAIEDEEEEEDVKPE--------NSNNNKKP 545

BLAST of Clc02G23620 vs. ExPASy Swiss-Prot
Match: F4I878 (Protein BRANCHLESS TRICHOME OS=Arabidopsis thaliana OX=3702 GN=BLT PE=1 SV=1)

HSP 1 Score: 56.2 bits (134), Expect = 1.5e-06
Identity = 45/127 (35.43%), Postives = 71/127 (55.91%), Query Frame = 0

Query: 279 INSMAEEVEVEKKLRRQAERLNKSIAKELAEAKVSVSKAMKDVEREKRAKEILEQICEEL 338
           I  +  E++ E+K RR+AE + K +A              KDVE E+ A+E  E   + L
Sbjct: 84  IKELKAELDYERKARRRAELMIKKLA--------------KDVEEERMAREAEEMQNKRL 143

Query: 339 AKGIGEDRAEFEELKKESAKVREEVEKEREMLHLADVLREERVQMKLSEAKYQFEEKNAA 398
            K +  +++E   +K+       ++E+ER+M  LA+VLREERVQMKL +A+   EEK + 
Sbjct: 144 FKELSSEKSEMVRMKR-------DLEEERQMHRLAEVLREERVQMKLMDARLFLEEKLSE 189

Query: 399 VERLKQQ 406
           +E   +Q
Sbjct: 204 LEEANRQ 189

BLAST of Clc02G23620 vs. ExPASy TrEMBL
Match: A0A0A0KEV9 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G425750 PE=4 SV=1)

HSP 1 Score: 873.2 bits (2255), Expect = 6.1e-250
Identity = 505/624 (80.93%), Postives = 545/624 (87.34%), Query Frame = 0

Query: 1   MSSWAEQKTE-KRCKIRKRGCLSSPSSSTLVRKYRFKKPPTWKMST--KSHSSKLS-TGD 60
           MSSWAEQKTE K+CKIRKR CLSSPSSST VRKYRFKKPPTWKMST  KSHSSKLS T D
Sbjct: 1   MSSWAEQKTEKKKCKIRKRVCLSSPSSSTFVRKYRFKKPPTWKMSTKSKSHSSKLSTTDD 60

Query: 61  LPNRSPSCSLDGGGKGKEGSVSVSARKSNLQKLKNNSDVVEEKKEVTKTRELVSQISHSC 120
           + NRSPSCS++   KGKE         S  + LK NS+VVE+     K+RELVS+IS + 
Sbjct: 61  IVNRSPSCSVN---KGKEEEEGGGGGGSVSRILKKNSEVVED-----KSRELVSEISETN 120

Query: 121 LSDPDRSMKNTK-TEKVEVG---RIHRRRRSASS---LRIGIEEMVGGSNFHGNHCL-ME 180
           LSDPDRS+KNTK TEK E+G   R+HRRRRSA++   LRIG  EMVGGSNFHGN CL ME
Sbjct: 121 LSDPDRSVKNTKTTEKDEIGTMKRVHRRRRSAATEPCLRIGNGEMVGGSNFHGNDCLTME 180

Query: 181 IENGNVGKATRRKTKSTVKTRLKEVSNCLTTSKELLRVLHHVLGHENHRPSSTSSLITAL 240
           IENGNV K TRRKTK+TVKTRLKEVSNCLTTSKELLRVLHH+L HE+H PSSTSSLI+AL
Sbjct: 181 IENGNVEKTTRRKTKTTVKTRLKEVSNCLTTSKELLRVLHHILLHEDHLPSSTSSLISAL 240

Query: 241 KSELDRAKTRVDNLIKDQTFHGDEIEVLRKRFAEEKAAWKYRERARFGSAINSMAEEVEV 300
           KSELDRAKTRVD+LIKDQTF+ DEIEVL++R AEEKAAWKYRERARFGSAI+SMAEE+E+
Sbjct: 241 KSELDRAKTRVDHLIKDQTFNVDEIEVLKRRLAEEKAAWKYRERARFGSAISSMAEEMEI 300

Query: 301 EKKLRRQAERLNKSIAKELAEAKVSVSKAMKDVEREKRAKEILEQICEELAKGIGEDRAE 360
           EKKLRRQAERLNKSIAKELAEAKVSVSKAMK+VEREKRAKEILEQICEELAKGIGEDRAE
Sbjct: 301 EKKLRRQAERLNKSIAKELAEAKVSVSKAMKEVEREKRAKEILEQICEELAKGIGEDRAE 360

Query: 361 FEELKKESAKVREEVEKEREMLHLADVLREERVQMKLSEAKYQFEEKNAAVERLKQQLQG 420
           FEELKKESAKVREEVEKEREMLHLADVLREERVQMKLSEAKYQFEEKNAAVERLK QLQG
Sbjct: 361 FEELKKESAKVREEVEKEREMLHLADVLREERVQMKLSEAKYQFEEKNAAVERLKHQLQG 420

Query: 421 YFLTQFGNEEPNGGENQEYSCNEFEKIKELEAYLKKINFGSCQDSEKLGRKEEQNEDCSD 480
           YF+   GNEE N GEN+EYSCNEFEKIKELEAYLKKINFGSCQD+EK+G+KEE N DCSD
Sbjct: 421 YFV--IGNEEQNAGENREYSCNEFEKIKELEAYLKKINFGSCQDTEKMGKKEE-NGDCSD 480

Query: 481 -------EEEESDLHSIELNMDNNNKSYRWSFVHGEKADNNQIQSNNGRKSVSEKIQWGS 540
                  EEEESD+HSIELNMDNNNKSYRWSFV  EKADNNQIQ NNGRKSVSEKIQWGS
Sbjct: 481 EEEEEEEEEEESDMHSIELNMDNNNKSYRWSFV--EKADNNQIQINNGRKSVSEKIQWGS 540

Query: 541 ICLNTT-NNTHQQNSNTFDWDTFSELFTHKQLEDLHKLEADDD----HEIKSVKCLRDIL 600
           ICLNT+ NNTHQQNSN+FDWDTFSELFT K LE+LH    DDD    H+IKSVKCLRDIL
Sbjct: 541 ICLNTSNNNTHQQNSNSFDWDTFSELFTRKNLEELHDQLDDDDGGDNHQIKSVKCLRDIL 600

BLAST of Clc02G23620 vs. ExPASy TrEMBL
Match: A0A1S4DXK6 (uncharacterized protein At5g41620 OS=Cucumis melo OX=3656 GN=LOC103491425 PE=4 SV=1)

HSP 1 Score: 731.1 bits (1886), Expect = 3.7e-207
Identity = 410/476 (86.13%), Postives = 440/476 (92.44%), Query Frame = 0

Query: 136 RIHRRRRSA---SSLRIGIEEMV-GGSNFHGNHCL-MEIENGNVGKATRRKTKSTVKTRL 195
           R HRRRRSA   S LR+G  E+V GGSNFHGN CL MEIENGNVGK TRRKTK+TVKTRL
Sbjct: 3   RGHRRRRSAATESCLRMGNGEIVAGGSNFHGNDCLTMEIENGNVGKTTRRKTKTTVKTRL 62

Query: 196 KEVSNCLTTSKELLRVLHHVLGHENHRPSSTSSLITALKSELDRAKTRVDNLIKDQTFHG 255
           KEVSNCLTTSKELLRVLHH+L HE+H PSSTSSLI+ALKSELDRAK+RVD+LIKDQTF+ 
Sbjct: 63  KEVSNCLTTSKELLRVLHHILLHEDHLPSSTSSLISALKSELDRAKSRVDHLIKDQTFNV 122

Query: 256 DEIEVLRKRFAEEKAAWKYRERARFGSAINSMAEEVEVEKKLRRQAERLNKSIAKELAEA 315
           DEIEV++KR AEEKAAWKYRERARFGSAI+SMAEE+EVEKKLRRQAERLNKSIAKELAEA
Sbjct: 123 DEIEVVKKRLAEEKAAWKYRERARFGSAISSMAEEMEVEKKLRRQAERLNKSIAKELAEA 182

Query: 316 KVSVSKAMKDVEREKRAKEILEQICEELAKGIGEDRAEFEELKKESAKVREEVEKEREML 375
           KVSVSKAMK+VEREKRAKEILEQICEELAKGIGEDRAEFEELKKESAKVREEVEKEREML
Sbjct: 183 KVSVSKAMKEVEREKRAKEILEQICEELAKGIGEDRAEFEELKKESAKVREEVEKEREML 242

Query: 376 HLADVLREERVQMKLSEAKYQFEEKNAAVERLKQQLQGYFLTQFGNEEPNGGENQEYSCN 435
           HLADVLREERVQMKLSEAKYQFEEKNAAVERLK QLQGYF+   GNE+ N GEN+EYSCN
Sbjct: 243 HLADVLREERVQMKLSEAKYQFEEKNAAVERLKHQLQGYFV--IGNEDQNAGENREYSCN 302

Query: 436 EFEKIKELEAYLKKINFGSCQDSEKLGRKEEQNEDCSD---EEEESDLHSIELNMDNNNK 495
           EFEKIKELEAYLKKINFGSCQD+EK+GRKEE N DCSD   EEEESD+HSIELNMDNNNK
Sbjct: 303 EFEKIKELEAYLKKINFGSCQDTEKIGRKEE-NGDCSDEEEEEEESDMHSIELNMDNNNK 362

Query: 496 SYRWSFVHGEKADNNQIQSNNGRKSVSEKIQWGSICLNTT-NNTHQQNSNTFDWDTFSEL 555
           SYRWSFV  EKADNNQIQ NNGRKSVSEKIQWGSICLNT+ NNTHQQN+N+FDWDTFSEL
Sbjct: 363 SYRWSFV--EKADNNQIQINNGRKSVSEKIQWGSICLNTSNNNTHQQNTNSFDWDTFSEL 422

Query: 556 FTHKQLEDLH-KLEADDD-HEIKSVKCLRDILFPELDQNHNGVAKIDDEASSMVRR 601
           FT K L++LH +LE DDD H+IKSVKCLRDILFPEL+QNHNGV K+DDEASSMVR+
Sbjct: 423 FTRKNLDELHDQLEEDDDNHQIKSVKCLRDILFPELEQNHNGVMKMDDEASSMVRK 473

BLAST of Clc02G23620 vs. ExPASy TrEMBL
Match: A0A5D3BAL9 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold182G00180 PE=4 SV=1)

HSP 1 Score: 719.9 bits (1857), Expect = 8.6e-204
Identity = 397/456 (87.06%), Postives = 426/456 (93.42%), Query Frame = 0

Query: 152 EEMVGGSNFHGNHCL-MEIENGNVGKATRRKTKSTVKTRLKEVSNCLTTSKELLRVLHHV 211
           E + GGSNFHGN CL MEIENGNVGK TRRKTK+TVKTRLKEVSNCLTTSKELLRVLHH+
Sbjct: 5   EIVAGGSNFHGNDCLTMEIENGNVGKTTRRKTKTTVKTRLKEVSNCLTTSKELLRVLHHI 64

Query: 212 LGHENHRPSSTSSLITALKSELDRAKTRVDNLIKDQTFHGDEIEVLRKRFAEEKAAWKYR 271
           L HE+H PSSTSSLI+ALKSELDRAK+RVD+LIKDQTF+ DEIEV++KR AEEKAAWKYR
Sbjct: 65  LLHEDHLPSSTSSLISALKSELDRAKSRVDHLIKDQTFNVDEIEVVKKRLAEEKAAWKYR 124

Query: 272 ERARFGSAINSMAEEVEVEKKLRRQAERLNKSIAKELAEAKVSVSKAMKDVEREKRAKEI 331
           ERARFGSAI+SMAEE+EVEKKLRRQAERLNKSIAKELAEAKVSVSKAMK+VEREKRAKEI
Sbjct: 125 ERARFGSAISSMAEEMEVEKKLRRQAERLNKSIAKELAEAKVSVSKAMKEVEREKRAKEI 184

Query: 332 LEQICEELAKGIGEDRAEFEELKKESAKVREEVEKEREMLHLADVLREERVQMKLSEAKY 391
           LEQICEELAKGIGEDRAEFEELKKESAKVREEVEKEREMLHLADVLREERVQMKLSEAKY
Sbjct: 185 LEQICEELAKGIGEDRAEFEELKKESAKVREEVEKEREMLHLADVLREERVQMKLSEAKY 244

Query: 392 QFEEKNAAVERLKQQLQGYFLTQFGNEEPNGGENQEYSCNEFEKIKELEAYLKKINFGSC 451
           QFEEKNAAVERLK QLQGYF+   GNE+ N GEN+EYSCNEFEKIKELEAYLKKINFGSC
Sbjct: 245 QFEEKNAAVERLKHQLQGYFV--IGNEDQNAGENREYSCNEFEKIKELEAYLKKINFGSC 304

Query: 452 QDSEKLGRKEEQNEDCSD---EEEESDLHSIELNMDNNNKSYRWSFVHGEKADNNQIQSN 511
           QD+EK+GRKEE N DCSD   EEEESD+HSIELNMDNNNKSYRWSFV  EKADNNQIQ N
Sbjct: 305 QDTEKIGRKEE-NGDCSDEEEEEEESDMHSIELNMDNNNKSYRWSFV--EKADNNQIQIN 364

Query: 512 NGRKSVSEKIQWGSICLNTT-NNTHQQNSNTFDWDTFSELFTHKQLEDLH-KLEADDD-H 571
           NGRKSVSEKIQWGSICLNT+ NNTHQQN+N+FDWDTFSELFT K L++LH +LE DDD H
Sbjct: 365 NGRKSVSEKIQWGSICLNTSNNNTHQQNTNSFDWDTFSELFTRKNLDELHDQLEEDDDNH 424

Query: 572 EIKSVKCLRDILFPELDQNHNGVAKIDDEASSMVRR 601
           +IKSVKCLRDILFPEL+QNHNGV K+DDEASSMVR+
Sbjct: 425 QIKSVKCLRDILFPELEQNHNGVMKMDDEASSMVRK 455

BLAST of Clc02G23620 vs. ExPASy TrEMBL
Match: A0A6J1K0X3 (uncharacterized protein LOC111490718 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111490718 PE=4 SV=1)

HSP 1 Score: 649.4 bits (1674), Expect = 1.4e-182
Identity = 392/601 (65.22%), Postives = 462/601 (76.87%), Query Frame = 0

Query: 3   SWAEQKTEKRCKIRKRGC----LSSPSSSTLVRKYRFKKPPTWKMSTKSHSSKLSTGDLP 62
           SW EQKTE+ CKIRKR C     SS SSSTLV KYRFK  PTWKMSTKSHSS        
Sbjct: 2   SWPEQKTEEICKIRKRRCSSSSSSSSSSSTLVFKYRFKNTPTWKMSTKSHSS-------- 61

Query: 63  NRSPSCSLDGGG-KGKEGSVSVS---ARKSNLQKLKNNSDVVEEKKEVTKTRELVSQISH 122
           NRSPSCS+ GGG KGKE SVSVS   + +++ QKLKNN D++E+K+E+ KT++ VSQISH
Sbjct: 62  NRSPSCSVAGGGSKGKEASVSVSVSVSARNHSQKLKNNMDIIEDKQELMKTQDFVSQISH 121

Query: 123 SCLSDPDRSMKNTKTEKVEVGRIHRRRRSASSLRIGIEEMVGGSNFHGNHCLMEIEN-GN 182
           SCLSDPD    ++ ++KVE  R+HRRR SASSLRIG     G +NFHGNHCL+EIEN  N
Sbjct: 122 SCLSDPDPCFNDSNSKKVEGDRVHRRRTSASSLRIG----TGEANFHGNHCLIEIENPSN 181

Query: 183 VGKATRRKTKSTVKTRLKEVSNCLTTSKELLRVLHHVLGHEN---HRPSSTSSLITALKS 242
            G+  RRKTK  +KTRLKEVSNCLTTSKEL+RVL+HVL HE+   HRPSS S LITALKS
Sbjct: 182 QGRTARRKTKFMLKTRLKEVSNCLTTSKELVRVLNHVLAHEDNDQHRPSSISPLITALKS 241

Query: 243 ELDRAKTRVDNLIKDQTFHGDEIEVLRKRFAEEKAAWKYRERARFGSAINSMAEEVEVEK 302
           E++RAK RVD+LIKDQ+FHGDEIE++ KRF EEK AWK RERAR  S+I SMA+E+E+EK
Sbjct: 242 EMERAKARVDHLIKDQSFHGDEIEIVMKRFTEEKTAWKNRERARVRSSIASMADEIEIEK 301

Query: 303 KLRRQAERLNKSIAKELAEAKVSVSKAMKDVEREKRAKEILEQICEELAKGIGEDRAEFE 362
           KLR+QAERLNK+IAKELAEAK+S+SKAMKD++RE+RAKEI EQIC+ELAKGIGEDRA+FE
Sbjct: 302 KLRKQAERLNKTIAKELAEAKISLSKAMKDLQRERRAKEIFEQICDELAKGIGEDRAQFE 361

Query: 363 ELKKESAKVREEVEKEREMLHLADVLREERVQMKLSEAKYQFEEKNAAVERLKQQLQGYF 422
           E KKESAKVREE+E+EREML LADVLREERVQMKLSEAKYQFEEKNAAVERLK +L+ + 
Sbjct: 362 EFKKESAKVREEIEQEREMLQLADVLREERVQMKLSEAKYQFEEKNAAVERLKDELEAFL 421

Query: 423 LTQFGNEEPNGGENQEYSCNEFEKIKELEAYLKKINFGSCQD-SEKLGRKEEQNEDCSDE 482
           +TQF +E     E ++YS     KIKELEAYLKKINFGS Q+  +  G+ EEQ  +CS E
Sbjct: 422 ITQFRHE---NREEEDYS----GKIKELEAYLKKINFGSVQEHPDGDGKIEEQ--ECS-E 481

Query: 483 EEESDLHSIELNMDNNNKSYRWSFVH-GEKADNNQIQSNNGRKSVSEKIQWGSICLNTTN 542
           E++SDLHSIELNMDNNNKSYRWSFVH G K ++ +    NGRKSVSEKIQWGSICLN   
Sbjct: 482 EDDSDLHSIELNMDNNNKSYRWSFVHGGSKRNSFEKDQINGRKSVSEKIQWGSICLNRKA 541

Query: 543 NTHQQN-----------SNTFDWDTFSELFTHKQLEDLHKLEADDDHEIKSVKCLRDILF 579
           +   +N           S   +W+ F+E+F     E      + +    KS KCLRDILF
Sbjct: 542 SNGSKNGDFVGRKSHESSERLEWERFTEVF-----EKEGDNGSAEKKNTKSGKCLRDILF 575

BLAST of Clc02G23620 vs. ExPASy TrEMBL
Match: A0A6J1JCV9 (uncharacterized protein LOC111484580 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111484580 PE=4 SV=1)

HSP 1 Score: 648.7 bits (1672), Expect = 2.4e-182
Identity = 402/606 (66.34%), Postives = 457/606 (75.41%), Query Frame = 0

Query: 3   SWAEQKTEKRCKIRKRGCLSSPSSSTLVRKYRFKKPPTWKMSTKSHSSKLSTGDLPNRSP 62
           SW +QKTEK+CKIRKR CLSSPSSSTLVRKYRFKKPPTWKM+TKSHSSKLS+GDL N+SP
Sbjct: 2   SWTDQKTEKKCKIRKRVCLSSPSSSTLVRKYRFKKPPTWKMNTKSHSSKLSSGDLANQSP 61

Query: 63  SCSLD-GGGKGKEGSVSVSARKSNLQKLKNNSDVVEEKKEVTKTRELVSQISHSCLSDPD 122
           SCS+D GGGKGK+ SVSVSARKS  +K +      + KKEV KT+ELVSQI HSCLSDPD
Sbjct: 62  SCSVDSGGGKGKQCSVSVSARKSAAEKTQ------KLKKEVIKTQELVSQIWHSCLSDPD 121

Query: 123 RSMKNTKTEKVEVGRIHRRRRSASSLRIGIEEMVGGSNFHGNHCLMEIENGNVGKATRRK 182
            S    +TEKVE GR+  RRR+ +  R G  E++GGSNFHG  CLMEIENGNV       
Sbjct: 122 PSFNILETEKVEGGRVQGRRRTTT--RTGTGEVMGGSNFHGKDCLMEIENGNV------- 181

Query: 183 TKSTVKTRLKEVSNCLTTSKELLRVLHHVLGHENHRPSSTSSLITALKSELDRAKTRVDN 242
               VKTRLKEVSN LTTSKELLRVL+HV GHE  R SS  SLI+ALKSELDRAK+ VD 
Sbjct: 182 ----VKTRLKEVSNWLTTSKELLRVLNHVCGHEEQRQSSALSLISALKSELDRAKSGVDE 241

Query: 243 LI-KDQTFHGDEIEVLRKRFAEEKAAWKYRERARFGSAINSMAEEVEVEKKLRRQAERLN 302
           LI K+Q+F  DEIEV+       KAAWK RER     AI SMA+E+EVEKK RRQAERLN
Sbjct: 242 LIVKEQSFRDDEIEVVM------KAAWKNRER-----AITSMADEIEVEKKRRRQAERLN 301

Query: 303 KSIAKELAEAKVSVSKAMKDVEREKRAKEILEQICEELAKGIGEDRAEFEELKKESAKVR 362
           KSIAKELA+ K  VSK  +D++REKRAKEILEQICEELA GIGEDRAE EELK+ESAKVR
Sbjct: 302 KSIAKELADTKALVSKLRRDLQREKRAKEILEQICEELAAGIGEDRAELEELKRESAKVR 361

Query: 363 EEVEKEREMLHLADVLREERVQMKLSEAKYQFEEKNAAVERLKQQLQGYFLTQFGNEEPN 422
           EEVEKEREML L DVLREERVQMKLSEAK++FEEKNAAVERLK QL+GY +   GN+E  
Sbjct: 362 EEVEKEREMLRLVDVLREERVQMKLSEAKFEFEEKNAAVERLKHQLEGYLV---GNDE-- 421

Query: 423 GGENQEYSCNEFEKIKELEAYLKKINFGSCQDSEKLGRKEEQNEDCSDEEEESDLHSIEL 482
               Q++ CN+FEKIKELEAYLK+INFGSC+D       ++Q ++CSD  EESDLHSIEL
Sbjct: 422 --HEQDHCCNKFEKIKELEAYLKRINFGSCRD-------QDQEQECSD-SEESDLHSIEL 481

Query: 483 NMD--NNNKSYRWSFVHGEKADNNQIQSNNGRKSVSEKIQWGSICLNTTN---NTHQQNS 542
           NMD  ++ KSYRWSFVHG    N+  +SN     +SEKIQWGSICLNTTN    THQQNS
Sbjct: 482 NMDDGDDKKSYRWSFVHGGSQKNSLEKSN----PISEKIQWGSICLNTTNPSTATHQQNS 541

Query: 543 NTFDWDTFSELFTHKQLEDLHKLEADDDHEIKSVKCLRDILF----PELDQ-------NH 591
           N      FSELFTH      +  E  DDH++KS+ CLRDILF    PELDQ       NH
Sbjct: 542 N-----RFSELFTH------NLQEQGDDHQVKSLNCLRDILFPETTPELDQIPAAKTDNH 547

BLAST of Clc02G23620 vs. TAIR 10
Match: AT3G11590.1 (unknown protein; LOCATED IN: plasma membrane; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G22310.1); Has 22320 Blast hits to 15179 proteins in 1213 species: Archae - 372; Bacteria - 2307; Metazoa - 10906; Fungi - 1700; Plants - 1146; Viruses - 65; Other Eukaryotes - 5824 (source: NCBI BLink). )

HSP 1 Score: 268.9 bits (686), Expect = 1.0e-71
Identity = 235/539 (43.60%), Postives = 314/539 (58.26%), Query Frame = 0

Query: 14  KIRKRGCLSSPSSST---LVRKYRFKKP-------------PTWKMSTKSHSSKLSTGDL 73
           KIRKRGC SSP+SST   L   YRFK+              PTW++  +S S + S    
Sbjct: 16  KIRKRGC-SSPTSSTSSILREGYRFKRAIVVGKRGGSTTPVPTWRLMGRSPSPRASGALH 75

Query: 74  PNRSPSCSLDGGGKGK-EGSVSVSARKS-----NLQKLKNNSDVVEEKKEVTKTR-ELVS 133
              SPS S  G   GK      VSARK       + ++ +   V E    + K+R E ++
Sbjct: 76  AAASPS-SHCGSKTGKVSAPAPVSARKLAATLWEMNEMPSPRVVEEAAPMIRKSRKERIA 135

Query: 134 QIS------HS-----CLSDPDRSMKNTKTEKVEVGRIHRRRRS-ASSLRIGIEEMVGGS 193
            +       HS      LSDP  S  + + E+   G   RR  S    LR+G +  VG  
Sbjct: 136 PLPPPRSSVHSGSLPPHLSDPSHSPVSERMERSGTGSRQRRASSTVQKLRLG-DCNVGAR 195

Query: 194 NFHGNHCLMEIENGNVGKATRRKTKSTVKTRLKEVSNCLTTSKELLRVLHHVLGHENHRP 253
           +   +   M+IE  +  +     T   VKTRLK+ SN LTTSKELL++++ + G ++ RP
Sbjct: 196 DPINSGSFMDIETRSRVETPTGSTVG-VKTRLKDCSNALTTSKELLKIINRMWGQDD-RP 255

Query: 254 SSTSSLITALKSELDRAKTRVDNLIKDQTFHGDEIEVLRKRFAEEKAAWKYRERARFGSA 313
           SS+ SL++AL SEL+RA+ +V+ LI +     ++I  L KRFAEEKA WK  E+    +A
Sbjct: 256 SSSMSLVSALHSELERARLQVNQLIHEHKPENNDISYLMKRFAEEKAVWKSNEQEVVEAA 315

Query: 314 INSMAEEVEVEKKLRRQAERLNKSIAKELAEAKVSVSKAMKDVEREKRAKEILEQICEEL 373
           I S+A E+EVE+KLRR+ E LNK + KELAE K ++ KA+K++E EKRA+ ++E++C+EL
Sbjct: 316 IESVAGELEVERKLRRRFESLNKKLGKELAETKSALMKAVKEIENEKRARVMVEKVCDEL 375

Query: 374 AKGIGEDRAEFEELKKESAKVREEVEKEREMLHLADVLREERVQMKLSEAKYQFEEKNAA 433
           A+ I ED+AE EELK+ES KV+EEVEKEREML LAD LREERVQMKLSEAK+Q EEKNAA
Sbjct: 376 ARDISEDKAEVEELKRESFKVKEEVEKEREMLQLADALREERVQMKLSEAKHQLEEKNAA 435

Query: 434 VERLKQQLQGYFLTQFGNEEPNGGENQEYSCNEFEKIKELEAYLK-KINFGSCQDSEKLG 493
           V++L+ QLQ Y   +   E+       +   NE     E   YL   I+FGS    +  G
Sbjct: 436 VDKLRNQLQTYLKAKRCKEKTREPPQTQLH-NE-----EAGDYLNHHISFGSYNIED--G 495

Query: 494 RKEEQNEDCSDEEEESDLHSIELNMDNNNKSYRWSFVHGEKADNNQIQSNNGRKSVSEK 517
             E  NE+ S    ESDLHSIELN+D  NKSY+W +  GE+        N GRKS   K
Sbjct: 496 EVENGNEEGSG---ESDLHSIELNID--NKSYKWPY--GEE--------NRGRKSTPRK 526

BLAST of Clc02G23620 vs. TAIR 10
Match: AT5G22310.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G11590.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 154.8 bits (390), Expect = 2.1e-37
Identity = 167/536 (31.16%), Postives = 268/536 (50.00%), Query Frame = 0

Query: 7   QKTEKRCKIRKRGCLSSPSSSTLVRKYRFKKPP-TWKMSTKSHS---------SKLSTGD 66
           ++ +K CKIRKRG  SS SSS+L R+ RFK+     K + +            +   T  
Sbjct: 2   EQRKKGCKIRKRGG-SSSSSSSLARRNRFKRAIFAGKRAAQDDGGSGTPVKSITAAKTPV 61

Query: 67  LPNRSPSCSLDGGGKGKEGSVSVSARKSNLQKLKNNSD-VVEEKKEVTKTRE-----LVS 126
           L + SP        + ++  VS     + L ++ +++D  V   K+  ++++        
Sbjct: 62  LLSFSPENLPIDHHQLQKSCVSARKLAATLWEINDDADPPVNSDKDCLRSKKPSRYRAKK 121

Query: 127 QISHSCLSDPDRS---MKNTKTEKVEVGRIHRRRRSASSLRIG-IEEMVGGSNFHGNHCL 186
               S +  P RS   +    +E++++     RRRS +  ++  IE  + G+N       
Sbjct: 122 STEFSSIDFPPRSSDPISRLSSERIDLCDDMIRRRSTNPQKLNPIEYKIIGAN------- 181

Query: 187 MEIENGNVGKATRRKTKSTVKTRLKEVSNCLTTSKELLRVLHHVLGHENHRPSSTSSLIT 246
                             +VKTR K VS+ LTTSKEL++VL  +    +   ++++ LI+
Sbjct: 182 ------------------SVKTRFKNVSDGLTTSKELVKVLKRIGELGDDHKTASNRLIS 241

Query: 247 ALKSELDRAKTRVDNLIKDQTFHGDEIEVLRKRFAEEKAAWKYRERARFGSAINSMAEEV 306
           AL  ELDRA++ + +L+ +     DE E  ++R                   I S+ EE 
Sbjct: 242 ALLCELDRARSSLKHLMSEL----DEEEEEKRRL------------------IESLQEEA 301

Query: 307 EVEKKLRRQAERLNKSIAKELAEAKVSVSKAMKDVEREKRAKEILEQICEELAKGIGEDR 366
            VE+KLRR+ E++N+ + +EL EAK +  K  ++++REKRAK++LE++C+EL KGIG+D 
Sbjct: 302 MVERKLRRRTEKMNRRLGRELTEAKETERKMKEEMKREKRAKDVLEEVCDELTKGIGDD- 361

Query: 367 AEFEELKKESAKVREEVEKEREMLHLADVLREERVQMKLSEAKYQFEEKNAAVERLKQQL 426
                        ++E+EKEREM+H+ADVLREERVQMKL+EAK++FE+K AAVERLK++L
Sbjct: 362 -------------KKEMEKEREMMHIADVLREERVQMKLTEAKFEFEDKYAAVERLKKEL 421

Query: 427 QGYFLTQFGNEEPNGGENQEYSCNEFEKIKELEAYLKKINFGSCQDSEKLGRKEEQNEDC 486
           +         EE  G               E+   L+ I+                    
Sbjct: 422 R----RVLDGEEGKGS-------------SEIRRILEVIDGSG----------------- 437

Query: 487 SDEEEESDLHSIELNMDNNNKSYRWSFVHGEKADNNQIQSNNGRKSVSEKIQWGSI 523
           SD++EESDL SIELNM++ +K   W +V   K D  +   + G     + ++  S+
Sbjct: 482 SDDDEESDLKSIELNMESGSK---WGYVDSLK-DRRRFDGSGGDDDDDDPVEKRSV 437

BLAST of Clc02G23620 vs. TAIR 10
Match: AT1G50660.1 (unknown protein; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G20350.1); Has 21445 Blast hits to 15134 proteins in 1325 species: Archae - 461; Bacteria - 2309; Metazoa - 11052; Fungi - 1737; Plants - 1035; Viruses - 42; Other Eukaryotes - 4809 (source: NCBI BLink). )

HSP 1 Score: 137.5 bits (345), Expect = 3.5e-32
Identity = 109/363 (30.03%), Postives = 207/363 (57.02%), Query Frame = 0

Query: 196 CLTTSKELLRVLHHVLGHENHRPSSTSSLITALKSELDRAKTRVDNLIKDQTFHGDEIEV 255
           CL T +E+ ++  ++      +  +  SL+++L++EL+ A  R+++L  ++  H  ++E 
Sbjct: 212 CLDTMEEVHQIYSNM--KRIDQQVNAVSLVSSLEAELEEAHARIEDLESEKRSHKKKLEQ 271

Query: 256 LRKRFAEEKAAWKYRERARFGSAINSMAEEVEVEKKLRRQAERLNKSIAKELAEAKVSVS 315
             ++ +EE+AAW+ RE  +  + I+ M  ++  EKK R++ E +N  +  ELA++K++V 
Sbjct: 272 FLRKVSEERAAWRSREHEKVRAIIDDMKTDMNREKKTRQRLEIVNHKLVNELADSKLAVK 331

Query: 316 KAMKDVEREKRAKEILEQICEELAKGIGEDRAEFEELKKESAKVREEVEKEREMLHLADV 375
           + M+D E+E++A+E++E++C+ELAK IGED+AE E LK+ES  +REEV+ ER ML +A+V
Sbjct: 332 RYMQDYEKERKARELIEEVCDELAKEIGEDKAEIEALKRESMSLREEVDDERRMLQMAEV 391

Query: 376 LREERVQMKLSEAKYQFEEKNAAVERLKQQLQGYFLTQ---FGNEEPNGGE--NQEYSCN 435
            REERVQMKL +AK   EE+ + + +L   L+ +  ++      +E    E   +  +  
Sbjct: 392 WREERVQMKLIDAKVALEERYSQMNKLVGDLESFLRSRDIVTDVKEVREAELLRETAASV 451

Query: 436 EFEKIKE----------LEAYLKKINFGSCQDSEKLGRKEEQNEDCSDEEEESDLHSIEL 495
             ++IKE          + A  +++N G   D     R+ E++   S    +S +H++ L
Sbjct: 452 NIQEIKEFTYVPANPDDIYAVFEEMNLGEAHD-----REMEKSVAYSPISHDSKVHTVSL 511

Query: 496 NMDNNNKSYRWSFVHGEKADNNQIQSNNGRKSVSEKIQWGSI--------CLNTTNNTHQ 536
           + +  NK  R S  +  +  + + + ++G ++VS   + GS          +N  N+ H+
Sbjct: 512 DANMMNKKGRHSDAYTHQNGDIE-EDDSGWETVSHLEEQGSSYSPDGSIPSVNNKNHNHR 566

BLAST of Clc02G23620 vs. TAIR 10
Match: AT3G20350.1 (unknown protein; INVOLVED IN: biological_process unknown; LOCATED IN: plasma membrane; EXPRESSED IN: cotyledon; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G50660.1); Has 15095 Blast hits to 11224 proteins in 1051 species: Archae - 223; Bacteria - 1586; Metazoa - 7000; Fungi - 1255; Plants - 746; Viruses - 40; Other Eukaryotes - 4245 (source: NCBI BLink). )

HSP 1 Score: 127.1 bits (318), Expect = 4.8e-29
Identity = 106/361 (29.36%), Postives = 196/361 (54.29%), Query Frame = 0

Query: 196 CLTTSKELLRVLHHVLGHENHRPSSTSSLITALKSELDRAKTRVDNLIKDQTFHGDEIEV 255
           CL T  ++ ++  +V    N++  +  SL ++++ +L  A+  + +L  ++     ++E 
Sbjct: 189 CLDTRDDVHQIYTNV--KWNNQQVNDVSLASSIELKLQEARACIKDLESEKRSQKKKLEQ 248

Query: 256 LRKRFAEEKAAWKYRERARFGSAINSMAEEVEVEKKLRRQAERLNKSIAKELAEAKVSVS 315
             K+ +EE+AAW+ RE  +  + I+ M  ++  EKK R++ E +N  +  ELA++K++V 
Sbjct: 249 FLKKVSEERAAWRSREHEKVRAIIDDMKADMNQEKKTRQRLEIVNSKLVNELADSKLAVK 308

Query: 316 KAMKDVEREKRAKEILEQICEELAKGIGEDRAEFEELKKESAKVREEVEKEREMLHLADV 375
           + M D ++E++A+E++E++C+ELAK I ED+AE E LK ES  +REEV+ ER ML +A+V
Sbjct: 309 RYMHDYQQERKARELIEEVCDELAKEIEEDKAEIEALKSESMNLREEVDDERRMLQMAEV 368

Query: 376 LREERVQMKLSEAKYQFEEKNAAVERLKQQLQGYFLTQ--FGNEEPNGGENQEYSCNEFE 435
            REERVQMKL +AK   EEK + + +L   ++ +  ++   G +E    E    +    +
Sbjct: 369 WREERVQMKLIDAKVTLEEKYSQMNKLVGDMEAFLSSRNTTGVKEVRVAELLRETAASVD 428

Query: 436 KIKELEAY-------------LKKINFGSCQDSEKLGRKEEQNEDCSDEEEESDLHSIEL 495
            I+E++ +              +++N G  QD     R+ EQ    S     S  H++  
Sbjct: 429 NIQEIKEFTYEPAKPDDILMLFEQMNMGENQD-----RESEQYVAYSPVSHASKAHTVSP 488

Query: 496 NMDNNNKSYRWSFVHGEKADNNQIQSNNGRKSVSEKIQWGSI-----CLNTTNNTHQQNS 537
           +++  NK    +    +  +    + ++G ++VS   + GS       +   +NTH +NS
Sbjct: 489 DVNLINKGRHSNAFTDQNGEFE--EDDSGWETVSHSEEHGSSYSPDESIPNISNTHHRNS 540

BLAST of Clc02G23620 vs. TAIR 10
Match: AT5G41620.1 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast, plasma membrane; EXPRESSED IN: 9 plant structures; EXPRESSED DURING: 6 growth stages; BEST Arabidopsis thaliana protein match is: intracellular protein transport protein USO1-related (TAIR:AT1G64180.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 89.4 bits (220), Expect = 1.1e-17
Identity = 117/443 (26.41%), Postives = 209/443 (47.18%), Query Frame = 0

Query: 143 SASSLRIGIEEMV---GGSNFHGNHCLMEIENGNVGKATRRKTKSTVKT----------R 202
           SA SLR  I +M+     S    NH L  +   + G +    T +   T           
Sbjct: 126 SAGSLRRQIGQMLIKHHQSIDRNNHALQPVSPASYGSSLEVTTYNKAVTPSSSLEFRGRP 185

Query: 203 LKEVSNCLTTSKELLRVLHHVLGHENHRPSSTSSLITALKSELDRAKTRVDNLIKDQTFH 262
            +E    L TS ELL+VL+ +   E    S+  SLI ALK+E+  ++ R+  L++ Q   
Sbjct: 186 SREPHYNLKTSTELLKVLNRIWSLEEQHVSNI-SLIKALKTEVAHSRVRIKELLRYQQAD 245

Query: 263 GDEIEVLRKRFAEEKAAWKYRERARFGSAINSMAEEVEVEKKLRRQAERLNKSIAKELAE 322
             E++ + K+ AEEK   K +E  R  SA+ S+ + +E E+KLR+++E L++ +A+EL+E
Sbjct: 246 RHELDSVVKQLAEEKLLSKNKEVERMSSAVQSVRKALEDERKLRKRSESLHRKMARELSE 305

Query: 323 AKVSVSKAMKDVEREKRAKEILEQICEELAKGIGEDRAEFEELKKESAKV--REEVEKER 382
            K S+S  +K++ER  ++ +++E +C+E AKGI     E   LKK++           ++
Sbjct: 306 VKSSLSNCVKELERGSKSNKMMELLCDEFAKGIKSYEEEIHGLKKKNLDKDWAGRGGGDQ 365

Query: 383 EMLHLADVLREERVQMKLSEAKYQFEEKNAAVERLKQQLQGYFLTQFGNEEPNGGEN--- 442
            +LH+A+   +ER+QM+L        +  + +++L+ +++  FL +  NE P    N   
Sbjct: 366 LVLHIAESWLDERMQMRLEGGDTLNGKNRSVLDKLEVEIE-TFLQEKRNEIPRNRRNSLE 425

Query: 443 -----------QEYSCNEFEKIKELEAYLKKINFGSCQDSEKLGRKEEQNEDCSDEEEES 502
                      ++  C E     +   +  K    S  D  K  +  + N+D S +E+  
Sbjct: 426 SVPFNTLSAPPRDVDCEEDSGGSDSNCFELKKPAESYGDETK--KPNQHNKDGSIDEKPK 485

Query: 503 DLHSIELNMDNNNKSYRWSF-VHGEKADNNQIQSNNGRKSVSEKIQWGSICLNTTNNTHQ 553
              S ++N ++      W+   +G+K     I+     + V  +        N+ NN   
Sbjct: 486 SPSSFQVNFED---QMAWALSSNGKKKTTRAIEDEEEEEDVKPE--------NSNNNKKP 545

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038900292.13.3e-30693.71uncharacterized protein At5g41620 isoform X1 [Benincasa hispida][more]
XP_038900293.12.3e-28393.58uncharacterized protein At5g41620 isoform X2 [Benincasa hispida] >XP_038900294.1... [more]
XP_011657588.11.3e-24980.93uncharacterized protein At5g41620 isoform X1 [Cucumis sativus] >KGN48038.1 hypot... [more]
XP_031743163.11.0e-22780.24uncharacterized protein At5g41620 isoform X2 [Cucumis sativus] >XP_031743164.1 u... [more]
XP_016900719.17.7e-20786.13PREDICTED: uncharacterized protein At5g41620 [Cucumis melo][more]
Match NameE-valueIdentityDescription
Q66GQ21.5e-1626.41Uncharacterized protein At5g41620 OS=Arabidopsis thaliana OX=3702 GN=At5g41620 P... [more]
F4I8781.5e-0635.43Protein BRANCHLESS TRICHOME OS=Arabidopsis thaliana OX=3702 GN=BLT PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KEV96.1e-25080.93Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G425750 PE=4 SV=1[more]
A0A1S4DXK63.7e-20786.13uncharacterized protein At5g41620 OS=Cucumis melo OX=3656 GN=LOC103491425 PE=4 S... [more]
A0A5D3BAL98.6e-20487.06Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A6J1K0X31.4e-18265.22uncharacterized protein LOC111490718 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1JCV92.4e-18266.34uncharacterized protein LOC111484580 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
Match NameE-valueIdentityDescription
AT3G11590.11.0e-7143.60unknown protein; LOCATED IN: plasma membrane; EXPRESSED IN: 22 plant structures;... [more]
AT5G22310.12.1e-3731.16unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G50660.13.5e-3230.03unknown protein; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplas... [more]
AT3G20350.14.8e-2929.36unknown protein; INVOLVED IN: biological_process unknown; LOCATED IN: plasma mem... [more]
AT5G41620.11.1e-1726.41FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (cordophanus) v2
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 279..334
NoneNo IPR availableCOILSCoilCoilcoord: 342..369
NoneNo IPR availableCOILSCoilCoilcoord: 225..245
NoneNo IPR availableCOILSCoilCoilcoord: 86..106
NoneNo IPR availableCOILSCoilCoilcoord: 385..405
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..81
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 42..61
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 453..475
NoneNo IPR availablePANTHERPTHR31071:SF16OS04G0382800 PROTEINcoord: 11..581
IPR043424Protein BRANCHLESS TRICHOME-likePANTHERPTHR31071GB|AAF24581.1coord: 11..581

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Clc02G23620.1Clc02G23620.1mRNA