Clc01G05690 (gene) Watermelon (cordophanus) v2

Overview
NameClc01G05690
Typegene
OrganismCitrullus lanatus subsp. cordophanus cv. cordophanus (Watermelon (cordophanus) v2)
DescriptionVARLMGL domain-containing protein
LocationClcChr01: 5347600 .. 5352120 (+)
RNA-Seq ExpressionClc01G05690
SyntenyClc01G05690
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGAAAGGTAGAATGGCACTTTGGAGGAAGATCATCTTCTCGTCGAGCCACCACCGCCGATCACCCCCGGCAACGCTACCGTCCTTCTCTTCCGAGCTGTATGAGCACCCTTTTTCACTTCTTTGATTTTCGTTCCTCTCGTTTTACTCACATTGTCTTCGATAATCACCACCCGTCCTCCTTCAACCTCCCGCATCATCGCCCTCCCCTACCCAAAGCTTCCCATCATGGTATATACACTTTCAAAATTTTAATTTTCATTTTTTTAAAGAAAGATATTTGGTCAATTTGATCCTCTATGAATGCTTGTTATGCGTGCATGAGTAGATTAGAATATGATTCCATGTGATATATTAGATAAAGTTTTGATAATGTAATAATGTTTGATATTAAAAAAGGGGTCATATTATTACTTAATATTTTTATGGTTAATTGATCTTTAATTATATATATATAAAAGGGCCTATTTTCAAACTATAAAGAAATAAGAAAAAAAGGGAGATTTTTATTTATTTAATTTTTAATTTTTACTTACATTCTCTTTTGGGAAATTCTCAGAAAACTTTGGTACCATCTTTTTTTGTTTTTCAGTTGTTGAAGCACCAAGGAATAGCTTGGAATTAGAGGGAGCTTCAATTTCTTGCTTAAGAAATAAAGAAAAAAATTTGCAACTTCAAGTAAACTTTTCCTTCTCCTTTTTTCATCAAGTCTATTCAAAATGATAGTGTAATTATGCTCAAGAAATCATTTTTTTTTAATGTAAATTTAAAAATTTACTAACATATAATATATATCTTAAAACAGATGGGACTTCAAATCAAAACTAGAAATGGTAGCACAAAATCAAAAGCAAGTGAGCAACAACTTCCAAATAATGATCACATTATTGCATTGGAATCTCCAAGTGGAAAGACACCAAATCTCTTGGCAAGATTAATGGGTCTTGATATTCTCCCTCAAAACCCCTCTTCTTCTTTCAATTGTCGCGGGTCAAATTTCGGAACCCGTTCTCTCCCAGAGAGCCCGAGAGTATCGTCAGCAAGACTATCAGACGTCGATTGTCATCATCGTCGCCTCTCACTCCAAATTATTTCAGACAAAGAAAATATAAATTTTTTTGAAGAGGCTAAGCGAGAAAAGGAAAAAGTGAGCAAGAAAGTTGCCCTCGTTGATATCACCAATAATAACAGAAAAATAGAGTTCGGAAAACAAGAAGTTGGTTTTAGTCAAATTAAAGTCGAGATCAAGTCCTCTAAGAAACTTAAGAAAACAGCCGTTGACGAATCAAGACGTAGTTCGAAAGTCGTGCGTAAAAATCAAGAGGTAATGATTTCAAAGAAGCAAAAGCTAATATCGATGTCGATGCAAAAACCAAAGCGGAGGGCTAGAGAAGGTGAAGCATTTGATTGTCCAACAAGTAATAACCTTGTTAACAACCTCCATCATTCAACCATTTTTCCAGCAAAGAAAGAGCCTTCTCCTCCGGCGATCCAAGCCCCTCGTGAACAGGTACTCAACATATTACTTTTCATGTCCTTTCAATAAATATATATTAAAACAATTATTAAAAAATAAATAAACTTTCCTTTACTTTTTTACCTGTGATTTGTTTCTAATGACCATAAAATATTATCGAAATAAACATGTATGAAACTAGATAATGTAAATAATTAAATACAAATAAACTTTACCCAAGTCAGTCATGTGAGTCAATTCGAAAAAATGGACATCCTCCCAGAGAGAACTCAGGTAAGAAATTCAATAGTACAATAAATAAAACAATATATATTTAAATATATCATCATTTTTATTATTCTTATAATTCTCTTAAAAAATAACATAATCTTCCTCAAAGTTTTTATTTTATTTTATTTTTTTGGGTTCAAAATAAGTAGCAAGAATTTCAACCTGTCAAATTTAATAAAAAATATTAATATTAAATATAAGTTATTTTTAAATATAGTAAAATGAATTGACTTATTTACAAATATAGTAAAATATTATTGTCTATCATTGATAGACACTAATATACAATGTAACTTGATTTTACTCGCTACTTAGTAACATTCGTTTTGGCTACAAAAAGTTTCTAGTTAAATTATATTATTCTATTAAATAAAAATTTATAGTTATTTAATAAATTTTGTATTTTTTTCTTTTCAAATATTAGTTGTATGATGATGATAGATGCAATAACATGGTTTGGTAAATTGTCACGACATTACTTTTAAAAATGTCGTAGGTTGAAGTCTATGATCTTCATATTTGTGATGTAATTAGAAAGTATTAGAATTTTTTTTTTTTAAATATCTTTTCAAGAATAAAGTTTATTATTATTTTCTTGTCAACTTTTCTTAAAAACATTTTTGAAATATATAGCCCGATTTTCAGAACTACATTTTATAATTTTTCTTTTAAAATCACATATAGCTAAAGTACTTTTGACATGTTTCTATAAGTAGATTATAATATAAATTATTGCTAATAAACAATTGTTTTTTTTAATTGAAGACCAAACAATAATTATGACAGTTTTTATTTTTCATTTTTATTTATTTTTGTCCTTTAAGTTTACTTTTCTTTTAATATTAGTAGTAATTTCTACTTGAAAGATCACACAAATAAATCTTGAAAATTCTAACCATCAAATATTTTAAAACATAACTGGAAAAAAAAAAAAGACACAAGTACTACTGATACAAAAAGTTGAAACATTGTTTTAATTTATAATAGTGAACAGGACATGTAAGATTTTCTGCAAAAGAATAAATTGATTATTTATGTTGGTCAAGAGGTTCAAATTTTTCTTGTGAAAAATAGGTCAATAACTTTATTGAAGAATTAAATGCAACATTTTATTTTTTCTGCATAAGATTTTTAAGAGAAAAATAAATTTAAGCCTAAAAGTTATTTATCAAGGTTTTTTTTTTTTTTGAATAATTGGTTTACTTCAAATTAAATCTCATGTTTATTCAATTATCATTTGTCTCGCTTTGTTAAATTATTAAATAAATTCAAAATGAAGAGTAATTTAAATAAATTATTTGAATTTAGGCAATTAAATCGAGTCAAATATTTCACCTAAAATTATTTTTCATTAGATAACATGATTTTATTATTATTGTTTTTATTATCATTTATGTGATGATAATTTTAACTATATTCTTGTTTCATGCTAAATAATTGCTAGTGAATGCTAATATTAAACCAAAGAGTTGGTAATAAGTTTGCTTCAACTTTTGGTATTCAAGTCCATGCACTTTAAGAACCATTTTAAAATATAAAGAACTCTATCAAAGATTTAAAACTTGGCTTCAATGGTATTTAACCAGGAACAAAACACCTATATAACAGACTGCTTTTGTTTTTGTTTTTTTTCTTTTTCCTTACAATTTCCACAGTTTTCATATTTCTTAACTAAACATTTAAAACATTATCAAACTTTTTTTTTTTTTTTTATAAGCTCAATTTCAAAAACTAAAAATCAAAAATTGCCATAATTTGATAATCATTTATTATTATTATTTTTTTAAAATTATGTTTGTTTTCTCACATTTTTTTCTCATGGTTTTCATATTTAGACATATTAGAATTCTTAGCTAATTTCAAAAAACAAAAACTTCTTTTTGAAAACTATTTCTTTATACATATACAAAATTTAGCTTTGATTTTGAAAACTCCTAAAACGTAGTCAACAAAACATAGAAAACAATAGATGGAAGTGATGTTTATAGGTTTAATTTTTAAAACATACATTTTAGTTCTACATTTCTTACTTTGTTATTTACTTTCTACTCATGTTTTCAAAATCTAAGCCAATTTTAAAAACTGAAAAGTAATTTTAAAAACTTATTTTTGTTTTTAAAATTTAACTAAGAATTATTGAACTAAGAAACAACGTAAAACCCATGATAAAAAATTCCAAAAACCAAAAACAAAAAATGAAATCACTATCAAACCAACCCATTAAAATTCTTTATTCCAACAATCTCACAAATTGTTTCGAAATCAGCCGTGTAGGTACTCAAAGGGCAAGGCGAAGCCGGCGGGCAGAGACGGCGGAGAAAGAAACGCCGTTGACAAGACCACCACCACAGACGGCGGATCAACGGAGTTCGAATACATCAAAAGAATACTAACCAACCACGGCAATTCAAACTCGATCATCTCACCCCCCAATAACCCGACGAACCCCTCAATCTTCCACCACCCAGAAGCGGCGGAGGACCAGCAATGGGGCAGACGACTACTAAACTGTTGGCACGTGCGAAGAGGAATGAAGGGATGGGAATTGGGTGAAGAAGCTGTGAGGGAGAGATCGGTGAAGAAAGAGTACTTCCCACGTGCGAAATATGAAGTAGTGGAAGATATGGATGCTTTAATAATCAACAAGAGAGTGGTGGAGGAAACAGAAGGGATTGTGAAGGTGGTTGAGCTTCACATTTTAGACTCCCTTTTACGAGAAACTGTTGCCCTAATTTCCTCCCTACCAAAATGCTCTCATTTTCCTAATTTCTAA

mRNA sequence

ATGGGAAAGGTAGAATGGCACTTTGGAGGAAGATCATCTTCTCGTCGAGCCACCACCGCCGATCACCCCCGGCAACGCTACCGTCCTTCTCTTCCGAGCTGTATGAGCACCCTTTTTCACTTCTTTGATTTTCGTTCCTCTCGTTTTACTCACATTGTCTTCGATAATCACCACCCGTCCTCCTTCAACCTCCCGCATCATCGCCCTCCCCTACCCAAAGCTTCCCATCATGTTGTTGAAGCACCAAGGAATAGCTTGGAATTAGAGGGAGCTTCAATTTCTTGCTTAAGAAATAAAGAAAAAAATTTGCAACTTCAAATGGGACTTCAAATCAAAACTAGAAATGGTAGCACAAAATCAAAAGCAAGTGAGCAACAACTTCCAAATAATGATCACATTATTGCATTGGAATCTCCAAGTGGAAAGACACCAAATCTCTTGGCAAGATTAATGGGTCTTGATATTCTCCCTCAAAACCCCTCTTCTTCTTTCAATTGTCGCGGGTCAAATTTCGGAACCCGTTCTCTCCCAGAGAGCCCGAGAGTATCGTCAGCAAGACTATCAGACGTCGATTGTCATCATCGTCGCCTCTCACTCCAAATTATTTCAGACAAAGAAAATATAAATTTTTTTGAAGAGGCTAAGCGAGAAAAGGAAAAAGTGAGCAAGAAAGTTGCCCTCGTTGATATCACCAATAATAACAGAAAAATAGAGTTCGGAAAACAAGAAGTTGGTTTTAGTCAAATTAAAGTCGAGATCAAGTCCTCTAAGAAACTTAAGAAAACAGCCGTTGACGAATCAAGACGTAGTTCGAAAGTCGTGCGTAAAAATCAAGAGGTAATGATTTCAAAGAAGCAAAAGCTAATATCGATGTCGATGCAAAAACCAAAGCGGAGGGCTAGAGAAGGTGAAGCATTTGATTGTCCAACAAGTAATAACCTTGTTAACAACCTCCATCATTCAACCATTTTTCCAGCAAAGAAAGAGCCTTCTCCTCCGGCGATCCAAGCCCCTCGTGAACAGGTTGAAGTCTATGATCTTCATATTTGTGATGTAATTAGAAAGTACTCAAAGGGCAAGGCGAAGCCGGCGGGCAGAGACGGCGGAGAAAGAAACGCCGTTGACAAGACCACCACCACAGACGGCGGATCAACGGAGTTCGAATACATCAAAAGAATACTAACCAACCACGGCAATTCAAACTCGATCATCTCACCCCCCAATAACCCGACGAACCCCTCAATCTTCCACCACCCAGAAGCGGCGGAGGACCAGCAATGGGGCAGACGACTACTAAACTGTTGGCACGTGCGAAGAGGAATGAAGGGATGGGAATTGGGTGAAGAAGCTGTGAGGGAGAGATCGGTGAAGAAAGAGTACTTCCCACGTGCGAAATATGAAGTAGTGGAAGATATGGATGCTTTAATAATCAACAAGAGAGTGGTGGAGGAAACAGAAGGGATTGTGAAGGTGGTTGAGCTTCACATTTTAGACTCCCTTTTACGAGAAACTGTTGCCCTAATTTCCTCCCTACCAAAATGCTCTCATTTTCCTAATTTCTAA

Coding sequence (CDS)

ATGGGAAAGGTAGAATGGCACTTTGGAGGAAGATCATCTTCTCGTCGAGCCACCACCGCCGATCACCCCCGGCAACGCTACCGTCCTTCTCTTCCGAGCTGTATGAGCACCCTTTTTCACTTCTTTGATTTTCGTTCCTCTCGTTTTACTCACATTGTCTTCGATAATCACCACCCGTCCTCCTTCAACCTCCCGCATCATCGCCCTCCCCTACCCAAAGCTTCCCATCATGTTGTTGAAGCACCAAGGAATAGCTTGGAATTAGAGGGAGCTTCAATTTCTTGCTTAAGAAATAAAGAAAAAAATTTGCAACTTCAAATGGGACTTCAAATCAAAACTAGAAATGGTAGCACAAAATCAAAAGCAAGTGAGCAACAACTTCCAAATAATGATCACATTATTGCATTGGAATCTCCAAGTGGAAAGACACCAAATCTCTTGGCAAGATTAATGGGTCTTGATATTCTCCCTCAAAACCCCTCTTCTTCTTTCAATTGTCGCGGGTCAAATTTCGGAACCCGTTCTCTCCCAGAGAGCCCGAGAGTATCGTCAGCAAGACTATCAGACGTCGATTGTCATCATCGTCGCCTCTCACTCCAAATTATTTCAGACAAAGAAAATATAAATTTTTTTGAAGAGGCTAAGCGAGAAAAGGAAAAAGTGAGCAAGAAAGTTGCCCTCGTTGATATCACCAATAATAACAGAAAAATAGAGTTCGGAAAACAAGAAGTTGGTTTTAGTCAAATTAAAGTCGAGATCAAGTCCTCTAAGAAACTTAAGAAAACAGCCGTTGACGAATCAAGACGTAGTTCGAAAGTCGTGCGTAAAAATCAAGAGGTAATGATTTCAAAGAAGCAAAAGCTAATATCGATGTCGATGCAAAAACCAAAGCGGAGGGCTAGAGAAGGTGAAGCATTTGATTGTCCAACAAGTAATAACCTTGTTAACAACCTCCATCATTCAACCATTTTTCCAGCAAAGAAAGAGCCTTCTCCTCCGGCGATCCAAGCCCCTCGTGAACAGGTTGAAGTCTATGATCTTCATATTTGTGATGTAATTAGAAAGTACTCAAAGGGCAAGGCGAAGCCGGCGGGCAGAGACGGCGGAGAAAGAAACGCCGTTGACAAGACCACCACCACAGACGGCGGATCAACGGAGTTCGAATACATCAAAAGAATACTAACCAACCACGGCAATTCAAACTCGATCATCTCACCCCCCAATAACCCGACGAACCCCTCAATCTTCCACCACCCAGAAGCGGCGGAGGACCAGCAATGGGGCAGACGACTACTAAACTGTTGGCACGTGCGAAGAGGAATGAAGGGATGGGAATTGGGTGAAGAAGCTGTGAGGGAGAGATCGGTGAAGAAAGAGTACTTCCCACGTGCGAAATATGAAGTAGTGGAAGATATGGATGCTTTAATAATCAACAAGAGAGTGGTGGAGGAAACAGAAGGGATTGTGAAGGTGGTTGAGCTTCACATTTTAGACTCCCTTTTACGAGAAACTGTTGCCCTAATTTCCTCCCTACCAAAATGCTCTCATTTTCCTAATTTCTAA

Protein sequence

MGKVEWHFGGRSSSRRATTADHPRQRYRPSLPSCMSTLFHFFDFRSSRFTHIVFDNHHPSSFNLPHHRPPLPKASHHVVEAPRNSLELEGASISCLRNKEKNLQLQMGLQIKTRNGSTKSKASEQQLPNNDHIIALESPSGKTPNLLARLMGLDILPQNPSSSFNCRGSNFGTRSLPESPRVSSARLSDVDCHHRRLSLQIISDKENINFFEEAKREKEKVSKKVALVDITNNNRKIEFGKQEVGFSQIKVEIKSSKKLKKTAVDESRRSSKVVRKNQEVMISKKQKLISMSMQKPKRRAREGEAFDCPTSNNLVNNLHHSTIFPAKKEPSPPAIQAPREQVEVYDLHICDVIRKYSKGKAKPAGRDGGERNAVDKTTTTDGGSTEFEYIKRILTNHGNSNSIISPPNNPTNPSIFHHPEAAEDQQWGRRLLNCWHVRRGMKGWELGEEAVRERSVKKEYFPRAKYEVVEDMDALIINKRVVEETEGIVKVVELHILDSLLRETVALISSLPKCSHFPNF
Homology
BLAST of Clc01G05690 vs. NCBI nr
Match: XP_011656164.1 (uncharacterized protein LOC105435648 [Cucumis sativus] >KGN50399.1 hypothetical protein Csa_000155 [Cucumis sativus])

HSP 1 Score: 441.8 bits (1135), Expect = 8.1e-120
Identity = 314/550 (57.09%), Postives = 357/550 (64.91%), Query Frame = 0

Query: 1   MGKVE-WHFGGR-SSSRRATTADHPR-QRYRPSLPSCMSTLFHFFDFRSSRFTHIVFDNH 60
           MGK E W+FGGR SSSRR TT D    QR   SLPSCMSTLFH FDFRSS FTHIVFDNH
Sbjct: 1   MGKSEWWYFGGRSSSSRRVTTVDIDHFQRRDHSLPSCMSTLFHLFDFRSSHFTHIVFDNH 60

Query: 61  HPSSFNLPHHRPPL--PKASHHVVEAPRNSLELE-GASISCLRNKEKNLQLQMGLQIKTR 120
             SSF+L HH P L   KASHH VEAPRNSLEL+ G SISCLRNKE+NLQLQMGLQIKTR
Sbjct: 61  RSSSFDLSHHHPTLRPTKASHHGVEAPRNSLELDNGDSISCLRNKEENLQLQMGLQIKTR 120

Query: 121 NGSTKSKASEQQLPNNDHIIALESPSGKTPNLLARLMGLDILPQNP-SSSFNCRGSNFGT 180
           NGSTKSKA+EQQLPNND+IIALESPS  TPNLLARLMGLD  PQ   SSS+N    N GT
Sbjct: 121 NGSTKSKATEQQLPNNDNIIALESPSTNTPNLLARLMGLDNFPQTTFSSSYNHCMPNLGT 180

Query: 181 RSLPESPRVSSARLSDVDCHHRRLSLQI-ISDKEN--INFFEE-AKREKEKVSK-KVALV 240
           RSL ESPR S +RLSDVD HHRRLSLQI I +KEN  I   EE +KREK+KV + KVAL+
Sbjct: 181 RSLSESPRNSLSRLSDVDYHHRRLSLQINIQEKENNKIKICEEISKREKKKVERPKVALI 240

Query: 241 DITNNNRKIEFGKQEVGFSQI-KVEIKSSKKLKKTAVDESRRSSKVVRKNQE-VMISKKQ 300
           DITN+  K+    QE+G SQ  KVE+KS KKLKKT  ++S  S  V R NQ+ V++S KQ
Sbjct: 241 DITNSYNKVRSKIQEIGSSQSRKVEMKSLKKLKKTTTNKSSSSKVVCRSNQKNVIVSNKQ 300

Query: 301 KLISMSMQKPK-RRAREGEAFDCPTSNNLVNNLHHSTIFPAKKEPSPPAIQAPREQVEVY 360
           K ISMSMQ PK RRAREGEA DCP SN L + L HSTIF    +P               
Sbjct: 301 KSISMSMQIPKERRAREGEALDCPRSNKL-DLLDHSTIF----QPC-------------- 360

Query: 361 DLHICDVIRKYSKGKAKPAGRDGGERNAVDKTTTTDGGSTEFEYIKRI-LTNHGNSNSII 420
                     Y KGKAK A   GGE NAVD  TTTDGGS EF+YIK I +++  NSN ++
Sbjct: 361 ---------SYPKGKAKAA---GGETNAVDTATTTDGGSAEFKYIKTIQISSKENSNWVV 420

Query: 421 SPPNNPTNPSIFHHPEAAEDQQWGRRL---------------LNCWHVRRGMK-GWELGE 480
            P       S F+H  A E+++W +R+                  W  +RG K GWE   
Sbjct: 421 VP------ASRFYHSVAGEERRWKKRVELQQAVVGGDQIPNNKGWWQKQRGRKRGWE--- 480

Query: 481 EAVRERSVKKEYFPRAKYEVVEDMDALIINKR--------VVEETEGIVKVVELHILDSL 510
                       FP  K+E+VE     +INK         + EE EGIVK+VELHILDSL
Sbjct: 481 ------------FPHVKFELVE---YALINKDLEKSKFIIMAEEREGIVKLVELHILDSL 495

BLAST of Clc01G05690 vs. NCBI nr
Match: KAG7035594.1 (hypothetical protein SDJN02_02391, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 255.4 bits (651), Expect = 1.1e-63
Identity = 212/535 (39.63%), Postives = 261/535 (48.79%), Query Frame = 0

Query: 5   EWHFGGRSSSRRATTADHPRQRYRPSLPSCMSTLFHFFDFRSSRFTHIVFDNHHPSSFNL 64
           +W FGG SS RRA      R R+RPSLPSCM+TLFHFFD  S   TH+  + H PSS   
Sbjct: 4   QWLFGGTSSPRRAPI---DRHRHRPSLPSCMNTLFHFFDSHSFPSTHLAHNKHQPSS--- 63

Query: 65  PHHRPPLPKASHHVVEAPRNSLELEGASISCLRNKEKNLQLQMGLQIKTRNGSTKSKASE 124
                 L       V APRNSLE  G        +E+N Q+QMGL+I T           
Sbjct: 64  ------LDHVCSSGVVAPRNSLEQLG--------QEQNEQIQMGLEINT----------- 123

Query: 125 QQLPNNDHIIALESPSGKTPNLLARLMGLDILPQNPSSSFNCRGSNFGTRSLPESPRVSS 184
               N DH  AL+SPS KTPNLLARLMGLDILPQ  +S          TRSLP SPRVSS
Sbjct: 124 ----NFDH-NALDSPSVKTPNLLARLMGLDILPQTTTSP-------SATRSLPNSPRVSS 183

Query: 185 ARLSDVDCHHRRLSLQIISDKENINFFEEAKREKEKVSKKVALVDITNNNRKIEFGKQEV 244
           +RLSDVD HH R SL I  D EN    +E K+E+E+V +KVALVDITNNN K+ +GK   
Sbjct: 184 SRLSDVDRHHHRHSLDINLDIENSQICKEMKQEEEQVRRKVALVDITNNNNKLVYGK--- 243

Query: 245 GFSQIKVEIKSSKKLKKTAVDESRRSSKVVRKNQEVMISKKQKLIS--MSMQKPKRRARE 304
                                          KNQ+V + +K   IS       PKR+ R 
Sbjct: 244 ------------------------------LKNQDVTMFRKHNSISTLTPTPTPKRKPR- 303

Query: 305 GEAFDCPTSNNLVNNLHHSTIFPAKKEPSPPAIQAPREQVEVYDLHICDVIRKYSKGKAK 364
                             +T    K+E SPP     R + + +    C    ++  GK +
Sbjct: 304 ------------------ATTREEKEEESPPPAAKVRHE-QSFPKQRC----RFPNGKQR 363

Query: 365 PAGRDGGERNAVDKTTTTDGGSTEFEYIKRILTNHGNSNSIISPPNNPTNPSIFHHPEAA 424
           PA  + G R       T DGG+ E +YIKRILT+    +     P NP NPSIFHH E +
Sbjct: 364 PAAEEVGRR------ATADGGAGELKYIKRILTSPNWFS-----PTNPLNPSIFHHLETS 412

Query: 425 ----------------EDQQWGRRLLNCWHVRRGMKGWELGEEAVRERSVKKEYFPRAKY 484
                           +D+  G  ++NC      MKGWEL                RAK 
Sbjct: 424 SAAVGEPRLERWNKDDDDEVLGEMVMNCRTRMMMMKGWELA---------------RAKC 412

Query: 485 EVVEDMDALI---INK-RVVEETEGIVKVVELHILDSLLRETVALISSLPKCSHF 518
            V+ED+D+LI   + K + V E EG+V+  + HILDSLLRET A I SL K   F
Sbjct: 484 HVLEDIDSLIDKDLGKWKKVLELEGVVRTFQFHILDSLLRETTATIMSLHKRCRF 412

BLAST of Clc01G05690 vs. NCBI nr
Match: KAA0055152.1 (putative dna repair [Cucumis melo var. makuwa] >TYK00310.1 putative dna repair [Cucumis melo var. makuwa])

HSP 1 Score: 246.5 bits (628), Expect = 5.0e-61
Identity = 190/376 (50.53%), Postives = 233/376 (61.97%), Query Frame = 0

Query: 151 MGLDILPQNPSSSFNCRGSNFGTRSLPESPRVSSARLSDVDCHHRRLSLQI-ISDKEN-- 210
           MGLD  PQ  SSS+   G N  TRSL ESPR SS+RLS+VDCHHRRLSLQI I +KEN  
Sbjct: 1   MGLDNFPQTSSSSYCRCGLNLETRSLTESPRNSSSRLSNVDCHHRRLSLQINIQEKENNG 60

Query: 211 INFFEE-AKREKEKVSKKVALVDITNNNRKIEFGKQEVGFS--QIKVEIKSSKKLKKTAV 270
           I   E+  KREK+KV +KVALVDITN+N KI +  QE+G S    KVE+KS KKL+KT V
Sbjct: 61  IEICEDIIKREKKKVGRKVALVDITNSNNKIGYEIQEIGHSSQSRKVEMKSLKKLEKTTV 120

Query: 271 DESRRSSKVVRKNQE-VMISKKQKLISMSMQKPKRRAREGEAFDCPTSNNLVNNLHHSTI 330
            ES  +SKVV  NQ+  M+SKKQKLISM MQ  K R  E EAFDCPT+N L+  LHH TI
Sbjct: 121 GES-SNSKVVHNNQKNEMVSKKQKLISMPMQILKGRTSEREAFDCPTNNKLL--LHHPTI 180

Query: 331 FPAKKEPSPPAIQAPREQVEVYDLHICDVIRKYSKGKAKPAGRDGGERNAVDKTTTTDGG 390
           F    EP                         Y KGK KPA   GGE +AVD TTTTDG 
Sbjct: 181 F----EPC-----------------------SYPKGKPKPA---GGETSAVDITTTTDGE 240

Query: 391 STEFEYIKRI-LTNHGNSNSIISPPNNPTNPSIFHHPE---AAEDQQWGRRLLNCWHVRR 450
           ST+F+YIK I +++  NSN ++        PS FHH E   A ++++W +RL     ++ 
Sbjct: 241 STDFKYIKTIQISSKENSNWVVP-------PSTFHHLETTLAGKERRWKKRL----ELQT 300

Query: 451 GMKGWELGEEAVRERSVKKEYFPRAKYEVV------EDMDALIINKRVVEETEGIVKVVE 510
           G+ G   G+   R+R  +   FP AK  +V      ED++   +   + EE EGIVK+VE
Sbjct: 301 GVVG---GDRRGRKRGWE---FPHAKCGLVEYGLINEDLEKSKLIIIMAEEREGIVKLVE 326

BLAST of Clc01G05690 vs. NCBI nr
Match: KAG6605686.1 (hypothetical protein SDJN03_03003, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 206.5 bits (524), Expect = 5.7e-49
Identity = 191/537 (35.57%), Postives = 240/537 (44.69%), Query Frame = 0

Query: 5   EWHFGGRSSSRRATTADHPRQRYRPSLPSCMSTLFHFFDFRSSRFTHIVFDNHHPSSFNL 64
           +W FGG SS RRA   D  + R+RPSLPSC                              
Sbjct: 4   QWLFGGTSSPRRA-PIDRHQHRHRPSLPSC------------------------------ 63

Query: 65  PHHRPPLPKASHHVVEAPRNSLELEGASISCLRNKEKNLQLQMGLQIKTRNGSTKSKASE 124
                         V APRNSLE  G        +E+N Q+QMGL+I T           
Sbjct: 64  --------------VVAPRNSLEQLG--------QEQNEQIQMGLEINT----------- 123

Query: 125 QQLPNNDHIIALESPSGKTPNLLARLMGLDILPQNPSSSFNCRGSNFGTRSLPESPRVSS 184
               N DH  AL+SPS KTPNLLARLMGLDILPQ  +S          TRSLP SPRVSS
Sbjct: 124 ----NFDH-NALDSPSVKTPNLLARLMGLDILPQTTTSP-------SATRSLPNSPRVSS 183

Query: 185 ARLSDVDCHHRRLSLQIISDKENINFFEEAKREKEKVSKKVALVDITNNNRKIEFGKQEV 244
           +RLSDVD HH R SL I  D EN    +E K+E+E+V +KVALVDITNNN K+ +GK   
Sbjct: 184 SRLSDVDRHHHRHSLDINLDIENSQICKEMKQEEEQVRRKVALVDITNNNNKLVYGK--- 243

Query: 245 GFSQIKVEIKSSKKLKKTAVDESRRSSKVVRKNQEVMISKKQKLIS----MSMQKPKRRA 304
                                          KNQ+V + +K   IS         PKR+ 
Sbjct: 244 ------------------------------LKNQDVTMFRKHNSISTLTPTPTPTPKRKP 303

Query: 305 REGEAFDCPTSNNLVNNLHHSTIFPAKKEPSPPAIQAPREQVEVYDLHICDVIRKYSKGK 364
           R+                        KK+     ++ P+ +             ++  GK
Sbjct: 304 RQR---------------------LEKKKKKSLLLRRPKSRC------------RFPNGK 363

Query: 365 AKPAGRDGGERNAVDKTTTTDGGSTEFEYIKRILTNHGNSNSIISPPNNPTNPSIFHHPE 424
            +PA  + G R      +T DGG+ E +YIKRILT+    +     P NP NPSIFHH E
Sbjct: 364 QRPAAEEVGRR------STADGGAGELKYIKRILTSPNWFS-----PTNPLNPSIFHHLE 372

Query: 425 AA----------------EDQQWGRRLLNCWHVRRGMKGWELGEEAVRERSVKKEYFPRA 484
            +                +D+  G  ++NC      MKGWEL                RA
Sbjct: 424 TSSAAVGEPRLERWNKDDDDEVLGEMVMNCRTRMMMMKGWELA---------------RA 372

Query: 485 KYEVVEDMDALI---INK-RVVEETEGIVKVVELHILDSLLRETVALISSLPKCSHF 518
           K  V+ED+D+LI   + K + V E EG+V+  + HILDSLLRET A I SL K   F
Sbjct: 484 KCHVLEDIDSLIDKDLGKWKKVLELEGVVRTFQFHILDSLLRETTATIMSLHKRCRF 372

BLAST of Clc01G05690 vs. NCBI nr
Match: XP_022958521.1 (uncharacterized protein LOC111459727 [Cucurbita moschata])

HSP 1 Score: 205.7 bits (522), Expect = 9.8e-49
Identity = 181/473 (38.27%), Postives = 222/473 (46.93%), Query Frame = 0

Query: 67  HRPPLPKASHHVVEAPRNSLELEGASISCLRNKEKNLQLQMGLQIKTRNGSTKSKASEQQ 126
           H P LP      V APRNSLE  G        +E+N Q+QMGL+I T             
Sbjct: 25  HHPSLPSC----VVAPRNSLEQLG--------QEQNEQIQMGLEINT------------- 84

Query: 127 LPNNDHIIALESPSGKTPNLLARLMGLDILPQNPSSSFNCRGSNFGTRSLPESPRVSSAR 186
             N DH  AL+SPS KTPNLLARLMGLDILPQ  +S          TRSLP SPRVSS R
Sbjct: 85  --NFDH-NALDSPSVKTPNLLARLMGLDILPQTTTSP-------SATRSLPNSPRVSSLR 144

Query: 187 LSDVDCHHRRLSLQIISDKENINFFEEAKREKEKVSKKVALVDITNNNRKIEFGKQEVGF 246
           LSDVD HH R SL I  D EN    +E K+E+E+V +KVALVDITNNN K+ +GK     
Sbjct: 145 LSDVDRHHHRHSLDINLDIENSQICKEMKQEEEQVRRKVALVDITNNNNKLVYGK----- 204

Query: 247 SQIKVEIKSSKKLKKTAVDESRRSSKVVRKNQEVMISKKQKLISMS--MQKPKRRAREGE 306
                                        KNQ+V + +K   IS       PKR+ R   
Sbjct: 205 ----------------------------LKNQDVTMFRKHNSISTQTPTPTPKRKPR--- 264

Query: 307 AFDCPTSNNLVNNLHHSTIFPAKKEPSPPAIQAPREQVEVYDLHICDVIRKYSKGKAKPA 366
                           +T    K+E SPP     R +             ++  GK +PA
Sbjct: 265 ----------------ATTREEKEEESPPPAAKVRHEQRC----------RFPNGKQRPA 324

Query: 367 GRDGGERNAVDKTTTTDGGSTEFEYIKRILTNHGNSNSIISPPNNPTNPSIFHHPEAA-- 426
             + G R       T DGG+ E +YIKRILT+    +     P NP NPSIFHH E +  
Sbjct: 325 AEEVGRR------ATADGGAGELKYIKRILTSPNWFS-----PTNPLNPSIFHHLETSNA 374

Query: 427 --------------EDQQWGRRLLNCWHVRRGMKGWELGEEAVRERSVKKEYFPRAKYEV 486
                         +D+  G  ++NC      MKGWEL                RAK  V
Sbjct: 385 AVGEPRLERWNKDDDDEVLGEMVMNCRTRMMMMKGWELA---------------RAKCHV 374

Query: 487 VEDMDALI---INK-RVVEETEGIVKVVELHILDSLLRETVALISSLPKCSHF 518
           ++D+D+LI   + K + V E EG+V+  E HILDSLLRET A I SL K   F
Sbjct: 445 LKDIDSLIDKDLGKWKKVLELEGVVRTFEFHILDSLLRETTATIMSLHKRCRF 374

BLAST of Clc01G05690 vs. ExPASy TrEMBL
Match: A0A0A0KNC2 (VARLMGL domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G172790 PE=4 SV=1)

HSP 1 Score: 441.8 bits (1135), Expect = 3.9e-120
Identity = 314/550 (57.09%), Postives = 357/550 (64.91%), Query Frame = 0

Query: 1   MGKVE-WHFGGR-SSSRRATTADHPR-QRYRPSLPSCMSTLFHFFDFRSSRFTHIVFDNH 60
           MGK E W+FGGR SSSRR TT D    QR   SLPSCMSTLFH FDFRSS FTHIVFDNH
Sbjct: 1   MGKSEWWYFGGRSSSSRRVTTVDIDHFQRRDHSLPSCMSTLFHLFDFRSSHFTHIVFDNH 60

Query: 61  HPSSFNLPHHRPPL--PKASHHVVEAPRNSLELE-GASISCLRNKEKNLQLQMGLQIKTR 120
             SSF+L HH P L   KASHH VEAPRNSLEL+ G SISCLRNKE+NLQLQMGLQIKTR
Sbjct: 61  RSSSFDLSHHHPTLRPTKASHHGVEAPRNSLELDNGDSISCLRNKEENLQLQMGLQIKTR 120

Query: 121 NGSTKSKASEQQLPNNDHIIALESPSGKTPNLLARLMGLDILPQNP-SSSFNCRGSNFGT 180
           NGSTKSKA+EQQLPNND+IIALESPS  TPNLLARLMGLD  PQ   SSS+N    N GT
Sbjct: 121 NGSTKSKATEQQLPNNDNIIALESPSTNTPNLLARLMGLDNFPQTTFSSSYNHCMPNLGT 180

Query: 181 RSLPESPRVSSARLSDVDCHHRRLSLQI-ISDKEN--INFFEE-AKREKEKVSK-KVALV 240
           RSL ESPR S +RLSDVD HHRRLSLQI I +KEN  I   EE +KREK+KV + KVAL+
Sbjct: 181 RSLSESPRNSLSRLSDVDYHHRRLSLQINIQEKENNKIKICEEISKREKKKVERPKVALI 240

Query: 241 DITNNNRKIEFGKQEVGFSQI-KVEIKSSKKLKKTAVDESRRSSKVVRKNQE-VMISKKQ 300
           DITN+  K+    QE+G SQ  KVE+KS KKLKKT  ++S  S  V R NQ+ V++S KQ
Sbjct: 241 DITNSYNKVRSKIQEIGSSQSRKVEMKSLKKLKKTTTNKSSSSKVVCRSNQKNVIVSNKQ 300

Query: 301 KLISMSMQKPK-RRAREGEAFDCPTSNNLVNNLHHSTIFPAKKEPSPPAIQAPREQVEVY 360
           K ISMSMQ PK RRAREGEA DCP SN L + L HSTIF    +P               
Sbjct: 301 KSISMSMQIPKERRAREGEALDCPRSNKL-DLLDHSTIF----QPC-------------- 360

Query: 361 DLHICDVIRKYSKGKAKPAGRDGGERNAVDKTTTTDGGSTEFEYIKRI-LTNHGNSNSII 420
                     Y KGKAK A   GGE NAVD  TTTDGGS EF+YIK I +++  NSN ++
Sbjct: 361 ---------SYPKGKAKAA---GGETNAVDTATTTDGGSAEFKYIKTIQISSKENSNWVV 420

Query: 421 SPPNNPTNPSIFHHPEAAEDQQWGRRL---------------LNCWHVRRGMK-GWELGE 480
            P       S F+H  A E+++W +R+                  W  +RG K GWE   
Sbjct: 421 VP------ASRFYHSVAGEERRWKKRVELQQAVVGGDQIPNNKGWWQKQRGRKRGWE--- 480

Query: 481 EAVRERSVKKEYFPRAKYEVVEDMDALIINKR--------VVEETEGIVKVVELHILDSL 510
                       FP  K+E+VE     +INK         + EE EGIVK+VELHILDSL
Sbjct: 481 ------------FPHVKFELVE---YALINKDLEKSKFIIMAEEREGIVKLVELHILDSL 495

BLAST of Clc01G05690 vs. ExPASy TrEMBL
Match: A0A5D3BMU7 (Putative dna repair OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1728G00570 PE=4 SV=1)

HSP 1 Score: 246.5 bits (628), Expect = 2.4e-61
Identity = 190/376 (50.53%), Postives = 233/376 (61.97%), Query Frame = 0

Query: 151 MGLDILPQNPSSSFNCRGSNFGTRSLPESPRVSSARLSDVDCHHRRLSLQI-ISDKEN-- 210
           MGLD  PQ  SSS+   G N  TRSL ESPR SS+RLS+VDCHHRRLSLQI I +KEN  
Sbjct: 1   MGLDNFPQTSSSSYCRCGLNLETRSLTESPRNSSSRLSNVDCHHRRLSLQINIQEKENNG 60

Query: 211 INFFEE-AKREKEKVSKKVALVDITNNNRKIEFGKQEVGFS--QIKVEIKSSKKLKKTAV 270
           I   E+  KREK+KV +KVALVDITN+N KI +  QE+G S    KVE+KS KKL+KT V
Sbjct: 61  IEICEDIIKREKKKVGRKVALVDITNSNNKIGYEIQEIGHSSQSRKVEMKSLKKLEKTTV 120

Query: 271 DESRRSSKVVRKNQE-VMISKKQKLISMSMQKPKRRAREGEAFDCPTSNNLVNNLHHSTI 330
            ES  +SKVV  NQ+  M+SKKQKLISM MQ  K R  E EAFDCPT+N L+  LHH TI
Sbjct: 121 GES-SNSKVVHNNQKNEMVSKKQKLISMPMQILKGRTSEREAFDCPTNNKLL--LHHPTI 180

Query: 331 FPAKKEPSPPAIQAPREQVEVYDLHICDVIRKYSKGKAKPAGRDGGERNAVDKTTTTDGG 390
           F    EP                         Y KGK KPA   GGE +AVD TTTTDG 
Sbjct: 181 F----EPC-----------------------SYPKGKPKPA---GGETSAVDITTTTDGE 240

Query: 391 STEFEYIKRI-LTNHGNSNSIISPPNNPTNPSIFHHPE---AAEDQQWGRRLLNCWHVRR 450
           ST+F+YIK I +++  NSN ++        PS FHH E   A ++++W +RL     ++ 
Sbjct: 241 STDFKYIKTIQISSKENSNWVVP-------PSTFHHLETTLAGKERRWKKRL----ELQT 300

Query: 451 GMKGWELGEEAVRERSVKKEYFPRAKYEVV------EDMDALIINKRVVEETEGIVKVVE 510
           G+ G   G+   R+R  +   FP AK  +V      ED++   +   + EE EGIVK+VE
Sbjct: 301 GVVG---GDRRGRKRGWE---FPHAKCGLVEYGLINEDLEKSKLIIIMAEEREGIVKLVE 326

BLAST of Clc01G05690 vs. ExPASy TrEMBL
Match: A0A6J1H2A5 (uncharacterized protein LOC111459727 OS=Cucurbita moschata OX=3662 GN=LOC111459727 PE=4 SV=1)

HSP 1 Score: 205.7 bits (522), Expect = 4.7e-49
Identity = 181/473 (38.27%), Postives = 222/473 (46.93%), Query Frame = 0

Query: 67  HRPPLPKASHHVVEAPRNSLELEGASISCLRNKEKNLQLQMGLQIKTRNGSTKSKASEQQ 126
           H P LP      V APRNSLE  G        +E+N Q+QMGL+I T             
Sbjct: 25  HHPSLPSC----VVAPRNSLEQLG--------QEQNEQIQMGLEINT------------- 84

Query: 127 LPNNDHIIALESPSGKTPNLLARLMGLDILPQNPSSSFNCRGSNFGTRSLPESPRVSSAR 186
             N DH  AL+SPS KTPNLLARLMGLDILPQ  +S          TRSLP SPRVSS R
Sbjct: 85  --NFDH-NALDSPSVKTPNLLARLMGLDILPQTTTSP-------SATRSLPNSPRVSSLR 144

Query: 187 LSDVDCHHRRLSLQIISDKENINFFEEAKREKEKVSKKVALVDITNNNRKIEFGKQEVGF 246
           LSDVD HH R SL I  D EN    +E K+E+E+V +KVALVDITNNN K+ +GK     
Sbjct: 145 LSDVDRHHHRHSLDINLDIENSQICKEMKQEEEQVRRKVALVDITNNNNKLVYGK----- 204

Query: 247 SQIKVEIKSSKKLKKTAVDESRRSSKVVRKNQEVMISKKQKLISMS--MQKPKRRAREGE 306
                                        KNQ+V + +K   IS       PKR+ R   
Sbjct: 205 ----------------------------LKNQDVTMFRKHNSISTQTPTPTPKRKPR--- 264

Query: 307 AFDCPTSNNLVNNLHHSTIFPAKKEPSPPAIQAPREQVEVYDLHICDVIRKYSKGKAKPA 366
                           +T    K+E SPP     R +             ++  GK +PA
Sbjct: 265 ----------------ATTREEKEEESPPPAAKVRHEQRC----------RFPNGKQRPA 324

Query: 367 GRDGGERNAVDKTTTTDGGSTEFEYIKRILTNHGNSNSIISPPNNPTNPSIFHHPEAA-- 426
             + G R       T DGG+ E +YIKRILT+    +     P NP NPSIFHH E +  
Sbjct: 325 AEEVGRR------ATADGGAGELKYIKRILTSPNWFS-----PTNPLNPSIFHHLETSNA 374

Query: 427 --------------EDQQWGRRLLNCWHVRRGMKGWELGEEAVRERSVKKEYFPRAKYEV 486
                         +D+  G  ++NC      MKGWEL                RAK  V
Sbjct: 385 AVGEPRLERWNKDDDDEVLGEMVMNCRTRMMMMKGWELA---------------RAKCHV 374

Query: 487 VEDMDALI---INK-RVVEETEGIVKVVELHILDSLLRETVALISSLPKCSHF 518
           ++D+D+LI   + K + V E EG+V+  E HILDSLLRET A I SL K   F
Sbjct: 445 LKDIDSLIDKDLGKWKKVLELEGVVRTFEFHILDSLLRETTATIMSLHKRCRF 374

BLAST of Clc01G05690 vs. ExPASy TrEMBL
Match: A0A7J8V457 (Uncharacterized protein OS=Gossypium klotzschianum OX=34286 GN=Goklo_009685 PE=4 SV=1)

HSP 1 Score: 143.3 bits (360), Expect = 2.9e-30
Identity = 211/696 (30.32%), Postives = 302/696 (43.39%), Query Frame = 0

Query: 6   WHFGGRSSSRRATTADHPRQRYRPSLPSCMSTLFHFFDFRSSRF--THIVFDNHHPSS-- 65
           W  G ++S++R   A   +     +   C+S +F FFDF   +F   H      + SS  
Sbjct: 8   WAIGAKTSTKRPPPA---KAETTTTPTGCISAVFQFFDFHHFQFPLNHQTNSGSNSSSSC 67

Query: 66  --FNLPH-----HRPPLPKASHHVVEAPRNSLELE-----------GASISCLRNKE-KN 125
             F  PH     H   +P A     EAPRNSLE E            AS++   +KE ++
Sbjct: 68  GCFKQPHSFISPHSNFVPTALKG-TEAPRNSLESEDESSTSVSASVSASLTTSTSKEDES 127

Query: 126 LQLQMGLQIKTRNGSTKSKASEQQLPNNDHIIALE-SPSGKTPNLLARLMGLDILPQNPS 185
           L + MG+QIKT +G  +SK       NND    +  SP  KTP L+ARLMGLD+LP+  S
Sbjct: 128 LNIPMGIQIKT-SGDIRSKVGAS---NNDTFSEISGSPGTKTPTLVARLMGLDLLPETHS 187

Query: 186 SSF---------------NCRGSNF-GTRSLPESPRVSSARLSDVDCHHRRLSLQIISDK 245
            SF               +  G +F GTRSLPE+PR+SSAR SDVD HH R SLQI  +K
Sbjct: 188 PSFSQPKSSSSHLKGRRRSVDGGDFRGTRSLPETPRLSSARRSDVDYHH-RFSLQI--NK 247

Query: 246 ENINFFEEA------------------------KREKEKVSKKVALVDITNNNRKIEFGK 305
           EN++  EE                         K+ KE V +KV + DITN+ R  E  +
Sbjct: 248 ENMSTTEEVMVTRFSKRSEDENKSPGHYARQIMKQVKESVGRKVGM-DITNSVRNREQAR 307

Query: 306 QEVGFSQIKVEIKSSKKLKKTAVDESR-------------------------------RS 365
           +E+  +Q K + K SK + K A D +                                + 
Sbjct: 308 EEL-VNQFKYK-KISKAMSKLAEDSTSNGNGKHSTTPSCSPRLRFLEPKTKDQNPQPPKP 367

Query: 366 SKVVRKNQEVMISKKQKLISMS--------------------MQKPKR-----RAREGEA 425
           S++    Q + + +K KL +++                    ++KP+R     R ++ E 
Sbjct: 368 SEISIHPQPIRVLQKPKLQTVAEEQDDQQTQRSTSKCKKVTKLKKPQRTSDIIRNKQEEP 427

Query: 426 FDCPTSNNLVN----------------NLHHSTIFPAKKEPSPPAIQAPREQV------- 485
           F  P++ N  N                N+  S++FP KK+PSPPA + P++QV       
Sbjct: 428 FVRPSTANRANIPDKKCKKTPLSNDLLNITVSSLFPVKKDPSPPATKIPQKQVLDATRPK 487

Query: 486 --EVYDLHICDVIRKYSKGKAK--PAGRDG--GERNAVDKTTTTDGGSTEF-EYIKRILT 507
                 L  C      +K +A    + RD   G+R   + TTTT G   E+ EYI RIL 
Sbjct: 488 RSNSSQLSSCSSQTYNNKQEATYLHSSRDDNIGDR-CNNVTTTTTGEEAEYHEYIARILR 547

BLAST of Clc01G05690 vs. ExPASy TrEMBL
Match: A0A7J9A4K1 (Uncharacterized protein OS=Gossypium laxum OX=34288 GN=Golax_006691 PE=4 SV=1)

HSP 1 Score: 142.9 bits (359), Expect = 3.8e-30
Identity = 210/695 (30.22%), Postives = 298/695 (42.88%), Query Frame = 0

Query: 6   WHFGGRSSSRRATTADHPRQRYRPSLPSCMSTLFHFFDFRSSRF--THIVFDNHHPSS-- 65
           W  G ++S++R   A   +     +   C+S +F FFDF   +F   H      + SS  
Sbjct: 8   WATGAKTSTKRPPPA---KAETTTTPTGCISAVFQFFDFHHFQFPLNHQTNSGSNSSSSC 67

Query: 66  --FNLPH-----HRPPLPKASHHVVEAPRNSLELE-----------GASISCLRNKE-KN 125
             F  PH     H   +P A     EAPRNSLE E            AS++   +KE ++
Sbjct: 68  GCFKQPHSFVSPHSNFVPTALKG-TEAPRNSLESEDESSTSVSASVSASLTTSTSKEDES 127

Query: 126 LQLQMGLQIKTRNGSTKSKASEQQLPNNDHIIALE-SPSGKTPNLLARLMGLDILPQNPS 185
           L + MG+QIKT +G  +SK       NND    +  SP  KTP L+ARLMGLD+LP+  S
Sbjct: 128 LNIPMGIQIKT-SGDIRSKVGAS---NNDTFSEISGSPGTKTPTLVARLMGLDLLPETHS 187

Query: 186 SSF---------------NCRGSNF-GTRSLPESPRVSSARLSDVDCHHRRLSLQIISDK 245
            SF               +  G +F GTRSLPE+PR+SSAR SDVD HH R SLQI  +K
Sbjct: 188 PSFSQPKSSSSHLKGRRRSVDGGDFRGTRSLPETPRLSSARRSDVDYHH-RFSLQI--NK 247

Query: 246 ENINFFEEA------------------------KREKEKVSKKVALVDITNNNRKIEFGK 305
           EN++  EE                         K+ KE V +KV + DITN  R  E  +
Sbjct: 248 ENMSTTEEVMVTRFSKRSEDENKSPGHYARQIMKQVKESVGRKVGM-DITNTVRNREQAR 307

Query: 306 QEVGFSQIKVEIKSSKKLKKTAVDESR-------------------------------RS 365
           +E+  +Q K + K SK + K A D +                                + 
Sbjct: 308 EEL-VNQFKYK-KISKAMSKLAEDSTSNGNGKQSTTPSCSPRLRLLEPKTKDQNPQPPKP 367

Query: 366 SKVVRKNQEVMISKKQKLISMS------------------MQKPKR-----RAREGEAFD 425
           S++  + Q + + +K KL +++                  ++KP+R     R ++ E F 
Sbjct: 368 SEISIQPQLIRVLQKPKLQTVAEEQDDQQTQRSTSKKVTKLKKPQRTSDIIRNKQEEPFV 427

Query: 426 CPTSNNLVN----------------NLHHSTIFPAKKEPSPPAIQAPREQV--------- 485
            P++ N  N                N+  S++FP KK+PSPPA + P++QV         
Sbjct: 428 RPSTANRANIPDKKCKKTPLSNDLLNITVSSLFPVKKDPSPPATKIPQKQVLDATRPKRS 487

Query: 486 EVYDLHICDVIRKYSKGKA-----KPAGRDGGERNAVDKTTTTDGGSTEF-EYIKRILTN 507
               L  C      +K +A      P    G   N V  TTTT G   E+ EYI RIL  
Sbjct: 488 NSSQLSSCSSQTYNNKQEATYLHSSPHDNIGDRCNNV--TTTTTGEEAEYHEYIARILRR 547

BLAST of Clc01G05690 vs. TAIR 10
Match: AT5G62170.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G51850.1); Has 381 Blast hits to 359 proteins in 81 species: Archae - 0; Bacteria - 16; Metazoa - 101; Fungi - 21; Plants - 99; Viruses - 3; Other Eukaryotes - 141 (source: NCBI BLink). )

HSP 1 Score: 85.5 bits (210), Expect = 1.4e-16
Identity = 182/673 (27.04%), Postives = 266/673 (39.52%), Query Frame = 0

Query: 6   WHFGG--RSSSRRATTADHPRQRYRPSL-------PSCMSTLFHFFDFRSSRFTHIVFDN 65
           W  GG  +SSS+       P Q   PSL         CMS +F+ FDF+     H+    
Sbjct: 7   WLGGGKKKSSSKSKEEDIKPTQPPPPSLAGNTATAAGCMSAVFNIFDFQ-----HL---- 66

Query: 66  HHPSSFNLPHHRPPLPKASHHVVEAPRNSLEL--EGASISCLRNKEKNLQLQMGLQIKTR 125
                F + HH   LPK     V+APRNSLE   E  S S  R K+ NL + MG++IKT+
Sbjct: 67  ----QFPINHHHLHLPKG----VDAPRNSLESTEEETSFSPTR-KDGNLNISMGIKIKTK 126

Query: 126 NGSTKSKASEQQLPNNDHIIALESPSGKTPNLLARLMGLDILPQN------PSSSFN--- 185
             +  S AS    P   +     SPS KTP L+ARLMGLD++P N      PSSS +   
Sbjct: 127 PQARSSSAS--LTPTETY-----SPSIKTPTLVARLMGLDLVPDNYRSSPTPSSSSSSTL 186

Query: 186 ------CRGSNF---------------GTRSLPESPRVSSARLS-DVDCH-HRRLSL--- 245
                  R S+                GTRSLPE+PR+S  R S DV+C+ H+R SL   
Sbjct: 187 IDLKTPTRSSHAKKHRHYSLQRNSVDGGTRSLPETPRISLGRRSVDVNCYEHQRSSLHLR 246

Query: 246 -----------------------QIISDKENINFFEEAK----REKEKVSKKVAL-VDIT 305
                                  +I  DKEN +  E A+    + KE VS++  +  DIT
Sbjct: 247 DNNINVFPERESGINNVRLTRVKEIHEDKENRSPREYARQIVMQLKENVSRRRRMGTDIT 306

Query: 306 NNN---RKIEFGKQEVGFSQIKVEIKSSK---------KLKKTAVDESRRSSKVV----- 365
           N     R++   K+    + I     SS          K K T++  +  +SK++     
Sbjct: 307 NKETQPREVHESKKASSKTTIITHDVSSSPRLGLTEVPKTKPTSLQTNNVASKILETTAM 366

Query: 366 ---------------------RKNQEVMISKKQKLISMSMQKPKRRAREGEAFDCPTSNN 425
                                ++ +     KK +     + KP +  +E      P  NN
Sbjct: 367 KVQDKTRLPTVHEEPQGTEKEKQRKSTKKCKKPENFKSRLVKPPQSMQEEPFVRSPAINN 426

Query: 426 LVNNLHHSTIF------PAKKEP----------SPPAIQAPREQVEVYDLHICDVIRKYS 485
             NN +   +        +KK P          S P I+    Q      ++     +  
Sbjct: 427 SNNNNNGHLLLIQGDKSSSKKTPLSINHLINFTSVPTIKKKDSQPHHKSSNLKLRETQTP 486

Query: 486 KGKAKPAGRDGGERNAVDKTTTTDGGSTEFEYIKRILTNHG-NSNSIIS-----PPNNPT 505
           + +A  +        +        GG  E EYI R L   G + ++ IS      P++P 
Sbjct: 487 RNRASSSELPSFPSQSQHHIAPIAGG--ELEYITRTLRRTGIDRDTPISYAKWFSPSHPL 546

BLAST of Clc01G05690 vs. TAIR 10
Match: AT4G25430.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G51850.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 75.1 bits (183), Expect = 1.9e-13
Identity = 81/236 (34.32%), Postives = 112/236 (47.46%), Query Frame = 0

Query: 1   MGKVEWHFGGRSSSRRATTADHPRQRYRPSLPSCMSTLFHFFDFRSSRFTHIVFDNHHPS 60
           MG+ EW+ GGRS      T     ++       C++ L+HFF F      H  F + H  
Sbjct: 1   MGR-EWYNGGRS------TCSSKSKKNSNEANGCVTALYHFFHFH-----HFYFPSRHHH 60

Query: 61  SFNLPHHRPPL--PKASHHVVEAPRNSLEL-EGASISCLRNKEKNLQLQMGLQIKTRNGS 120
                HH+P +  P  +   + APRNSL+L E + +S     E+      GL I    G 
Sbjct: 61  -----HHQPSIDSPSRTRKGLVAPRNSLDLSEESPLSTNYKLERE-----GLNISV--GG 120

Query: 121 TKSKASEQQLPNNDHIIALESPSGKTPNLLARLMGLDILPQNPSSSFNCR---------G 180
            KS      +    H   L  P  KTPN++ARLMGLD+LP N   + + R         G
Sbjct: 121 KKSTLRGLLVDTPSHNCNL--PRTKTPNVVARLMGLDLLPDNLELTRSPRNGVRGHRLSG 180

Query: 181 SNFGTRSLPESPRVSSARLSDVDCHHRRLSLQIISDKENINFFEEAKREKEKVSKK 225
           +  GTRSLP SPR+SS      D  + RLSL++  ++EN N  EE  R + K  K+
Sbjct: 181 NGSGTRSLPASPRISS------DSENHRLSLEL--NREN-NKHEEFVRTRLKELKQ 201

BLAST of Clc01G05690 vs. TAIR 10
Match: AT5G51850.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G62170.1); Has 384 Blast hits to 375 proteins in 79 species: Archae - 0; Bacteria - 14; Metazoa - 135; Fungi - 31; Plants - 92; Viruses - 0; Other Eukaryotes - 112 (source: NCBI BLink). )

HSP 1 Score: 65.9 bits (159), Expect = 1.1e-10
Identity = 153/600 (25.50%), Postives = 237/600 (39.50%), Query Frame = 0

Query: 9   GGRSSSRRATTADHPRQRYRPSLPSCMSTLFHFFDFRSSRFTHIVFDNHHPSSFNLPHHR 68
           G  SSSR   TA+            CM+  +H            +FD+HH          
Sbjct: 21  GAFSSSRSKKTAN-----------GCMAAFYH------------LFDSHH---------- 80

Query: 69  PPLPKASHHVVEAPRNS--LELEGASISCLRNKEKNL-QLQMGLQIKTRNGSTKSKASEQ 128
                  H  +++P  S  L+L   S+     K+K +  + +G+++KT  G+  S+    
Sbjct: 81  -------HLTIDSPSRSKGLKLMEESLPSTTYKDKEISNIPVGMRVKTDTGTKSSRLRAL 140

Query: 129 QLPNNDHIIAL-ESPSGKTPNLLARLMGLDILPQNPSSSFNC--------------RGSN 188
              ++     +  SP  KTPNL+ARLMGLD+LP     + +               R S 
Sbjct: 141 VTDSSTSSSEICNSPGSKTPNLVARLMGLDLLPDKTDLNHSLSDLHTMSSHHITSHRLSK 200

Query: 189 FGTRSLPESPRVSSARLSDVDCHHRRLSLQIISDKE----NINFFEE------------A 248
            GTRSLP SPR+SSAR SD D H  RLSLQ+  +KE     +   +E             
Sbjct: 201 KGTRSLPVSPRISSARKSDFDIH--RLSLQLNREKEFGRSRLKEDQEESHSPRDYARQIV 260

Query: 249 KREKEK-VSKKVALVDITNN--NR------------------KIEFGKQEVGFSQIKVEI 308
           K+ KE+ V+++V  +DITN+  NR                  +  F ++E   S      
Sbjct: 261 KQIKERVVTRRVVGMDITNSVKNREARPSHELRRDTTVSCSPRTRFSEKENKQSTSHKPN 320

Query: 309 KSSKKLKKTAVDESRRSSKVVRKNQEVMISKKQKLISMSMQKPKR--------------- 368
            SS    +  + + + +  ++ + Q     K+++L  +++ K                  
Sbjct: 321 SSSSSRPEPIIQKPKPTPVILGEKQSQNRVKQRQLKPINLCKKAETETRRPIKPSPTSDI 380

Query: 369 RAREGEAFDCPTSNNLVNNLHHSTIFPAKKEPSP------PAIQAPREQVEVYDLHICD- 428
           R R+ E F   + +     LH    F  KK P         A + P +Q+   +  I + 
Sbjct: 381 RNRKRETFLSDSRDVKAKPLHKIKKF--KKIPKSNDLENISATRPPHQQINERERLISNE 440

Query: 429 --VIRKYSKGKAKPAGRDGGERNAVDKTTTTDGGSTEFEYIKRILTNHGNSNSIISPPNN 488
              IR  S  K +         +  D   T    ++E +YI RI+    N   I S    
Sbjct: 441 AASIRSSSMHKTEKNSPQVARNHKFDDAATEI--NSEQDYIIRIM----NLAGIKSDSQA 500

Query: 489 PTNPSIFHHPEAAEDQQWGRRLLNCWH----------------VRRG-MKGWELGEE--- 510
             + SIF   E   D   G   L C                   RRG  +G EL  E   
Sbjct: 501 MLDLSIFRKLEHFGDYPSGTLALGCNRRLLFDLVNEILIETVAKRRGNYQGSELISELCS 560

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_011656164.18.1e-12057.09uncharacterized protein LOC105435648 [Cucumis sativus] >KGN50399.1 hypothetical ... [more]
KAG7035594.11.1e-6339.63hypothetical protein SDJN02_02391, partial [Cucurbita argyrosperma subsp. argyro... [more]
KAA0055152.15.0e-6150.53putative dna repair [Cucumis melo var. makuwa] >TYK00310.1 putative dna repair [... [more]
KAG6605686.15.7e-4935.57hypothetical protein SDJN03_03003, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022958521.19.8e-4938.27uncharacterized protein LOC111459727 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KNC23.9e-12057.09VARLMGL domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G172790 PE=... [more]
A0A5D3BMU72.4e-6150.53Putative dna repair OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1728... [more]
A0A6J1H2A54.7e-4938.27uncharacterized protein LOC111459727 OS=Cucurbita moschata OX=3662 GN=LOC1114597... [more]
A0A7J8V4572.9e-3030.32Uncharacterized protein OS=Gossypium klotzschianum OX=34286 GN=Goklo_009685 PE=4... [more]
A0A7J9A4K13.8e-3030.22Uncharacterized protein OS=Gossypium laxum OX=34288 GN=Golax_006691 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G62170.11.4e-1627.04unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G25430.11.9e-1334.32unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT5G51850.11.1e-1025.50unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (cordophanus) v2
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR032795DUF3741-associated sequence motifPFAMPF14383VARLMGLcoord: 137..161
e-value: 7.0E-8
score: 31.8
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 359..374
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 359..381
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..28
NoneNo IPR availablePANTHERPTHR37751LOW PROTEIN: M-PHASE INDUCER PHOSPHATASE-LIKE PROTEINcoord: 251..508
coord: 1..234

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Clc01G05690.2Clc01G05690.2mRNA