Clc01G01860 (gene) Watermelon (cordophanus) v2

Overview
NameClc01G01860
Typegene
OrganismCitrullus lanatus subsp. cordophanus cv. cordophanus (Watermelon (cordophanus) v2)
Descriptionprotein ZGRF1 isoform X2
LocationClcChr01: 1681069 .. 1685432 (-)
RNA-Seq ExpressionClc01G01860
SyntenyClc01G01860
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGAGAAATGAATCGATGGAAGGTGACCTATACCAAGCACCTCAAGCAGAAGCGCAAAGTTTATCACGATGGTTTCTTAGACATCCACCGTTCCAGCAACAAGGTTTCTATTTCTTTCACACTTTAACTTCACTACACATCTTTTATTTGGACTCCTCGATATGTAATTCTGCAGCAGAACATGTCGAATTTCGGCGATTATGAACTGGAAAGTATGGATTAACTAACATAGTGGCGCTAAATCCTAGGATTGGATGTAGAGCTTTAGAAGCTTCTTGATGGAAATGATAATAGTCGATAGAATTGCTGAGATTTGAATCGGTATTTGGTTTATGTGGTTCTGATTTCGATTCGGAGTAACATTGTGCTGATTTGGTTGATTCTTAGAAATTTGGATTTTCTCATTCTAATTGTTAGCCTAGTCGATTTGTTCTAATGGCTAGATTGACACACAGACAATGCTCTATGATGGGTGCGAGAAGCTCCTCGAATGCAGGATCCTAAAGCAAGATGAAGTTATTTGCTCTGGCGAAACGCTTACATTCAACAGTTACCTTGTTGACATTGACACTCCTCTGGGAGATCATAAGCCGGAGTCTGGTCTGAATTTCCAACCAGGCGATGATAAGATATCTGAAAAATCTGGTTTGCTGCGAGGGAAATTTTTTCGAAATAACTCAGGTTGCTTTGGTAATTAATCTAGTTTTCCCTCCTTGGTTTATGTTTTTAAAATTCTTTCCATAACTAATGAGCACGGGATTGAGTTTTTTCTCTTACTGATTGTTTTATGATGAGTATTATTGTCTTGTTTGTTACTTACAAAACAAAGCAAGTGCGGAGAAGAATAAAGCTCAGCCGAGTCTCAGCCCTTCACATAAAATAATCAGAGGTAACTTGCATTATTTGGCTAACGCACCATTTCTGTTATTGTTGTTCTCCCAATTGCTTATTAAAGCGTTTTTAATCATAATTTGGGCTATAGATATAATCTTACTGGTGATTACCATTTATTTCAGACGTGGTACTCTGATCATGCGTGTACCAACATGCTTTTAGATTAGGTTTTAAAACTGTCATTCCCTTTACATTCCAGAATTTAAGAAGAGAAAATTGAAGTGCTATGGATCGCCACAAAGTATTCCAGACATTAGTAACACAGAAGAGACAGGTTAGTTCATGATTTCTTTCCTGGGTTGTTAGGTTGTCATGATTTTATTTTTTTCTTGGGGAGGGGGGCATATGTGCAATGGAAGTAGAGCACAATGTTGATGAATATGAGTCCTGTCCGTCTTGAGAAAACAATGTAATTGAAATATGTGAAAGTACTCTATATGTTTACGTACGTCAGTTCAATATGGTACACGATATTATAAATAGCCAAATATAGAGTGTAACAAAGGCACGTCCGACAGAACAAATTAATTAACAAGAGTAACTAAAATCAACTACTTTGCCTAATATGCCTTTTTAAGTATATTTAACATTCCTCAAAGAGTCAAATTTAATATGTATCTACCAATTTCTTCATTCTGTCTTCATCCGTTTTCTGTTGCCTTTATTCACATGATTAACCTTATGATGTTTGTAGAGTGGCAGGTTCTTTACACCACAAATATTACTCAGAAAGCTAAGAAGTATCATGATGGTTTCTTAAAACTTTCTATTTGTGGATCCCTCGGGAGGCAGGTTAACTCCTTGATCGTTACTTAAATTATCTGGTTTATCAAGCCTACCATACTTCTTGACTATTTGTTAACTATTCAATTTGTGAACTTTATATCTTCTACTCAACTTCGATAAGTATTAGAACTTTTTGTAGGGATGTATGTCATCACGATGAAGTATTTTAATCAATTGAAAGGTAGCCCAAGAGTTTGAGAGGGACCATAAAATGTGTAGGGAGGTCGAGAGATCAGTGAAGTGTCTCTACTTTTTGTAATTATGATATTAGTTTTGATCTTTTAGATTGGAGTCCAATATAATTAGGGTCGGACTTCCTTGTATTTTTGGCTTGTTTTTTGTATACCCTGACATTCTATTATTTTTCTCAAACTTTCTTACCCCTCCCCCCCCCCCCCCCCCCCCCCCCCCCCCAAAAAAAAAAAAAAAAAAGAGTTCAATTAAAATGTAAAGTTTCTGCCTGTACCTCTCCTGGTCTTTCTGCAGGCCATGCTCTTTGATGAAAACAGGAAACTATTGGATAGCAGATTTATTAAGAAGGATGAAACAGTAAAATCTGGAGAATCAATAGCCTTTGATGCTCATTTAGTAGAAATTGGAGAATGTGAGAAGGACCATAAGCCTCCTAAAACTCCCTTAAATCCAGGTAGCAGTTCTGGAGAAGGGGGAACCAGGGTACTGCATGGACAGAAAAGCTGTTTCAGTGAAAATGAAATATCAACTGGCAAAGGTCAGCTGTTTTTGCTACATTCATTTCTATCACGTAATGATCGTAGATGAAATTTGTCGGCATTGTTCATCTTTTATCTCCTTCCGCCATAAGGTGTCATGAAGTCTATTTTCCTCTCGTTGTGCTTCTACTTAGGCATGTGTAGTGTTTTAGTGCAGAATGGAATGTTTTGTACACTAGTCAGATAACTCAGAAGTCCAAGAAATATCACAATGGGATCATCAAAGTTTCCTCCTCTGGCTCTCACCATATGCAGGTTAAGTTACTTTGCATTGAACACCATGTTACCTGCTTTCTGTTGCAGTTGTACGAAGATACAATTCGGTCACTCCCCTCCCCCCTAGAGAAGAATGTAAATTTTAGTACACAATATGAGTTTCATGTGTTCAATTCAACTGTTTGCACAAGCAAACAGTATTTTTGATGCCATGGATACAAATAGATACATGCCTAAAGGTGCAATCGAGATTGTTGAAAAAGAACATCAGAACTACTGGTGGCTTACTCTAACAATAGGTAGTGTACCCAAACTTTCAGCTGTTATCAATTAATTGATCGCTTCTGGGACATGCTGTTCATTATCAAGTTAAAGAACAAATTCTTATCAGATTTGACCGTCACTATTTAAAGTTCTAGAGTTGTCTGATGACTATGAATGCAAGTTTAGATTGTATAGCAGATGTACAAGAAAGGTGGAAGGTTTAGGACAGGATATTAACATATTAGCCAGGGTTTTTTTTTTATCCGTTACTAATGCAATTCAATGAGTTGCAGGGATTTCTAATCATCTTTTTTATGAGACTAACTCTTACATTTATTTCAGGTTACTTTACTGAATGAAGATAGAAGTATATTAAGCAGCAAACACTTAAGTTTATCTAAAAATGTGAGGATGGGGGAGATACTTGAGCTACCGAAATACTTGGTGGAGATTGGCGAGGCATGTGAAAGTGTTAAAGGTCCTATGCTTGTGTAACATTTCAGTTTCTAAACCAATGTTCGTATGGCTATCTAATAACTTTTCTGGCTTGCTATCCGACCTTTGGTCAAATGCAGTAGAGCTCCGCAACAGAAAATTTGATATAAGAAAAGACGCAAGTTCTTGCATTTCTGGTGGAGATGAAAAGGGATCGGGCAGGGAGACTATGAAAAAGTCTTTACGGGATGGTTTGTGAATGGTTACTTGGTGCTGTCTGAACTCATTAGTTGTTTATGCTTGATTCCTTCTGAACTAAATATTTATTTGTTTTATATCATGAAGCCCATCAAATATTGTCCATTCTTCAAAGGCCAAGGGCTAGAATGAGCCTTTCTTCAGCTTGTACGAATGAAAATATTAGCGTATCTGTTTCGTCATATAACCCTGAACCTTTCCTTGCGGAGGCATTGCATCTTCCTATAGATGACCAATCTCATCAAAGACCAAGTGAAGGACAGGACACGAGGGAATCAACTAAGAACGCAGAAAACAACCAATCCATTGCTCTAATTCAATGTGATCTCGTGCAAATGCAACTCAGTTTAAGTCTTAATTTTCGTAATTTTACTATGAGATTGTCGTGTCAAATTACAGAATTATTTCAACTTTGATCCTGCAGCAACATTCACTGGTAATGCCGGAGCATTGACGGAGGATGTTGAAATTGGACACTCCAGCCAGGTGTGGTTAAATTTCTTTTTGGTGTCCTTTGAATGTACATTAGATGATTGTCATATAAGATCTTAAGTTTCCCATGACTTGACTCTATTCCTTTCCCGAAATGTGTGCACAGCTTCTTCGGTCAGACCATGTGGAGACCGAAAGTATTTCTCACAGAAATTCAATTTCTAGGACTCGAGCTTCCAGTGACTCTGCTGCTTGTAGCCTTGCTAATGATGAAGGGAAAATCTGCGAGGAGATTACATATGAAAGAGAAATGGATGCATGCCCAAGTTTTGATCTTGGAATTTGA

mRNA sequence

ATGGGAGAAATGAATCGATGGAAGGTGACCTATACCAAGCACCTCAAGCAGAAGCGCAAAGTTTATCACGATGGTTTCTTAGACATCCACCGTTCCAGCAACAAGACAATGCTCTATGATGGGTGCGAGAAGCTCCTCGAATGCAGGATCCTAAAGCAAGATGAAGTTATTTGCTCTGGCGAAACGCTTACATTCAACAGTTACCTTGTTGACATTGACACTCCTCTGGGAGATCATAAGCCGGAGTCTGGTCTGAATTTCCAACCAGGCGATGATAAGATATCTGAAAAATCTGGTTTGCTGCGAGGGAAATTTTTTCGAAATAACTCAGGTTGCTTTGCAAGTGCGGAGAAGAATAAAGCTCAGCCGAGTCTCAGCCCTTCACATAAAATAATCAGAGAATTTAAGAAGAGAAAATTGAAGTGCTATGGATCGCCACAAAGTATTCCAGACATTAGTAACACAGAAGAGACAGAGTGGCAGGTTCTTTACACCACAAATATTACTCAGAAAGCTAAGAAGTATCATGATGGTTTCTTAAAACTTTCTATTTGTGGATCCCTCGGGAGGCAGGCCATGCTCTTTGATGAAAACAGGAAACTATTGGATAGCAGATTTATTAAGAAGGATGAAACAGTAAAATCTGGAGAATCAATAGCCTTTGATGCTCATTTAGTAGAAATTGGAGAATGTGAGAAGGACCATAAGCCTCCTAAAACTCCCTTAAATCCAGGTAGCAGTTCTGGAGAAGGGGGAACCAGGGTACTGCATGGACAGAAAAGCTGTTTCAGTGAAAATGAAATATCAACTGGCAAAGAATGGAATGTTTTGTACACTAGTCAGATAACTCAGAAGTCCAAGAAATATCACAATGGGATCATCAAAGTTTCCTCCTCTGGCTCTCACCATATGCAGGTTACTTTACTGAATGAAGATAGAAGTATATTAAGCAGCAAACACTTAAGTTTATCTAAAAATGTGAGGATGGGGGAGATACTTGAGCTACCGAAATACTTGGTGGAGATTGGCGAGGCATGTGAAAGTGTTAAAGTAGAGCTCCGCAACAGAAAATTTGATATAAGAAAAGACGCAAGTTCTTGCATTTCTGGTGGAGATGAAAAGGGATCGGGCAGGGAGACTATGAAAAAGTCTTTACGGGATGCCCATCAAATATTGTCCATTCTTCAAAGGCCAAGGGCTAGAATGAGCCTTTCTTCAGCTTGTACGAATGAAAATATTAGCGTATCTGTTTCGTCATATAACCCTGAACCTTTCCTTGCGGAGGCATTGCATCTTCCTATAGATGACCAATCTCATCAAAGACCAAGTGAAGGACAGGACACGAGGGAATCAACTAAGAACGCAGAAAACAACCAATCCATTGCTCTAATTCAATCAACATTCACTGGTAATGCCGGAGCATTGACGGAGGATGTTGAAATTGGACACTCCAGCCAGCTTCTTCGGTCAGACCATGTGGAGACCGAAAGTATTTCTCACAGAAATTCAATTTCTAGGACTCGAGCTTCCAGTGACTCTGCTGCTTGTAGCCTTGCTAATGATGAAGGGAAAATCTGCGAGGAGATTACATATGAAAGAGAAATGGATGCATGCCCAAGTTTTGATCTTGGAATTTGA

Coding sequence (CDS)

ATGGGAGAAATGAATCGATGGAAGGTGACCTATACCAAGCACCTCAAGCAGAAGCGCAAAGTTTATCACGATGGTTTCTTAGACATCCACCGTTCCAGCAACAAGACAATGCTCTATGATGGGTGCGAGAAGCTCCTCGAATGCAGGATCCTAAAGCAAGATGAAGTTATTTGCTCTGGCGAAACGCTTACATTCAACAGTTACCTTGTTGACATTGACACTCCTCTGGGAGATCATAAGCCGGAGTCTGGTCTGAATTTCCAACCAGGCGATGATAAGATATCTGAAAAATCTGGTTTGCTGCGAGGGAAATTTTTTCGAAATAACTCAGGTTGCTTTGCAAGTGCGGAGAAGAATAAAGCTCAGCCGAGTCTCAGCCCTTCACATAAAATAATCAGAGAATTTAAGAAGAGAAAATTGAAGTGCTATGGATCGCCACAAAGTATTCCAGACATTAGTAACACAGAAGAGACAGAGTGGCAGGTTCTTTACACCACAAATATTACTCAGAAAGCTAAGAAGTATCATGATGGTTTCTTAAAACTTTCTATTTGTGGATCCCTCGGGAGGCAGGCCATGCTCTTTGATGAAAACAGGAAACTATTGGATAGCAGATTTATTAAGAAGGATGAAACAGTAAAATCTGGAGAATCAATAGCCTTTGATGCTCATTTAGTAGAAATTGGAGAATGTGAGAAGGACCATAAGCCTCCTAAAACTCCCTTAAATCCAGGTAGCAGTTCTGGAGAAGGGGGAACCAGGGTACTGCATGGACAGAAAAGCTGTTTCAGTGAAAATGAAATATCAACTGGCAAAGAATGGAATGTTTTGTACACTAGTCAGATAACTCAGAAGTCCAAGAAATATCACAATGGGATCATCAAAGTTTCCTCCTCTGGCTCTCACCATATGCAGGTTACTTTACTGAATGAAGATAGAAGTATATTAAGCAGCAAACACTTAAGTTTATCTAAAAATGTGAGGATGGGGGAGATACTTGAGCTACCGAAATACTTGGTGGAGATTGGCGAGGCATGTGAAAGTGTTAAAGTAGAGCTCCGCAACAGAAAATTTGATATAAGAAAAGACGCAAGTTCTTGCATTTCTGGTGGAGATGAAAAGGGATCGGGCAGGGAGACTATGAAAAAGTCTTTACGGGATGCCCATCAAATATTGTCCATTCTTCAAAGGCCAAGGGCTAGAATGAGCCTTTCTTCAGCTTGTACGAATGAAAATATTAGCGTATCTGTTTCGTCATATAACCCTGAACCTTTCCTTGCGGAGGCATTGCATCTTCCTATAGATGACCAATCTCATCAAAGACCAAGTGAAGGACAGGACACGAGGGAATCAACTAAGAACGCAGAAAACAACCAATCCATTGCTCTAATTCAATCAACATTCACTGGTAATGCCGGAGCATTGACGGAGGATGTTGAAATTGGACACTCCAGCCAGCTTCTTCGGTCAGACCATGTGGAGACCGAAAGTATTTCTCACAGAAATTCAATTTCTAGGACTCGAGCTTCCAGTGACTCTGCTGCTTGTAGCCTTGCTAATGATGAAGGGAAAATCTGCGAGGAGATTACATATGAAAGAGAAATGGATGCATGCCCAAGTTTTGATCTTGGAATTTGA

Protein sequence

MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKTMLYDGCEKLLECRILKQDEVICSGETLTFNSYLVDIDTPLGDHKPESGLNFQPGDDKISEKSGLLRGKFFRNNSGCFASAEKNKAQPSLSPSHKIIREFKKRKLKCYGSPQSIPDISNTEETEWQVLYTTNITQKAKKYHDGFLKLSICGSLGRQAMLFDENRKLLDSRFIKKDETVKSGESIAFDAHLVEIGECEKDHKPPKTPLNPGSSSGEGGTRVLHGQKSCFSENEISTGKEWNVLYTSQITQKSKKYHNGIIKVSSSGSHHMQVTLLNEDRSILSSKHLSLSKNVRMGEILELPKYLVEIGEACESVKVELRNRKFDIRKDASSCISGGDEKGSGRETMKKSLRDAHQILSILQRPRARMSLSSACTNENISVSVSSYNPEPFLAEALHLPIDDQSHQRPSEGQDTRESTKNAENNQSIALIQSTFTGNAGALTEDVEIGHSSQLLRSDHVETESISHRNSISRTRASSDSAACSLANDEGKICEEITYEREMDACPSFDLGI
Homology
BLAST of Clc01G01860 vs. NCBI nr
Match: XP_038874789.1 (uncharacterized protein LOC120067307 isoform X3 [Benincasa hispida])

HSP 1 Score: 957.2 bits (2473), Expect = 6.0e-275
Identity = 490/545 (89.91%), Postives = 505/545 (92.66%), Query Frame = 0

Query: 1   MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKTMLYDGCEKLLECRILKQDEVICSG 60
           MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKTMLYD CEKLLECRILKQDEVI SG
Sbjct: 1   MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKTMLYDECEKLLECRILKQDEVIGSG 60

Query: 61  ETLTFNSYLVDIDTPLGDHKPESGLNFQPGDDKISEKSGLLRGKFFRNNSGCFASAEKNK 120
           ETL FNSYLVDIDTPLGDHKPES LNFQPGDDKISEKSG+LRGK FRNNS  FAS EKNK
Sbjct: 61  ETLIFNSYLVDIDTPLGDHKPESDLNFQPGDDKISEKSGVLRGKNFRNNSVSFASTEKNK 120

Query: 121 AQPSLSPSHKIIREFKKRKLKCYGSPQSIPDISNTEETEWQVLYTTNITQKAKKYHDGFL 180
           A+PSLSPSH+IIREFKKR+LKCYGSPQS PD   TEETEWQVLYTTNITQKAKK+HDGFL
Sbjct: 121 ARPSLSPSHRIIREFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKFHDGFL 180

Query: 181 KLSICGSLGRQAMLFDENRKLLDSRFIKKDETVKSGESIAFDAHLVEIGECEKDHKPPKT 240
           KLSICGSLGRQ MLFDENRKLLDSRFIKKDETVKSGESIAFDAHLVEIGECEKDHKPPK 
Sbjct: 181 KLSICGSLGRQVMLFDENRKLLDSRFIKKDETVKSGESIAFDAHLVEIGECEKDHKPPKI 240

Query: 241 PLNPGSSSGEGGTRVLHGQKSCFSENEISTGKEWNVLYTSQITQKSKKYHNGIIKVSSSG 300
             N GSSSGEGGTRVLHG+K+CFSENEISTGKEWNVLYTSQ+TQKSKKYHNGIIKVSSSG
Sbjct: 241 LSNQGSSSGEGGTRVLHGRKNCFSENEISTGKEWNVLYTSQMTQKSKKYHNGIIKVSSSG 300

Query: 301 SHHMQVTLLNEDRSILSSKHLSLSKNVRMGEILELPKYLVEIGEACESVKVELRNRKFDI 360
           SH  QVTLLNEDRSILSSKH SLSKNVR+GEILELPKYLVEIGEACE+VKVEL NR FDI
Sbjct: 301 SHLRQVTLLNEDRSILSSKHFSLSKNVRIGEILELPKYLVEIGEACENVKVELGNRNFDI 360

Query: 361 RKDASSCISGGDEKGSGRETMKKSLRDAHQILSILQRPRARMSLSSACTNENISVSVSSY 420
           RKDAS CISGGDEKGSGRETMKKSLR+AHQILSILQRPRAR+ LSS   +ENISVSVSSY
Sbjct: 361 RKDASFCISGGDEKGSGRETMKKSLRNAHQILSILQRPRARVILSSGHMDENISVSVSSY 420

Query: 421 NPEPFLAEALHLPIDDQSHQRPSEGQDTRESTKNAENNQSIALIQSTFTGNAGALTEDVE 480
           NPEP LAEALHL IDDQSHQ+PSEGQD RESTKNAENNQSI L Q TFTGNAG LTEDVE
Sbjct: 421 NPEPSLAEALHLSIDDQSHQQPSEGQDKRESTKNAENNQSIVLTQPTFTGNAGTLTEDVE 480

Query: 481 IGHSSQLLRSDHVETESISHRNSISRTRASSDSAACSLANDEGKICEEITYEREMDACPS 540
           IGHSSQLLRSDH E E IS RNSISRTR SSD+AACSL NDEGKICEEITYEREMDACPS
Sbjct: 481 IGHSSQLLRSDHEEAEIISLRNSISRTRTSSDTAACSLVNDEGKICEEITYEREMDACPS 540

Query: 541 FDLGI 546
           FDLGI
Sbjct: 541 FDLGI 545

BLAST of Clc01G01860 vs. NCBI nr
Match: XP_038874787.1 (uncharacterized protein LOC120067307 isoform X1 [Benincasa hispida])

HSP 1 Score: 950.7 bits (2456), Expect = 5.6e-273
Identity = 490/553 (88.61%), Postives = 506/553 (91.50%), Query Frame = 0

Query: 1   MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKTMLYDGCEKLLECRILKQDEVICSG 60
           MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKTMLYD CEKLLECRILKQDEVI SG
Sbjct: 1   MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKTMLYDECEKLLECRILKQDEVIGSG 60

Query: 61  ETLTFNSYLVDIDTPLGDHKPESGLNFQPGDDKISEKSGLLRGKFFRNNSGCFASAEKNK 120
           ETL FNSYLVDIDTPLGDHKPES LNFQPGDDKISEKSG+LRGK FRNNS  FAS EKNK
Sbjct: 61  ETLIFNSYLVDIDTPLGDHKPESDLNFQPGDDKISEKSGVLRGKNFRNNSVSFASTEKNK 120

Query: 121 AQPSLSPSHKIIREFKKRKLKCYGSPQSIPDISNTEETEWQVLYTTNITQKAKKYHDGFL 180
           A+PSLSPSH+IIREFKKR+LKCYGSPQS PD   TEETEWQVLYTTNITQKAKK+HDGFL
Sbjct: 121 ARPSLSPSHRIIREFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKFHDGFL 180

Query: 181 KLSICGSLGRQAMLFDENRKLLDSRFIKKDETVKSGESIAFDAHLVEIGECEKDHKPPKT 240
           KLSICGSLGRQ MLFDENRKLLDSRFIKKDETVKSGESIAFDAHLVEIGECEKDHKPPK 
Sbjct: 181 KLSICGSLGRQVMLFDENRKLLDSRFIKKDETVKSGESIAFDAHLVEIGECEKDHKPPKI 240

Query: 241 PLNPGSSSGEGGTRVLHGQKSCFSENEISTGKEWNVLYTSQITQKSKKYHNGIIKVSSSG 300
             N GSSSGEGGTRVLHG+K+CFSENEISTGKEWNVLYTSQ+TQKSKKYHNGIIKVSSSG
Sbjct: 241 LSNQGSSSGEGGTRVLHGRKNCFSENEISTGKEWNVLYTSQMTQKSKKYHNGIIKVSSSG 300

Query: 301 SHHMQVTLLNEDRSILSSKHLSLSKNVRMGEILELPKYLVEIGEACESVKVELRNRKFDI 360
           SH  QVTLLNEDRSILSSKH SLSKNVR+GEILELPKYLVEIGEACE+VKVEL NR FDI
Sbjct: 301 SHLRQVTLLNEDRSILSSKHFSLSKNVRIGEILELPKYLVEIGEACENVKVELGNRNFDI 360

Query: 361 RKDASSCISGGDEKGSGRETMKKSLRDAHQILSILQRPRARMSLSSACTNENISVSVSSY 420
           RKDAS CISGGDEKGSGRETMKKSLR+AHQILSILQRPRAR+ LSS   +ENISVSVSSY
Sbjct: 361 RKDASFCISGGDEKGSGRETMKKSLRNAHQILSILQRPRARVILSSGHMDENISVSVSSY 420

Query: 421 NPEPFLAEALHLPIDDQSHQRPSEGQDTRESTKNAENNQSIALIQ--------STFTGNA 480
           NPEP LAEALHL IDDQSHQ+PSEGQD RESTKNAENNQSI L Q        +TFTGNA
Sbjct: 421 NPEPSLAEALHLSIDDQSHQQPSEGQDKRESTKNAENNQSIVLTQRDDVQMQLTTFTGNA 480

Query: 481 GALTEDVEIGHSSQLLRSDHVETESISHRNSISRTRASSDSAACSLANDEGKICEEITYE 540
           G LTEDVEIGHSSQLLRSDH E E IS RNSISRTR SSD+AACSL NDEGKICEEITYE
Sbjct: 481 GTLTEDVEIGHSSQLLRSDHEEAEIISLRNSISRTRTSSDTAACSLVNDEGKICEEITYE 540

Query: 541 REMDACPSFDLGI 546
           REMDACPSFDLGI
Sbjct: 541 REMDACPSFDLGI 553

BLAST of Clc01G01860 vs. NCBI nr
Match: XP_038874788.1 (uncharacterized protein LOC120067307 isoform X2 [Benincasa hispida])

HSP 1 Score: 944.5 bits (2440), Expect = 4.0e-271
Identity = 489/553 (88.43%), Postives = 505/553 (91.32%), Query Frame = 0

Query: 1   MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKTMLYDGCEKLLECRILKQDEVICSG 60
           MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKTMLYD CEKLLECRILKQDEVI SG
Sbjct: 1   MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKTMLYDECEKLLECRILKQDEVIGSG 60

Query: 61  ETLTFNSYLVDIDTPLGDHKPESGLNFQPGDDKISEKSGLLRGKFFRNNSGCFASAEKNK 120
           ETL FNSYLVDIDTPLGDHKPES LNFQPGDDKISEKSG+LRGK FRNNS  FAS EKNK
Sbjct: 61  ETLIFNSYLVDIDTPLGDHKPESDLNFQPGDDKISEKSGVLRGKNFRNNSVSFASTEKNK 120

Query: 121 AQPSLSPSHKIIREFKKRKLKCYGSPQSIPDISNTEETEWQVLYTTNITQKAKKYHDGFL 180
           A+PSLSPSH+IIREFKKR+LKCYGSPQS PD   TEETEWQVLYTTNITQKAKK+HDGFL
Sbjct: 121 ARPSLSPSHRIIREFKKRRLKCYGSPQSSPDTRKTEETEWQVLYTTNITQKAKKFHDGFL 180

Query: 181 KLSICGSLGRQAMLFDENRKLLDSRFIKKDETVKSGESIAFDAHLVEIGECEKDHKPPKT 240
           KLSICGSLGRQ MLFDENRKLLDSRFIKKDETVKSGESIAFDAHLVEIGECEKDHKPPK 
Sbjct: 181 KLSICGSLGRQVMLFDENRKLLDSRFIKKDETVKSGESIAFDAHLVEIGECEKDHKPPKI 240

Query: 241 PLNPGSSSGEGGTRVLHGQKSCFSENEISTGKEWNVLYTSQITQKSKKYHNGIIKVSSSG 300
             N GSSSGEGGTRVLHG+K+CFSENEISTGKEWNVLYTSQ+TQKSKKYHNGIIKVSSSG
Sbjct: 241 LSNQGSSSGEGGTRVLHGRKNCFSENEISTGKEWNVLYTSQMTQKSKKYHNGIIKVSSSG 300

Query: 301 SHHMQVTLLNEDRSILSSKHLSLSKNVRMGEILELPKYLVEIGEACESVKVELRNRKFDI 360
           SH  QVTLLNEDRSILSSKH SLSKNVR+GEILELPKYLVEIGEACE+VK EL NR FDI
Sbjct: 301 SHLRQVTLLNEDRSILSSKHFSLSKNVRIGEILELPKYLVEIGEACENVK-ELGNRNFDI 360

Query: 361 RKDASSCISGGDEKGSGRETMKKSLRDAHQILSILQRPRARMSLSSACTNENISVSVSSY 420
           RKDAS CISGGDEKGSGRETMKKSLR+AHQILSILQRPRAR+ LSS   +ENISVSVSSY
Sbjct: 361 RKDASFCISGGDEKGSGRETMKKSLRNAHQILSILQRPRARVILSSGHMDENISVSVSSY 420

Query: 421 NPEPFLAEALHLPIDDQSHQRPSEGQDTRESTKNAENNQSIALIQ--------STFTGNA 480
           NPEP LAEALHL IDDQSHQ+PSEGQD RESTKNAENNQSI L Q        +TFTGNA
Sbjct: 421 NPEPSLAEALHLSIDDQSHQQPSEGQDKRESTKNAENNQSIVLTQRDDVQMQLTTFTGNA 480

Query: 481 GALTEDVEIGHSSQLLRSDHVETESISHRNSISRTRASSDSAACSLANDEGKICEEITYE 540
           G LTEDVEIGHSSQLLRSDH E E IS RNSISRTR SSD+AACSL NDEGKICEEITYE
Sbjct: 481 GTLTEDVEIGHSSQLLRSDHEEAEIISLRNSISRTRTSSDTAACSLVNDEGKICEEITYE 540

Query: 541 REMDACPSFDLGI 546
           REMDACPSFDLGI
Sbjct: 541 REMDACPSFDLGI 552

BLAST of Clc01G01860 vs. NCBI nr
Match: XP_016903664.1 (PREDICTED: uncharacterized protein LOC103482830 isoform X4 [Cucumis melo])

HSP 1 Score: 880.6 bits (2274), Expect = 7.2e-252
Identity = 462/548 (84.31%), Postives = 489/548 (89.23%), Query Frame = 0

Query: 4   MNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKTMLYDGCEKLLECRILKQDEVICSGETL 63
           MNRWKVTYT HLKQKRKVYHDGFLDIHRSSNKTMLYD CEKLLECRIL++DEVICSGETL
Sbjct: 1   MNRWKVTYTNHLKQKRKVYHDGFLDIHRSSNKTMLYDECEKLLECRILRKDEVICSGETL 60

Query: 64  TFNSYLVDIDTPLGDHKPESGLNFQPGDDKISEKSGLLRGKFFRNNSGCFASAEKNKAQP 123
            FNS+LVDIDTPLGDHKPE GLNFQ GDDKISEKSG+LRGK  RNNS CFASAEKNK +P
Sbjct: 61  IFNSFLVDIDTPLGDHKPEFGLNFQEGDDKISEKSGVLRGKSIRNNSVCFASAEKNKTRP 120

Query: 124 SLSPSHKIIREFKKRKLKCYGSPQSIPDISNTEETEWQVLYTTNITQKAKKYHDGFLKLS 183
           SLSPSH+IIREFKKR+LK YGSPQ+ PD   TEETEWQVLYTTNITQKAKK+HDGFLKLS
Sbjct: 121 SLSPSHQIIREFKKRRLKSYGSPQTSPDTRKTEETEWQVLYTTNITQKAKKFHDGFLKLS 180

Query: 184 ICGSLGRQAMLFDENRKLLDSRFIKKDETVKSGESIAFDAHLVEIGECEKDHKPPKTPLN 243
           ICGSLG Q MLFDENRKLL+SRFIKK ETVKSGESIAFDAHLVEIGECEKDHKP K PLN
Sbjct: 181 ICGSLGSQVMLFDENRKLLNSRFIKKHETVKSGESIAFDAHLVEIGECEKDHKPSKIPLN 240

Query: 244 PGSSSGEGGTRVLHGQKSCFSENEISTGKEWNVLYTSQITQKSKKYHNGIIKVSSSGSHH 303
            G+SS EGG RVLHGQKSCFSENEIS GKEW+VLYTSQITQKSKKY NGIIK+SSSGSH 
Sbjct: 241 EGTSSKEGGDRVLHGQKSCFSENEISAGKEWHVLYTSQITQKSKKYQNGIIKISSSGSHQ 300

Query: 304 MQVTLLNEDRSILSSKHLSLSKNVRMGEILELPKYLVEIGEACESVKVELRNRKFDIRKD 363
           MQVTLLNEDR+ILS KHLSLSKNV++GE LELPKYLVEIGEACESVKVEL NR FDIRKD
Sbjct: 301 MQVTLLNEDRNILSRKHLSLSKNVKVGEKLELPKYLVEIGEACESVKVELGNRNFDIRKD 360

Query: 364 ASSCISGGDEKGSGRETMKKSLRD-----AHQILSILQRPRARMSLSSACTNENISVSVS 423
           AS CISGGDEKGSGRET +KSLRD     AHQILSILQRPRAR++LSS  T+ENISVSVS
Sbjct: 361 ASFCISGGDEKGSGRETTQKSLRDVLYHEAHQILSILQRPRARVNLSSGHTDENISVSVS 420

Query: 424 SYNPEPFLAEALHLPIDDQSHQRPSEGQDTRESTKNAENNQSIALIQSTFTGNAGALTED 483
           S NP+P +AEALHLPIDDQSHQ+PSEGQ+TRES KNAEN+QSIAL QSTFTGNA  LTED
Sbjct: 421 SRNPKPSVAEALHLPIDDQSHQKPSEGQNTRESIKNAENSQSIALTQSTFTGNAETLTED 480

Query: 484 VEIGHSSQLLRSDHVETESISHRNSISRTRASSDSAACSLAN-DEGKICEEITYEREMDA 543
            EIG SS+ LRSDHVE ESIS RNSI R    SDSAA SL N DEGKIC+EITYEREM A
Sbjct: 481 GEIGQSSK-LRSDHVEAESISLRNSIPR---KSDSAAYSLVNDDEGKICQEITYEREMHA 540

Query: 544 CPSFDLGI 546
            PSFDLGI
Sbjct: 541 FPSFDLGI 544

BLAST of Clc01G01860 vs. NCBI nr
Match: XP_011654694.1 (uncharacterized protein LOC101209453 isoform X1 [Cucumis sativus] >KGN49631.2 hypothetical protein Csa_000494 [Cucumis sativus])

HSP 1 Score: 877.9 bits (2267), Expect = 4.6e-251
Identity = 459/547 (83.91%), Postives = 483/547 (88.30%), Query Frame = 0

Query: 1   MGEM-NRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKTMLYDGCEKLLECRILKQDEVICS 60
           MGE+ NRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKTMLYD CEKLLECR+LKQDEVICS
Sbjct: 1   MGEITNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKTMLYDECEKLLECRMLKQDEVICS 60

Query: 61  GETLTFNSYLVDIDTPLGDHKPESGLNFQPGDDKISEKSGLLRGKFFRNNSGCFASAEKN 120
           GETL FNS+LVDIDTPLGD KPESGLNFQ GDDKISE SG++RGK   NNS C A AEKN
Sbjct: 61  GETLIFNSFLVDIDTPLGDQKPESGLNFQEGDDKISENSGVVRGKSILNNSVCSAGAEKN 120

Query: 121 KAQPSLSPSHKIIREFKKRKLKCYGSPQSIPDISNTEETEWQVLYTTNITQKAKKYHDGF 180
           K +PS SPS +IIREFKKR+LKCYGSPQ+  D   TEETEWQVLYTTNITQKAKK+HDGF
Sbjct: 121 KTRPSFSPSQQIIREFKKRRLKCYGSPQTSLDTRKTEETEWQVLYTTNITQKAKKFHDGF 180

Query: 181 LKLSICGSLGRQAMLFDENRKLLDSRFIKKDETVKSGESIAFDAHLVEIGECEKDHKPPK 240
           LKLSICGSLG Q MLFDENRKLLDSRFIKK ETVKSGESIAFDAHLVEIGECEKDHKP K
Sbjct: 181 LKLSICGSLGSQVMLFDENRKLLDSRFIKKHETVKSGESIAFDAHLVEIGECEKDHKPSK 240

Query: 241 TPLNPGSSSGEGGTRVLHGQKSCFSENEISTGKEWNVLYTSQITQKSKKYHNGIIKVSSS 300
            PLN G+SS EGG  VLHGQKSCFSENEISTGKEWNVLYTSQITQKSKKYHNGIIK+SSS
Sbjct: 241 IPLNEGTSSKEGGASVLHGQKSCFSENEISTGKEWNVLYTSQITQKSKKYHNGIIKISSS 300

Query: 301 GSHHMQVTLLNEDRSILSSKHLSLSKNVRMGEILELPKYLVEIGEACESVKVELRNRKFD 360
           GSH MQVTLLNEDR+ILS KHLSLSKNVR+GE LELPKYLVEIGEACESVKVEL +RK D
Sbjct: 301 GSHQMQVTLLNEDRNILSRKHLSLSKNVRVGEKLELPKYLVEIGEACESVKVELGDRKCD 360

Query: 361 IRKDASSCISGGDEKGSGRETMKKSLRDAHQILSILQRPRARMSLSSACTNENISVSVSS 420
           IRKDAS CISGGDE GSGRET +KSLRDAHQILSILQRPR R++LSS  T+ENISVSVSS
Sbjct: 361 IRKDASFCISGGDENGSGRETTQKSLRDAHQILSILQRPRGRVNLSSGHTDENISVSVSS 420

Query: 421 YNPEPFLAEALHLPIDDQSHQRPSEGQDTRESTKNAENNQSIALIQSTFTGNAGALTEDV 480
            NP+P LAEALHLP D QSHQ+PSEGQ+TRES KN EN+QSIAL QSTFTGNA  LTED 
Sbjct: 421 RNPKPSLAEALHLPKDYQSHQKPSEGQNTRESIKNTENSQSIALTQSTFTGNAETLTEDG 480

Query: 481 EIGHSSQLLRSDHVETESISHRNSISRTRASSDSAACSLAN-DEGKICEEITYEREMDAC 540
           E G SS+LLRSDHVE ESIS RNSI R    SDS ACSL N DEGKICEEITYEREM A 
Sbjct: 481 EFGQSSKLLRSDHVEAESISLRNSIPR---RSDSTACSLVNDDEGKICEEITYEREMHAF 540

Query: 541 PSFDLGI 546
           PSFDLGI
Sbjct: 541 PSFDLGI 544

BLAST of Clc01G01860 vs. ExPASy TrEMBL
Match: A0A1S4E617 (uncharacterized protein LOC103482830 isoform X4 OS=Cucumis melo OX=3656 GN=LOC103482830 PE=4 SV=1)

HSP 1 Score: 880.6 bits (2274), Expect = 3.5e-252
Identity = 462/548 (84.31%), Postives = 489/548 (89.23%), Query Frame = 0

Query: 4   MNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKTMLYDGCEKLLECRILKQDEVICSGETL 63
           MNRWKVTYT HLKQKRKVYHDGFLDIHRSSNKTMLYD CEKLLECRIL++DEVICSGETL
Sbjct: 1   MNRWKVTYTNHLKQKRKVYHDGFLDIHRSSNKTMLYDECEKLLECRILRKDEVICSGETL 60

Query: 64  TFNSYLVDIDTPLGDHKPESGLNFQPGDDKISEKSGLLRGKFFRNNSGCFASAEKNKAQP 123
            FNS+LVDIDTPLGDHKPE GLNFQ GDDKISEKSG+LRGK  RNNS CFASAEKNK +P
Sbjct: 61  IFNSFLVDIDTPLGDHKPEFGLNFQEGDDKISEKSGVLRGKSIRNNSVCFASAEKNKTRP 120

Query: 124 SLSPSHKIIREFKKRKLKCYGSPQSIPDISNTEETEWQVLYTTNITQKAKKYHDGFLKLS 183
           SLSPSH+IIREFKKR+LK YGSPQ+ PD   TEETEWQVLYTTNITQKAKK+HDGFLKLS
Sbjct: 121 SLSPSHQIIREFKKRRLKSYGSPQTSPDTRKTEETEWQVLYTTNITQKAKKFHDGFLKLS 180

Query: 184 ICGSLGRQAMLFDENRKLLDSRFIKKDETVKSGESIAFDAHLVEIGECEKDHKPPKTPLN 243
           ICGSLG Q MLFDENRKLL+SRFIKK ETVKSGESIAFDAHLVEIGECEKDHKP K PLN
Sbjct: 181 ICGSLGSQVMLFDENRKLLNSRFIKKHETVKSGESIAFDAHLVEIGECEKDHKPSKIPLN 240

Query: 244 PGSSSGEGGTRVLHGQKSCFSENEISTGKEWNVLYTSQITQKSKKYHNGIIKVSSSGSHH 303
            G+SS EGG RVLHGQKSCFSENEIS GKEW+VLYTSQITQKSKKY NGIIK+SSSGSH 
Sbjct: 241 EGTSSKEGGDRVLHGQKSCFSENEISAGKEWHVLYTSQITQKSKKYQNGIIKISSSGSHQ 300

Query: 304 MQVTLLNEDRSILSSKHLSLSKNVRMGEILELPKYLVEIGEACESVKVELRNRKFDIRKD 363
           MQVTLLNEDR+ILS KHLSLSKNV++GE LELPKYLVEIGEACESVKVEL NR FDIRKD
Sbjct: 301 MQVTLLNEDRNILSRKHLSLSKNVKVGEKLELPKYLVEIGEACESVKVELGNRNFDIRKD 360

Query: 364 ASSCISGGDEKGSGRETMKKSLRD-----AHQILSILQRPRARMSLSSACTNENISVSVS 423
           AS CISGGDEKGSGRET +KSLRD     AHQILSILQRPRAR++LSS  T+ENISVSVS
Sbjct: 361 ASFCISGGDEKGSGRETTQKSLRDVLYHEAHQILSILQRPRARVNLSSGHTDENISVSVS 420

Query: 424 SYNPEPFLAEALHLPIDDQSHQRPSEGQDTRESTKNAENNQSIALIQSTFTGNAGALTED 483
           S NP+P +AEALHLPIDDQSHQ+PSEGQ+TRES KNAEN+QSIAL QSTFTGNA  LTED
Sbjct: 421 SRNPKPSVAEALHLPIDDQSHQKPSEGQNTRESIKNAENSQSIALTQSTFTGNAETLTED 480

Query: 484 VEIGHSSQLLRSDHVETESISHRNSISRTRASSDSAACSLAN-DEGKICEEITYEREMDA 543
            EIG SS+ LRSDHVE ESIS RNSI R    SDSAA SL N DEGKIC+EITYEREM A
Sbjct: 481 GEIGQSSK-LRSDHVEAESISLRNSIPR---KSDSAAYSLVNDDEGKICQEITYEREMHA 540

Query: 544 CPSFDLGI 546
            PSFDLGI
Sbjct: 541 FPSFDLGI 544

BLAST of Clc01G01860 vs. ExPASy TrEMBL
Match: A0A0A0KQR3 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G148820 PE=4 SV=1)

HSP 1 Score: 872.8 bits (2254), Expect = 7.2e-250
Identity = 458/547 (83.73%), Postives = 483/547 (88.30%), Query Frame = 0

Query: 1   MGEM-NRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKTMLYDGCEKLLECRILKQDEVICS 60
           MGE+ NRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKTMLYD CEKLLECR+LKQDEVICS
Sbjct: 1   MGEITNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKTMLYDECEKLLECRMLKQDEVICS 60

Query: 61  GETLTFNSYLVDIDTPLGDHKPESGLNFQPGDDKISEKSGLLRGKFFRNNSGCFASAEKN 120
           GETL FNS+LVDIDTPLGD KPESGLNFQ GDDKISE SG++RGK   NNS C + AEKN
Sbjct: 61  GETLIFNSFLVDIDTPLGDQKPESGLNFQEGDDKISENSGVVRGKSILNNSVC-SGAEKN 120

Query: 121 KAQPSLSPSHKIIREFKKRKLKCYGSPQSIPDISNTEETEWQVLYTTNITQKAKKYHDGF 180
           K +PS SPS +IIREFKKR+LKCYGSPQ+  D   TEETEWQVLYTTNITQKAKK+HDGF
Sbjct: 121 KTRPSFSPSQQIIREFKKRRLKCYGSPQTSLDTRKTEETEWQVLYTTNITQKAKKFHDGF 180

Query: 181 LKLSICGSLGRQAMLFDENRKLLDSRFIKKDETVKSGESIAFDAHLVEIGECEKDHKPPK 240
           LKLSICGSLG Q MLFDENRKLLDSRFIKK ETVKSGESIAFDAHLVEIGECEKDHKP K
Sbjct: 181 LKLSICGSLGSQVMLFDENRKLLDSRFIKKHETVKSGESIAFDAHLVEIGECEKDHKPSK 240

Query: 241 TPLNPGSSSGEGGTRVLHGQKSCFSENEISTGKEWNVLYTSQITQKSKKYHNGIIKVSSS 300
            PLN G+SS EGG  VLHGQKSCFSENEISTGKEWNVLYTSQITQKSKKYHNGIIK+SSS
Sbjct: 241 IPLNEGTSSKEGGASVLHGQKSCFSENEISTGKEWNVLYTSQITQKSKKYHNGIIKISSS 300

Query: 301 GSHHMQVTLLNEDRSILSSKHLSLSKNVRMGEILELPKYLVEIGEACESVKVELRNRKFD 360
           GSH MQVTLLNEDR+ILS KHLSLSKNVR+GE LELPKYLVEIGEACESVKVEL +RK D
Sbjct: 301 GSHQMQVTLLNEDRNILSRKHLSLSKNVRVGEKLELPKYLVEIGEACESVKVELGDRKCD 360

Query: 361 IRKDASSCISGGDEKGSGRETMKKSLRDAHQILSILQRPRARMSLSSACTNENISVSVSS 420
           IRKDAS CISGGDE GSGRET +KSLRDAHQILSILQRPR R++LSS  T+ENISVSVSS
Sbjct: 361 IRKDASFCISGGDENGSGRETTQKSLRDAHQILSILQRPRGRVNLSSGHTDENISVSVSS 420

Query: 421 YNPEPFLAEALHLPIDDQSHQRPSEGQDTRESTKNAENNQSIALIQSTFTGNAGALTEDV 480
            NP+P LAEALHLP D QSHQ+PSEGQ+TRES KN EN+QSIAL QSTFTGNA  LTED 
Sbjct: 421 RNPKPSLAEALHLPKDYQSHQKPSEGQNTRESIKNTENSQSIALTQSTFTGNAETLTEDG 480

Query: 481 EIGHSSQLLRSDHVETESISHRNSISRTRASSDSAACSLAN-DEGKICEEITYEREMDAC 540
           E G SS+LLRSDHVE ESIS RNSI R    SDS ACSL N DEGKICEEITYEREM A 
Sbjct: 481 EFGQSSKLLRSDHVEAESISLRNSIPR---RSDSTACSLVNDDEGKICEEITYEREMHAF 540

Query: 541 PSFDLGI 546
           PSFDLGI
Sbjct: 541 PSFDLGI 543

BLAST of Clc01G01860 vs. ExPASy TrEMBL
Match: A0A6J1ESH9 (uncharacterized protein LOC111435594 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111435594 PE=4 SV=1)

HSP 1 Score: 845.1 bits (2182), Expect = 1.6e-241
Identity = 437/550 (79.45%), Postives = 482/550 (87.64%), Query Frame = 0

Query: 1   MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKTMLYDGCEKLLECRILKQDEVICSG 60
           M EMNRWKVTYTKHLKQ+RKVYHDGFLD+HRSSNKTMLYD CEKLLECRILKQ+EV+CSG
Sbjct: 1   MAEMNRWKVTYTKHLKQRRKVYHDGFLDVHRSSNKTMLYDECEKLLECRILKQEEVVCSG 60

Query: 61  ETLTFNSYLVDIDTPLGDHKPESGLNFQPGDDKISEKSGLLRGKFFRNNSGCF---ASAE 120
           ETL FNSYLVDIDTPLGDHKPESGLNFQ G DKI EKSG+LRGK FRNNS CF   ASAE
Sbjct: 61  ETLIFNSYLVDIDTPLGDHKPESGLNFQAGHDKIPEKSGVLRGKNFRNNSVCFENKASAE 120

Query: 121 KNKAQPSLSPSHKIIREFKKRKLKCYGSPQSIPDISNTEETEWQVLYTTNITQKAKKYHD 180
           KNK +P+LSPS KIIREFKK +LKCYGSPQS PD   TEETEWQVL+T+NITQKAKKYHD
Sbjct: 121 KNKTRPTLSPSCKIIREFKKSRLKCYGSPQSSPDTRQTEETEWQVLHTSNITQKAKKYHD 180

Query: 181 GFLKLSICGSLGRQAMLFDENRKLLDSRFIKKDETVKSGESIAFDAHLVEIGECEKDHKP 240
           GFLKL ICGSLGRQ MLFDENRKLLDSRF+KKDETVKSGESIAFDAHLV+IGECE++HKP
Sbjct: 181 GFLKLLICGSLGRQVMLFDENRKLLDSRFMKKDETVKSGESIAFDAHLVDIGECEREHKP 240

Query: 241 PKTPLNPGSSSGEGGTRVLHGQKSCFSENEISTGKEWNVLYTSQITQKSKKYHNGIIKVS 300
           PK PL+ GSS G+ GTRVL+  K CFSENEISTGKEW+VLYTSQITQKSKKYHNGIIK+S
Sbjct: 241 PKIPLSQGSSFGDRGTRVLNEPKKCFSENEISTGKEWHVLYTSQITQKSKKYHNGIIKIS 300

Query: 301 SSGSHHMQVTLLNEDRSILSSKHLSLSKNVRMGEILELPKYLVEIGEACESVKVELRNRK 360
           SSGSHHMQVTLLNEDR+ILSSKHLSLSK + MGEILELPKYLVEIGEAC +VKVE+ NR 
Sbjct: 301 SSGSHHMQVTLLNEDRTILSSKHLSLSKKLGMGEILELPKYLVEIGEACGNVKVEIANRD 360

Query: 361 FDIRKDASSCISGGDEKGSGRETMKKSLRDAHQILSILQRPRARMSLSSACTNENISVSV 420
           FDIRKD S CISG DEKGS R TMKKSLRDAH+ILSILQRP+AR+SLSS  +++NI VSV
Sbjct: 361 FDIRKDTSFCISGEDEKGSDRATMKKSLRDAHEILSILQRPKARVSLSSGHSDKNICVSV 420

Query: 421 -SSYNPEPFL-AEALHLPIDDQSHQRPSEGQDTRESTKNAENNQSIALIQSTFTGNAGAL 480
            SS  PEP L AEAL LP+DD+SH++PSE  DTR+STKNAENNQSIAL  ST       L
Sbjct: 421 PSSKVPEPSLAAEALDLPMDDRSHKKPSENLDTRDSTKNAENNQSIALTPST-------L 480

Query: 481 TEDVEIGHSSQLLRSDHVETESISHRNSISRTRASSDSAACSLANDEGKICEEITYEREM 540
           TE++EIGHS+QLL+++HVE ES S R++ISRT+ +S  AAC L NDEGK+CEEITYERE 
Sbjct: 481 TEELEIGHSNQLLQTEHVEAESSSLRDTISRTQGTSQFAACELVNDEGKMCEEITYERET 540

Query: 541 DACPSFDLGI 546
             CPSFDLGI
Sbjct: 541 GTCPSFDLGI 543

BLAST of Clc01G01860 vs. ExPASy TrEMBL
Match: A0A6J1I1P0 (protein ZGRF1 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111469041 PE=4 SV=1)

HSP 1 Score: 839.0 bits (2166), Expect = 1.2e-239
Identity = 433/550 (78.73%), Postives = 481/550 (87.45%), Query Frame = 0

Query: 1   MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKTMLYDGCEKLLECRILKQDEVICSG 60
           M EMNRWKVTYTKHLKQ+RKVYHDGFLD+HRSSNKTMLYD CEKLLECRILKQ+EV+CSG
Sbjct: 1   MAEMNRWKVTYTKHLKQRRKVYHDGFLDVHRSSNKTMLYDECEKLLECRILKQEEVVCSG 60

Query: 61  ETLTFNSYLVDIDTPLGDHKPESGLNFQPGDDKISEKSGLLRGKFFRNNSGCF---ASAE 120
           ETL FNSYLV+IDTPLGD+KPESGLNFQ G D+ISEKSG+LRGK FRNNS CF   ASAE
Sbjct: 61  ETLIFNSYLVEIDTPLGDNKPESGLNFQAGHDEISEKSGVLRGKNFRNNSVCFENKASAE 120

Query: 121 KNKAQPSLSPSHKIIREFKKRKLKCYGSPQSIPDISNTEETEWQVLYTTNITQKAKKYHD 180
           KNK +P+LSPS KIIREFKK +LKCYGSPQ+ PD   TEETEWQVLYT+NITQKAKKYHD
Sbjct: 121 KNKTRPTLSPSCKIIREFKKSRLKCYGSPQTSPDTRQTEETEWQVLYTSNITQKAKKYHD 180

Query: 181 GFLKLSICGSLGRQAMLFDENRKLLDSRFIKKDETVKSGESIAFDAHLVEIGECEKDHKP 240
           GFLKL ICGSLGRQ MLFDENRKLLDSRF+KKDE VKSGESIAFDAHLV+IGECE++HKP
Sbjct: 181 GFLKLLICGSLGRQVMLFDENRKLLDSRFMKKDERVKSGESIAFDAHLVDIGECEREHKP 240

Query: 241 PKTPLNPGSSSGEGGTRVLHGQKSCFSENEISTGKEWNVLYTSQITQKSKKYHNGIIKVS 300
           PK P++ GSS G+ GTRVLH  K CFSENEISTGKEW+VLYTSQITQKSKKYHNGIIK+S
Sbjct: 241 PKIPVSQGSSFGDRGTRVLHEPKKCFSENEISTGKEWHVLYTSQITQKSKKYHNGIIKIS 300

Query: 301 SSGSHHMQVTLLNEDRSILSSKHLSLSKNVRMGEILELPKYLVEIGEACESVKVELRNRK 360
           SSGSHHMQVTLLNEDR ILSSKH+SLSK + MGEILELPKYLVEIGEACE+VKVEL NR 
Sbjct: 301 SSGSHHMQVTLLNEDRIILSSKHISLSKKLGMGEILELPKYLVEIGEACENVKVELANRD 360

Query: 361 FDIRKDASSCISGGDEKGSGRETMKKSLRDAHQILSILQRPRARMSLSSACTNENISVSV 420
           FDIRKDAS CISG DEKGS R TMKKSLRDAH+ILSILQRP+AR+SLSS  +++NISVSV
Sbjct: 361 FDIRKDASFCISGEDEKGSARATMKKSLRDAHEILSILQRPKARVSLSSGQSDKNISVSV 420

Query: 421 SSYN-PEPFLA-EALHLPIDDQSHQRPSEGQDTRESTKNAENNQSIALIQSTFTGNAGAL 480
           SS   PEP LA EAL LP+D++SHQ+PSE  DTRESTKNAE+NQS AL QST T      
Sbjct: 421 SSSKVPEPSLATEALDLPMDERSHQKPSENLDTRESTKNAESNQSFALTQSTLT------ 480

Query: 481 TEDVEIGHSSQLLRSDHVETESISHRNSISRTRASSDSAACSLANDEGKICEEITYEREM 540
             ++EIGHS+QLL++++VE ES S R++IS T+ +S  AAC L NDEGK+CEEITYERE 
Sbjct: 481 --ELEIGHSNQLLQTEYVEAESSSLRDTISWTQGTSQFAACKLVNDEGKMCEEITYERET 540

Query: 541 DACPSFDLGI 546
             CPSFDLGI
Sbjct: 541 GTCPSFDLGI 542

BLAST of Clc01G01860 vs. ExPASy TrEMBL
Match: A0A6J1HXZ5 (protein ZGRF1 isoform X4 OS=Cucurbita maxima OX=3661 GN=LOC111469041 PE=4 SV=1)

HSP 1 Score: 829.7 bits (2142), Expect = 7.0e-237
Identity = 431/550 (78.36%), Postives = 478/550 (86.91%), Query Frame = 0

Query: 1   MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKTMLYDGCEKLLECRILKQDEVICSG 60
           M EMNRWKVTYTKHLKQ+RKVYHDGFLD+HRSSNKTMLYD CEKLLECRILKQ+EV+CSG
Sbjct: 1   MAEMNRWKVTYTKHLKQRRKVYHDGFLDVHRSSNKTMLYDECEKLLECRILKQEEVVCSG 60

Query: 61  ETLTFNSYLVDIDTPLGDHKPESGLNFQPGDDKISEKSGLLRGKFFRNNSGCF---ASAE 120
           ETL FNSYLV+IDTPLGD+KPESGLNFQ G D+ISEKSG+LRGK FRNNS CF   ASAE
Sbjct: 61  ETLIFNSYLVEIDTPLGDNKPESGLNFQAGHDEISEKSGVLRGKNFRNNSVCFENKASAE 120

Query: 121 KNKAQPSLSPSHKIIREFKKRKLKCYGSPQSIPDISNTEETEWQVLYTTNITQKAKKYHD 180
           KNK +P+LSPS KIIREFKK +LKCYGSPQ+ PD   TEETEWQVLYT+NITQKAKKYHD
Sbjct: 121 KNKTRPTLSPSCKIIREFKKSRLKCYGSPQTSPDTRQTEETEWQVLYTSNITQKAKKYHD 180

Query: 181 GFLKLSICGSLGRQAMLFDENRKLLDSRFIKKDETVKSGESIAFDAHLVEIGECEKDHKP 240
           GFLKL ICGSLGRQ MLFDENRKLLDSRF+KKDE VKSGESIAFDAHLV+IGECE++HKP
Sbjct: 181 GFLKLLICGSLGRQVMLFDENRKLLDSRFMKKDERVKSGESIAFDAHLVDIGECEREHKP 240

Query: 241 PKTPLNPGSSSGEGGTRVLHGQKSCFSENEISTGKEWNVLYTSQITQKSKKYHNGIIKVS 300
           PK P++ GSS G+ GTRVLH  K CFSENEISTGKEW+VLYTSQITQKSKKYHNGIIK+S
Sbjct: 241 PKIPVSQGSSFGDRGTRVLHEPKKCFSENEISTGKEWHVLYTSQITQKSKKYHNGIIKIS 300

Query: 301 SSGSHHMQVTLLNEDRSILSSKHLSLSKNVRMGEILELPKYLVEIGEACESVKVELRNRK 360
           SSGSHHMQVTLLNEDR ILSSKH+SLSK + MGEILELPKYLVEIGEACE+VKVEL NR 
Sbjct: 301 SSGSHHMQVTLLNEDRIILSSKHISLSKKLGMGEILELPKYLVEIGEACENVKVELANRD 360

Query: 361 FDIRKDASSCISGGDEKGSGRETMKKSLRDAHQILSILQRPRARMSLSSACTNENISVSV 420
           FDIRKDAS CISG DEKGS R TMKKSLRDAH+ILSILQRP+AR+SLSS  +++NISVSV
Sbjct: 361 FDIRKDASFCISGEDEKGSARATMKKSLRDAHEILSILQRPKARVSLSSGQSDKNISVSV 420

Query: 421 SSYN-PEPFLA-EALHLPIDDQSHQRPSEGQDTRESTKNAENNQSIALIQSTFTGNAGAL 480
           SS   PEP LA EAL LP+D++SHQ+PSE  DTRESTKNAE+NQS AL QST T      
Sbjct: 421 SSSKVPEPSLATEALDLPMDERSHQKPSENLDTRESTKNAESNQSFALTQSTLT------ 480

Query: 481 TEDVEIGHSSQLLRSDHVETESISHRNSISRTRASSDSAACSLANDEGKICEEITYEREM 540
             ++EIGHS+Q   +++VE ES S R++IS T+ +S  AAC L NDEGK+CEEITYERE 
Sbjct: 481 --ELEIGHSNQ---TEYVEAESSSLRDTISWTQGTSQFAACKLVNDEGKMCEEITYERET 539

Query: 541 DACPSFDLGI 546
             CPSFDLGI
Sbjct: 541 GTCPSFDLGI 539

BLAST of Clc01G01860 vs. TAIR 10
Match: AT4G10890.1 (unknown protein; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF2439 (InterPro:IPR018838); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G43722.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 128.6 bits (322), Expect = 1.5e-29
Identity = 75/149 (50.34%), Postives = 91/149 (61.07%), Query Frame = 0

Query: 1   MGEMNRWKVTYTKHLKQKRKVYHDGFLDIHRSSNKTMLYDGCEKLLECRILKQDEVICSG 60
           M E  RW   YTKHLKQKRKVYHDGFLD+H +  K MLYD  + LLE R LK  EV+ +G
Sbjct: 241 MAEKQRWIAMYTKHLKQKRKVYHDGFLDLHIARKKVMLYDEDDNLLESRTLKACEVVNTG 300

Query: 61  ETLTFNSYLVDIDTPLGDHKPESGLNFQPGDDKISEKS-GLLRGKFFRNNSGCFASAEK- 120
           ETLTF +YLVDI  P    K  S    +P D K + K   +LR  F +++  C       
Sbjct: 301 ETLTFQAYLVDICDPKDGSKASSEPKVEPSDQKCARKPFTVLRPNFKKSSLRCDEKKPDL 360

Query: 121 -NK-AQPSLSPSHKIIREFKKRKLKCYGS 146
            NK +  SLSPSH +IR FKKR+L  YG+
Sbjct: 361 VNKFSSKSLSPSHNMIRVFKKRELHKYGA 389

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038874789.16.0e-27589.91uncharacterized protein LOC120067307 isoform X3 [Benincasa hispida][more]
XP_038874787.15.6e-27388.61uncharacterized protein LOC120067307 isoform X1 [Benincasa hispida][more]
XP_038874788.14.0e-27188.43uncharacterized protein LOC120067307 isoform X2 [Benincasa hispida][more]
XP_016903664.17.2e-25284.31PREDICTED: uncharacterized protein LOC103482830 isoform X4 [Cucumis melo][more]
XP_011654694.14.6e-25183.91uncharacterized protein LOC101209453 isoform X1 [Cucumis sativus] >KGN49631.2 hy... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A1S4E6173.5e-25284.31uncharacterized protein LOC103482830 isoform X4 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A0A0KQR37.2e-25083.73Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G148820 PE=4 SV=1[more]
A0A6J1ESH91.6e-24179.45uncharacterized protein LOC111435594 isoform X2 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1I1P01.2e-23978.73protein ZGRF1 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111469041 PE=4 SV=1[more]
A0A6J1HXZ57.0e-23778.36protein ZGRF1 isoform X4 OS=Cucurbita maxima OX=3661 GN=LOC111469041 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G10890.11.5e-2950.34unknown protein; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF2439... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (cordophanus) v2
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR018838Domain of unknown function DUF2439PFAMPF10382DUF2439coord: 159..234
e-value: 7.3E-20
score: 71.1
coord: 272..348
e-value: 3.1E-15
score: 56.2
coord: 5..74
e-value: 1.2E-14
score: 54.3
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 241..255
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 232..255
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 433..458
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 434..451
NoneNo IPR availablePANTHERPTHR28535ZINC FINGER GRF-TYPE CONTAINING 1coord: 269..352
coord: 1..262

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Clc01G01860.2Clc01G01860.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006302 double-strand break repair
cellular_component GO:0035861 site of double-strand break