Clc02G03770 (gene) Watermelon (cordophanus) v2

Overview
NameClc02G03770
Typegene
OrganismCitrullus lanatus subsp. cordophanus cv. cordophanus (Watermelon (cordophanus) v2)
DescriptionArabidopsis protein of unknown function (DUF241)
LocationClcChr02: 3287357 .. 3288617 (+)
RNA-Seq ExpressionClc02G03770
SyntenyClc02G03770
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
AGGAGACAAGCTTTTCTGCATAAGAGAAATCCGTTCAAGTTTCATGGCTGTCTTTCAGTAAACTAATGGCTTTCCTAAACAAAAGGGAAATATTAAGAGGTTACAGGCCAGCTTTTAACATGGTGGAAGGGAGACATGGCAGAAACATTGCTGATATGGTGGTTGCAGATCCATATTACAATCATATAACTTAAGGATTTTATTTTATTTCTTCAGAATTTCAATAATGTAGTTGAAGGTTCCTTCTGTCCCCTTGGCCTTATATAAATTTCCCTGATGCAAGAAAATGATCATAAACTGAACATATTCAATTGATTTGTAATTCAAAAGCTAGGCCTTTGTGTCTTCTCAAGATTCAAATCAAGAAAGAAAGATGGATACCTCTGCTTTGAACCAAAAAAACTCTCATCATGTTCGTTCAAACAGCTTGCCTTCAAAGCCACACCCGTTCATCGATCAAGTCGATGAGCAGTTATGCAGATTGAAGGAGGCTTCAGACGCTACATCTTCTGCTTCATCATCTGAACTATGCCATAAATTAAATGGCCTTCAGGATTTGCATGATTCCATTGATAGGTTGCTTCTGTTGCCTCTCACCCAACATGTTCTTGTTGAAGAGGGTGATAAGAAATCGTTCAACGATTTACTTGAAGGATCTATCAGACTCTTGGATTTGTGTGACATAGCTAAGGATGCATTGTTGCAGACAAAAGAATGTGTGCATGAACTAGAATCCGTTTTGCGCAGAAGAAGGGGAGGTGAAATGTTCATAGCAAGTGAGGTTCAGAGATGCTTAATTTCAAGGAAGTTGATAAAGAAAACAATCTATAAAGCCTTAAAGGCAATTGAAAGCAAAAGTTGTCAGAAAATTCAAGCCACTCCAGCCATTGTTAGCTCGTTGAAACAGGCTGAAGTGGTTGGTTACAACGTCGTCGAATCCTTGTTGTCTTTCCTAGCAGGACCGAAACTTCCATCAAATTCAAGCCGTTGGTCTCTTGTTGGAAAGCTTGTGCAATCCAAAAGGGTAGCTTGTGAAGTTGAAGAAACAAGTAGAAATGAAGTGGCACTGGTTGATGCTGCCTTGCATTCGATTGCCAGTCAAAAAACAAAGAAATCTGATTTCCTTGTTCATGTTGACAATTTGCAGAGCTCATTGAAGATATTTGGTTCAAACATTAAAGAACTTGAAGGCGATCTCGAAGCTCTATACAGGCGCCTTATTAAAATTAGAGTCTCACTTCTCAACATCTACAACTACTAA

mRNA sequence

AGGAGACAAGCTTTTCTGCATAAGAGAAATCCGTTCAAGTTTCATGGCTGTCTTTCAGTAAACTAATGGCTTTCCTAAACAAAAGGGAAATATTAAGAGGTTACAGGCCAGCTTTTAACATGGTGGAAGGGAGACATGGCAGAAACATTGCTGATATGGTGGTTGCAGATCCATATTACAATCATATAACTTAAGGATTTTATTTTATTTCTTCAGAATTTCAATAATGTAGTTGAAGGTTCCTTCTGTCCCCTTGGCCTTATATAAATTTCCCTGATGCAAGAAAATGATCATAAACTGAACATATTCAATTGATTTGTAATTCAAAAGCTAGGCCTTTGTGTCTTCTCAAGATTCAAATCAAGAAAGAAAGATGGATACCTCTGCTTTGAACCAAAAAAACTCTCATCATGTTCGTTCAAACAGCTTGCCTTCAAAGCCACACCCGTTCATCGATCAAGTCGATGAGCAGTTATGCAGATTGAAGGAGGCTTCAGACGCTACATCTTCTGCTTCATCATCTGAACTATGCCATAAATTAAATGGCCTTCAGGATTTGCATGATTCCATTGATAGGTTGCTTCTGTTGCCTCTCACCCAACATGTTCTTGTTGAAGAGGGTGATAAGAAATCGTTCAACGATTTACTTGAAGGATCTATCAGACTCTTGGATTTGTGTGACATAGCTAAGGATGCATTGTTGCAGACAAAAGAATGTGTGCATGAACTAGAATCCGTTTTGCGCAGAAGAAGGGGAGGTGAAATGTTCATAGCAAGTGAGGTTCAGAGATGCTTAATTTCAAGGAAGTTGATAAAGAAAACAATCTATAAAGCCTTAAAGGCAATTGAAAGCAAAAGTTGTCAGAAAATTCAAGCCACTCCAGCCATTGTTAGCTCGTTGAAACAGGCTGAAGTGGTTGGTTACAACGTCGTCGAATCCTTGTTGTCTTTCCTAGCAGGACCGAAACTTCCATCAAATTCAAGCCGTTGGTCTCTTGTTGGAAAGCTTGTGCAATCCAAAAGGGTAGCTTGTGAAGTTGAAGAAACAAGTAGAAATGAAGTGGCACTGGTTGATGCTGCCTTGCATTCGATTGCCAGTCAAAAAACAAAGAAATCTGATTTCCTTGTTCATGTTGACAATTTGCAGAGCTCATTGAAGATATTTGGTTCAAACATTAAAGAACTTGAAGGCGATCTCGAAGCTCTATACAGGCGCCTTATTAAAATTAGAGTCTCACTTCTCAACATCTACAACTACTAA

Coding sequence (CDS)

ATGGATACCTCTGCTTTGAACCAAAAAAACTCTCATCATGTTCGTTCAAACAGCTTGCCTTCAAAGCCACACCCGTTCATCGATCAAGTCGATGAGCAGTTATGCAGATTGAAGGAGGCTTCAGACGCTACATCTTCTGCTTCATCATCTGAACTATGCCATAAATTAAATGGCCTTCAGGATTTGCATGATTCCATTGATAGGTTGCTTCTGTTGCCTCTCACCCAACATGTTCTTGTTGAAGAGGGTGATAAGAAATCGTTCAACGATTTACTTGAAGGATCTATCAGACTCTTGGATTTGTGTGACATAGCTAAGGATGCATTGTTGCAGACAAAAGAATGTGTGCATGAACTAGAATCCGTTTTGCGCAGAAGAAGGGGAGGTGAAATGTTCATAGCAAGTGAGGTTCAGAGATGCTTAATTTCAAGGAAGTTGATAAAGAAAACAATCTATAAAGCCTTAAAGGCAATTGAAAGCAAAAGTTGTCAGAAAATTCAAGCCACTCCAGCCATTGTTAGCTCGTTGAAACAGGCTGAAGTGGTTGGTTACAACGTCGTCGAATCCTTGTTGTCTTTCCTAGCAGGACCGAAACTTCCATCAAATTCAAGCCGTTGGTCTCTTGTTGGAAAGCTTGTGCAATCCAAAAGGGTAGCTTGTGAAGTTGAAGAAACAAGTAGAAATGAAGTGGCACTGGTTGATGCTGCCTTGCATTCGATTGCCAGTCAAAAAACAAAGAAATCTGATTTCCTTGTTCATGTTGACAATTTGCAGAGCTCATTGAAGATATTTGGTTCAAACATTAAAGAACTTGAAGGCGATCTCGAAGCTCTATACAGGCGCCTTATTAAAATTAGAGTCTCACTTCTCAACATCTACAACTACTAA

Protein sequence

MDTSALNQKNSHHVRSNSLPSKPHPFIDQVDEQLCRLKEASDATSSASSSELCHKLNGLQDLHDSIDRLLLLPLTQHVLVEEGDKKSFNDLLEGSIRLLDLCDIAKDALLQTKECVHELESVLRRRRGGEMFIASEVQRCLISRKLIKKTIYKALKAIESKSCQKIQATPAIVSSLKQAEVVGYNVVESLLSFLAGPKLPSNSSRWSLVGKLVQSKRVACEVEETSRNEVALVDAALHSIASQKTKKSDFLVHVDNLQSSLKIFGSNIKELEGDLEALYRRLIKIRVSLLNIYNY
Homology
BLAST of Clc02G03770 vs. NCBI nr
Match: XP_038888781.1 (uncharacterized protein LOC120078575 [Benincasa hispida])

HSP 1 Score: 496.1 bits (1276), Expect = 2.1e-136
Identity = 261/295 (88.47%), Postives = 277/295 (93.90%), Query Frame = 0

Query: 1   MDTSALNQKNSHHVRSNSLPSKPHPFIDQVDEQLCRLKEASDATSSASSSELCHKLNGLQ 60
           MD SA+N  NSHHVRSNSLPSKPHPFI+QVDEQLCRL+EAS+ATS  SSSE+CHKLNGLQ
Sbjct: 1   MDASAMNPNNSHHVRSNSLPSKPHPFIEQVDEQLCRLEEASEATS--SSSEICHKLNGLQ 60

Query: 61  DLHDSIDRLLLLPLTQHVLVEEGDKKSFNDLLEGSIRLLDLCDIAKDALLQTKECVHELE 120
           DLHDSIDRLLLLPLT+HV VEE  +KSF+DLLEGS R+LDLCDIAKDALLQTKECV ELE
Sbjct: 61  DLHDSIDRLLLLPLTEHVFVEESCEKSFDDLLEGSFRILDLCDIAKDALLQTKECVQELE 120

Query: 121 SVLRRRRGGEMFIASEVQRCLISRKLIKKTIYKALKAIESKSCQKIQATPAIVSSLKQAE 180
           SVLRRR+GGEMFIASE+Q+CL SRKLIKKTIYKALKAIE KSC+K QAT AIVSSLKQAE
Sbjct: 121 SVLRRRKGGEMFIASEIQKCLNSRKLIKKTIYKALKAIERKSCEKSQATSAIVSSLKQAE 180

Query: 181 VVGYNVVESLLSFLAGPKLPSNSSRWSLVGKLVQSKRVACEVEETSRNEVALVDAALHSI 240
           VVGYNVV+SLLSFLAGPKLPSNSSRWSLV KLVQSKRVACEVEETSRNEVALVDAAL SI
Sbjct: 181 VVGYNVVKSLLSFLAGPKLPSNSSRWSLVSKLVQSKRVACEVEETSRNEVALVDAALQSI 240

Query: 241 ASQKTKKSDFLVHVDNLQSSLKIFGSNIKELEGDLEALYRRLIKIRVSLLNIYNY 296
           ASQKTKKSDFLV VDNLQSSLK+FGSNI+ELEGDLEALYRRLIK RVSLLNIYNY
Sbjct: 241 ASQKTKKSDFLVQVDNLQSSLKVFGSNIEELEGDLEALYRRLIKTRVSLLNIYNY 293

BLAST of Clc02G03770 vs. NCBI nr
Match: XP_004144946.1 (uncharacterized protein LOC101212572 [Cucumis sativus] >KGN43324.1 hypothetical protein Csa_020274 [Cucumis sativus])

HSP 1 Score: 482.6 bits (1241), Expect = 2.4e-132
Identity = 257/295 (87.12%), Postives = 272/295 (92.20%), Query Frame = 0

Query: 1   MDTSALNQKNSHHVRSNSLPSKPHPFIDQVDEQLCRLKEASDATSSASSSELCHKLNGLQ 60
           MD SALN KNSHHVRSNSLPSKPHPFIDQVDEQLCRLKE S ATS  SSSELCHKLN LQ
Sbjct: 1   MDASALNPKNSHHVRSNSLPSKPHPFIDQVDEQLCRLKEVSKATS--SSSELCHKLNDLQ 60

Query: 61  DLHDSIDRLLLLPLTQHVLVEEGDKKSFNDLLEGSIRLLDLCDIAKDALLQTKECVHELE 120
           DLHDSIDR+LLL  TQH+LVEE DKKSFNDLLEGSI+LLDLCDIAKDALLQ+KECVHELE
Sbjct: 61  DLHDSIDRMLLLSHTQHILVEESDKKSFNDLLEGSIKLLDLCDIAKDALLQSKECVHELE 120

Query: 121 SVLRRRRGGEMFIASEVQRCLISRKLIKKTIYKALKAIESKSCQKIQATPAIVSSLKQAE 180
           SVLRRRRGGEMFIASEVQ+ L SRKLIKKTI KALKAIE+KSC+K QA+ AIVSSLKQAE
Sbjct: 121 SVLRRRRGGEMFIASEVQKGLSSRKLIKKTINKALKAIETKSCEKSQASSAIVSSLKQAE 180

Query: 181 VVGYNVVESLLSFLAGPKLPSNSSRWSLVGKLVQSKRVACEVEETSRNEVALVDAALHSI 240
           VVGYNVV+SLLS+LAGPK  SNSS WSLV KLVQSKRVACEVEET+RNEVALVDAALHSI
Sbjct: 181 VVGYNVVKSLLSYLAGPKFSSNSSHWSLVSKLVQSKRVACEVEETNRNEVALVDAALHSI 240

Query: 241 ASQKTKKSDFLVHVDNLQSSLKIFGSNIKELEGDLEALYRRLIKIRVSLLNIYNY 296
           ASQKTKKSDF V VDNLQ++LKIFGSNI++LEGDLEALYR LIK RVSLLNIYNY
Sbjct: 241 ASQKTKKSDFRVQVDNLQNALKIFGSNIEDLEGDLEALYRHLIKTRVSLLNIYNY 293

BLAST of Clc02G03770 vs. NCBI nr
Match: XP_022136146.1 (uncharacterized protein LOC111007910 [Momordica charantia])

HSP 1 Score: 478.0 bits (1229), Expect = 5.8e-131
Identity = 248/295 (84.07%), Postives = 269/295 (91.19%), Query Frame = 0

Query: 1   MDTSALNQKNSHHVRSNSLPSKPHPFIDQVDEQLCRLKEASDATSSASSSELCHKLNGLQ 60
           MD SALN +NSHH+RSNS PSKPHPFIDQVDE+LCRLKEAS+ATSS SSSELCHKLNGLQ
Sbjct: 1   MDASALNPRNSHHIRSNSWPSKPHPFIDQVDERLCRLKEASEATSS-SSSELCHKLNGLQ 60

Query: 61  DLHDSIDRLLLLPLTQHVLVEEGDKKSFNDLLEGSIRLLDLCDIAKDALLQTKECVHELE 120
           DLHD IDRLLLLPLTQ  + +E DKK F+DLLEGS+RLLDLCDIAKDALLQTKECVHELE
Sbjct: 61  DLHDCIDRLLLLPLTQQAIAQESDKKWFDDLLEGSVRLLDLCDIAKDALLQTKECVHELE 120

Query: 121 SVLRRRRGGEMFIASEVQRCLISRKLIKKTIYKALKAIESKSCQKIQATPAIVSSLKQAE 180
           SVLRRRRGGE+FIASE+Q+CL SRKLIKKTIYKALK IESKSC+K QATPAIVS LKQ E
Sbjct: 121 SVLRRRRGGELFIASELQKCLSSRKLIKKTIYKALKNIESKSCEKSQATPAIVSLLKQVE 180

Query: 181 VVGYNVVESLLSFLAGPKLPSNSSRWSLVGKLVQSKRVACEVEETSRNEVALVDAALHSI 240
            V YNV+ESLLSF+AGPKLPSNSSRWSLV K+VQ KRVACEVEE S NEVA+VDA L+S+
Sbjct: 181 AVSYNVIESLLSFIAGPKLPSNSSRWSLVSKIVQPKRVACEVEEASTNEVAIVDATLYSV 240

Query: 241 ASQKTKKSDFLVHVDNLQSSLKIFGSNIKELEGDLEALYRRLIKIRVSLLNIYNY 296
           ASQKTKKSD LV VDNLQSSLKIFGSNI+E+EGDLEALYR LIK RVSLLNI+NY
Sbjct: 241 ASQKTKKSDLLVQVDNLQSSLKIFGSNIQEVEGDLEALYRLLIKTRVSLLNIFNY 294

BLAST of Clc02G03770 vs. NCBI nr
Match: XP_022969195.1 (uncharacterized protein LOC111468263 [Cucurbita maxima])

HSP 1 Score: 476.9 bits (1226), Expect = 1.3e-130
Identity = 250/295 (84.75%), Postives = 269/295 (91.19%), Query Frame = 0

Query: 1   MDTSALNQKNSHHVRSNSLPSKPHPFIDQVDEQLCRLKEASDATSSASSSELCHKLNGLQ 60
           MD SA+NQ NSHHVRSNSLPSK HPFIDQVDE L RLKEAS+ATSS SSSEL HKLNGLQ
Sbjct: 1   MDFSAMNQTNSHHVRSNSLPSKLHPFIDQVDEHLHRLKEASEATSSCSSSELSHKLNGLQ 60

Query: 61  DLHDSIDRLLLLPLTQHVLVEEGDKKSFNDLLEGSIRLLDLCDIAKDALLQTKECVHELE 120
           +LHD ID+LLLLPLTQHVLVE+ D K F+DLL+GSIRLLDLCDIAKDALLQTKECV ELE
Sbjct: 61  ELHDYIDKLLLLPLTQHVLVEDSDTKPFDDLLKGSIRLLDLCDIAKDALLQTKECVQELE 120

Query: 121 SVLRRRRGGEMFIASEVQRCLISRKLIKKTIYKALKAIESKSCQKIQATPAIVSSLKQAE 180
           SVLRRR+G E FIASE+Q+CL SRKLIKKTIYKALK +++KSC+K QATPAIVSSLKQAE
Sbjct: 121 SVLRRRKGSETFIASELQKCLSSRKLIKKTIYKALKTVQTKSCEKTQATPAIVSSLKQAE 180

Query: 181 VVGYNVVESLLSFLAGPKLPSNSSRWSLVGKLVQSKRVACEVEETSRNEVALVDAALHSI 240
            VGYNVVESLLSFLAGPK  SNSSRWSLV KLVQSKRVACEVEETSRNEVALVDAALHS+
Sbjct: 181 FVGYNVVESLLSFLAGPKFRSNSSRWSLVSKLVQSKRVACEVEETSRNEVALVDAALHSV 240

Query: 241 ASQKTKKSDFLVHVDNLQSSLKIFGSNIKELEGDLEALYRRLIKIRVSLLNIYNY 296
            SQKTKK DFL  V+NLQSSLK+FGSNI+ELEGDLEALYRRLIK RVS+LNIYNY
Sbjct: 241 CSQKTKKHDFLAQVENLQSSLKMFGSNIQELEGDLEALYRRLIKTRVSILNIYNY 295

BLAST of Clc02G03770 vs. NCBI nr
Match: XP_023554559.1 (uncharacterized protein LOC111811765 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 475.3 bits (1222), Expect = 3.8e-130
Identity = 248/295 (84.07%), Postives = 268/295 (90.85%), Query Frame = 0

Query: 1   MDTSALNQKNSHHVRSNSLPSKPHPFIDQVDEQLCRLKEASDATSSASSSELCHKLNGLQ 60
           MD SA+NQ +SHH+RSNSLPSKPHPFIDQVDE L R KEAS+ATSS SSSEL HKLNGLQ
Sbjct: 1   MDFSAMNQTDSHHLRSNSLPSKPHPFIDQVDEHLHRFKEASEATSSCSSSELSHKLNGLQ 60

Query: 61  DLHDSIDRLLLLPLTQHVLVEEGDKKSFNDLLEGSIRLLDLCDIAKDALLQTKECVHELE 120
           +LHD ID LLLLPLTQHVLVE+ D K F+DLLEGSIRLLDLCDIAKDALLQTKECV ELE
Sbjct: 61  ELHDCIDNLLLLPLTQHVLVEDSDTKPFDDLLEGSIRLLDLCDIAKDALLQTKECVQELE 120

Query: 121 SVLRRRRGGEMFIASEVQRCLISRKLIKKTIYKALKAIESKSCQKIQATPAIVSSLKQAE 180
           SVLRRR+G E FIASE+Q+CL SRKLIKKTIYKALK +++KSC++ QATPAIVSSLKQAE
Sbjct: 121 SVLRRRKGSETFIASELQKCLSSRKLIKKTIYKALKTVQTKSCEQTQATPAIVSSLKQAE 180

Query: 181 VVGYNVVESLLSFLAGPKLPSNSSRWSLVGKLVQSKRVACEVEETSRNEVALVDAALHSI 240
            VGYNVVESLLSFLAGPK  SNSSRWSLV KLVQSKRVACEVEETSRNEVALVDAALHS+
Sbjct: 181 DVGYNVVESLLSFLAGPKFRSNSSRWSLVSKLVQSKRVACEVEETSRNEVALVDAALHSV 240

Query: 241 ASQKTKKSDFLVHVDNLQSSLKIFGSNIKELEGDLEALYRRLIKIRVSLLNIYNY 296
            SQKTKK DFL  V+NLQSSLK+FGSNI+ELEGDLEALYRRLIK RVS+LNIYNY
Sbjct: 241 CSQKTKKHDFLAQVENLQSSLKVFGSNIQELEGDLEALYRRLIKTRVSVLNIYNY 295

BLAST of Clc02G03770 vs. ExPASy TrEMBL
Match: A0A0A0K2T0 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G023980 PE=4 SV=1)

HSP 1 Score: 482.6 bits (1241), Expect = 1.1e-132
Identity = 257/295 (87.12%), Postives = 272/295 (92.20%), Query Frame = 0

Query: 1   MDTSALNQKNSHHVRSNSLPSKPHPFIDQVDEQLCRLKEASDATSSASSSELCHKLNGLQ 60
           MD SALN KNSHHVRSNSLPSKPHPFIDQVDEQLCRLKE S ATS  SSSELCHKLN LQ
Sbjct: 1   MDASALNPKNSHHVRSNSLPSKPHPFIDQVDEQLCRLKEVSKATS--SSSELCHKLNDLQ 60

Query: 61  DLHDSIDRLLLLPLTQHVLVEEGDKKSFNDLLEGSIRLLDLCDIAKDALLQTKECVHELE 120
           DLHDSIDR+LLL  TQH+LVEE DKKSFNDLLEGSI+LLDLCDIAKDALLQ+KECVHELE
Sbjct: 61  DLHDSIDRMLLLSHTQHILVEESDKKSFNDLLEGSIKLLDLCDIAKDALLQSKECVHELE 120

Query: 121 SVLRRRRGGEMFIASEVQRCLISRKLIKKTIYKALKAIESKSCQKIQATPAIVSSLKQAE 180
           SVLRRRRGGEMFIASEVQ+ L SRKLIKKTI KALKAIE+KSC+K QA+ AIVSSLKQAE
Sbjct: 121 SVLRRRRGGEMFIASEVQKGLSSRKLIKKTINKALKAIETKSCEKSQASSAIVSSLKQAE 180

Query: 181 VVGYNVVESLLSFLAGPKLPSNSSRWSLVGKLVQSKRVACEVEETSRNEVALVDAALHSI 240
           VVGYNVV+SLLS+LAGPK  SNSS WSLV KLVQSKRVACEVEET+RNEVALVDAALHSI
Sbjct: 181 VVGYNVVKSLLSYLAGPKFSSNSSHWSLVSKLVQSKRVACEVEETNRNEVALVDAALHSI 240

Query: 241 ASQKTKKSDFLVHVDNLQSSLKIFGSNIKELEGDLEALYRRLIKIRVSLLNIYNY 296
           ASQKTKKSDF V VDNLQ++LKIFGSNI++LEGDLEALYR LIK RVSLLNIYNY
Sbjct: 241 ASQKTKKSDFRVQVDNLQNALKIFGSNIEDLEGDLEALYRHLIKTRVSLLNIYNY 293

BLAST of Clc02G03770 vs. ExPASy TrEMBL
Match: A0A6J1C2Q0 (uncharacterized protein LOC111007910 OS=Momordica charantia OX=3673 GN=LOC111007910 PE=4 SV=1)

HSP 1 Score: 478.0 bits (1229), Expect = 2.8e-131
Identity = 248/295 (84.07%), Postives = 269/295 (91.19%), Query Frame = 0

Query: 1   MDTSALNQKNSHHVRSNSLPSKPHPFIDQVDEQLCRLKEASDATSSASSSELCHKLNGLQ 60
           MD SALN +NSHH+RSNS PSKPHPFIDQVDE+LCRLKEAS+ATSS SSSELCHKLNGLQ
Sbjct: 1   MDASALNPRNSHHIRSNSWPSKPHPFIDQVDERLCRLKEASEATSS-SSSELCHKLNGLQ 60

Query: 61  DLHDSIDRLLLLPLTQHVLVEEGDKKSFNDLLEGSIRLLDLCDIAKDALLQTKECVHELE 120
           DLHD IDRLLLLPLTQ  + +E DKK F+DLLEGS+RLLDLCDIAKDALLQTKECVHELE
Sbjct: 61  DLHDCIDRLLLLPLTQQAIAQESDKKWFDDLLEGSVRLLDLCDIAKDALLQTKECVHELE 120

Query: 121 SVLRRRRGGEMFIASEVQRCLISRKLIKKTIYKALKAIESKSCQKIQATPAIVSSLKQAE 180
           SVLRRRRGGE+FIASE+Q+CL SRKLIKKTIYKALK IESKSC+K QATPAIVS LKQ E
Sbjct: 121 SVLRRRRGGELFIASELQKCLSSRKLIKKTIYKALKNIESKSCEKSQATPAIVSLLKQVE 180

Query: 181 VVGYNVVESLLSFLAGPKLPSNSSRWSLVGKLVQSKRVACEVEETSRNEVALVDAALHSI 240
            V YNV+ESLLSF+AGPKLPSNSSRWSLV K+VQ KRVACEVEE S NEVA+VDA L+S+
Sbjct: 181 AVSYNVIESLLSFIAGPKLPSNSSRWSLVSKIVQPKRVACEVEEASTNEVAIVDATLYSV 240

Query: 241 ASQKTKKSDFLVHVDNLQSSLKIFGSNIKELEGDLEALYRRLIKIRVSLLNIYNY 296
           ASQKTKKSD LV VDNLQSSLKIFGSNI+E+EGDLEALYR LIK RVSLLNI+NY
Sbjct: 241 ASQKTKKSDLLVQVDNLQSSLKIFGSNIQEVEGDLEALYRLLIKTRVSLLNIFNY 294

BLAST of Clc02G03770 vs. ExPASy TrEMBL
Match: A0A6J1HZA2 (uncharacterized protein LOC111468263 OS=Cucurbita maxima OX=3661 GN=LOC111468263 PE=4 SV=1)

HSP 1 Score: 476.9 bits (1226), Expect = 6.3e-131
Identity = 250/295 (84.75%), Postives = 269/295 (91.19%), Query Frame = 0

Query: 1   MDTSALNQKNSHHVRSNSLPSKPHPFIDQVDEQLCRLKEASDATSSASSSELCHKLNGLQ 60
           MD SA+NQ NSHHVRSNSLPSK HPFIDQVDE L RLKEAS+ATSS SSSEL HKLNGLQ
Sbjct: 1   MDFSAMNQTNSHHVRSNSLPSKLHPFIDQVDEHLHRLKEASEATSSCSSSELSHKLNGLQ 60

Query: 61  DLHDSIDRLLLLPLTQHVLVEEGDKKSFNDLLEGSIRLLDLCDIAKDALLQTKECVHELE 120
           +LHD ID+LLLLPLTQHVLVE+ D K F+DLL+GSIRLLDLCDIAKDALLQTKECV ELE
Sbjct: 61  ELHDYIDKLLLLPLTQHVLVEDSDTKPFDDLLKGSIRLLDLCDIAKDALLQTKECVQELE 120

Query: 121 SVLRRRRGGEMFIASEVQRCLISRKLIKKTIYKALKAIESKSCQKIQATPAIVSSLKQAE 180
           SVLRRR+G E FIASE+Q+CL SRKLIKKTIYKALK +++KSC+K QATPAIVSSLKQAE
Sbjct: 121 SVLRRRKGSETFIASELQKCLSSRKLIKKTIYKALKTVQTKSCEKTQATPAIVSSLKQAE 180

Query: 181 VVGYNVVESLLSFLAGPKLPSNSSRWSLVGKLVQSKRVACEVEETSRNEVALVDAALHSI 240
            VGYNVVESLLSFLAGPK  SNSSRWSLV KLVQSKRVACEVEETSRNEVALVDAALHS+
Sbjct: 181 FVGYNVVESLLSFLAGPKFRSNSSRWSLVSKLVQSKRVACEVEETSRNEVALVDAALHSV 240

Query: 241 ASQKTKKSDFLVHVDNLQSSLKIFGSNIKELEGDLEALYRRLIKIRVSLLNIYNY 296
            SQKTKK DFL  V+NLQSSLK+FGSNI+ELEGDLEALYRRLIK RVS+LNIYNY
Sbjct: 241 CSQKTKKHDFLAQVENLQSSLKMFGSNIQELEGDLEALYRRLIKTRVSILNIYNY 295

BLAST of Clc02G03770 vs. ExPASy TrEMBL
Match: A0A6J1GLF0 (uncharacterized protein LOC111455338 OS=Cucurbita moschata OX=3662 GN=LOC111455338 PE=4 SV=1)

HSP 1 Score: 474.9 bits (1221), Expect = 2.4e-130
Identity = 247/295 (83.73%), Postives = 267/295 (90.51%), Query Frame = 0

Query: 1   MDTSALNQKNSHHVRSNSLPSKPHPFIDQVDEQLCRLKEASDATSSASSSELCHKLNGLQ 60
           MD SA+NQ NSHH+RSNSLPSKPHPFIDQVDE L R KEAS+ATSS SSSEL HKLNGLQ
Sbjct: 1   MDFSAMNQTNSHHLRSNSLPSKPHPFIDQVDEHLHRFKEASEATSSCSSSELSHKLNGLQ 60

Query: 61  DLHDSIDRLLLLPLTQHVLVEEGDKKSFNDLLEGSIRLLDLCDIAKDALLQTKECVHELE 120
           +LHD ID LLLLPLTQHVLVE+ D K F+DLLEGSIRLLDLCDIAK+ALLQTKECV ELE
Sbjct: 61  ELHDCIDNLLLLPLTQHVLVEDSDTKPFDDLLEGSIRLLDLCDIAKEALLQTKECVQELE 120

Query: 121 SVLRRRRGGEMFIASEVQRCLISRKLIKKTIYKALKAIESKSCQKIQATPAIVSSLKQAE 180
           SVLRRR+G E FIASE+Q+CL SRKLIKKTIYKALK +++KSC+K QATPA+VSSLKQAE
Sbjct: 121 SVLRRRKGSETFIASELQKCLSSRKLIKKTIYKALKTVQTKSCEKTQATPAVVSSLKQAE 180

Query: 181 VVGYNVVESLLSFLAGPKLPSNSSRWSLVGKLVQSKRVACEVEETSRNEVALVDAALHSI 240
            V YNVVESLLSFLAGPK  SNSSRWSLV KLVQSKRVACEVEETSRNEVALVDAALHS+
Sbjct: 181 FVSYNVVESLLSFLAGPKFRSNSSRWSLVSKLVQSKRVACEVEETSRNEVALVDAALHSV 240

Query: 241 ASQKTKKSDFLVHVDNLQSSLKIFGSNIKELEGDLEALYRRLIKIRVSLLNIYNY 296
            SQKTKK DFL  V+NLQSSLK+FGSNI+ELEGDLEALYRRLIK RVS+LNIYNY
Sbjct: 241 CSQKTKKHDFLAQVENLQSSLKMFGSNIQELEGDLEALYRRLIKTRVSILNIYNY 295

BLAST of Clc02G03770 vs. ExPASy TrEMBL
Match: A0A5D3DJA0 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold142G003570 PE=4 SV=1)

HSP 1 Score: 464.9 bits (1195), Expect = 2.5e-127
Identity = 249/295 (84.41%), Postives = 266/295 (90.17%), Query Frame = 0

Query: 1   MDTSALNQKNSHHVRSNSLPSKPHPFIDQVDEQLCRLKEASDATSSASSSELCHKLNGLQ 60
           MD SALN KNSHHVRSNSLPSK HPFIDQVDEQLCRLK+AS ATS  SSSELCHKLN LQ
Sbjct: 1   MDPSALNPKNSHHVRSNSLPSKLHPFIDQVDEQLCRLKQASKATS--SSSELCHKLNDLQ 60

Query: 61  DLHDSIDRLLLLPLTQHVLVEEGDKKSFNDLLEGSIRLLDLCDIAKDALLQTKECVHELE 120
           DLH+SIDR+LLL  TQH+ VEE DKKSFNDLLEGSI+LLDLCDIAKDALLQ+KECVH+LE
Sbjct: 61  DLHESIDRMLLLSHTQHIQVEESDKKSFNDLLEGSIKLLDLCDIAKDALLQSKECVHKLE 120

Query: 121 SVLRRRRGGEMFIASEVQRCLISRKLIKKTIYKALKAIESKSCQKIQATPAIVSSLKQAE 180
           SVLRRR GGE  IASEVQ+CL SRKLIKKTI KALKAIE+KSC+K QA+ AIVSSLKQAE
Sbjct: 121 SVLRRRGGGETLIASEVQKCLSSRKLIKKTINKALKAIETKSCEKSQASSAIVSSLKQAE 180

Query: 181 VVGYNVVESLLSFLAGPKLPSNSSRWSLVGKLVQSKRVACEVEETSRNEVALVDAALHSI 240
           V G NVVESLLS+LAGPK  +NSS WSLV KLVQSKRVACEVEETSRNEVALVDAALHSI
Sbjct: 181 VAGSNVVESLLSYLAGPKFSTNSSHWSLVSKLVQSKRVACEVEETSRNEVALVDAALHSI 240

Query: 241 ASQKTKKSDFLVHVDNLQSSLKIFGSNIKELEGDLEALYRRLIKIRVSLLNIYNY 296
           ASQKTK SDF V VDNLQ++LKIFGSNI++LEGDLEALYR LIK RVSLLNIYNY
Sbjct: 241 ASQKTKNSDFHVQVDNLQNALKIFGSNIEDLEGDLEALYRHLIKTRVSLLNIYNY 293

BLAST of Clc02G03770 vs. TAIR 10
Match: AT2G17080.1 (Arabidopsis protein of unknown function (DUF241) )

HSP 1 Score: 193.0 bits (489), Expect = 3.5e-49
Identity = 122/285 (42.81%), Postives = 181/285 (63.51%), Query Frame = 0

Query: 11  SHHVRSNSLPSKPHPFIDQVDEQLCRLKEASDATSSASSSELCHKLNGLQDLHDSIDRLL 70
           S HVRSNS PS+ HP    VDEQL RL+ +S+  SS+SSS +C +L+ LQ+LH+S+D+L+
Sbjct: 4   SFHVRSNSFPSRSHPQAAHVDEQLARLR-SSEQASSSSSSSICQRLDNLQELHESLDKLI 63

Query: 71  LLPLTQHVLVEEGDKKSFNDLLEGSIRLLDLCDIAKDALLQTKECVHELESVLRRRRGGE 130
             P+TQ  L +E +KK+   LL+GS+R+LDLC+I+KDAL + KE + E++S+LRR+RG  
Sbjct: 64  SRPVTQQALSQEHNKKAVEQLLDGSLRILDLCNISKDALSEMKEGLMEIQSILRRKRGD- 123

Query: 131 MFIASEVQRCLISRKLIKKTIYKALKAIESKSCQKIQATPAIVSSLKQAEVVGYNVVESL 190
             ++ EV++ L SRK +KK+  K  K++  K  Q        ++   +AE +  ++ +SL
Sbjct: 124 --LSEEVKKYLTSRKSLKKSFQKVQKSL--KVTQAEDNNDDTLAVFGEAEAITLSLFDSL 183

Query: 191 LSFLAGPKLPSNSSRWSLVGKLVQSKRVACEVEETSRNEVALVDAALHSIASQKTKKSDF 250
           LS+++G K     S+WS+V KL+  K+V CE +E   NE   VD+      S+KT K D 
Sbjct: 184 LSYMSGSK---TCSKWSVVSKLMNKKKVTCEAQE---NEFTKVDS---EFQSEKTLKMD- 243

Query: 251 LVHVDNLQSSLKIFGSNIKELEGDLEALYRRLIKIRVSLLNIYNY 296
              V NL+S        I++LE  LE+L + LIK RVS LNI  +
Sbjct: 244 --DVQNLESC-------IQDLEDGLESLSKSLIKYRVSFLNILGH 263

BLAST of Clc02G03770 vs. TAIR 10
Match: AT2G17070.1 (Arabidopsis protein of unknown function (DUF241) )

HSP 1 Score: 188.7 bits (478), Expect = 6.5e-48
Identity = 117/285 (41.05%), Postives = 179/285 (62.81%), Query Frame = 0

Query: 11  SHHVRSNSLPSKPHPFIDQVDEQLCRLKEASDATSSASSSELCHKLNGLQDLHDSIDRLL 70
           S HVRS+S PS PHP    VDEQL RL+ +S+ TS++SSS +C +L+ LQ+LH+S+D+L+
Sbjct: 4   SFHVRSHSYPSIPHPQAAHVDEQLARLR-SSEETSTSSSSSICQRLDNLQELHESLDKLI 63

Query: 71  LLPLTQHVLVEEGDKKSFNDLLEGSIRLLDLCDIAKDALLQTKECVHELESVLRRRRGGE 130
            LP+TQ  L +E +KK    LL+GS+++LD+C+I+KDAL Q KE + E++S+LRR+RG  
Sbjct: 64  RLPVTQQALGQEKNKKDVEQLLDGSLKILDVCNISKDALSQMKEGLMEIQSILRRKRGD- 123

Query: 131 MFIASEVQRCLISRKLIKKTIYKALKAIESKSCQKIQATPAIVSSLKQAEVVGYNVVESL 190
             ++ EV++ L SRK  KKT  K  K++  K+ Q        ++   +AE V   + +SL
Sbjct: 124 --LSGEVKKYLASRKSFKKTFQKVQKSL--KAAQAEDNKDKSLAVFGEAEAVTIAMFDSL 183

Query: 191 LSFLAGPKLPSNSSRWSLVGKLVQSKRVACEVEETSRNEVALVDAALHSIASQKTKKSDF 250
            S+++G K     S+WS+V KL+  K++ CE +E   NE   VD+      S+KT K + 
Sbjct: 184 FSYMSGSK---TCSKWSVVSKLMNKKKITCEAQE---NEFTKVDS---EFQSEKTLKME- 243

Query: 251 LVHVDNLQSSLKIFGSNIKELEGDLEALYRRLIKIRVSLLNIYNY 296
                     ++I  S I++ E  LE+L + LIK RVS+LN + +
Sbjct: 244 ---------DVQILESCIQDFEDGLESLSKSLIKYRVSILNSFGH 263

BLAST of Clc02G03770 vs. TAIR 10
Match: AT4G35200.1 (Arabidopsis protein of unknown function (DUF241) )

HSP 1 Score: 179.1 bits (453), Expect = 5.2e-45
Identity = 117/282 (41.49%), Postives = 175/282 (62.06%), Query Frame = 0

Query: 11  SHHVRSNSLPSKPHPFIDQVDEQLCRLKEASDATSSASSSELCHKLNGLQDLHDSIDRLL 70
           S HVRSNS PS+ HP    VDEQL RL+    ++ SASSS +C +L+ LQDLHDS+++++
Sbjct: 4   SFHVRSNSYPSRQHPQAAHVDEQLTRLR----SSDSASSSSICQRLSNLQDLHDSLEKMI 63

Query: 71  LLPLTQHVLVEEGDKKSFNDLLEGSIRLLDLCDIAKDALLQTKECVHELESVLRRRRGGE 130
            L +T   L ++  +K    LL+GS+R+LDLC+IAKDA+ Q KE + E++S+LRR+ G  
Sbjct: 64  RLSVTNLALSQDQIEK----LLDGSLRILDLCNIAKDAISQMKEGLMEIQSILRRKPGD- 123

Query: 131 MFIASEVQRCLISRKLIKKTIYKALKAIESKSCQKIQATPAIVSSLKQAEVVGYNVVESL 190
             ++ EV++ L+SRK +KK++ K +K++  K CQ   +T A +    +AE V   + ESL
Sbjct: 124 --LSGEVKKYLVSRKFLKKSLQKVIKSL--KVCQSKDSTNASLVVFGRAEAVTMALFESL 183

Query: 191 LSFLAGPKLPSNSSRWSLVGKLVQSKRVACEVEETSRNEVALVDAALHSIASQKTKKSDF 250
            SF++G K      +WSLV K++   +V CE E    NE   +D+   S      +KS  
Sbjct: 184 FSFMSGSKA---CGKWSLVSKMMSQNKVTCEAE---ANEFTRIDSEFQS------EKSLQ 243

Query: 251 LVHVDNLQSSLKIFGSNIKELEGDLEALYRRLIKIRVSLLNI 293
           +  V NL+S        I++LE  +E+L + LIK RVS+LNI
Sbjct: 244 MEDVQNLESC-------IQDLEDGIESLSKSLIKYRVSILNI 253

BLAST of Clc02G03770 vs. TAIR 10
Match: AT4G35210.1 (Arabidopsis protein of unknown function (DUF241) )

HSP 1 Score: 160.6 bits (405), Expect = 1.9e-39
Identity = 109/282 (38.65%), Postives = 167/282 (59.22%), Query Frame = 0

Query: 11  SHHVRSNSLPSKPHPFIDQVDEQLCRLKEASDATSSASSSELCHKLNGLQDLHDSIDRLL 70
           S HVRS+S PS+ HP    VDEQL RL+    ++ +ASSS +C +L+ LQDLHDS+++++
Sbjct: 4   SFHVRSSSYPSRQHPQAAHVDEQLTRLR----SSGTASSSSICQRLSNLQDLHDSLEKMI 63

Query: 71  LLPLTQHVLVEEGDKKSFNDLLEGSIRLLDLCDIAKDALLQTKECVHELESVLRRRRGGE 130
            L +T   L ++  +K    LL+GSI++LDLC I+KD L Q KE + E++S++RR+RG  
Sbjct: 64  RLSVTNQALSQDQIEK----LLDGSIKILDLCSISKDGLSQMKESLKEIQSIVRRKRGD- 123

Query: 131 MFIASEVQRCLISRKLIKKTIYKALKAIESKSCQKIQATPAIVSSLKQAEVVGYNVVESL 190
             +++EV++ L SRK +KK+  K LK++++      Q     ++   +AE V   + ESL
Sbjct: 124 --LSAEVKKYLASRKFLKKSFEKVLKSLKTS-----QNKNDALAVFGEAETVTIALFESL 183

Query: 191 LSFLAGPKLPSNSSRWSLVGKLVQSKRVACEVEETSRNEVALVDAALHSIASQKTKKSDF 250
            SF++G K      +WSLV K++   +  CE E    NE   VD    S      +KS  
Sbjct: 184 FSFMSGSKA---CGKWSLVSKMMSQSKGTCEAE---ANEFTRVDMEFQS------EKSLQ 243

Query: 251 LVHVDNLQSSLKIFGSNIKELEGDLEALYRRLIKIRVSLLNI 293
           +  V NL+         I++LE  + +L + LIK RVS+LNI
Sbjct: 244 MEDVQNLEIC-------IQDLEDGIGSLSKSLIKYRVSILNI 250

BLAST of Clc02G03770 vs. TAIR 10
Match: AT4G35690.1 (Arabidopsis protein of unknown function (DUF241) )

HSP 1 Score: 100.9 bits (250), Expect = 1.8e-21
Identity = 83/302 (27.48%), Postives = 154/302 (50.99%), Query Frame = 0

Query: 7   NQKNSHHVRSNSLPSKPHPFIDQVDEQLCRLKEASDATSSASSSELCHKLNGLQDLHDSI 66
           N    + +RS SLPS  HP    ++E L ++K  +  T + SS  +   L GL++L++  
Sbjct: 3   NMLVKNQLRSISLPSSSHPSTTGIEESLNKVKTIN--TMTGSSESVLMGLEGLEELYNCT 62

Query: 67  DRLLLLPLTQHVLVEEGDKKSFNDLLEGSIRLLDLCDIAKDALLQTKECVHELESVLRRR 126
           +  L +  TQ V+      +   ++L+GS+RL+D+C +++D +++T+E V  ++S +RR+
Sbjct: 63  EDFLKMGSTQRVMSSSDGSEFMEEMLDGSLRLMDICSVSRDLMVETQEHVRGVQSCVRRK 122

Query: 127 R--GGEMFIASEVQRCLISRKLIKKTIYKALKAIES-----------KSCQKIQATPAIV 186
           +  GGE  +   V   +  RK ++K   + L ++++            + ++ +    +V
Sbjct: 123 KVVGGEDQLDVAVAGYVGFRKNMRKEAKRLLGSLKNIDGGLSSSSSVNNGEQEEHLVVVV 182

Query: 187 SSLKQAEVVGYNVVESLLSFLAGPKLPSNSSRWSLVGKLVQSKRVACEVEETSRNEVALV 246
            +++Q   V   V+ S L FL+G +  +  S+ + V K    K+    VEET +NE+  +
Sbjct: 183 DAMRQVVSVSVAVLRSFLEFLSGRRQSNIKSKLASVLK----KKKVHHVEET-KNELENL 242

Query: 247 DAALHSIASQKTKKSDFLVHVDNLQSSLKIFGSNIKELEGDLEALYRRLIKIRVSLLNIY 296
           D              +     ++LQ  L+    +I   E  LE L+RRLI+ R SLLNI 
Sbjct: 243 DL-------------EIFCSRNDLQKKLEEVEMSIDGFEKKLEGLFRRLIRTRASLLNII 284

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038888781.12.1e-13688.47uncharacterized protein LOC120078575 [Benincasa hispida][more]
XP_004144946.12.4e-13287.12uncharacterized protein LOC101212572 [Cucumis sativus] >KGN43324.1 hypothetical ... [more]
XP_022136146.15.8e-13184.07uncharacterized protein LOC111007910 [Momordica charantia][more]
XP_022969195.11.3e-13084.75uncharacterized protein LOC111468263 [Cucurbita maxima][more]
XP_023554559.13.8e-13084.07uncharacterized protein LOC111811765 [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0K2T01.1e-13287.12Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G023980 PE=4 SV=1[more]
A0A6J1C2Q02.8e-13184.07uncharacterized protein LOC111007910 OS=Momordica charantia OX=3673 GN=LOC111007... [more]
A0A6J1HZA26.3e-13184.75uncharacterized protein LOC111468263 OS=Cucurbita maxima OX=3661 GN=LOC111468263... [more]
A0A6J1GLF02.4e-13083.73uncharacterized protein LOC111455338 OS=Cucurbita moschata OX=3662 GN=LOC1114553... [more]
A0A5D3DJA02.5e-12784.41Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
Match NameE-valueIdentityDescription
AT2G17080.13.5e-4942.81Arabidopsis protein of unknown function (DUF241) [more]
AT2G17070.16.5e-4841.05Arabidopsis protein of unknown function (DUF241) [more]
AT4G35200.15.2e-4541.49Arabidopsis protein of unknown function (DUF241) [more]
AT4G35210.11.9e-3938.65Arabidopsis protein of unknown function (DUF241) [more]
AT4G35690.11.8e-2127.48Arabidopsis protein of unknown function (DUF241) [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (cordophanus) v2
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004320Protein of unknown function DUF241, plantPFAMPF03087DUF241coord: 55..292
e-value: 1.1E-64
score: 218.5
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..18
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..25
NoneNo IPR availablePANTHERPTHR31509:SF109SUBFAMILY NOT NAMEDcoord: 1..294
NoneNo IPR availablePANTHERPTHR31509BPS1-LIKE PROTEINcoord: 1..294

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Clc02G03770.1Clc02G03770.1mRNA