ClCG02G010920 (gene) Watermelon (Charleston Gray)

NameClCG02G010920
Typegene
OrganismCitrullus lanatus (Watermelon (Charleston Gray))
DescriptionArabidopsis protein of unknown function (DUF241) LENGTH=250
LocationCG_Chr02 : 22434309 .. 22435956 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGAGGTTCCTCCAATTACCATGCTCGTTCCAACAGCCTGCCGACGAGGCCTCACCCCCTTGTGACAGAGTGCGACGAACACCTATGCCGGCTAAAGGCACTGGACTCGGATCCATGGACGGCTTCGGGGATGGCAAAGAAGCTGGCGGGGCTGCAGGACTTGCAGGAGTGCGTGAACAAGCTGCTGCTGCTGCGGCGGACGAGGGACGCCTTTGCGGCACAGCGCCGGGAGAAATGGGTGGATGAGGTGTTTGATGGATCCTTGAGATTGCTGGACTTGTGTAGCGCTTCTAAGGATGGTGTTATTCATACTAAGGAGTGTGTGAGGGAGTTGCAATCTTTGGTTAGGAGAAGATCCTCTTCTTGGTGTTGTGGAAATGGTATTGCTAATCAGGTTTGTATTTCCAATTCTTATGCACACAAAGTTAATAAGTTGGAGTAAATTTATTTTCAAGAGAATTGATTGATTTTTACGTCTGGAATAATATTATTGAAAAATGTGTTGATAAGTCTTGATTTGATTTGATTTTGGAGTTGATAAATATCGCGATGTTTTTTTAGCTTAAACTTGGATGAGTATTGCAATGGACAATTTGGTCTTCTTGCAATCGCCATATATTTATATCGTAGAGTATATGGGACGATTCTAGAGAGAGAAAGATGTTCAAATTGGATTTTGTTAAACTTTGTGAGTTTTGAGAGTTTTATTCAATATTGTTTGATTTGGGTTCATTTTTGCGGTAAAAAATAAAATTCTTTTAATTGAAATTGGTGATTTTTGAGATAGTTTTTTTTAGTTTAATGATTTGAGAGCAAGGAGAGATTCAAACTTCTAATCTCTTGGTCAAAAGATAAACACGATGATAATTAAGTTATACTCAACTCAATATGATTTTTTTAGAAATAGTGTTTACATTATCTTTTTTTTATTTTTTTTTTATTTTTTATTTTTTATTTTTTATATTTTATATTTGATTAGTCAATGGATGTATTAGAAATATAAAGCTAGTGGTAAAAAATAATACACCTGTCCAAATTTCTTAGTTTTCCTGAATTTTGTACCTGTCCTATTCCAATACCATATTAAACTATCTGATGAACACAAAAACTGATAACTTTAACCTTTTGATTAACAGGTGGAGAAATATTTAGCCTCAAGAAAAGTAGTGAAAAGAGCCATCCAAAAGGCTTTAGCCAGCGTGAAGACTTATGGGGTAAAACCAAGTTCAATCTCCATTAAAGATGCCGAAACCATGGCCTTAATAACCCTATTATTTGACGTAGAAGTGGCAAGCGTGAATGTTTTTGAAGCCCTCTTGTGCTACGTTTTGGGGAAGAAAGGGAAGGCCAAGGGTAGCGGTTGGGCGCTAGTTTCGAAGCTGATGCGCTCCAAACGAGCGCTTTTAACGGAGGAGGGTGTAGAAGGAGAGGCAAATGAGTTCGCCGCTGTGGATGCAGCGGTCGACACGGTGGCGAGCAGATTAGCCTCTGATAGCACGGCTGTAGGCAGTGGCGGCGTGGAGAGCATGGGGGATCAATTAGGGAAGTTGGAGGCATGTGTTCAAGATTTAGAAGGAGGATTGGAGGGTTTGTTTAGGAGATTGATAAGAAATAGGGTTTCCCTTTTGAATATTATCAATAACTAA

mRNA sequence

ATGGGAGGTTCCTCCAATTACCATGCTCGTTCCAACAGCCTGCCGACGAGGCCTCACCCCCTTGTGACAGAGTGCGACGAACACCTATGCCGGCTAAAGGCACTGGACTCGGATCCATGGACGGCTTCGGGGATGGCAAAGAAGCTGGCGGGGCTGCAGGACTTGCAGGAGTGCGTGAACAAGCTGCTGCTGCTGCGGCGGACGAGGGACGCCTTTGCGGCACAGCGCCGGGAGAAATGGGTGGATGAGGTGTTTGATGGATCCTTGAGATTGCTGGACTTGTGTAGCGCTTCTAAGGATGGTGTTATTCATACTAAGGAGTGTGTGAGGGAGTTGCAATCTTTGGTTAGGAGAAGATCCTCTTCTTGGTGTTGTGGAAATGGTATTGCTAATCAGGTGGAGAAATATTTAGCCTCAAGAAAAGTAGTGAAAAGAGCCATCCAAAAGGCTTTAGCCAGCGTGAAGACTTATGGGGTAAAACCAAGTTCAATCTCCATTAAAGATGCCGAAACCATGGCCTTAATAACCCTATTATTTGACGTAGAAGTGGCAAGCGTGAATGTTTTTGAAGCCCTCTTGTGCTACGTTTTGGGGAAGAAAGGGAAGGCCAAGGGTAGCGGTTGGGCGCTAGTTTCGAAGCTGATGCGCTCCAAACGAGCGCTTTTAACGGAGGAGGGTGTAGAAGGAGAGGCAAATGAGTTCGCCGCTGTGGATGCAGCGGTCGACACGGTGGCGAGCAGATTAGCCTCTGATAGCACGGCTGTAGGCAGTGGCGGCGTGGAGAGCATGGGGGATCAATTAGGGAAGTTGGAGGCATGTGTTCAAGATTTAGAAGGAGGATTGGAGGGTTTGTTTAGGAGATTGATAAGAAATAGGGTTTCCCTTTTGAATATTATCAATAACTAA

Coding sequence (CDS)

ATGGGAGGTTCCTCCAATTACCATGCTCGTTCCAACAGCCTGCCGACGAGGCCTCACCCCCTTGTGACAGAGTGCGACGAACACCTATGCCGGCTAAAGGCACTGGACTCGGATCCATGGACGGCTTCGGGGATGGCAAAGAAGCTGGCGGGGCTGCAGGACTTGCAGGAGTGCGTGAACAAGCTGCTGCTGCTGCGGCGGACGAGGGACGCCTTTGCGGCACAGCGCCGGGAGAAATGGGTGGATGAGGTGTTTGATGGATCCTTGAGATTGCTGGACTTGTGTAGCGCTTCTAAGGATGGTGTTATTCATACTAAGGAGTGTGTGAGGGAGTTGCAATCTTTGGTTAGGAGAAGATCCTCTTCTTGGTGTTGTGGAAATGGTATTGCTAATCAGGTGGAGAAATATTTAGCCTCAAGAAAAGTAGTGAAAAGAGCCATCCAAAAGGCTTTAGCCAGCGTGAAGACTTATGGGGTAAAACCAAGTTCAATCTCCATTAAAGATGCCGAAACCATGGCCTTAATAACCCTATTATTTGACGTAGAAGTGGCAAGCGTGAATGTTTTTGAAGCCCTCTTGTGCTACGTTTTGGGGAAGAAAGGGAAGGCCAAGGGTAGCGGTTGGGCGCTAGTTTCGAAGCTGATGCGCTCCAAACGAGCGCTTTTAACGGAGGAGGGTGTAGAAGGAGAGGCAAATGAGTTCGCCGCTGTGGATGCAGCGGTCGACACGGTGGCGAGCAGATTAGCCTCTGATAGCACGGCTGTAGGCAGTGGCGGCGTGGAGAGCATGGGGGATCAATTAGGGAAGTTGGAGGCATGTGTTCAAGATTTAGAAGGAGGATTGGAGGGTTTGTTTAGGAGATTGATAAGAAATAGGGTTTCCCTTTTGAATATTATCAATAACTAA

Protein sequence

MGGSSNYHARSNSLPTRPHPLVTECDEHLCRLKALDSDPWTASGMAKKLAGLQDLQECVNKLLLLRRTRDAFAAQRREKWVDEVFDGSLRLLDLCSASKDGVIHTKECVRELQSLVRRRSSSWCCGNGIANQVEKYLASRKVVKRAIQKALASVKTYGVKPSSISIKDAETMALITLLFDVEVASVNVFEALLCYVLGKKGKAKGSGWALVSKLMRSKRALLTEEGVEGEANEFAAVDAAVDTVASRLASDSTAVGSGGVESMGDQLGKLEACVQDLEGGLEGLFRRLIRNRVSLLNIINN
BLAST of ClCG02G010920 vs. TrEMBL
Match: A0A0A0KG43_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G127330 PE=4 SV=1)

HSP 1 Score: 552.7 bits (1423), Expect = 2.7e-154
Identity = 280/302 (92.72%), Postives = 288/302 (95.36%), Query Frame = 1

Query: 1   MGGSSNYHARSNSLPTRPHPLVTECDEHLCRLKALDSDPWTASGMAKKLAGLQDLQECVN 60
           MGGSSNYHARSNSLPTRPHPLVTECDEHLCRLKA+DS PWTA GMA KLAGLQDLQECVN
Sbjct: 1   MGGSSNYHARSNSLPTRPHPLVTECDEHLCRLKAMDSAPWTAFGMANKLAGLQDLQECVN 60

Query: 61  KLLLLRRTRDAFAAQRREKWVDEVFDGSLRLLDLCSASKDGVIHTKECVRELQSLVRRRS 120
           KLLLLRRTRDAFAA RREKWVDEVFDGSLRLLDLCSASKDGVIHTKECVRELQSLVRRRS
Sbjct: 61  KLLLLRRTRDAFAAHRREKWVDEVFDGSLRLLDLCSASKDGVIHTKECVRELQSLVRRRS 120

Query: 121 SSWCCGNGIANQVEKYLASRKVVKRAIQKALASVKTYGVKPSSISIKDAETMALITLLFD 180
           SSWCCGNG+ANQVEKYLASRKVVKRAIQKALASVKTYG +PSSISIKD ET+ALITLLFD
Sbjct: 121 SSWCCGNGVANQVEKYLASRKVVKRAIQKALASVKTYGARPSSISIKDTETIALITLLFD 180

Query: 181 VEVASVNVFEALLCYVLGKKGKAKGSGWALVSKLMRSKRALLTEEGVEGEANEFAAVDAA 240
           VEVASVNVFEALL YVLGKKGKAKGSGWALVSKLMRSKR LLTE+G EGEANEFA +DAA
Sbjct: 181 VEVASVNVFEALLSYVLGKKGKAKGSGWALVSKLMRSKRVLLTEDGAEGEANEFATIDAA 240

Query: 241 VDTVASRLASD-STAVGSGGVESMGDQLGKLEACVQDLEGGLEGLFRRLIRNRVSLLNII 300
           VD VASRLASD ST +G GG+ESMGDQLGKLEACVQDLEGGLEGLFRRLIRNRVSLLNII
Sbjct: 241 VDVVASRLASDSSTIIGGGGIESMGDQLGKLEACVQDLEGGLEGLFRRLIRNRVSLLNII 300

Query: 301 NN 302
           NN
Sbjct: 301 NN 302

BLAST of ClCG02G010920 vs. TrEMBL
Match: W9RKX3_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_014372 PE=4 SV=1)

HSP 1 Score: 282.7 bits (722), Expect = 5.2e-73
Identity = 163/303 (53.80%), Postives = 210/303 (69.31%), Query Frame = 1

Query: 5   SNYHARSNSLPTRPHPLVTECDEHLCRLKALDSDPWTASGMAKKLAGLQDLQECVNKLLL 64
           S YH RSNS PTRPHPL   CDEHLCRL A  S+  ++S ++ KL+GL+DL +CV KL  
Sbjct: 6   SQYHVRSNSFPTRPHPLFQRCDEHLCRLGA--SEATSSSSLSHKLSGLEDLHDCVEKLFQ 65

Query: 65  LRRTRDAFAAQRREKWVDEVFDGSLRLLDLCSASKDGVIHTKECVRELQSLVRRRSSSWC 124
           L  T+ AF   R EKWVDE  DGSLRLLD+CSA+KD V+HTKEC RE+QS++RRR  +  
Sbjct: 66  LPLTQQAFVHSRHEKWVDEQVDGSLRLLDMCSAAKDAVLHTKECAREVQSIMRRRRGAEV 125

Query: 125 CGNGIANQVEKYLASRKVVKRAIQKALASVKTYGVKPSSIS--IKD--AETMALITLLFD 184
              G+A +V KYLASRKVVK+AI+KAL ++K   VK SS S   KD   ET AL+ +L +
Sbjct: 126 ---GLAGEVTKYLASRKVVKKAIRKALENMKA-AVKSSSSSPTNKDMTVETAALVGVLRE 185

Query: 185 VEVASVNVFEALLCYVLGKKGKAKGSGWALVSKLMRSKRAL-LTEEGVEGEANEFAAVDA 244
           VE  S+ VFE+LLC++ G K + K SGW+LVSKLM +K+ +   +E  E + NEFA V+ 
Sbjct: 186 VESVSLAVFESLLCFISGPKAQTKLSGWSLVSKLMNNKKKVGCDQEAQETDVNEFAKVEE 245

Query: 245 AVDTVASRLASDSTAV-GSGGVESMGDQLGKLEACVQDLEGGLEGLFRRLIRNRVSLLNI 302
           A+  +   +  D++ V  S  +E+   +L  LE CVQD E  LE LFRRLI+NRVSLLNI
Sbjct: 246 ALQRL---MCHDTSKVENSLHIENAQTELQSLELCVQDFEERLERLFRRLIKNRVSLLNI 299

BLAST of ClCG02G010920 vs. TrEMBL
Match: M5VME9_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa025877mg PE=4 SV=1)

HSP 1 Score: 272.3 bits (695), Expect = 7.0e-70
Identity = 149/301 (49.50%), Postives = 212/301 (70.43%), Query Frame = 1

Query: 5   SNYHARSNSLPTRPHPLVTECDEHLCRLKALD---SDPWTASGMAKKLAGLQDLQECVNK 64
           SNYH RS SLP++PHPL  +C++HL R+ A D   S  +++S +++KL+GL DL  C+N+
Sbjct: 11  SNYHIRSISLPSKPHPLFQQCEDHLLRIAASDASSSSSYSSSSISQKLSGLLDLHNCLNE 70

Query: 65  LLLLRRTRDAFAAQRREKWVDEVFDGSLRLLDLCSASKDGVIHTKECVRELQSLVRRRSS 124
           L  L  T++AF  ++ EKWVDE+ DGSLRLLD+C+A+KD +IHTKEC RE+QS++RRR  
Sbjct: 71  LFQLPLTQEAFVREQNEKWVDELLDGSLRLLDVCTAAKDALIHTKECAREIQSIMRRRRG 130

Query: 125 SWCCGNGIANQVEKYLASRKVVKRAIQKALASVKTYGVKPS-SISIKDAETMALITLLFD 184
                +G  N+V KYLASRKVVK+A+ KAL +++T   K + S + KD  T+ALI +L +
Sbjct: 131 G---KSGFTNEVRKYLASRKVVKKAVCKALGTLRTSQKKSTFSSTNKDNVTVALIGVLRE 190

Query: 185 VEVASVNVFEALLCYVLGKKGKAKGSGWALVSKLMRSKRALLTEEGVEGEANEFAAVDAA 244
           VE  S+ VFE+LL ++ G K  +K  GW+ VSKLM +K+    E+  + E NEFA VDAA
Sbjct: 191 VEAVSLTVFESLLSFISGAKSASKMRGWSFVSKLMLTKKVGCEED--KTEINEFADVDAA 250

Query: 245 VDTVASRLASDSTAVGSGGVESMGDQLGKLEACVQDLEGGLEGLFRRLIRNRVSLLNIIN 302
           + ++  +  S+S ++     E++  +L +LE C QDLE GLE LFRRLI+NRVSLLN ++
Sbjct: 251 LSSLVCQETSNSDSMVDS--ENVQSELQQLEMCSQDLEEGLECLFRRLIKNRVSLLNTLS 304

BLAST of ClCG02G010920 vs. TrEMBL
Match: W9S9W8_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_014368 PE=4 SV=1)

HSP 1 Score: 270.4 bits (690), Expect = 2.6e-69
Identity = 150/300 (50.00%), Postives = 210/300 (70.00%), Query Frame = 1

Query: 4   SSNYHARSNSLPTRPHPLVTECDEHLCRLKALDSDPWTASGMAKKLAGLQDLQECVNKLL 63
           +S++H RSNSLP++ +PL+ +C+E+L RL+A  +   ++S + ++L+GL+DL ECV KLL
Sbjct: 5   NSHFHVRSNSLPSQSNPLLLQCNENLSRLEARGATS-SSSFITQRLSGLEDLHECVEKLL 64

Query: 64  LLRRTRDAFAAQRREKWVDEVFDGSLRLLDLCSASKDGVIHTKECVRELQSLVRRRSSSW 123
           LL  T+ AF   R+E+W+D++ DGSLRLLDLCS++KD V+HTKEC R++QS++RRR  + 
Sbjct: 65  LLPSTQQAFVQGRQEQWIDQLVDGSLRLLDLCSSAKDAVLHTKECARDIQSIMRRRRGAE 124

Query: 124 CCGNGIANQVEKYLASRKVVKRAIQKALASVKTYGVKP--SSISIKDAETMALITLLFDV 183
               G+ +++ KYLASRKVVKRAI KAL S+K+   K   S +S KD ET+ALI++L +V
Sbjct: 125 V---GLESEISKYLASRKVVKRAINKALGSLKSVETKSSFSDLSNKDGETIALISVLKEV 184

Query: 184 EVASVNVFEALLCYVLGKKGKAKGSGWALVSKLMRSKRALLTEEGVEGEANEFAAVDAAV 243
           EV ++ VFE+LL ++ G K + K  GW+LVSK+M SKR    E   E + NEFA VDAA+
Sbjct: 185 EVVALAVFESLLSFISGPKAQTKLGGWSLVSKIMHSKRVGCEE---EQQVNEFAKVDAAM 244

Query: 244 DTVASRLASDSTAVGSGGVESMGDQLGKLEACVQDLEGGLEGLFRRLIRNRVSLLNIINN 302
             + S             +E +  +L  LE CVQD E  LE LFRRLI+ RVSLLNI+NN
Sbjct: 245 HRLMSHKMQ---------IEKVQSELQILEMCVQDFEERLECLFRRLIKTRVSLLNILNN 288

BLAST of ClCG02G010920 vs. TrEMBL
Match: W9RW89_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_014365 PE=4 SV=1)

HSP 1 Score: 263.8 bits (673), Expect = 2.5e-67
Identity = 146/298 (48.99%), Postives = 206/298 (69.13%), Query Frame = 1

Query: 6   NYHARSNSLPTRPHPLVTECDEHLCRLKALDSDPWTASGMAKKLAGLQDLQECVNKLLLL 65
           ++H RSNSLP++ +PL+ +C+E+L RL+A  +   ++S + +KL+GL+DL ECV KLLL 
Sbjct: 7   HFHVRSNSLPSQSNPLLLQCNENLSRLEACGATS-SSSFITQKLSGLEDLHECVEKLLLF 66

Query: 66  RRTRDAFAAQRREKWVDEVFDGSLRLLDLCSASKDGVIHTKECVRELQSLVRRRSSSWCC 125
             T+ AF   R+E+W+D++ DGSLRLLDLCS++KD V+HTKEC R++QS++ RR  +   
Sbjct: 67  PSTQQAFVQGRQEQWIDQLVDGSLRLLDLCSSAKDAVLHTKECARDIQSIMHRRRGAEV- 126

Query: 126 GNGIANQVEKYLASRKVVKRAIQKALASVKTYGVKP--SSISIKDAETMALITLLFDVEV 185
             G+ +++ KYLASRKVVKRA+ KAL S+K+   K   S +S KD ET+ALI++L +VEV
Sbjct: 127 --GLESEISKYLASRKVVKRALNKALGSLKSVETKSSFSDLSNKDGETIALISVLKEVEV 186

Query: 186 ASVNVFEALLCYVLGKKGKAKGSGWALVSKLMRSKRALLTEEGVEGEANEFAAVDAAVDT 245
            ++ VFE+LL ++ G K + K  GW+LVSK+M SKR    E   E + NEFA VDAA+  
Sbjct: 187 VTLAVFESLLSFISGPKAQTKLGGWSLVSKIMHSKRVGCEE---EQQVNEFAKVDAAMHR 246

Query: 246 VASRLASDSTAVGSGGVESMGDQLGKLEACVQDLEGGLEGLFRRLIRNRVSLLNIINN 302
           + S             +E +  +L  LE CVQD E  LE LFRRLI+ RVSLLNI+N+
Sbjct: 247 LMSHKMQ---------IEKVQSELQILEMCVQDFEERLECLFRRLIKTRVSLLNILND 288

BLAST of ClCG02G010920 vs. TAIR10
Match: AT2G17080.1 (AT2G17080.1 Arabidopsis protein of unknown function (DUF241))

HSP 1 Score: 162.9 bits (411), Expect = 3.0e-40
Identity = 103/297 (34.68%), Postives = 166/297 (55.89%), Query Frame = 1

Query: 6   NYHARSNSLPTRPHPLVTECDEHLCRLKALD-SDPWTASGMAKKLAGLQDLQECVNKLLL 65
           ++H RSNS P+R HP     DE L RL++ + +   ++S + ++L  LQ+L E ++KL+ 
Sbjct: 4   SFHVRSNSFPSRSHPQAAHVDEQLARLRSSEQASSSSSSSICQRLDNLQELHESLDKLIS 63

Query: 66  LRRTRDAFAAQRREKWVDEVFDGSLRLLDLCSASKDGVIHTKECVRELQSLVRRRSSSWC 125
              T+ A + +  +K V+++ DGSLR+LDLC+ SKD +   KE + E+QS++RR+     
Sbjct: 64  RPVTQQALSQEHNKKAVEQLLDGSLRILDLCNISKDALSEMKEGLMEIQSILRRKRGD-- 123

Query: 126 CGNGIANQVEKYLASRKVVKRAIQKALASVKTYGVKPSSISIKDAETMALITLLFDVEVA 185
               ++ +V+KYL SRK +K++ QK   S+K      +     + +T+A+     + E  
Sbjct: 124 ----LSEEVKKYLTSRKSLKKSFQKVQKSLKV-----TQAEDNNDDTLAVFG---EAEAI 183

Query: 186 SVNVFEALLCYVLGKKGKAKGSGWALVSKLMRSKRALLTEEGVEGEANEFAAVDAAVDTV 245
           ++++F++LL Y+ G K  +K   W++VSKLM  K+        E + NEF  VD      
Sbjct: 184 TLSLFDSLLSYMSGSKTCSK---WSVVSKLMNKKKVT-----CEAQENEFTKVD------ 243

Query: 246 ASRLASDSTAVGSGGVESMGDQLGKLEACVQDLEGGLEGLFRRLIRNRVSLLNIINN 302
            S   S+ T           D +  LE+C+QDLE GLE L + LI+ RVS LNI+ +
Sbjct: 244 -SEFQSEKTL--------KMDDVQNLESCIQDLEDGLESLSKSLIKYRVSFLNILGH 263

BLAST of ClCG02G010920 vs. TAIR10
Match: AT4G35200.1 (AT4G35200.1 Arabidopsis protein of unknown function (DUF241))

HSP 1 Score: 159.8 bits (403), Expect = 2.5e-39
Identity = 102/293 (34.81%), Postives = 158/293 (53.92%), Query Frame = 1

Query: 6   NYHARSNSLPTRPHPLVTECDEHLCRLKALDSDPWTASGMAKKLAGLQDLQECVNKLLLL 65
           ++H RSNS P+R HP     DE L RL++ DS   ++S + ++L+ LQDL + + K++ L
Sbjct: 4   SFHVRSNSYPSRQHPQAAHVDEQLTRLRSSDSA--SSSSICQRLSNLQDLHDSLEKMIRL 63

Query: 66  RRTRDAFAAQRREKWVDEVFDGSLRLLDLCSASKDGVIHTKECVRELQSLVRRRSSSWCC 125
             T  A +  + EK    + DGSLR+LDLC+ +KD +   KE + E+QS++RR+      
Sbjct: 64  SVTNLALSQDQIEK----LLDGSLRILDLCNIAKDAISQMKEGLMEIQSILRRKPGD--- 123

Query: 126 GNGIANQVEKYLASRKVVKRAIQKALASVKTYGVKPSSISIKDAETMALITLLFDVEVAS 185
              ++ +V+KYL SRK +K+++QK + S+K    K S        T A + +    E  +
Sbjct: 124 ---LSGEVKKYLVSRKFLKKSLQKVIKSLKVCQSKDS--------TNASLVVFGRAEAVT 183

Query: 186 VNVFEALLCYVLGKKGKAKGSGWALVSKLMRSKRALLTEEGVEGEANEFAAVDAAVDTVA 245
           + +FE+L  ++ G K   K   W+LVSK+M   +        E EANEF  +D+   +  
Sbjct: 184 MALFESLFSFMSGSKACGK---WSLVSKMMSQNKVT-----CEAEANEFTRIDSEFQSEK 243

Query: 246 SRLASDSTAVGSGGVESMGDQLGKLEACVQDLEGGLEGLFRRLIRNRVSLLNI 299
           S    D               +  LE+C+QDLE G+E L + LI+ RVS+LNI
Sbjct: 244 SLQMED---------------VQNLESCIQDLEDGIESLSKSLIKYRVSILNI 253

BLAST of ClCG02G010920 vs. TAIR10
Match: AT4G35210.1 (AT4G35210.1 Arabidopsis protein of unknown function (DUF241))

HSP 1 Score: 157.9 bits (398), Expect = 9.7e-39
Identity = 101/293 (34.47%), Postives = 157/293 (53.58%), Query Frame = 1

Query: 6   NYHARSNSLPTRPHPLVTECDEHLCRLKALDSDPWTASGMAKKLAGLQDLQECVNKLLLL 65
           ++H RS+S P+R HP     DE L RL++  S   ++S + ++L+ LQDL + + K++ L
Sbjct: 4   SFHVRSSSYPSRQHPQAAHVDEQLTRLRS--SGTASSSSICQRLSNLQDLHDSLEKMIRL 63

Query: 66  RRTRDAFAAQRREKWVDEVFDGSLRLLDLCSASKDGVIHTKECVRELQSLVRRRSSSWCC 125
             T  A +  + EK    + DGS+++LDLCS SKDG+   KE ++E+QS+VRR+      
Sbjct: 64  SVTNQALSQDQIEK----LLDGSIKILDLCSISKDGLSQMKESLKEIQSIVRRKRGD--- 123

Query: 126 GNGIANQVEKYLASRKVVKRAIQKALASVKTYGVKPSSISIKDAETMALITLLFDVEVAS 185
              ++ +V+KYLASRK +K++ +K L S+KT   K  ++++             + E  +
Sbjct: 124 ---LSAEVKKYLASRKFLKKSFEKVLKSLKTSQNKNDALAV-----------FGEAETVT 183

Query: 186 VNVFEALLCYVLGKKGKAKGSGWALVSKLMRSKRALLTEEGVEGEANEFAAVDAAVDTVA 245
           + +FE+L  ++ G K   K   W+LVSK+M   +        E EANEF  VD    +  
Sbjct: 184 IALFESLFSFMSGSKACGK---WSLVSKMMSQSKGT-----CEAEANEFTRVDMEFQSEK 243

Query: 246 SRLASDSTAVGSGGVESMGDQLGKLEACVQDLEGGLEGLFRRLIRNRVSLLNI 299
           S    D               +  LE C+QDLE G+  L + LI+ RVS+LNI
Sbjct: 244 SLQMED---------------VQNLEICIQDLEDGIGSLSKSLIKYRVSILNI 250

BLAST of ClCG02G010920 vs. TAIR10
Match: AT2G17070.1 (AT2G17070.1 Arabidopsis protein of unknown function (DUF241))

HSP 1 Score: 152.5 bits (384), Expect = 4.1e-37
Identity = 98/293 (33.45%), Postives = 160/293 (54.61%), Query Frame = 1

Query: 6   NYHARSNSLPTRPHPLVTECDEHLCRLKALD-SDPWTASGMAKKLAGLQDLQECVNKLLL 65
           ++H RS+S P+ PHP     DE L RL++ + +   ++S + ++L  LQ+L E ++KL+ 
Sbjct: 4   SFHVRSHSYPSIPHPQAAHVDEQLARLRSSEETSTSSSSSICQRLDNLQELHESLDKLIR 63

Query: 66  LRRTRDAFAAQRREKWVDEVFDGSLRLLDLCSASKDGVIHTKECVRELQSLVRRRSSSWC 125
           L  T+ A   ++ +K V+++ DGSL++LD+C+ SKD +   KE + E+QS++RR+     
Sbjct: 64  LPVTQQALGQEKNKKDVEQLLDGSLKILDVCNISKDALSQMKEGLMEIQSILRRKRGD-- 123

Query: 126 CGNGIANQVEKYLASRKVVKRAIQKALASVKTYGVKPSSISIKDAETMALITLLFDVEVA 185
               ++ +V+KYLASRK  K+  QK   S+K    + +    KD      + +  + E  
Sbjct: 124 ----LSGEVKKYLASRKSFKKTFQKVQKSLKAAQAEDN----KDKS----LAVFGEAEAV 183

Query: 186 SVNVFEALLCYVLGKKGKAKGSGWALVSKLMRSKRALLTEEGVEGEANEFAAVDAAVDTV 245
           ++ +F++L  Y+ G K  +K   W++VSKLM  K+        E + NEF  VD      
Sbjct: 184 TIAMFDSLFSYMSGSKTCSK---WSVVSKLMNKKKIT-----CEAQENEFTKVD------ 243

Query: 246 ASRLASDSTAVGSGGVESMGDQLGKLEACVQDLEGGLEGLFRRLIRNRVSLLN 298
            S   S+ T           + +  LE+C+QD E GLE L + LI+ RVS+LN
Sbjct: 244 -SEFQSEKTL--------KMEDVQILESCIQDFEDGLESLSKSLIKYRVSILN 259

BLAST of ClCG02G010920 vs. TAIR10
Match: AT4G35710.1 (AT4G35710.1 Arabidopsis protein of unknown function (DUF241))

HSP 1 Score: 112.1 bits (279), Expect = 6.1e-25
Identity = 85/295 (28.81%), Postives = 152/295 (51.53%), Query Frame = 1

Query: 10  RSNSLPTRPHPLVTECDEHLCRLKALDSDPWTASGMAKKLAGLQDLQECVNKLLLLRRTR 69
           RS SLP+R  P  +  +E L ++K +++   ++  +   LAGL++L   + + L +   +
Sbjct: 11  RSISLPSRSQPSTSGLEESLNKIKTINTTTGSSESILMGLAGLEELYIFLEEFLKMGSKQ 70

Query: 70  DAFAAQRREKWVDEVFDGSLRLLDLCSASKDGVIHTKECVRELQSLVRRRS-SSWCCGNG 129
              ++   E +++E+ DGSLRL+D+CS S+D ++ T E VR +QS VRR+  S    G+ 
Sbjct: 71  RVMSSGGSE-FMEEMLDGSLRLMDICSVSRDLMVETHEHVRGVQSYVRRKKVSGGGGGDK 130

Query: 130 IANQVEKYLASRKVVKRAIQKALASVKTY--GVKPSSISIKDAETMALITLLFDVEVASV 189
           I   V  Y+  RK +++  +K L S+K    G +      +D + +A+I  +  V   SV
Sbjct: 131 IDVAVSDYVGFRKNMRKEAKKLLGSLKKVDGGTRSCDNDHEDEQLVAVIDRVRRVVSVSV 190

Query: 190 NVFEALLCYVLGKKGKAKGSGWALVSKLMRSKRALLTEEGVEGEANEFAAVDAAVDTVAS 249
            V ++ L  +  +K   K      ++ +++ K+            +  A     ++T+ S
Sbjct: 191 VVLKSFLELLSRRKSNIKSK----LASVLKMKK------------DNHAPAKNVLETLDS 250

Query: 250 RLASDSTAVGSGGVESMGDQLGKLEACVQDLEGGLEGLFRRLIRNRVSLLNIINN 302
            +  D  +      + + ++L ++E C+   E  LEGLFRRLIR R S+LNII++
Sbjct: 251 AIFGDFLS-----HDDLQNELEEVEMCIGGFERNLEGLFRRLIRTRASILNIISH 283

BLAST of ClCG02G010920 vs. NCBI nr
Match: gi|659094947|ref|XP_008448320.1| (PREDICTED: uncharacterized protein LOC103490547 [Cucumis melo])

HSP 1 Score: 563.1 bits (1450), Expect = 2.8e-157
Identity = 284/301 (94.35%), Postives = 290/301 (96.35%), Query Frame = 1

Query: 1   MGGSSNYHARSNSLPTRPHPLVTECDEHLCRLKALDSDPWTASGMAKKLAGLQDLQECVN 60
           MGGSSNYHARSNSLPTRPHPLVTECDEHLCRLKA+DS PWTASGMA KLAGLQDLQECVN
Sbjct: 1   MGGSSNYHARSNSLPTRPHPLVTECDEHLCRLKAMDSAPWTASGMANKLAGLQDLQECVN 60

Query: 61  KLLLLRRTRDAFAAQRREKWVDEVFDGSLRLLDLCSASKDGVIHTKECVRELQSLVRRRS 120
           KLLLLRRTRDAFAAQRREKWVDEVFDGSLRLLDLCSASKDGVIHTKECVRELQSLVRRRS
Sbjct: 61  KLLLLRRTRDAFAAQRREKWVDEVFDGSLRLLDLCSASKDGVIHTKECVRELQSLVRRRS 120

Query: 121 SSWCCGNGIANQVEKYLASRKVVKRAIQKALASVKTYGVKPSSISIKDAETMALITLLFD 180
           SSWCCGNG+ANQVEKYLASRKVVKRAIQKALASVKTYG KPSSISIKD ET+ALITLLFD
Sbjct: 121 SSWCCGNGVANQVEKYLASRKVVKRAIQKALASVKTYGAKPSSISIKDTETIALITLLFD 180

Query: 181 VEVASVNVFEALLCYVLGKKGKAKGSGWALVSKLMRSKRALLTEEGVEGEANEFAAVDAA 240
           VEVASVNVFEALL YVLGKKGKAKGSGWALVSKLMRSKR LLTE+G EGEANEFA VDAA
Sbjct: 181 VEVASVNVFEALLSYVLGKKGKAKGSGWALVSKLMRSKRVLLTEDGAEGEANEFATVDAA 240

Query: 241 VDTVASRLASDSTAVGSGGVESMGDQLGKLEACVQDLEGGLEGLFRRLIRNRVSLLNIIN 300
           VD VASRLASDST +G GG+ESMGDQLGKLEACVQDLEGGLEGLFRRLIRNRVSLLNIIN
Sbjct: 241 VDVVASRLASDSTIIGGGGIESMGDQLGKLEACVQDLEGGLEGLFRRLIRNRVSLLNIIN 300

Query: 301 N 302
           N
Sbjct: 301 N 301

BLAST of ClCG02G010920 vs. NCBI nr
Match: gi|449444560|ref|XP_004140042.1| (PREDICTED: uncharacterized protein LOC101209279 [Cucumis sativus])

HSP 1 Score: 552.7 bits (1423), Expect = 3.8e-154
Identity = 280/302 (92.72%), Postives = 288/302 (95.36%), Query Frame = 1

Query: 1   MGGSSNYHARSNSLPTRPHPLVTECDEHLCRLKALDSDPWTASGMAKKLAGLQDLQECVN 60
           MGGSSNYHARSNSLPTRPHPLVTECDEHLCRLKA+DS PWTA GMA KLAGLQDLQECVN
Sbjct: 1   MGGSSNYHARSNSLPTRPHPLVTECDEHLCRLKAMDSAPWTAFGMANKLAGLQDLQECVN 60

Query: 61  KLLLLRRTRDAFAAQRREKWVDEVFDGSLRLLDLCSASKDGVIHTKECVRELQSLVRRRS 120
           KLLLLRRTRDAFAA RREKWVDEVFDGSLRLLDLCSASKDGVIHTKECVRELQSLVRRRS
Sbjct: 61  KLLLLRRTRDAFAAHRREKWVDEVFDGSLRLLDLCSASKDGVIHTKECVRELQSLVRRRS 120

Query: 121 SSWCCGNGIANQVEKYLASRKVVKRAIQKALASVKTYGVKPSSISIKDAETMALITLLFD 180
           SSWCCGNG+ANQVEKYLASRKVVKRAIQKALASVKTYG +PSSISIKD ET+ALITLLFD
Sbjct: 121 SSWCCGNGVANQVEKYLASRKVVKRAIQKALASVKTYGARPSSISIKDTETIALITLLFD 180

Query: 181 VEVASVNVFEALLCYVLGKKGKAKGSGWALVSKLMRSKRALLTEEGVEGEANEFAAVDAA 240
           VEVASVNVFEALL YVLGKKGKAKGSGWALVSKLMRSKR LLTE+G EGEANEFA +DAA
Sbjct: 181 VEVASVNVFEALLSYVLGKKGKAKGSGWALVSKLMRSKRVLLTEDGAEGEANEFATIDAA 240

Query: 241 VDTVASRLASD-STAVGSGGVESMGDQLGKLEACVQDLEGGLEGLFRRLIRNRVSLLNII 300
           VD VASRLASD ST +G GG+ESMGDQLGKLEACVQDLEGGLEGLFRRLIRNRVSLLNII
Sbjct: 241 VDVVASRLASDSSTIIGGGGIESMGDQLGKLEACVQDLEGGLEGLFRRLIRNRVSLLNII 300

Query: 301 NN 302
           NN
Sbjct: 301 NN 302

BLAST of ClCG02G010920 vs. NCBI nr
Match: gi|703125547|ref|XP_010103332.1| (hypothetical protein L484_014372 [Morus notabilis])

HSP 1 Score: 282.7 bits (722), Expect = 7.4e-73
Identity = 163/303 (53.80%), Postives = 210/303 (69.31%), Query Frame = 1

Query: 5   SNYHARSNSLPTRPHPLVTECDEHLCRLKALDSDPWTASGMAKKLAGLQDLQECVNKLLL 64
           S YH RSNS PTRPHPL   CDEHLCRL A  S+  ++S ++ KL+GL+DL +CV KL  
Sbjct: 6   SQYHVRSNSFPTRPHPLFQRCDEHLCRLGA--SEATSSSSLSHKLSGLEDLHDCVEKLFQ 65

Query: 65  LRRTRDAFAAQRREKWVDEVFDGSLRLLDLCSASKDGVIHTKECVRELQSLVRRRSSSWC 124
           L  T+ AF   R EKWVDE  DGSLRLLD+CSA+KD V+HTKEC RE+QS++RRR  +  
Sbjct: 66  LPLTQQAFVHSRHEKWVDEQVDGSLRLLDMCSAAKDAVLHTKECAREVQSIMRRRRGAEV 125

Query: 125 CGNGIANQVEKYLASRKVVKRAIQKALASVKTYGVKPSSIS--IKD--AETMALITLLFD 184
              G+A +V KYLASRKVVK+AI+KAL ++K   VK SS S   KD   ET AL+ +L +
Sbjct: 126 ---GLAGEVTKYLASRKVVKKAIRKALENMKA-AVKSSSSSPTNKDMTVETAALVGVLRE 185

Query: 185 VEVASVNVFEALLCYVLGKKGKAKGSGWALVSKLMRSKRAL-LTEEGVEGEANEFAAVDA 244
           VE  S+ VFE+LLC++ G K + K SGW+LVSKLM +K+ +   +E  E + NEFA V+ 
Sbjct: 186 VESVSLAVFESLLCFISGPKAQTKLSGWSLVSKLMNNKKKVGCDQEAQETDVNEFAKVEE 245

Query: 245 AVDTVASRLASDSTAV-GSGGVESMGDQLGKLEACVQDLEGGLEGLFRRLIRNRVSLLNI 302
           A+  +   +  D++ V  S  +E+   +L  LE CVQD E  LE LFRRLI+NRVSLLNI
Sbjct: 246 ALQRL---MCHDTSKVENSLHIENAQTELQSLELCVQDFEERLERLFRRLIKNRVSLLNI 299

BLAST of ClCG02G010920 vs. NCBI nr
Match: gi|1009150627|ref|XP_015893119.1| (PREDICTED: uncharacterized protein LOC107427255 [Ziziphus jujuba])

HSP 1 Score: 282.7 bits (722), Expect = 7.4e-73
Identity = 160/301 (53.16%), Postives = 210/301 (69.77%), Query Frame = 1

Query: 5   SNYHARSNSLPTRPHPLVTECDEHLCRLKALDSDPWTASG---MAKKLAGLQDLQECVNK 64
           S YH RSNSLPTRPHPL+  C+ HLCRL A D    T+S    ++ KL+GL+DL +CV+K
Sbjct: 11  SQYHVRSNSLPTRPHPLIQLCNTHLCRLGASDGTSSTSSSSSSISHKLSGLEDLHDCVDK 70

Query: 65  LLLLRRTRDAFAAQRREKWVDEVFDGSLRLLDLCSASKDGVIHTKECVRELQSLVRRRSS 124
           LLLL  T+ AF   R EKWVDE+ DGSLRLLD CSA+KD V+HTKEC RE+QS++R R  
Sbjct: 71  LLLLPLTQKAFVQGRNEKWVDELLDGSLRLLDACSAAKDAVLHTKECAREVQSIIRTRRG 130

Query: 125 SWCCGNGIANQVEKYLASRKVVKRAIQKALASVKTYGVKPSSISIKDAETMALITLLFDV 184
                 G+ ++V KYLASRKV+K+A+ KAL ++K    K  S+S KD ET+AL+ +L +V
Sbjct: 131 G---EGGLVSEVRKYLASRKVMKKAVSKALGNLKGVRTK-CSVSSKDNETVALVGVLREV 190

Query: 185 EVASVNVFEALLCYVLGKKGKAKGSGWALVSKL-MRSKRALLTEEGVEGEANEFAAVDAA 244
           E  ++ VFE+LL +  G K  +K SGW+LVS L M SK+    EE  E +ANEFA VDAA
Sbjct: 191 EAVTLAVFESLLSFTSGTKAGSKLSGWSLVSNLVMNSKKVGCDEE--ETDANEFAKVDAA 250

Query: 245 VDTVASRLASDSTAVGSGGVESMGDQLGKLEACVQDLEGGLEGLFRRLIRNRVSLLNIIN 302
           + ++     ++ +   +  VE++  +L KLE CVQDLEG LE +FRRLI+ RVSLLNI+N
Sbjct: 251 LQSLLC-YDTEKSEQYNPQVENVRSELQKLELCVQDLEGRLEYIFRRLIKIRVSLLNILN 304

BLAST of ClCG02G010920 vs. NCBI nr
Match: gi|657980968|ref|XP_008382492.1| (PREDICTED: uncharacterized protein LOC103445280 [Malus domestica])

HSP 1 Score: 275.0 bits (702), Expect = 1.5e-70
Identity = 151/303 (49.83%), Postives = 213/303 (70.30%), Query Frame = 1

Query: 1   MGGSSNYHARSNSLPTRPHPLVTECDEHLCRLKALD-SDPWTASGMAKKLAGLQDLQECV 60
           +   SNYH RS SLP++PHPL  +C++HL R+ A D S  +++S ++ +L+ L DL  CV
Sbjct: 7   LNSKSNYHIRSISLPSKPHPLFQQCEDHLLRIAASDVSSSFSSSSISHRLSSLLDLHNCV 66

Query: 61  NKLLLLRRTRDAFAAQRREKWVDEVFDGSLRLLDLCSASKDGVIHTKECVRELQSLVRRR 120
           N+L  L  T++AF  +R EKWVDE+ DGSLRLLD+C+A+KD +IHTKECVRE+QS++RRR
Sbjct: 67  NELFQLPLTQEAFVRERNEKWVDELLDGSLRLLDVCTAAKDALIHTKECVREIQSIMRRR 126

Query: 121 SSSWCCGNGIANQVEKYLASRKVVKRAIQKALASVKTYGVKPS-SISIKDAETMALITLL 180
                  +G  N+V KY ASRKVVK+AI KAL ++++   K + S + KD   +ALI  L
Sbjct: 127 RGGI---SGFTNEVRKYSASRKVVKKAICKALGTLRSSQKKGTFSSTNKDNVAVALIGAL 186

Query: 181 FDVEVASVNVFEALLCYVLGKKGKAKGSGWALVSKLMRSKRALLTEEGVEGEANEFAAVD 240
            +VE  ++ VFE+LL ++ G K ++K SGW+ VSKLM +K+    EE  + + NEFA VD
Sbjct: 187 REVEAVTLTVFESLLSFISGAKSQSKMSGWSFVSKLMLTKKVACDEED-KADLNEFADVD 246

Query: 241 AAVDTVASRLASDSTAVGSGGVESMGDQLGKLEACVQDLEGGLEGLFRRLIRNRVSLLNI 300
           AA+++++ + AS S  V     E++  +L +LE C QDLE GLEGLFRRLI+NRVSLLN 
Sbjct: 247 AALNSLSCQEASKSDNVVDS--ENVQSELQQLELCSQDLEEGLEGLFRRLIKNRVSLLNT 303

Query: 301 INN 302
           ++N
Sbjct: 307 LSN 303

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
A0A0A0KG43_CUCSA2.7e-15492.72Uncharacterized protein OS=Cucumis sativus GN=Csa_6G127330 PE=4 SV=1[more]
W9RKX3_9ROSA5.2e-7353.80Uncharacterized protein OS=Morus notabilis GN=L484_014372 PE=4 SV=1[more]
M5VME9_PRUPE7.0e-7049.50Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa025877mg PE=4 SV=1[more]
W9S9W8_9ROSA2.6e-6950.00Uncharacterized protein OS=Morus notabilis GN=L484_014368 PE=4 SV=1[more]
W9RW89_9ROSA2.5e-6748.99Uncharacterized protein OS=Morus notabilis GN=L484_014365 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G17080.13.0e-4034.68 Arabidopsis protein of unknown function (DUF241)[more]
AT4G35200.12.5e-3934.81 Arabidopsis protein of unknown function (DUF241)[more]
AT4G35210.19.7e-3934.47 Arabidopsis protein of unknown function (DUF241)[more]
AT2G17070.14.1e-3733.45 Arabidopsis protein of unknown function (DUF241)[more]
AT4G35710.16.1e-2528.81 Arabidopsis protein of unknown function (DUF241)[more]
Match NameE-valueIdentityDescription
gi|659094947|ref|XP_008448320.1|2.8e-15794.35PREDICTED: uncharacterized protein LOC103490547 [Cucumis melo][more]
gi|449444560|ref|XP_004140042.1|3.8e-15492.72PREDICTED: uncharacterized protein LOC101209279 [Cucumis sativus][more]
gi|703125547|ref|XP_010103332.1|7.4e-7353.80hypothetical protein L484_014372 [Morus notabilis][more]
gi|1009150627|ref|XP_015893119.1|7.4e-7353.16PREDICTED: uncharacterized protein LOC107427255 [Ziziphus jujuba][more]
gi|657980968|ref|XP_008382492.1|1.5e-7049.83PREDICTED: uncharacterized protein LOC103445280 [Malus domestica][more]
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0006413 translational initiation
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
molecular_function GO:0003743 translation initiation factor activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG02G010920.1ClCG02G010920.1mRNA


The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
ClCG02G010920Silver-seed gourdcarwcgB0195
ClCG02G010920Cucurbita pepo (Zucchini)cpewcgB235