CmaCh02G005660 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh02G005660
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
Descriptionprotein root UVB sensitive 4
LocationCma_Chr02: 3214259 .. 3219163 (-)
RNA-Seq ExpressionCmaCh02G005660
SyntenyCmaCh02G005660
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAACCCTTGTTTCTTGGCGGGAGGTAATCATCATCGACCTCATTGTTGGACCCTAGTTTCCACTTCGCACTTCGATTGTATAAGCACTTCACAATTTCTGGAAGGGGAATCCGCTGCTTGGAGCGGACCATATCACGCCCTTTTGCGTCTAACGCCATCTCCCTCTCTCTCCCCCGCAGCGACAAGCCCAAAATTTCACTCCCTCATGCAAAGCAGCTCCTTCAACCCAATCTCGAATTCCCTTCAATTTCAACGCCCATGGATTTTCCCCGAGACCCACTTCACCCTTGGTATTAGGGTTCCAAATAAGCCTAGATTCCGCCTTACACCTCGAATTGTTATCAGAACTTCAAGAACTCGGTACAGAGCCGATGAAGGCCTTGATGATACACCCGGTCCCAGCACCCCGGTGCGCTTCCCCGTCGTCCTTCTCCGGTCCGGTAGGGTCTCTCAGTACGTGTGGGATGGGTTTTCTCTACAGTTGGTCGGCGTTGATGGTGGTGCTTCATCTGTGTCTTTTGATTTCGTTGATGGGTTTCGAACGATGTACAGGGCATGTGGTTTGGCTGTTAAGGGTTTTTTTATTCCCAAGAATGTTAGTGAGCATTATGTACTTTATGTCAAATGGAAGTTCTTGCATCGGGTTTTCAGTTCCGCTCTGCAGGTTATTGCTACTCAGGTCAGTGGATGAAACTTAATTGTTTAATTTATTCGAGTTTTCTAGTTGCGAGAAGAAATTAGTCTGATAGAGTTTAAGGATTAGACTCATAGTTTTATTTAACTATAGACCATTTGATGTTTGTTCATATTCTTCTTAGTTTATTTCTTATAGTTGGTTTCTGTTGGGACAACATTTTAGCTTGAGCACATCCATATAGCAGAACTTTGAAGTGTTTTTTGGTGTGTGGATTGAATTGTTTACCTGATGAAAGTATTTGGAGTGTTAATTTTTATGGCTCTGCACTATAGTTGTTAGCTCTGATTTTATTGATGTCCACCTAATGGTTTAATGATAAGTATTCTCTTCAAATTATAAGGAAAATTCTTATTGATAGGCAATGTTTCGGGCTATAGGAGTTGGGCGCACTCGTTCGCTAGCATCTGCAGCAGCTCTGAATTGGGTCCTGAAGGATGGACTTGGACGGTTAAGTCGGTGTATTTACACTGCTACGATAGCATCTTCATTCGATACAAATTTAAAGGTTTGCTTCTGTTTGTTAATTTCTTATGAACAAGGAGTTTTATTTGAAGGTCATCCTTGAAAGTGCAGTGTTTGTGTTTCCATTGTATCTTATTGTTTGATGAACAATTCTTTTTTTCTTTTTAATGTTTCTCGTTTCAAGAGAGTCAGGTTTTCGACAGCGATTCTATTCAGTTTAAGCATTGGAGTTGAGTTGCTGACTCCAGCATTTCCTCAGTACTTCTTGCTTCTTGCATCGATTGCAAACATCGTGAAACAAATAAGTCTTGGATGCTACTTATCAACGGCTGTAAGGGTTCACAATTTTATCTAGTCCAATTTTTGTGCATTATATGTTTGACATACTTTCTTCACGAAAAATCGATCACCTCTTTATGATAATAACCAATATTTCTTTCTGTTGGTAAGAGGTGTTTGTTGGATTACTCTGTGTTGTTCGTTTGTCGAGAAACGTTACTCCAAGTTGTTGATCACTTAAGGTTTGGAATGACTTTCAAAGTGCTTAGAAAAAGGTTTTTAAGACTCAATTTTCCAAATATCCATTTGGTTCATTTATGAAGTTTCTGGAAGACTCAGTTTCAAAAGCTCTAATTATGCTGGTTAAGCATTCCTAAAAGTTTCATTTCAATAGATATTATTCTTTTTGCTTCACGACTTCGGATTTTATTAGTGTCGTCTTATAGTTCTCACTCTTTGTAATCAGTCTGCTGTTCATAGAAGTTTTGCAGTGACAGATAACCTTGGTGAAGTTTCTGCCAAGGCACAGGTTGGCTATAATCCTTCCTGAAACAGATCACAATTTTTCGGTATCCTTTACGTTCATTCAACTATCCATTTTGTGCAGATTCAATCAGTGTGCTTTGATAATCTTGGTCTCATACTCGCCGTCCTTCTGAACTTCTTGTCCAAGAATGATCAAAGGTTTTTTTTCCCCTGATTGTTCTTTAGACTAACGTTTGTTCTTAAATAAGTTACAAATACTCAGAGATTATCAATTCATGTCTGTTTTATCATGATGTACATGTTTTCTCGTTTTTGGAATTTATCATTTCAAAGGTTTTCTAACTCATGATTTTTTTTTTTATGTTAGATTGCAAGCAGCTTTACCTTTTGTAGTTTACCCCATTTTTGCTGCAATGGATCTTTTCGGAACATATCAAGGGCTTAAACATGTCCATTTGCAAACACTAACGAAGGTAGGTTTTGTGTTATTCAACTTTCCACAATATTTGGAGGCTAGGTATGTAACGGCCCAAGCCCTCTTTAGCAAATATTGTCATCTTTGGGTTTTTCTTTTCGGGCTTTCTCTCAAGGTTTTTAAAACGCGTATGCTAGGGAGAGGTTTCCACACTCTCATAAAGAATATTTGTTCTCCTCTCCAACTGATGTGGGATCTCACAATCCATCCCCCTTCGAGGTCCAGCGTCCTCGCTGGCACTCGTTCCCTTTGGTCCAATCGATGTGGGACCCCCAATTCACCCCCTTCAGGGCCCAACATCCTTGCTGGCACACCACCTCATGTCCACCCTCCTTCGAGGCTCAGCCTCATCGCTGACACATCGTCCGGTGTCTGGCTTTGATACTATTTGTAACGACCCAAGCCCACTGCTAGTAGATATTGTCCTCTTTGAACTTTCCTTTTCGGACTTCCCCTCAAGATTTTTAAAAAGTGCCTACTAGGGAGAGGTTTTCACACCCTTATAAAGAATGTTTCGTTCTCCTCCCCAACCGATGTGGGATCTCACAAGTTGAGTGTTCGAATTTGCTGTTTTGTGCTTACTTGATGGAGCAAAATTGTCTTTTATAAGATGTTTGCTTAATGGAACTAATGATTTATTTATTCTCTCTTCACTACTTGAGTTCGGAATGACTTGAGTTTTGATAGAGTGCTTTTTAGGTGAAACACTTTCTTCCTATGTTGTGATGCTATGAAAAAAAAAAAAAAAAAACAAACACTTTTTACTATTCCAAAAGTCATAGTAAACTCAATTGAAGTGAATTTTTCATCTCAACCGTTAATTTCTTCATGTTCAGGATCGACTTGAGATCATATTATGCACTTGGATCGAGCAAGGATACGTGCCTACACCAGCCGAAGTAAGTGAGATGGAAGGAATTGATTTACTATGCAGAAAGGGTGAGAGTCTTGTCTATGTATTGATTCCTCTATACATAGAAGGTTAACCAATCGTAGACTTATGTTGAACTACAGTTAATGAATATAACTTCAATATTCAGGTAAGGTTTCATGGCCAATCAGAATAGGTTGCTTGAATCTGGAGTCTCAAATACCAAAGTTGTCAATGCTTGCAATGCGATCTGTATGTAACAAGGACTATTATTTTATATGCATGGATGTCTTCTGCCGGGGCTTAACAACAAATACAGTGAGTAGATATACTTCGATTAACCTTTACGAGCTGTCCATATTTCTTTTGCTCACACTTTTCTTTTTCGAAATCGTAGCACGGTATTCTTCTTTGTCTCCGTGAAGGGGCTCGTGCAGCAGATATCAGCATGGGGTTATTGCAGGTAGTAGAAAATTGCTTAAATTCATTTTCATCTGTTTGGTTTGCGCTTGATTTGAATATGATATCTATTGTTCGAAAAGTTTCAGTTGGGTCGGGTGATTTAGTGTAACAACCTAAACCCACAGCGAGCAAATATTGTTTTCTTTGGACTTCCCTTCAAGGTTTTTAAAATGCGTCTTCTAGAGAGAGTTTCTATCGTTTGTAACAGTCCAAGTCCATCGCTAGCAGATATTGTCCTCTTTGAGTTTTCCCTTTCAAGTTTCCACCAAGATTTTTAAAATGCGACTATTAGGGAGAGGTTTCCACACCTTTGTAAAGAATGTTTTGTTTCCCTCTCCAACCGATACTCAGTGTCCTCGCTGGCATTCGTTCCCCTTTTCAATCGAGGTAAGATCTTACATTTAGTTAAATCTGTCAAACGACCCTTGATTTACCAAGTGTTTCATAATTTTTTTATTTTGTTTTTCTTTCAATAACATATTTTTTTTTCAAATGCATTAAGGAAACATGGTAAAAGTCATGTAAAAGACAACGGTCAACTTTTGGTGAGGTTAATGATGTTTAGAGATATAACAAAACCAGAAAGACCAAAGCGAAACATTTAAAATCAGAAGGATGAAAGCATTAAGGAAATCTCTTCGAAACGGGATGTCACAACATTCTTTTAGTCCTATGCTTTATATTCAGCCGTTCTTTGATGATTTTGAAAGTCTGATGTAACGAGTCTTGACTTGAATAGGCATGCTTCATCCGCAAAGTGATTGTATCGAACACGAGCGTTTGGGACAAAGAAATCATGAAAGGTATTAACTTTTCAGATGCAATGGCGAAGGAGTGGGTTGGTTTGGTTGGGGATAGCAAGAAATATGCAGAAGAAAATGGTTGTAATTTGCTTAAACAAATGTCAAGCTTAGGATGGGCTGTCAAGAACGTTCTGCTGAGTACAAATGAGCAAATACGATACAGCTTTGTTGATGACTGAGGGAGCTTGTTTCGAAGGACCAAGATGCTGATTAACCCAAACTTATTACTCATCATTTTTTTCTTTTACCCTTATAAGGATACTGAACTCTGGGGATAAATTTAGAAATTGTTAATTATGTATTTTTTATATATAAAAAAAACTATAATTGTTTACACCCAATAAAATTATGGAAAAATATTAATTTCTAT

mRNA sequence

AAACCCTTGTTTCTTGGCGGGAGGTAATCATCATCGACCTCATTGTTGGACCCTAGTTTCCACTTCGCACTTCGATTGTATAAGCACTTCACAATTTCTGGAAGGGGAATCCGCTGCTTGGAGCGGACCATATCACGCCCTTTTGCGTCTAACGCCATCTCCCTCTCTCTCCCCCGCAGCGACAAGCCCAAAATTTCACTCCCTCATGCAAAGCAGCTCCTTCAACCCAATCTCGAATTCCCTTCAATTTCAACGCCCATGGATTTTCCCCGAGACCCACTTCACCCTTGGTATTAGGGTTCCAAATAAGCCTAGATTCCGCCTTACACCTCGAATTGTTATCAGAACTTCAAGAACTCGGTACAGAGCCGATGAAGGCCTTGATGATACACCCGGTCCCAGCACCCCGGTGCGCTTCCCCGTCGTCCTTCTCCGGTCCGGTAGGGTCTCTCAGTACGTGTGGGATGGGTTTTCTCTACAGTTGGTCGGCGTTGATGGTGGTGCTTCATCTGTGTCTTTTGATTTCGTTGATGGGTTTCGAACGATGTACAGGGCATGTGGTTTGGCTGTTAAGGGTTTTTTTATTCCCAAGAATGTTAGTGAGCATTATGTACTTTATGTCAAATGGAAGTTCTTGCATCGGGTTTTCAGTTCCGCTCTGCAGGTTATTGCTACTCAGGCAATGTTTCGGGCTATAGGAGTTGGGCGCACTCGTTCGCTAGCATCTGCAGCAGCTCTGAATTGGGTCCTGAAGGATGGACTTGGACGGTTAAGTCGGTGTATTTACACTGCTACGATAGCATCTTCATTCGATACAAATTTAAAGAGAGTCAGGTTTTCGACAGCGATTCTATTCAGTTTAAGCATTGGAGTTGAGTTGCTGACTCCAGCATTTCCTCAGTACTTCTTGCTTCTTGCATCGATTGCAAACATCGTGAAACAAATAAGTCTTGGATGCTACTTATCAACGGCTTCTGCTGTTCATAGAAGTTTTGCAGTGACAGATAACCTTGGTGAAGTTTCTGCCAAGGCACAGATTCAATCAGTGTGCTTTGATAATCTTGGTCTCATACTCGCCGTCCTTCTGAACTTCTTGTCCAAGAATGATCAAAGATTGCAAGCAGCTTTACCTTTTGTAGTTTACCCCATTTTTGCTGCAATGGATCTTTTCGGAACATATCAAGGGCTTAAACATGATCGACTTGAGATCATATTATGCACTTGGATCGAGCAAGGATACGTGCCTACACCAGCCGAAGTAAGTGAGATGGAAGGAATTGATTTACTATGCAGAAAGGGTAAGGTTTCATGGCCAATCAGAATAGGTTGCTTGAATCTGGAGTCTCAAATACCAAAGTTGTCAATGCTTGCAATGCGATCTGTATGTAACAAGGACTATTATTTTATATGCATGGATGTCTTCTGCCGGGGCTTAACAACAAATACACACGGTATTCTTCTTTGTCTCCGTGAAGGGGCTCGTGCAGCAGATATCAGCATGGGGTTATTGCAGGCATGCTTCATCCGCAAAGTGATTGTATCGAACACGAGCGTTTGGGACAAAGAAATCATGAAAGGTATTAACTTTTCAGATGCAATGGCGAAGGAGTGGGTTGGTTTGGTTGGGGATAGCAAGAAATATGCAGAAGAAAATGGTTGTAATTTGCTTAAACAAATGTCAAGCTTAGGATGGGCTGTCAAGAACGTTCTGCTGAGTACAAATGAGCAAATACGATACAGCTTTGTTGATGACTGAGGGAGCTTGTTTCGAAGGACCAAGATGCTGATTAACCCAAACTTATTACTCATCATTTTTTTCTTTTACCCTTATAAGGATACTGAACTCTGGGGATAAATTTAGAAATTGTTAATTATGTATTTTTTATATATAAAAAAAACTATAATTGTTTACACCCAATAAAATTATGGAAAAATATTAATTTCTAT

Coding sequence (CDS)

ATGCAAAGCAGCTCCTTCAACCCAATCTCGAATTCCCTTCAATTTCAACGCCCATGGATTTTCCCCGAGACCCACTTCACCCTTGGTATTAGGGTTCCAAATAAGCCTAGATTCCGCCTTACACCTCGAATTGTTATCAGAACTTCAAGAACTCGGTACAGAGCCGATGAAGGCCTTGATGATACACCCGGTCCCAGCACCCCGGTGCGCTTCCCCGTCGTCCTTCTCCGGTCCGGTAGGGTCTCTCAGTACGTGTGGGATGGGTTTTCTCTACAGTTGGTCGGCGTTGATGGTGGTGCTTCATCTGTGTCTTTTGATTTCGTTGATGGGTTTCGAACGATGTACAGGGCATGTGGTTTGGCTGTTAAGGGTTTTTTTATTCCCAAGAATGTTAGTGAGCATTATGTACTTTATGTCAAATGGAAGTTCTTGCATCGGGTTTTCAGTTCCGCTCTGCAGGTTATTGCTACTCAGGCAATGTTTCGGGCTATAGGAGTTGGGCGCACTCGTTCGCTAGCATCTGCAGCAGCTCTGAATTGGGTCCTGAAGGATGGACTTGGACGGTTAAGTCGGTGTATTTACACTGCTACGATAGCATCTTCATTCGATACAAATTTAAAGAGAGTCAGGTTTTCGACAGCGATTCTATTCAGTTTAAGCATTGGAGTTGAGTTGCTGACTCCAGCATTTCCTCAGTACTTCTTGCTTCTTGCATCGATTGCAAACATCGTGAAACAAATAAGTCTTGGATGCTACTTATCAACGGCTTCTGCTGTTCATAGAAGTTTTGCAGTGACAGATAACCTTGGTGAAGTTTCTGCCAAGGCACAGATTCAATCAGTGTGCTTTGATAATCTTGGTCTCATACTCGCCGTCCTTCTGAACTTCTTGTCCAAGAATGATCAAAGATTGCAAGCAGCTTTACCTTTTGTAGTTTACCCCATTTTTGCTGCAATGGATCTTTTCGGAACATATCAAGGGCTTAAACATGATCGACTTGAGATCATATTATGCACTTGGATCGAGCAAGGATACGTGCCTACACCAGCCGAAGTAAGTGAGATGGAAGGAATTGATTTACTATGCAGAAAGGGTAAGGTTTCATGGCCAATCAGAATAGGTTGCTTGAATCTGGAGTCTCAAATACCAAAGTTGTCAATGCTTGCAATGCGATCTGTATGTAACAAGGACTATTATTTTATATGCATGGATGTCTTCTGCCGGGGCTTAACAACAAATACACACGGTATTCTTCTTTGTCTCCGTGAAGGGGCTCGTGCAGCAGATATCAGCATGGGGTTATTGCAGGCATGCTTCATCCGCAAAGTGATTGTATCGAACACGAGCGTTTGGGACAAAGAAATCATGAAAGGTATTAACTTTTCAGATGCAATGGCGAAGGAGTGGGTTGGTTTGGTTGGGGATAGCAAGAAATATGCAGAAGAAAATGGTTGTAATTTGCTTAAACAAATGTCAAGCTTAGGATGGGCTGTCAAGAACGTTCTGCTGAGTACAAATGAGCAAATACGATACAGCTTTGTTGATGACTGA

Protein sequence

MQSSSFNPISNSLQFQRPWIFPETHFTLGIRVPNKPRFRLTPRIVIRTSRTRYRADEGLDDTPGPSTPVRFPVVLLRSGRVSQYVWDGFSLQLVGVDGGASSVSFDFVDGFRTMYRACGLAVKGFFIPKNVSEHYVLYVKWKFLHRVFSSALQVIATQAMFRAIGVGRTRSLASAAALNWVLKDGLGRLSRCIYTATIASSFDTNLKRVRFSTAILFSLSIGVELLTPAFPQYFLLLASIANIVKQISLGCYLSTASAVHRSFAVTDNLGEVSAKAQIQSVCFDNLGLILAVLLNFLSKNDQRLQAALPFVVYPIFAAMDLFGTYQGLKHDRLEIILCTWIEQGYVPTPAEVSEMEGIDLLCRKGKVSWPIRIGCLNLESQIPKLSMLAMRSVCNKDYYFICMDVFCRGLTTNTHGILLCLREGARAADISMGLLQACFIRKVIVSNTSVWDKEIMKGINFSDAMAKEWVGLVGDSKKYAEENGCNLLKQMSSLGWAVKNVLLSTNEQIRYSFVDD
Homology
BLAST of CmaCh02G005660 vs. ExPASy Swiss-Prot
Match: Q67YT8 (Protein root UVB sensitive 4 OS=Arabidopsis thaliana OX=3702 GN=RUS4 PE=2 SV=1)

HSP 1 Score: 528.5 bits (1360), Expect = 8.6e-149
Identity = 286/482 (59.34%), Postives = 352/482 (73.03%), Query Frame = 0

Query: 46  IRTSRTRYRADEGLDDTPGPSTPV-RFPVVLLRSGRVSQYVWDGFSLQLVGVD---GGAS 105
           +RTS    +     +D   PS    R P+++ +SG+VS+Y   G SL+L+ VD     ++
Sbjct: 39  LRTSIDYKQEGASKEDLVVPSNVARRLPIIIKKSGKVSRYFIKGDSLELLCVDEEEDDST 98

Query: 106 SVSFDFVDGFRTMYRACGLAVKGFFIPKNVSEHYVLYVKWKFLHRVFSSALQVIATQAMF 165
           S      DGF  + R    A K FF+PK VS++Y+ YVKWKFLHRVFSSALQV+ATQAMF
Sbjct: 99  SFCLGLDDGFWKLIRLTSSAAKDFFLPKQVSDNYISYVKWKFLHRVFSSALQVLATQAMF 158

Query: 166 RAIGVGRTRSLASAAALNWVLKDGLGRLSRCIYTATIASSFDTNLKRVRFSTAILFSLSI 225
           RAIG+G++RSLAS+AA NW+LKDGLGRLSRCIYTA++AS+FDTNLKRVRFST++LFSLSI
Sbjct: 159 RAIGIGQSRSLASSAAFNWILKDGLGRLSRCIYTASLASAFDTNLKRVRFSTSVLFSLSI 218

Query: 226 GVELLTPAFPQYFLLLASIANIVKQISLGCYLSTASAVHRSFAVTDNLGEVSAKAQIQSV 285
           GVEL+TP FPQYFLLLASIANI KQISL CYL+T SAVHRSFAV DNLGEVSAKAQIQ+V
Sbjct: 219 GVELMTPVFPQYFLLLASIANIAKQISLSCYLATGSAVHRSFAVADNLGEVSAKAQIQTV 278

Query: 286 CFDNLGLILAVLLNFLSKNDQRLQAALPFVVYPIFAAMDLFGTYQGLKH--------DRL 345
           CFDNLGL+LAVLLN L +++QRLQA LPFV+YPIF+  DL G YQGLKH        DRL
Sbjct: 279 CFDNLGLLLAVLLNMLFQHNQRLQACLPFVLYPIFSTFDLLGIYQGLKHINLQTLTKDRL 338

Query: 346 EIILCTWIEQGYVPTPAEVSEMEGIDLLCRKG-KVSWPIRIGCLNLESQIPKLSMLAMRS 405
           EIIL  WIE   VP+PAEVSE EGI LL  +G K  WPIRIGCL+ ++QIP LSM+AM+S
Sbjct: 339 EIILERWIEFRQVPSPAEVSEEEGIGLLGSRGSKRVWPIRIGCLDPKAQIPTLSMMAMQS 398

Query: 406 VCNKDYYFICMDVFCRGL-TTNTHGILLCLREGARAADISMGLLQACFIRKVIVSNTSVW 465
           +C+ D YFI M++  +G       GI++CLREGA + D+   LLQ C+IRK + +N    
Sbjct: 399 LCSDDGYFITMELSSQGFRRIPKSGIVICLREGANSVDVITSLLQTCYIRKSLGAN---- 458

Query: 466 DKEIMKGINFSDAMAKEWVGLVGDSKKYAEENGCNLLKQMSSLGWAVKNVLLSTNEQIRY 514
            +     ++FSD   ++W  L  +SK+ A ++   L KQM   GW VKNVLLS  EQIRY
Sbjct: 459 -RTKRSYLSFSDLTLQDWTLLTRESKRAARDDNIALNKQMQEQGWIVKNVLLSAEEQIRY 515

BLAST of CmaCh02G005660 vs. ExPASy Swiss-Prot
Match: Q93YU2 (Protein root UVB sensitive 6 OS=Arabidopsis thaliana OX=3702 GN=RUS6 PE=2 SV=1)

HSP 1 Score: 122.5 bits (306), Expect = 1.4e-26
Identity = 83/262 (31.68%), Postives = 136/262 (51.91%), Query Frame = 0

Query: 105 FDFVDGFRTMYRACGLAVKGFFIPKNVSEHYVLYVKWKFLHRVFSSALQVIATQAMFRAI 164
           FD V  F   Y    +  +GF  P +V+E YV Y+ W+ L   F  A+ V  TQ +  ++
Sbjct: 101 FDEVGSFLRSY----VVPEGF--PGSVNESYVPYMTWRALKHFFGGAMGVFTTQTLLNSV 160

Query: 165 GVGRTRSLASAAALNWVLKDGLGRLSRCIYTATIASSFDTNLKRVRFSTAILFSLSIGVE 224
           G  R  S ++A A+NW+LKDG GR+ + ++ A     FD +LK++RF+  +L  L  GVE
Sbjct: 161 GASRNSSASAAVAINWILKDGAGRVGKMLF-ARQGKKFDYDLKQLRFAGDLLMELGAGVE 220

Query: 225 LLTPAFPQYFLLLASIANIVKQISLGCYLSTASAVHRSFAVTDNLGEVSAKAQIQSVCFD 284
           L T A P  FL LA  AN+VK ++     ST + ++++FA  +N+G+V+AK +      D
Sbjct: 221 LATAAVPHLFLPLACAANVVKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIAD 280

Query: 285 NLGLILAVLLNFLSKNDQRLQAALPFVVYPIFAAMDLFGTYQ--------GLKHDRLEII 344
            +G   ++L   +SK +  L        + + +   L  +YQ         L   R  + 
Sbjct: 281 LMGTGFSIL---ISKRNPSL-----VTTFGLLSCGYLMSSYQEVRSVVLHTLNRARFTVA 340

Query: 345 LCTWIEQGYVPTPAEVSEMEGI 359
           + ++++ G VP+  E +  E I
Sbjct: 341 VESFLKTGRVPSLQEGNIQEKI 347

BLAST of CmaCh02G005660 vs. ExPASy Swiss-Prot
Match: Q84JB8 (Protein root UVB sensitive 3 OS=Arabidopsis thaliana OX=3702 GN=RUS3 PE=2 SV=1)

HSP 1 Score: 102.4 bits (254), Expect = 1.5e-20
Identity = 92/392 (23.47%), Postives = 173/392 (44.13%), Query Frame = 0

Query: 128 PKNVSEHYVLYVKWKFLHRVFSSALQVIATQAMFRAIGVGRTRSLASAAALNWVLKDGLG 187
           P +V+  YV +  W  L  + +    +++TQA+  AIGVG   +    A   W L+D  G
Sbjct: 63  PGSVTPDYVGFQLWDTLQGLSTYTKMMLSTQALLSAIGVGEKSATVIGATFQWFLRDFTG 122

Query: 188 RLSRCIYTATIASSFDTNLKRVRFSTAILFSLSIGVELLTPAFPQYFLLLASIANIVKQI 247
            L   ++T    S+ D+N K  R    ++  + + ++LL+P FP  F+++  + ++ +  
Sbjct: 123 MLGGILFTFYQGSNLDSNAKMWRLVADLMNDIGMLMDLLSPLFPSAFIVVVCLGSLSRSF 182

Query: 248 SLGCYLSTASAVHRSFAVTDNLGEVSAKAQIQSVCFDNLGLILAVLLNFLSKNDQRLQAA 307
           +     +T +A+ + FA+ DN  ++SAK   Q      +G+ L +LL        R  + 
Sbjct: 183 TGVASGATRAALTQHFALQDNAADISAKEGSQETMATMMGMSLGMLL-------ARFTSG 242

Query: 308 LPFVVYPIFAAMDLFGTY-----------QGLKHDRLEIILCTWIEQGYVPTPAEVSEME 367
            P  ++  F ++ +F  Y             L  +R  I+L  +I+ G V +P +VS ME
Sbjct: 243 NPMAIWLSFLSLTVFHMYANYRAVRCLVLNSLNFERSSILLTHFIQTGQVLSPEQVSSME 302

Query: 368 GIDLL---CRKGKVSWPI--RIGCLNLESQIPKLSMLAMRSVCNKDYYFICMDVFCRGLT 427
           G+  L     +   S P+  R+      S +P+L ML + +      Y        + L 
Sbjct: 303 GVLPLWATSLRSTNSKPLHKRVQLGVRVSSLPRLDMLQLLNGVGASSY-----KNAKYLL 362

Query: 428 TNTHG-ILLCLREGARAADISMGLLQACFIRKVIVSNTSVWDKEIMKGINFSDAMAKEWV 487
            +  G + + L + ++ AD+    + A  +  ++  +TS + +             + W+
Sbjct: 363 AHIKGNVSVILHKDSKPADVLKSYIHAIVLANLMEKSTSFYSE------------GEAWI 420

Query: 488 GLVGDSKKYAEENGCNLLKQMSSLGWAVKNVL 503
                 K Y E     LL ++ S GW  + +L
Sbjct: 423 -----DKHYDE-----LLHKLRSGGWKTERLL 420

BLAST of CmaCh02G005660 vs. ExPASy Swiss-Prot
Match: Q7X6P3 (Protein root UVB sensitive 1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=RUS1 PE=1 SV=1)

HSP 1 Score: 101.3 bits (251), Expect = 3.4e-20
Identity = 76/267 (28.46%), Postives = 133/267 (49.81%), Query Frame = 0

Query: 120 LAVKGFFIPKNVSEHYVLYVKWKFLHRVFSSALQVIATQAMFRAIGVGRTRSLASAAALN 179
           L  +GF  P +V+  Y+ Y  W+ +  + S    V+ATQ++  A+G+G+  ++ +AAA+N
Sbjct: 198 LLPEGF--PNSVTSDYLDYSLWRGVQGIASQISGVLATQSLLYAVGLGK-GAIPTAAAIN 257

Query: 180 WVLKDGLGRLSRCIYTATIASSFDTNLKRVRFSTAILFSLSIGVELLTPAFPQYFLLLAS 239
           WVLKDG+G LS+ I  +     FD + K  R    +L + + G+E+LTP FPQ+F+++ +
Sbjct: 258 WVLKDGIGYLSK-IMLSKYGRHFDVHPKGWRLFADLLENAAFGMEMLTPVFPQFFVMIGA 317

Query: 240 IANIVKQISLGCYLSTASAVHRSFAVTDNLGEVSAKAQIQSVCFDNLGLILAVLLNFLSK 299
            A   +  +     +T S  +  FA   N  EV AK + Q +   ++G++L +++     
Sbjct: 318 AAGAGRSAAALIQAATRSCFNAGFASQRNFAEVIAKGEAQGMVSKSVGILLGIVVANCIG 377

Query: 300 NDQRLQAALPFVVYPIFAAMDLFGTYQ-----GLKHDRLEIILCTWIEQGYVPTPAEVSE 359
               L  A   VV  I    +L  +YQ      L   R  ++   ++  G  P   EV++
Sbjct: 378 TSTSLALAAFGVVTTIHMYTNL-KSYQCIQLRTLNPYRASLVFSEYLISGQAPLIKEVND 437

Query: 360 MEGIDLLCRKGKVSWPIRIGCLNLESQ 382
            E +    R   +  P ++    L S+
Sbjct: 438 EEPLFPTVRFSNMKSPEKLQDFVLSSE 459

BLAST of CmaCh02G005660 vs. ExPASy Swiss-Prot
Match: Q86K80 (RUS family member 1 OS=Dictyostelium discoideum OX=44689 GN=rusf1 PE=3 SV=1)

HSP 1 Score: 101.3 bits (251), Expect = 3.4e-20
Identity = 67/242 (27.69%), Postives = 132/242 (54.55%), Query Frame = 0

Query: 128 PKNVSEHYVLYVKWKFLHRVFSSALQVIATQAMFRAIGVGRTRSLASAAALNWVLKDGLG 187
           P +V+  Y  Y  W  +  + S+    +AT+A+ +  GVG + +  ++A   W+++DG+G
Sbjct: 69  PDSVTTDYFGYQFWDSIQALCSTITGTLATRAILKGYGVGDSSATVASATTQWLIRDGMG 128

Query: 188 RLSRCIYTATIASSFDTNLKRVRFSTAILFSLSIGVELLTPAF-PQYFLLLASIANIVKQ 247
            + R ++     +  D N K+ R++  IL ++ +  E+++P F  Q FL L+ I  I K 
Sbjct: 129 MIGRIVFAWRKGTDLDCNSKKWRYTADILNNIGMAFEMISPLFSSQLFLPLSCIGLIAKS 188

Query: 248 ISLGCYLSTASAVHRSFAVTDNLGEVSAKAQIQSVCFDNLGLILAVLL-NFLSKNDQRLQ 307
           I       T +++ + FA  DNL +VSAK   Q    + +G++L+V++ +F++ N   + 
Sbjct: 189 ICGVAGGCTKASLTQHFAKRDNLADVSAKDGSQETAVNLVGMLLSVIVSSFINDNTSLI- 248

Query: 308 AALPFVVYPIFAAMDLFGTYQGLKHDRLE--------IILCTWI-EQGYVPTPAEVSEME 359
             + ++V+  F ++ LF  Y+ +   +L+        +I   +I  QG +P+P+E+S++E
Sbjct: 249 --VTWLVFLFFTSLHLFCNYRAVSAVQLKSINRYRAYLIYDYFIHNQGSIPSPSEISKLE 307

BLAST of CmaCh02G005660 vs. TAIR 10
Match: AT2G23470.1 (Protein of unknown function, DUF647 )

HSP 1 Score: 528.5 bits (1360), Expect = 6.1e-150
Identity = 286/482 (59.34%), Postives = 352/482 (73.03%), Query Frame = 0

Query: 46  IRTSRTRYRADEGLDDTPGPSTPV-RFPVVLLRSGRVSQYVWDGFSLQLVGVD---GGAS 105
           +RTS    +     +D   PS    R P+++ +SG+VS+Y   G SL+L+ VD     ++
Sbjct: 39  LRTSIDYKQEGASKEDLVVPSNVARRLPIIIKKSGKVSRYFIKGDSLELLCVDEEEDDST 98

Query: 106 SVSFDFVDGFRTMYRACGLAVKGFFIPKNVSEHYVLYVKWKFLHRVFSSALQVIATQAMF 165
           S      DGF  + R    A K FF+PK VS++Y+ YVKWKFLHRVFSSALQV+ATQAMF
Sbjct: 99  SFCLGLDDGFWKLIRLTSSAAKDFFLPKQVSDNYISYVKWKFLHRVFSSALQVLATQAMF 158

Query: 166 RAIGVGRTRSLASAAALNWVLKDGLGRLSRCIYTATIASSFDTNLKRVRFSTAILFSLSI 225
           RAIG+G++RSLAS+AA NW+LKDGLGRLSRCIYTA++AS+FDTNLKRVRFST++LFSLSI
Sbjct: 159 RAIGIGQSRSLASSAAFNWILKDGLGRLSRCIYTASLASAFDTNLKRVRFSTSVLFSLSI 218

Query: 226 GVELLTPAFPQYFLLLASIANIVKQISLGCYLSTASAVHRSFAVTDNLGEVSAKAQIQSV 285
           GVEL+TP FPQYFLLLASIANI KQISL CYL+T SAVHRSFAV DNLGEVSAKAQIQ+V
Sbjct: 219 GVELMTPVFPQYFLLLASIANIAKQISLSCYLATGSAVHRSFAVADNLGEVSAKAQIQTV 278

Query: 286 CFDNLGLILAVLLNFLSKNDQRLQAALPFVVYPIFAAMDLFGTYQGLKH--------DRL 345
           CFDNLGL+LAVLLN L +++QRLQA LPFV+YPIF+  DL G YQGLKH        DRL
Sbjct: 279 CFDNLGLLLAVLLNMLFQHNQRLQACLPFVLYPIFSTFDLLGIYQGLKHINLQTLTKDRL 338

Query: 346 EIILCTWIEQGYVPTPAEVSEMEGIDLLCRKG-KVSWPIRIGCLNLESQIPKLSMLAMRS 405
           EIIL  WIE   VP+PAEVSE EGI LL  +G K  WPIRIGCL+ ++QIP LSM+AM+S
Sbjct: 339 EIILERWIEFRQVPSPAEVSEEEGIGLLGSRGSKRVWPIRIGCLDPKAQIPTLSMMAMQS 398

Query: 406 VCNKDYYFICMDVFCRGL-TTNTHGILLCLREGARAADISMGLLQACFIRKVIVSNTSVW 465
           +C+ D YFI M++  +G       GI++CLREGA + D+   LLQ C+IRK + +N    
Sbjct: 399 LCSDDGYFITMELSSQGFRRIPKSGIVICLREGANSVDVITSLLQTCYIRKSLGAN---- 458

Query: 466 DKEIMKGINFSDAMAKEWVGLVGDSKKYAEENGCNLLKQMSSLGWAVKNVLLSTNEQIRY 514
            +     ++FSD   ++W  L  +SK+ A ++   L KQM   GW VKNVLLS  EQIRY
Sbjct: 459 -RTKRSYLSFSDLTLQDWTLLTRESKRAARDDNIALNKQMQEQGWIVKNVLLSAEEQIRY 515

BLAST of CmaCh02G005660 vs. TAIR 10
Match: AT5G49820.1 (Protein of unknown function, DUF647 )

HSP 1 Score: 122.5 bits (306), Expect = 1.0e-27
Identity = 83/262 (31.68%), Postives = 136/262 (51.91%), Query Frame = 0

Query: 105 FDFVDGFRTMYRACGLAVKGFFIPKNVSEHYVLYVKWKFLHRVFSSALQVIATQAMFRAI 164
           FD V  F   Y    +  +GF  P +V+E YV Y+ W+ L   F  A+ V  TQ +  ++
Sbjct: 101 FDEVGSFLRSY----VVPEGF--PGSVNESYVPYMTWRALKHFFGGAMGVFTTQTLLNSV 160

Query: 165 GVGRTRSLASAAALNWVLKDGLGRLSRCIYTATIASSFDTNLKRVRFSTAILFSLSIGVE 224
           G  R  S ++A A+NW+LKDG GR+ + ++ A     FD +LK++RF+  +L  L  GVE
Sbjct: 161 GASRNSSASAAVAINWILKDGAGRVGKMLF-ARQGKKFDYDLKQLRFAGDLLMELGAGVE 220

Query: 225 LLTPAFPQYFLLLASIANIVKQISLGCYLSTASAVHRSFAVTDNLGEVSAKAQIQSVCFD 284
           L T A P  FL LA  AN+VK ++     ST + ++++FA  +N+G+V+AK +      D
Sbjct: 221 LATAAVPHLFLPLACAANVVKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIAD 280

Query: 285 NLGLILAVLLNFLSKNDQRLQAALPFVVYPIFAAMDLFGTYQ--------GLKHDRLEII 344
            +G   ++L   +SK +  L        + + +   L  +YQ         L   R  + 
Sbjct: 281 LMGTGFSIL---ISKRNPSL-----VTTFGLLSCGYLMSSYQEVRSVVLHTLNRARFTVA 340

Query: 345 LCTWIEQGYVPTPAEVSEMEGI 359
           + ++++ G VP+  E +  E I
Sbjct: 341 VESFLKTGRVPSLQEGNIQEKI 347

BLAST of CmaCh02G005660 vs. TAIR 10
Match: AT1G13770.1 (Protein of unknown function, DUF647 )

HSP 1 Score: 102.4 bits (254), Expect = 1.1e-21
Identity = 92/392 (23.47%), Postives = 173/392 (44.13%), Query Frame = 0

Query: 128 PKNVSEHYVLYVKWKFLHRVFSSALQVIATQAMFRAIGVGRTRSLASAAALNWVLKDGLG 187
           P +V+  YV +  W  L  + +    +++TQA+  AIGVG   +    A   W L+D  G
Sbjct: 63  PGSVTPDYVGFQLWDTLQGLSTYTKMMLSTQALLSAIGVGEKSATVIGATFQWFLRDFTG 122

Query: 188 RLSRCIYTATIASSFDTNLKRVRFSTAILFSLSIGVELLTPAFPQYFLLLASIANIVKQI 247
            L   ++T    S+ D+N K  R    ++  + + ++LL+P FP  F+++  + ++ +  
Sbjct: 123 MLGGILFTFYQGSNLDSNAKMWRLVADLMNDIGMLMDLLSPLFPSAFIVVVCLGSLSRSF 182

Query: 248 SLGCYLSTASAVHRSFAVTDNLGEVSAKAQIQSVCFDNLGLILAVLLNFLSKNDQRLQAA 307
           +     +T +A+ + FA+ DN  ++SAK   Q      +G+ L +LL        R  + 
Sbjct: 183 TGVASGATRAALTQHFALQDNAADISAKEGSQETMATMMGMSLGMLL-------ARFTSG 242

Query: 308 LPFVVYPIFAAMDLFGTY-----------QGLKHDRLEIILCTWIEQGYVPTPAEVSEME 367
            P  ++  F ++ +F  Y             L  +R  I+L  +I+ G V +P +VS ME
Sbjct: 243 NPMAIWLSFLSLTVFHMYANYRAVRCLVLNSLNFERSSILLTHFIQTGQVLSPEQVSSME 302

Query: 368 GIDLL---CRKGKVSWPI--RIGCLNLESQIPKLSMLAMRSVCNKDYYFICMDVFCRGLT 427
           G+  L     +   S P+  R+      S +P+L ML + +      Y        + L 
Sbjct: 303 GVLPLWATSLRSTNSKPLHKRVQLGVRVSSLPRLDMLQLLNGVGASSY-----KNAKYLL 362

Query: 428 TNTHG-ILLCLREGARAADISMGLLQACFIRKVIVSNTSVWDKEIMKGINFSDAMAKEWV 487
            +  G + + L + ++ AD+    + A  +  ++  +TS + +             + W+
Sbjct: 363 AHIKGNVSVILHKDSKPADVLKSYIHAIVLANLMEKSTSFYSE------------GEAWI 420

Query: 488 GLVGDSKKYAEENGCNLLKQMSSLGWAVKNVL 503
                 K Y E     LL ++ S GW  + +L
Sbjct: 423 -----DKHYDE-----LLHKLRSGGWKTERLL 420

BLAST of CmaCh02G005660 vs. TAIR 10
Match: AT3G45890.1 (Protein of unknown function, DUF647 )

HSP 1 Score: 101.3 bits (251), Expect = 2.4e-21
Identity = 76/267 (28.46%), Postives = 133/267 (49.81%), Query Frame = 0

Query: 120 LAVKGFFIPKNVSEHYVLYVKWKFLHRVFSSALQVIATQAMFRAIGVGRTRSLASAAALN 179
           L  +GF  P +V+  Y+ Y  W+ +  + S    V+ATQ++  A+G+G+  ++ +AAA+N
Sbjct: 198 LLPEGF--PNSVTSDYLDYSLWRGVQGIASQISGVLATQSLLYAVGLGK-GAIPTAAAIN 257

Query: 180 WVLKDGLGRLSRCIYTATIASSFDTNLKRVRFSTAILFSLSIGVELLTPAFPQYFLLLAS 239
           WVLKDG+G LS+ I  +     FD + K  R    +L + + G+E+LTP FPQ+F+++ +
Sbjct: 258 WVLKDGIGYLSK-IMLSKYGRHFDVHPKGWRLFADLLENAAFGMEMLTPVFPQFFVMIGA 317

Query: 240 IANIVKQISLGCYLSTASAVHRSFAVTDNLGEVSAKAQIQSVCFDNLGLILAVLLNFLSK 299
            A   +  +     +T S  +  FA   N  EV AK + Q +   ++G++L +++     
Sbjct: 318 AAGAGRSAAALIQAATRSCFNAGFASQRNFAEVIAKGEAQGMVSKSVGILLGIVVANCIG 377

Query: 300 NDQRLQAALPFVVYPIFAAMDLFGTYQ-----GLKHDRLEIILCTWIEQGYVPTPAEVSE 359
               L  A   VV  I    +L  +YQ      L   R  ++   ++  G  P   EV++
Sbjct: 378 TSTSLALAAFGVVTTIHMYTNL-KSYQCIQLRTLNPYRASLVFSEYLISGQAPLIKEVND 437

Query: 360 MEGIDLLCRKGKVSWPIRIGCLNLESQ 382
            E +    R   +  P ++    L S+
Sbjct: 438 EEPLFPTVRFSNMKSPEKLQDFVLSSE 459

BLAST of CmaCh02G005660 vs. TAIR 10
Match: AT5G01510.1 (Protein of unknown function, DUF647 )

HSP 1 Score: 100.5 bits (249), Expect = 4.1e-21
Identity = 82/297 (27.61%), Postives = 141/297 (47.47%), Query Frame = 0

Query: 128 PKNVSEHYVLYVKWKFLHRVFSSALQVIATQAMFRAIGVG-------RTRSLASAAALNW 187
           P +VS+ Y+ Y+ W+F   +      V+ T ++ +A+GVG          + ASAAA+ W
Sbjct: 125 PGSVSDDYLDYMLWQFPTNITGWICNVLVTSSLLKAVGVGSFSGTSAAATAAASAAAIRW 184

Query: 188 VLKDGLGRLSRCIYTATIASSFDTNLKRVRFSTAILFSLSIGVELLTPAFPQYFLLLASI 247
           V KDG+G L R +      S FD + K+ R     + S     +L T  +P  FLLLAS 
Sbjct: 185 VSKDGIGALGRLLIGGRFGSLFDDDPKQWRMYADFIGSAGSFFDLATQLYPSQFLLLAST 244

Query: 248 ANIVKQISLGCYLSTASAVHRSFAVTDNLGEVSAKAQIQSVCFDNLGLILAVLLNFLSKN 307
            N+ K ++ G    +   +   FA++ NLGEV+AK ++  V    +GL   +L+     +
Sbjct: 245 GNLAKAVARGLRDPSFRVIQNHFAISGNLGEVAAKEEVWEVAAQLIGLGFGILI----ID 304

Query: 308 DQRLQAALPFVV--YPIFAAMDLFGTYQGL--------KHDRLEIILCTWIEQGYVPTPA 367
              L  + PFV+  +     + L+  YQ L           R  II+ + +    VP   
Sbjct: 305 TPGLVKSFPFVLLTWTSIRLVHLWLRYQSLAVLQFNTVNLKRARIIVESHVVHSVVPGYV 364

Query: 368 EVSEMEGIDLLCR--KGKVSWPIRIGCLN-LESQIPKLSMLAMRSVCNKDYYFICMD 405
           + ++ E I L  R  K ++ + + +  L+ LE  + K+   A+  +  K+ Y + ++
Sbjct: 365 DCNKRENILLWQRFMKPRIIFGVSLEELSGLEKSVSKVK--ALLKMYTKEKYILTLN 415

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q67YT88.6e-14959.34Protein root UVB sensitive 4 OS=Arabidopsis thaliana OX=3702 GN=RUS4 PE=2 SV=1[more]
Q93YU21.4e-2631.68Protein root UVB sensitive 6 OS=Arabidopsis thaliana OX=3702 GN=RUS6 PE=2 SV=1[more]
Q84JB81.5e-2023.47Protein root UVB sensitive 3 OS=Arabidopsis thaliana OX=3702 GN=RUS3 PE=2 SV=1[more]
Q7X6P33.4e-2028.46Protein root UVB sensitive 1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=R... [more]
Q86K803.4e-2027.69RUS family member 1 OS=Dictyostelium discoideum OX=44689 GN=rusf1 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT2G23470.16.1e-15059.34Protein of unknown function, DUF647 [more]
AT5G49820.11.0e-2731.68Protein of unknown function, DUF647 [more]
AT1G13770.11.1e-2123.47Protein of unknown function, DUF647 [more]
AT3G45890.12.4e-2128.46Protein of unknown function, DUF647 [more]
AT5G01510.14.1e-2127.61Protein of unknown function, DUF647 [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006968Root UVB sensitive familyPFAMPF04884DUF647coord: 124..334
e-value: 7.8E-54
score: 182.8
IPR006968Root UVB sensitive familyPANTHERPTHR12770RUS1 FAMILY PROTEIN C16ORF58coord: 61..516
NoneNo IPR availablePANTHERPTHR12770:SF29PROTEIN ROOT UVB SENSITIVE 4coord: 61..516

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh02G005660.1CmaCh02G005660.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0048653 anther development
cellular_component GO:0009507 chloroplast