Cp4.1LG12g10330 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG12g10330
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionUPF0420 C16orf58-like protein
LocationCp4.1LG12 : 9457053 .. 9461805 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TCTGCCACTTCGGAGAAACAGATAAAACTTCCTTATTCGGCGGCAATTGTGGATTTCAACTCCGATTCTCGCGGCGGCGTTTTCACCTCTGGAACGAAACGGTCATTAGGCAAATTGCGAGTCTTCCTCCATGGATTTCCTGGTACATTTCAATTTTATTATCATTCCAGTCCCTATCTTCTTCTGAATCTCCTCCACTACGCAGAACAAGTTCAACGTGCGCAAGGACGCAGAGAAATCGCCTTCTGGTCCTGTTTCCTGGATTGAAATCTCCGAGTCTGTCTCACGCCGCTGCCAATTTCAACCCGATGGTCAACTTTCTGTGAGCTTCTGGTTTAGCTGACATTAACGTTTTCTGAGCTGTGTTTTAAATAACTGATGGTTTTTCGCTTTCTGGGCATTGGTTATTTCATTCCAAATTTTCATTCATTCTACTTTTCTTTATGGAAATTCCGTGGTGTTCTTGCTTTTTTTAGTGTAAATGTGGGCTATCTTGGTAGCCGGATATGCTTGATGTTTAGTACCATGCTTCATAGTTTTATATGGAGTTGAAAGACAATCTGAGGCGTGAGGCGTTAGGCATGATGTGATTTTCCACTTGTGCAATCATATGGTGCTGATCCACTTAAGCTACTACTATCAGTAATATGATTATAGAATGAATTACCATCAGAAGTTTATAGGATTAGTGGCTTGCCTTTGGTTATGCGGTTCATTGAGTGAACAGGCTTCAGTGCATCATAAAATTGAATGGCCAATGGAGTGTCGTGTGATATAATCTTCTGGTTCTCATTGCTGCTCTATATGATGCAAATGGGCCAGTGTCAAATCATGATTTTGAAACCTAGGTTGATCTTAGCTTCTCTGCCTTTTTTTTAGCATGAATGGATCAGCTAATAGTGAGCATGGGATTTAGATCAGAACAATTATCCAAAAGAGACTCGACATCTATACCGTTAGGCATCGGGGAAGAAAAAAATATTGCTGCCTTTGGTTTAGTTTGTGCACTTTACAAAAATACATACGAAAAGCACATCTCTCTGAACCTAAACTCTAAAACACCCTGTCGTAGAGTCTTCAATGTTCAAAAAGACCCTTCTTAATGGTACCCTTATGATAACTAACTCACTGTTTCCACCTAACTAATGGTACCCACCCCCTTACACATGCTCTGACAAAAATTTCTCATAAACTCGAGTTTCTTTTACCATTGAATTGCTTTTTTAAAAGGAAAGAAAATGTGGTTAGAGAGGAACAATAGAATTTTTAGAGGGCAGAGAGATCTTTAAATGAGGTTTTGGCGGGGGAGGTCTAAAGCCTGTTTGTGAGTGTTTGTTCGTCGATTTTTGTAATTATCATCTTATTCTTCTTCTTTCAGATTTGAGCCCGTTCTTATAGGTTCTTGAGGGATTTTTTTGTGGGCTTGGTATTTGTTGCCATTGTATATTCTTTCATTTTTTTTTCACTGAAAGCTTAGTTTCTTATCCAAAGAAAAAAAAATATGGAAAGAAAAAGGAAATCCTGTTCAAATTGGTCGGATTAAAGTAGATCGTTCTCTTAAAACAAACCTAAGTTCTGAATAAAATTACATTGAGAGGCATTTGGAAGAGGAAAATACCTAAAAAGACCAAACTCTTGCTTTGGAAAAATAGAATATGAAGAGTGATTGATGAAGGGAAATTATTTCGAAGCGTATGACAAAATTTCTGATTCAACCATGATCGTTTTGAAGTTAAAATTAGGAAATGGTTTTTTTTTTTTTTTTTTCCCAGTTGTGTAACCTTCAAATTGGATGCTAATATAAAAAAGAAATTGATCTCTATAGGTGAAGATTATTGATGACTCAAGACCAGCAATTCAACGTGTTGTTGATTCCTTCCTCAATACATTTTTTCCCTCAGGATATCCATACAGGTAAGCTGCATGATTTATTGTCACTAGTCTAAACAATTACACCAAGATTTGCATAAATCTCACCACCAGTCAAGTAAGGTGGAGGATTGTTGTATTTATAATAAATTGAAAGAGTTATTGTGAGATTCCACATCGGTTGGGGAGGAGAATGAAGCACTCTTTATAAGGGTGTGGAAACCTCCCCCTAGCAGACGCGTTTTATAACCTTGAGGGAAAACCCGGAAGAAAAAGTCCAAAGAAGACAATGTCTGCTAGCGGTGGACTTGAGCTGTTACAGTTATAGTAGTAAATGAACAAGTGTCTTTTGTTAGCTACTTGATTATTGCCGTCTTACTTGACAGTGTCAATGAAGGATATTTGAGGTATACACAATTTCGAGCATTGCAACATGTTACTAGTGCAGCTTTGTCAGTGCTGTCAACTCAGGTTTTTATCCAAATCTTTCCATCTTTGGTATTGCATTATGATTACATAATAATTCTTAACATTAGTAATATGATAATTACCTGTCAGCTTCAAATGATAGAATGGTGGTCATTGATATAAGTTCTTAATTCTGTTCAGTCACTGCTGTTTGCTGCAGGCTTGAGGCCAACTGCGGCACAGGCTACTGTTGTTAGTTGGGTAAGTTCAACAATTGTTTGTTATGGATCCATATGCTTGGTGACATATACTTCTCTCTTTCTTTTTTCTTCCTGGAAGGTTTTAAAGGATGGGATGCAACATGTTGGCAAACTCATATGCAGCAATTTAGGTACAAGAATGGATTCAGAGCCCAAAAGGTGGAGAGTTATAGGTATGCATCTTTGTATCAATTTGTTATTCATTTTAAGATAGCATTTCCATACAGACATTATGTTGTTTCTTGGAGTCGCATCGTGTGTGTGTGCACTTATGGTTTCTTAAGTGACAAGTGACCATCGCTTCTAAAAAACGATCGGATTTTATGCTGTAGCTGATGTCCTCTATGACTTTGGTGCTGGCTTGGAAGTTATTTCTCCCTTATGCCCTCATCTTTTTCTTCAAATGGCCGGCCTAGGCAACTTTGCAAAGGTTTAAGTTGCTTCTCTCTTCAGCAAATTGTTTGAAGTATTATCAGTATTCGCCCGTTCTAGTTTCCGAGTACAAGAATGTTCTTTATTTTGTGATTTTGAATTTCTCTTCTGCTTTTTCTGGGGCTGGCTAGGGAATGGCTGTAGTTGCTGCAAGAGCAACAAGATTACCAATATATTCTTCATTTGCTAAAGAAGGCAATCTTAGTGACCTTTTTGCCAAAGGGGAAGCCATCTCCACTCTCTTCAATGTTGTTGGAATCGGAGCTGGGCTACAATTAGCGTCAACTATTTGTTCATCAATACAAGGAAAGGTAGGTTCCCTTGAATTTCCACTGTCAGTTGATTTTTTCTCTATGTTCTTATGCATTTTTCACTGTTATAAAAAGATACAAAAATAAACTGGTGATAAATTTATGCATTTTTCACACGGCTTGATATCCGGCTCTGATACTTATGCATTTTTCAATGATTATCGAGTGATATTACTCTGTTTTCTGCATTTTTTGGTAGGCTTATTGTAGACACTGATCCTGAACTCTAGGGGAATGGTATTTTTTGTGTTGTTGAGGAAATGACTGAATATGGACATGAGAAAAATGTACAAATGTGAACTGACGGTGTTACGTAACGGGTTAAAGCGGACAATATTTGCTAACGGTGGGCTTGGACTGTTACAAATAGTATCAGAGCCAGATATCAAGCGGTGTGTCAACAATGACGTTGGCCCCCAAGGGGGTGGATTTTGAGATCCCATATCAGTTGGAGAGGGGAATGAAACATTCCTTATAAGGGTGTGGAAACCTCTCCCTAGTAGACGCGTTTTAAAATCGTGAGGCTGATGACAATACGTAACGGGCTAAAATGGATAATATCTGTTAGGGGTGAGCTCGTTTGTTAACCTTTTTTTATCTTGCTTATCATTTGTCTAAATCTGGTTCAAGCATGAGATACATTCGACTCGATGTGAGTACCGTTTTTAGTTGGTTTTAATTTCTATTCCTTGCAGTTAGTTGCAGCCCCTCTCCTTTCAATTGTACATGTCTATTGCGTTGTAGAGCAAATGCGAGCAACTCCAATAAACACATTGAATCCACAGAGGACGGCAATGATCGTTGCTGATTTCGTTAAGTCGGGAAGGATACCGAGTCCCGCTGACCTAAGGTACCATGAAGATCTTGTTTTTCCTGGAAGACTCATAGAGGATGCTGGAAGTGTAAAAGTAGGAAGGGCTTTACATGAGGTTATTAAGCCATCAAAACTTGTTGAAATGAAACAAATGTTCCCCGAGGAGAAGTTTGTTTTGAACCAAACTCACAAATGGGTTGACATGGTGTTGGAGCATGATGCCTCAGGCGAGGATGCATTACGGGGATGGCTAGTAGCTGCATATACTGCAAACATAAAAGGGCCTTCTCATGAGCCAACTGCCAGCGTGTTGCTCGAGGCTTACGAAAAGATGAACGATGTGTTCACTCCGTTCGTATCTGAACTTCAGGCCAAAGGGTGGCATACTGATCGTTTTCTCGACGGAGTGGGAACTCGTTTTGCATGGTAGTTCTACATTCTCTATACTATCTTTAATCTTAGGTTCATATAGGTTTGTAGATCTTAGAACAGGCAATCATTTTCTCACGAGAAGTCGAGTTGTTTTGCTTGATGGTTCTTCATTTGTCGGACTACTTTGATTCATGCAGGTTTGTTGATATCAGAACTATATACTATATGAAGTAGAAATAGATCAAACGTTGTTACTTTTGTTAGAACAATGATGTGGGAGCTTTACA

mRNA sequence

TCTGCCACTTCGGAGAAACAGATAAAACTTCCTTATTCGGCGGCAATTGTGGATTTCAACTCCGATTCTCGCGGCGGCGTTTTCACCTCTGGAACGAAACGGTCATTAGGCAAATTGCGAGTCTTCCTCCATGGATTTCCTGAACAAGTTCAACGTGCGCAAGGACGCAGAGAAATCGCCTTCTGGTCCTGTTTCCTGGATTGAAATCTCCGAGTCTGTCTCACGCCGCTGCCAATTTCAACCCGATGGTCAACTTTCTGTGAAGATTATTGATGACTCAAGACCAGCAATTCAACGTGTTGTTGATTCCTTCCTCAATACATTTTTTCCCTCAGGATATCCATACAGTGTCAATGAAGGATATTTGAGGTATACACAATTTCGAGCATTGCAACATGTTACTAGTGCAGCTTTGTCAGTGCTGTCAACTCAGTCACTGCTGTTTGCTGCAGGCTTGAGGCCAACTGCGGCACAGGCTACTGTTGTTAGTTGGGTTTTAAAGGATGGGATGCAACATGTTGGCAAACTCATATGCAGCAATTTAGGTACAAGAATGGATTCAGAGCCCAAAAGGTGGAGAGTTATAGCTGATGTCCTCTATGACTTTGGTGCTGGCTTGGAAGTTATTTCTCCCTTATGCCCTCATCTTTTTCTTCAAATGGCCGGCCTAGGCAACTTTGCAAAGGGAATGGCTGTAGTTGCTGCAAGAGCAACAAGATTACCAATATATTCTTCATTTGCTAAAGAAGGCAATCTTAGTGACCTTTTTGCCAAAGGGGAAGCCATCTCCACTCTCTTCAATGTTGTTGGAATCGGAGCTGGGCTACAATTAGCGTCAACTATTTGTTCATCAATACAAGGAAAGTTAGTTGCAGCCCCTCTCCTTTCAATTGTACATGTCTATTGCGTTGTAGAGCAAATGCGAGCAACTCCAATAAACACATTGAATCCACAGAGGACGGCAATGATCGTTGCTGATTTCGTTAAGTCGGGAAGGATACCGAGTCCCGCTGACCTAAGGTACCATGAAGATCTTGTTTTTCCTGGAAGACTCATAGAGGATGCTGGAAGTGTAAAAGTAGGAAGGGCTTTACATGAGGTTATTAAGCCATCAAAACTTGTTGAAATGAAACAAATGTTCCCCGAGGAGAAGTTTGTTTTGAACCAAACTCACAAATGGGTTGACATGGTGTTGGAGCATGATGCCTCAGGCGAGGATGCATTACGGGGATGGCTAGTAGCTGCATATACTGCAAACATAAAAGGGCCTTCTCATGAGCCAACTGCCAGCGTGTTGCTCGAGGCTTACGAAAAGATGAACGATGTGTTCACTCCGTTCGTATCTGAACTTCAGGCCAAAGGGTGGCATACTGATCGTTTTCTCGACGGAGTGGGAACTCGTTTTGCATGGTAGTTCTACATTCTCTATACTATCTTTAATCTTAGGTTCATATAGGTTTGTAGATCTTAGAACAGGCAATCATTTTCTCACGAGAAGTCGAGTTGTTTTGCTTGATGGTTCTTCATTTGTCGGACTACTTTGATTCATGCAGGTTTGTTGATATCAGAACTATATACTATATGAAGTAGAAATAGATCAAACGTTGTTACTTTTGTTAGAACAATGATGTGGGAGCTTTACA

Coding sequence (CDS)

ATGGATTTCCTGAACAAGTTCAACGTGCGCAAGGACGCAGAGAAATCGCCTTCTGGTCCTGTTTCCTGGATTGAAATCTCCGAGTCTGTCTCACGCCGCTGCCAATTTCAACCCGATGGTCAACTTTCTGTGAAGATTATTGATGACTCAAGACCAGCAATTCAACGTGTTGTTGATTCCTTCCTCAATACATTTTTTCCCTCAGGATATCCATACAGTGTCAATGAAGGATATTTGAGGTATACACAATTTCGAGCATTGCAACATGTTACTAGTGCAGCTTTGTCAGTGCTGTCAACTCAGTCACTGCTGTTTGCTGCAGGCTTGAGGCCAACTGCGGCACAGGCTACTGTTGTTAGTTGGGTTTTAAAGGATGGGATGCAACATGTTGGCAAACTCATATGCAGCAATTTAGGTACAAGAATGGATTCAGAGCCCAAAAGGTGGAGAGTTATAGCTGATGTCCTCTATGACTTTGGTGCTGGCTTGGAAGTTATTTCTCCCTTATGCCCTCATCTTTTTCTTCAAATGGCCGGCCTAGGCAACTTTGCAAAGGGAATGGCTGTAGTTGCTGCAAGAGCAACAAGATTACCAATATATTCTTCATTTGCTAAAGAAGGCAATCTTAGTGACCTTTTTGCCAAAGGGGAAGCCATCTCCACTCTCTTCAATGTTGTTGGAATCGGAGCTGGGCTACAATTAGCGTCAACTATTTGTTCATCAATACAAGGAAAGTTAGTTGCAGCCCCTCTCCTTTCAATTGTACATGTCTATTGCGTTGTAGAGCAAATGCGAGCAACTCCAATAAACACATTGAATCCACAGAGGACGGCAATGATCGTTGCTGATTTCGTTAAGTCGGGAAGGATACCGAGTCCCGCTGACCTAAGGTACCATGAAGATCTTGTTTTTCCTGGAAGACTCATAGAGGATGCTGGAAGTGTAAAAGTAGGAAGGGCTTTACATGAGGTTATTAAGCCATCAAAACTTGTTGAAATGAAACAAATGTTCCCCGAGGAGAAGTTTGTTTTGAACCAAACTCACAAATGGGTTGACATGGTGTTGGAGCATGATGCCTCAGGCGAGGATGCATTACGGGGATGGCTAGTAGCTGCATATACTGCAAACATAAAAGGGCCTTCTCATGAGCCAACTGCCAGCGTGTTGCTCGAGGCTTACGAAAAGATGAACGATGTGTTCACTCCGTTCGTATCTGAACTTCAGGCCAAAGGGTGGCATACTGATCGTTTTCTCGACGGAGTGGGAACTCGTTTTGCATGGTAG

Protein sequence

MDFLNKFNVRKDAEKSPSGPVSWIEISESVSRRCQFQPDGQLSVKIIDDSRPAIQRVVDSFLNTFFPSGYPYSVNEGYLRYTQFRALQHVTSAALSVLSTQSLLFAAGLRPTAAQATVVSWVLKDGMQHVGKLICSNLGTRMDSEPKRWRVIADVLYDFGAGLEVISPLCPHLFLQMAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGIGAGLQLASTICSSIQGKLVAAPLLSIVHVYCVVEQMRATPINTLNPQRTAMIVADFVKSGRIPSPADLRYHEDLVFPGRLIEDAGSVKVGRALHEVIKPSKLVEMKQMFPEEKFVLNQTHKWVDMVLEHDASGEDALRGWLVAAYTANIKGPSHEPTASVLLEAYEKMNDVFTPFVSELQAKGWHTDRFLDGVGTRFAW
BLAST of Cp4.1LG12g10330 vs. Swiss-Prot
Match: RUS2_ARATH (Protein root UVB sensitive 2, chloroplastic OS=Arabidopsis thaliana GN=RUS2 PE=1 SV=2)

HSP 1 Score: 649.8 bits (1675), Expect = 2.0e-185
Identity = 312/414 (75.36%), Postives = 363/414 (87.68%), Query Frame = 1

Query: 15  KSPSG-PVSWIEISESVSRRCQFQPDGQLSVKIIDDSRPAIQRVVDSFLNTFFPSGYPYS 74
           KSP   PV W E S+SVS R QFQ DG LS+K++DD+RP  Q++V+SFLN FFPSGYPYS
Sbjct: 20  KSPEDFPVYWFETSDSVSHRYQFQSDGHLSMKVVDDARPVPQKMVESFLNKFFPSGYPYS 79

Query: 75  VNEGYLRYTQFRALQHVTSAALSVLSTQSLLFAAGLRPTAAQATVVSWVLKDGMQHVGKL 134
           VNEGYLRYTQFRALQH +SAALSVLSTQSLLFAAGLRPT AQATVVSW+LKDGMQHVGKL
Sbjct: 80  VNEGYLRYTQFRALQHFSSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDGMQHVGKL 139

Query: 135 ICSNLGTRMDSEPKRWRVIADVLYDFGAGLEVISPLCPHLFLQMAGLGNFAKGMAVVAAR 194
           ICSNLG RMDSEPKRWR++ADVLYD G GLE++SPLCPHLFL+MAGLGNFAKGMA VAAR
Sbjct: 140 ICSNLGARMDSEPKRWRILADVLYDLGTGLELVSPLCPHLFLEMAGLGNFAKGMATVAAR 199

Query: 195 ATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGIGAGLQLASTICSSIQGKLVAAPLLS 254
           ATRLPIYSSFAKEGNLSD+FAKGEAISTLFNV GIGAG+QLASTICSS++GKLV   +LS
Sbjct: 200 ATRLPIYSSFAKEGNLSDIFAKGEAISTLFNVAGIGAGIQLASTICSSMEGKLVVGSILS 259

Query: 255 IVHVYCVVEQMRATPINTLNPQRTAMIVADFVKSGRIPSPADLRYHEDLVFPGRLIEDAG 314
           +VHVY VVEQMR  PINTLNPQRTA+IVA+F+K+G++PSP DLR+ EDL+FP R I+DAG
Sbjct: 260 VVHVYSVVEQMRGVPINTLNPQRTALIVANFLKTGKVPSPPDLRFQEDLMFPERPIQDAG 319

Query: 315 SVKVGRALHEVIKPSKLVEMKQMFPEEKFVLNQTHKWVDMVLEHDASGEDALRGWLVAAY 374
           +VKVGRALH+ +KPS++  +KQ+F EEKF+L+    W DMVLEHDA+GEDALRGWLVAAY
Sbjct: 320 NVKVGRALHKAVKPSEVQRLKQVFVEEKFLLSHGKSWTDMVLEHDATGEDALRGWLVAAY 379

Query: 375 TANIKGPSHEPTASVLLEAYEKMNDVFTPFVSELQAKGWHTDRFLDGVGTRFAW 428
             ++    ++P   +L +AY+KMNDVF PF+S++QAKGW+TDRFLDG GTRFAW
Sbjct: 380 VKSMTKIYNDPDDIILQDAYDKMNDVFNPFLSQVQAKGWYTDRFLDGTGTRFAW 433

BLAST of Cp4.1LG12g10330 vs. Swiss-Prot
Match: RUS6_ARATH (Protein root UVB sensitive 6 OS=Arabidopsis thaliana GN=RUS6 PE=2 SV=1)

HSP 1 Score: 174.5 bits (441), Expect = 2.5e-42
Identity = 110/358 (30.73%), Postives = 193/358 (53.91%), Query Frame = 1

Query: 58  VDSFLNTFF-PSGYPYSVNEGYLRYTQFRALQHVTSAALSVLSTQSLLFAAGL--RPTAA 117
           V SFL ++  P G+P SVNE Y+ Y  +RAL+H    A+ V +TQ+LL + G     +A+
Sbjct: 104 VGSFLRSYVVPEGFPGSVNESYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGASRNSSAS 163

Query: 118 QATVVSWVLKDGMQHVGKLICSNLGTRMDSEPKRWRVIADVLYDFGAGLEVISPLCPHLF 177
            A  ++W+LKDG   VGK++ +  G + D + K+ R   D+L + GAG+E+ +   PHLF
Sbjct: 164 AAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLF 223

Query: 178 LQMAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGIGAGLQL 237
           L +A   N  K +A V + +TR PIY +FAK  N+ D+ AKGE +  + +++G G  + +
Sbjct: 224 LPLACAANVVKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLMGTGFSILI 283

Query: 238 ASTICSSIQGKLVAAPLLSIVHVYCVVEQMRATPINTLNPQRTAMIVADFVKSGRIPSPA 297
           +    S +        LLS  ++    +++R+  ++TLN  R  + V  F+K+GR+PS  
Sbjct: 284 SKRNPSLV----TTFGLLSCGYLMSSYQEVRSVVLHTLNRARFTVAVESFLKTGRVPSLQ 343

Query: 298 DLRYHEDLVFPGRLIEDAGSVKVGRALHEVIKPSKLVEMKQMFPEEKFVL--NQTHKWVD 357
           +    E  +F    ++D   +   R       PS  + +K  F +E++++  + T   V 
Sbjct: 344 EGNIQEK-IFTFPWVDDRPVMLGARFKDAFQDPSTYMAVKPFFDKERYMVTYSPTKGKVY 403

Query: 358 MVLEHDASGEDALRGWLVAAYTANIKGPSHEPTASVLLEAYEKMNDVFTPFVSELQAK 411
            +L+H A+ +D L+    AA+ A++       +      + E+++  F P   EL+++
Sbjct: 404 ALLKHQANSDDILK----AAFHAHVLLHFMNQSKDGNPRSVEQLDPAFAPTEYELESR 452

BLAST of Cp4.1LG12g10330 vs. Swiss-Prot
Match: RUS1_ARATH (Protein root UVB sensitive 1, chloroplastic OS=Arabidopsis thaliana GN=RUS1 PE=1 SV=1)

HSP 1 Score: 146.7 bits (369), Expect = 5.6e-34
Identity = 80/240 (33.33%), Postives = 135/240 (56.25%), Query Frame = 1

Query: 67  PSGYPYSVNEGYLRYTQFRALQHVTSAALSVLSTQSLLFAAGLRPTAAQ-ATVVSWVLKD 126
           P G+P SV   YL Y+ +R +Q + S    VL+TQSLL+A GL   A   A  ++WVLKD
Sbjct: 200 PEGFPNSVTSDYLDYSLWRGVQGIASQISGVLATQSLLYAVGLGKGAIPTAAAINWVLKD 259

Query: 127 GMQHVGKLICSNLGTRMDSEPKRWRVIADVLYDFGAGLEVISPLCPHLFLQMAGLGNFAK 186
           G+ ++ K++ S  G   D  PK WR+ AD+L +   G+E+++P+ P  F+ +       +
Sbjct: 260 GIGYLSKIMLSKYGRHFDVHPKGWRLFADLLENAAFGMEMLTPVFPQFFVMIGAAAGAGR 319

Query: 187 GMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGIGAGLQLASTICSSIQGK 246
             A +   ATR    + FA + N +++ AKGEA   +   VGI  G+ +A+ I +S    
Sbjct: 320 SAAALIQAATRSCFNAGFASQRNFAEVIAKGEAQGMVSKSVGILLGIVVANCIGTSTSLA 379

Query: 247 LVAAPLLSIVHVYCVVEQMRATPINTLNPQRTAMIVADFVKSGRIPSPADLRYHEDLVFP 306
           L A  +++ +H+Y  ++  +   + TLNP R +++ ++++ SG+ P   ++   E L FP
Sbjct: 380 LAAFGVVTTIHMYTNLKSYQCIQLRTLNPYRASLVFSEYLISGQAPLIKEVNDEEPL-FP 438

BLAST of Cp4.1LG12g10330 vs. Swiss-Prot
Match: RUS3_ARATH (Protein root UVB sensitive 3 OS=Arabidopsis thaliana GN=RUS3 PE=2 SV=1)

HSP 1 Score: 138.7 bits (348), Expect = 1.5e-31
Identity = 100/394 (25.38%), Postives = 193/394 (48.98%), Query Frame = 1

Query: 46  IIDDSRPAIQRVVDSF-------LNTFFPSGYPYSVNEGYLRYTQFRALQHVTSAALSVL 105
           I   S  +IQR  + F       L  F P G+P SV   Y+ +  +  LQ +++    +L
Sbjct: 31  ITASSSLSIQRSANRFNHVWRRVLQAFVPEGFPGSVTPDYVGFQLWDTLQGLSTYTKMML 90

Query: 106 STQSLLFAAGLRPTAAQA--TVVSWVLKDGMQHVGKLICSNL-GTRMDSEPKRWRVIADV 165
           STQ+LL A G+   +A        W L+D    +G ++ +   G+ +DS  K WR++AD+
Sbjct: 91  STQALLSAIGVGEKSATVIGATFQWFLRDFTGMLGGILFTFYQGSNLDSNAKMWRLVADL 150

Query: 166 LYDFGAGLEVISPLCPHLFLQMAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAK 225
           + D G  ++++SPL P  F+ +  LG+ ++    VA+ ATR  +   FA + N +D+ AK
Sbjct: 151 MNDIGMLMDLLSPLFPSAFIVVVCLGSLSRSFTGVASGATRAALTQHFALQDNAADISAK 210

Query: 226 GEAISTLFNVVGIGAGLQLASTICSSIQGKLVAAPLLSIVHVYCVVEQMRATPINTLNPQ 285
             +  T+  ++G+  G+ LA     +     ++   L++ H+Y     +R   +N+LN +
Sbjct: 211 EGSQETMATMMGMSLGMLLARFTSGNPMAIWLSFLSLTVFHMYANYRAVRCLVLNSLNFE 270

Query: 286 RTAMIVADFVKSGRIPSPADLRYHED-LVFPGRLIEDAGSVKVGRALHEVIKPSKL--VE 345
           R+++++  F+++G++ SP  +   E  L      +    S  + + +   ++ S L  ++
Sbjct: 271 RSSILLTHFIQTGQVLSPEQVSSMEGVLPLWATSLRSTNSKPLHKRVQLGVRVSSLPRLD 330

Query: 346 MKQM--------FPEEKFVLNQTHKWVDMVLEHDASGEDALRGWLVAAYTANIKGPSHEP 405
           M Q+        +   K++L      V ++L  D+   D L+ ++ A   AN+     E 
Sbjct: 331 MLQLLNGVGASSYKNAKYLLAHIKGNVSVILHKDSKPADVLKSYIHAIVLANLM----EK 390

Query: 406 TASVLLEAYEKMNDVFTPFVSELQAKGWHTDRFL 419
           + S   E    ++  +   + +L++ GW T+R L
Sbjct: 391 STSFYSEGEAWIDKHYDELLHKLRSGGWKTERLL 420

BLAST of Cp4.1LG12g10330 vs. Swiss-Prot
Match: RUS1_RAT (RUS1 family protein C16orf58 homolog OS=Rattus norvegicus PE=2 SV=1)

HSP 1 Score: 138.3 bits (347), Expect = 2.0e-31
Identity = 104/379 (27.44%), Postives = 187/379 (49.34%), Query Frame = 1

Query: 63  NTFFPSGYPYSVNEGYLRYTQFRALQHVTSAALSVLSTQSLLFAAGLRPTAAQ--ATVVS 122
           +   P G+P SV+  YL+Y  + ++Q   S+    L+TQ++L   G+    A   A   +
Sbjct: 73  SVLLPQGFPDSVSPDYLQYQLWDSVQAFASSLSGSLATQAVLQGLGVGNAKASVSAATST 132

Query: 123 WVLKDGMQHVGKLICSNL-GTRMDSEPKRWRVIADVLYDFGAGLEVISPLCPHLFLQMAG 182
           W++KD    +G++I +   G+++D   K+WR+ AD+L D    LE+++P+ P  F     
Sbjct: 133 WLVKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDTAMFLEIMAPMYPIFFTMTVS 192

Query: 183 LGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGIGAGLQLASTIC 242
             N AK +  VA  ATR  +    A+  N++D+ AK  +  T+ N+ G+   L +   + 
Sbjct: 193 TSNLAKCIVGVAGGATRAALTMHQARRNNMADVSAKDSSQETVVNLAGLLVSLLMLPLVS 252

Query: 243 SSIQGKLVAAPLLSIVHVYCVVEQMRATPINTLNPQRTAMIVADFVKSGRIPSPADLRYH 302
             +   L    LL+ +H+Y     +RA  + TLN  R  +++  F++ G +  PA     
Sbjct: 253 DCLSLSLGCFILLTALHIYANYRAVRALVLETLNESRLQLVLKHFLQRGEVLEPASANQM 312

Query: 303 EDLVFPGRLIEDAGSVKVGRALHEVIKPSKLVEMKQMFP--EEKFVL--NQTHKWVDMVL 362
           E L + G     + S+ +G  LH ++  S + E+KQ+    +E ++L  NQ+   V + L
Sbjct: 313 EPL-WTG--FWPSLSLSLGVPLHHLV--SSVSELKQLVEGHQEPYLLCWNQSQNQVQVAL 372

Query: 363 EHDASGEDALR----GWLVAAY---------TANIK-----GPSHEPTASVLLEAYEKMN 417
              A  E  LR    G ++ A           A ++     GP +E +  ++ E ++ ++
Sbjct: 373 SQVAGPETVLRAATHGLILGALQEDGPLPGELAELRDMVQAGPKNE-SWILVRETHQVLD 432

BLAST of Cp4.1LG12g10330 vs. TrEMBL
Match: K4C001_SOLLC (Uncharacterized protein OS=Solanum lycopersicum PE=4 SV=1)

HSP 1 Score: 688.7 bits (1776), Expect = 4.4e-195
Identity = 330/429 (76.92%), Postives = 386/429 (89.98%), Query Frame = 1

Query: 1   MDFLNKFNV-RKDA-EKSPSGPVSWIEISESVSRRCQFQPDGQLSVKIIDDSRPAIQRVV 60
           M  L+K  + RK++ E+ P  P+SW+EIS S+SR+ QFQPDG+LSVK++DDSRPA QRV+
Sbjct: 1   MQILDKIKMQRKESDERPPELPISWVEISNSISRQYQFQPDGKLSVKMVDDSRPAAQRVM 60

Query: 61  DSFLNTFFPSGYPYSVNEGYLRYTQFRALQHVTSAALSVLSTQSLLFAAGLRPTAAQATV 120
           +SFLN FFPSGYPYSVNEGY+RYTQFRALQH TSA+LSVLSTQSLLFAAGLRPT AQAT 
Sbjct: 61  ESFLNKFFPSGYPYSVNEGYMRYTQFRALQHFTSASLSVLSTQSLLFAAGLRPTPAQATA 120

Query: 121 VSWVLKDGMQHVGKLICSNLGTRMDSEPKRWRVIADVLYDFGAGLEVISPLCPHLFLQMA 180
           VSW+L+DGMQHVGKLICSNLG RMDSEPKRWR++ADVLYDFG GLEV+SPLCPHLFL++A
Sbjct: 121 VSWILRDGMQHVGKLICSNLGARMDSEPKRWRILADVLYDFGTGLEVMSPLCPHLFLEVA 180

Query: 181 GLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGIGAGLQLASTI 240
           GLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV+G+GAG+ LASTI
Sbjct: 181 GLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVLGLGAGIHLASTI 240

Query: 241 CSSIQGKLVAAPLLSIVHVYCVVEQMRATPINTLNPQRTAMIVADFVKSGRIPSPADLRY 300
           CSS+QGKLV APLLS++H+Y V E+MRA P+NTLNPQRTAMIVADFVK+GRI SPADLRY
Sbjct: 241 CSSMQGKLVVAPLLSVIHIYSVCEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRY 300

Query: 301 HEDLVFPGRLIEDAGSVKVGRALHEVIKPSKLVEMKQMFPEEKFVLNQTHKWVDMVLEHD 360
            EDL+FPGRLIEDAG VKVGR+LHEV++PSKL + K+ F EEKF+LN   +W DM+LEH+
Sbjct: 301 REDLLFPGRLIEDAGKVKVGRSLHEVVRPSKLKQFKEAFLEEKFLLNHGSRWTDMILEHN 360

Query: 361 ASGEDALRGWLVAAYTANIKGPSHEPTASVLLEAYEKMNDVFTPFVSELQAKGWHTDRFL 420
           A+GEDALRGWLVAAY ++++   HEP+A++L EAY+KMN  F+PF++ELQAKGWHTDRFL
Sbjct: 361 ATGEDALRGWLVAAYASDMERLVHEPSANILQEAYDKMNSTFSPFLAELQAKGWHTDRFL 420

Query: 421 DGVGTRFAW 428
           DG G RFA+
Sbjct: 421 DGTGNRFAF 429

BLAST of Cp4.1LG12g10330 vs. TrEMBL
Match: B9H8L8_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0005s24540g PE=4 SV=1)

HSP 1 Score: 688.0 bits (1774), Expect = 7.5e-195
Identity = 328/428 (76.64%), Postives = 383/428 (89.49%), Query Frame = 1

Query: 1   MDFLNKFNV-RKDAEKSPSGPVSWIEISESVSRRCQFQPDGQLSVKIIDDSRPAIQRVVD 60
           M+ L+K  + +K+ +K+P  PV WIE S+SVSR  QF+PDGQLS+K++DD+RP  +RVV+
Sbjct: 1   MNLLDKIKMQKKEPDKTPEIPVYWIETSDSVSRHFQFEPDGQLSMKVVDDARPVYRRVVE 60

Query: 61  SFLNTFFPSGYPYSVNEGYLRYTQFRALQHVTSAALSVLSTQSLLFAAGLRPTAAQATVV 120
           SFLN FFPSGYPYSVNEGYLRYTQFRALQH +SAALSVLSTQSLLFAAGLRPT AQAT V
Sbjct: 61  SFLNKFFPSGYPYSVNEGYLRYTQFRALQHFSSAALSVLSTQSLLFAAGLRPTPAQATAV 120

Query: 121 SWVLKDGMQHVGKLICSNLGTRMDSEPKRWRVIADVLYDFGAGLEVISPLCPHLFLQMAG 180
           SW+LKDGMQH GKLICSNLG RMDSEPKRWR++ADVLYD G GLEV+SPLCPHLFL++AG
Sbjct: 121 SWILKDGMQHAGKLICSNLGARMDSEPKRWRILADVLYDLGTGLEVLSPLCPHLFLEVAG 180

Query: 181 LGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGIGAGLQLASTIC 240
           LGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV+G+G G+QLAST+C
Sbjct: 181 LGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVLGLGVGIQLASTVC 240

Query: 241 SSIQGKLVAAPLLSIVHVYCVVEQMRATPINTLNPQRTAMIVADFVKSGRIPSPADLRYH 300
           SS+QGK VA PLLSIVHV CV+E+MRATP+NTLNPQRTAM+VADFVK+G+I SPADLRYH
Sbjct: 241 SSMQGKFVAGPLLSIVHVCCVIEEMRATPVNTLNPQRTAMVVADFVKTGKISSPADLRYH 300

Query: 301 EDLVFPGRLIEDAGSVKVGRALHEVIKPSKLVEMKQMFPEEKFVLNQTHKWVDMVLEHDA 360
           EDL+FPGRLIE+AG+VKVG+ALH  ++PSKL E+K++FP EKF+L+  +KW D+VLE +A
Sbjct: 301 EDLLFPGRLIENAGNVKVGQALHRAVRPSKLRELKEIFPGEKFILSPGNKWTDLVLEQNA 360

Query: 361 SGEDALRGWLVAAYTANIKGPSHEPTASVLLEAYEKMNDVFTPFVSELQAKGWHTDRFLD 420
           SGEDALR WLVAAY +++K  SHE T+  L +AYEKMN VF PF+SELQAKGWHTDRFLD
Sbjct: 361 SGEDALRAWLVAAYASSMKKSSHESTSVTLQDAYEKMNSVFDPFLSELQAKGWHTDRFLD 420

Query: 421 GVGTRFAW 428
           G G+RF+W
Sbjct: 421 GTGSRFSW 428

BLAST of Cp4.1LG12g10330 vs. TrEMBL
Match: A0A0B0NUH9_GOSAR (Uncharacterized protein OS=Gossypium arboreum GN=F383_00350 PE=4 SV=1)

HSP 1 Score: 680.2 bits (1754), Expect = 1.6e-192
Identity = 323/408 (79.17%), Postives = 374/408 (91.67%), Query Frame = 1

Query: 20  PVSWIEISESVSRRCQFQPDGQLSVKIIDDSRPAIQRVVDSFLNTFFPSGYPYSVNEGYL 79
           PV W+E S++VSRR +F+PDG LSVK+++DSRP   RVV+SFLN FFPSGYPYSVNEGYL
Sbjct: 68  PVYWLETSDTVSRRYEFEPDGYLSVKVVNDSRPVYHRVVESFLNKFFPSGYPYSVNEGYL 127

Query: 80  RYTQFRALQHVTSAALSVLSTQSLLFAAGLRPTAAQATVVSWVLKDGMQHVGKLICSNLG 139
           RYTQFRALQH+TSAALSVLSTQSLLFAAGLRPT AQAT VSW+LKDGMQH+GKLICSNLG
Sbjct: 128 RYTQFRALQHMTSAALSVLSTQSLLFAAGLRPTPAQATAVSWILKDGMQHMGKLICSNLG 187

Query: 140 TRMDSEPKRWRVIADVLYDFGAGLEVISPLCPHLFLQMAGLGNFAKGMAVVAARATRLPI 199
            RMDSEPKRWR++ADVLYD G GLEV+SPLCPHLFL++AGLGNFAKGMAVVAARATRLPI
Sbjct: 188 ARMDSEPKRWRILADVLYDLGTGLEVLSPLCPHLFLEVAGLGNFAKGMAVVAARATRLPI 247

Query: 200 YSSFAKEGNLSDLFAKGEAISTLFNVVGIGAGLQLASTICSSIQGKLVAAPLLSIVHVYC 259
           YSSFAKEGNLSDLFAKGEAISTLFNVVG+G G+QLAST+CSS+QGKL+A PLLSI+HV+ 
Sbjct: 248 YSSFAKEGNLSDLFAKGEAISTLFNVVGLGVGIQLASTVCSSMQGKLIAGPLLSIIHVFS 307

Query: 260 VVEQMRATPINTLNPQRTAMIVADFVKSGRIPSPADLRYHEDLVFPGRLIEDAGSVKVGR 319
           VVE+MRA PINTLNPQRTAMIVADF+K+G++ SPADLRY EDL+FPGRLIEDAG+VKVGR
Sbjct: 308 VVEEMRAAPINTLNPQRTAMIVADFLKTGKVSSPADLRYREDLLFPGRLIEDAGNVKVGR 367

Query: 320 ALHEVIKPSKLVEMKQMFPEEKFVLNQTHKWVDMVLEHDASGEDALRGWLVAAYTANIKG 379
           ALH+V+KPSKL E K++FPEEKFVL+  +KW DM+LEH+A+ EDALRGWLVAAY  +++ 
Sbjct: 368 ALHKVVKPSKLQEWKEIFPEEKFVLSHGNKWTDMLLEHNATAEDALRGWLVAAYATSMEK 427

Query: 380 PSHEPTASVLLEAYEKMNDVFTPFVSELQAKGWHTDRFLDGVGTRFAW 428
             HEP+AS+L +AY+KMN +FTPF+ ELQAKGWHTDRFLDG G+RFA+
Sbjct: 428 SFHEPSASMLQDAYDKMNSIFTPFLCELQAKGWHTDRFLDGTGSRFAF 475

BLAST of Cp4.1LG12g10330 vs. TrEMBL
Match: A0A0D2M2Y2_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_001G253600 PE=4 SV=1)

HSP 1 Score: 678.3 bits (1749), Expect = 6.0e-192
Identity = 322/408 (78.92%), Postives = 373/408 (91.42%), Query Frame = 1

Query: 20  PVSWIEISESVSRRCQFQPDGQLSVKIIDDSRPAIQRVVDSFLNTFFPSGYPYSVNEGYL 79
           PV W+E S++VSRR +F+PDG LSVK+++DSRP   RVV+SFLN FFPSGYPYSVNEGYL
Sbjct: 26  PVYWLETSDTVSRRYEFEPDGYLSVKVVNDSRPVYHRVVESFLNKFFPSGYPYSVNEGYL 85

Query: 80  RYTQFRALQHVTSAALSVLSTQSLLFAAGLRPTAAQATVVSWVLKDGMQHVGKLICSNLG 139
           RYTQFRALQH+TSAALSVLSTQSLLFAAGLRPT AQAT VSW+LKDGMQH+GKLICSNLG
Sbjct: 86  RYTQFRALQHMTSAALSVLSTQSLLFAAGLRPTPAQATAVSWILKDGMQHMGKLICSNLG 145

Query: 140 TRMDSEPKRWRVIADVLYDFGAGLEVISPLCPHLFLQMAGLGNFAKGMAVVAARATRLPI 199
            RMDSEPKRWR++ADVLYD G GLEV+SPLCPHLFL++AGLGNFAKGMAVVAARATRLPI
Sbjct: 146 ARMDSEPKRWRILADVLYDLGTGLEVLSPLCPHLFLEVAGLGNFAKGMAVVAARATRLPI 205

Query: 200 YSSFAKEGNLSDLFAKGEAISTLFNVVGIGAGLQLASTICSSIQGKLVAAPLLSIVHVYC 259
           YSSFAKEGNLSDLFAKGEAISTLFNVVG+G G+ LAST+CSS+QGKL+A PLLSI+HV+ 
Sbjct: 206 YSSFAKEGNLSDLFAKGEAISTLFNVVGLGVGIHLASTVCSSMQGKLIAGPLLSIIHVFS 265

Query: 260 VVEQMRATPINTLNPQRTAMIVADFVKSGRIPSPADLRYHEDLVFPGRLIEDAGSVKVGR 319
           VVE+MRA PINTLNPQRTAMIVADF+K+G++ SPADLRY EDL+FPGRLIEDAG+VKVGR
Sbjct: 266 VVEEMRAAPINTLNPQRTAMIVADFLKTGKVSSPADLRYREDLLFPGRLIEDAGNVKVGR 325

Query: 320 ALHEVIKPSKLVEMKQMFPEEKFVLNQTHKWVDMVLEHDASGEDALRGWLVAAYTANIKG 379
           ALH+V+KPSKL E K+ FPEEKFVL+  +KW DM+LEH+A+ EDALRGWLVAAY  +++ 
Sbjct: 326 ALHKVVKPSKLQEWKETFPEEKFVLSHGNKWTDMLLEHNATAEDALRGWLVAAYATSMEK 385

Query: 380 PSHEPTASVLLEAYEKMNDVFTPFVSELQAKGWHTDRFLDGVGTRFAW 428
             HEP+AS+L +AY+KMN +FTPF++ELQAKGWHTDRFLDG G+RFA+
Sbjct: 386 SFHEPSASMLQDAYDKMNSIFTPFLNELQAKGWHTDRFLDGTGSRFAF 433

BLAST of Cp4.1LG12g10330 vs. TrEMBL
Match: A0A067JUM8_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_17696 PE=4 SV=1)

HSP 1 Score: 678.3 bits (1749), Expect = 6.0e-192
Identity = 331/419 (79.00%), Postives = 372/419 (88.78%), Query Frame = 1

Query: 10  RKDAEKSP-SGPVSWIEISESVSRRCQFQPDGQLSVKIIDDSRPAIQRVVDSFLNTFFPS 69
           +K+ +KSP   PV WIE SESVSR  QFQPDG+LS+K++DD+R    +VV+SF N FFPS
Sbjct: 3   KKEPDKSPIEVPVYWIETSESVSRHFQFQPDGRLSMKVVDDARSTFHKVVESFQNKFFPS 62

Query: 70  GYPYSVNEGYLRYTQFRALQHVTSAALSVLSTQSLLFAAGLRPTAAQATVVSWVLKDGMQ 129
           GYPYSVNEGYLRYTQFRALQH +SAALSVLSTQSLLFAAGLRPT AQAT +SWVLKDGMQ
Sbjct: 63  GYPYSVNEGYLRYTQFRALQHFSSAALSVLSTQSLLFAAGLRPTPAQATAISWVLKDGMQ 122

Query: 130 HVGKLICSNLGTRMDSEPKRWRVIADVLYDFGAGLEVISPLCPHLFLQMAGLGNFAKGMA 189
           HVGKLICSNLG RMDSEPKRWR++ADVLYD G GLEV+SPLCPHLFL++AGLGNFAKGMA
Sbjct: 123 HVGKLICSNLGARMDSEPKRWRILADVLYDLGTGLEVLSPLCPHLFLEVAGLGNFAKGMA 182

Query: 190 VVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGIGAGLQLASTICSSIQGKLVA 249
           VVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVG+GAG+QLASTICSSIQGKLV 
Sbjct: 183 VVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGMGAGIQLASTICSSIQGKLVV 242

Query: 250 APLLSIVHVYCVVEQMRATPINTLNPQRTAMIVADFVKSGRIPSPADLRYHEDLVFPGRL 309
            PLLS+VHVY V+E+MRA P+NTLNPQRTAM+VADFVK+G+I SPADLRY EDLVFPGRL
Sbjct: 243 GPLLSVVHVYSVIEEMRAAPVNTLNPQRTAMVVADFVKTGKISSPADLRYREDLVFPGRL 302

Query: 310 IEDAGSVKVGRALHEVIKPSKLVEMKQMFPEEKFVLNQTHKWVDMVLEHDASGEDALRGW 369
           I DAG+VKVGRALH+V  PSKL E+K +FPEEKF+LN+  KW DMVLE +ASGEDALRGW
Sbjct: 303 IGDAGNVKVGRALHKVFTPSKLREVKDIFPEEKFLLNRGTKWTDMVLEQNASGEDALRGW 362

Query: 370 LVAAYTANIKGPSHEPTASVLLEAYEKMNDVFTPFVSELQAKGWHTDRFLDGVGTRFAW 428
           LVAAY AN+   S + +ASVL +AY+KMN  F  F++ELQAKGWHTDRFLDG G+RFAW
Sbjct: 363 LVAAYAANMDASSQKSSASVLQDAYDKMNSTFDSFLTELQAKGWHTDRFLDGTGSRFAW 421

BLAST of Cp4.1LG12g10330 vs. TAIR10
Match: AT2G31190.1 (AT2G31190.1 Protein of unknown function, DUF647)

HSP 1 Score: 649.8 bits (1675), Expect = 1.2e-186
Identity = 312/414 (75.36%), Postives = 363/414 (87.68%), Query Frame = 1

Query: 15  KSPSG-PVSWIEISESVSRRCQFQPDGQLSVKIIDDSRPAIQRVVDSFLNTFFPSGYPYS 74
           KSP   PV W E S+SVS R QFQ DG LS+K++DD+RP  Q++V+SFLN FFPSGYPYS
Sbjct: 20  KSPEDFPVYWFETSDSVSHRYQFQSDGHLSMKVVDDARPVPQKMVESFLNKFFPSGYPYS 79

Query: 75  VNEGYLRYTQFRALQHVTSAALSVLSTQSLLFAAGLRPTAAQATVVSWVLKDGMQHVGKL 134
           VNEGYLRYTQFRALQH +SAALSVLSTQSLLFAAGLRPT AQATVVSW+LKDGMQHVGKL
Sbjct: 80  VNEGYLRYTQFRALQHFSSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDGMQHVGKL 139

Query: 135 ICSNLGTRMDSEPKRWRVIADVLYDFGAGLEVISPLCPHLFLQMAGLGNFAKGMAVVAAR 194
           ICSNLG RMDSEPKRWR++ADVLYD G GLE++SPLCPHLFL+MAGLGNFAKGMA VAAR
Sbjct: 140 ICSNLGARMDSEPKRWRILADVLYDLGTGLELVSPLCPHLFLEMAGLGNFAKGMATVAAR 199

Query: 195 ATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGIGAGLQLASTICSSIQGKLVAAPLLS 254
           ATRLPIYSSFAKEGNLSD+FAKGEAISTLFNV GIGAG+QLASTICSS++GKLV   +LS
Sbjct: 200 ATRLPIYSSFAKEGNLSDIFAKGEAISTLFNVAGIGAGIQLASTICSSMEGKLVVGSILS 259

Query: 255 IVHVYCVVEQMRATPINTLNPQRTAMIVADFVKSGRIPSPADLRYHEDLVFPGRLIEDAG 314
           +VHVY VVEQMR  PINTLNPQRTA+IVA+F+K+G++PSP DLR+ EDL+FP R I+DAG
Sbjct: 260 VVHVYSVVEQMRGVPINTLNPQRTALIVANFLKTGKVPSPPDLRFQEDLMFPERPIQDAG 319

Query: 315 SVKVGRALHEVIKPSKLVEMKQMFPEEKFVLNQTHKWVDMVLEHDASGEDALRGWLVAAY 374
           +VKVGRALH+ +KPS++  +KQ+F EEKF+L+    W DMVLEHDA+GEDALRGWLVAAY
Sbjct: 320 NVKVGRALHKAVKPSEVQRLKQVFVEEKFLLSHGKSWTDMVLEHDATGEDALRGWLVAAY 379

Query: 375 TANIKGPSHEPTASVLLEAYEKMNDVFTPFVSELQAKGWHTDRFLDGVGTRFAW 428
             ++    ++P   +L +AY+KMNDVF PF+S++QAKGW+TDRFLDG GTRFAW
Sbjct: 380 VKSMTKIYNDPDDIILQDAYDKMNDVFNPFLSQVQAKGWYTDRFLDGTGTRFAW 433

BLAST of Cp4.1LG12g10330 vs. TAIR10
Match: AT5G49820.1 (AT5G49820.1 Protein of unknown function, DUF647)

HSP 1 Score: 174.5 bits (441), Expect = 1.4e-43
Identity = 110/358 (30.73%), Postives = 193/358 (53.91%), Query Frame = 1

Query: 58  VDSFLNTFF-PSGYPYSVNEGYLRYTQFRALQHVTSAALSVLSTQSLLFAAGL--RPTAA 117
           V SFL ++  P G+P SVNE Y+ Y  +RAL+H    A+ V +TQ+LL + G     +A+
Sbjct: 104 VGSFLRSYVVPEGFPGSVNESYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGASRNSSAS 163

Query: 118 QATVVSWVLKDGMQHVGKLICSNLGTRMDSEPKRWRVIADVLYDFGAGLEVISPLCPHLF 177
            A  ++W+LKDG   VGK++ +  G + D + K+ R   D+L + GAG+E+ +   PHLF
Sbjct: 164 AAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLF 223

Query: 178 LQMAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGIGAGLQL 237
           L +A   N  K +A V + +TR PIY +FAK  N+ D+ AKGE +  + +++G G  + +
Sbjct: 224 LPLACAANVVKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLMGTGFSILI 283

Query: 238 ASTICSSIQGKLVAAPLLSIVHVYCVVEQMRATPINTLNPQRTAMIVADFVKSGRIPSPA 297
           +    S +        LLS  ++    +++R+  ++TLN  R  + V  F+K+GR+PS  
Sbjct: 284 SKRNPSLV----TTFGLLSCGYLMSSYQEVRSVVLHTLNRARFTVAVESFLKTGRVPSLQ 343

Query: 298 DLRYHEDLVFPGRLIEDAGSVKVGRALHEVIKPSKLVEMKQMFPEEKFVL--NQTHKWVD 357
           +    E  +F    ++D   +   R       PS  + +K  F +E++++  + T   V 
Sbjct: 344 EGNIQEK-IFTFPWVDDRPVMLGARFKDAFQDPSTYMAVKPFFDKERYMVTYSPTKGKVY 403

Query: 358 MVLEHDASGEDALRGWLVAAYTANIKGPSHEPTASVLLEAYEKMNDVFTPFVSELQAK 411
            +L+H A+ +D L+    AA+ A++       +      + E+++  F P   EL+++
Sbjct: 404 ALLKHQANSDDILK----AAFHAHVLLHFMNQSKDGNPRSVEQLDPAFAPTEYELESR 452

BLAST of Cp4.1LG12g10330 vs. TAIR10
Match: AT3G45890.1 (AT3G45890.1 Protein of unknown function, DUF647)

HSP 1 Score: 146.7 bits (369), Expect = 3.2e-35
Identity = 80/240 (33.33%), Postives = 135/240 (56.25%), Query Frame = 1

Query: 67  PSGYPYSVNEGYLRYTQFRALQHVTSAALSVLSTQSLLFAAGLRPTAAQ-ATVVSWVLKD 126
           P G+P SV   YL Y+ +R +Q + S    VL+TQSLL+A GL   A   A  ++WVLKD
Sbjct: 200 PEGFPNSVTSDYLDYSLWRGVQGIASQISGVLATQSLLYAVGLGKGAIPTAAAINWVLKD 259

Query: 127 GMQHVGKLICSNLGTRMDSEPKRWRVIADVLYDFGAGLEVISPLCPHLFLQMAGLGNFAK 186
           G+ ++ K++ S  G   D  PK WR+ AD+L +   G+E+++P+ P  F+ +       +
Sbjct: 260 GIGYLSKIMLSKYGRHFDVHPKGWRLFADLLENAAFGMEMLTPVFPQFFVMIGAAAGAGR 319

Query: 187 GMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGIGAGLQLASTICSSIQGK 246
             A +   ATR    + FA + N +++ AKGEA   +   VGI  G+ +A+ I +S    
Sbjct: 320 SAAALIQAATRSCFNAGFASQRNFAEVIAKGEAQGMVSKSVGILLGIVVANCIGTSTSLA 379

Query: 247 LVAAPLLSIVHVYCVVEQMRATPINTLNPQRTAMIVADFVKSGRIPSPADLRYHEDLVFP 306
           L A  +++ +H+Y  ++  +   + TLNP R +++ ++++ SG+ P   ++   E L FP
Sbjct: 380 LAAFGVVTTIHMYTNLKSYQCIQLRTLNPYRASLVFSEYLISGQAPLIKEVNDEEPL-FP 438

BLAST of Cp4.1LG12g10330 vs. TAIR10
Match: AT1G13770.1 (AT1G13770.1 Protein of unknown function, DUF647)

HSP 1 Score: 138.7 bits (348), Expect = 8.6e-33
Identity = 100/394 (25.38%), Postives = 193/394 (48.98%), Query Frame = 1

Query: 46  IIDDSRPAIQRVVDSF-------LNTFFPSGYPYSVNEGYLRYTQFRALQHVTSAALSVL 105
           I   S  +IQR  + F       L  F P G+P SV   Y+ +  +  LQ +++    +L
Sbjct: 31  ITASSSLSIQRSANRFNHVWRRVLQAFVPEGFPGSVTPDYVGFQLWDTLQGLSTYTKMML 90

Query: 106 STQSLLFAAGLRPTAAQA--TVVSWVLKDGMQHVGKLICSNL-GTRMDSEPKRWRVIADV 165
           STQ+LL A G+   +A        W L+D    +G ++ +   G+ +DS  K WR++AD+
Sbjct: 91  STQALLSAIGVGEKSATVIGATFQWFLRDFTGMLGGILFTFYQGSNLDSNAKMWRLVADL 150

Query: 166 LYDFGAGLEVISPLCPHLFLQMAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAK 225
           + D G  ++++SPL P  F+ +  LG+ ++    VA+ ATR  +   FA + N +D+ AK
Sbjct: 151 MNDIGMLMDLLSPLFPSAFIVVVCLGSLSRSFTGVASGATRAALTQHFALQDNAADISAK 210

Query: 226 GEAISTLFNVVGIGAGLQLASTICSSIQGKLVAAPLLSIVHVYCVVEQMRATPINTLNPQ 285
             +  T+  ++G+  G+ LA     +     ++   L++ H+Y     +R   +N+LN +
Sbjct: 211 EGSQETMATMMGMSLGMLLARFTSGNPMAIWLSFLSLTVFHMYANYRAVRCLVLNSLNFE 270

Query: 286 RTAMIVADFVKSGRIPSPADLRYHED-LVFPGRLIEDAGSVKVGRALHEVIKPSKL--VE 345
           R+++++  F+++G++ SP  +   E  L      +    S  + + +   ++ S L  ++
Sbjct: 271 RSSILLTHFIQTGQVLSPEQVSSMEGVLPLWATSLRSTNSKPLHKRVQLGVRVSSLPRLD 330

Query: 346 MKQM--------FPEEKFVLNQTHKWVDMVLEHDASGEDALRGWLVAAYTANIKGPSHEP 405
           M Q+        +   K++L      V ++L  D+   D L+ ++ A   AN+     E 
Sbjct: 331 MLQLLNGVGASSYKNAKYLLAHIKGNVSVILHKDSKPADVLKSYIHAIVLANLM----EK 390

Query: 406 TASVLLEAYEKMNDVFTPFVSELQAKGWHTDRFL 419
           + S   E    ++  +   + +L++ GW T+R L
Sbjct: 391 STSFYSEGEAWIDKHYDELLHKLRSGGWKTERLL 420

BLAST of Cp4.1LG12g10330 vs. TAIR10
Match: AT5G01510.1 (AT5G01510.1 Protein of unknown function, DUF647)

HSP 1 Score: 132.9 bits (333), Expect = 4.7e-31
Identity = 103/371 (27.76%), Postives = 179/371 (48.25%), Query Frame = 1

Query: 66  FPSGYPYSVNEGYLRYTQFRALQHVTSAALSVLSTQSLLFAAGL---------RPTAAQA 125
           FPSG+P SV++ YL Y  ++   ++T    +VL T SLL A G+            AA A
Sbjct: 120 FPSGFPGSVSDDYLDYMLWQFPTNITGWICNVLVTSSLLKAVGVGSFSGTSAAATAAASA 179

Query: 126 TVVSWVLKDGMQHVGK-LICSNLGTRMDSEPKRWRVIADVLYDFGAGLEVISPLCPHLFL 185
             + WV KDG+  +G+ LI    G+  D +PK+WR+ AD +   G+  ++ + L P  FL
Sbjct: 180 AAIRWVSKDGIGALGRLLIGGRFGSLFDDDPKQWRMYADFIGSAGSFFDLATQLYPSQFL 239

Query: 186 QMAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGIGAGLQLA 245
            +A  GN AK +A      +   I + FA  GNL ++ AK E       ++G+G G+ + 
Sbjct: 240 LLASTGNLAKAVARGLRDPSFRVIQNHFAISGNLGEVAAKEEVWEVAAQLIGLGFGILII 299

Query: 246 ST--ICSSIQGKLVAAPLLSIVHVYCVVEQMRATPINTLNPQRTAMIVADFVKSGRIPSP 305
            T  +  S    L+    + +VH++   + +     NT+N +R  +IV   V    +P  
Sbjct: 300 DTPGLVKSFPFVLLTWTSIRLVHLWLRYQSLAVLQFNTVNLKRARIIVESHVVHSVVPGY 359

Query: 306 ADLRYHEDLVFPGRLIEDAGSVKVGRALHEVI----KPSKLVEMKQMFPEEKFV--LNQT 365
            D    E+++   R ++    +  G +L E+       SK+  + +M+ +EK++  LN+ 
Sbjct: 360 VDCNKRENILLWQRFMKP--RIIFGVSLEELSGLEKSVSKVKALLKMYTKEKYILTLNKL 419

Query: 366 HKWVDMVLEH--DASGEDALRGWLVAAYTANIKGPSHEPTASV---LLEAYEKMNDVFTP 414
           +K  +  +    +A+  D LR    A +       S +   SV   L ++  +M++ F  
Sbjct: 420 NKDTEFSVSFKVNATSRDVLRCLWQAYWLEENMEESFKDKDSVFHWLKQSLSEMDNKFDD 479

BLAST of Cp4.1LG12g10330 vs. NCBI nr
Match: gi|659100750|ref|XP_008451249.1| (PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis melo])

HSP 1 Score: 808.5 bits (2087), Expect = 5.5e-231
Identity = 403/428 (94.16%), Postives = 414/428 (96.73%), Query Frame = 1

Query: 1   MDFLNKFNVRK-DAEKSPSGPVSWIEISESVSRRCQFQPDGQLSVKIIDDSRPAIQRVVD 60
           MD LNKFN RK D  KSPS PVSWIE+S+SVSRRCQFQPDG LSVKIIDDSRPAIQR+VD
Sbjct: 1   MDLLNKFNARKKDPVKSPSLPVSWIEVSDSVSRRCQFQPDGHLSVKIIDDSRPAIQRIVD 60

Query: 61  SFLNTFFPSGYPYSVNEGYLRYTQFRALQHVTSAALSVLSTQSLLFAAGLRPTAAQATVV 120
           SFLNTFFPSGYPYSVNEGYLRYTQFRALQHVTSAALSVLSTQSLLFAAGLRPTAAQATVV
Sbjct: 61  SFLNTFFPSGYPYSVNEGYLRYTQFRALQHVTSAALSVLSTQSLLFAAGLRPTAAQATVV 120

Query: 121 SWVLKDGMQHVGKLICSNLGTRMDSEPKRWRVIADVLYDFGAGLEVISPLCPHLFLQMAG 180
           SWVLKDGMQHVGKLICSNLG RMDSEPKRWRVIADVLYD GAGLEVISPLCPHLFL+MAG
Sbjct: 121 SWVLKDGMQHVGKLICSNLGARMDSEPKRWRVIADVLYDLGAGLEVISPLCPHLFLEMAG 180

Query: 181 LGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGIGAGLQLASTIC 240
           LGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGIGAGLQLASTIC
Sbjct: 181 LGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGIGAGLQLASTIC 240

Query: 241 SSIQGKLVAAPLLSIVHVYCVVEQMRATPINTLNPQRTAMIVADFVKSGRIPSPADLRYH 300
           SSIQGKLVAAPLLSIVHVYCVVEQMRATPINTLNPQRTAMIVADFVKSGRIPSPAD+RY 
Sbjct: 241 SSIQGKLVAAPLLSIVHVYCVVEQMRATPINTLNPQRTAMIVADFVKSGRIPSPADIRYQ 300

Query: 301 EDLVFPGRLIEDAGSVKVGRALHEVIKPSKLVEMKQMFPEEKFVLNQTHKWVDMVLEHDA 360
           EDLVFPGRLIE+AG+VKVGRALHEVIKPSKLVEMKQ+FPEEKFVLNQ+ KWVDMVLEHDA
Sbjct: 301 EDLVFPGRLIEEAGNVKVGRALHEVIKPSKLVEMKQIFPEEKFVLNQSQKWVDMVLEHDA 360

Query: 361 SGEDALRGWLVAAYTANIKGPSHEPTASVLLEAYEKMNDVFTPFVSELQAKGWHTDRFLD 420
           SGEDALRGWLVAAYTANIKGPSHEPTAS LLEAYEKMNDVFTPF+SELQ KGWHTDRFLD
Sbjct: 361 SGEDALRGWLVAAYTANIKGPSHEPTASALLEAYEKMNDVFTPFLSELQGKGWHTDRFLD 420

Query: 421 GVGTRFAW 428
           G G+RFAW
Sbjct: 421 GAGSRFAW 428

BLAST of Cp4.1LG12g10330 vs. NCBI nr
Match: gi|449462449|ref|XP_004148953.1| (PREDICTED: protein root UVB sensitive 2, chloroplastic [Cucumis sativus])

HSP 1 Score: 799.3 bits (2063), Expect = 3.3e-228
Identity = 399/428 (93.22%), Postives = 414/428 (96.73%), Query Frame = 1

Query: 1   MDFLNKFNVR-KDAEKSPSGPVSWIEISESVSRRCQFQPDGQLSVKIIDDSRPAIQRVVD 60
           MD LNKF+ R KD EKSPS PVSWIE+S+SVSRRCQFQPDG LSVKIIDDSRPAIQR+VD
Sbjct: 1   MDLLNKFSARNKDPEKSPSLPVSWIEVSDSVSRRCQFQPDGHLSVKIIDDSRPAIQRIVD 60

Query: 61  SFLNTFFPSGYPYSVNEGYLRYTQFRALQHVTSAALSVLSTQSLLFAAGLRPTAAQATVV 120
           SFLNTFFPSGYPYSV+EGYLRYTQFRALQHVTSAALSVLSTQSLLFAAGLRPTAAQATVV
Sbjct: 61  SFLNTFFPSGYPYSVSEGYLRYTQFRALQHVTSAALSVLSTQSLLFAAGLRPTAAQATVV 120

Query: 121 SWVLKDGMQHVGKLICSNLGTRMDSEPKRWRVIADVLYDFGAGLEVISPLCPHLFLQMAG 180
           SWVLKDGMQHVGKLICSNLG RMDSEPKRWRVIADVLYD GAGLEVISPLCPHLFL+MAG
Sbjct: 121 SWVLKDGMQHVGKLICSNLGARMDSEPKRWRVIADVLYDLGAGLEVISPLCPHLFLEMAG 180

Query: 181 LGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGIGAGLQLASTIC 240
           LGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGIGAGLQLASTIC
Sbjct: 181 LGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGIGAGLQLASTIC 240

Query: 241 SSIQGKLVAAPLLSIVHVYCVVEQMRATPINTLNPQRTAMIVADFVKSGRIPSPADLRYH 300
           SSIQGKLVAAPLLSIVHVYCVVEQMRATPINTLNPQRTAMIVADFVK+GRIPSPAD+RY 
Sbjct: 241 SSIQGKLVAAPLLSIVHVYCVVEQMRATPINTLNPQRTAMIVADFVKAGRIPSPADIRYQ 300

Query: 301 EDLVFPGRLIEDAGSVKVGRALHEVIKPSKLVEMKQMFPEEKFVLNQTHKWVDMVLEHDA 360
           EDLVFPGRLIE+AG+VKVGRALHEVIKPSKLVEMKQ+FP EKFVLNQ+ KWVDMVLEHDA
Sbjct: 301 EDLVFPGRLIEEAGNVKVGRALHEVIKPSKLVEMKQIFPGEKFVLNQSKKWVDMVLEHDA 360

Query: 361 SGEDALRGWLVAAYTANIKGPSHEPTASVLLEAYEKMNDVFTPFVSELQAKGWHTDRFLD 420
           SGEDALRGWLVAAYT NIK PSHEPTASVLLEAYEKMNDVFTPF+SELQAKGW+TDRFLD
Sbjct: 361 SGEDALRGWLVAAYTTNIKEPSHEPTASVLLEAYEKMNDVFTPFLSELQAKGWYTDRFLD 420

Query: 421 GVGTRFAW 428
           G G+RFAW
Sbjct: 421 GAGSRFAW 428

BLAST of Cp4.1LG12g10330 vs. NCBI nr
Match: gi|565342089|ref|XP_006338194.1| (PREDICTED: protein root UVB sensitive 2, chloroplastic isoform X1 [Solanum tuberosum])

HSP 1 Score: 691.4 bits (1783), Expect = 9.8e-196
Identity = 331/429 (77.16%), Postives = 386/429 (89.98%), Query Frame = 1

Query: 1   MDFLNKFNV-RKDA-EKSPSGPVSWIEISESVSRRCQFQPDGQLSVKIIDDSRPAIQRVV 60
           M  L+K  + RK++ E+ P  P+SW+EIS S+SR+ QFQPDG+LSVK++DDSRPA QRV+
Sbjct: 1   MQMLDKIKMQRKESDERPPELPISWVEISNSISRQYQFQPDGKLSVKMVDDSRPAAQRVM 60

Query: 61  DSFLNTFFPSGYPYSVNEGYLRYTQFRALQHVTSAALSVLSTQSLLFAAGLRPTAAQATV 120
           +SFLN FFPSGYPYSVNEGY+RYTQFRALQH TSAALSVLSTQSLLFAAGLRPT AQAT 
Sbjct: 61  ESFLNKFFPSGYPYSVNEGYMRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATA 120

Query: 121 VSWVLKDGMQHVGKLICSNLGTRMDSEPKRWRVIADVLYDFGAGLEVISPLCPHLFLQMA 180
           VSW+L+DGMQHVGKLICSNLG RMDSEPKRWR++ADVLYDFG GLEV+SPLCPHLFL++A
Sbjct: 121 VSWILRDGMQHVGKLICSNLGARMDSEPKRWRILADVLYDFGTGLEVMSPLCPHLFLEVA 180

Query: 181 GLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGIGAGLQLASTI 240
           GLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV+G+G G+ LASTI
Sbjct: 181 GLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVLGLGTGIHLASTI 240

Query: 241 CSSIQGKLVAAPLLSIVHVYCVVEQMRATPINTLNPQRTAMIVADFVKSGRIPSPADLRY 300
           CSS+QGKLV APLLS++HVY V E+MRA P+NTLNPQRTAMIVADFVK+G+I SPADLRY
Sbjct: 241 CSSMQGKLVVAPLLSVIHVYSVCEEMRAAPVNTLNPQRTAMIVADFVKTGKISSPADLRY 300

Query: 301 HEDLVFPGRLIEDAGSVKVGRALHEVIKPSKLVEMKQMFPEEKFVLNQTHKWVDMVLEHD 360
            EDL+FPGRLIEDAG VKVGR+LHEV++PSKL + K+ FPEEKF+LN   +W DM+LEH+
Sbjct: 301 REDLLFPGRLIEDAGKVKVGRSLHEVVRPSKLQQFKEAFPEEKFLLNHGSRWTDMILEHN 360

Query: 361 ASGEDALRGWLVAAYTANIKGPSHEPTASVLLEAYEKMNDVFTPFVSELQAKGWHTDRFL 420
           A+GEDALRGWLVAAY ++++   HEP+A++L EAY+KMN  F+PF++ELQAKGWHTDRFL
Sbjct: 361 ATGEDALRGWLVAAYASDMERLVHEPSANILQEAYDKMNSTFSPFLAELQAKGWHTDRFL 420

Query: 421 DGVGTRFAW 428
           DG G RFA+
Sbjct: 421 DGTGNRFAF 429

BLAST of Cp4.1LG12g10330 vs. NCBI nr
Match: gi|970030405|ref|XP_015076484.1| (PREDICTED: protein root UVB sensitive 2, chloroplastic isoform X1 [Solanum pennellii])

HSP 1 Score: 689.5 bits (1778), Expect = 3.7e-195
Identity = 330/429 (76.92%), Postives = 386/429 (89.98%), Query Frame = 1

Query: 1   MDFLNKFNV-RKDA-EKSPSGPVSWIEISESVSRRCQFQPDGQLSVKIIDDSRPAIQRVV 60
           M  L+K  + RK++ E+ P  P+SW+EIS S+SR+ QFQPDG+LSVK++DDSRPA QRV+
Sbjct: 1   MQILDKIKMQRKESDERPPELPISWVEISNSISRQYQFQPDGKLSVKMVDDSRPAAQRVM 60

Query: 61  DSFLNTFFPSGYPYSVNEGYLRYTQFRALQHVTSAALSVLSTQSLLFAAGLRPTAAQATV 120
           +SFLN FFPSGYPYSVNEGY+RYTQFRALQH TSA+LSVLSTQSLLFAAGLRPT AQAT 
Sbjct: 61  ESFLNKFFPSGYPYSVNEGYMRYTQFRALQHFTSASLSVLSTQSLLFAAGLRPTPAQATA 120

Query: 121 VSWVLKDGMQHVGKLICSNLGTRMDSEPKRWRVIADVLYDFGAGLEVISPLCPHLFLQMA 180
           VSW+L+DGMQHVGKLICSNLG RMDSEPKRWR++ADVLYDFG GLEV+SPLCPHLFL++A
Sbjct: 121 VSWILRDGMQHVGKLICSNLGARMDSEPKRWRILADVLYDFGTGLEVMSPLCPHLFLEVA 180

Query: 181 GLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGIGAGLQLASTI 240
           GLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV+G+GAG+ LASTI
Sbjct: 181 GLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVLGLGAGIHLASTI 240

Query: 241 CSSIQGKLVAAPLLSIVHVYCVVEQMRATPINTLNPQRTAMIVADFVKSGRIPSPADLRY 300
           CSS+QGKLV AP LS++H+Y V E+MRA P+NTLNPQRTAMIVADFVK+GRI SPADLRY
Sbjct: 241 CSSMQGKLVVAPFLSVIHIYSVCEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRY 300

Query: 301 HEDLVFPGRLIEDAGSVKVGRALHEVIKPSKLVEMKQMFPEEKFVLNQTHKWVDMVLEHD 360
            EDL+FPGRLIEDAG VKVGR+LHEV++PSKL + K+ FPEEKF+LN   +W DM+LEH+
Sbjct: 301 REDLLFPGRLIEDAGKVKVGRSLHEVVRPSKLKKFKEAFPEEKFLLNHGSRWTDMILEHN 360

Query: 361 ASGEDALRGWLVAAYTANIKGPSHEPTASVLLEAYEKMNDVFTPFVSELQAKGWHTDRFL 420
           A+GEDALRGWLVAAY ++++   HEP+A++L EAY+KMN  F+PF++ELQAKGWHTDRFL
Sbjct: 361 ATGEDALRGWLVAAYASDMERLVHEPSANILQEAYDKMNFTFSPFLAELQAKGWHTDRFL 420

Query: 421 DGVGTRFAW 428
           DG G RFA+
Sbjct: 421 DGTGNRFAF 429

BLAST of Cp4.1LG12g10330 vs. NCBI nr
Match: gi|971534504|ref|XP_015161485.1| (PREDICTED: protein root UVB sensitive 2, chloroplastic isoform X2 [Solanum tuberosum])

HSP 1 Score: 689.5 bits (1778), Expect = 3.7e-195
Identity = 326/418 (77.99%), Postives = 379/418 (90.67%), Query Frame = 1

Query: 10  RKDAEKSPSGPVSWIEISESVSRRCQFQPDGQLSVKIIDDSRPAIQRVVDSFLNTFFPSG 69
           ++  E+ P  P+SW+EIS S+SR+ QFQPDG+LSVK++DDSRPA QRV++SFLN FFPSG
Sbjct: 4   KESDERPPELPISWVEISNSISRQYQFQPDGKLSVKMVDDSRPAAQRVMESFLNKFFPSG 63

Query: 70  YPYSVNEGYLRYTQFRALQHVTSAALSVLSTQSLLFAAGLRPTAAQATVVSWVLKDGMQH 129
           YPYSVNEGY+RYTQFRALQH TSAALSVLSTQSLLFAAGLRPT AQAT VSW+L+DGMQH
Sbjct: 64  YPYSVNEGYMRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATAVSWILRDGMQH 123

Query: 130 VGKLICSNLGTRMDSEPKRWRVIADVLYDFGAGLEVISPLCPHLFLQMAGLGNFAKGMAV 189
           VGKLICSNLG RMDSEPKRWR++ADVLYDFG GLEV+SPLCPHLFL++AGLGNFAKGMAV
Sbjct: 124 VGKLICSNLGARMDSEPKRWRILADVLYDFGTGLEVMSPLCPHLFLEVAGLGNFAKGMAV 183

Query: 190 VAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGIGAGLQLASTICSSIQGKLVAA 249
           VAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV+G+G G+ LASTICSS+QGKLV A
Sbjct: 184 VAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVLGLGTGIHLASTICSSMQGKLVVA 243

Query: 250 PLLSIVHVYCVVEQMRATPINTLNPQRTAMIVADFVKSGRIPSPADLRYHEDLVFPGRLI 309
           PLLS++HVY V E+MRA P+NTLNPQRTAMIVADFVK+G+I SPADLRY EDL+FPGRLI
Sbjct: 244 PLLSVIHVYSVCEEMRAAPVNTLNPQRTAMIVADFVKTGKISSPADLRYREDLLFPGRLI 303

Query: 310 EDAGSVKVGRALHEVIKPSKLVEMKQMFPEEKFVLNQTHKWVDMVLEHDASGEDALRGWL 369
           EDAG VKVGR+LHEV++PSKL + K+ FPEEKF+LN   +W DM+LEH+A+GEDALRGWL
Sbjct: 304 EDAGKVKVGRSLHEVVRPSKLQQFKEAFPEEKFLLNHGSRWTDMILEHNATGEDALRGWL 363

Query: 370 VAAYTANIKGPSHEPTASVLLEAYEKMNDVFTPFVSELQAKGWHTDRFLDGVGTRFAW 428
           VAAY ++++   HEP+A++L EAY+KMN  F+PF++ELQAKGWHTDRFLDG G RFA+
Sbjct: 364 VAAYASDMERLVHEPSANILQEAYDKMNSTFSPFLAELQAKGWHTDRFLDGTGNRFAF 421

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
RUS2_ARATH2.0e-18575.36Protein root UVB sensitive 2, chloroplastic OS=Arabidopsis thaliana GN=RUS2 PE=1... [more]
RUS6_ARATH2.5e-4230.73Protein root UVB sensitive 6 OS=Arabidopsis thaliana GN=RUS6 PE=2 SV=1[more]
RUS1_ARATH5.6e-3433.33Protein root UVB sensitive 1, chloroplastic OS=Arabidopsis thaliana GN=RUS1 PE=1... [more]
RUS3_ARATH1.5e-3125.38Protein root UVB sensitive 3 OS=Arabidopsis thaliana GN=RUS3 PE=2 SV=1[more]
RUS1_RAT2.0e-3127.44RUS1 family protein C16orf58 homolog OS=Rattus norvegicus PE=2 SV=1[more]
Match NameE-valueIdentityDescription
K4C001_SOLLC4.4e-19576.92Uncharacterized protein OS=Solanum lycopersicum PE=4 SV=1[more]
B9H8L8_POPTR7.5e-19576.64Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0005s24540g PE=4 SV=1[more]
A0A0B0NUH9_GOSAR1.6e-19279.17Uncharacterized protein OS=Gossypium arboreum GN=F383_00350 PE=4 SV=1[more]
A0A0D2M2Y2_GOSRA6.0e-19278.92Uncharacterized protein OS=Gossypium raimondii GN=B456_001G253600 PE=4 SV=1[more]
A0A067JUM8_JATCU6.0e-19279.00Uncharacterized protein OS=Jatropha curcas GN=JCGZ_17696 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G31190.11.2e-18675.36 Protein of unknown function, DUF647[more]
AT5G49820.11.4e-4330.73 Protein of unknown function, DUF647[more]
AT3G45890.13.2e-3533.33 Protein of unknown function, DUF647[more]
AT1G13770.18.6e-3325.38 Protein of unknown function, DUF647[more]
AT5G01510.14.7e-3127.76 Protein of unknown function, DUF647[more]
Match NameE-valueIdentityDescription
gi|659100750|ref|XP_008451249.1|5.5e-23194.16PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis melo][more]
gi|449462449|ref|XP_004148953.1|3.3e-22893.22PREDICTED: protein root UVB sensitive 2, chloroplastic [Cucumis sativus][more]
gi|565342089|ref|XP_006338194.1|9.8e-19677.16PREDICTED: protein root UVB sensitive 2, chloroplastic isoform X1 [Solanum tuber... [more]
gi|970030405|ref|XP_015076484.1|3.7e-19576.92PREDICTED: protein root UVB sensitive 2, chloroplastic isoform X1 [Solanum penne... [more]
gi|971534504|ref|XP_015161485.1|3.7e-19577.99PREDICTED: protein root UVB sensitive 2, chloroplastic isoform X2 [Solanum tuber... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR006968RUS_fam
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0009926 auxin polar transport
biological_process GO:0010224 response to UV-B
cellular_component GO:0005575 cellular_component
cellular_component GO:0009941 chloroplast envelope
cellular_component GO:0005739 mitochondrion
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG12g10330.1Cp4.1LG12g10330.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006968Root UVB sensitive familyPANTHERPTHR12770FAMILY NOT NAMEDcoord: 10..422
score: 1.5E
IPR006968Root UVB sensitive familyPFAMPF04884DUF647coord: 55..287
score: 1.0
NoneNo IPR availablePANTHERPTHR12770:SF5SUBFAMILY NOT NAMEDcoord: 10..422
score: 1.5E