Cp4.1LG04g02050 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG04g02050
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionU11/U12 small nuclear ribonucleoprotein 48 kDa protein, putative isoform 1
LocationCp4.1LG04 : 1031 .. 6634 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CGCCGATGTCTCACTCTTCCCTCCATTTCACTCCGAGCAATGAACCCGTCTCTTCCATCTTCCAACCTAACCTTCCCCAACTTTCTGCTCTCAAACCCTAACCCTAACCCTAATTTCTCTCTTCCCGAATCTCGTGATCCTCCTCTGGACTTGTCTTCTTCTCTTTCCTCCCTCAGAAACCTCATCCATGTCGCCAACCAGACCCTTCAATCTCTCTCCTACCTCACCCTCATCCAAACCCCATCTGCTAAGCCTGACGATTCCGGTTTCGTTCAGTGCAATTTTGATCGGCGCCACCGTGTTCCGCCGCATTCTCTCTTCCACCATTCCCTTCTTTGTCCTTCCGCTCCTCCGCCTCGTATCGACCCGACTCAACTCCTCCAGTCCTTGCTTTACCCTCGAACGCTTCAATCGTCTCGTGAATTGGTTAACGAAAAGCGCTTCCGTCAAACGATGCCGGATTCTGATGCCGACCTTTGTTTCTCTCTCAATGATTATTCTGATGGAAGTTCCAATTTCTTCTATCTCGATTGCCCTGGCGTGGTTGCCTTGTCTAACCGGGATGAAATGTCTAAAGTCTTCACTCTCCCTCGTGTTTTGGCTGTTGAATGTGCTAATTTTGTTTGTGATGATGATGGTGATCGAGAGATGAAGTGTTCGTTGAAGGGGATTCGAATGCTGCCCTCTGATCTGTGGGCTCTTAGAAGTGAAGTAGAAATGTGGAATGACTACCCCAGTGTGTATTCGCATATTGTTATACGGTCCATATTGGGCTCAGAGATGGCCGTGGACAACCGTTTGAAGACATGGATCATTGCAAATTCTCCTCGGTATGGTGTTATTATTGATGTTGCTATGAGGGATCACATATTTTTGCTGTTTAGGCTGTGTTTTATGGCAATTTCAAAGGAAGCTATGGGATTTCAAATTGCGCTGGAAAATGGAAATGGAATGGAGGGTGGATCAGGGAACCTCAGTTTTAAGTGTCCGATTTTAGTACAAGTACTATTGTGGCTGGCATCTCAGCTTTCCGTATTGTATGGGGAGATGAACGGGAAGTTCTTTGCTATTAACATGCTTAGACAACGTATACTATATGCTGCATCGGGCTTGTCGCTTTTGCCATACGAACAAAAGCGAGTGGAGAGTCCAACTTTGGTAGAGGGCTCTCATAATCTAGTATCTAGTTTTAGTGACACACAGAGTGTGAATTTGAAAGAGTTGGATAAGAAGGTTATAAATAATGTCTATGTTACTGGAGAAGAAACGGTCAACTGCAGGGTGATTTTTGTGTCCCAAGTTGCTGCGGCTGTTGCAGCATTGCATGAACGTTTCCTACTTGAAGAAAAGATCAAAGCTTTACGCTTTTCTCATCTGCAAACTAAACATCAGCTGTATGTTACTTCTCCATATTCTGTTTTTTGGTTACTAGTTTTGATGAACTTGATGTTTTCCTTATTGCTATGATAGTGATATTCATTGATCCAACAAATTTATGCTTGTATCATTCATGAGGTGGAAAATAATTGTTAGGAGACTTAGAGGAAATGGGTTTAAATCCATGTCTTCCTACCTAGGATATAAAAATTTTACGAGTTTCTTTGGCAACCAAATATTGTAGGGTAGGATGTTTGTCCTGTTATATTAGAAAGTGAACTCAAATTTGTCCGAACTTTCATAGTTATTAAGAAAAAAAAGATGATCGAAGGGATTCTACAGTCTTGTGTCTAAAAGAGATGGCCGAACAGATTCTAGAATCTGTCTTTGGAGCCCTTGTCATAAATATGCTTAAACATTTTAAGGACATCATTTCTATTTATACTAGCTGTGTCTGAAAATTTTGTCAACAGAATTCTTAGCACCTGGGGATTTAAGTTGCCTAGCCACTTAACTGGCTTTTCAAACTCAAATTAATGGTTGAACCTAGCACTATCAGCTCTAGAAAGTGGACTTTAGTTTATGTTTTTTTTACAGCAATAATTCTTGGAATTTAGGGTTTAAATTCTTGAAGTAATCTAGAAGTAATAGAAGTTTTCCCCTTCTATTGAATACAATTAATCTTCATTTTTCTTTCAGTTGCAATTTTATGGAAAAGGATGTTCTGAGTTTTCATCTGCCTCTTTCATCCAATTTAATCTGCACGTTTTGTTCCACTTGTCTTCATGATTTCATCAAAAGAATTTTCGTACTAAGAAAAGGATCTCACTTTTCAAGTAGGCAACTGTTAGAAACAGACTAATCCTTTAACTCTTTTAGGGTTTCTGAACATAACTATATCTGTCAAAGAGCTTGTGAAGAGCGTAAAAGACGGTGCAATTATAGACCTATAATTGAGCATGATGGACTCCCAAAGCAGCAGTCTCATGATGAGGTATTTCTGAACTCTGTAATGGTCTAGTCATGAATTATGTAGTTGGTATAATCAAGCATGTTATTTTCAGTTATCCTTCCACCTGGATTACTCTTCCTACCAATTTTCTCCATTCATTTGTTGGTGATTTGAAGTCTCGCCGAATCTTCTTATTTACTGCCCTGAATGGTACATCATGTTCCGAAATTTTTTTTAAAAAATATATATAAAAAAAAGCTTTCAATTCCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTATATATATATATATATATATATATATTTTTTTTTTTTTTTAAAAAAATTAACTTCTTGGAACCAAATCAAGCTGGTGCCTCTATATACTTATTTTTTGAGATGGATGTTTTGCCGTTCTTTCCTTCATTTTGTGCATCTCCTCCCATTTTATTTTATGTTAAAAGAATATGTTTAGCAATAAATGATGAAGCAGGTGCTGATCATCTGCTTATTGTTCACTTACCCATGATGTTTCTTGGCTATATTTTATATCTTATGGGTTTAATTTTAATAAAAGCAGGATAGAGACAAGACCAAAACAAAAGAGGAATTGTTGGCTGAAGAAAGAGATTATAAACGTCGAAGAATGTCATACCGTGGGAAAAAGGCAAAGCGATCAACTTTACAGGTAAGCAAGCAGCTTGTTCAATTTTACGTCAGCCATTGTTTTTGTTATCTCTCGTCTCTATTGGTTATTAGCCTAGGGTCTAACTGGGTATGGCATGATGATTCCTTTTCTTATAGCCTTTTATGGAAAATTGACAAAGTTCTTAATTAAATGATAATGCTTATCAAGAACATTTGGTGGTAATATCTTCTTAGGCGTACATTGGTCACTTTTTAGAATATTATTCACAACTTTCAAGCTACTAAGATTGCAGCAACAATGGCCGAATTATGGAAGAGGTGAATCTGCAATGCTAAATAGACCATGAAACTATGTGATACCCATTATGTATTTTCGGTTCCTGAAATATGTTTTCTGACATCTGTTTTCCAACTTTTTCTATCAAACGTTTTTATGTTAGATGATATCCACACTCACGTGCTTACTAATTTTACCTTTCATGGAAGGTTACAAGAGATATTATTGAAGAATACATGGACGAAATTATGAAAGCTGGAGGGATTGGATACTTTGTGAAGGGAACTGAAGAGAGAGGGATAAAATCTGAACAACCAACTGATCATTACATTACAAGAGATAGTATTGCTGATGGGCACACAAAAGGAAGCAACAGCTCATATGGAGAAACTAGGCATCATACCTCGAGTCATTCCCGGAAGCAGTCTAACTATGATAACAGATATTCAGCATCCGAAAAGCCCCCAAAAGGTAGGTATGGGCACTATGGCTCTCCAGAAGATGAAAGGAAAAGTGTCGGTAAAGACAAATATGATCGAGAACACTATCATAGATTCTCAGATCGAAGCAGTAGTCCTAGCGAGTCACACAAATGGAAGAGATATCCAAGTGATCGAGATGATGAGGAGCCAGCAGAAACCAGGCATCATAAATCTCGAGGACTGTCTTCTAGCAGCTCTAATTATCATGGTTTTAGATCATCTTCATCCTCAAGATCGGGGGGTGATTCTAGTGCAAGGAAGGATGGTCACAAGTTAAGAGCTAGTGATAGTTGGAAAAGGAACACAGCTGATAATCATAGTTCAGATTCTTCGGTGCTTAATTCATTTAATGATAGATATACCCCTTCCAAGTGTCATGACCAATTAGAAGATGATCACTCCACTGGCAGAAGATATGTAAATCCAGACGTTTAGTTTGCTGATCCTCAAATGAGGGGCTGTCCCGCAAATTGTGCCATACTTCTTAACATTACCAATGTACCTTGTGGTAAGTCTTTCTTTCATGCTACATCCCATGTGCCTTTAACTGTGACTTCTCTGACTTCTTTTTTTAACATGTTTTTATTCACACAACCTTGCAGCATATGGGCTTCAGATATTTTATCCTCACGTGTATCCTTCATAGTATCCTTGAAGTTTATCCCATTTGCTTATACATTAAATCTTGCAAAACTTATTCTTTCCTTGTTTTTTTTTTGTCCCCTTAGCAGGGATATTTGCCCTGAATGTTGCAGTAGCATAATTTGAACTCATCGTCACTATATCAAAAATGCCAATTTTACATCATCCTTTTCTGTTCCCGGGTTTTCTAATATTTCTGGACTAGTCACAATGACAGATAAACAAAGTGGTCGTATTGGCGTGGATTTTCCATGTCGTACGATTTTTTCTTTCTGTCCTCGTTTGCTATGGCTTCCACTACTCTGGGAAATGTTTTGGTGATGCATAACTTGTCCTTTTCCTGTTCTCTTCAGTCATGATTGGTATCGGTCGTTTAAAGTCTTACAACGAATAATTAAAAAAACTATCGATGATTTCACTTCAAAAACTTACATGTAGGTCATTTAGAACGGTACAGATCATTAATCTAGCATCTTAACAAACATTTTACAGTACTACACTGCTTGAGTTCAACCTTATTATTGATCTTATATTAGTTCTAACTTTTGTATTACTTTTAGTGTTCTACCAACAGCTCACAAGATCAGAGAAGTATGAAGGCAGTCTGCAAATAAACTTGATCAGAAACCACAATCCACAGAGTTTCAGATCGTCGGTTGCAGGCCGTTCCATCCTACATCAGGCTGATAATTGGCTTCTTATCAGTTATAGAAGGGGTATGAATTCTGACTCCCTGGTATGTTTTTCTCCTTTTGGGCAGCCCATGTGAATATTAGATACATGTGTACAAAGCCTTTGGAGGAAAGGGATATGCAGAAATCTCTTACAATAAGATAGCTTAGGAGTTTGTATAGGTAGGTATGAAACAGTTAGCAGGTAAAGTAGGTAGTTTCTTCCATTTACTTACAAGTAGGAAATGTCTCAATTTCATACAAATGTCTTGGGAAATATTGTTAAAATCTGGATGTTGATAGATATTTTTGGAAAATTTATGAAAAGTAAAAAAAGAAAATCAAGAACATCAACTATGCAGTATTTCAATGAAGGATGATGAAATTGAATCTCCAATTGTAAATTGCAATTTAGTATAGAACAATATGGTTTGATTACATTAAGTTGTCGACAAACTATTGATGTATCAGATGTAATTGAAGTTTTATTTAGAAATAAACTAGTGATACATTGATAACATACTTGGAAGTTATATTATAGATATATTATTGATAA

mRNA sequence

CGCCGATGTCTCACTCTTCCCTCCATTTCACTCCGAGCAATGAACCCGTCTCTTCCATCTTCCAACCTAACCTTCCCCAACTTTCTGCTCTCAAACCCTAACCCTAACCCTAATTTCTCTCTTCCCGAATCTCGTGATCCTCCTCTGGACTTGTCTTCTTCTCTTTCCTCCCTCAGAAACCTCATCCATGTCGCCAACCAGACCCTTCAATCTCTCTCCTACCTCACCCTCATCCAAACCCCATCTGCTAAGCCTGACGATTCCGGTTTCGTTCAGTGCAATTTTGATCGGCGCCACCGTGTTCCGCCGCATTCTCTCTTCCACCATTCCCTTCTTTGTCCTTCCGCTCCTCCGCCTCGTATCGACCCGACTCAACTCCTCCAGTCCTTGCTTTACCCTCGAACGCTTCAATCGTCTCGTGAATTGGTTAACGAAAAGCGCTTCCGTCAAACGATGCCGGATTCTGATGCCGACCTTTGTTTCTCTCTCAATGATTATTCTGATGGAAGTTCCAATTTCTTCTATCTCGATTGCCCTGGCGTGGTTGCCTTGTCTAACCGGGATGAAATGTCTAAAGTCTTCACTCTCCCTCGTGTTTTGGCTGTTGAATGTGCTAATTTTGTTTGTGATGATGATGGTGATCGAGAGATGAAGTGTTCGTTGAAGGGGATTCGAATGCTGCCCTCTGATCTGTGGGCTCTTAGAAGTGAAGTAGAAATGTGGAATGACTACCCCAGTGTGTATTCGCATATTGTTATACGGTCCATATTGGGCTCAGAGATGGCCGTGGACAACCGTTTGAAGACATGGATCATTGCAAATTCTCCTCGGTATGGTGTTATTATTGATGTTGCTATGAGGGATCACATATTTTTGCTGTTTAGGCTGTGTTTTATGGCAATTTCAAAGGAAGCTATGGGATTTCAAATTGCGCTGGAAAATGGAAATGGAATGGAGGGTGGATCAGGGAACCTCAGTTTTAAGTGTCCGATTTTAGTACAAGTACTATTGTGGCTGGCATCTCAGCTTTCCGTATTGTATGGGGAGATGAACGGGAAGTTCTTTGCTATTAACATGCTTAGACAACGTATACTATATGCTGCATCGGGCTTGTCGCTTTTGCCATACGAACAAAAGCGAGTGGAGAGTCCAACTTTGGTAGAGGGCTCTCATAATCTAGTATCTAGTTTTAGTGACACACAGAGTGTGAATTTGAAAGAGTTGGATAAGAAGGTTATAAATAATGTCTATGTTACTGGAGAAGAAACGGTCAACTGCAGGGTGATTTTTGTGTCCCAAGTTGCTGCGGCTGTTGCAGCATTGCATGAACGTTTCCTACTTGAAGAAAAGATCAAAGCTTTACGCTTTTCTCATCTGCAAACTAAACATCAGCTGGTTTCTGAACATAACTATATCTGTCAAAGAGCTTGTGAAGAGCGTAAAAGACGGTGCAATTATAGACCTATAATTGAGCATGATGGACTCCCAAAGCAGCAGTCTCATGATGAGGATAGAGACAAGACCAAAACAAAAGAGGAATTGTTGGCTGAAGAAAGAGATTATAAACGTCGAAGAATGTCATACCGTGGGAAAAAGGCAAAGCGATCAACTTTACAGGTTACAAGAGATATTATTGAAGAATACATGGACGAAATTATGAAAGCTGGAGGGATTGGATACTTTGTGAAGGGAACTGAAGAGAGAGGGATAAAATCTGAACAACCAACTGATCATTACATTACAAGAGATAGTATTGCTGATGGGCACACAAAAGGAAGCAACAGCTCATATGGAGAAACTAGGCATCATACCTCGAGTCATTCCCGGAAGCAGTCTAACTATGATAACAGATATTCAGCATCCGAAAAGCCCCCAAAAGGTAGGTATGGGCACTATGGCTCTCCAGAAGATGAAAGGAAAAGTGTCGGTAAAGACAAATATGATCGAGAACACTATCATAGATTCTCAGATCGAAGCAGTAGTCCTAGCGAGTCACACAAATGGAAGAGATATCCAAGTGATCGAGATGATGAGGAGCCAGCAGAAACCAGGCATCATAAATCTCGAGGACTGTCTTCTAGCAGCTCTAATTATCATGGTTTTAGATCATCTTCATCCTCAAGATCGGGGGGTGATTCTAGTGCAAGGAAGGATGGTCACAAGTTAAGAGCTAGTGATAGTTGGAAAAGGAACACAGCTGATAATCATAGTTCAGATTCTTCGGTGCTTAATTCATTTAATGATAGATATACCCCTTCCAAGTGTCATGACCAATTAGAAGATGATCACTCCACTGGCAGAAGATATGTAAATCCAGACGTTTAGTTTGCTGATCCTCAAATGAGGGGCTGTCCCGCAAATTGTGCCATACTTCTTAACATTACCAATGTACCTTGTGCATATGGGCTTCAGATATTTTATCCTCACGTGTATCCTTCATAGTATCCTTGAAGTTTATCCCATTTGCTTATACATTAAATCTTGCAAAACTTATTCTTTCCTTGTTTTTTTTTTGTCCCCTTAGCAGGGATATTTGCCCTGAATGTTGCAGTAGCATAATTTGAACTCATCGTCACTATATCAAAAATGCCAATTTTACATCATCCTTTTCTGTTCCCGGGTTTTCTAATATTTCTGGACTAGTCACAATGACAGATAAACAAAGTGGTCGTATTGGCGTGGATTTTCCATGTCGTACGATTTTTTCTTTCTGTCCTCGTTTGCTATGGCTTCCACTACTCTGGGAAATGTTTTGGTGATGCATAACTTGTCCTTTTCCTGTTCTCTTCAGTCATGATTGGTATCGGTCGTTTAAAGTCTTACAACGAATAATTAAAAAAACTATCGATGATTTCACTTCAAAAACTTACATGTAGGTCATTTAGAACGGTACAGATCATTAATCTAGCATCTTAACAAACATTTTACAGTACTACACTGCTTGAGTTCAACCTTATTATTGATCTTATATTAGTTCTAACTTTTGTATTACTTTTAGTGTTCTACCAACAGCTCACAAGATCAGAGAAGTATGAAGGCAGTCTGCAAATAAACTTGATCAGAAACCACAATCCACAGAGTTTCAGATCGTCGGTTGCAGGCCGTTCCATCCTACATCAGGCTGATAATTGGCTTCTTATCAGTTATAGAAGGGGTATGAATTCTGACTCCCTGGTATGTTTTTCTCCTTTTGGGCAGCCCATGTGAATATTAGATACATGTGTACAAAGCCTTTGGAGGAAAGGGATATGCAGAAATCTCTTACAATAAGATAGCTTAGGAGTTTGTATAGGTAGGTATGAAACAGTTAGCAGGTAAAGTAGGTAGTTTCTTCCATTTACTTACAAGTAGGAAATGTCTCAATTTCATACAAATGTCTTGGGAAATATTGTTAAAATCTGGATGTTGATAGATATTTTTGGAAAATTTATGAAAAGTAAAAAAAGAAAATCAAGAACATCAACTATGCAGTATTTCAATGAAGGATGATGAAATTGAATCTCCAATTGTAAATTGCAATTTAGTATAGAACAATATGGTTTGATTACATTAAGTTGTCGACAAACTATTGATGTATCAGATGTAATTGAAGTTTTATTTAGAAATAAACTAGTGATACATTGATAACATACTTGGAAGTTATATTATAGATATATTATTGATAA

Coding sequence (CDS)

ATGAACCCGTCTCTTCCATCTTCCAACCTAACCTTCCCCAACTTTCTGCTCTCAAACCCTAACCCTAACCCTAATTTCTCTCTTCCCGAATCTCGTGATCCTCCTCTGGACTTGTCTTCTTCTCTTTCCTCCCTCAGAAACCTCATCCATGTCGCCAACCAGACCCTTCAATCTCTCTCCTACCTCACCCTCATCCAAACCCCATCTGCTAAGCCTGACGATTCCGGTTTCGTTCAGTGCAATTTTGATCGGCGCCACCGTGTTCCGCCGCATTCTCTCTTCCACCATTCCCTTCTTTGTCCTTCCGCTCCTCCGCCTCGTATCGACCCGACTCAACTCCTCCAGTCCTTGCTTTACCCTCGAACGCTTCAATCGTCTCGTGAATTGGTTAACGAAAAGCGCTTCCGTCAAACGATGCCGGATTCTGATGCCGACCTTTGTTTCTCTCTCAATGATTATTCTGATGGAAGTTCCAATTTCTTCTATCTCGATTGCCCTGGCGTGGTTGCCTTGTCTAACCGGGATGAAATGTCTAAAGTCTTCACTCTCCCTCGTGTTTTGGCTGTTGAATGTGCTAATTTTGTTTGTGATGATGATGGTGATCGAGAGATGAAGTGTTCGTTGAAGGGGATTCGAATGCTGCCCTCTGATCTGTGGGCTCTTAGAAGTGAAGTAGAAATGTGGAATGACTACCCCAGTGTGTATTCGCATATTGTTATACGGTCCATATTGGGCTCAGAGATGGCCGTGGACAACCGTTTGAAGACATGGATCATTGCAAATTCTCCTCGGTATGGTGTTATTATTGATGTTGCTATGAGGGATCACATATTTTTGCTGTTTAGGCTGTGTTTTATGGCAATTTCAAAGGAAGCTATGGGATTTCAAATTGCGCTGGAAAATGGAAATGGAATGGAGGGTGGATCAGGGAACCTCAGTTTTAAGTGTCCGATTTTAGTACAAGTACTATTGTGGCTGGCATCTCAGCTTTCCGTATTGTATGGGGAGATGAACGGGAAGTTCTTTGCTATTAACATGCTTAGACAACGTATACTATATGCTGCATCGGGCTTGTCGCTTTTGCCATACGAACAAAAGCGAGTGGAGAGTCCAACTTTGGTAGAGGGCTCTCATAATCTAGTATCTAGTTTTAGTGACACACAGAGTGTGAATTTGAAAGAGTTGGATAAGAAGGTTATAAATAATGTCTATGTTACTGGAGAAGAAACGGTCAACTGCAGGGTGATTTTTGTGTCCCAAGTTGCTGCGGCTGTTGCAGCATTGCATGAACGTTTCCTACTTGAAGAAAAGATCAAAGCTTTACGCTTTTCTCATCTGCAAACTAAACATCAGCTGGTTTCTGAACATAACTATATCTGTCAAAGAGCTTGTGAAGAGCGTAAAAGACGGTGCAATTATAGACCTATAATTGAGCATGATGGACTCCCAAAGCAGCAGTCTCATGATGAGGATAGAGACAAGACCAAAACAAAAGAGGAATTGTTGGCTGAAGAAAGAGATTATAAACGTCGAAGAATGTCATACCGTGGGAAAAAGGCAAAGCGATCAACTTTACAGGTTACAAGAGATATTATTGAAGAATACATGGACGAAATTATGAAAGCTGGAGGGATTGGATACTTTGTGAAGGGAACTGAAGAGAGAGGGATAAAATCTGAACAACCAACTGATCATTACATTACAAGAGATAGTATTGCTGATGGGCACACAAAAGGAAGCAACAGCTCATATGGAGAAACTAGGCATCATACCTCGAGTCATTCCCGGAAGCAGTCTAACTATGATAACAGATATTCAGCATCCGAAAAGCCCCCAAAAGGTAGGTATGGGCACTATGGCTCTCCAGAAGATGAAAGGAAAAGTGTCGGTAAAGACAAATATGATCGAGAACACTATCATAGATTCTCAGATCGAAGCAGTAGTCCTAGCGAGTCACACAAATGGAAGAGATATCCAAGTGATCGAGATGATGAGGAGCCAGCAGAAACCAGGCATCATAAATCTCGAGGACTGTCTTCTAGCAGCTCTAATTATCATGGTTTTAGATCATCTTCATCCTCAAGATCGGGGGGTGATTCTAGTGCAAGGAAGGATGGTCACAAGTTAAGAGCTAGTGATAGTTGGAAAAGGAACACAGCTGATAATCATAGTTCAGATTCTTCGGTGCTTAATTCATTTAATGATAGATATACCCCTTCCAAGTGTCATGACCAATTAGAAGATGATCACTCCACTGGCAGAAGATATGTAAATCCAGACGTTTAG

Protein sequence

MNPSLPSSNLTFPNFLLSNPNPNPNFSLPESRDPPLDLSSSLSSLRNLIHVANQTLQSLSYLTLIQTPSAKPDDSGFVQCNFDRRHRVPPHSLFHHSLLCPSAPPPRIDPTQLLQSLLYPRTLQSSRELVNEKRFRQTMPDSDADLCFSLNDYSDGSSNFFYLDCPGVVALSNRDEMSKVFTLPRVLAVECANFVCDDDGDREMKCSLKGIRMLPSDLWALRSEVEMWNDYPSVYSHIVIRSILGSEMAVDNRLKTWIIANSPRYGVIIDVAMRDHIFLLFRLCFMAISKEAMGFQIALENGNGMEGGSGNLSFKCPILVQVLLWLASQLSVLYGEMNGKFFAINMLRQRILYAASGLSLLPYEQKRVESPTLVEGSHNLVSSFSDTQSVNLKELDKKVINNVYVTGEETVNCRVIFVSQVAAAVAALHERFLLEEKIKALRFSHLQTKHQLVSEHNYICQRACEERKRRCNYRPIIEHDGLPKQQSHDEDRDKTKTKEELLAEERDYKRRRMSYRGKKAKRSTLQVTRDIIEEYMDEIMKAGGIGYFVKGTEERGIKSEQPTDHYITRDSIADGHTKGSNSSYGETRHHTSSHSRKQSNYDNRYSASEKPPKGRYGHYGSPEDERKSVGKDKYDREHYHRFSDRSSSPSESHKWKRYPSDRDDEEPAETRHHKSRGLSSSSSNYHGFRSSSSSRSGGDSSARKDGHKLRASDSWKRNTADNHSSDSSVLNSFNDRYTPSKCHDQLEDDHSTGRRYVNPDV
BLAST of Cp4.1LG04g02050 vs. Swiss-Prot
Match: U1148_ARATH (U11/U12 small nuclear ribonucleoprotein 48 kDa protein OS=Arabidopsis thaliana GN=SNRNP48 PE=2 SV=1)

HSP 1 Score: 416.8 bits (1070), Expect = 5.2e-115
Identity = 304/764 (39.79%), Postives = 421/764 (55.10%), Query Frame = 1

Query: 2   NPSL------PSSNLTFPNFLLSNPNP---NPN-FSLPESRDPPLDLSSSLSSLRNLIHV 61
           NP+L      P+SN   PNF    P P   NPN +S+  S  P  +LS +LSSL++L+  
Sbjct: 14  NPNLFYHYPPPNSN---PNFFFRPPPPPLQNPNNYSIVPSPPPIRELSGTLSSLKSLLSE 73

Query: 62  ANQTLQSLSY-LTLIQTPSAKPDDSG-FVQCNFDRRHRVPPHSLFHHSLLCPSAPPPRID 121
             +TL SLS  L L  +   + D++G FV+C FD  H +PP +LF HSL CP+     +D
Sbjct: 74  CQRTLDSLSQNLALDHSSLLQKDENGCFVRCPFDSNHFMPPEALFLHSLRCPNT----LD 133

Query: 122 PTQLLQSLL-YPRTLQSSRELVNEKRFRQTMPDSDADLCFSLNDYSDGSSNFFYLDCPGV 181
              LL+S   Y  TL+   EL         + + D DLC SL+D +D  SNFFY DCPG 
Sbjct: 134 LIHLLESFSSYRNTLELPCEL--------QLNNGDGDLCISLDDLADFGSNFFYRDCPGA 193

Query: 182 VALSNRDEMSKVFTLPRVLAVECANFVCDDDGDREMKCSLKGIRMLPSDLWALRSEVEMW 241
           V  S  D   +  TLP VL+VEC++FV  D+  +++    K + +LPSDL A+++E++ W
Sbjct: 194 VKFSELDGKKRTLTLPHVLSVECSDFVGSDEKVKKIVLD-KCLGVLPSDLCAMKNEIDQW 253

Query: 242 NDYPSVYSHIVIRSILGSEMAVDNRLKTWIIANSPRYGVIIDVAMRDHIFLLFRLCFMAI 301
            D+PS YS  V+ SI+GS++   + L+ WI+ NS RYGVIID  MRDHIFLLFRLC  + 
Sbjct: 254 RDFPSSYSSSVLSSIVGSKVVEISALRKWILVNSTRYGVIIDTFMRDHIFLLFRLCLKSA 313

Query: 302 SKEAMGFQI---ALENGNGMEGGSGNLSFKCPILVQVLLWLASQLSVLYGEMNGKFFAIN 361
            KEA GF++   A + G        + +F+CP+ +QVL WLASQL+VLYGE NGKFFA++
Sbjct: 314 VKEACGFRMESDATDVGEQKIMSCKSSTFECPVFIQVLSWLASQLAVLYGEGNGKFFALD 373

Query: 362 MLRQRILYAASGLSLLPYEQKRVESPTLVEGSHNLVSSFSDTQSVNLKELDKKVINNVY- 421
           M +Q I+ +AS + L   E  R +   +VE          D +  N   + +K   N   
Sbjct: 374 MFKQCIVESASQVMLFRLEGTRSKCSGVVE-------DLDDARLRNKDVIMEKPFENSSG 433

Query: 422 -VTGEETVNCRVIFVSQVAAAVAALHERFLLEEKIKALRFSHLQTKHQLVSEHNYICQRA 481
              G+   + +VI VS+V+AAVAAL+ER LLEEKI+A+R++   T++Q  +E  ++  +A
Sbjct: 434 GECGKTLDSPQVISVSRVSAAVAALYERSLLEEKIRAVRYAQPLTRYQRAAELGFMTAKA 493

Query: 482 CEERKRRCNYRPIIEHDGLPKQQSHDEDRDKTKTKEELLAEERDYKRRRMSYRGKKAKRS 541
            EER RRC+YRPII+HDG P+Q+S ++D DK KT+EELLAEERDYKRRRMSYRGKK KR+
Sbjct: 494 DEERNRRCSYRPIIDHDGRPRQRSLNQDMDKMKTREELLAEERDYKRRRMSYRGKKVKRT 553

Query: 542 TLQVTRDIIEEYMDEIMKAGGIGYFVKG---TEERGIKSEQPTDHYITRDSIADGHTKGS 601
             QV  D+IEEY +EI  AGGIG F KG        I ++Q    +       D   KG 
Sbjct: 554 PRQVLHDMIEEYTEEIKLAGGIGCFEKGMPLQSRSPIGNDQKESDFGYSIPSTDKQWKGE 613

Query: 602 NSS---YGETRHHTSSHSRKQSNYDNRYSASEKPPKGRYGHYGSPEDERKSVGKDKYDRE 661
           N +   Y       S   ++   YD+  S  ++  +  Y H    +D+ +   KDK+   
Sbjct: 614 NRADIEYPIDNRQNSDKVKRHDEYDSGSSQRQQSHRS-YKHSDRRDDKLRDRRKDKH--- 673

Query: 662 HYHRFSDRSSSPSESHKWKRYPSDRDDEEPAETRHHKSRGLSSSSSNYHGFRSSSSSRSG 721
                                 +DR D+E   T+ H   G   S  NY   R  SSS   
Sbjct: 674 ----------------------NDRRDDEFTRTKRHSIEG--ESYQNYRSSREKSSS--- 710

Query: 722 GDSSARKDGHKLRASDSWKRNTADNHSSDSSVLNSFNDRYTPSK 742
                    +K +  D +     D  S      N F DRY P++
Sbjct: 734 --------DYKTKRDDPY-----DRRSQQPRNQNLFEDRYIPTE 710

BLAST of Cp4.1LG04g02050 vs. TrEMBL
Match: A0A0A0M085_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G662780 PE=4 SV=1)

HSP 1 Score: 757.3 bits (1954), Expect = 1.8e-215
Identity = 405/536 (75.56%), Postives = 445/536 (83.02%), Query Frame = 1

Query: 1   MNPSLPSSNL-TFPNFLLSNPNPNPNF---SLPESRDPPLDLSSSLSSLRNLIHVANQTL 60
           +NPSLP     TFPNFL  NPNPN +    S  +S+ PPLDLSSS SSL NLIH ANQTL
Sbjct: 4   INPSLPFPPYQTFPNFLPPNPNPNSHIHDSSHSQSQHPPLDLSSSFSSLNNLIHFANQTL 63

Query: 61  QSLSYLTLIQTPSAKPDDSGFVQCNFDRRHRVPPHSLFHHSLLCPSAPPPRIDPTQLLQS 120
           QSLSYLT    PS   + S  + C+FDRRHRVPPHSLF HSLLCPSA    IDPTQL QS
Sbjct: 64  QSLSYLT----PSDFANHSHLLHCHFDRRHRVPPHSLFRHSLLCPSASLLPIDPTQLFQS 123

Query: 121 LLYPRTLQSSRELVNEKRFRQTMPDSDADLCFSLNDYSDGSSNFFYLDCPGVVALSNRDE 180
           LLYP+TL SSR+LVNE RF Q +PDSDADLCFSL DYSD +SNFFY+DCPGVVALSN DE
Sbjct: 124 LLYPQTLHSSRQLVNENRFSQVLPDSDADLCFSLTDYSDATSNFFYVDCPGVVALSNLDE 183

Query: 181 MSKVFTLPRVLAVECANFVCDDDGDREMKCSLKGIRMLPSDLWALRSEVEMWNDYPSVYS 240
           MSKVFTLPRVLAV CANFV +D    EM  +L GIR+LPSDLW LRSEVE+WNDYPS YS
Sbjct: 184 MSKVFTLPRVLAVHCANFVGNDHF--EMNSTLNGIRILPSDLWNLRSEVEIWNDYPSKYS 243

Query: 241 HIVIRSILGSEMAVDNRLKTWIIANSPRYGVIIDVAMRDHIFLLFRLCFMAISKEAMGFQ 300
            +V+RSILGSEMA+++ L TWII NSPRYGV+IDVA+RDHIFLLFRLCFMAI KEA+GFQ
Sbjct: 244 FVVLRSILGSEMALNSHLMTWIIENSPRYGVVIDVALRDHIFLLFRLCFMAIYKEALGFQ 303

Query: 301 IALENGNGMEGGSGNLSFKCPILVQVLLWLASQLSVLYGEMNGKFFAINMLRQRILYAAS 360
           +ALE GNGMEG SGN  FKCPIL+QVL+WLASQLSVLYGE NG FFA+NMLRQ IL AAS
Sbjct: 304 VALEKGNGMEGESGNSCFKCPILIQVLMWLASQLSVLYGETNGNFFAVNMLRQCILDAAS 363

Query: 361 GLSLLPYEQKRVESPTLVEGSHNLVSSFSDTQSVNLKELDKKVINNVYVTGEETVNCRVI 420
           GL LL  EQK  ES TL EGSH+L  S SDTQSV + ELD+KV+NN +      VNC VI
Sbjct: 364 GLLLLQSEQKSTESLTLGEGSHDLEISCSDTQSVKMNELDQKVVNNGH-----AVNCSVI 423

Query: 421 FVSQVAAAVAALHERFLLEEKIKALRFSHLQTKHQLVSEHNYICQRACEERKRRCNYRPI 480
            VSQVAAAVAALHERFLLEEKIKALRF+HLQTK+Q VSE+NYI QRACEERKR CNYRPI
Sbjct: 424 LVSQVAAAVAALHERFLLEEKIKALRFAHLQTKYQRVSEYNYISQRACEERKRCCNYRPI 483

Query: 481 IEHDGLPKQQSHDEDRDKTKTKEELLAEERDYKRRRMSYRGKKAKRSTLQVTRDII 533
           IEHDGLPKQQSH+ED +KTKT+EELLAEERDYKRRRMSYRGKKAKRSTLQVTRDI+
Sbjct: 484 IEHDGLPKQQSHNEDANKTKTREELLAEERDYKRRRMSYRGKKAKRSTLQVTRDIL 528

BLAST of Cp4.1LG04g02050 vs. TrEMBL
Match: M5W6C9_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa001825mg PE=4 SV=1)

HSP 1 Score: 555.8 bits (1431), Expect = 8.0e-155
Identity = 369/774 (47.67%), Postives = 475/774 (61.37%), Query Frame = 1

Query: 6   PSSNLTFPNFLL--SNPNPNPNF--SLPESRDP--------PLDLSSSLSSLRNLIHVAN 65
           P +    P+F L  SNPNPNPNF  S P++  P        P DLS+++SSL +L+  + 
Sbjct: 4   PPAQFAHPSFTLIPSNPNPNPNFFHSQPQNTQPVISTPPLPPPDLSTTISSLDSLVRDSY 63

Query: 66  QTLQSLSYLTLIQTPSAKPDDSGFVQCNFDRRHRVPPHSLFHHSLLCPSAPPPRIDPTQL 125
           QTL SLS L  +Q P+     S  + C F+  HRV PHSLF HSL CPS P P       
Sbjct: 64  QTLDSLSALLPLQNPNYDNPQSSLIPCPFNPHHRVHPHSLFSHSLHCPSHPHP------- 123

Query: 126 LQSLLYPRTLQSSRELVNEKRFRQTMPDSDADLCFSLNDY-SDGSSNFFYLDCPGVVALS 185
           L  L YP+TL+SS +   EK F QT+  S+ADL  SL  Y +D  SNFFY DCPGVV  S
Sbjct: 124 LPHLNYPKTLKSSDQSQTEKSFLQTLHGSEADLRLSLEHYYADFGSNFFYSDCPGVVNFS 183

Query: 186 NRDEMSKVFTLPRVLAVECANFVCDDDGDRE-MKCSLKGIRMLPSDLWALRSEVEMWNDY 245
             D ++++FTLP +L+VECANF+    G+RE M    +  R+LPS+LWA+++EVE WN++
Sbjct: 184 GLDGVNRMFTLPLILSVECANFI--GRGEREIMDFEKEWCRILPSELWAIKTEVEGWNEF 243

Query: 246 PSVYSHIVIRSILGSEMAVDNRLKTWIIANSPRYGVIIDVAMRDHIFLLFRLCFMAISKE 305
           P  YS+ V+ +ILG  +  +  + TWIIANSP+YG++IDVAMRDHIFLL RLC  AI +E
Sbjct: 244 PFTYSYRVLCAILGLGVVKEYDVGTWIIANSPQYGIVIDVAMRDHIFLLSRLCLKAILRE 303

Query: 306 AMGFQIALENGNGMEGGSGNLSFKCPILVQVLLWLASQLSVLYGEMNGKFFAINMLRQRI 365
           A+            EG   +  F+CP LVQ L+WLASQLS+LYG  NGK F IN+L++ +
Sbjct: 304 ALS--------KVKEGDPESTHFECPTLVQALMWLASQLSILYGAQNGKLFVINVLKKCL 363

Query: 366 LYAASGLSLLPYEQKRVESPTLVEGSHNLVSSFSDTQSVN-LKELDKKVINNVYVTGEET 425
           L AA G    P EQ+  E P L EG  NL ++ S  +    +K L      N  V  +E 
Sbjct: 364 LDAALGSLTFPLEQQVTEYPALEEGLLNLDANGSGVRDAEVMKPLSTHGGENSMV--KEN 423

Query: 426 VNCRVIFVSQVAAAVAALHERFLLEEKIKALRFSHLQTKHQLVSEHNYICQRACEERKRR 485
           +  R +FVSQVAAAVAALHERFLLEEK+KA R S   T++Q + +H Y+ QRA EERK R
Sbjct: 424 IFSREVFVSQVAAAVAALHERFLLEEKLKAQRVSQTFTRYQRMVDHEYVSQRADEERKNR 483

Query: 486 CNYRPIIEHDGLPKQQSHDEDRDKTKTKEELLAEERDYKRRRMSYRGKKAKRSTLQVTRD 545
             YRPII+HDGLP+QQS +++ +K KT+EELLAEERDYKRRRMSYRGKK KR+TLQV RD
Sbjct: 484 SQYRPIIDHDGLPRQQSCNQETNKPKTREELLAEERDYKRRRMSYRGKKVKRTTLQVMRD 543

Query: 546 IIEEYMDEIMKAGGIGYFVKGTEERG-IKSEQPTDHYITRDSIADGHTKGSNSSYG--ET 605
           IIEEYM+EI +AGGIG F KGTE  G    E P+   IT D  A+  TK +  S G   +
Sbjct: 544 IIEEYMEEIKQAGGIGCFEKGTEGEGSFPFELPSAPEITTD--AEKPTKSNYDSAGCSPS 603

Query: 606 RHHTSSHSR-----KQSNYDNRYSASEKPPKGRYGHYGSPEDERKSVGKDKYDREHYHRF 665
           R    SHS        ++ D     SEKP +   GH+   ED R S  +D+ D   + R 
Sbjct: 604 RSRKRSHSSYYAIDSVTSRDASAKGSEKPRRSLQGHHHYLEDHR-SDSRDRRDMVKHSRS 663

Query: 666 SDRSSSPSESHKWKRYPSDRDDEEPAETRHHKSRGLSSSSSNYHGFRSSSSSRSGGDSSA 725
            +   +P  +H   R+  +RDD E  +T+H +    SSS S Y   RSSS S SG +S  
Sbjct: 664 PESRRNPGWAHGQTRHHRERDDLEVRKTKHREISRSSSSISKYRDNRSSSHSNSGENSKV 723

Query: 726 RKDGHKLRASDSWKRNTADNHSSDSSVLNSFNDRYTPSKCHDQLEDDHSTGRRY 757
           R+D           R T +NH+S+S V N+F DRY P    D  E+D ST R+Y
Sbjct: 724 RRD-----------RYTYENHNSNSVVQNTFEDRYDPLISRDIYEEDLSTDRKY 744

BLAST of Cp4.1LG04g02050 vs. TrEMBL
Match: W9S254_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_006471 PE=4 SV=1)

HSP 1 Score: 512.7 bits (1319), Expect = 7.7e-142
Identity = 350/775 (45.16%), Postives = 469/775 (60.52%), Query Frame = 1

Query: 13  PNFLLSNPNPNPN-FSL--------PESRDP-PLDLSSSLSSLRNLIHVANQTLQSLSYL 72
           P+F    PNPNPN  SL        P++  P PLD S++LSSL  LIH + QTL++L  L
Sbjct: 11  PSFHFLPPNPNPNSVSLNAELQNPQPQNLTPQPLDFSATLSSLNGLIHHSEQTLRALFSL 70

Query: 73  TLIQTPSAKPDDSGFVQCNFDRRHRVPPHSLFHHSLLCPSAPPPRIDPTQLLQSLLYPRT 132
             +Q P+ +   +G V C F+ +H + P SLF H L C S+P P      LL  L Y  T
Sbjct: 71  LPLQNPN-QAHSNGVVPCPFNSQHLMHPSSLFSHFLHCSSSPCPI--QFDLLPQLNYTET 130

Query: 133 LQSSRELVNEKRFRQTMPDSDADLCFSLND-YSDGSSNFFYLDCPGVVALSNRDEMSKVF 192
           L SS     E+ F QT+  SD++LCFSL+D YS    NFFY DC GVV LS  D +S+ F
Sbjct: 131 LNSSDSSKAERGFLQTLHGSDSELCFSLDDFYSQFGFNFFYNDCHGVVNLSALDGISRTF 190

Query: 193 TLPRVLAVECANFVCDDDGDREMKCSLKGIRMLPSDLWALRSEVEMWNDYPSVYSHIVIR 252
           TLP  L+VECANFV +++ +R+     K  ++LPS+LWA+R+E+E WN+YP+VYS+ V+ 
Sbjct: 191 TLPVFLSVECANFVSNNEEERK-SFERKNRKILPSELWAIRAEIEAWNEYPNVYSYRVLY 250

Query: 253 SILGSEMAVDNRLKTWIIANSPRYGVIIDVAMRDHIFLLFRLCFMAISKEAMGFQIALEN 312
           +ILG +      L  W+IANSP+YGV+ID AMRDHIFLL RLC  AI KEA+     + N
Sbjct: 251 AILGLDFISVCDLARWVIANSPQYGVVIDTAMRDHIFLLCRLCLKAILKEALNL---VGN 310

Query: 313 GNGMEGGSGNLSFKCPILVQVLLWLASQLSVLYGEMNGKFFAINMLRQRILYAASGLSLL 372
            N ++    +++F CPILVQ L+WLASQLS+LYGEMNGKFFA+N+L+Q +L AASGL   
Sbjct: 311 CNSVKI-LNSMNFSCPILVQALMWLASQLSILYGEMNGKFFALNILKQCVLDAASGLVFF 370

Query: 373 PYEQKRVESPTLVEGSHNLVSSFSD--TQSVNLKELDKKVINNVYVTGEETVNCRVIFVS 432
             E+   E+P L E   +LV S  +    S   K L+ +    V    EE+    VI VS
Sbjct: 371 SLEKSVTETPALEEVPQSLVDSNGNGIKGSEVQKPLEIRRNGEVNSVVEESFTSGVILVS 430

Query: 433 QVAAAVAALHERFLLEEKIKALRFSHLQTKHQLVSEHNYICQRACEERKRRCNYRPIIEH 492
           Q+AAA+AALHER LLE KIK LRF      +Q V+EH+Y+  RA EER++R  YRPIIEH
Sbjct: 431 QLAAAIAALHERSLLEGKIKGLRFHQPLNNYQRVAEHDYVSHRADEEREKRPQYRPIIEH 490

Query: 493 DGLPKQQSHDEDRDKTKTKEELLAEERDYKRRRMSYRGKKAKRSTLQVTRDIIEEYMDEI 552
           DGLP+ +  +E+  KTKT+EELLAE+RDYKRRRMSYR KK KR+ L+V RDIIE++MDEI
Sbjct: 491 DGLPRLKVSNEETSKTKTREELLAEDRDYKRRRMSYRAKKVKRTNLEVMRDIIEDFMDEI 550

Query: 553 MKAGGIGYFVKGTEERGIKSEQPTDHYITRDSIADGHTKGSNSSYGETRHHTSS------ 612
            +AGGIG F     E+G K+E   D  + + S A   T   N S  E R++ SS      
Sbjct: 551 KQAGGIGCF-----EKGAKAE---DTLLLKPSYASEITSDINMS--EKRNYDSSAAGDSP 610

Query: 613 -HSRKQSNYDNRYSAS----------EKPPKGRYGHYGSPEDERKSVGKDKYDREHYHRF 672
              RKQS +D    A+          E+  +G YG +  P+D+++S+ +DK DRE+Y   
Sbjct: 611 DRHRKQSGFDYGARATTFKGYTHKDYEQTKRGLYGDH-EPKDDQRSISRDKRDREYY--- 670

Query: 673 SDRSSSPSESHKWKRYPSDRDDEEPAETRHHKSRGLSSSSSNYHGFRSSSSSRSGGDSSA 732
             RS     S  W  +  ++++ E + T+ H+S+  SS  S Y+  R S+   +    S 
Sbjct: 671 -SRSPRHDRSSDWTHHRREQNEREGSGTKRHESKHSSSRKSKYYVNRLSTFGLTSEHKSK 730

Query: 733 RKDGHKLRASDSWKRNTADNHSSDSSVLNSFNDRYTPSKCHDQLEDDHSTGRRYV 758
            KD H          +  +N SS   + N+F DRY PS+ H   EDD  T  +YV
Sbjct: 731 SKDRH--------HGDRYENRSSALFLRNTFEDRYDPSESHGTYEDDIPTNSKYV 754

BLAST of Cp4.1LG04g02050 vs. TrEMBL
Match: A0A061F1I4_THECC (U11/U12 small nuclear ribonucleoprotein 48 kDa protein, putative isoform 1 OS=Theobroma cacao GN=TCM_026062 PE=4 SV=1)

HSP 1 Score: 492.7 bits (1267), Expect = 8.3e-136
Identity = 346/772 (44.82%), Postives = 459/772 (59.46%), Query Frame = 1

Query: 1   MNPSLPSSNLTFPNFLLSNPNPNPNF-SLPESRD--PPLDLSSSLSSLRNLIHVANQTLQ 60
           + PSLPS N          PNPN    SLP++ +   P  LS++LSSL  L+ +++QTL 
Sbjct: 6   IQPSLPSQN----------PNPNSTIPSLPQNSNLNGPSSLSTTLSSLTALLSLSHQTLN 65

Query: 61  SLSYLTLIQTPSAKPDDSGFVQCNFDRRHRVPPHSLFHHSLLCPSAPPPRIDPTQLLQSL 120
           S S LT    P+  P       C F+  H + P SLF HSL CPS     + P     +L
Sbjct: 66  SHSTLTKSLNPNLIP-------CPFNPNHLLAPESLFSHSLRCPSPQNLDLYPPNYRNTL 125

Query: 121 LYPRTLQSSRELVNEKRFRQTMPDSDADLCFSLNDY-SDGSSNFFYLDCPGVVALSNRDE 180
           + P  L +      +  F+       ++LC SL++Y +D  SNFF  DCP  V L + D 
Sbjct: 126 IPPSNLHAQ-----DTHFQGIQC---SELCLSLDEYFADFGSNFFCKDCPAAVNLFDIDN 185

Query: 181 MSKVFTLPRVLAVECANFVCDDDGDREMKCSL-KGIRMLPSDLWALRSEVEMWNDYPSVY 240
             K FTLP  L+VEC NF  +   +RE   S  KG+R+L S LW +R EVE W DYP  Y
Sbjct: 186 SKKTFTLPGFLSVECVNF--EGFNEREGVVSEEKGLRVLASGLWEIRREVERWGDYPGSY 245

Query: 241 SHIVIRSILGSEMAVDNRLKTWIIANSPRYGVIIDVAMRDHIFLLFRLCFMAISKEAMGF 300
           S  VI +ILGS+M   + L+ WI+ANSPRYGV+ID  M DHI +L RLC  A+ +EA+G 
Sbjct: 246 SFNVICAILGSKMVKGSNLRKWIVANSPRYGVMIDGCMGDHIVVLVRLCLKAVVREAVGL 305

Query: 301 -QIALENGNGMEGG-SGNLS---FKCPILVQVLLWLASQLSVLYGEMNGKFFAINMLRQR 360
            ++ +  G   E     NL    F+CPIL+QVL+WL SQLSVLYG++NGKFFAINM++Q 
Sbjct: 306 MEVEMGYGEAKEKEWDVNLQMRMFECPILLQVLVWLGSQLSVLYGDVNGKFFAINMIKQC 365

Query: 361 ILYAASGLSLLPYEQKRVESPTLVEGSHNLVSSFSDTQSVNLKELDKKVINNVYVTGEET 420
           +L  AS L L P E+K  +S  L + S +L ++    + + L+E  ++  N    T  ET
Sbjct: 366 VLEGASLLLLFPLEEKVTDSHNLGQESQSLDAN--GVKEIKLEETIEQS-NEPVETVNET 425

Query: 421 VNCRVIFVSQVAAAVAALHERFLLEEKIKALRFSHLQTKHQLVSEHNYICQRACEERKRR 480
           +   VIFVSQVAAAVAALHER  LEEKIK LR     +++Q ++EH Y+ +RA  ERK+R
Sbjct: 426 IGVGVIFVSQVAAAVAALHERCFLEEKIKHLRGLQQLSRYQRMAEHAYVSERADAERKKR 485

Query: 481 CNYRPIIEHDGLPKQQSHDEDRDKTKTKEELLAEERDYKRRRMSYRGKKAKRSTLQVTRD 540
            NYRPII+HDGLP+Q S + +   TKT+EE+LAEERDYKRRRMSYRGKK KR+ LQV RD
Sbjct: 486 PNYRPIIDHDGLPRQASSNGETSTTKTREEILAEERDYKRRRMSYRGKKLKRTALQVMRD 545

Query: 541 IIEEYMDEIMKAGGIGYFVKGTEERG-IKSEQPTDHYITRDSIADGHTKGSNSSYGETRH 600
           IIEEY +EI KAG IG FVKG EE G + SE P  +   R   AD H KG+ S   E   
Sbjct: 546 IIEEYTEEIKKAGRIGCFVKGVEEEGLLPSESPVPY--DRAVDADQHKKGT-SDISEAAR 605

Query: 601 HTSSHSRKQSNYDNRYSASEKPPKGRYGHYGSPEDERKSVGKDKYDREHYHRFSDRSSSP 660
            + +H R++S+ D    ++      R GH+   ED R S+ K+K+  E++   S R  S 
Sbjct: 606 RSPNHCRRRSHDDQHTRSTRLEDSSRNGHHDLLEDSR-SMSKEKHRDEYHSGISKRYRSH 665

Query: 661 SESHKWKRYPSDRDDEEPAETRHHKSRGLSSSSSNYHGFRSS-SSSRSGGDSSARKDGHK 720
             S + + +  +RDD E   + H++S G  SS S Y  ++SS S+S S  D   RKD  K
Sbjct: 666 GRSDEQRSHRRERDDAESTRSTHYES-GRRSSISKYKDYKSSYSASNSSDDFHVRKDDQK 725

Query: 721 LRASDSWKRNTADNHSSDSSVLNSFNDRYTPSKCHDQLEDDHSTGRRYVNPD 761
           L A D  +R   +NH+  S V N F+DRY PS+  D  EDD     +YV P+
Sbjct: 726 LDARDKNRRTLYENHTPGSWVQNGFDDRYNPSESDDMYEDDVFV--KYVRPE 740

BLAST of Cp4.1LG04g02050 vs. TrEMBL
Match: B9SHL0_RICCO (Putative uncharacterized protein OS=Ricinus communis GN=RCOM_1122770 PE=4 SV=1)

HSP 1 Score: 486.1 bits (1250), Expect = 7.8e-134
Identity = 342/764 (44.76%), Postives = 447/764 (58.51%), Query Frame = 1

Query: 1   MNPS-------LPSSNLTFPNFLLSN--PNPNPNFSLPESRDPPLDLSSSLSSLRNLIHV 60
           MNPS         +SN   PNF+  +    P P+        P LDLS++LSSL NL+ +
Sbjct: 1   MNPSSAPYPDYFQNSNYPIPNFVFHSLPQPPPPHIPTITPTTPILDLSTTLSSLANLLSL 60

Query: 61  ANQTLQSLSYLTLIQTPSAKPDDS-GFVQCNFDRRHRVPPHSLFHHSLLCPSAPPPRIDP 120
           + QT  SLS L        KP+ +  F+ C ++  H +PP SLF HSL CPS  P   DP
Sbjct: 61  SQQTRNSLSSLI-------KPNKNVKFISCPYNPNHLMPPESLFLHSLRCPS--PSFQDP 120

Query: 121 TQLLQSLLYPRTLQSSRELVNEKRFRQTMPDSDADLCFSLND-YSDGSSNFFYLDCPGVV 180
             L+ SL YP+TL S     +   F+ +    +A+LC SL+  Y++ SSNFFY DCPG V
Sbjct: 121 ISLVNSLHYPKTLNSQNP--SNPLFKNS---DNAELCLSLDGFYNEFSSNFFYKDCPGAV 180

Query: 181 ALSNRDEMSKVFTLPRVLAVECANFVCDDDGDREMKCSLKGIRMLPSDLWALRSEVEMWN 240
             S+ D  SK F LP VL+VECANFV   + D +    +   R+LPSDLW ++ EVE W 
Sbjct: 181 QFSDLDSSSKTFLLPAVLSVECANFVARIEEDIK-GFDINEFRILPSDLWVIKREVESWA 240

Query: 241 DYPSVYSHIVIRSILGSEMAVDNRLKTWIIANSPRYGVIIDVAMRDHIFLLFRLCFMAIS 300
           DYPS+YS+ V  +IL   +   + L+ WII NSPRYGV+IDV MRDHI +LFRLC  AI 
Sbjct: 241 DYPSMYSYAVFCAILRLNVIKGSDLRRWIIFNSPRYGVVIDVYMRDHISVLFRLCLNAIR 300

Query: 301 KEAMGFQIALENGNGMEGGSGNLSFKCPILVQVLLWLASQLSVLYGEMNGKFFAINMLRQ 360
           +EA  F        G +      SF CP+L QV +W+  QLSVLYGE N K FAI++ RQ
Sbjct: 301 REAFSFM-------GHQMNVKTSSFNCPVLSQVFMWIVPQLSVLYGERNAKCFAIHIFRQ 360

Query: 361 RILYAASGLSLLPYEQKRVESPTLVEGSHNLVSSFSDTQSVNLKE-LDKKVINNVYVTGE 420
            IL  ++G+ L P E    E  T + G+       SD + + L+E L+  +        E
Sbjct: 361 CILDVSNGM-LFPLEANVKEISTELNGNG------SDVRDIKLQEPLEGSIKCETDAEVE 420

Query: 421 ETVNCRVIFVSQVAAAVAALHERFLLEEKIKALRFSHLQTKHQLVSEHNYICQRACEERK 480
           E V+  VIFVSQVAA+VAALHER LLE KI+  R S    ++Q + EH+Y+ +RA E+RK
Sbjct: 421 EHVDKEVIFVSQVAASVAALHERALLEAKIQGTRESQSLPRYQRMIEHDYVSKRADEQRK 480

Query: 481 RRCNYRPIIEHDGLPKQQSHDEDRDKTKTKEELLAEERDYKRRRMSYRGKKAKRSTLQVT 540
            R NYR II+HDGLP++Q  DED  KTKT+EE+LAEERDYKRRRMSYRGKK KR+TLQVT
Sbjct: 481 ERSNYRAIIDHDGLPRRQPIDEDMSKTKTREEILAEERDYKRRRMSYRGKKLKRTTLQVT 540

Query: 541 RDIIEEYMDEIMKAGGIGYFVKGTEERGIKSEQPTDHYITRDSIADGHTKGSNSSYGETR 600
           RD+IEEYMDEI +AGGIG F KG EE G+ S+ P     T   I  G  + S+S   E  
Sbjct: 541 RDLIEEYMDEIKQAGGIGCFEKGAEEEGMSSKPPFPSDFT---IGGGELRKSSSKSSEAI 600

Query: 601 HHTSSHSRKQSNYDNRYSAS----------EKPPKGRYGHYGSPEDERKSVGKDKYDREH 660
             T +H +KQS+ DN   ++          E+  K    H+   E +RK   +D++ R++
Sbjct: 601 RATPNHYQKQSHIDNNNRSATCKNASTQDYERWRKVHNRHHEHVEYQRKD-SRDRHGRDY 660

Query: 661 YHRFSDRSSSPSESHKWKRYPSDRDDEEPAETRHHKSRGLSSSSSNYHGFRSSS-SSRSG 720
           Y      S+SP E HK      +R+D E   ++ H  R  SS  SNY  ++SS   S S 
Sbjct: 661 Y------SASP-ERHKGHGPLHEREDAEFNISKRHDKR--SSGKSNYQNYKSSCFGSDSA 720

Query: 721 GDSSARKDGHKLRASDSWKRNTADNHSSDSSVLNSFNDRYTPSK 742
            D   +KDG KL   D   RN+   HSS   V N+F DRY P++
Sbjct: 721 NDPGVQKDGDKLDVRDWHLRNSYGTHSSTFLVKNAFEDRYDPAE 722

BLAST of Cp4.1LG04g02050 vs. TAIR10
Match: AT3G04160.2 (AT3G04160.2 unknown protein)

HSP 1 Score: 407.1 bits (1045), Expect = 2.3e-113
Identity = 302/766 (39.43%), Postives = 417/766 (54.44%), Query Frame = 1

Query: 2   NPSL------PSSNLTFPNFLLSNPNP---NPN-FSLPESRDPPLDLSSSLSSLRNLIHV 61
           NP+L      P+SN   PNF    P P   NPN +S+  S  P  +LS +LSSL++L+  
Sbjct: 14  NPNLFYHYPPPNSN---PNFFFRPPPPPLQNPNNYSIVPSPPPIRELSGTLSSLKSLLSE 73

Query: 62  ANQTLQSLSY-LTLIQTPSAKPDDSG-FVQCNFDRRHRVPPHSLFHHSLLCPSAPPPRID 121
             +TL SLS  L L  +   + D++G FV+C FD  H +PP +LF HSL CP+     +D
Sbjct: 74  CQRTLDSLSQNLALDHSSLLQKDENGCFVRCPFDSNHFMPPEALFLHSLRCPNT----LD 133

Query: 122 PTQLLQSLL-YPRTLQSSRELVNEKRFRQTMPDSDADLCFSLNDYSDGSSNFFYLDCPGV 181
              LL+S   Y  TL+   EL         + + D DLC SL+D +D  SNFFY DCPG 
Sbjct: 134 LIHLLESFSSYRNTLELPCEL--------QLNNGDGDLCISLDDLADFGSNFFYRDCPGA 193

Query: 182 VALSNRDEMSKVFTLPRVLAVECANFVCDDDGDREMKCSLKGIRMLPSDLWALRSEVEMW 241
           V  S  D   +  TLP VL+VEC++FV  D+  +++    K + +LPSDL A+++E++ W
Sbjct: 194 VKFSELDGKKRTLTLPHVLSVECSDFVGSDEKVKKIVLD-KCLGVLPSDLCAMKNEIDQW 253

Query: 242 NDYPSVYSHIVIRSILGSEMAVDNRLKTWIIANSPRYGVIIDVAMRDHIFLLFRLCFMAI 301
            D+PS YS  V+ SI+GS++   + L+ WI+ NS RYGVIID  MRDHIFLLFRLC  + 
Sbjct: 254 RDFPSSYSSSVLSSIVGSKVVEISALRKWILVNSTRYGVIIDTFMRDHIFLLFRLCLKSA 313

Query: 302 SKEAMGFQI---ALENGNGMEGGSGNLSFKCPILVQVLLWLASQLSVLYGEMNGKFFAIN 361
            KEA GF++   A + G        + +F+CP+ +QVL WLASQL+VLYGE NGKFFA++
Sbjct: 314 VKEACGFRMESDATDVGEQKIMSCKSSTFECPVFIQVLSWLASQLAVLYGEGNGKFFALD 373

Query: 362 MLRQRILYAASGLSLLPYEQKRVESPTLVEGSHNLVSSFSDTQSVNLKELDKKVINNVY- 421
           M +Q I+ +AS + L   E  R +   +VE          D +  N   + +K   N   
Sbjct: 374 MFKQCIVESASQVMLFRLEGTRSKCSGVVE-------DLDDARLRNKDVIMEKPFENSSG 433

Query: 422 -VTGEETVNCRVIFVSQVAAAVAALHERFLLEEKIKALRFSHLQTKHQLVSE--HNYICQ 481
              G+   + +VI VS+V+AAVAAL+ER LLEEKI+A+R++   T++Q +    H  +  
Sbjct: 434 GECGKTLDSPQVISVSRVSAAVAALYERSLLEEKIRAVRYAQPLTRYQRIISCLHLSLIP 493

Query: 482 RACEERKRRCNYRPIIEHDGLPKQQSHDEDRDKTKTKEELLAEERDYKRRRMSYRGKKAK 541
               ER RRC+YRPII+HDG P+Q+S ++D DK KT+EELLAEERDYKRRRMSYRGKK K
Sbjct: 494 HDVSERNRRCSYRPIIDHDGRPRQRSLNQDMDKMKTREELLAEERDYKRRRMSYRGKKVK 553

Query: 542 RSTLQVTRDIIEEYMDEIMKAGGIGYFVKG---TEERGIKSEQPTDHYITRDSIADGHTK 601
           R+  QV  D+IEEY +EI  AGGIG F KG        I ++Q    +       D   K
Sbjct: 554 RTPRQVLHDMIEEYTEEIKLAGGIGCFEKGMPLQSRSPIGNDQKESDFGYSIPSTDKQWK 613

Query: 602 GSNSS---YGETRHHTSSHSRKQSNYDNRYSASEKPPKGRYGHYGSPEDERKSVGKDKYD 661
           G N +   Y       S   ++   YD+  S  ++  +  Y H    +D+ +   KDK+ 
Sbjct: 614 GENRADIEYPIDNRQNSDKVKRHDEYDSGSSQRQQSHRS-YKHSDRRDDKLRDRRKDKH- 673

Query: 662 REHYHRFSDRSSSPSESHKWKRYPSDRDDEEPAETRHHKSRGLSSSSSNYHGFRSSSSSR 721
                                   +DR D+E   T+ H   G   S  NY   R  SSS 
Sbjct: 674 ------------------------NDRRDDEFTRTKRHSIEG--ESYQNYRSSREKSSS- 712

Query: 722 SGGDSSARKDGHKLRASDSWKRNTADNHSSDSSVLNSFNDRYTPSK 742
                      +K +  D +     D  S      N F DRY P++
Sbjct: 734 ----------DYKTKRDDPY-----DRRSQQPRNQNLFEDRYIPTE 712

BLAST of Cp4.1LG04g02050 vs. NCBI nr
Match: gi|659086077|ref|XP_008443753.1| (PREDICTED: uncharacterized protein LOC103487266 [Cucumis melo])

HSP 1 Score: 1044.6 bits (2700), Expect = 8.1e-302
Identity = 565/759 (74.44%), Postives = 632/759 (83.27%), Query Frame = 1

Query: 1   MNPSLP-SSNLTFPNFLLSNPNPNPNF---SLPESRDPPLDLSSSLSSLRNLIHVANQTL 60
           +NPSLP   N TFP+FL  NPNPN +    S  ES+ P LDL SS SSL  LIH+ANQTL
Sbjct: 4   VNPSLPFPPNQTFPSFLPPNPNPNSHIHDSSHSESQHPSLDLPSSFSSLNTLIHLANQTL 63

Query: 61  QSLSYLTLIQTPSAKPDDSGFVQCNFDRRHRVPPHSLFHHSLLCPSAPPPRIDPTQLLQS 120
           +SLSYLT    PS   + S  + C FDRRHRVPPHSLF HSLLCPSA    IDPTQL QS
Sbjct: 64  ESLSYLT----PSVFANHSRLLHCYFDRRHRVPPHSLFRHSLLCPSASLHPIDPTQLFQS 123

Query: 121 LLYPRTLQSSRELVNEKRFRQTMPDSDADLCFSLNDYSDGSSNFFYLDCPGVVALSNRDE 180
           LLYP+TL SS +LVNE RF Q +PDSDADLCFSL DY+D +SNFFY DCPGVVALSN DE
Sbjct: 124 LLYPQTLHSSHQLVNENRFSQVLPDSDADLCFSLTDYTDATSNFFYADCPGVVALSNLDE 183

Query: 181 MSKVFTLPRVLAVECANFVCDDDGDREMKCSLKGIRMLPSDLWALRSEVEMWNDYPSVYS 240
           MSKVFTLPRVLAV CANFV +D    EM  +L GIR+LPSDLW LRSEVE+WNDYP+ YS
Sbjct: 184 MSKVFTLPRVLAVHCANFVGNDH--LEMNSTLNGIRILPSDLWILRSEVEIWNDYPNKYS 243

Query: 241 HIVIRSILGSEMAVDNRLKTWIIANSPRYGVIIDVAMRDHIFLLFRLCFMAISKEAMGFQ 300
            +V+RSILGSEM +++ L TWII NSPRYGV+IDVA+RDHIFLLFRLCFMAI KEA+GFQ
Sbjct: 244 FVVLRSILGSEMLLNSHLMTWIIENSPRYGVVIDVALRDHIFLLFRLCFMAIYKEALGFQ 303

Query: 301 IALENGNGMEGGSGNLSFKCPILVQVLLWLASQLSVLYGEMNGKFFAINMLRQRILYAAS 360
           +ALE GNGMEGGS N  FKCPIL+QVL+WLASQLSVLYGE NGKFFA+NMLRQ IL AA 
Sbjct: 304 VALEKGNGMEGGSVNSCFKCPILIQVLMWLASQLSVLYGETNGKFFAVNMLRQCILDAAL 363

Query: 361 GLSLLPYEQKRVESPTLVEGSHNLVSSFSDTQSVNLKELDKKVINNVYVTGEETVNCRVI 420
            L LLP EQK  E  TL +G H+L  S SD QSV + ELD+KV+NN  VTG ETVNCRVI
Sbjct: 364 RL-LLPSEQKSTEGLTLGKGCHDLEISCSDIQSVKMNELDQKVVNNGNVTGGETVNCRVI 423

Query: 421 FVSQVAAAVAALHERFLLEEKIKALRFSHLQTKHQLVSEHNYICQRACEERKRRCNYRPI 480
            VSQVAAAVAALHERFLLEEKIKALRF+HLQTK+Q VSE+NYI QRACEERKRRCNYRPI
Sbjct: 424 LVSQVAAAVAALHERFLLEEKIKALRFAHLQTKYQRVSEYNYISQRACEERKRRCNYRPI 483

Query: 481 IEHDGLPKQQSHDEDRDKTKTKEELLAEERDYKRRRMSYRGKKAKRSTLQVTRDIIEEYM 540
           IEHDGLPKQQS++ED +KTKT+EELLAEERDYKRRRMSYRGKKAKRSTLQVTRDIIEEYM
Sbjct: 484 IEHDGLPKQQSYNEDANKTKTREELLAEERDYKRRRMSYRGKKAKRSTLQVTRDIIEEYM 543

Query: 541 DEIMKAGGIGYFVKGTEERGIKSEQPTDHYITRDSIADGHTKGSNSSYGETRHHTSSHSR 600
           +EIMKAGGIG FVKG EERGIKSEQP+DH ITR+ +AD HT+GSN   G+ R H+S HS+
Sbjct: 544 EEIMKAGGIGCFVKGPEERGIKSEQPSDHNITRNIVADVHTRGSNDPCGDAR-HSSGHSK 603

Query: 601 KQSNYDNRYSASEKPPKGRYGHYGSPEDERKSVGKDKYDREHYHRFSDRSSSPSESHKWK 660
           KQS +D+RY  S+KP KG Y HYGSPEDERK   KDKYDR+HYHRFSD+SS PS+SHKWK
Sbjct: 604 KQSFHDSRYLVSDKPQKGHYEHYGSPEDERKISHKDKYDRDHYHRFSDQSSIPSQSHKWK 663

Query: 661 RYPSDRDDEEPAETRHHKSRGLSSSSSNYHGFRSSSSSRSGGDSSARKDGHKLRASDSWK 720
           RYP+DRDDE PAETRHH+++ L+SSSS  HG RSSSSSRSG  SSARKD HKLRASDSWK
Sbjct: 664 RYPNDRDDEVPAETRHHETKKLASSSS--HG-RSSSSSRSGNGSSARKDSHKLRASDSWK 723

Query: 721 RNTADNHSSDSSVLNSFNDRYTPSKCHDQLEDDHSTGRR 756
           RNTADNHSS+  V NSF+DRYTPS+CHD+LED++ST  R
Sbjct: 724 RNTADNHSSEHLVSNSFSDRYTPSECHDELEDEYSTVSR 751

BLAST of Cp4.1LG04g02050 vs. NCBI nr
Match: gi|778664192|ref|XP_004142553.2| (PREDICTED: U11/U12 small nuclear ribonucleoprotein 48 kDa protein isoform X2 [Cucumis sativus])

HSP 1 Score: 757.3 bits (1954), Expect = 2.6e-215
Identity = 405/536 (75.56%), Postives = 445/536 (83.02%), Query Frame = 1

Query: 1   MNPSLPSSNL-TFPNFLLSNPNPNPNF---SLPESRDPPLDLSSSLSSLRNLIHVANQTL 60
           +NPSLP     TFPNFL  NPNPN +    S  +S+ PPLDLSSS SSL NLIH ANQTL
Sbjct: 4   INPSLPFPPYQTFPNFLPPNPNPNSHIHDSSHSQSQHPPLDLSSSFSSLNNLIHFANQTL 63

Query: 61  QSLSYLTLIQTPSAKPDDSGFVQCNFDRRHRVPPHSLFHHSLLCPSAPPPRIDPTQLLQS 120
           QSLSYLT    PS   + S  + C+FDRRHRVPPHSLF HSLLCPSA    IDPTQL QS
Sbjct: 64  QSLSYLT----PSDFANHSHLLHCHFDRRHRVPPHSLFRHSLLCPSASLLPIDPTQLFQS 123

Query: 121 LLYPRTLQSSRELVNEKRFRQTMPDSDADLCFSLNDYSDGSSNFFYLDCPGVVALSNRDE 180
           LLYP+TL SSR+LVNE RF Q +PDSDADLCFSL DYSD +SNFFY+DCPGVVALSN DE
Sbjct: 124 LLYPQTLHSSRQLVNENRFSQVLPDSDADLCFSLTDYSDATSNFFYVDCPGVVALSNLDE 183

Query: 181 MSKVFTLPRVLAVECANFVCDDDGDREMKCSLKGIRMLPSDLWALRSEVEMWNDYPSVYS 240
           MSKVFTLPRVLAV CANFV +D    EM  +L GIR+LPSDLW LRSEVE+WNDYPS YS
Sbjct: 184 MSKVFTLPRVLAVHCANFVGNDHF--EMNSTLNGIRILPSDLWNLRSEVEIWNDYPSKYS 243

Query: 241 HIVIRSILGSEMAVDNRLKTWIIANSPRYGVIIDVAMRDHIFLLFRLCFMAISKEAMGFQ 300
            +V+RSILGSEMA+++ L TWII NSPRYGV+IDVA+RDHIFLLFRLCFMAI KEA+GFQ
Sbjct: 244 FVVLRSILGSEMALNSHLMTWIIENSPRYGVVIDVALRDHIFLLFRLCFMAIYKEALGFQ 303

Query: 301 IALENGNGMEGGSGNLSFKCPILVQVLLWLASQLSVLYGEMNGKFFAINMLRQRILYAAS 360
           +ALE GNGMEG SGN  FKCPIL+QVL+WLASQLSVLYGE NG FFA+NMLRQ IL AAS
Sbjct: 304 VALEKGNGMEGESGNSCFKCPILIQVLMWLASQLSVLYGETNGNFFAVNMLRQCILDAAS 363

Query: 361 GLSLLPYEQKRVESPTLVEGSHNLVSSFSDTQSVNLKELDKKVINNVYVTGEETVNCRVI 420
           GL LL  EQK  ES TL EGSH+L  S SDTQSV + ELD+KV+NN +      VNC VI
Sbjct: 364 GLLLLQSEQKSTESLTLGEGSHDLEISCSDTQSVKMNELDQKVVNNGH-----AVNCSVI 423

Query: 421 FVSQVAAAVAALHERFLLEEKIKALRFSHLQTKHQLVSEHNYICQRACEERKRRCNYRPI 480
            VSQVAAAVAALHERFLLEEKIKALRF+HLQTK+Q VSE+NYI QRACEERKR CNYRPI
Sbjct: 424 LVSQVAAAVAALHERFLLEEKIKALRFAHLQTKYQRVSEYNYISQRACEERKRCCNYRPI 483

Query: 481 IEHDGLPKQQSHDEDRDKTKTKEELLAEERDYKRRRMSYRGKKAKRSTLQVTRDII 533
           IEHDGLPKQQSH+ED +KTKT+EELLAEERDYKRRRMSYRGKKAKRSTLQVTRDI+
Sbjct: 484 IEHDGLPKQQSHNEDANKTKTREELLAEERDYKRRRMSYRGKKAKRSTLQVTRDIL 528

BLAST of Cp4.1LG04g02050 vs. NCBI nr
Match: gi|778664187|ref|XP_011660240.1| (PREDICTED: U11/U12 small nuclear ribonucleoprotein 48 kDa protein isoform X1 [Cucumis sativus])

HSP 1 Score: 752.7 bits (1942), Expect = 6.4e-214
Identity = 405/537 (75.42%), Postives = 445/537 (82.87%), Query Frame = 1

Query: 1   MNPSLPSSNL-TFPNFLLSNPNPNPNF---SLPESRDPPLDLSSSLSSLRNLIHVANQTL 60
           +NPSLP     TFPNFL  NPNPN +    S  +S+ PPLDLSSS SSL NLIH ANQTL
Sbjct: 4   INPSLPFPPYQTFPNFLPPNPNPNSHIHDSSHSQSQHPPLDLSSSFSSLNNLIHFANQTL 63

Query: 61  QSLSYLTLIQTPSAKPDDSGFVQCNFDRRHRVPPHSLFHHSLLCPSAPPPRIDPTQLLQS 120
           QSLSYLT    PS   + S  + C+FDRRHRVPPHSLF HSLLCPSA    IDPTQL QS
Sbjct: 64  QSLSYLT----PSDFANHSHLLHCHFDRRHRVPPHSLFRHSLLCPSASLLPIDPTQLFQS 123

Query: 121 LLYPRTLQSSRELVNEKRFRQTMPDSDADLCFSLNDYSDGSSNFFYLDCPGVVALSNRDE 180
           LLYP+TL SSR+LVNE RF Q +PDSDADLCFSL DYSD +SNFFY+DCPGVVALSN DE
Sbjct: 124 LLYPQTLHSSRQLVNENRFSQVLPDSDADLCFSLTDYSDATSNFFYVDCPGVVALSNLDE 183

Query: 181 MSKVFTLPRVLAVECANFVCDDDGDREMKCSLKGIRMLPSDLWALRSEVEMWNDYPSVYS 240
           MSKVFTLPRVLAV CANFV +D    EM  +L GIR+LPSDLW LRSEVE+WNDYPS YS
Sbjct: 184 MSKVFTLPRVLAVHCANFVGNDHF--EMNSTLNGIRILPSDLWNLRSEVEIWNDYPSKYS 243

Query: 241 HIVIRSILGSEMAVDNRLKTWIIANSPRYGVIIDVAMRDHIFLLFRLCFMAISKEAMGFQ 300
            +V+RSILGSEMA+++ L TWII NSPRYGV+IDVA+RDHIFLLFRLCFMAI KEA+GFQ
Sbjct: 244 FVVLRSILGSEMALNSHLMTWIIENSPRYGVVIDVALRDHIFLLFRLCFMAIYKEALGFQ 303

Query: 301 IALENGNGMEGGSGNLSFKCPILVQVLLWLASQLSVLYGEMNGKFFAINMLRQRILYAAS 360
           +ALE GNGMEG SGN  FKCPIL+QVL+WLASQLSVLYGE NG FFA+NMLRQ IL AAS
Sbjct: 304 VALEKGNGMEGESGNSCFKCPILIQVLMWLASQLSVLYGETNGNFFAVNMLRQCILDAAS 363

Query: 361 GLSLLPYEQKRVESPTLVEGSHNLVSSFSDTQSVNLKELDKKVINNVYVTGEETVNCRVI 420
           GL LL  EQK  ES TL EGSH+L  S SDTQSV + ELD+KV+NN +      VNC VI
Sbjct: 364 GLLLLQSEQKSTESLTLGEGSHDLEISCSDTQSVKMNELDQKVVNNGH-----AVNCSVI 423

Query: 421 FVSQVAAAVAALHERFLLEEKIKALRFSHLQTKHQLVSEHNYICQRACEERKRRCNYRPI 480
            VSQVAAAVAALHERFLLEEKIKALRF+HLQTK+Q VSE+NYI QRACEERKR CNYRPI
Sbjct: 424 LVSQVAAAVAALHERFLLEEKIKALRFAHLQTKYQRVSEYNYISQRACEERKRCCNYRPI 483

Query: 481 IEHDGLPKQQSHDE-DRDKTKTKEELLAEERDYKRRRMSYRGKKAKRSTLQVTRDII 533
           IEHDGLPKQQSH+E D +KTKT+EELLAEERDYKRRRMSYRGKKAKRSTLQVTRDI+
Sbjct: 484 IEHDGLPKQQSHNEQDANKTKTREELLAEERDYKRRRMSYRGKKAKRSTLQVTRDIL 529

BLAST of Cp4.1LG04g02050 vs. NCBI nr
Match: gi|645222904|ref|XP_008218377.1| (PREDICTED: uncharacterized protein LOC103318737 [Prunus mume])

HSP 1 Score: 562.4 bits (1448), Expect = 1.2e-156
Identity = 370/778 (47.56%), Postives = 480/778 (61.70%), Query Frame = 1

Query: 6   PSSNLTFPNFLL--SNPNPNPNF----------SLPESRDPPLDLSSSLSSLRNLIHVAN 65
           P +    P+F L  SNPNPNPNF          ++P    PP DLS+++SSL +L+  + 
Sbjct: 4   PPAQFAHPSFTLIPSNPNPNPNFFHSQPQNTQPAIPTPPLPPPDLSTTISSLDSLVRDSY 63

Query: 66  QTLQSLSYLTLIQTPSAKPDDSGFVQCNFDRRHRVPPHSLFHHSLLCPSAPPPRIDPTQL 125
           QTL SLS L  ++ P+     S  + C F+  HRV PHSLF HSL CPS P P       
Sbjct: 64  QTLDSLSALLPLENPNYNNPQSSLIPCPFNPHHRVQPHSLFSHSLHCPSHPHP------- 123

Query: 126 LQSLLYPRTLQSSRELVNEKRFRQTMPDSDADLCFSLNDY-SDGSSNFFYLDCPGVVALS 185
           L  L YP+TL+SS +   EK F QT+  S+ADLC SL  Y +D  SNFFY DCPGVV  S
Sbjct: 124 LPHLNYPKTLKSSDQSQIEKSFLQTLHGSEADLCLSLEHYYADFGSNFFYSDCPGVVNFS 183

Query: 186 NRDEMSKVFTLPRVLAVECANFVCDDDGDREMKCSLKG-IRMLPSDLWALRSEVEMWNDY 245
             D ++++FTLP +L+VECANF+    G+RE+    K   R+LPS+LWA+++EVE WN++
Sbjct: 184 GLDGVNRMFTLPLILSVECANFI--GRGEREITDFEKAWCRILPSELWAIKTEVESWNEF 243

Query: 246 PSVYSHIVIRSILGSEMAVDNRLKTWIIANSPRYGVIIDVAMRDHIFLLFRLCFMAISKE 305
           P  YS+ V+ +ILG  +  +  + TWIIANSP+YG++IDVAMRDHIFLL RLC  AI +E
Sbjct: 244 PFTYSYRVLCAILGLGVVKEYDVGTWIIANSPQYGIVIDVAMRDHIFLLSRLCLKAILRE 303

Query: 306 AMGFQIALENGNGMEGGSGNLSFKCPILVQVLLWLASQLSVLYGEMNGKFFAINMLRQRI 365
           A+            EG   +  F+CP LVQ L+WLASQLS+LYG  NGK F IN+L++ +
Sbjct: 304 ALS--------KVKEGDPESTHFECPTLVQALMWLASQLSILYGAQNGKLFVINVLKKCL 363

Query: 366 LYAASGLSLLPYEQKRVESPTLVEGSHNLVSSFSDTQSVN-LKELDKKVINNVYVTGEET 425
           L AA G    P EQ+  E P L EGS NL ++ S  +    +K L      N  V  +E 
Sbjct: 364 LDAALGSLTFPLEQQVTEYPALEEGSLNLDANGSGVRDAEVMKPLSTDGGGNSMV--KEN 423

Query: 426 VNCRVIFVSQVAAAVAALHERFLLEEKIKALRFSHLQTKHQLVSEHNYICQRACEERKRR 485
           +  RV+FVSQVAAAVAALHERFLLEEK+KA R S   +++Q + +H+Y+ QRA E+RK R
Sbjct: 424 IISRVVFVSQVAAAVAALHERFLLEEKLKAQRVSQTFSRYQRMVDHDYVSQRADEKRKNR 483

Query: 486 CNYRPIIEHDGLPKQQSHDEDRDKTKTKEELLAEERDYKRRRMSYRGKKAKRSTLQVTRD 545
             YRPII+HDGLP+QQS +++ +KTKT+EELLAEERDYKRRRMSYRGKK KR+TLQV RD
Sbjct: 484 GQYRPIIDHDGLPRQQSCNQETNKTKTREELLAEERDYKRRRMSYRGKKVKRTTLQVMRD 543

Query: 546 IIEEYMDEIMKAGGIGYFVKGTEERG-IKSEQPTDHYITRDSIADGHTKGSNSSYGETRH 605
           IIEEYM+EI +AGGIG F KGTE  G    E P+   IT D  A+  TK +  S G    
Sbjct: 544 IIEEYMEEIKQAGGIGCFEKGTEGEGSFPFELPSAPEITTD--AEKPTKSNYDSAG---- 603

Query: 606 HTSSHSRKQS-----------NYDNRYSASEKPPKGRYGHYGSPEDERKSVGKDKYDREH 665
            + SHSRKQS           + D     S+K  +   GH+   ED R S  +D++D   
Sbjct: 604 CSPSHSRKQSHSSYYAIDSATSKDASAKGSKKLRRSLQGHHHYLEDHR-SDSRDRHDMVK 663

Query: 666 YHRFSDRSSSPSESHKWKRYPSDRDDEEPAETRHHKSRGLSSSSSNYHGFRSSSSSRSGG 725
           + R  +   +P  +H   R+  +RDD E  +T+H +    SSS S Y   RSSS S S  
Sbjct: 664 HSRSPESRRNPGWAHGQIRHHRERDDLEVRKTKHREISWSSSSISKYRDNRSSSQSNSSE 723

Query: 726 DSSARKDGHKLRASDSWKRNTADNHSSDSSVLNSFNDRYTPSKCHDQLEDDHSTGRRY 757
           +S  R+D           R T +NHSS+S V N+F DRY P    D  E+D ST R+Y
Sbjct: 724 NSKVRRD-----------RYTYENHSSNSVVQNTFEDRYDPLISRDIYEEDLSTNRKY 744

BLAST of Cp4.1LG04g02050 vs. NCBI nr
Match: gi|595840682|ref|XP_007208065.1| (hypothetical protein PRUPE_ppa001825mg [Prunus persica])

HSP 1 Score: 555.8 bits (1431), Expect = 1.1e-154
Identity = 369/774 (47.67%), Postives = 475/774 (61.37%), Query Frame = 1

Query: 6   PSSNLTFPNFLL--SNPNPNPNF--SLPESRDP--------PLDLSSSLSSLRNLIHVAN 65
           P +    P+F L  SNPNPNPNF  S P++  P        P DLS+++SSL +L+  + 
Sbjct: 4   PPAQFAHPSFTLIPSNPNPNPNFFHSQPQNTQPVISTPPLPPPDLSTTISSLDSLVRDSY 63

Query: 66  QTLQSLSYLTLIQTPSAKPDDSGFVQCNFDRRHRVPPHSLFHHSLLCPSAPPPRIDPTQL 125
           QTL SLS L  +Q P+     S  + C F+  HRV PHSLF HSL CPS P P       
Sbjct: 64  QTLDSLSALLPLQNPNYDNPQSSLIPCPFNPHHRVHPHSLFSHSLHCPSHPHP------- 123

Query: 126 LQSLLYPRTLQSSRELVNEKRFRQTMPDSDADLCFSLNDY-SDGSSNFFYLDCPGVVALS 185
           L  L YP+TL+SS +   EK F QT+  S+ADL  SL  Y +D  SNFFY DCPGVV  S
Sbjct: 124 LPHLNYPKTLKSSDQSQTEKSFLQTLHGSEADLRLSLEHYYADFGSNFFYSDCPGVVNFS 183

Query: 186 NRDEMSKVFTLPRVLAVECANFVCDDDGDRE-MKCSLKGIRMLPSDLWALRSEVEMWNDY 245
             D ++++FTLP +L+VECANF+    G+RE M    +  R+LPS+LWA+++EVE WN++
Sbjct: 184 GLDGVNRMFTLPLILSVECANFI--GRGEREIMDFEKEWCRILPSELWAIKTEVEGWNEF 243

Query: 246 PSVYSHIVIRSILGSEMAVDNRLKTWIIANSPRYGVIIDVAMRDHIFLLFRLCFMAISKE 305
           P  YS+ V+ +ILG  +  +  + TWIIANSP+YG++IDVAMRDHIFLL RLC  AI +E
Sbjct: 244 PFTYSYRVLCAILGLGVVKEYDVGTWIIANSPQYGIVIDVAMRDHIFLLSRLCLKAILRE 303

Query: 306 AMGFQIALENGNGMEGGSGNLSFKCPILVQVLLWLASQLSVLYGEMNGKFFAINMLRQRI 365
           A+            EG   +  F+CP LVQ L+WLASQLS+LYG  NGK F IN+L++ +
Sbjct: 304 ALS--------KVKEGDPESTHFECPTLVQALMWLASQLSILYGAQNGKLFVINVLKKCL 363

Query: 366 LYAASGLSLLPYEQKRVESPTLVEGSHNLVSSFSDTQSVN-LKELDKKVINNVYVTGEET 425
           L AA G    P EQ+  E P L EG  NL ++ S  +    +K L      N  V  +E 
Sbjct: 364 LDAALGSLTFPLEQQVTEYPALEEGLLNLDANGSGVRDAEVMKPLSTHGGENSMV--KEN 423

Query: 426 VNCRVIFVSQVAAAVAALHERFLLEEKIKALRFSHLQTKHQLVSEHNYICQRACEERKRR 485
           +  R +FVSQVAAAVAALHERFLLEEK+KA R S   T++Q + +H Y+ QRA EERK R
Sbjct: 424 IFSREVFVSQVAAAVAALHERFLLEEKLKAQRVSQTFTRYQRMVDHEYVSQRADEERKNR 483

Query: 486 CNYRPIIEHDGLPKQQSHDEDRDKTKTKEELLAEERDYKRRRMSYRGKKAKRSTLQVTRD 545
             YRPII+HDGLP+QQS +++ +K KT+EELLAEERDYKRRRMSYRGKK KR+TLQV RD
Sbjct: 484 SQYRPIIDHDGLPRQQSCNQETNKPKTREELLAEERDYKRRRMSYRGKKVKRTTLQVMRD 543

Query: 546 IIEEYMDEIMKAGGIGYFVKGTEERG-IKSEQPTDHYITRDSIADGHTKGSNSSYG--ET 605
           IIEEYM+EI +AGGIG F KGTE  G    E P+   IT D  A+  TK +  S G   +
Sbjct: 544 IIEEYMEEIKQAGGIGCFEKGTEGEGSFPFELPSAPEITTD--AEKPTKSNYDSAGCSPS 603

Query: 606 RHHTSSHSR-----KQSNYDNRYSASEKPPKGRYGHYGSPEDERKSVGKDKYDREHYHRF 665
           R    SHS        ++ D     SEKP +   GH+   ED R S  +D+ D   + R 
Sbjct: 604 RSRKRSHSSYYAIDSVTSRDASAKGSEKPRRSLQGHHHYLEDHR-SDSRDRRDMVKHSRS 663

Query: 666 SDRSSSPSESHKWKRYPSDRDDEEPAETRHHKSRGLSSSSSNYHGFRSSSSSRSGGDSSA 725
            +   +P  +H   R+  +RDD E  +T+H +    SSS S Y   RSSS S SG +S  
Sbjct: 664 PESRRNPGWAHGQTRHHRERDDLEVRKTKHREISRSSSSISKYRDNRSSSHSNSGENSKV 723

Query: 726 RKDGHKLRASDSWKRNTADNHSSDSSVLNSFNDRYTPSKCHDQLEDDHSTGRRY 757
           R+D           R T +NH+S+S V N+F DRY P    D  E+D ST R+Y
Sbjct: 724 RRD-----------RYTYENHNSNSVVQNTFEDRYDPLISRDIYEEDLSTDRKY 744

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
U1148_ARATH5.2e-11539.79U11/U12 small nuclear ribonucleoprotein 48 kDa protein OS=Arabidopsis thaliana G... [more]
Match NameE-valueIdentityDescription
A0A0A0M085_CUCSA1.8e-21575.56Uncharacterized protein OS=Cucumis sativus GN=Csa_1G662780 PE=4 SV=1[more]
M5W6C9_PRUPE8.0e-15547.67Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa001825mg PE=4 SV=1[more]
W9S254_9ROSA7.7e-14245.16Uncharacterized protein OS=Morus notabilis GN=L484_006471 PE=4 SV=1[more]
A0A061F1I4_THECC8.3e-13644.82U11/U12 small nuclear ribonucleoprotein 48 kDa protein, putative isoform 1 OS=Th... [more]
B9SHL0_RICCO7.8e-13444.76Putative uncharacterized protein OS=Ricinus communis GN=RCOM_1122770 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G04160.22.3e-11339.43 unknown protein[more]
Match NameE-valueIdentityDescription
gi|659086077|ref|XP_008443753.1|8.1e-30274.44PREDICTED: uncharacterized protein LOC103487266 [Cucumis melo][more]
gi|778664192|ref|XP_004142553.2|2.6e-21575.56PREDICTED: U11/U12 small nuclear ribonucleoprotein 48 kDa protein isoform X2 [Cu... [more]
gi|778664187|ref|XP_011660240.1|6.4e-21475.42PREDICTED: U11/U12 small nuclear ribonucleoprotein 48 kDa protein isoform X1 [Cu... [more]
gi|645222904|ref|XP_008218377.1|1.2e-15647.56PREDICTED: uncharacterized protein LOC103318737 [Prunus mume][more]
gi|595840682|ref|XP_007208065.1|1.1e-15447.67hypothetical protein PRUPE_ppa001825mg [Prunus persica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG04g02050.1Cp4.1LG04g02050.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR21402UNCHARACTERIZEDcoord: 19..101
score: 1.6E-91coord: 400..559
score: 1.6E-91coord: 594..739
score: 1.6

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:

None