Cp4.1LG04g02050 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG04g02050
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionU11/U12 small nuclear ribonucleoprotein 48 kDa protein
LocationCp4.1LG04: 1031 .. 6634 (+)
RNA-Seq ExpressionCp4.1LG04g02050
SyntenyCp4.1LG04g02050
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CGCCGATGTCTCACTCTTCCCTCCATTTCACTCCGAGCAATGAACCCGTCTCTTCCATCTTCCAACCTAACCTTCCCCAACTTTCTGCTCTCAAACCCTAACCCTAACCCTAATTTCTCTCTTCCCGAATCTCGTGATCCTCCTCTGGACTTGTCTTCTTCTCTTTCCTCCCTCAGAAACCTCATCCATGTCGCCAACCAGACCCTTCAATCTCTCTCCTACCTCACCCTCATCCAAACCCCATCTGCTAAGCCTGACGATTCCGGTTTCGTTCAGTGCAATTTTGATCGGCGCCACCGTGTTCCGCCGCATTCTCTCTTCCACCATTCCCTTCTTTGTCCTTCCGCTCCTCCGCCTCGTATCGACCCGACTCAACTCCTCCAGTCCTTGCTTTACCCTCGAACGCTTCAATCGTCTCGTGAATTGGTTAACGAAAAGCGCTTCCGTCAAACGATGCCGGATTCTGATGCCGACCTTTGTTTCTCTCTCAATGATTATTCTGATGGAAGTTCCAATTTCTTCTATCTCGATTGCCCTGGCGTGGTTGCCTTGTCTAACCGGGATGAAATGTCTAAAGTCTTCACTCTCCCTCGTGTTTTGGCTGTTGAATGTGCTAATTTTGTTTGTGATGATGATGGTGATCGAGAGATGAAGTGTTCGTTGAAGGGGATTCGAATGCTGCCCTCTGATCTGTGGGCTCTTAGAAGTGAAGTAGAAATGTGGAATGACTACCCCAGTGTGTATTCGCATATTGTTATACGGTCCATATTGGGCTCAGAGATGGCCGTGGACAACCGTTTGAAGACATGGATCATTGCAAATTCTCCTCGGTATGGTGTTATTATTGATGTTGCTATGAGGGATCACATATTTTTGCTGTTTAGGCTGTGTTTTATGGCAATTTCAAAGGAAGCTATGGGATTTCAAATTGCGCTGGAAAATGGAAATGGAATGGAGGGTGGATCAGGGAACCTCAGTTTTAAGTGTCCGATTTTAGTACAAGTACTATTGTGGCTGGCATCTCAGCTTTCCGTATTGTATGGGGAGATGAACGGGAAGTTCTTTGCTATTAACATGCTTAGACAACGTATACTATATGCTGCATCGGGCTTGTCGCTTTTGCCATACGAACAAAAGCGAGTGGAGAGTCCAACTTTGGTAGAGGGCTCTCATAATCTAGTATCTAGTTTTAGTGACACACAGAGTGTGAATTTGAAAGAGTTGGATAAGAAGGTTATAAATAATGTCTATGTTACTGGAGAAGAAACGGTCAACTGCAGGGTGATTTTTGTGTCCCAAGTTGCTGCGGCTGTTGCAGCATTGCATGAACGTTTCCTACTTGAAGAAAAGATCAAAGCTTTACGCTTTTCTCATCTGCAAACTAAACATCAGCTGTATGTTACTTCTCCATATTCTGTTTTTTGGTTACTAGTTTTGATGAACTTGATGTTTTCCTTATTGCTATGATAGTGATATTCATTGATCCAACAAATTTATGCTTGTATCATTCATGAGGTGGAAAATAATTGTTAGGAGACTTAGAGGAAATGGGTTTAAATCCATGTCTTCCTACCTAGGATATAAAAATTTTACGAGTTTCTTTGGCAACCAAATATTGTAGGGTAGGATGTTTGTCCTGTTATATTAGAAAGTGAACTCAAATTTGTCCGAACTTTCATAGTTATTAAGAAAAAAAAGATGATCGAAGGGATTCTACAGTCTTGTGTCTAAAAGAGATGGCCGAACAGATTCTAGAATCTGTCTTTGGAGCCCTTGTCATAAATATGCTTAAACATTTTAAGGACATCATTTCTATTTATACTAGCTGTGTCTGAAAATTTTGTCAACAGAATTCTTAGCACCTGGGGATTTAAGTTGCCTAGCCACTTAACTGGCTTTTCAAACTCAAATTAATGGTTGAACCTAGCACTATCAGCTCTAGAAAGTGGACTTTAGTTTATGTTTTTTTTACAGCAATAATTCTTGGAATTTAGGGTTTAAATTCTTGAAGTAATCTAGAAGTAATAGAAGTTTTCCCCTTCTATTGAATACAATTAATCTTCATTTTTCTTTCAGTTGCAATTTTATGGAAAAGGATGTTCTGAGTTTTCATCTGCCTCTTTCATCCAATTTAATCTGCACGTTTTGTTCCACTTGTCTTCATGATTTCATCAAAAGAATTTTCGTACTAAGAAAAGGATCTCACTTTTCAAGTAGGCAACTGTTAGAAACAGACTAATCCTTTAACTCTTTTAGGGTTTCTGAACATAACTATATCTGTCAAAGAGCTTGTGAAGAGCGTAAAAGACGGTGCAATTATAGACCTATAATTGAGCATGATGGACTCCCAAAGCAGCAGTCTCATGATGAGGTATTTCTGAACTCTGTAATGGTCTAGTCATGAATTATGTAGTTGGTATAATCAAGCATGTTATTTTCAGTTATCCTTCCACCTGGATTACTCTTCCTACCAATTTTCTCCATTCATTTGTTGGTGATTTGAAGTCTCGCCGAATCTTCTTATTTACTGCCCTGAATGGTACATCATGTTCCGAAATTTTTTTTAAAAAATATATATAAAAAAAAGCTTTCAATTCCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTATATATATATATATATATATATATATTTTTTTTTTTTTTTAAAAAAATTAACTTCTTGGAACCAAATCAAGCTGGTGCCTCTATATACTTATTTTTTGAGATGGATGTTTTGCCGTTCTTTCCTTCATTTTGTGCATCTCCTCCCATTTTATTTTATGTTAAAAGAATATGTTTAGCAATAAATGATGAAGCAGGTGCTGATCATCTGCTTATTGTTCACTTACCCATGATGTTTCTTGGCTATATTTTATATCTTATGGGTTTAATTTTAATAAAAGCAGGATAGAGACAAGACCAAAACAAAAGAGGAATTGTTGGCTGAAGAAAGAGATTATAAACGTCGAAGAATGTCATACCGTGGGAAAAAGGCAAAGCGATCAACTTTACAGGTAAGCAAGCAGCTTGTTCAATTTTACGTCAGCCATTGTTTTTGTTATCTCTCGTCTCTATTGGTTATTAGCCTAGGGTCTAACTGGGTATGGCATGATGATTCCTTTTCTTATAGCCTTTTATGGAAAATTGACAAAGTTCTTAATTAAATGATAATGCTTATCAAGAACATTTGGTGGTAATATCTTCTTAGGCGTACATTGGTCACTTTTTAGAATATTATTCACAACTTTCAAGCTACTAAGATTGCAGCAACAATGGCCGAATTATGGAAGAGGTGAATCTGCAATGCTAAATAGACCATGAAACTATGTGATACCCATTATGTATTTTCGGTTCCTGAAATATGTTTTCTGACATCTGTTTTCCAACTTTTTCTATCAAACGTTTTTATGTTAGATGATATCCACACTCACGTGCTTACTAATTTTACCTTTCATGGAAGGTTACAAGAGATATTATTGAAGAATACATGGACGAAATTATGAAAGCTGGAGGGATTGGATACTTTGTGAAGGGAACTGAAGAGAGAGGGATAAAATCTGAACAACCAACTGATCATTACATTACAAGAGATAGTATTGCTGATGGGCACACAAAAGGAAGCAACAGCTCATATGGAGAAACTAGGCATCATACCTCGAGTCATTCCCGGAAGCAGTCTAACTATGATAACAGATATTCAGCATCCGAAAAGCCCCCAAAAGGTAGGTATGGGCACTATGGCTCTCCAGAAGATGAAAGGAAAAGTGTCGGTAAAGACAAATATGATCGAGAACACTATCATAGATTCTCAGATCGAAGCAGTAGTCCTAGCGAGTCACACAAATGGAAGAGATATCCAAGTGATCGAGATGATGAGGAGCCAGCAGAAACCAGGCATCATAAATCTCGAGGACTGTCTTCTAGCAGCTCTAATTATCATGGTTTTAGATCATCTTCATCCTCAAGATCGGGGGGTGATTCTAGTGCAAGGAAGGATGGTCACAAGTTAAGAGCTAGTGATAGTTGGAAAAGGAACACAGCTGATAATCATAGTTCAGATTCTTCGGTGCTTAATTCATTTAATGATAGATATACCCCTTCCAAGTGTCATGACCAATTAGAAGATGATCACTCCACTGGCAGAAGATATGTAAATCCAGACGTTTAGTTTGCTGATCCTCAAATGAGGGGCTGTCCCGCAAATTGTGCCATACTTCTTAACATTACCAATGTACCTTGTGGTAAGTCTTTCTTTCATGCTACATCCCATGTGCCTTTAACTGTGACTTCTCTGACTTCTTTTTTTAACATGTTTTTATTCACACAACCTTGCAGCATATGGGCTTCAGATATTTTATCCTCACGTGTATCCTTCATAGTATCCTTGAAGTTTATCCCATTTGCTTATACATTAAATCTTGCAAAACTTATTCTTTCCTTGTTTTTTTTTTGTCCCCTTAGCAGGGATATTTGCCCTGAATGTTGCAGTAGCATAATTTGAACTCATCGTCACTATATCAAAAATGCCAATTTTACATCATCCTTTTCTGTTCCCGGGTTTTCTAATATTTCTGGACTAGTCACAATGACAGATAAACAAAGTGGTCGTATTGGCGTGGATTTTCCATGTCGTACGATTTTTTCTTTCTGTCCTCGTTTGCTATGGCTTCCACTACTCTGGGAAATGTTTTGGTGATGCATAACTTGTCCTTTTCCTGTTCTCTTCAGTCATGATTGGTATCGGTCGTTTAAAGTCTTACAACGAATAATTAAAAAAACTATCGATGATTTCACTTCAAAAACTTACATGTAGGTCATTTAGAACGGTACAGATCATTAATCTAGCATCTTAACAAACATTTTACAGTACTACACTGCTTGAGTTCAACCTTATTATTGATCTTATATTAGTTCTAACTTTTGTATTACTTTTAGTGTTCTACCAACAGCTCACAAGATCAGAGAAGTATGAAGGCAGTCTGCAAATAAACTTGATCAGAAACCACAATCCACAGAGTTTCAGATCGTCGGTTGCAGGCCGTTCCATCCTACATCAGGCTGATAATTGGCTTCTTATCAGTTATAGAAGGGGTATGAATTCTGACTCCCTGGTATGTTTTTCTCCTTTTGGGCAGCCCATGTGAATATTAGATACATGTGTACAAAGCCTTTGGAGGAAAGGGATATGCAGAAATCTCTTACAATAAGATAGCTTAGGAGTTTGTATAGGTAGGTATGAAACAGTTAGCAGGTAAAGTAGGTAGTTTCTTCCATTTACTTACAAGTAGGAAATGTCTCAATTTCATACAAATGTCTTGGGAAATATTGTTAAAATCTGGATGTTGATAGATATTTTTGGAAAATTTATGAAAAGTAAAAAAAGAAAATCAAGAACATCAACTATGCAGTATTTCAATGAAGGATGATGAAATTGAATCTCCAATTGTAAATTGCAATTTAGTATAGAACAATATGGTTTGATTACATTAAGTTGTCGACAAACTATTGATGTATCAGATGTAATTGAAGTTTTATTTAGAAATAAACTAGTGATACATTGATAACATACTTGGAAGTTATATTATAGATATATTATTGATAA

mRNA sequence

CGCCGATGTCTCACTCTTCCCTCCATTTCACTCCGAGCAATGAACCCGTCTCTTCCATCTTCCAACCTAACCTTCCCCAACTTTCTGCTCTCAAACCCTAACCCTAACCCTAATTTCTCTCTTCCCGAATCTCGTGATCCTCCTCTGGACTTGTCTTCTTCTCTTTCCTCCCTCAGAAACCTCATCCATGTCGCCAACCAGACCCTTCAATCTCTCTCCTACCTCACCCTCATCCAAACCCCATCTGCTAAGCCTGACGATTCCGGTTTCGTTCAGTGCAATTTTGATCGGCGCCACCGTGTTCCGCCGCATTCTCTCTTCCACCATTCCCTTCTTTGTCCTTCCGCTCCTCCGCCTCGTATCGACCCGACTCAACTCCTCCAGTCCTTGCTTTACCCTCGAACGCTTCAATCGTCTCGTGAATTGGTTAACGAAAAGCGCTTCCGTCAAACGATGCCGGATTCTGATGCCGACCTTTGTTTCTCTCTCAATGATTATTCTGATGGAAGTTCCAATTTCTTCTATCTCGATTGCCCTGGCGTGGTTGCCTTGTCTAACCGGGATGAAATGTCTAAAGTCTTCACTCTCCCTCGTGTTTTGGCTGTTGAATGTGCTAATTTTGTTTGTGATGATGATGGTGATCGAGAGATGAAGTGTTCGTTGAAGGGGATTCGAATGCTGCCCTCTGATCTGTGGGCTCTTAGAAGTGAAGTAGAAATGTGGAATGACTACCCCAGTGTGTATTCGCATATTGTTATACGGTCCATATTGGGCTCAGAGATGGCCGTGGACAACCGTTTGAAGACATGGATCATTGCAAATTCTCCTCGGTATGGTGTTATTATTGATGTTGCTATGAGGGATCACATATTTTTGCTGTTTAGGCTGTGTTTTATGGCAATTTCAAAGGAAGCTATGGGATTTCAAATTGCGCTGGAAAATGGAAATGGAATGGAGGGTGGATCAGGGAACCTCAGTTTTAAGTGTCCGATTTTAGTACAAGTACTATTGTGGCTGGCATCTCAGCTTTCCGTATTGTATGGGGAGATGAACGGGAAGTTCTTTGCTATTAACATGCTTAGACAACGTATACTATATGCTGCATCGGGCTTGTCGCTTTTGCCATACGAACAAAAGCGAGTGGAGAGTCCAACTTTGGTAGAGGGCTCTCATAATCTAGTATCTAGTTTTAGTGACACACAGAGTGTGAATTTGAAAGAGTTGGATAAGAAGGTTATAAATAATGTCTATGTTACTGGAGAAGAAACGGTCAACTGCAGGGTGATTTTTGTGTCCCAAGTTGCTGCGGCTGTTGCAGCATTGCATGAACGTTTCCTACTTGAAGAAAAGATCAAAGCTTTACGCTTTTCTCATCTGCAAACTAAACATCAGCTGGTTTCTGAACATAACTATATCTGTCAAAGAGCTTGTGAAGAGCGTAAAAGACGGTGCAATTATAGACCTATAATTGAGCATGATGGACTCCCAAAGCAGCAGTCTCATGATGAGGATAGAGACAAGACCAAAACAAAAGAGGAATTGTTGGCTGAAGAAAGAGATTATAAACGTCGAAGAATGTCATACCGTGGGAAAAAGGCAAAGCGATCAACTTTACAGGTTACAAGAGATATTATTGAAGAATACATGGACGAAATTATGAAAGCTGGAGGGATTGGATACTTTGTGAAGGGAACTGAAGAGAGAGGGATAAAATCTGAACAACCAACTGATCATTACATTACAAGAGATAGTATTGCTGATGGGCACACAAAAGGAAGCAACAGCTCATATGGAGAAACTAGGCATCATACCTCGAGTCATTCCCGGAAGCAGTCTAACTATGATAACAGATATTCAGCATCCGAAAAGCCCCCAAAAGGTAGGTATGGGCACTATGGCTCTCCAGAAGATGAAAGGAAAAGTGTCGGTAAAGACAAATATGATCGAGAACACTATCATAGATTCTCAGATCGAAGCAGTAGTCCTAGCGAGTCACACAAATGGAAGAGATATCCAAGTGATCGAGATGATGAGGAGCCAGCAGAAACCAGGCATCATAAATCTCGAGGACTGTCTTCTAGCAGCTCTAATTATCATGGTTTTAGATCATCTTCATCCTCAAGATCGGGGGGTGATTCTAGTGCAAGGAAGGATGGTCACAAGTTAAGAGCTAGTGATAGTTGGAAAAGGAACACAGCTGATAATCATAGTTCAGATTCTTCGGTGCTTAATTCATTTAATGATAGATATACCCCTTCCAAGTGTCATGACCAATTAGAAGATGATCACTCCACTGGCAGAAGATATGTAAATCCAGACGTTTAGTTTGCTGATCCTCAAATGAGGGGCTGTCCCGCAAATTGTGCCATACTTCTTAACATTACCAATGTACCTTGTGCATATGGGCTTCAGATATTTTATCCTCACGTGTATCCTTCATAGTATCCTTGAAGTTTATCCCATTTGCTTATACATTAAATCTTGCAAAACTTATTCTTTCCTTGTTTTTTTTTTGTCCCCTTAGCAGGGATATTTGCCCTGAATGTTGCAGTAGCATAATTTGAACTCATCGTCACTATATCAAAAATGCCAATTTTACATCATCCTTTTCTGTTCCCGGGTTTTCTAATATTTCTGGACTAGTCACAATGACAGATAAACAAAGTGGTCGTATTGGCGTGGATTTTCCATGTCGTACGATTTTTTCTTTCTGTCCTCGTTTGCTATGGCTTCCACTACTCTGGGAAATGTTTTGGTGATGCATAACTTGTCCTTTTCCTGTTCTCTTCAGTCATGATTGGTATCGGTCGTTTAAAGTCTTACAACGAATAATTAAAAAAACTATCGATGATTTCACTTCAAAAACTTACATGTAGGTCATTTAGAACGGTACAGATCATTAATCTAGCATCTTAACAAACATTTTACAGTACTACACTGCTTGAGTTCAACCTTATTATTGATCTTATATTAGTTCTAACTTTTGTATTACTTTTAGTGTTCTACCAACAGCTCACAAGATCAGAGAAGTATGAAGGCAGTCTGCAAATAAACTTGATCAGAAACCACAATCCACAGAGTTTCAGATCGTCGGTTGCAGGCCGTTCCATCCTACATCAGGCTGATAATTGGCTTCTTATCAGTTATAGAAGGGGTATGAATTCTGACTCCCTGGTATGTTTTTCTCCTTTTGGGCAGCCCATGTGAATATTAGATACATGTGTACAAAGCCTTTGGAGGAAAGGGATATGCAGAAATCTCTTACAATAAGATAGCTTAGGAGTTTGTATAGGTAGGTATGAAACAGTTAGCAGGTAAAGTAGGTAGTTTCTTCCATTTACTTACAAGTAGGAAATGTCTCAATTTCATACAAATGTCTTGGGAAATATTGTTAAAATCTGGATGTTGATAGATATTTTTGGAAAATTTATGAAAAGTAAAAAAAGAAAATCAAGAACATCAACTATGCAGTATTTCAATGAAGGATGATGAAATTGAATCTCCAATTGTAAATTGCAATTTAGTATAGAACAATATGGTTTGATTACATTAAGTTGTCGACAAACTATTGATGTATCAGATGTAATTGAAGTTTTATTTAGAAATAAACTAGTGATACATTGATAACATACTTGGAAGTTATATTATAGATATATTATTGATAA

Coding sequence (CDS)

ATGAACCCGTCTCTTCCATCTTCCAACCTAACCTTCCCCAACTTTCTGCTCTCAAACCCTAACCCTAACCCTAATTTCTCTCTTCCCGAATCTCGTGATCCTCCTCTGGACTTGTCTTCTTCTCTTTCCTCCCTCAGAAACCTCATCCATGTCGCCAACCAGACCCTTCAATCTCTCTCCTACCTCACCCTCATCCAAACCCCATCTGCTAAGCCTGACGATTCCGGTTTCGTTCAGTGCAATTTTGATCGGCGCCACCGTGTTCCGCCGCATTCTCTCTTCCACCATTCCCTTCTTTGTCCTTCCGCTCCTCCGCCTCGTATCGACCCGACTCAACTCCTCCAGTCCTTGCTTTACCCTCGAACGCTTCAATCGTCTCGTGAATTGGTTAACGAAAAGCGCTTCCGTCAAACGATGCCGGATTCTGATGCCGACCTTTGTTTCTCTCTCAATGATTATTCTGATGGAAGTTCCAATTTCTTCTATCTCGATTGCCCTGGCGTGGTTGCCTTGTCTAACCGGGATGAAATGTCTAAAGTCTTCACTCTCCCTCGTGTTTTGGCTGTTGAATGTGCTAATTTTGTTTGTGATGATGATGGTGATCGAGAGATGAAGTGTTCGTTGAAGGGGATTCGAATGCTGCCCTCTGATCTGTGGGCTCTTAGAAGTGAAGTAGAAATGTGGAATGACTACCCCAGTGTGTATTCGCATATTGTTATACGGTCCATATTGGGCTCAGAGATGGCCGTGGACAACCGTTTGAAGACATGGATCATTGCAAATTCTCCTCGGTATGGTGTTATTATTGATGTTGCTATGAGGGATCACATATTTTTGCTGTTTAGGCTGTGTTTTATGGCAATTTCAAAGGAAGCTATGGGATTTCAAATTGCGCTGGAAAATGGAAATGGAATGGAGGGTGGATCAGGGAACCTCAGTTTTAAGTGTCCGATTTTAGTACAAGTACTATTGTGGCTGGCATCTCAGCTTTCCGTATTGTATGGGGAGATGAACGGGAAGTTCTTTGCTATTAACATGCTTAGACAACGTATACTATATGCTGCATCGGGCTTGTCGCTTTTGCCATACGAACAAAAGCGAGTGGAGAGTCCAACTTTGGTAGAGGGCTCTCATAATCTAGTATCTAGTTTTAGTGACACACAGAGTGTGAATTTGAAAGAGTTGGATAAGAAGGTTATAAATAATGTCTATGTTACTGGAGAAGAAACGGTCAACTGCAGGGTGATTTTTGTGTCCCAAGTTGCTGCGGCTGTTGCAGCATTGCATGAACGTTTCCTACTTGAAGAAAAGATCAAAGCTTTACGCTTTTCTCATCTGCAAACTAAACATCAGCTGGTTTCTGAACATAACTATATCTGTCAAAGAGCTTGTGAAGAGCGTAAAAGACGGTGCAATTATAGACCTATAATTGAGCATGATGGACTCCCAAAGCAGCAGTCTCATGATGAGGATAGAGACAAGACCAAAACAAAAGAGGAATTGTTGGCTGAAGAAAGAGATTATAAACGTCGAAGAATGTCATACCGTGGGAAAAAGGCAAAGCGATCAACTTTACAGGTTACAAGAGATATTATTGAAGAATACATGGACGAAATTATGAAAGCTGGAGGGATTGGATACTTTGTGAAGGGAACTGAAGAGAGAGGGATAAAATCTGAACAACCAACTGATCATTACATTACAAGAGATAGTATTGCTGATGGGCACACAAAAGGAAGCAACAGCTCATATGGAGAAACTAGGCATCATACCTCGAGTCATTCCCGGAAGCAGTCTAACTATGATAACAGATATTCAGCATCCGAAAAGCCCCCAAAAGGTAGGTATGGGCACTATGGCTCTCCAGAAGATGAAAGGAAAAGTGTCGGTAAAGACAAATATGATCGAGAACACTATCATAGATTCTCAGATCGAAGCAGTAGTCCTAGCGAGTCACACAAATGGAAGAGATATCCAAGTGATCGAGATGATGAGGAGCCAGCAGAAACCAGGCATCATAAATCTCGAGGACTGTCTTCTAGCAGCTCTAATTATCATGGTTTTAGATCATCTTCATCCTCAAGATCGGGGGGTGATTCTAGTGCAAGGAAGGATGGTCACAAGTTAAGAGCTAGTGATAGTTGGAAAAGGAACACAGCTGATAATCATAGTTCAGATTCTTCGGTGCTTAATTCATTTAATGATAGATATACCCCTTCCAAGTGTCATGACCAATTAGAAGATGATCACTCCACTGGCAGAAGATATGTAAATCCAGACGTTTAG

Protein sequence

MNPSLPSSNLTFPNFLLSNPNPNPNFSLPESRDPPLDLSSSLSSLRNLIHVANQTLQSLSYLTLIQTPSAKPDDSGFVQCNFDRRHRVPPHSLFHHSLLCPSAPPPRIDPTQLLQSLLYPRTLQSSRELVNEKRFRQTMPDSDADLCFSLNDYSDGSSNFFYLDCPGVVALSNRDEMSKVFTLPRVLAVECANFVCDDDGDREMKCSLKGIRMLPSDLWALRSEVEMWNDYPSVYSHIVIRSILGSEMAVDNRLKTWIIANSPRYGVIIDVAMRDHIFLLFRLCFMAISKEAMGFQIALENGNGMEGGSGNLSFKCPILVQVLLWLASQLSVLYGEMNGKFFAINMLRQRILYAASGLSLLPYEQKRVESPTLVEGSHNLVSSFSDTQSVNLKELDKKVINNVYVTGEETVNCRVIFVSQVAAAVAALHERFLLEEKIKALRFSHLQTKHQLVSEHNYICQRACEERKRRCNYRPIIEHDGLPKQQSHDEDRDKTKTKEELLAEERDYKRRRMSYRGKKAKRSTLQVTRDIIEEYMDEIMKAGGIGYFVKGTEERGIKSEQPTDHYITRDSIADGHTKGSNSSYGETRHHTSSHSRKQSNYDNRYSASEKPPKGRYGHYGSPEDERKSVGKDKYDREHYHRFSDRSSSPSESHKWKRYPSDRDDEEPAETRHHKSRGLSSSSSNYHGFRSSSSSRSGGDSSARKDGHKLRASDSWKRNTADNHSSDSSVLNSFNDRYTPSKCHDQLEDDHSTGRRYVNPDV
Homology
BLAST of Cp4.1LG04g02050 vs. ExPASy Swiss-Prot
Match: Q9M8X2 (U11/U12 small nuclear ribonucleoprotein 48 kDa protein OS=Arabidopsis thaliana OX=3702 GN=SNRNP48 PE=3 SV=1)

HSP 1 Score: 416.8 bits (1070), Expect = 5.4e-115
Identity = 304/764 (39.79%), Postives = 421/764 (55.10%), Query Frame = 0

Query: 2   NPSL------PSSNLTFPNFLLSNPNP---NP-NFSLPESRDPPLDLSSSLSSLRNLIHV 61
           NP+L      P+SN   PNF    P P   NP N+S+  S  P  +LS +LSSL++L+  
Sbjct: 14  NPNLFYHYPPPNSN---PNFFFRPPPPPLQNPNNYSIVPSPPPIRELSGTLSSLKSLLSE 73

Query: 62  ANQTLQSLSY-LTLIQTPSAKPDDSG-FVQCNFDRRHRVPPHSLFHHSLLCPSAPPPRID 121
             +TL SLS  L L  +   + D++G FV+C FD  H +PP +LF HSL CP+     +D
Sbjct: 74  CQRTLDSLSQNLALDHSSLLQKDENGCFVRCPFDSNHFMPPEALFLHSLRCPNT----LD 133

Query: 122 PTQLLQSL-LYPRTLQSSRELVNEKRFRQTMPDSDADLCFSLNDYSDGSSNFFYLDCPGV 181
              LL+S   Y  TL+   EL         + + D DLC SL+D +D  SNFFY DCPG 
Sbjct: 134 LIHLLESFSSYRNTLELPCEL--------QLNNGDGDLCISLDDLADFGSNFFYRDCPGA 193

Query: 182 VALSNRDEMSKVFTLPRVLAVECANFVCDDDGDREMKCSLKGIRMLPSDLWALRSEVEMW 241
           V  S  D   +  TLP VL+VEC++FV  D+  +++    K + +LPSDL A+++E++ W
Sbjct: 194 VKFSELDGKKRTLTLPHVLSVECSDFVGSDEKVKKIVLD-KCLGVLPSDLCAMKNEIDQW 253

Query: 242 NDYPSVYSHIVIRSILGSEMAVDNRLKTWIIANSPRYGVIIDVAMRDHIFLLFRLCFMAI 301
            D+PS YS  V+ SI+GS++   + L+ WI+ NS RYGVIID  MRDHIFLLFRLC  + 
Sbjct: 254 RDFPSSYSSSVLSSIVGSKVVEISALRKWILVNSTRYGVIIDTFMRDHIFLLFRLCLKSA 313

Query: 302 SKEAMGFQI---ALENGNGMEGGSGNLSFKCPILVQVLLWLASQLSVLYGEMNGKFFAIN 361
            KEA GF++   A + G        + +F+CP+ +QVL WLASQL+VLYGE NGKFFA++
Sbjct: 314 VKEACGFRMESDATDVGEQKIMSCKSSTFECPVFIQVLSWLASQLAVLYGEGNGKFFALD 373

Query: 362 MLRQRILYAASGLSLLPYEQKRVESPTLVEGSHNLVSSFSDTQSVNLKELDKKVINNVY- 421
           M +Q I+ +AS + L   E  R +   +VE          D +  N   + +K   N   
Sbjct: 374 MFKQCIVESASQVMLFRLEGTRSKCSGVVE-------DLDDARLRNKDVIMEKPFENSSG 433

Query: 422 -VTGEETVNCRVIFVSQVAAAVAALHERFLLEEKIKALRFSHLQTKHQLVSEHNYICQRA 481
              G+   + +VI VS+V+AAVAAL+ER LLEEKI+A+R++   T++Q  +E  ++  +A
Sbjct: 434 GECGKTLDSPQVISVSRVSAAVAALYERSLLEEKIRAVRYAQPLTRYQRAAELGFMTAKA 493

Query: 482 CEERKRRCNYRPIIEHDGLPKQQSHDEDRDKTKTKEELLAEERDYKRRRMSYRGKKAKRS 541
            EER RRC+YRPII+HDG P+Q+S ++D DK KT+EELLAEERDYKRRRMSYRGKK KR+
Sbjct: 494 DEERNRRCSYRPIIDHDGRPRQRSLNQDMDKMKTREELLAEERDYKRRRMSYRGKKVKRT 553

Query: 542 TLQVTRDIIEEYMDEIMKAGGIGYFVKG---TEERGIKSEQPTDHYITRDSIADGHTKGS 601
             QV  D+IEEY +EI  AGGIG F KG        I ++Q    +       D   KG 
Sbjct: 554 PRQVLHDMIEEYTEEIKLAGGIGCFEKGMPLQSRSPIGNDQKESDFGYSIPSTDKQWKGE 613

Query: 602 NSS---YGETRHHTSSHSRKQSNYDNRYSASEKPPKGRYGHYGSPEDERKSVGKDKYDRE 661
           N +   Y       S   ++   YD+  S  ++  +  Y H    +D+ +   KDK+   
Sbjct: 614 NRADIEYPIDNRQNSDKVKRHDEYDSGSSQRQQSHRS-YKHSDRRDDKLRDRRKDKH--- 673

Query: 662 HYHRFSDRSSSPSESHKWKRYPSDRDDEEPAETRHHKSRGLSSSSSNYHGFRSSSSSRSG 721
                                 +DR D+E   T+ H   G   S  NY   R  SSS   
Sbjct: 674 ----------------------NDRRDDEFTRTKRHSIEG--ESYQNYRSSREKSSS--- 710

Query: 722 GDSSARKDGHKLRASDSWKRNTADNHSSDSSVLNSFNDRYTPSK 742
                    +K +  D +     D  S      N F DRY P++
Sbjct: 734 --------DYKTKRDDPY-----DRRSQQPRNQNLFEDRYIPTE 710

BLAST of Cp4.1LG04g02050 vs. NCBI nr
Match: XP_023530557.1 (U11/U12 small nuclear ribonucleoprotein 48 kDa protein [Cucurbita pepo subsp. pepo] >XP_023530558.1 U11/U12 small nuclear ribonucleoprotein 48 kDa protein [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1486 bits (3848), Expect = 0.0
Identity = 761/762 (99.87%), Postives = 761/762 (99.87%), Query Frame = 0

Query: 1   MNPSLPSSNLTFPNFLLSNPNPNPNFSLPESRDPPLDLSSSLSSLRNLIHVANQTLQSLS 60
           MNPSLPSSNLTFPNFLLSNPNPNPNFSLPESRDPPLDLSSSLSSLRNLIHVANQTLQSLS
Sbjct: 1   MNPSLPSSNLTFPNFLLSNPNPNPNFSLPESRDPPLDLSSSLSSLRNLIHVANQTLQSLS 60

Query: 61  YLTLIQTPSAKPDDSGFVQCNFDRRHRVPPHSLFHHSLLCPSAPPPRIDPTQLLQSLLYP 120
           YLTLIQTPSAKPDDSGFVQCNFDRRHRVPPHSLFHHSLLCPSAPPPRIDPTQLLQSLLYP
Sbjct: 61  YLTLIQTPSAKPDDSGFVQCNFDRRHRVPPHSLFHHSLLCPSAPPPRIDPTQLLQSLLYP 120

Query: 121 RTLQSSRELVNEKRFRQTMPDSDADLCFSLNDYSDGSSNFFYLDCPGVVALSNRDEMSKV 180
           RTLQSSRELVNEKRFRQTMPDSDADLCFSLNDYSDGSSNFFYLDCPGVVALSNRDEMSKV
Sbjct: 121 RTLQSSRELVNEKRFRQTMPDSDADLCFSLNDYSDGSSNFFYLDCPGVVALSNRDEMSKV 180

Query: 181 FTLPRVLAVECANFVCDDDGDREMKCSLKGIRMLPSDLWALRSEVEMWNDYPSVYSHIVI 240
           FTLPRVLAVECANFVCDDDGDREMKCSLKGIRMLPSDLWALRSEVEMWNDYPSVYSHIVI
Sbjct: 181 FTLPRVLAVECANFVCDDDGDREMKCSLKGIRMLPSDLWALRSEVEMWNDYPSVYSHIVI 240

Query: 241 RSILGSEMAVDNRLKTWIIANSPRYGVIIDVAMRDHIFLLFRLCFMAISKEAMGFQIALE 300
           RSILGSEMAVDNRLKTWIIANSPRYGVIIDVAMRDHIFLLFRLCFMAISKEAMGFQIALE
Sbjct: 241 RSILGSEMAVDNRLKTWIIANSPRYGVIIDVAMRDHIFLLFRLCFMAISKEAMGFQIALE 300

Query: 301 NGNGMEGGSGNLSFKCPILVQVLLWLASQLSVLYGEMNGKFFAINMLRQRILYAASGLSL 360
           NGNGMEGGSGNLSFKCPILVQVLLWLASQLSVLYGEMNGKFFAINMLRQRILYAASGLSL
Sbjct: 301 NGNGMEGGSGNLSFKCPILVQVLLWLASQLSVLYGEMNGKFFAINMLRQRILYAASGLSL 360

Query: 361 LPYEQKRVESPTLVEGSHNLVSSFSDTQSVNLKELDKKVINNVYVTGEETVNCRVIFVSQ 420
           LPYEQKRVESPTLVEGSHNLVSSFSDTQSVNLKELDKKVINNVYVTGEETVNCRVIFVSQ
Sbjct: 361 LPYEQKRVESPTLVEGSHNLVSSFSDTQSVNLKELDKKVINNVYVTGEETVNCRVIFVSQ 420

Query: 421 VAAAVAALHERFLLEEKIKALRFSHLQTKHQLVSEHNYICQRACEERKRRCNYRPIIEHD 480
           VAAAVAALHERFLLEEKIKALRFSHLQTKHQLVSEHNYICQRACEERKRRCNYRPIIEHD
Sbjct: 421 VAAAVAALHERFLLEEKIKALRFSHLQTKHQLVSEHNYICQRACEERKRRCNYRPIIEHD 480

Query: 481 GLPKQQSHDE-DRDKTKTKEELLAEERDYKRRRMSYRGKKAKRSTLQVTRDIIEEYMDEI 540
           GLPKQQSHDE DRDKTKTKEELLAEERDYKRRRMSYRGKKAKRSTLQVTRDIIEEYMDEI
Sbjct: 481 GLPKQQSHDEQDRDKTKTKEELLAEERDYKRRRMSYRGKKAKRSTLQVTRDIIEEYMDEI 540

Query: 541 MKAGGIGYFVKGTEERGIKSEQPTDHYITRDSIADGHTKGSNSSYGETRHHTSSHSRKQS 600
           MKAGGIGYFVKGTEERGIKSEQPTDHYITRDSIADGHTKGSNSSYGETRHHTSSHSRKQS
Sbjct: 541 MKAGGIGYFVKGTEERGIKSEQPTDHYITRDSIADGHTKGSNSSYGETRHHTSSHSRKQS 600

Query: 601 NYDNRYSASEKPPKGRYGHYGSPEDERKSVGKDKYDREHYHRFSDRSSSPSESHKWKRYP 660
           NYDNRYSASEKPPKGRYGHYGSPEDERKSVGKDKYDREHYHRFSDRSSSPSESHKWKRYP
Sbjct: 601 NYDNRYSASEKPPKGRYGHYGSPEDERKSVGKDKYDREHYHRFSDRSSSPSESHKWKRYP 660

Query: 661 SDRDDEEPAETRHHKSRGLSSSSSNYHGFRSSSSSRSGGDSSARKDGHKLRASDSWKRNT 720
           SDRDDEEPAETRHHKSRGLSSSSSNYHGFRSSSSSRSGGDSSARKDGHKLRASDSWKRNT
Sbjct: 661 SDRDDEEPAETRHHKSRGLSSSSSNYHGFRSSSSSRSGGDSSARKDGHKLRASDSWKRNT 720

Query: 721 ADNHSSDSSVLNSFNDRYTPSKCHDQLEDDHSTGRRYVNPDV 761
           ADNHSSDSSVLNSFNDRYTPSKCHDQLEDDHSTGRRYVNPDV
Sbjct: 721 ADNHSSDSSVLNSFNDRYTPSKCHDQLEDDHSTGRRYVNPDV 762

BLAST of Cp4.1LG04g02050 vs. NCBI nr
Match: XP_022927551.1 (U11/U12 small nuclear ribonucleoprotein 48 kDa protein [Cucurbita moschata])

HSP 1 Score: 1442 bits (3733), Expect = 0.0
Identity = 740/760 (97.37%), Postives = 746/760 (98.16%), Query Frame = 0

Query: 1   MNPSLPSSNLTFPNFLLSNPNPNPNFSLPESRDPPLDLSSSLSSLRNLIHVANQTLQSLS 60
           MNPSLPSSNLT+P F  SNPNPNPNFSLPESR+PPLDLSSSL SLRNLIHVANQTLQSLS
Sbjct: 25  MNPSLPSSNLTYPKFQPSNPNPNPNFSLPESREPPLDLSSSLYSLRNLIHVANQTLQSLS 84

Query: 61  YLTLIQTPSAKPDDSGFVQCNFDRRHRVPPHSLFHHSLLCPSAPPPRIDPTQLLQSLLYP 120
           YLTLIQTPS KPDDSGFVQCNFD RHRVPPHSLFHHSLLC SAPPPRIDPTQLLQSLLYP
Sbjct: 85  YLTLIQTPSPKPDDSGFVQCNFDLRHRVPPHSLFHHSLLCTSAPPPRIDPTQLLQSLLYP 144

Query: 121 RTLQSSRELVNEKRFRQTMPDSDADLCFSLNDYSDGSSNFFYLDCPGVVALSNRDEMSKV 180
           RTLQSSRELVNEKRFRQTMPDSDADLCFSLNDYSDGSSNFFYLDCPGVVALSNRDEMSKV
Sbjct: 145 RTLQSSRELVNEKRFRQTMPDSDADLCFSLNDYSDGSSNFFYLDCPGVVALSNRDEMSKV 204

Query: 181 FTLPRVLAVECANFVCDDDGDREMKCSLKGIRMLPSDLWALRSEVEMWNDYPSVYSHIVI 240
           FTLPRVLAVECANFVCDDDGDREMK SLKGI MLPSDLWALRSEVEMWNDYPSVYSHIVI
Sbjct: 205 FTLPRVLAVECANFVCDDDGDREMKSSLKGIIMLPSDLWALRSEVEMWNDYPSVYSHIVI 264

Query: 241 RSILGSEMAVDNRLKTWIIANSPRYGVIIDVAMRDHIFLLFRLCFMAISKEAMGFQIALE 300
           RSILGSEMAVDNRLKTWIIANSPRYGVIIDVAMRDHIFLLFRLCFMAISKEAMGFQIALE
Sbjct: 265 RSILGSEMAVDNRLKTWIIANSPRYGVIIDVAMRDHIFLLFRLCFMAISKEAMGFQIALE 324

Query: 301 NGNGMEGGSGNLSFKCPILVQVLLWLASQLSVLYGEMNGKFFAINMLRQRILYAASGLSL 360
           NGNGMEGGSGNLSFKCPILVQVLLWLASQLSVLYGEMNGKFFAINMLRQRILYAASGLSL
Sbjct: 325 NGNGMEGGSGNLSFKCPILVQVLLWLASQLSVLYGEMNGKFFAINMLRQRILYAASGLSL 384

Query: 361 LPYEQKRVESPTLVEGSHNLVSSFSDTQSVNLKELDKKVINNVYVTGEETVNCRVIFVSQ 420
           LPYEQKRVESPTLVEGSHNLVSS SDT+SVNLKELDKKVINNVYV+GEETVNCRVIFVSQ
Sbjct: 385 LPYEQKRVESPTLVEGSHNLVSSCSDTRSVNLKELDKKVINNVYVSGEETVNCRVIFVSQ 444

Query: 421 VAAAVAALHERFLLEEKIKALRFSHLQTKHQLVSEHNYICQRACEERKRRCNYRPIIEHD 480
           VAAAVAALHERFLLEEKIKALRFSHLQTKHQLVSEHNYI QRACEERKRRCNYRPIIEHD
Sbjct: 445 VAAAVAALHERFLLEEKIKALRFSHLQTKHQLVSEHNYISQRACEERKRRCNYRPIIEHD 504

Query: 481 GLPKQQSHDEDRDKTKTKEELLAEERDYKRRRMSYRGKKAKRSTLQVTRDIIEEYMDEIM 540
           GLPKQQSHDEDRDKTKTKEELLAEERDYKRRRMSYRGKKAKRSTLQVTRDIIEEYMDEIM
Sbjct: 505 GLPKQQSHDEDRDKTKTKEELLAEERDYKRRRMSYRGKKAKRSTLQVTRDIIEEYMDEIM 564

Query: 541 KAGGIGYFVKGTEERGIKSEQPTDHYITRDSIADGHTKGSNSSYGETRHHTSSHSRKQSN 600
           KAGGIGYFVKGTEERGIKSEQPTDHYITRDSIADGHTKGSNSSYGETRHH SSHSRKQSN
Sbjct: 565 KAGGIGYFVKGTEERGIKSEQPTDHYITRDSIADGHTKGSNSSYGETRHHISSHSRKQSN 624

Query: 601 YDNRYSASEKPPKGRYGHYGSPEDERKSVGKDKYDREHYHRFSDRSSSPSESHKWKRYPS 660
           YDNRYSASEKPPKGRYGHYGSPEDERKSVG DKYDREHYHRFSDRSSSPS+SHKWKRYPS
Sbjct: 625 YDNRYSASEKPPKGRYGHYGSPEDERKSVGIDKYDREHYHRFSDRSSSPSQSHKWKRYPS 684

Query: 661 DRDDEEPAETRHHKSRGLSSSSSNYHGFRSSSSSRSGGDSSARKDGHKLRASDSWKRNTA 720
           DRDDE+PAETRHHKSRGLSSSSSNYHGFRSSSSSRSGGDSSARKDGHKLRASDSWKRNTA
Sbjct: 685 DRDDEKPAETRHHKSRGLSSSSSNYHGFRSSSSSRSGGDSSARKDGHKLRASDSWKRNTA 744

Query: 721 DNHSSDSSVLNSFNDRYTPSKCHDQLEDDHSTGRRYVNPD 760
           DNHSSDSSVLN FNDRYTPSKCHDQLEDDHSTGRRYVNPD
Sbjct: 745 DNHSSDSSVLNPFNDRYTPSKCHDQLEDDHSTGRRYVNPD 784

BLAST of Cp4.1LG04g02050 vs. NCBI nr
Match: KAG6588294.1 (U11/U12 small nuclear ribonucleoprotein 48 kDa protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1438 bits (3723), Expect = 0.0
Identity = 740/760 (97.37%), Postives = 745/760 (98.03%), Query Frame = 0

Query: 1   MNPSLPSSNLTFPNFLLSNPNPNPNFSLPESRDPPLDLSSSLSSLRNLIHVANQTLQSLS 60
           MNPSLPSSNLT+P F  SNPNPNPNFSL ESRDPPLDLSSSL SLRNLIHVANQTLQSLS
Sbjct: 1   MNPSLPSSNLTYPKFQPSNPNPNPNFSLLESRDPPLDLSSSLYSLRNLIHVANQTLQSLS 60

Query: 61  YLTLIQTPSAKPDDSGFVQCNFDRRHRVPPHSLFHHSLLCPSAPPPRIDPTQLLQSLLYP 120
           YLTLIQTPS KPDDSGFVQCNFDRRHRVP HSLFHHSLLCPSAPPPRIDPTQLLQSLLYP
Sbjct: 61  YLTLIQTPSPKPDDSGFVQCNFDRRHRVPTHSLFHHSLLCPSAPPPRIDPTQLLQSLLYP 120

Query: 121 RTLQSSRELVNEKRFRQTMPDSDADLCFSLNDYSDGSSNFFYLDCPGVVALSNRDEMSKV 180
           RTLQSSRELVNEKRFRQTMPDSDADLCFSLNDYSDGSSNFFYL CPGVVALSNRDEMSKV
Sbjct: 121 RTLQSSRELVNEKRFRQTMPDSDADLCFSLNDYSDGSSNFFYLGCPGVVALSNRDEMSKV 180

Query: 181 FTLPRVLAVECANFVCDDDGDREMKCSLKGIRMLPSDLWALRSEVEMWNDYPSVYSHIVI 240
           F+LPRVLAVECANFVCDDDGDR MK SLKGI MLPSDLWALRSEVEMWNDYPSVYSHIVI
Sbjct: 181 FSLPRVLAVECANFVCDDDGDRVMKSSLKGIIMLPSDLWALRSEVEMWNDYPSVYSHIVI 240

Query: 241 RSILGSEMAVDNRLKTWIIANSPRYGVIIDVAMRDHIFLLFRLCFMAISKEAMGFQIALE 300
           RSILGSEMAVDNRLKTWIIANSPRYGVIIDVAMRDHIFLLFRLCFMAISKEAMGFQIALE
Sbjct: 241 RSILGSEMAVDNRLKTWIIANSPRYGVIIDVAMRDHIFLLFRLCFMAISKEAMGFQIALE 300

Query: 301 NGNGMEGGSGNLSFKCPILVQVLLWLASQLSVLYGEMNGKFFAINMLRQRILYAASGLSL 360
           NGNGMEGGSGNLSFKCPILVQVLLWLASQLSVLYGEMNGKFFAINMLRQRILYAASGLSL
Sbjct: 301 NGNGMEGGSGNLSFKCPILVQVLLWLASQLSVLYGEMNGKFFAINMLRQRILYAASGLSL 360

Query: 361 LPYEQKRVESPTLVEGSHNLVSSFSDTQSVNLKELDKKVINNVYVTGEETVNCRVIFVSQ 420
           LPYEQKRVESPTLVEGSHNLVSS SDT+SVNLKELDKKVINNVYVTGEETVNCRVIFVSQ
Sbjct: 361 LPYEQKRVESPTLVEGSHNLVSSCSDTRSVNLKELDKKVINNVYVTGEETVNCRVIFVSQ 420

Query: 421 VAAAVAALHERFLLEEKIKALRFSHLQTKHQLVSEHNYICQRACEERKRRCNYRPIIEHD 480
           VAAAVAALHERFLLEEKIKALRFSHLQTKHQLVSEHNYI QRACEERKRRCNYRPIIEHD
Sbjct: 421 VAAAVAALHERFLLEEKIKALRFSHLQTKHQLVSEHNYISQRACEERKRRCNYRPIIEHD 480

Query: 481 GLPKQQSHDEDRDKTKTKEELLAEERDYKRRRMSYRGKKAKRSTLQVTRDIIEEYMDEIM 540
           GLPKQQSHDEDRDKTKTKEELLAEERDYKRRRMSYRGKKAKRSTLQVTRDIIEEYMDEIM
Sbjct: 481 GLPKQQSHDEDRDKTKTKEELLAEERDYKRRRMSYRGKKAKRSTLQVTRDIIEEYMDEIM 540

Query: 541 KAGGIGYFVKGTEERGIKSEQPTDHYITRDSIADGHTKGSNSSYGETRHHTSSHSRKQSN 600
           KAGGIGYFVKGTEERGIKSEQPTDHYITRDSIADGHTKGSNSSYGETRHHTSSHSRKQSN
Sbjct: 541 KAGGIGYFVKGTEERGIKSEQPTDHYITRDSIADGHTKGSNSSYGETRHHTSSHSRKQSN 600

Query: 601 YDNRYSASEKPPKGRYGHYGSPEDERKSVGKDKYDREHYHRFSDRSSSPSESHKWKRYPS 660
           YDNRYSASEKPPKGRYGHYGSPEDERKSVG DKYDREHYHRFSDRSSSPS+SHKWKRYPS
Sbjct: 601 YDNRYSASEKPPKGRYGHYGSPEDERKSVGIDKYDREHYHRFSDRSSSPSQSHKWKRYPS 660

Query: 661 DRDDEEPAETRHHKSRGLSSSSSNYHGFRSSSSSRSGGDSSARKDGHKLRASDSWKRNTA 720
           DRDDE+PAETRHHKSRGLSSSSSNYHGFRSSSSSRSGGDSSARK GHKLRASDSWKRNTA
Sbjct: 661 DRDDEKPAETRHHKSRGLSSSSSNYHGFRSSSSSRSGGDSSARKAGHKLRASDSWKRNTA 720

Query: 721 DNHSSDSSVLNSFNDRYTPSKCHDQLEDDHSTGRRYVNPD 760
           DNHSSDSSVLNSFNDRYTPSKCHDQLEDDHSTGRRYVNPD
Sbjct: 721 DNHSSDSSVLNSFNDRYTPSKCHDQLEDDHSTGRRYVNPD 760

BLAST of Cp4.1LG04g02050 vs. NCBI nr
Match: KAG7020854.1 (U11/U12 small nuclear ribonucleoprotein 48 kDa protein [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1438 bits (3722), Expect = 0.0
Identity = 739/760 (97.24%), Postives = 744/760 (97.89%), Query Frame = 0

Query: 1   MNPSLPSSNLTFPNFLLSNPNPNPNFSLPESRDPPLDLSSSLSSLRNLIHVANQTLQSLS 60
           MNPSLPSSNLT+P F  SNPNPNPNFSLPESRDPPLDLSSSL SLRNLIHVANQTLQSLS
Sbjct: 1   MNPSLPSSNLTYPKFQPSNPNPNPNFSLPESRDPPLDLSSSLYSLRNLIHVANQTLQSLS 60

Query: 61  YLTLIQTPSAKPDDSGFVQCNFDRRHRVPPHSLFHHSLLCPSAPPPRIDPTQLLQSLLYP 120
           YLTLIQ PS KPDDSGFVQCNFDRRHRVP HSLFHHSLLCPSAPPPRIDPTQLLQSLLYP
Sbjct: 61  YLTLIQIPSPKPDDSGFVQCNFDRRHRVPTHSLFHHSLLCPSAPPPRIDPTQLLQSLLYP 120

Query: 121 RTLQSSRELVNEKRFRQTMPDSDADLCFSLNDYSDGSSNFFYLDCPGVVALSNRDEMSKV 180
           RTLQSSRELVNEKRFRQTMPDSDADLCFSLNDYSDGSSNFFYL CPGVVALSNRDEMSKV
Sbjct: 121 RTLQSSRELVNEKRFRQTMPDSDADLCFSLNDYSDGSSNFFYLGCPGVVALSNRDEMSKV 180

Query: 181 FTLPRVLAVECANFVCDDDGDREMKCSLKGIRMLPSDLWALRSEVEMWNDYPSVYSHIVI 240
           F+LPRVLAVECANFVCDDDGDR MK SLKGI MLPSDLWALRSEVEMWNDYPSVYSHIVI
Sbjct: 181 FSLPRVLAVECANFVCDDDGDRVMKSSLKGIIMLPSDLWALRSEVEMWNDYPSVYSHIVI 240

Query: 241 RSILGSEMAVDNRLKTWIIANSPRYGVIIDVAMRDHIFLLFRLCFMAISKEAMGFQIALE 300
           RSILGSEMAVDNRLKTWIIANSPRYGVIIDVAMRDHIFLLFRLCFMAISKEAMGFQIALE
Sbjct: 241 RSILGSEMAVDNRLKTWIIANSPRYGVIIDVAMRDHIFLLFRLCFMAISKEAMGFQIALE 300

Query: 301 NGNGMEGGSGNLSFKCPILVQVLLWLASQLSVLYGEMNGKFFAINMLRQRILYAASGLSL 360
           NGNGMEGGSGNLSFKCPILVQVLLWLASQLSVLYGEMNGKFFAINMLRQ ILYAASGLSL
Sbjct: 301 NGNGMEGGSGNLSFKCPILVQVLLWLASQLSVLYGEMNGKFFAINMLRQHILYAASGLSL 360

Query: 361 LPYEQKRVESPTLVEGSHNLVSSFSDTQSVNLKELDKKVINNVYVTGEETVNCRVIFVSQ 420
           LPYEQKRVESPTLVEGSHNLVSS SDT+SVNLKELDKKVINNVYVTGEETVNCRVIFVSQ
Sbjct: 361 LPYEQKRVESPTLVEGSHNLVSSCSDTRSVNLKELDKKVINNVYVTGEETVNCRVIFVSQ 420

Query: 421 VAAAVAALHERFLLEEKIKALRFSHLQTKHQLVSEHNYICQRACEERKRRCNYRPIIEHD 480
           VAAAVAALHERFLLEEKIKALRFSHLQTKHQLVSEHNYI QRACEERKRRCNYRPIIEHD
Sbjct: 421 VAAAVAALHERFLLEEKIKALRFSHLQTKHQLVSEHNYISQRACEERKRRCNYRPIIEHD 480

Query: 481 GLPKQQSHDEDRDKTKTKEELLAEERDYKRRRMSYRGKKAKRSTLQVTRDIIEEYMDEIM 540
           GLPKQQSHDEDRDKTKTKEELLAEERDYKRRRMSYRGKKAKRSTLQVTRDIIEEYMDEIM
Sbjct: 481 GLPKQQSHDEDRDKTKTKEELLAEERDYKRRRMSYRGKKAKRSTLQVTRDIIEEYMDEIM 540

Query: 541 KAGGIGYFVKGTEERGIKSEQPTDHYITRDSIADGHTKGSNSSYGETRHHTSSHSRKQSN 600
           KAGGIGYFVKGTEERGIKSEQPTDHYITRDSIADGHTKGSNSSYGETRHHTSSHSRKQSN
Sbjct: 541 KAGGIGYFVKGTEERGIKSEQPTDHYITRDSIADGHTKGSNSSYGETRHHTSSHSRKQSN 600

Query: 601 YDNRYSASEKPPKGRYGHYGSPEDERKSVGKDKYDREHYHRFSDRSSSPSESHKWKRYPS 660
           YDNRYSASEKPPKGRYGHYGSPEDERKSVG DKYDREHYHRFSDRSSSPS+SHKWKRYPS
Sbjct: 601 YDNRYSASEKPPKGRYGHYGSPEDERKSVGIDKYDREHYHRFSDRSSSPSQSHKWKRYPS 660

Query: 661 DRDDEEPAETRHHKSRGLSSSSSNYHGFRSSSSSRSGGDSSARKDGHKLRASDSWKRNTA 720
           DRDDE+PAETRHHKSRGLSSSSSNYHGFRSSSSSRSGGDSSARK GHKLRASDSWKRNTA
Sbjct: 661 DRDDEKPAETRHHKSRGLSSSSSNYHGFRSSSSSRSGGDSSARKAGHKLRASDSWKRNTA 720

Query: 721 DNHSSDSSVLNSFNDRYTPSKCHDQLEDDHSTGRRYVNPD 760
           DNHSSDSSVLNSFNDRYTPSKCHDQLEDDHSTGRRYVNPD
Sbjct: 721 DNHSSDSSVLNSFNDRYTPSKCHDQLEDDHSTGRRYVNPD 760

BLAST of Cp4.1LG04g02050 vs. NCBI nr
Match: XP_023004252.1 (U11/U12 small nuclear ribonucleoprotein 48 kDa protein isoform X2 [Cucurbita maxima])

HSP 1 Score: 1429 bits (3699), Expect = 0.0
Identity = 735/760 (96.71%), Postives = 744/760 (97.89%), Query Frame = 0

Query: 1   MNPSLPSSNLTFPNFLLSNPNPNPNFSLPESRDPPLDLSSSLSSLRNLIHVANQTLQSLS 60
           MNPSLPSSNLTFP F  SNPNPN  FSLP+SR+PPLDLSSSLSSLRNLIHVANQTLQSLS
Sbjct: 1   MNPSLPSSNLTFPKFQPSNPNPN--FSLPDSREPPLDLSSSLSSLRNLIHVANQTLQSLS 60

Query: 61  YLTLIQTPSAKPDDSGFVQCNFDRRHRVPPHSLFHHSLLCPSAPPPRIDPTQLLQSLLYP 120
           YLTLIQTPSAKPDDSGFVQCNFDRRHRVPPHSLFHHSLLCPSAPPPRIDPTQLLQSLLYP
Sbjct: 61  YLTLIQTPSAKPDDSGFVQCNFDRRHRVPPHSLFHHSLLCPSAPPPRIDPTQLLQSLLYP 120

Query: 121 RTLQSSRELVNEKRFRQTMPDSDADLCFSLNDYSDGSSNFFYLDCPGVVALSNRDEMSKV 180
           RTLQSSRELVNEKRFRQTMPDSDADLCFSLNDYSDGSSNFFYLDCPGVVALSNRDEMSKV
Sbjct: 121 RTLQSSRELVNEKRFRQTMPDSDADLCFSLNDYSDGSSNFFYLDCPGVVALSNRDEMSKV 180

Query: 181 FTLPRVLAVECANFVCDDDGDREMKCSLKGIRMLPSDLWALRSEVEMWNDYPSVYSHIVI 240
           FTLPRVLAVECANFVCDDDGDREMK SLKGIRMLPSDLWALR EVEMWNDYPSVYSHIVI
Sbjct: 181 FTLPRVLAVECANFVCDDDGDREMKSSLKGIRMLPSDLWALRGEVEMWNDYPSVYSHIVI 240

Query: 241 RSILGSEMAVDNRLKTWIIANSPRYGVIIDVAMRDHIFLLFRLCFMAISKEAMGFQIALE 300
           RSILGSEMAVDNRLKTWIIANSPRYGVIIDVAMRDHIFLLFRLCFMAISKEAMGFQIALE
Sbjct: 241 RSILGSEMAVDNRLKTWIIANSPRYGVIIDVAMRDHIFLLFRLCFMAISKEAMGFQIALE 300

Query: 301 NGNGMEGGSGNLSFKCPILVQVLLWLASQLSVLYGEMNGKFFAINMLRQRILYAASGLSL 360
           NGNGMEGGSGNLSFKCPILVQVLLWLASQLSVLYGEM+GKFFAINMLRQRILYAASGLSL
Sbjct: 301 NGNGMEGGSGNLSFKCPILVQVLLWLASQLSVLYGEMSGKFFAINMLRQRILYAASGLSL 360

Query: 361 LPYEQKRVESPTLVEGSHNLVSSFSDTQSVNLKELDKKVINNVYVTGEETVNCRVIFVSQ 420
           LPYEQKRVESPTLVEGSHNLVSS SDTQSVNLK+LD KVINNVYVTGEETVNCRVIFVSQ
Sbjct: 361 LPYEQKRVESPTLVEGSHNLVSSCSDTQSVNLKKLDMKVINNVYVTGEETVNCRVIFVSQ 420

Query: 421 VAAAVAALHERFLLEEKIKALRFSHLQTKHQLVSEHNYICQRACEERKRRCNYRPIIEHD 480
           VAAAVAALHERFLLEEKIKALRFSHLQTKHQLVSEHNYI QRACEERKRRC+YRPIIEHD
Sbjct: 421 VAAAVAALHERFLLEEKIKALRFSHLQTKHQLVSEHNYISQRACEERKRRCSYRPIIEHD 480

Query: 481 GLPKQQSHDEDRDKTKTKEELLAEERDYKRRRMSYRGKKAKRSTLQVTRDIIEEYMDEIM 540
           GLPKQQSHDEDRDKTKTKEELLAEERDYKRRRMSYRGKKAKRSTLQVTRDIIEEYMDEIM
Sbjct: 481 GLPKQQSHDEDRDKTKTKEELLAEERDYKRRRMSYRGKKAKRSTLQVTRDIIEEYMDEIM 540

Query: 541 KAGGIGYFVKGTEERGIKSEQPTDHYITRDSIADGHTKGSNSSYGETRHHTSSHSRKQSN 600
           KAGGIGYFVKGTEERGIKSEQPTDHYITRDSIADGHTKGSNSSYGETRHH+SSHSRKQSN
Sbjct: 541 KAGGIGYFVKGTEERGIKSEQPTDHYITRDSIADGHTKGSNSSYGETRHHSSSHSRKQSN 600

Query: 601 YDNRYSASEKPPKGRYGHYGSPEDERKSVGKDKYDREHYHRFSDRSSSPSESHKWKRYPS 660
           YDNRYSASEKPPKGR GHYGSPEDERK V KDKYDREHYHRFS+RSSSPS+SHKWKRYPS
Sbjct: 601 YDNRYSASEKPPKGRCGHYGSPEDERKGVSKDKYDREHYHRFSNRSSSPSQSHKWKRYPS 660

Query: 661 DRDDEEPAETRHHKSRGLSSSSSNYHGFRSSSSSRSGGDSSARKDGHKLRASDSWKRNTA 720
           DRDDEEPAETRHHKSRGLSSSSSNYHGFRSSSSSRSGG+SSARKD HKLRASDSWKRNTA
Sbjct: 661 DRDDEEPAETRHHKSRGLSSSSSNYHGFRSSSSSRSGGNSSARKDDHKLRASDSWKRNTA 720

Query: 721 DNHSSDSSVLNSFNDRYTPSKCHDQLEDDHSTGRRYVNPD 760
           DNHSSDS VLNSFNDRYTPSKCHDQLEDDHS GRRYVNPD
Sbjct: 721 DNHSSDSLVLNSFNDRYTPSKCHDQLEDDHSAGRRYVNPD 758

BLAST of Cp4.1LG04g02050 vs. ExPASy TrEMBL
Match: A0A6J1EI02 (U11/U12 small nuclear ribonucleoprotein 48 kDa protein OS=Cucurbita moschata OX=3662 GN=LOC111434345 PE=4 SV=1)

HSP 1 Score: 1442 bits (3733), Expect = 0.0
Identity = 740/760 (97.37%), Postives = 746/760 (98.16%), Query Frame = 0

Query: 1   MNPSLPSSNLTFPNFLLSNPNPNPNFSLPESRDPPLDLSSSLSSLRNLIHVANQTLQSLS 60
           MNPSLPSSNLT+P F  SNPNPNPNFSLPESR+PPLDLSSSL SLRNLIHVANQTLQSLS
Sbjct: 25  MNPSLPSSNLTYPKFQPSNPNPNPNFSLPESREPPLDLSSSLYSLRNLIHVANQTLQSLS 84

Query: 61  YLTLIQTPSAKPDDSGFVQCNFDRRHRVPPHSLFHHSLLCPSAPPPRIDPTQLLQSLLYP 120
           YLTLIQTPS KPDDSGFVQCNFD RHRVPPHSLFHHSLLC SAPPPRIDPTQLLQSLLYP
Sbjct: 85  YLTLIQTPSPKPDDSGFVQCNFDLRHRVPPHSLFHHSLLCTSAPPPRIDPTQLLQSLLYP 144

Query: 121 RTLQSSRELVNEKRFRQTMPDSDADLCFSLNDYSDGSSNFFYLDCPGVVALSNRDEMSKV 180
           RTLQSSRELVNEKRFRQTMPDSDADLCFSLNDYSDGSSNFFYLDCPGVVALSNRDEMSKV
Sbjct: 145 RTLQSSRELVNEKRFRQTMPDSDADLCFSLNDYSDGSSNFFYLDCPGVVALSNRDEMSKV 204

Query: 181 FTLPRVLAVECANFVCDDDGDREMKCSLKGIRMLPSDLWALRSEVEMWNDYPSVYSHIVI 240
           FTLPRVLAVECANFVCDDDGDREMK SLKGI MLPSDLWALRSEVEMWNDYPSVYSHIVI
Sbjct: 205 FTLPRVLAVECANFVCDDDGDREMKSSLKGIIMLPSDLWALRSEVEMWNDYPSVYSHIVI 264

Query: 241 RSILGSEMAVDNRLKTWIIANSPRYGVIIDVAMRDHIFLLFRLCFMAISKEAMGFQIALE 300
           RSILGSEMAVDNRLKTWIIANSPRYGVIIDVAMRDHIFLLFRLCFMAISKEAMGFQIALE
Sbjct: 265 RSILGSEMAVDNRLKTWIIANSPRYGVIIDVAMRDHIFLLFRLCFMAISKEAMGFQIALE 324

Query: 301 NGNGMEGGSGNLSFKCPILVQVLLWLASQLSVLYGEMNGKFFAINMLRQRILYAASGLSL 360
           NGNGMEGGSGNLSFKCPILVQVLLWLASQLSVLYGEMNGKFFAINMLRQRILYAASGLSL
Sbjct: 325 NGNGMEGGSGNLSFKCPILVQVLLWLASQLSVLYGEMNGKFFAINMLRQRILYAASGLSL 384

Query: 361 LPYEQKRVESPTLVEGSHNLVSSFSDTQSVNLKELDKKVINNVYVTGEETVNCRVIFVSQ 420
           LPYEQKRVESPTLVEGSHNLVSS SDT+SVNLKELDKKVINNVYV+GEETVNCRVIFVSQ
Sbjct: 385 LPYEQKRVESPTLVEGSHNLVSSCSDTRSVNLKELDKKVINNVYVSGEETVNCRVIFVSQ 444

Query: 421 VAAAVAALHERFLLEEKIKALRFSHLQTKHQLVSEHNYICQRACEERKRRCNYRPIIEHD 480
           VAAAVAALHERFLLEEKIKALRFSHLQTKHQLVSEHNYI QRACEERKRRCNYRPIIEHD
Sbjct: 445 VAAAVAALHERFLLEEKIKALRFSHLQTKHQLVSEHNYISQRACEERKRRCNYRPIIEHD 504

Query: 481 GLPKQQSHDEDRDKTKTKEELLAEERDYKRRRMSYRGKKAKRSTLQVTRDIIEEYMDEIM 540
           GLPKQQSHDEDRDKTKTKEELLAEERDYKRRRMSYRGKKAKRSTLQVTRDIIEEYMDEIM
Sbjct: 505 GLPKQQSHDEDRDKTKTKEELLAEERDYKRRRMSYRGKKAKRSTLQVTRDIIEEYMDEIM 564

Query: 541 KAGGIGYFVKGTEERGIKSEQPTDHYITRDSIADGHTKGSNSSYGETRHHTSSHSRKQSN 600
           KAGGIGYFVKGTEERGIKSEQPTDHYITRDSIADGHTKGSNSSYGETRHH SSHSRKQSN
Sbjct: 565 KAGGIGYFVKGTEERGIKSEQPTDHYITRDSIADGHTKGSNSSYGETRHHISSHSRKQSN 624

Query: 601 YDNRYSASEKPPKGRYGHYGSPEDERKSVGKDKYDREHYHRFSDRSSSPSESHKWKRYPS 660
           YDNRYSASEKPPKGRYGHYGSPEDERKSVG DKYDREHYHRFSDRSSSPS+SHKWKRYPS
Sbjct: 625 YDNRYSASEKPPKGRYGHYGSPEDERKSVGIDKYDREHYHRFSDRSSSPSQSHKWKRYPS 684

Query: 661 DRDDEEPAETRHHKSRGLSSSSSNYHGFRSSSSSRSGGDSSARKDGHKLRASDSWKRNTA 720
           DRDDE+PAETRHHKSRGLSSSSSNYHGFRSSSSSRSGGDSSARKDGHKLRASDSWKRNTA
Sbjct: 685 DRDDEKPAETRHHKSRGLSSSSSNYHGFRSSSSSRSGGDSSARKDGHKLRASDSWKRNTA 744

Query: 721 DNHSSDSSVLNSFNDRYTPSKCHDQLEDDHSTGRRYVNPD 760
           DNHSSDSSVLN FNDRYTPSKCHDQLEDDHSTGRRYVNPD
Sbjct: 745 DNHSSDSSVLNPFNDRYTPSKCHDQLEDDHSTGRRYVNPD 784

BLAST of Cp4.1LG04g02050 vs. ExPASy TrEMBL
Match: A0A6J1KPX5 (U11/U12 small nuclear ribonucleoprotein 48 kDa protein isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111497626 PE=4 SV=1)

HSP 1 Score: 1429 bits (3699), Expect = 0.0
Identity = 735/760 (96.71%), Postives = 744/760 (97.89%), Query Frame = 0

Query: 1   MNPSLPSSNLTFPNFLLSNPNPNPNFSLPESRDPPLDLSSSLSSLRNLIHVANQTLQSLS 60
           MNPSLPSSNLTFP F  SNPNPN  FSLP+SR+PPLDLSSSLSSLRNLIHVANQTLQSLS
Sbjct: 1   MNPSLPSSNLTFPKFQPSNPNPN--FSLPDSREPPLDLSSSLSSLRNLIHVANQTLQSLS 60

Query: 61  YLTLIQTPSAKPDDSGFVQCNFDRRHRVPPHSLFHHSLLCPSAPPPRIDPTQLLQSLLYP 120
           YLTLIQTPSAKPDDSGFVQCNFDRRHRVPPHSLFHHSLLCPSAPPPRIDPTQLLQSLLYP
Sbjct: 61  YLTLIQTPSAKPDDSGFVQCNFDRRHRVPPHSLFHHSLLCPSAPPPRIDPTQLLQSLLYP 120

Query: 121 RTLQSSRELVNEKRFRQTMPDSDADLCFSLNDYSDGSSNFFYLDCPGVVALSNRDEMSKV 180
           RTLQSSRELVNEKRFRQTMPDSDADLCFSLNDYSDGSSNFFYLDCPGVVALSNRDEMSKV
Sbjct: 121 RTLQSSRELVNEKRFRQTMPDSDADLCFSLNDYSDGSSNFFYLDCPGVVALSNRDEMSKV 180

Query: 181 FTLPRVLAVECANFVCDDDGDREMKCSLKGIRMLPSDLWALRSEVEMWNDYPSVYSHIVI 240
           FTLPRVLAVECANFVCDDDGDREMK SLKGIRMLPSDLWALR EVEMWNDYPSVYSHIVI
Sbjct: 181 FTLPRVLAVECANFVCDDDGDREMKSSLKGIRMLPSDLWALRGEVEMWNDYPSVYSHIVI 240

Query: 241 RSILGSEMAVDNRLKTWIIANSPRYGVIIDVAMRDHIFLLFRLCFMAISKEAMGFQIALE 300
           RSILGSEMAVDNRLKTWIIANSPRYGVIIDVAMRDHIFLLFRLCFMAISKEAMGFQIALE
Sbjct: 241 RSILGSEMAVDNRLKTWIIANSPRYGVIIDVAMRDHIFLLFRLCFMAISKEAMGFQIALE 300

Query: 301 NGNGMEGGSGNLSFKCPILVQVLLWLASQLSVLYGEMNGKFFAINMLRQRILYAASGLSL 360
           NGNGMEGGSGNLSFKCPILVQVLLWLASQLSVLYGEM+GKFFAINMLRQRILYAASGLSL
Sbjct: 301 NGNGMEGGSGNLSFKCPILVQVLLWLASQLSVLYGEMSGKFFAINMLRQRILYAASGLSL 360

Query: 361 LPYEQKRVESPTLVEGSHNLVSSFSDTQSVNLKELDKKVINNVYVTGEETVNCRVIFVSQ 420
           LPYEQKRVESPTLVEGSHNLVSS SDTQSVNLK+LD KVINNVYVTGEETVNCRVIFVSQ
Sbjct: 361 LPYEQKRVESPTLVEGSHNLVSSCSDTQSVNLKKLDMKVINNVYVTGEETVNCRVIFVSQ 420

Query: 421 VAAAVAALHERFLLEEKIKALRFSHLQTKHQLVSEHNYICQRACEERKRRCNYRPIIEHD 480
           VAAAVAALHERFLLEEKIKALRFSHLQTKHQLVSEHNYI QRACEERKRRC+YRPIIEHD
Sbjct: 421 VAAAVAALHERFLLEEKIKALRFSHLQTKHQLVSEHNYISQRACEERKRRCSYRPIIEHD 480

Query: 481 GLPKQQSHDEDRDKTKTKEELLAEERDYKRRRMSYRGKKAKRSTLQVTRDIIEEYMDEIM 540
           GLPKQQSHDEDRDKTKTKEELLAEERDYKRRRMSYRGKKAKRSTLQVTRDIIEEYMDEIM
Sbjct: 481 GLPKQQSHDEDRDKTKTKEELLAEERDYKRRRMSYRGKKAKRSTLQVTRDIIEEYMDEIM 540

Query: 541 KAGGIGYFVKGTEERGIKSEQPTDHYITRDSIADGHTKGSNSSYGETRHHTSSHSRKQSN 600
           KAGGIGYFVKGTEERGIKSEQPTDHYITRDSIADGHTKGSNSSYGETRHH+SSHSRKQSN
Sbjct: 541 KAGGIGYFVKGTEERGIKSEQPTDHYITRDSIADGHTKGSNSSYGETRHHSSSHSRKQSN 600

Query: 601 YDNRYSASEKPPKGRYGHYGSPEDERKSVGKDKYDREHYHRFSDRSSSPSESHKWKRYPS 660
           YDNRYSASEKPPKGR GHYGSPEDERK V KDKYDREHYHRFS+RSSSPS+SHKWKRYPS
Sbjct: 601 YDNRYSASEKPPKGRCGHYGSPEDERKGVSKDKYDREHYHRFSNRSSSPSQSHKWKRYPS 660

Query: 661 DRDDEEPAETRHHKSRGLSSSSSNYHGFRSSSSSRSGGDSSARKDGHKLRASDSWKRNTA 720
           DRDDEEPAETRHHKSRGLSSSSSNYHGFRSSSSSRSGG+SSARKD HKLRASDSWKRNTA
Sbjct: 661 DRDDEEPAETRHHKSRGLSSSSSNYHGFRSSSSSRSGGNSSARKDDHKLRASDSWKRNTA 720

Query: 721 DNHSSDSSVLNSFNDRYTPSKCHDQLEDDHSTGRRYVNPD 760
           DNHSSDS VLNSFNDRYTPSKCHDQLEDDHS GRRYVNPD
Sbjct: 721 DNHSSDSLVLNSFNDRYTPSKCHDQLEDDHSAGRRYVNPD 758

BLAST of Cp4.1LG04g02050 vs. ExPASy TrEMBL
Match: A0A6J1KU07 (U11/U12 small nuclear ribonucleoprotein 48 kDa protein isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111497626 PE=4 SV=1)

HSP 1 Score: 1424 bits (3687), Expect = 0.0
Identity = 735/761 (96.58%), Postives = 744/761 (97.77%), Query Frame = 0

Query: 1   MNPSLPSSNLTFPNFLLSNPNPNPNFSLPESRDPPLDLSSSLSSLRNLIHVANQTLQSLS 60
           MNPSLPSSNLTFP F  SNPNPN  FSLP+SR+PPLDLSSSLSSLRNLIHVANQTLQSLS
Sbjct: 1   MNPSLPSSNLTFPKFQPSNPNPN--FSLPDSREPPLDLSSSLSSLRNLIHVANQTLQSLS 60

Query: 61  YLTLIQTPSAKPDDSGFVQCNFDRRHRVPPHSLFHHSLLCPSAPPPRIDPTQLLQSLLYP 120
           YLTLIQTPSAKPDDSGFVQCNFDRRHRVPPHSLFHHSLLCPSAPPPRIDPTQLLQSLLYP
Sbjct: 61  YLTLIQTPSAKPDDSGFVQCNFDRRHRVPPHSLFHHSLLCPSAPPPRIDPTQLLQSLLYP 120

Query: 121 RTLQSSRELVNEKRFRQTMPDSDADLCFSLNDYSDGSSNFFYLDCPGVVALSNRDEMSKV 180
           RTLQSSRELVNEKRFRQTMPDSDADLCFSLNDYSDGSSNFFYLDCPGVVALSNRDEMSKV
Sbjct: 121 RTLQSSRELVNEKRFRQTMPDSDADLCFSLNDYSDGSSNFFYLDCPGVVALSNRDEMSKV 180

Query: 181 FTLPRVLAVECANFVCDDDGDREMKCSLKGIRMLPSDLWALRSEVEMWNDYPSVYSHIVI 240
           FTLPRVLAVECANFVCDDDGDREMK SLKGIRMLPSDLWALR EVEMWNDYPSVYSHIVI
Sbjct: 181 FTLPRVLAVECANFVCDDDGDREMKSSLKGIRMLPSDLWALRGEVEMWNDYPSVYSHIVI 240

Query: 241 RSILGSEMAVDNRLKTWIIANSPRYGVIIDVAMRDHIFLLFRLCFMAISKEAMGFQIALE 300
           RSILGSEMAVDNRLKTWIIANSPRYGVIIDVAMRDHIFLLFRLCFMAISKEAMGFQIALE
Sbjct: 241 RSILGSEMAVDNRLKTWIIANSPRYGVIIDVAMRDHIFLLFRLCFMAISKEAMGFQIALE 300

Query: 301 NGNGMEGGSGNLSFKCPILVQVLLWLASQLSVLYGEMNGKFFAINMLRQRILYAASGLSL 360
           NGNGMEGGSGNLSFKCPILVQVLLWLASQLSVLYGEM+GKFFAINMLRQRILYAASGLSL
Sbjct: 301 NGNGMEGGSGNLSFKCPILVQVLLWLASQLSVLYGEMSGKFFAINMLRQRILYAASGLSL 360

Query: 361 LPYEQKRVESPTLVEGSHNLVSSFSDTQSVNLKELDKKVINNVYVTGEETVNCRVIFVSQ 420
           LPYEQKRVESPTLVEGSHNLVSS SDTQSVNLK+LD KVINNVYVTGEETVNCRVIFVSQ
Sbjct: 361 LPYEQKRVESPTLVEGSHNLVSSCSDTQSVNLKKLDMKVINNVYVTGEETVNCRVIFVSQ 420

Query: 421 VAAAVAALHERFLLEEKIKALRFSHLQTKHQLVSEHNYICQRACEERKRRCNYRPIIEHD 480
           VAAAVAALHERFLLEEKIKALRFSHLQTKHQLVSEHNYI QRACEERKRRC+YRPIIEHD
Sbjct: 421 VAAAVAALHERFLLEEKIKALRFSHLQTKHQLVSEHNYISQRACEERKRRCSYRPIIEHD 480

Query: 481 GLPKQQSHDE-DRDKTKTKEELLAEERDYKRRRMSYRGKKAKRSTLQVTRDIIEEYMDEI 540
           GLPKQQSHDE DRDKTKTKEELLAEERDYKRRRMSYRGKKAKRSTLQVTRDIIEEYMDEI
Sbjct: 481 GLPKQQSHDEQDRDKTKTKEELLAEERDYKRRRMSYRGKKAKRSTLQVTRDIIEEYMDEI 540

Query: 541 MKAGGIGYFVKGTEERGIKSEQPTDHYITRDSIADGHTKGSNSSYGETRHHTSSHSRKQS 600
           MKAGGIGYFVKGTEERGIKSEQPTDHYITRDSIADGHTKGSNSSYGETRHH+SSHSRKQS
Sbjct: 541 MKAGGIGYFVKGTEERGIKSEQPTDHYITRDSIADGHTKGSNSSYGETRHHSSSHSRKQS 600

Query: 601 NYDNRYSASEKPPKGRYGHYGSPEDERKSVGKDKYDREHYHRFSDRSSSPSESHKWKRYP 660
           NYDNRYSASEKPPKGR GHYGSPEDERK V KDKYDREHYHRFS+RSSSPS+SHKWKRYP
Sbjct: 601 NYDNRYSASEKPPKGRCGHYGSPEDERKGVSKDKYDREHYHRFSNRSSSPSQSHKWKRYP 660

Query: 661 SDRDDEEPAETRHHKSRGLSSSSSNYHGFRSSSSSRSGGDSSARKDGHKLRASDSWKRNT 720
           SDRDDEEPAETRHHKSRGLSSSSSNYHGFRSSSSSRSGG+SSARKD HKLRASDSWKRNT
Sbjct: 661 SDRDDEEPAETRHHKSRGLSSSSSNYHGFRSSSSSRSGGNSSARKDDHKLRASDSWKRNT 720

Query: 721 ADNHSSDSSVLNSFNDRYTPSKCHDQLEDDHSTGRRYVNPD 760
           ADNHSSDS VLNSFNDRYTPSKCHDQLEDDHS GRRYVNPD
Sbjct: 721 ADNHSSDSLVLNSFNDRYTPSKCHDQLEDDHSAGRRYVNPD 759

BLAST of Cp4.1LG04g02050 vs. ExPASy TrEMBL
Match: A0A6J1DFF2 (U11/U12 small nuclear ribonucleoprotein 48 kDa protein OS=Momordica charantia OX=3673 GN=LOC111019602 PE=4 SV=1)

HSP 1 Score: 1073 bits (2776), Expect = 0.0
Identity = 573/768 (74.61%), Postives = 646/768 (84.11%), Query Frame = 0

Query: 1   MNPSLPSSNLTFPNFLLSNPNPN----PNFSLPESRDPPLDLSSSLSSLRNLIHVANQTL 60
           MNP+LP  N +FPNFL  NPNPN    P FS  ES+D PLDLSSSLSSL+NLIHVANQTL
Sbjct: 128 MNPALPFPNQSFPNFLPQNPNPNLFVSPEFSHSESQDLPLDLSSSLSSLKNLIHVANQTL 187

Query: 61  QSLSYLTLIQTPSAKPDDSGFVQCNFDRRHRVPPHSLFHHSLLCPSAPPPRIDPTQLLQS 120
           QSLSYL+L + PS   DDS FVQC FDRRHR+PPHSLF HSLLCPSAP PRIDPT LL S
Sbjct: 188 QSLSYLSLTRNPSDNGDDSDFVQCAFDRRHRLPPHSLFRHSLLCPSAPQPRIDPTHLLHS 247

Query: 121 LLYPRTLQSSRELVNEKRFRQTMPDSDADLCFSLNDYSDGSSNFFYLDCPGVVALSNRDE 180
           LLYP +LQSSR+L+NEKRF Q +PDSDADLCFSLND++D SSNFFY+DCPGVV L ++DE
Sbjct: 248 LLYPHSLQSSRDLINEKRFHQALPDSDADLCFSLNDFADSSSNFFYVDCPGVVTLCDQDE 307

Query: 181 MSKVFTLPRVLAVECANFVCDDDGDREMKCSLKGIRMLPSDLWALRSEVEMWNDYPSVYS 240
           MSKVFTLPRVLAVECANFV  D   REM+ + K  R+LPS+LWALR+EVEMWNDYP++YS
Sbjct: 308 MSKVFTLPRVLAVECANFVSKDG--REMESAWKETRILPSELWALRNEVEMWNDYPTMYS 367

Query: 241 HIVIRSILGSEMAVDNRLKTWIIANSPRYGVIIDVAMRDHIFLLFRLCFMAISKEAMGFQ 300
           HIV+RS+LG+++ +D+ L TWIIANSPRYGV+IDVAMRDHI  LFRLCFMAI KEA+GFQ
Sbjct: 368 HIVLRSLLGTKLVIDDHLMTWIIANSPRYGVVIDVAMRDHILPLFRLCFMAILKEAVGFQ 427

Query: 301 IALENGNGMEGGS----GNLSFKCPILVQVLLWLASQLSVLYGEMNGKFFAINMLRQRIL 360
           IALENGN ME GS    G+ +FKCP+L+QVL+WLASQLSVLYGE NGKFFAI+MLRQ IL
Sbjct: 428 IALENGNEMEDGSLIISGSHNFKCPVLIQVLMWLASQLSVLYGETNGKFFAIHMLRQCIL 487

Query: 361 YAASGLSLLPYEQKRVESPTLVEGSHNLVSSFSDTQSVNLKELDKKVINNVYVTGEETVN 420
            AASGL LLP EQK         GSHN  S+ SDTQ+V  K L ++V N   VTGEETV 
Sbjct: 488 DAASGLLLLPLEQKPGH------GSHNQESTCSDTQNVKSKRLVQEVRNYDNVTGEETVT 547

Query: 421 CRVIFVSQVAAAVAALHERFLLEEKIKALRFSHLQTKHQLVSEHNYICQRACEERKRRCN 480
           C VIFVSQVAAA+AALHERFLLEEKIKA+RFSHLQTK+QLVSEH YI QRACEERKRRCN
Sbjct: 548 CTVIFVSQVAAAIAALHERFLLEEKIKAVRFSHLQTKYQLVSEHKYISQRACEERKRRCN 607

Query: 481 YRPIIEHDGLPKQQSHDEDRDKTKTKEELLAEERDYKRRRMSYRGKKAKRSTLQVTRDII 540
           YRPIIEHDGLPKQQ+HDE   KTKT+EELLAEERDYKRRRMSYRGKKAKRSTLQVTRDII
Sbjct: 608 YRPIIEHDGLPKQQAHDEGTGKTKTREELLAEERDYKRRRMSYRGKKAKRSTLQVTRDII 667

Query: 541 EEYMDEIMKAGGIGYFVKGTEERGIKSEQPTDHYITRDSIADGHTKGSNSSYGETRHHTS 600
           EEYM+EIMKAGGIG FVK +EERG+K EQP +H I RD IAD +TKGSN +YG  +H +S
Sbjct: 668 EEYMEEIMKAGGIGCFVKRSEERGVKYEQPAEHNIKRDIIADEYTKGSNDTYGAAKH-SS 727

Query: 601 SHSRKQSNYDNRYSASEKPPKGRYGHYGSPEDERKSVGKDKYDREHYHRFSDRSSSPSES 660
           SHS+KQS YD+RYSAS KP KG+YGHYGS EDE KSVGKDKYD+  Y+R  DRSSSPS S
Sbjct: 728 SHSKKQSYYDDRYSASNKPQKGQYGHYGSLEDEGKSVGKDKYDQGRYYRSLDRSSSPSLS 787

Query: 661 HKWKRYPSDRDDEEPAETRHHKSRGLSSSSSNYHGFRSSSSSRSGGDSSARKDGHKLRAS 720
           HK  RY SDRDDE P +TRHH++R ++SSSSN+HGFRS SSSRS G SSA KDGHK RA 
Sbjct: 788 HKRNRYLSDRDDEVPTKTRHHEAREITSSSSNFHGFRSPSSSRSAGGSSAMKDGHKSRAG 847

Query: 721 DSWKRNTADNHSSDSSVLNSFNDRYTPSKCHDQLEDDHSTGRRYVNPD 760
           D+WKR+TA+NH S SS+LNSFNDRYTP++ HDQLEDD+STG RYVNPD
Sbjct: 848 DNWKRSTANNHGSKSSMLNSFNDRYTPTEWHDQLEDDYSTGSRYVNPD 886

BLAST of Cp4.1LG04g02050 vs. ExPASy TrEMBL
Match: A0A5D3BCH1 (U11/U12 small nuclear ribonucleoprotein 48 kDa protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold506G00550 PE=4 SV=1)

HSP 1 Score: 1040 bits (2690), Expect = 0.0
Identity = 565/759 (74.44%), Postives = 632/759 (83.27%), Query Frame = 0

Query: 1   MNPSLP-SSNLTFPNFLLSNPNPNPNF---SLPESRDPPLDLSSSLSSLRNLIHVANQTL 60
           +NPSLP   N TFP+FL  NPNPN +    S  ES+ P LDL SS SSL  LIH+ANQTL
Sbjct: 4   VNPSLPFPPNQTFPSFLPPNPNPNSHIHDSSHSESQHPSLDLPSSFSSLNTLIHLANQTL 63

Query: 61  QSLSYLTLIQTPSAKPDDSGFVQCNFDRRHRVPPHSLFHHSLLCPSAPPPRIDPTQLLQS 120
           +SLSYLT    PS   + S  + C FDRRHRVPPHSLF HSLLCPSA    IDPTQL QS
Sbjct: 64  ESLSYLT----PSVFANHSRLLHCYFDRRHRVPPHSLFRHSLLCPSASLHPIDPTQLFQS 123

Query: 121 LLYPRTLQSSRELVNEKRFRQTMPDSDADLCFSLNDYSDGSSNFFYLDCPGVVALSNRDE 180
           LLYP+TL SS +LVNE RF Q +PDSDADLCFSL DY+D +SNFFY DCPGVVALSN DE
Sbjct: 124 LLYPQTLHSSHQLVNENRFSQVLPDSDADLCFSLTDYTDATSNFFYADCPGVVALSNLDE 183

Query: 181 MSKVFTLPRVLAVECANFVCDDDGDREMKCSLKGIRMLPSDLWALRSEVEMWNDYPSVYS 240
           MSKVFTLPRVLAV CANFV +D    EM  +L GIR+LPSDLW LRSEVE+WNDYP+ YS
Sbjct: 184 MSKVFTLPRVLAVHCANFVGNDH--LEMNSTLNGIRILPSDLWILRSEVEIWNDYPNKYS 243

Query: 241 HIVIRSILGSEMAVDNRLKTWIIANSPRYGVIIDVAMRDHIFLLFRLCFMAISKEAMGFQ 300
            +V+RSILGSEM +++ L TWII NSPRYGV+IDVA+RDHIFLLFRLCFMAI KEA+GFQ
Sbjct: 244 FVVLRSILGSEMLLNSHLMTWIIENSPRYGVVIDVALRDHIFLLFRLCFMAIYKEALGFQ 303

Query: 301 IALENGNGMEGGSGNLSFKCPILVQVLLWLASQLSVLYGEMNGKFFAINMLRQRILYAAS 360
           +ALE GNGMEGGS N  FKCPIL+QVL+WLASQLSVLYGE NGKFFA+NMLRQ IL AA 
Sbjct: 304 VALEKGNGMEGGSVNSCFKCPILIQVLMWLASQLSVLYGETNGKFFAVNMLRQCILDAAL 363

Query: 361 GLSLLPYEQKRVESPTLVEGSHNLVSSFSDTQSVNLKELDKKVINNVYVTGEETVNCRVI 420
            L LLP EQK  E  TL +G H+L  S SD QSV + ELD+KV+NN  VTG ETVNCRVI
Sbjct: 364 RL-LLPSEQKSTEGLTLGKGCHDLEISCSDIQSVKMNELDQKVVNNGNVTGGETVNCRVI 423

Query: 421 FVSQVAAAVAALHERFLLEEKIKALRFSHLQTKHQLVSEHNYICQRACEERKRRCNYRPI 480
            VSQVAAAVAALHERFLLEEKIKALRF+HLQTK+Q VSE+NYI QRACEERKRRCNYRPI
Sbjct: 424 LVSQVAAAVAALHERFLLEEKIKALRFAHLQTKYQRVSEYNYISQRACEERKRRCNYRPI 483

Query: 481 IEHDGLPKQQSHDEDRDKTKTKEELLAEERDYKRRRMSYRGKKAKRSTLQVTRDIIEEYM 540
           IEHDGLPKQQS++ED +KTKT+EELLAEERDYKRRRMSYRGKKAKRSTLQVTRDIIEEYM
Sbjct: 484 IEHDGLPKQQSYNEDANKTKTREELLAEERDYKRRRMSYRGKKAKRSTLQVTRDIIEEYM 543

Query: 541 DEIMKAGGIGYFVKGTEERGIKSEQPTDHYITRDSIADGHTKGSNSSYGETRHHTSSHSR 600
           +EIMKAGGIG FVKG EERGIKSEQP+DH ITR+ +AD HT+GSN   G+ RH +S HS+
Sbjct: 544 EEIMKAGGIGCFVKGPEERGIKSEQPSDHNITRNIVADVHTRGSNDPCGDARH-SSGHSK 603

Query: 601 KQSNYDNRYSASEKPPKGRYGHYGSPEDERKSVGKDKYDREHYHRFSDRSSSPSESHKWK 660
           KQS +D+RY  S+KP KG Y HYGSPEDERK   KDKYDR+HYHRFSD+SS PS+SHKWK
Sbjct: 604 KQSFHDSRYLVSDKPQKGHYEHYGSPEDERKISHKDKYDRDHYHRFSDQSSIPSQSHKWK 663

Query: 661 RYPSDRDDEEPAETRHHKSRGLSSSSSNYHGFRSSSSSRSGGDSSARKDGHKLRASDSWK 720
           RYP+DRDDE PAETRHH+++ L+SSSS  HG RSSSSSRSG  SSARKD HKLRASDSWK
Sbjct: 664 RYPNDRDDEVPAETRHHETKKLASSSS--HG-RSSSSSRSGNGSSARKDSHKLRASDSWK 723

Query: 721 RNTADNHSSDSSVLNSFNDRYTPSKCHDQLEDDHSTGRR 755
           RNTADNHSS+  V NSF+DRYTPS+CHD+LED++ST  R
Sbjct: 724 RNTADNHSSEHLVSNSFSDRYTPSECHDELEDEYSTVSR 751

BLAST of Cp4.1LG04g02050 vs. TAIR 10
Match: AT3G04160.1 (unknown protein; Has 1711 Blast hits to 1353 proteins in 195 species: Archae - 0; Bacteria - 64; Metazoa - 693; Fungi - 201; Plants - 207; Viruses - 0; Other Eukaryotes - 546 (source: NCBI BLink). )

HSP 1 Score: 416.8 bits (1070), Expect = 3.8e-116
Identity = 304/764 (39.79%), Postives = 421/764 (55.10%), Query Frame = 0

Query: 2   NPSL------PSSNLTFPNFLLSNPNP---NP-NFSLPESRDPPLDLSSSLSSLRNLIHV 61
           NP+L      P+SN   PNF    P P   NP N+S+  S  P  +LS +LSSL++L+  
Sbjct: 14  NPNLFYHYPPPNSN---PNFFFRPPPPPLQNPNNYSIVPSPPPIRELSGTLSSLKSLLSE 73

Query: 62  ANQTLQSLSY-LTLIQTPSAKPDDSG-FVQCNFDRRHRVPPHSLFHHSLLCPSAPPPRID 121
             +TL SLS  L L  +   + D++G FV+C FD  H +PP +LF HSL CP+     +D
Sbjct: 74  CQRTLDSLSQNLALDHSSLLQKDENGCFVRCPFDSNHFMPPEALFLHSLRCPNT----LD 133

Query: 122 PTQLLQSL-LYPRTLQSSRELVNEKRFRQTMPDSDADLCFSLNDYSDGSSNFFYLDCPGV 181
              LL+S   Y  TL+   EL         + + D DLC SL+D +D  SNFFY DCPG 
Sbjct: 134 LIHLLESFSSYRNTLELPCEL--------QLNNGDGDLCISLDDLADFGSNFFYRDCPGA 193

Query: 182 VALSNRDEMSKVFTLPRVLAVECANFVCDDDGDREMKCSLKGIRMLPSDLWALRSEVEMW 241
           V  S  D   +  TLP VL+VEC++FV  D+  +++    K + +LPSDL A+++E++ W
Sbjct: 194 VKFSELDGKKRTLTLPHVLSVECSDFVGSDEKVKKIVLD-KCLGVLPSDLCAMKNEIDQW 253

Query: 242 NDYPSVYSHIVIRSILGSEMAVDNRLKTWIIANSPRYGVIIDVAMRDHIFLLFRLCFMAI 301
            D+PS YS  V+ SI+GS++   + L+ WI+ NS RYGVIID  MRDHIFLLFRLC  + 
Sbjct: 254 RDFPSSYSSSVLSSIVGSKVVEISALRKWILVNSTRYGVIIDTFMRDHIFLLFRLCLKSA 313

Query: 302 SKEAMGFQI---ALENGNGMEGGSGNLSFKCPILVQVLLWLASQLSVLYGEMNGKFFAIN 361
            KEA GF++   A + G        + +F+CP+ +QVL WLASQL+VLYGE NGKFFA++
Sbjct: 314 VKEACGFRMESDATDVGEQKIMSCKSSTFECPVFIQVLSWLASQLAVLYGEGNGKFFALD 373

Query: 362 MLRQRILYAASGLSLLPYEQKRVESPTLVEGSHNLVSSFSDTQSVNLKELDKKVINNVY- 421
           M +Q I+ +AS + L   E  R +   +VE          D +  N   + +K   N   
Sbjct: 374 MFKQCIVESASQVMLFRLEGTRSKCSGVVE-------DLDDARLRNKDVIMEKPFENSSG 433

Query: 422 -VTGEETVNCRVIFVSQVAAAVAALHERFLLEEKIKALRFSHLQTKHQLVSEHNYICQRA 481
              G+   + +VI VS+V+AAVAAL+ER LLEEKI+A+R++   T++Q  +E  ++  +A
Sbjct: 434 GECGKTLDSPQVISVSRVSAAVAALYERSLLEEKIRAVRYAQPLTRYQRAAELGFMTAKA 493

Query: 482 CEERKRRCNYRPIIEHDGLPKQQSHDEDRDKTKTKEELLAEERDYKRRRMSYRGKKAKRS 541
            EER RRC+YRPII+HDG P+Q+S ++D DK KT+EELLAEERDYKRRRMSYRGKK KR+
Sbjct: 494 DEERNRRCSYRPIIDHDGRPRQRSLNQDMDKMKTREELLAEERDYKRRRMSYRGKKVKRT 553

Query: 542 TLQVTRDIIEEYMDEIMKAGGIGYFVKG---TEERGIKSEQPTDHYITRDSIADGHTKGS 601
             QV  D+IEEY +EI  AGGIG F KG        I ++Q    +       D   KG 
Sbjct: 554 PRQVLHDMIEEYTEEIKLAGGIGCFEKGMPLQSRSPIGNDQKESDFGYSIPSTDKQWKGE 613

Query: 602 NSS---YGETRHHTSSHSRKQSNYDNRYSASEKPPKGRYGHYGSPEDERKSVGKDKYDRE 661
           N +   Y       S   ++   YD+  S  ++  +  Y H    +D+ +   KDK+   
Sbjct: 614 NRADIEYPIDNRQNSDKVKRHDEYDSGSSQRQQSHRS-YKHSDRRDDKLRDRRKDKH--- 673

Query: 662 HYHRFSDRSSSPSESHKWKRYPSDRDDEEPAETRHHKSRGLSSSSSNYHGFRSSSSSRSG 721
                                 +DR D+E   T+ H   G   S  NY   R  SSS   
Sbjct: 674 ----------------------NDRRDDEFTRTKRHSIEG--ESYQNYRSSREKSSS--- 710

Query: 722 GDSSARKDGHKLRASDSWKRNTADNHSSDSSVLNSFNDRYTPSK 742
                    +K +  D +     D  S      N F DRY P++
Sbjct: 734 --------DYKTKRDDPY-----DRRSQQPRNQNLFEDRYIPTE 710

BLAST of Cp4.1LG04g02050 vs. TAIR 10
Match: AT3G04160.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; EXPRESSED IN: 7 plant structures; EXPRESSED DURING: F mature embryo stage, petal differentiation and expansion stage, E expanded cotyledon stage, D bilateral stage. )

HSP 1 Score: 407.1 bits (1045), Expect = 3.0e-113
Identity = 302/766 (39.43%), Postives = 417/766 (54.44%), Query Frame = 0

Query: 2   NPSL------PSSNLTFPNFLLSNPNP---NP-NFSLPESRDPPLDLSSSLSSLRNLIHV 61
           NP+L      P+SN   PNF    P P   NP N+S+  S  P  +LS +LSSL++L+  
Sbjct: 14  NPNLFYHYPPPNSN---PNFFFRPPPPPLQNPNNYSIVPSPPPIRELSGTLSSLKSLLSE 73

Query: 62  ANQTLQSLSY-LTLIQTPSAKPDDSG-FVQCNFDRRHRVPPHSLFHHSLLCPSAPPPRID 121
             +TL SLS  L L  +   + D++G FV+C FD  H +PP +LF HSL CP+     +D
Sbjct: 74  CQRTLDSLSQNLALDHSSLLQKDENGCFVRCPFDSNHFMPPEALFLHSLRCPNT----LD 133

Query: 122 PTQLLQSL-LYPRTLQSSRELVNEKRFRQTMPDSDADLCFSLNDYSDGSSNFFYLDCPGV 181
              LL+S   Y  TL+   EL         + + D DLC SL+D +D  SNFFY DCPG 
Sbjct: 134 LIHLLESFSSYRNTLELPCEL--------QLNNGDGDLCISLDDLADFGSNFFYRDCPGA 193

Query: 182 VALSNRDEMSKVFTLPRVLAVECANFVCDDDGDREMKCSLKGIRMLPSDLWALRSEVEMW 241
           V  S  D   +  TLP VL+VEC++FV  D+  +++    K + +LPSDL A+++E++ W
Sbjct: 194 VKFSELDGKKRTLTLPHVLSVECSDFVGSDEKVKKIVLD-KCLGVLPSDLCAMKNEIDQW 253

Query: 242 NDYPSVYSHIVIRSILGSEMAVDNRLKTWIIANSPRYGVIIDVAMRDHIFLLFRLCFMAI 301
            D+PS YS  V+ SI+GS++   + L+ WI+ NS RYGVIID  MRDHIFLLFRLC  + 
Sbjct: 254 RDFPSSYSSSVLSSIVGSKVVEISALRKWILVNSTRYGVIIDTFMRDHIFLLFRLCLKSA 313

Query: 302 SKEAMGFQI---ALENGNGMEGGSGNLSFKCPILVQVLLWLASQLSVLYGEMNGKFFAIN 361
            KEA GF++   A + G        + +F+CP+ +QVL WLASQL+VLYGE NGKFFA++
Sbjct: 314 VKEACGFRMESDATDVGEQKIMSCKSSTFECPVFIQVLSWLASQLAVLYGEGNGKFFALD 373

Query: 362 MLRQRILYAASGLSLLPYEQKRVESPTLVEGSHNLVSSFSDTQSVNLKELDKKVINNVY- 421
           M +Q I+ +AS + L   E  R +   +VE          D +  N   + +K   N   
Sbjct: 374 MFKQCIVESASQVMLFRLEGTRSKCSGVVE-------DLDDARLRNKDVIMEKPFENSSG 433

Query: 422 -VTGEETVNCRVIFVSQVAAAVAALHERFLLEEKIKALRFSHLQTKHQLVSE--HNYICQ 481
              G+   + +VI VS+V+AAVAAL+ER LLEEKI+A+R++   T++Q +    H  +  
Sbjct: 434 GECGKTLDSPQVISVSRVSAAVAALYERSLLEEKIRAVRYAQPLTRYQRIISCLHLSLIP 493

Query: 482 RACEERKRRCNYRPIIEHDGLPKQQSHDEDRDKTKTKEELLAEERDYKRRRMSYRGKKAK 541
               ER RRC+YRPII+HDG P+Q+S ++D DK KT+EELLAEERDYKRRRMSYRGKK K
Sbjct: 494 HDVSERNRRCSYRPIIDHDGRPRQRSLNQDMDKMKTREELLAEERDYKRRRMSYRGKKVK 553

Query: 542 RSTLQVTRDIIEEYMDEIMKAGGIGYFVKG---TEERGIKSEQPTDHYITRDSIADGHTK 601
           R+  QV  D+IEEY +EI  AGGIG F KG        I ++Q    +       D   K
Sbjct: 554 RTPRQVLHDMIEEYTEEIKLAGGIGCFEKGMPLQSRSPIGNDQKESDFGYSIPSTDKQWK 613

Query: 602 GSNSS---YGETRHHTSSHSRKQSNYDNRYSASEKPPKGRYGHYGSPEDERKSVGKDKYD 661
           G N +   Y       S   ++   YD+  S  ++  +  Y H    +D+ +   KDK+ 
Sbjct: 614 GENRADIEYPIDNRQNSDKVKRHDEYDSGSSQRQQSHRS-YKHSDRRDDKLRDRRKDKH- 673

Query: 662 REHYHRFSDRSSSPSESHKWKRYPSDRDDEEPAETRHHKSRGLSSSSSNYHGFRSSSSSR 721
                                   +DR D+E   T+ H   G   S  NY   R  SSS 
Sbjct: 674 ------------------------NDRRDDEFTRTKRHSIEG--ESYQNYRSSREKSSS- 712

Query: 722 SGGDSSARKDGHKLRASDSWKRNTADNHSSDSSVLNSFNDRYTPSK 742
                      +K +  D +     D  S      N F DRY P++
Sbjct: 734 ----------DYKTKRDDPY-----DRRSQQPRNQNLFEDRYIPTE 712

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9M8X25.4e-11539.79U11/U12 small nuclear ribonucleoprotein 48 kDa protein OS=Arabidopsis thaliana O... [more]
Match NameE-valueIdentityDescription
XP_023530557.10.099.87U11/U12 small nuclear ribonucleoprotein 48 kDa protein [Cucurbita pepo subsp. pe... [more]
XP_022927551.10.097.37U11/U12 small nuclear ribonucleoprotein 48 kDa protein [Cucurbita moschata][more]
KAG6588294.10.097.37U11/U12 small nuclear ribonucleoprotein 48 kDa protein, partial [Cucurbita argyr... [more]
KAG7020854.10.097.24U11/U12 small nuclear ribonucleoprotein 48 kDa protein [Cucurbita argyrosperma s... [more]
XP_023004252.10.096.71U11/U12 small nuclear ribonucleoprotein 48 kDa protein isoform X2 [Cucurbita max... [more]
Match NameE-valueIdentityDescription
A0A6J1EI020.097.37U11/U12 small nuclear ribonucleoprotein 48 kDa protein OS=Cucurbita moschata OX=... [more]
A0A6J1KPX50.096.71U11/U12 small nuclear ribonucleoprotein 48 kDa protein isoform X2 OS=Cucurbita m... [more]
A0A6J1KU070.096.58U11/U12 small nuclear ribonucleoprotein 48 kDa protein isoform X1 OS=Cucurbita m... [more]
A0A6J1DFF20.074.61U11/U12 small nuclear ribonucleoprotein 48 kDa protein OS=Momordica charantia OX... [more]
A0A5D3BCH10.074.44U11/U12 small nuclear ribonucleoprotein 48 kDa protein OS=Cucumis melo var. maku... [more]
Match NameE-valueIdentityDescription
AT3G04160.13.8e-11639.79unknown protein; Has 1711 Blast hits to 1353 proteins in 195 species: Archae - 0... [more]
AT3G04160.23.0e-11339.43unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 608..676
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 574..588
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 700..720
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 556..761
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 721..739
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 740..761
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 556..572
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 478..499
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 677..699
NoneNo IPR availablePANTHERPTHR21402UNCHARACTERIZEDcoord: 19..741
IPR032845U11/U12 small nuclear ribonucleoprotein 48kDa proteinPANTHERPTHR21402:SF10U11/U12 SMALL NUCLEAR RIBONUCLEOPROTEIN 48 KDA PROTEINcoord: 19..741
IPR022776TRM13/UPF0224 family, U11-48K-like CHHC zinc finger domainPROSITEPS51800ZF_CHHC_U11_48Kcoord: 77..104
score: 12.30901

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG04g02050.1Cp4.1LG04g02050.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0046872 metal ion binding