Cp4.1LG03g13150 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG03g13150
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionRNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog
LocationCp4.1LG03: 9715263 .. 9721210 (-)
RNA-Seq ExpressionCp4.1LG03g13150
SyntenyCp4.1LG03g13150
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTTTTAAATGATATTTCTTAAAATGAATTAATACCAGATAAAAAAAAATAAAAAAATTTGGACAATATAAAAATATAAAAATTTATTGAATATTTTTGAAAGTGAGAAGGGTAAATTAGTAATTTAACATTTCATAAATTTCGAATTAAATAGTCGTAGGTGGGGTCTGAATTTGGGTTCTTATGGAGAGAAAATTAAAACCCTCGGAATGAACTGCTGAAGTTAGAAACGTTTTCAATTTTTGCTTCTCCTCACGGTATCGTGTTCGTCGTCCTCTCCGGCGGCGATCGGCTCGGCGGCAGTTGCTTCGGAGGAAAGTTTAACCACTGCCAGCGTCCAATCCCTGCAGGTCTGATAGCTCATCTTCCTATTTACTGTTTTTGCTTCCCGCAGTATTTCTCTCTCCCTCGTTCTGTTGAAATCTCTTGAATGTCACTGGCATGTTCTTCTCTTGAGGTGCCCTATGCGAAGTCTTCATTGTCAACATCTGCTGTTACTTACGTAACTCTCCTCCTTGTCTCGGTTTAGATTGTCGGTCTGGTCGGACGATTCGAGTGAAGAAAGAACTTGTGGTTTTGTATTGGTGCTGGTCTGAGATATTTTGAATATGCTCTTGAAATTCTATATATGTCGTAATATACGAAGTATAATTTCATGGAATTGTATGAGAGTATAGATATATTTTGACTTGCAAGTCTAAGTGTGTTGTCTTTACTAGTATTTTTTGTAAAATAGTGTGTGGCCTTGTCCGTGCCCGTGCATTTTATTCTGCAGGTATTGGTTATGTTCATATAGGGCAAAAAACTAATCAAAGTGTACATGGAACGGTCTTTAGCAGCTATATGGATAGAAATTATCAAGTGTCTTAATGGCACCTATCCTTGTTGTTTGACTGTTTTATACAATGTCAAGACGTTTAATCTTTATCCAGCGCCGATGTATGTTAAATTATCATTTCACTTACAGGCAAGTGAAGTCATATTGAGGAATTTTCCCAATGGCAAAGAATCAGACTATTTTGATCAAAGACACAGTCTATAAATTGCAGCTTGCACTCCTTGACGGAATTCATAATGAGAACCAGCTATTTGCAGCTGGATCTCTGATGTCTCGCAGTGACTATGAAGATGTGGTGACTGAGCGGTCTATTGCAAACCTCTGTGGATATCCATTATGCCAATCTAATTTGCCATCTGATAACACTAGGAAAGGCCGGTACAGAATTTCCTTAAAAGAGCATAAGGTGTATGATTTAGAAGAGACATATAAGTACTGCTCTTCCACTTGTCTCATTAACAGTCGTGCCTTTTCTGGGAGATTGCAAGATGAGAGATGTTCGGTTATGAATCCCGGGAAACTTAAAGAAATTCTTAGATTGTTTGAGAATTTGAGTTTGGATTCTAAGGAAAATACGAGGAACAGTTGTGATTTGGGGCTTGAAATTCAGGAGAAGATAGTAAGCAGTATAGGAGAAGTTCCCATTGAAGAGTGGATGGGTCCATCAAATGCAATTGAAGGCTATGTTCCACACAGAAATCATAACATCATGACTTTGCCCAGCAAGGATGGAAAAGAACTCAAGGATGGTAACCATTCTTTTTTTTTTTTTTTTTTCCCTGTATGCCAAAGTTTTATTGNTTTTTCCTCTGTATGCCAAAGTTTTATTGTGTTTGGTTGTTGAATTCCATATTAGAGACGCTTATGCATTCAAAAGAAGTTCCTAGAAAAAGAAAAACATTCTTAAAATGCTAATATGGAGAAATGTAACTATAGATTTATTTGTATAGGTGGTTAAAAGTGAGTTTAAGTACTCTCCCATTAGGTCTAATGCTAAAATTAAGGAGTTGAACCTTTTATAGCATGTCTGCAACTTACTGTGTTTGATTGTTAAATAATATATAGAAGCTAATGCATTCTTGGGAGTTGGGATGTCTTCATGAGAAAAGAGGAGAAAGAAAAAAAAGAAAAAAAGAGTGAACAGTGTATTTTTTTGCTCCTGTTTGTTTAATTGAGTTTTGGTGCACTATTTTAGGTTCTAAAGCTAAAATTAAGCAATTGGGTGTTGGAAAGGATTTCTTCAGTGACTTCTCTTTCGCAACTACTGTAATCACAGATGAAGAGTATAGTGTTTCGAAGATATCATCTGGTTTGAAAGAGATGACTTTTGATACAAAGTCAAAAGAACAAACAGGAGAGTTCTGTGGTAAACAATCAAATGAACAATTTACCATTTTGGAAACCCCGCACAGTCCAGCTCCCACAAAAAACAGTGTTGGACGGAAGGCAAGAGGATCAAAAGAAAGGACTAATGTATCAGCCACCGCAGAAAGTAATAATAATTTGTCTGATGCTCCCTCAACTTCAAATCACTGCAGTACTAATTGCAATATAACGACTGAAGAACCAAATGGTGGATCCAATGATCTTAACGAAACTCAGATCAAGTCCTCCCTTAAACAACCAGGCAAGAAAAACCTGCGCCGCTCTGTAACCTGGGCAGATGCAAAAACTGATGAGACCAGTATTATAAACCTTCCAGAGGACAGAGAAATGGGGAAGACAAAGGAATGTTCCAGAATGACAAGCAATTTGGTAAATGCTGACAATGGTAATGAGGACATATTACGCGTTGAATCTGCTGAAGCCTGTGCAATGGCACTGAGCCAAGCAGCTGAAGCAATTACTTCTGGGCAAAATGAGGTCTCTGATGCAGGTATTCTTCTCTATGCATGATGATTGAGGTTAGTTATTGTACTAAGTGCTATAAAGATAATATTACTTGTTTGGCCCTTAGATGTGCTCTACGGCTGAAACTAGTTCTAGTTATCCCTATTTTGATGTATAAACCAACCAGAGGTCTTTTTTGTTGTGATTTCATCGATATTAGCAAATATGTTATCTTCCCCTTCTCTGCATTCTTTTTCCTTGAAGTATTGTAACTTCAGCCAACAAAAAACAAAAACTGGCTCTTAGTGGTTTAAAATTGCTGTATGAATATGAACTAGATGGTTACTCGTATAGTTATTCCTTCTGTGCTCTGCTCTTTGTTATACCTCACAAATAAAAGACACTTTATACAGTTTTTCTTTGTGTTTGGTTGTTTGTAATATTCAGTGTCTGAAGCTGGAATTATTATATTGCCACGTCCAAGTGATGCTAATGAAGAAGCATCTACAAATGGCGAGAACATATCTGAACCACATTCATCCTCAGAGAAGTCAAACAAACCTGGGATATTACGTTCTGATCTGTTTGATCCCGATGACTCTTGGTATGATTCGCCTCCAGAGGGTTTCAGCCTAACTGTAAGCTCCTTTTCTTTTTTTGGATGTACCTTCTCTTCTTAACTTAAAAATGTTCTTTTTTCCCACCACATTATCTTGTTACTTTCAGTTATCTTCTTTTGCAACCATGTGGATGGCAATCTTTGCATGGATGACATCATCTTCCCTGGCCTACATTTATGGAAAAGATGAAAAGTTCCACGAGGAATTTCAATATATTGATGGGAGGGAGTATCCGAGGAAAATTGTTTCTGCTGATGGCCGATCTTCTGAAATCAAGCAAACACTTGCTGGATGTCTAACACGGTCAATACCTGGACTTGCCTCTGAACTTAAGCTATCAACCCCAATATCAAGTTTGGAGCACGGGATGGTATTGTTGAAATATTCATGACGCCTTATTTTTCTCTCAATAATTTTTCTTGGTGATAATTGAGAACTTACATTAATTTGCGTAGATCCAGAGCTTATTATGTTACTATTTATTTTGTGTTGATATATACATACATGCATACATATATATAAATATATATACACACAAAAATTATTTAGTATCTGCAGTACCATTCCCCAAAAGAGTCGCATTCACTATATAGTAATTCTTAGTTATACGTCAAGATTATAACATCTTGATCTAGTTTGGAAAATGCTGATGGGAGGAAGGAGGGGGTGTTGAATTGCATTCTCTTGTGATGTTATGGCTTTGTCAAATACTATTTTCAATGTACCTTGGTAGTCTGAGGATCGGATTCGTTCCTCAAATTTTCTTATCTCCTGTCCTGATGGACCTTTTGTATTACTGGTAGTATTATTAGTACAAATTTTCATTATCCTATTAAGTATTCTATCATTGTGCTACTGCTAGTCCAACTTTGGCATGCGAGTGACGTTGATTTATATTCTTGTGCAGGGGTGTTTGTTAGACACTATGACTTTCCTTGATGCACTTCCAGCATTCAGAACGAAGCAGTGGCAAGTAATTGTTCTCCTGTTTATAGAAGCTTTATCTGTTTGCCGGATTCCGTCACTTGACTCCCAAGTGTCACATAGTAGAAGTCTGTTTCACAAGGTTTGTCTCCGTATAAATCTTCTCTATTGCTTAGAAACGAATGCTTTGTTTTGCACGGAGTATTAAAGAAGCAGTTTAAGAGTTAAAGGAATGATAATCTTGATGATCTGAAACATGCTTTATTTATCAGCTATCGCTAATGCCAAGTTCTAGAACTGTATACTAGGATATAATGAATTGTATGGGTGCACTTGATATAATCAACTTACTTGTGCTCTTTTTCTAGTGCTAAAAAATATTCATGTATTTAATTTGAAAATTGGTTGTTGACATGTGACCTTATTCCTCTCCTTGACTGGAACTTTTCGTTTTGAGGAAATCGTAATCAGATGTTGAAATTGTTGCATATTTGATTAATATTCTACAAAAATGTTGCTTGATTGATTATCCTCAAACAATTAAAGCCTGAATTATAGCACTTCTGTAGTCTATTTTTTTCCCCATCTTTTGCGTCTGTATTATTTTTTCAGCTTCTTTTGTGGCTGCTTGTATATTCATATTTTTCAGTTGAACTCAAGTGTTCAAGAATCTTTCAGAACGAAGTAAGATCCCCAGAGGATTGATTAGTTTCTTTACTGGCGTACAAATGTTCTACTTTTTAGCAATTATAGGTTAAATGCTCTTTTATCTAATTTGAAATATTTTTTGTAACCCTTTTAGATGGGACCGCCCCGTAGTATATTTCATTTAATCAATAAAATTGTTTTGTATCTATATATACACACACACACATGAACACTGAAAAGAGAAATATTCTAATGGTGGATTTTCCTCTTTCTATCCAAAATTTGAATTTTGTACCACTATATAGTTTTATTGAGTGGACCTCTGAATTTCATGTCGTGCTCTTGTTTTGGTGATTGACCATGAAGAGTATTCTTATATTTTAAGTTTAGCGAAAATATTGATGAAATTCAATTCGTTGATTCTTACATTGGTATATACTTTAACATCCTGCTGTGCCTCTCATTTGCATAGGTGCTTGATCGTGCTCAGATACGGTCCGACGAATATGAGACTTTGAAAGATCATATACTACCGCTTGGTCGAACAGCTCAGTTTCCAGGCGAGAATGGTGCTTAAAAGATTAGAACGAGCTTCTCCAACACAAATATCCTCACACATGAGGCTGATTCAAATTTCTGGGGTGATAGTGTGAGCTCCAGATCGTATTAATGGCAGTTGCAAGCGAATCCAATTTTCAGTATGCAATAGTTGGTAATAATCGACATATTTCGTACTGTTTTTTCAGGTAGATTCTTTTGGTATTTCTTTCTGGGAGTATTGGTGTGCTTTAAGGACTGCAAATTTCTAGGGTGATAGTGTGATTTTGGATGAACTCAATTCAAAATAACACTTTTCTTTTTGTTTGACTCTTTAGTGTATAAACTACTCCCCCCCAAATTTAGAGGGTTAAAAAAATTAAAAAAAAAAATACCTTATTGTTAGTATACATTTACCTATACTTTATATTACTAATCTAGTTTAACATTTTTTCATTCTTTTTTTAATAAAGTCAATCTTCTGTGCCGATTCCAAATTCTAACTCATAATCTAAATTCTAGACATGGATCCGTTGCATCACTTGTGGCAGC

mRNA sequence

TTTTTAAATGATATTTCTTAAAATGAATTAATACCAGATAAAAAAAAATAAAAAAATTTGGACAATATAAAAATATAAAAATTTATTGAATATTTTTGAAAGTGAGAAGGGTAAATTAGTAATTTAACATTTCATAAATTTCGAATTAAATAGTCGTAGGTGGGGTCTGAATTTGGGTTCTTATGGAGAGAAAATTAAAACCCTCGGAATGAACTGCTGAAGTTAGAAACGTTTTCAATTTTTGCTTCTCCTCACGGTATCGTGTTCGTCGTCCTCTCCGGCGGCGATCGGCTCGGCGGCAGTTGCTTCGGAGGAAAGTTTAACCACTGCCAGCGTCCAATCCCTGCAGGCAAGTGAAGTCATATTGAGGAATTTTCCCAATGGCAAAGAATCAGACTATTTTGATCAAAGACACAGTCTATAAATTGCAGCTTGCACTCCTTGACGGAATTCATAATGAGAACCAGCTATTTGCAGCTGGATCTCTGATGTCTCGCAGTGACTATGAAGATGTGGTGACTGAGCGGTCTATTGCAAACCTCTGTGGATATCCATTATGCCAATCTAATTTGCCATCTGATAACACTAGGAAAGGCCGGTACAGAATTTCCTTAAAAGAGCATAAGGTGTATGATTTAGAAGAGACATATAAGTACTGCTCTTCCACTTGTCTCATTAACAGTCGTGCCTTTTCTGGGAGATTGCAAGATGAGAGATGTTCGGTTATGAATCCCGGGAAACTTAAAGAAATTCTTAGATTGTTTGAGAATTTGAGTTTGGATTCTAAGGAAAATACGAGGAACAGTTGTGATTTGGGGCTTGAAATTCAGGAGAAGATAGTAAGCAGTATAGGAGAAGTTCCCATTGAAGAGTGGATGGGTCCATCAAATGCAATTGAAGGCTATGTTCCACACAGAAATCATAACATCATGACTTTGCCCAGCAAGGATGGAAAAGAACTCAAGGATGGTTCTAAAGCTAAAATTAAGCAATTGGGTGTTGGAAAGGATTTCTTCAGTGACTTCTCTTTCGCAACTACTGTAATCACAGATGAAGAGTATAGTGTTTCGAAGATATCATCTGGTTTGAAAGAGATGACTTTTGATACAAAGTCAAAAGAACAAACAGGAGAGTTCTGTGGTAAACAATCAAATGAACAATTTACCATTTTGGAAACCCCGCACAGTCCAGCTCCCACAAAAAACAGTGTTGGACGGAAGGCAAGAGGATCAAAAGAAAGGACTAATGTATCAGCCACCGCAGAAAGTAATAATAATTTGTCTGATGCTCCCTCAACTTCAAATCACTGCAGTACTAATTGCAATATAACGACTGAAGAACCAAATGGTGGATCCAATGATCTTAACGAAACTCAGATCAAGTCCTCCCTTAAACAACCAGGCAAGAAAAACCTGCGCCGCTCTGTAACCTGGGCAGATGCAAAAACTGATGAGACCAGTATTATAAACCTTCCAGAGGACAGAGAAATGGGGAAGACAAAGGAATGTTCCAGAATGACAAGCAATTTGGTAAATGCTGACAATGGTAATGAGGACATATTACGCGTTGAATCTGCTGAAGCCTGTGCAATGGCACTGAGCCAAGCAGCTGAAGCAATTACTTCTGGGCAAAATGAGGTCTCTGATGCAGTGTCTGAAGCTGGAATTATTATATTGCCACGTCCAAGTGATGCTAATGAAGAAGCATCTACAAATGGCGAGAACATATCTGAACCACATTCATCCTCAGAGAAGTCAAACAAACCTGGGATATTACGTTCTGATCTGTTTGATCCCGATGACTCTTGGTATGATTCGCCTCCAGAGGGTTTCAGCCTAACTTTATCTTCTTTTGCAACCATGTGGATGGCAATCTTTGCATGGATGACATCATCTTCCCTGGCCTACATTTATGGAAAAGATGAAAAGTTCCACGAGGAATTTCAATATATTGATGGGAGGGAGTATCCGAGGAAAATTGTTTCTGCTGATGGCCGATCTTCTGAAATCAAGCAAACACTTGCTGGATGTCTAACACGGTCAATACCTGGACTTGCCTCTGAACTTAAGCTATCAACCCCAATATCAAGTTTGGAGCACGGGATGTCTGAGGATCGGATTCGTTCCTCAAATTTTCTTATCTCCTGTCCTGATGGACCTTTTGTATTACTGGGGTGTTTGTTAGACACTATGACTTTCCTTGATGCACTTCCAGCATTCAGAACGAAGCAGTGGCAAGTAATTGTTCTCCTGTTTATAGAAGCTTTATCTGTTTGCCGGATTCCGTCACTTGACTCCCAAGTGTCACATAGTAGAAGTCTGTTTCACAAGGTGCTTGATCGTGCTCAGATACGGTCCGACGAATATGAGACTTTGAAAGATCATATACTACCGCTTGGTCGAACAGCTCAGTTTCCAGGCGAGAATGGTGCTTAAAAGATTAGAACGAGCTTCTCCAACACAAATATCCTCACACATGAGGCTGATTCAAATTTCTGGGGTGATAGTGTGAGCTCCAGATCGTATTAATGGCAGTTGCAAGCGAATCCAATTTTCAGTATGCAATAGTTGGTAATAATCGACATATTTCGTACTGTTTTTTCAGGTAGATTCTTTTGGTATTTCTTTCTGGGAGTATTGGTGTGCTTTAAGGACTGCAAATTTCTAGGGTGATAGTGTGATTTTGGATGAACTCAATTCAAAATAACACTTTTCTTTTTGTTTGACTCTTTAGTGTATAAACTACTCCCCCCCAAATTTAGAGGGTTAAAAAAATTAAAAAAAAAAATACCTTATTGTTAGTATACATTTACCTATACTTTATATTACTAATCTAGTTTAACATTTTTTCATTCTTTTTTTAATAAAGTCAATCTTCTGTGCCGATTCCAAATTCTAACTCATAATCTAAATTCTAGACATGGATCCGTTGCATCACTTGTGGCAGC

Coding sequence (CDS)

ATGGCAAAGAATCAGACTATTTTGATCAAAGACACAGTCTATAAATTGCAGCTTGCACTCCTTGACGGAATTCATAATGAGAACCAGCTATTTGCAGCTGGATCTCTGATGTCTCGCAGTGACTATGAAGATGTGGTGACTGAGCGGTCTATTGCAAACCTCTGTGGATATCCATTATGCCAATCTAATTTGCCATCTGATAACACTAGGAAAGGCCGGTACAGAATTTCCTTAAAAGAGCATAAGGTGTATGATTTAGAAGAGACATATAAGTACTGCTCTTCCACTTGTCTCATTAACAGTCGTGCCTTTTCTGGGAGATTGCAAGATGAGAGATGTTCGGTTATGAATCCCGGGAAACTTAAAGAAATTCTTAGATTGTTTGAGAATTTGAGTTTGGATTCTAAGGAAAATACGAGGAACAGTTGTGATTTGGGGCTTGAAATTCAGGAGAAGATAGTAAGCAGTATAGGAGAAGTTCCCATTGAAGAGTGGATGGGTCCATCAAATGCAATTGAAGGCTATGTTCCACACAGAAATCATAACATCATGACTTTGCCCAGCAAGGATGGAAAAGAACTCAAGGATGGTTCTAAAGCTAAAATTAAGCAATTGGGTGTTGGAAAGGATTTCTTCAGTGACTTCTCTTTCGCAACTACTGTAATCACAGATGAAGAGTATAGTGTTTCGAAGATATCATCTGGTTTGAAAGAGATGACTTTTGATACAAAGTCAAAAGAACAAACAGGAGAGTTCTGTGGTAAACAATCAAATGAACAATTTACCATTTTGGAAACCCCGCACAGTCCAGCTCCCACAAAAAACAGTGTTGGACGGAAGGCAAGAGGATCAAAAGAAAGGACTAATGTATCAGCCACCGCAGAAAGTAATAATAATTTGTCTGATGCTCCCTCAACTTCAAATCACTGCAGTACTAATTGCAATATAACGACTGAAGAACCAAATGGTGGATCCAATGATCTTAACGAAACTCAGATCAAGTCCTCCCTTAAACAACCAGGCAAGAAAAACCTGCGCCGCTCTGTAACCTGGGCAGATGCAAAAACTGATGAGACCAGTATTATAAACCTTCCAGAGGACAGAGAAATGGGGAAGACAAAGGAATGTTCCAGAATGACAAGCAATTTGGTAAATGCTGACAATGGTAATGAGGACATATTACGCGTTGAATCTGCTGAAGCCTGTGCAATGGCACTGAGCCAAGCAGCTGAAGCAATTACTTCTGGGCAAAATGAGGTCTCTGATGCAGTGTCTGAAGCTGGAATTATTATATTGCCACGTCCAAGTGATGCTAATGAAGAAGCATCTACAAATGGCGAGAACATATCTGAACCACATTCATCCTCAGAGAAGTCAAACAAACCTGGGATATTACGTTCTGATCTGTTTGATCCCGATGACTCTTGGTATGATTCGCCTCCAGAGGGTTTCAGCCTAACTTTATCTTCTTTTGCAACCATGTGGATGGCAATCTTTGCATGGATGACATCATCTTCCCTGGCCTACATTTATGGAAAAGATGAAAAGTTCCACGAGGAATTTCAATATATTGATGGGAGGGAGTATCCGAGGAAAATTGTTTCTGCTGATGGCCGATCTTCTGAAATCAAGCAAACACTTGCTGGATGTCTAACACGGTCAATACCTGGACTTGCCTCTGAACTTAAGCTATCAACCCCAATATCAAGTTTGGAGCACGGGATGTCTGAGGATCGGATTCGTTCCTCAAATTTTCTTATCTCCTGTCCTGATGGACCTTTTGTATTACTGGGGTGTTTGTTAGACACTATGACTTTCCTTGATGCACTTCCAGCATTCAGAACGAAGCAGTGGCAAGTAATTGTTCTCCTGTTTATAGAAGCTTTATCTGTTTGCCGGATTCCGTCACTTGACTCCCAAGTGTCACATAGTAGAAGTCTGTTTCACAAGGTGCTTGATCGTGCTCAGATACGGTCCGACGAATATGAGACTTTGAAAGATCATATACTACCGCTTGGTCGAACAGCTCAGTTTCCAGGCGAGAATGGTGCTTAA

Protein sequence

MAKNQTILIKDTVYKLQLALLDGIHNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLCQSNLPSDNTRKGRYRISLKEHKVYDLEETYKYCSSTCLINSRAFSGRLQDERCSVMNPGKLKEILRLFENLSLDSKENTRNSCDLGLEIQEKIVSSIGEVPIEEWMGPSNAIEGYVPHRNHNIMTLPSKDGKELKDGSKAKIKQLGVGKDFFSDFSFATTVITDEEYSVSKISSGLKEMTFDTKSKEQTGEFCGKQSNEQFTILETPHSPAPTKNSVGRKARGSKERTNVSATAESNNNLSDAPSTSNHCSTNCNITTEEPNGGSNDLNETQIKSSLKQPGKKNLRRSVTWADAKTDETSIINLPEDREMGKTKECSRMTSNLVNADNGNEDILRVESAEACAMALSQAAEAITSGQNEVSDAVSEAGIIILPRPSDANEEASTNGENISEPHSSSEKSNKPGILRSDLFDPDDSWYDSPPEGFSLTLSSFATMWMAIFAWMTSSSLAYIYGKDEKFHEEFQYIDGREYPRKIVSADGRSSEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGMSEDRIRSSNFLISCPDGPFVLLGCLLDTMTFLDALPAFRTKQWQVIVLLFIEALSVCRIPSLDSQVSHSRSLFHKVLDRAQIRSDEYETLKDHILPLGRTAQFPGENGA
Homology
BLAST of Cp4.1LG03g13150 vs. ExPASy Swiss-Prot
Match: F4K1B1 (Putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog OS=Arabidopsis thaliana OX=3702 GN=At5g26760 PE=2 SV=1)

HSP 1 Score: 420.2 bits (1079), Expect = 4.4e-116
Identity = 306/791 (38.69%), Postives = 411/791 (51.96%), Query Frame = 0

Query: 1   MAK-NQTILIKDTVYKLQLALLDGIHNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPL 60
           MAK N+ I I D V+KLQL +L+   ++NQLFAA  LMSRSDYEDVVTER+IA LCGY L
Sbjct: 1   MAKDNEAIAINDAVHKLQLYMLENTTDQNQLFAARKLMSRSDYEDVVTERAIAKLCGYTL 60

Query: 61  CQSNLPSDNTRKGRYRISLKEHKVYDLEETYKYCSSTCLINSRAFSGRLQDERCSVMNPG 120
           CQ  LPSD +R+G+YRISLK+HKVYDL+ET K+CS+ CLI+S+ FSG LQ+ R    +  
Sbjct: 61  CQRFLPSDVSRRGKYRISLKDHKVYDLQETSKFCSAGCLIDSKTFSGSLQEARTLEFDSV 120

Query: 121 KLKEILRLF-ENLSLDSKENTRNSCDLG-LEIQEKIVSSIGEVPIEEWMGPSNAIEGYVP 180
           KL EIL LF ++L +    +     DL  L I+E       E+ +E+WMGPSNA+EGYVP
Sbjct: 121 KLNEILDLFGDSLEVKGSLDVNKDLDLSKLMIKENFGVRGEELSLEKWMGPSNAVEGYVP 180

Query: 181 HRNHNIMTLPSKDGKELKDGSKAKIKQLGVGKDFFSDFSFATTVITDEEYSVSKISSGLK 240
                  +  S D K     ++ K            +  F +TVI  +  SVSK+    K
Sbjct: 181 FDR----SKSSNDSKATTQSNQEK-----------HEMDFTSTVIMPDVNSVSKLPPQTK 240

Query: 241 EMTFDTKS---------KEQT--------GEFCGKQSNEQFTILETPHSPAPTKNSV--- 300
           + +   +S         KEQT          F  ++  E+ T        A  K +V   
Sbjct: 241 QASTVVESVDGKGKTVLKEQTVVPPTKKVSRFRREKEKEKKTFGVDGMGCAQEKTTVLPR 300

Query: 301 -----------GRKARGSKERTNVSATAESNN-----NLSDAPSTSNHCSTNCNI----- 360
                        K  G  E    S+   S+      ++S  P  S   S +C +     
Sbjct: 301 KILSFCNEIEKDFKNFGFDEMGLASSAMMSDGYGVEYSVSKQPQCSMEDSLSCKLKGDLQ 360

Query: 361 ------TTEEPNGGSN-------------------------------------------- 420
                 T    + GSN                                            
Sbjct: 361 TLDGKNTLSGSSSGSNTKGSKTKPEKSRKKIISVEYHANSYEDGEEILAAESYERHKAQD 420

Query: 421 --DLNETQIKSSLKQPGKKNLRRSVTWADAKTDETSIINL-PEDREMGKTKECSRMTSNL 480
               +E   KS LK  G K L RSVTWAD       +  +   D   G +     ++SN 
Sbjct: 421 VCSSSEIVTKSCLKISGSKKLSRSVTWADQNDGRGDLCEVRNNDNAAGPS-----LSSND 480

Query: 481 VNADNGNEDILRVESAEACAMALSQAAEAITSGQNEVSDAVSEAGIIILPRPSDANEEAS 540
           +   N    + R+  AEA A ALSQAAEA++SG ++ SDA ++AGII+LP     +EE  
Sbjct: 481 IEDVN---SLSRLALAEALATALSQAAEAVSSGNSDASDATAKAGIILLPSTHQLDEE-- 540

Query: 541 TNGENISEPHSSSEKS----------NKPGILRSDLFDPDDSWYDSPPEGFSLTLSSFAT 600
                ++E HS  E +          NKPGI  SDLFD D SW+D PPEGF+LTLS+FA 
Sbjct: 541 -----VTEEHSEEEMTEEEPTLLKWPNKPGIPDSDLFDRDQSWFDGPPEGFNLTLSNFAV 600

Query: 601 MWMAIFAWMTSSSLAYIYGKDEKFHEEFQYIDGREYPRKIVSADGRSSEIKQTLAGCLTR 660
           MW ++F W++SSSLAYIYGK+E  HEEF  ++G+EYPR+I+  DG SSEIKQT+AGCL R
Sbjct: 601 MWDSLFGWVSSSSLAYIYGKEESAHEEFLLVNGKEYPRRIIMVDGLSSEIKQTIAGCLAR 660

Query: 661 SIPGLASELKLSTPISSLEHGMSEDRIRSSNFLISCPDGPFVLLGCLLDTMTFLDALPAF 685
           ++P + + L+L   IS LE G                      LG LL+TM+   A+P+F
Sbjct: 661 ALPRVVTHLRLPIAISELEKG----------------------LGSLLETMSLTGAVPSF 720

BLAST of Cp4.1LG03g13150 vs. ExPASy Swiss-Prot
Match: A2Y040 (Putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog OS=Oryza sativa subsp. indica OX=39946 GN=OsI_18345 PE=3 SV=1)

HSP 1 Score: 357.8 bits (917), Expect = 2.7e-97
Identity = 276/771 (35.80%), Postives = 388/771 (50.32%), Query Frame = 0

Query: 2   AKNQTILIKDTVYKLQLALLDG--IHNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPL 61
           A+ +   +   V+++Q+AL DG     E  L AA SL+S  DY DVVTERSIA+ CGYP 
Sbjct: 11  ARMKPTTVASAVHRVQMALYDGAAASREPLLRAAASLLSGPDYADVVTERSIADACGYPA 70

Query: 62  CQSNLPSDNTR---KGRYRISLKEHKVYDLEETYKYCSSTCLINSRAFSGRLQDERCSVM 121
           C + LPS++ R     R+RISL+EH+VYDLEE  K+CS  CL+ S AF   L  +R   +
Sbjct: 71  CPNPLPSEDARGKAAPRFRISLREHRVYDLEEARKFCSERCLVASAAFGASLPPDRPFGV 130

Query: 122 NPGKLKEILRLFE---------------NLSLDSKENTRNSCDLGLEIQEKIVSSIGEVP 181
           +P +L  ++ LFE                 S D KE         +EI EK  +  GEV 
Sbjct: 131 SPDRLDALVALFEGGGGGGGDGGLALGFGASGDGKEVEEGR---KVEIMEKEAAGTGEVT 190

Query: 182 IEEWMGPSNAIEGYVPHRNHNIMTLPSKDGKE---------------------------L 241
           ++EW+GPS+AIEGYVP R+  ++  P K+ K+                           L
Sbjct: 191 LQEWIGPSDAIEGYVPRRD-RVVGGPKKEAKQNDACSAEQSSNINVDSRNASSGESGMVL 250

Query: 242 KDGSKAKIKQLGVG--KDFFSDFSFATTVITDEEYSVSKISSGLKEMTFDTKSKEQTGEF 301
            + +KAK K+      K F  D         D +   S IS  + +   D   +E+    
Sbjct: 251 TENTKAKKKEATKTPLKMFKQD--------EDNDMLSSCISDSIVKQLEDVVLEEKK--- 310

Query: 302 CGKQSNEQFTILETPHSPAPTKNSVGRKA--------------------RGSKERTNVSA 361
             K+ N+            P K  VGR                       G+  + N S+
Sbjct: 311 -DKKKNKAAKGTSRVGKSKPAKRPVGRDGHEVDFTSTIIMGDHGSEMMDHGALGQYNFSS 370

Query: 362 TAESNNNLSDAPSTSNHC----------------STNCNITTEEPNGGSNDLNETQIKSS 421
           +  +N    + PS+S +                 S   NI  +E    S+D     ++SS
Sbjct: 371 SILAN----EQPSSSQYAAIDSVQAYTEELDELFSNAVNIAKDET---SDDSGRCTLRSS 430

Query: 422 LKQPGKKNLRRSVTWADAKTDETSIINLPEDREMGKTKECSR-MTSNLVNADNGNEDILR 481
           LK  G KN  RSV WAD               E G   E SR   S+   +    +  +R
Sbjct: 431 LKAVGSKNAGRSVKWAD---------------ENGSVLETSRAFVSHSSKSQESMDSSVR 490

Query: 482 VESAEACAMALSQAAEAITSGQNEVSDAVSEAGIIILP---------RPSDANEEASTNG 541
            ESAEACA AL +AAEAI+SG +EV DAVS+AGIIILP            D +++A  N 
Sbjct: 491 RESAEACAAALIEAAEAISSGTSEVEDAVSKAGIIILPDMVNQQQYNNDYDNDKDAGEN- 550

Query: 542 ENISEPHSSSEKSNKPGILRSDLFDPDDSWYDSPPEGFSLTLSSFATMWMAIFAWMTSSS 601
           E         +   K  +L +D+FD DDSW+D+PPEGFSLTLSSFATMW A+F W++ SS
Sbjct: 551 EIFEIDRGVVKWPKKTVLLDTDMFDVDDSWHDTPPEGFSLTLSSFATMWAALFGWVSRSS 610

Query: 602 LAYIYGKDEKFHEEFQYIDGREYPRKIVSADGRSSEIKQTLAGCLTRSIPGLASELKLST 661
           LAY+YG DE   E+     GRE P+K V  DG SSEI++ L  C+  ++P L S L++  
Sbjct: 611 LAYVYGLDESSMEDLLIAGGRECPQKRVLNDGHSSEIRRALDTCVCNALPVLVSNLRMQI 670

Query: 662 PISSLEHGMSEDRIRSSNFLISCPDGPFVLLGCLLDTMTFLDALPAFRTKQWQVIVLLFI 678
           P+S LE                      + LG LLDTM+F+DALP+ R++QWQ++VL+ +
Sbjct: 671 PVSKLE----------------------ITLGYLLDTMSFVDALPSLRSRQWQLMVLVLL 719

BLAST of Cp4.1LG03g13150 vs. ExPASy Swiss-Prot
Match: Q6AVZ9 (Putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog OS=Oryza sativa subsp. japonica OX=39947 GN=Os05g0134300 PE=3 SV=1)

HSP 1 Score: 356.7 bits (914), Expect = 5.9e-97
Identity = 278/768 (36.20%), Postives = 388/768 (50.52%), Query Frame = 0

Query: 2   AKNQTILIKDTVYKLQLALLDG--IHNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPL 61
           A+ +   +   V+++Q+AL DG     E  L AA SL+S  DY DVVTERSIA+ CGYP 
Sbjct: 11  ARMKPTTVASAVHRVQMALYDGAAASREPLLRAAASLLSGPDYADVVTERSIADACGYPA 70

Query: 62  CQSNLPSDNTR---KGRYRISLKEHKVYDLEETYKYCSSTCLINSRAFSGRLQDERCSVM 121
           C + LPS++ R     R+RISL+EH+VYDLEE  K+CS  CL+ S AF   L  +R   +
Sbjct: 71  CPNPLPSEDARGKAAPRFRISLREHRVYDLEEARKFCSERCLVASAAFGASLPPDRPFGV 130

Query: 122 NPGKLKEILRLFE---------------NLSLDSKENTRNSCDLGLEIQEKIVSSIGEVP 181
           +P +L  ++ LFE                 S D KE         +EI EK  +  GEV 
Sbjct: 131 SPDRLDALVALFEGGGGGGDDGGLALGFGASGDGKEVEEGR---KVEIMEKEAAGTGEVT 190

Query: 182 IEEWMGPSNAIEGYVPHRNHNIMTLPSKDGKE---------------------------L 241
           ++EW+GPS+AIEGYVP R+  ++  P K+ K+                           L
Sbjct: 191 LQEWIGPSDAIEGYVPRRD-RVVGGPKKEAKQNDACSAEQSSNINVDSRNASSGESGMVL 250

Query: 242 KDGSKAKIKQLGVG--KDFFSDFSFATTVITDEEYSVSKISSGLKEMTFDTKSKEQTGEF 301
            + +KAK K+      K F  D         D +   S IS  + +   D   +E+    
Sbjct: 251 TENTKAKKKEATKTPLKMFKQD--------EDNDMLSSCISDSIVKQLEDVVLEEKK--- 310

Query: 302 CGKQSNEQFTILETPHSPAPTKNSVGRKA-------------RGSKERTNVSATAESNNN 361
             K+ N+            P K  VGR               RGS E  +  A  + N +
Sbjct: 311 -DKKKNKAAKGTSRVGKSKPAKRPVGRDGHEVDFTSTIIMGDRGS-EMMDHGALGQYNFS 370

Query: 362 LS----DAPSTSNHC----------------STNCNITTEEPNGGSNDLNETQIKSSLKQ 421
            S    + PS+S +                 S   NI  +E    S+D     ++SSLK 
Sbjct: 371 SSILANEQPSSSQYAAIDSVQAYTEELDELFSNAVNIAKDET---SDDSGRCTLRSSLKA 430

Query: 422 PGKKNLRRSVTWADAKTDETSIINLPEDREMGKTKECSR-MTSNLVNADNGNEDILRVES 481
            G KN   SV WAD               E G   E SR   S+   +    +  +R ES
Sbjct: 431 VGSKNAGHSVKWAD---------------ENGSVLETSRAFVSHSSKSQESMDSSVRRES 490

Query: 482 AEACAMALSQAAEAITSGQNEVSDAVSEAGIIILP---------RPSDANEEASTNGENI 541
           AEACA AL +AAEAI+SG +EV DAVS+AGIIILP            D +++A  N E  
Sbjct: 491 AEACAAALIEAAEAISSGTSEVEDAVSKAGIIILPDMVNQQQYNNDYDNDKDAGEN-EIF 550

Query: 542 SEPHSSSEKSNKPGILRSDLFDPDDSWYDSPPEGFSLTLSSFATMWMAIFAWMTSSSLAY 601
                  +   K  +L +D+FD DDSW+D+PPEGFSLTLSSFATMW A+F W++ SSLAY
Sbjct: 551 EIDRGVVKWPKKTVLLDTDMFDVDDSWHDTPPEGFSLTLSSFATMWAALFGWVSRSSLAY 610

Query: 602 IYGKDEKFHEEFQYIDGREYPRKIVSADGRSSEIKQTLAGCLTRSIPGLASELKLSTPIS 661
           +YG DE   E+     GRE P+K V  DG SSEI++ L  C+  ++P L S L++  P+S
Sbjct: 611 VYGLDESSMEDLLIAGGRECPQKRVLNDGHSSEIRRALDTCVCNALPVLVSNLRMQIPVS 670

Query: 662 SLEHGMSEDRIRSSNFLISCPDGPFVLLGCLLDTMTFLDALPAFRTKQWQVIVLLFIEAL 678
            LE                      + LG LLDTM+F+DALP+ R++QWQ++VL+ ++AL
Sbjct: 671 KLE----------------------ITLGYLLDTMSFVDALPSLRSRQWQLMVLVLLDAL 719

BLAST of Cp4.1LG03g13150 vs. ExPASy Swiss-Prot
Match: Q8IXW5 (Putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 OS=Homo sapiens OX=9606 GN=RPAP2 PE=1 SV=1)

HSP 1 Score: 59.7 bits (143), Expect = 1.5e-07
Identity = 106/448 (23.66%), Postives = 178/448 (39.73%), Query Frame = 0

Query: 20  LLDGIHNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLCQSNLPSDNTRKGRYRISLK 79
           LL+    E  L   G  ++ + Y DVV ERSI  LCGYPLCQ  L      K +Y+IS K
Sbjct: 65  LLEENITEEFLMECGRFITPAHYSDVVDERSIVKLCGYPLCQKKL--GIVPKQKYKISTK 124

Query: 80  EHKVYDLEETYKYCSSTCLINSRAFSGRL--------QDERCSVMNPGKLKEILRLFENL 139
            +KVYD+ E   +CS+ C   S+ F  ++        ++ER       K ++     E +
Sbjct: 125 TNKVYDITERKSFCSNFCYQASKFFEAQIPKTPVWVREEERHPDFQLLKEEQSGHSGEEV 184

Query: 140 SLDSKENTRNSCDLGLEIQEKIVSSIGEVPIE-----EWMGPSNAIEGYVPHRNHNIMTL 199
            L SK    +  D     +++  SS      +     E    S+ + G  P+  +    L
Sbjct: 185 QLCSKAIKTSDIDNPSHFEKQYESSSSSTHSDSSSDNEQDFVSSILPGNRPNSTNIRPQL 244

Query: 200 PSKDGKELKDGSKAKIKQLGVGKDFFSDFSFATTVITDEEYSVSKISSGLKEMTFDTKSK 259
             K   + K G KA  K                    D+E +V  ++  L +   D++ K
Sbjct: 245 HQKSIMKKKAGHKANSKH------------------KDKEQTVVDVTEQLGDCKLDSQEK 304

Query: 260 EQTGEFCGKQSNEQFTILET-PHSPAPTKNSVGRKARGSKERTNVSATAESNNNLSDAPS 319
           + T E   ++ N Q +   T P     ++NS    +R   E T V  + +S  +     +
Sbjct: 305 DATCELPLQKVNTQSSSNSTLPERLKASENSESEYSR--SEITLVGISKKSAEHFKRKFA 364

Query: 320 TSNHCSTNCNITTEEPNGGSNDLNETQIKSSLKQPGKKNLRRSV--TWADAKTDETSIIN 379
            SN  S + + + +                   + GK+NL + +  T  + KT+ET    
Sbjct: 365 KSNQVSRSVSSSVQ----------------VCPEVGKRNLLKVLKETLIEWKTEETLRFL 424

Query: 380 LPEDREMGKTKECSRMTSNLVNADNGNEDILRVESAEACAMALSQAAEAITSGQNEVSDA 439
             ++        C +  ++LV  +   +DI+    +   A   S         QN + ++
Sbjct: 425 YGQN----YASVCLKPEASLVKEELDEDDIISDPDSHFPAWRES---------QNSLDES 461

Query: 440 V--SEAGIIILPRPSDANEEASTNGENI 450
           +    +G  I P PS  N +  T   N+
Sbjct: 485 LPFRGSGTAIKPLPSYENLKKETEKLNL 461

BLAST of Cp4.1LG03g13150 vs. ExPASy Swiss-Prot
Match: Q5RA37 (Putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 OS=Pongo abelii OX=9601 GN=RPAP2 PE=2 SV=1)

HSP 1 Score: 59.3 bits (142), Expect = 2.0e-07
Identity = 106/448 (23.66%), Postives = 177/448 (39.51%), Query Frame = 0

Query: 20  LLDGIHNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLCQSNLPSDNTRKGRYRISLK 79
           LL+    E  L   G  ++ + Y DVV ERSI  LCGYPLCQ  L      K +Y+IS K
Sbjct: 65  LLEENITEEFLMECGKFITPAHYSDVVDERSIVKLCGYPLCQKKL--GIVPKQKYKISTK 124

Query: 80  EHKVYDLEETYKYCSSTCLINSRAFSGRL--------QDERCSVMNPGKLKEILRLFENL 139
            +KVYD+ E   +CS+ C   S+ F  ++        ++ER       K ++     E +
Sbjct: 125 TNKVYDITERKSFCSNFCYQASKFFEAQIPKTPVWVREEERHPDFQLLKEQQSGHSGEEV 184

Query: 140 SLDSKENTRNSCDLGLEIQEKIVSSIGEVPIE-----EWMGPSNAIEGYVPHRNHNIMTL 199
            L SK    +  D     +++  SS      +     E    S+ + G  P+       L
Sbjct: 185 QLCSKAIKTSDIDNPSHFEKQYESSSSSTHSDSSSDNEQDFVSSILPGNRPNSTSIRPQL 244

Query: 200 PSKDGKELKDGSKAKIKQLGVGKDFFSDFSFATTVITDEEYSVSKISSGLKEMTFDTKSK 259
             K   + K G KA  K                    D+E +V  ++  L +   D++ K
Sbjct: 245 HQKSIMKKKAGHKANSKH------------------KDKEQTVIDVTEQLGDCKLDSQEK 304

Query: 260 EQTGEFCGKQSNEQFTILET-PHSPAPTKNSVGRKARGSKERTNVSATAESNNNLSDAPS 319
           + T E   ++ N Q +   T P     ++NS    +R   E T V  + +S  +     +
Sbjct: 305 DATCELPLQKVNTQSSSNSTLPERLKASENSESEYSR--SEITLVGISKKSAEHFKRKFA 364

Query: 320 TSNHCSTNCNITTEEPNGGSNDLNETQIKSSLKQPGKKNLRRSV--TWADAKTDETSIIN 379
            SN  S + + + +                   + GK+NL + +  T  + KT+ET    
Sbjct: 365 KSNQVSRSVSSSVQ----------------VCPEVGKRNLLKILKETLIEWKTEETLRFL 424

Query: 380 LPEDREMGKTKECSRMTSNLVNADNGNEDILRVESAEACAMALSQAAEAITSGQNEVSDA 439
             ++        C +  ++LV  +   +DI+    +   A   S         QN + ++
Sbjct: 425 YGQN----YASVCLKPEASLVKEELDEDDIISDPDSHFPAWRES---------QNSLDES 461

Query: 440 V--SEAGIIILPRPSDANEEASTNGENI 450
           +    +G  I P PS  N +  T   N+
Sbjct: 485 LPFRGSGTAIKPLPSYENLKKETEKLNL 461

BLAST of Cp4.1LG03g13150 vs. NCBI nr
Match: XP_023528028.1 (putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1287 bits (3330), Expect = 0.0
Identity = 662/684 (96.78%), Postives = 662/684 (96.78%), Query Frame = 0

Query: 1   MAKNQTILIKDTVYKLQLALLDGIHNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC 60
           MAKNQTILIKDTVYKLQLALLDGIHNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC
Sbjct: 1   MAKNQTILIKDTVYKLQLALLDGIHNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC 60

Query: 61  QSNLPSDNTRKGRYRISLKEHKVYDLEETYKYCSSTCLINSRAFSGRLQDERCSVMNPGK 120
           QSNLPSDNTRKGRYRISLKEHKVYDLEETYKYCSSTCLINSRAFSGRLQDERCSVMNPGK
Sbjct: 61  QSNLPSDNTRKGRYRISLKEHKVYDLEETYKYCSSTCLINSRAFSGRLQDERCSVMNPGK 120

Query: 121 LKEILRLFENLSLDSKENTRNSCDLGLEIQEKIVSSIGEVPIEEWMGPSNAIEGYVPHRN 180
           LKEILRLFENLSLDSKENTRNSCDLGLEIQEKIVSSIGEVPIEEWMGPSNAIEGYVPHRN
Sbjct: 121 LKEILRLFENLSLDSKENTRNSCDLGLEIQEKIVSSIGEVPIEEWMGPSNAIEGYVPHRN 180

Query: 181 HNIMTLPSKDGKELKDGSKAKIKQLGVGKDFFSDFSFATTVITDEEYSVSKISSGLKEMT 240
           HNIMTLPSKDGKELKDGSKAKIKQLGVGKDFFSDFSFATTVITDEEYSVSKISSGLKEMT
Sbjct: 181 HNIMTLPSKDGKELKDGSKAKIKQLGVGKDFFSDFSFATTVITDEEYSVSKISSGLKEMT 240

Query: 241 FDTKSKEQTGEFCGKQSNEQFTILETPHSPAPTKNSVGRKARGSKERTNVSATAESNNNL 300
           FDTKSKEQTGEFCGKQSNEQFTILETPHSPAPTKNSVGRKARGSKERTNVSATAESNNNL
Sbjct: 241 FDTKSKEQTGEFCGKQSNEQFTILETPHSPAPTKNSVGRKARGSKERTNVSATAESNNNL 300

Query: 301 SDAPSTSNHCSTNCNITTEEPNGGSNDLNETQIKSSLKQPGKKNLRRSVTWADAKTDETS 360
           SDAPSTSNHCSTNCNITTEEPNGGSNDLNETQIKSSLKQPGKKNLRRSVTWADAKTDETS
Sbjct: 301 SDAPSTSNHCSTNCNITTEEPNGGSNDLNETQIKSSLKQPGKKNLRRSVTWADAKTDETS 360

Query: 361 IINLPEDREMGKTKECSRMTSNLVNADNGNEDILRVESAEACAMALSQAAEAITSGQNEV 420
           IINLPEDREMGKTKECSRMTSNLVNADNGNEDILRVESAEACAMALSQAAEAITSGQNEV
Sbjct: 361 IINLPEDREMGKTKECSRMTSNLVNADNGNEDILRVESAEACAMALSQAAEAITSGQNEV 420

Query: 421 SDAVSEAGIIILPRPSDANEEASTNGENISEPHSSSEKSNKPGILRSDLFDPDDSWYDSP 480
           SDAVSEAGIIILPRPSDANEEASTNGENISEPHSSSEKSNKPGILRSDLFDPDDSWYDSP
Sbjct: 421 SDAVSEAGIIILPRPSDANEEASTNGENISEPHSSSEKSNKPGILRSDLFDPDDSWYDSP 480

Query: 481 PEGFSLTLSSFATMWMAIFAWMTSSSLAYIYGKDEKFHEEFQYIDGREYPRKIVSADGRS 540
           PEGFSLTLSSFATMWMAIFAWMTSSSLAYIYGKDEKFHEEFQYIDGREYPRKIVSADGRS
Sbjct: 481 PEGFSLTLSSFATMWMAIFAWMTSSSLAYIYGKDEKFHEEFQYIDGREYPRKIVSADGRS 540

Query: 541 SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGMSEDRIRSSNFLISCPDGPFVLLGCL 600
           SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGM                      GCL
Sbjct: 541 SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGM----------------------GCL 600

Query: 601 LDTMTFLDALPAFRTKQWQVIVLLFIEALSVCRIPSLDSQVSHSRSLFHKVLDRAQIRSD 660
           LDTMTFLDALPAFRTKQWQVIVLLFIEALSVCRIPSLDSQVSHSRSLFHKVLDRAQIRSD
Sbjct: 601 LDTMTFLDALPAFRTKQWQVIVLLFIEALSVCRIPSLDSQVSHSRSLFHKVLDRAQIRSD 660

Query: 661 EYETLKDHILPLGRTAQFPGENGA 684
           EYETLKDHILPLGRTAQFPGENGA
Sbjct: 661 EYETLKDHILPLGRTAQFPGENGA 662

BLAST of Cp4.1LG03g13150 vs. NCBI nr
Match: KAG6581990.1 (putative RNA polymerase II subunit B1 CTD phosphatase RPAP2-like protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1277 bits (3304), Expect = 0.0
Identity = 657/684 (96.05%), Postives = 658/684 (96.20%), Query Frame = 0

Query: 1   MAKNQTILIKDTVYKLQLALLDGIHNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC 60
           MAKNQT LIKDTVYKLQLALLDGIHNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC
Sbjct: 1   MAKNQTTLIKDTVYKLQLALLDGIHNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC 60

Query: 61  QSNLPSDNTRKGRYRISLKEHKVYDLEETYKYCSSTCLINSRAFSGRLQDERCSVMNPGK 120
           QSNLPSDNTRKGRYRISLKEHKVYDLEETYKYCSSTCLINSRAFSGRLQDERCSVMNPGK
Sbjct: 61  QSNLPSDNTRKGRYRISLKEHKVYDLEETYKYCSSTCLINSRAFSGRLQDERCSVMNPGK 120

Query: 121 LKEILRLFENLSLDSKENTRNSCDLGLEIQEKIVSSIGEVPIEEWMGPSNAIEGYVPHRN 180
           LKEILRLFENLSLDSKENTRNSCDLGLEIQEKI SSIGEVPIEEWMGPSNAIEGYVPHRN
Sbjct: 121 LKEILRLFENLSLDSKENTRNSCDLGLEIQEKIESSIGEVPIEEWMGPSNAIEGYVPHRN 180

Query: 181 HNIMTLPSKDGKELKDGSKAKIKQLGVGKDFFSDFSFATTVITDEEYSVSKISSGLKEMT 240
           HNIMTLPSKDGKELKDGSKAKIKQLGVGKDFFSDFSFATTVITDEEYSVSKISSGLKEMT
Sbjct: 181 HNIMTLPSKDGKELKDGSKAKIKQLGVGKDFFSDFSFATTVITDEEYSVSKISSGLKEMT 240

Query: 241 FDTKSKEQTGEFCGKQSNEQFTILETPHSPAPTKNSVGRKARGSKERTNVSATAESNNNL 300
           FDTKSKEQTGEFCGKQSNEQFTILETPH PAPTKNSVGRKARGSKERTNVSATAESNNNL
Sbjct: 241 FDTKSKEQTGEFCGKQSNEQFTILETPHGPAPTKNSVGRKARGSKERTNVSATAESNNNL 300

Query: 301 SDAPSTSNHCSTNCNITTEEPNGGSNDLNETQIKSSLKQPGKKNLRRSVTWADAKTDETS 360
           SDAPSTSNHCSTNCNITTEEPNGGSNDLNETQIKSSLKQPGKKNLRRSVTWADAKTDETS
Sbjct: 301 SDAPSTSNHCSTNCNITTEEPNGGSNDLNETQIKSSLKQPGKKNLRRSVTWADAKTDETS 360

Query: 361 IINLPEDREMGKTKECSRMTSNLVNADNGNEDILRVESAEACAMALSQAAEAITSGQNEV 420
           IINLPEDREMGKTKECSRMTSNLVNADNGNEDILRVESAEACAMALSQAAEAITSGQNEV
Sbjct: 361 IINLPEDREMGKTKECSRMTSNLVNADNGNEDILRVESAEACAMALSQAAEAITSGQNEV 420

Query: 421 SDAVSEAGIIILPRPSDANEEASTNGENISEPHSSSEKSNKPGILRSDLFDPDDSWYDSP 480
           SDAVSEAGIIILPRPSDANEEASTNGENISEPHSSSEKSNKPGILRSDLFDPDDSWYDSP
Sbjct: 421 SDAVSEAGIIILPRPSDANEEASTNGENISEPHSSSEKSNKPGILRSDLFDPDDSWYDSP 480

Query: 481 PEGFSLTLSSFATMWMAIFAWMTSSSLAYIYGKDEKFHEEFQYIDGREYPRKIVSADGRS 540
           PEGFSLTLSSFATMWMAIFAWMTSSSLAYIYGKDEKFHEEFQYIDGREYPRKIVSADGRS
Sbjct: 481 PEGFSLTLSSFATMWMAIFAWMTSSSLAYIYGKDEKFHEEFQYIDGREYPRKIVSADGRS 540

Query: 541 SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGMSEDRIRSSNFLISCPDGPFVLLGCL 600
           SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGM                      GCL
Sbjct: 541 SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGM----------------------GCL 600

Query: 601 LDTMTFLDALPAFRTKQWQVIVLLFIEALSVCRIPSLDSQVSHSRSLFHKVLDRAQIRSD 660
           LDTMTFLDALPAFRTKQWQVIVLLFIEALSVCRIPSLDSQVSHSRSLFHKVLDRAQI+SD
Sbjct: 601 LDTMTFLDALPAFRTKQWQVIVLLFIEALSVCRIPSLDSQVSHSRSLFHKVLDRAQIQSD 660

Query: 661 EYETLKDHILPLGRTAQFPGENGA 684
           EYETLKDHILPLGRTAQFPGEN A
Sbjct: 661 EYETLKDHILPLGRTAQFPGENDA 662

BLAST of Cp4.1LG03g13150 vs. NCBI nr
Match: KAG7018411.1 (putative RNA polymerase II subunit B1 CTD phosphatase RPAP2-like protein [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1275 bits (3299), Expect = 0.0
Identity = 656/684 (95.91%), Postives = 657/684 (96.05%), Query Frame = 0

Query: 1   MAKNQTILIKDTVYKLQLALLDGIHNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC 60
           MAKNQT LIKDTVYKLQLALLDGIHNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC
Sbjct: 1   MAKNQTTLIKDTVYKLQLALLDGIHNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC 60

Query: 61  QSNLPSDNTRKGRYRISLKEHKVYDLEETYKYCSSTCLINSRAFSGRLQDERCSVMNPGK 120
           QSNLPSDNTRKGRYRISLKEHKVYDLEETYKYCSSTCLINSRAFSGRLQDERCSVMNPGK
Sbjct: 61  QSNLPSDNTRKGRYRISLKEHKVYDLEETYKYCSSTCLINSRAFSGRLQDERCSVMNPGK 120

Query: 121 LKEILRLFENLSLDSKENTRNSCDLGLEIQEKIVSSIGEVPIEEWMGPSNAIEGYVPHRN 180
           LKEILRLFENLSLDSKENTRNSCDLGLEIQEKI SSIGEVPIEEWMGPSNAIEGYVPHRN
Sbjct: 121 LKEILRLFENLSLDSKENTRNSCDLGLEIQEKIESSIGEVPIEEWMGPSNAIEGYVPHRN 180

Query: 181 HNIMTLPSKDGKELKDGSKAKIKQLGVGKDFFSDFSFATTVITDEEYSVSKISSGLKEMT 240
           HNIMTLPSKDGKELKDGSKAKIKQLGVGKDFFSDFSFATTVITDEEYSVSKISSGLKEMT
Sbjct: 181 HNIMTLPSKDGKELKDGSKAKIKQLGVGKDFFSDFSFATTVITDEEYSVSKISSGLKEMT 240

Query: 241 FDTKSKEQTGEFCGKQSNEQFTILETPHSPAPTKNSVGRKARGSKERTNVSATAESNNNL 300
           FDTKSKEQTGEFCGKQSNEQFTILETPH PAPTKNSVGRKARGS ERTNVSATAESNNNL
Sbjct: 241 FDTKSKEQTGEFCGKQSNEQFTILETPHGPAPTKNSVGRKARGSNERTNVSATAESNNNL 300

Query: 301 SDAPSTSNHCSTNCNITTEEPNGGSNDLNETQIKSSLKQPGKKNLRRSVTWADAKTDETS 360
           SDAPSTSNHCSTNCNITTEEPNGGSNDLNETQIKSSLKQPGKKNLRRSVTWADAKTDETS
Sbjct: 301 SDAPSTSNHCSTNCNITTEEPNGGSNDLNETQIKSSLKQPGKKNLRRSVTWADAKTDETS 360

Query: 361 IINLPEDREMGKTKECSRMTSNLVNADNGNEDILRVESAEACAMALSQAAEAITSGQNEV 420
           IINLPEDREMGKTKECSRMTSNLVNADNGNEDILRVESAEACAMALSQAAEAITSGQNEV
Sbjct: 361 IINLPEDREMGKTKECSRMTSNLVNADNGNEDILRVESAEACAMALSQAAEAITSGQNEV 420

Query: 421 SDAVSEAGIIILPRPSDANEEASTNGENISEPHSSSEKSNKPGILRSDLFDPDDSWYDSP 480
           SDAVSEAGIIILPRPSDANEEASTNGENISEPHSSSEKSNKPGILRSDLFDPDDSWYDSP
Sbjct: 421 SDAVSEAGIIILPRPSDANEEASTNGENISEPHSSSEKSNKPGILRSDLFDPDDSWYDSP 480

Query: 481 PEGFSLTLSSFATMWMAIFAWMTSSSLAYIYGKDEKFHEEFQYIDGREYPRKIVSADGRS 540
           PEGFSLTLSSFATMWMAIFAWMTSSSLAYIYGKDEKFHEEFQYIDGREYPRKIVSADGRS
Sbjct: 481 PEGFSLTLSSFATMWMAIFAWMTSSSLAYIYGKDEKFHEEFQYIDGREYPRKIVSADGRS 540

Query: 541 SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGMSEDRIRSSNFLISCPDGPFVLLGCL 600
           SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGM                      GCL
Sbjct: 541 SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGM----------------------GCL 600

Query: 601 LDTMTFLDALPAFRTKQWQVIVLLFIEALSVCRIPSLDSQVSHSRSLFHKVLDRAQIRSD 660
           LDTMTFLDALPAFRTKQWQVIVLLFIEALSVCRIPSLDSQVSHSRSLFHKVLDRAQI+SD
Sbjct: 601 LDTMTFLDALPAFRTKQWQVIVLLFIEALSVCRIPSLDSQVSHSRSLFHKVLDRAQIQSD 660

Query: 661 EYETLKDHILPLGRTAQFPGENGA 684
           EYETLKDHILPLGRTAQFPGEN A
Sbjct: 661 EYETLKDHILPLGRTAQFPGENDA 662

BLAST of Cp4.1LG03g13150 vs. NCBI nr
Match: XP_022955995.1 (putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog isoform X1 [Cucurbita moschata])

HSP 1 Score: 1270 bits (3287), Expect = 0.0
Identity = 654/684 (95.61%), Postives = 656/684 (95.91%), Query Frame = 0

Query: 1   MAKNQTILIKDTVYKLQLALLDGIHNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC 60
           MAKNQTILIKDTVYKLQLALLDGIHNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC
Sbjct: 1   MAKNQTILIKDTVYKLQLALLDGIHNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC 60

Query: 61  QSNLPSDNTRKGRYRISLKEHKVYDLEETYKYCSSTCLINSRAFSGRLQDERCSVMNPGK 120
           QSNLPSDNTRKGRYRISLKEHKVYDLEETYKYCSSTCLINSRAFSGRLQDERCSVMNPGK
Sbjct: 61  QSNLPSDNTRKGRYRISLKEHKVYDLEETYKYCSSTCLINSRAFSGRLQDERCSVMNPGK 120

Query: 121 LKEILRLFENLSLDSKENTRNSCDLGLEIQEKIVSSIGEVPIEEWMGPSNAIEGYVPHRN 180
           LKEILRLFENLSLDSKENTRNSCDLGLEIQEKI SSIGEVPIEEWMGPSNAIEGYVPHRN
Sbjct: 121 LKEILRLFENLSLDSKENTRNSCDLGLEIQEKIESSIGEVPIEEWMGPSNAIEGYVPHRN 180

Query: 181 HNIMTLPSKDGKELKDGSKAKIKQLGVGKDFFSDFSFATTVITDEEYSVSKISSGLKEMT 240
           HNIMT P KDGKELKDGSKAKIKQLGVGKDFFSDFSFATTVITDEEYSVSKISSGLKEMT
Sbjct: 181 HNIMTSPRKDGKELKDGSKAKIKQLGVGKDFFSDFSFATTVITDEEYSVSKISSGLKEMT 240

Query: 241 FDTKSKEQTGEFCGKQSNEQFTILETPHSPAPTKNSVGRKARGSKERTNVSATAESNNNL 300
           FDTKSK QTGEFCGKQSNEQFTILETPH PAPTKNSVGRKARGSKERTNVSATAESNNNL
Sbjct: 241 FDTKSKAQTGEFCGKQSNEQFTILETPHGPAPTKNSVGRKARGSKERTNVSATAESNNNL 300

Query: 301 SDAPSTSNHCSTNCNITTEEPNGGSNDLNETQIKSSLKQPGKKNLRRSVTWADAKTDETS 360
           SDAPSTSNHCSTNCNITTEEPNGGSNDLNETQIKSSLKQPGKKNLRRSVTWADAKTDETS
Sbjct: 301 SDAPSTSNHCSTNCNITTEEPNGGSNDLNETQIKSSLKQPGKKNLRRSVTWADAKTDETS 360

Query: 361 IINLPEDREMGKTKECSRMTSNLVNADNGNEDILRVESAEACAMALSQAAEAITSGQNEV 420
           IINLPEDREMGKTKECSRMTSNLVNADNGNEDILRVESAEACAMALSQAAEAITSG+NEV
Sbjct: 361 IINLPEDREMGKTKECSRMTSNLVNADNGNEDILRVESAEACAMALSQAAEAITSGKNEV 420

Query: 421 SDAVSEAGIIILPRPSDANEEASTNGENISEPHSSSEKSNKPGILRSDLFDPDDSWYDSP 480
           SDAVSEAGIIILPRPSDANEEASTNGENISEPHSSSEKSNKPGILRSDLFDP+DSWYDSP
Sbjct: 421 SDAVSEAGIIILPRPSDANEEASTNGENISEPHSSSEKSNKPGILRSDLFDPNDSWYDSP 480

Query: 481 PEGFSLTLSSFATMWMAIFAWMTSSSLAYIYGKDEKFHEEFQYIDGREYPRKIVSADGRS 540
           PEGFSLTLSSFATMWMAIFAWMTSSSLAYIYGKDEKFHEEFQYIDGREYPRKIVSADGRS
Sbjct: 481 PEGFSLTLSSFATMWMAIFAWMTSSSLAYIYGKDEKFHEEFQYIDGREYPRKIVSADGRS 540

Query: 541 SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGMSEDRIRSSNFLISCPDGPFVLLGCL 600
           SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGM                      GCL
Sbjct: 541 SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGM----------------------GCL 600

Query: 601 LDTMTFLDALPAFRTKQWQVIVLLFIEALSVCRIPSLDSQVSHSRSLFHKVLDRAQIRSD 660
           LDTMTFLDALPAFRTKQWQVIVLLFIEALSVCRIPSLDSQVSHSRSLFHKVLDRAQIRSD
Sbjct: 601 LDTMTFLDALPAFRTKQWQVIVLLFIEALSVCRIPSLDSQVSHSRSLFHKVLDRAQIRSD 660

Query: 661 EYETLKDHILPLGRTAQFPGENGA 684
           EYETLKDHILPLGRTAQFPGEN A
Sbjct: 661 EYETLKDHILPLGRTAQFPGENDA 662

BLAST of Cp4.1LG03g13150 vs. NCBI nr
Match: XP_022980004.1 (putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog isoform X1 [Cucurbita maxima])

HSP 1 Score: 1246 bits (3224), Expect = 0.0
Identity = 641/684 (93.71%), Postives = 652/684 (95.32%), Query Frame = 0

Query: 1   MAKNQTILIKDTVYKLQLALLDGIHNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC 60
           MAKNQTILIKDTVYKLQLALLDGIHNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC
Sbjct: 1   MAKNQTILIKDTVYKLQLALLDGIHNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC 60

Query: 61  QSNLPSDNTRKGRYRISLKEHKVYDLEETYKYCSSTCLINSRAFSGRLQDERCSVMNPGK 120
           QSNLPSDNTRKGRYRISLKEHKVYDLEETYKYCSSTCLINSRAFSGRLQDERCSVMNPGK
Sbjct: 61  QSNLPSDNTRKGRYRISLKEHKVYDLEETYKYCSSTCLINSRAFSGRLQDERCSVMNPGK 120

Query: 121 LKEILRLFENLSLDSKENTRNSCDLGLEIQEKIVSSIGEVPIEEWMGPSNAIEGYVPHRN 180
           LKEILRLFENLSLDSKENTRNSCDLGLEIQEKI SSIGEVPIEEWMGPSNAIEGYVPHRN
Sbjct: 121 LKEILRLFENLSLDSKENTRNSCDLGLEIQEKIESSIGEVPIEEWMGPSNAIEGYVPHRN 180

Query: 181 HNIMTLPSKDGKELKDGSKAKIKQLGVGKDFFSDFSFATTVITDEEYSVSKISSGLKEMT 240
           HNIMTLPSKDGKELKDGSKAKIKQLGV KDFFSDFSFA+TVITDEEYSVSKISSGLKEMT
Sbjct: 181 HNIMTLPSKDGKELKDGSKAKIKQLGVEKDFFSDFSFASTVITDEEYSVSKISSGLKEMT 240

Query: 241 FDTKSKEQTGEFCGKQSNEQFTILETPHSPAPTKNSVGRKARGSKERTNVSATAESNNNL 300
           FDTKSK QTGEFCGKQSNEQFTILETPH PAPTKNSVGRKARG+KERTNVSATAESNNNL
Sbjct: 241 FDTKSKAQTGEFCGKQSNEQFTILETPHGPAPTKNSVGRKARGTKERTNVSATAESNNNL 300

Query: 301 SDAPSTSNHCSTNCNITTEEPNGGSNDLNETQIKSSLKQPGKKNLRRSVTWADAKTDETS 360
           SD+PSTSNHC+TNCNITTEEP GGSN+LNETQIKSSLKQPGKKNLRRSVTWADAKTDETS
Sbjct: 301 SDSPSTSNHCNTNCNITTEEPKGGSNELNETQIKSSLKQPGKKNLRRSVTWADAKTDETS 360

Query: 361 IINLPEDREMGKTKECSRMTSNLVNADNGNEDILRVESAEACAMALSQAAEAITSGQNEV 420
           IINLPEDREMGKTKECSRMTSNLVNADNGNED+LRVESAEACAMALSQAAEAITSGQNEV
Sbjct: 361 IINLPEDREMGKTKECSRMTSNLVNADNGNEDMLRVESAEACAMALSQAAEAITSGQNEV 420

Query: 421 SDAVSEAGIIILPRPSDANEEASTNGENISEPHSSSEKSNKPGILRSDLFDPDDSWYDSP 480
           SDAVSEAGIIILPRPSDANEE STNG+NISEP+SSSEKSNKPGIL SDLFDP+DSWYDSP
Sbjct: 421 SDAVSEAGIIILPRPSDANEEVSTNGKNISEPYSSSEKSNKPGILHSDLFDPEDSWYDSP 480

Query: 481 PEGFSLTLSSFATMWMAIFAWMTSSSLAYIYGKDEKFHEEFQYIDGREYPRKIVSADGRS 540
           PEGFSLTLSSFATMWMAIFAWMTSSSLAYIYGKDEKFHEEFQYIDGREYPRKIVSADGRS
Sbjct: 481 PEGFSLTLSSFATMWMAIFAWMTSSSLAYIYGKDEKFHEEFQYIDGREYPRKIVSADGRS 540

Query: 541 SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGMSEDRIRSSNFLISCPDGPFVLLGCL 600
           SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGM                      GCL
Sbjct: 541 SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGM----------------------GCL 600

Query: 601 LDTMTFLDALPAFRTKQWQVIVLLFIEALSVCRIPSLDSQVSHSRSLFHKVLDRAQIRSD 660
           LDTMTFLDALPAFR KQWQVIVLLFIEALSVCRIPSLDSQVS+SRSLFHKVLDRAQIRS+
Sbjct: 601 LDTMTFLDALPAFRMKQWQVIVLLFIEALSVCRIPSLDSQVSNSRSLFHKVLDRAQIRSN 660

Query: 661 EYETLKDHILPLGRTAQFPGENGA 684
           EYETLKDHILPLGRTAQF GEN A
Sbjct: 661 EYETLKDHILPLGRTAQFSGENDA 662

BLAST of Cp4.1LG03g13150 vs. ExPASy TrEMBL
Match: A0A6J1GWL9 (RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog OS=Cucurbita moschata OX=3662 GN=LOC111457827 PE=3 SV=1)

HSP 1 Score: 1270 bits (3287), Expect = 0.0
Identity = 654/684 (95.61%), Postives = 656/684 (95.91%), Query Frame = 0

Query: 1   MAKNQTILIKDTVYKLQLALLDGIHNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC 60
           MAKNQTILIKDTVYKLQLALLDGIHNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC
Sbjct: 1   MAKNQTILIKDTVYKLQLALLDGIHNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC 60

Query: 61  QSNLPSDNTRKGRYRISLKEHKVYDLEETYKYCSSTCLINSRAFSGRLQDERCSVMNPGK 120
           QSNLPSDNTRKGRYRISLKEHKVYDLEETYKYCSSTCLINSRAFSGRLQDERCSVMNPGK
Sbjct: 61  QSNLPSDNTRKGRYRISLKEHKVYDLEETYKYCSSTCLINSRAFSGRLQDERCSVMNPGK 120

Query: 121 LKEILRLFENLSLDSKENTRNSCDLGLEIQEKIVSSIGEVPIEEWMGPSNAIEGYVPHRN 180
           LKEILRLFENLSLDSKENTRNSCDLGLEIQEKI SSIGEVPIEEWMGPSNAIEGYVPHRN
Sbjct: 121 LKEILRLFENLSLDSKENTRNSCDLGLEIQEKIESSIGEVPIEEWMGPSNAIEGYVPHRN 180

Query: 181 HNIMTLPSKDGKELKDGSKAKIKQLGVGKDFFSDFSFATTVITDEEYSVSKISSGLKEMT 240
           HNIMT P KDGKELKDGSKAKIKQLGVGKDFFSDFSFATTVITDEEYSVSKISSGLKEMT
Sbjct: 181 HNIMTSPRKDGKELKDGSKAKIKQLGVGKDFFSDFSFATTVITDEEYSVSKISSGLKEMT 240

Query: 241 FDTKSKEQTGEFCGKQSNEQFTILETPHSPAPTKNSVGRKARGSKERTNVSATAESNNNL 300
           FDTKSK QTGEFCGKQSNEQFTILETPH PAPTKNSVGRKARGSKERTNVSATAESNNNL
Sbjct: 241 FDTKSKAQTGEFCGKQSNEQFTILETPHGPAPTKNSVGRKARGSKERTNVSATAESNNNL 300

Query: 301 SDAPSTSNHCSTNCNITTEEPNGGSNDLNETQIKSSLKQPGKKNLRRSVTWADAKTDETS 360
           SDAPSTSNHCSTNCNITTEEPNGGSNDLNETQIKSSLKQPGKKNLRRSVTWADAKTDETS
Sbjct: 301 SDAPSTSNHCSTNCNITTEEPNGGSNDLNETQIKSSLKQPGKKNLRRSVTWADAKTDETS 360

Query: 361 IINLPEDREMGKTKECSRMTSNLVNADNGNEDILRVESAEACAMALSQAAEAITSGQNEV 420
           IINLPEDREMGKTKECSRMTSNLVNADNGNEDILRVESAEACAMALSQAAEAITSG+NEV
Sbjct: 361 IINLPEDREMGKTKECSRMTSNLVNADNGNEDILRVESAEACAMALSQAAEAITSGKNEV 420

Query: 421 SDAVSEAGIIILPRPSDANEEASTNGENISEPHSSSEKSNKPGILRSDLFDPDDSWYDSP 480
           SDAVSEAGIIILPRPSDANEEASTNGENISEPHSSSEKSNKPGILRSDLFDP+DSWYDSP
Sbjct: 421 SDAVSEAGIIILPRPSDANEEASTNGENISEPHSSSEKSNKPGILRSDLFDPNDSWYDSP 480

Query: 481 PEGFSLTLSSFATMWMAIFAWMTSSSLAYIYGKDEKFHEEFQYIDGREYPRKIVSADGRS 540
           PEGFSLTLSSFATMWMAIFAWMTSSSLAYIYGKDEKFHEEFQYIDGREYPRKIVSADGRS
Sbjct: 481 PEGFSLTLSSFATMWMAIFAWMTSSSLAYIYGKDEKFHEEFQYIDGREYPRKIVSADGRS 540

Query: 541 SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGMSEDRIRSSNFLISCPDGPFVLLGCL 600
           SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGM                      GCL
Sbjct: 541 SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGM----------------------GCL 600

Query: 601 LDTMTFLDALPAFRTKQWQVIVLLFIEALSVCRIPSLDSQVSHSRSLFHKVLDRAQIRSD 660
           LDTMTFLDALPAFRTKQWQVIVLLFIEALSVCRIPSLDSQVSHSRSLFHKVLDRAQIRSD
Sbjct: 601 LDTMTFLDALPAFRTKQWQVIVLLFIEALSVCRIPSLDSQVSHSRSLFHKVLDRAQIRSD 660

Query: 661 EYETLKDHILPLGRTAQFPGENGA 684
           EYETLKDHILPLGRTAQFPGEN A
Sbjct: 661 EYETLKDHILPLGRTAQFPGENDA 662

BLAST of Cp4.1LG03g13150 vs. ExPASy TrEMBL
Match: A0A6J1IY57 (RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog OS=Cucurbita maxima OX=3661 GN=LOC111479539 PE=3 SV=1)

HSP 1 Score: 1246 bits (3224), Expect = 0.0
Identity = 641/684 (93.71%), Postives = 652/684 (95.32%), Query Frame = 0

Query: 1   MAKNQTILIKDTVYKLQLALLDGIHNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC 60
           MAKNQTILIKDTVYKLQLALLDGIHNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC
Sbjct: 1   MAKNQTILIKDTVYKLQLALLDGIHNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC 60

Query: 61  QSNLPSDNTRKGRYRISLKEHKVYDLEETYKYCSSTCLINSRAFSGRLQDERCSVMNPGK 120
           QSNLPSDNTRKGRYRISLKEHKVYDLEETYKYCSSTCLINSRAFSGRLQDERCSVMNPGK
Sbjct: 61  QSNLPSDNTRKGRYRISLKEHKVYDLEETYKYCSSTCLINSRAFSGRLQDERCSVMNPGK 120

Query: 121 LKEILRLFENLSLDSKENTRNSCDLGLEIQEKIVSSIGEVPIEEWMGPSNAIEGYVPHRN 180
           LKEILRLFENLSLDSKENTRNSCDLGLEIQEKI SSIGEVPIEEWMGPSNAIEGYVPHRN
Sbjct: 121 LKEILRLFENLSLDSKENTRNSCDLGLEIQEKIESSIGEVPIEEWMGPSNAIEGYVPHRN 180

Query: 181 HNIMTLPSKDGKELKDGSKAKIKQLGVGKDFFSDFSFATTVITDEEYSVSKISSGLKEMT 240
           HNIMTLPSKDGKELKDGSKAKIKQLGV KDFFSDFSFA+TVITDEEYSVSKISSGLKEMT
Sbjct: 181 HNIMTLPSKDGKELKDGSKAKIKQLGVEKDFFSDFSFASTVITDEEYSVSKISSGLKEMT 240

Query: 241 FDTKSKEQTGEFCGKQSNEQFTILETPHSPAPTKNSVGRKARGSKERTNVSATAESNNNL 300
           FDTKSK QTGEFCGKQSNEQFTILETPH PAPTKNSVGRKARG+KERTNVSATAESNNNL
Sbjct: 241 FDTKSKAQTGEFCGKQSNEQFTILETPHGPAPTKNSVGRKARGTKERTNVSATAESNNNL 300

Query: 301 SDAPSTSNHCSTNCNITTEEPNGGSNDLNETQIKSSLKQPGKKNLRRSVTWADAKTDETS 360
           SD+PSTSNHC+TNCNITTEEP GGSN+LNETQIKSSLKQPGKKNLRRSVTWADAKTDETS
Sbjct: 301 SDSPSTSNHCNTNCNITTEEPKGGSNELNETQIKSSLKQPGKKNLRRSVTWADAKTDETS 360

Query: 361 IINLPEDREMGKTKECSRMTSNLVNADNGNEDILRVESAEACAMALSQAAEAITSGQNEV 420
           IINLPEDREMGKTKECSRMTSNLVNADNGNED+LRVESAEACAMALSQAAEAITSGQNEV
Sbjct: 361 IINLPEDREMGKTKECSRMTSNLVNADNGNEDMLRVESAEACAMALSQAAEAITSGQNEV 420

Query: 421 SDAVSEAGIIILPRPSDANEEASTNGENISEPHSSSEKSNKPGILRSDLFDPDDSWYDSP 480
           SDAVSEAGIIILPRPSDANEE STNG+NISEP+SSSEKSNKPGIL SDLFDP+DSWYDSP
Sbjct: 421 SDAVSEAGIIILPRPSDANEEVSTNGKNISEPYSSSEKSNKPGILHSDLFDPEDSWYDSP 480

Query: 481 PEGFSLTLSSFATMWMAIFAWMTSSSLAYIYGKDEKFHEEFQYIDGREYPRKIVSADGRS 540
           PEGFSLTLSSFATMWMAIFAWMTSSSLAYIYGKDEKFHEEFQYIDGREYPRKIVSADGRS
Sbjct: 481 PEGFSLTLSSFATMWMAIFAWMTSSSLAYIYGKDEKFHEEFQYIDGREYPRKIVSADGRS 540

Query: 541 SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGMSEDRIRSSNFLISCPDGPFVLLGCL 600
           SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGM                      GCL
Sbjct: 541 SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGM----------------------GCL 600

Query: 601 LDTMTFLDALPAFRTKQWQVIVLLFIEALSVCRIPSLDSQVSHSRSLFHKVLDRAQIRSD 660
           LDTMTFLDALPAFR KQWQVIVLLFIEALSVCRIPSLDSQVS+SRSLFHKVLDRAQIRS+
Sbjct: 601 LDTMTFLDALPAFRMKQWQVIVLLFIEALSVCRIPSLDSQVSNSRSLFHKVLDRAQIRSN 660

Query: 661 EYETLKDHILPLGRTAQFPGENGA 684
           EYETLKDHILPLGRTAQF GEN A
Sbjct: 661 EYETLKDHILPLGRTAQFSGENDA 662

BLAST of Cp4.1LG03g13150 vs. ExPASy TrEMBL
Match: A0A6J1GVD1 (RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog OS=Cucurbita moschata OX=3662 GN=LOC111457827 PE=3 SV=1)

HSP 1 Score: 1204 bits (3116), Expect = 0.0
Identity = 627/684 (91.67%), Postives = 629/684 (91.96%), Query Frame = 0

Query: 1   MAKNQTILIKDTVYKLQLALLDGIHNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC 60
           MAKNQTILIKDTVYKLQLALLDGIHNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC
Sbjct: 1   MAKNQTILIKDTVYKLQLALLDGIHNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC 60

Query: 61  QSNLPSDNTRKGRYRISLKEHKVYDLEETYKYCSSTCLINSRAFSGRLQDERCSVMNPGK 120
           QSNLPSDNTRKGRYRISLKEHKVYDLEETYKYCSSTCLINSRAFSGRLQDERCSVMNPGK
Sbjct: 61  QSNLPSDNTRKGRYRISLKEHKVYDLEETYKYCSSTCLINSRAFSGRLQDERCSVMNPGK 120

Query: 121 LKEILRLFENLSLDSKENTRNSCDLGLEIQEKIVSSIGEVPIEEWMGPSNAIEGYVPHRN 180
           LKEILRLFENLSLDSKENTRNSCDLGLEIQEKI SSIGEVPIEEWMGPSNAIEGYVPHRN
Sbjct: 121 LKEILRLFENLSLDSKENTRNSCDLGLEIQEKIESSIGEVPIEEWMGPSNAIEGYVPHRN 180

Query: 181 HNIMTLPSKDGKELKDGSKAKIKQLGVGKDFFSDFSFATTVITDEEYSVSKISSGLKEMT 240
           HNIMT P KDGKELKD                           DEEYSVSKISSGLKEMT
Sbjct: 181 HNIMTSPRKDGKELKD---------------------------DEEYSVSKISSGLKEMT 240

Query: 241 FDTKSKEQTGEFCGKQSNEQFTILETPHSPAPTKNSVGRKARGSKERTNVSATAESNNNL 300
           FDTKSK QTGEFCGKQSNEQFTILETPH PAPTKNSVGRKARGSKERTNVSATAESNNNL
Sbjct: 241 FDTKSKAQTGEFCGKQSNEQFTILETPHGPAPTKNSVGRKARGSKERTNVSATAESNNNL 300

Query: 301 SDAPSTSNHCSTNCNITTEEPNGGSNDLNETQIKSSLKQPGKKNLRRSVTWADAKTDETS 360
           SDAPSTSNHCSTNCNITTEEPNGGSNDLNETQIKSSLKQPGKKNLRRSVTWADAKTDETS
Sbjct: 301 SDAPSTSNHCSTNCNITTEEPNGGSNDLNETQIKSSLKQPGKKNLRRSVTWADAKTDETS 360

Query: 361 IINLPEDREMGKTKECSRMTSNLVNADNGNEDILRVESAEACAMALSQAAEAITSGQNEV 420
           IINLPEDREMGKTKECSRMTSNLVNADNGNEDILRVESAEACAMALSQAAEAITSG+NEV
Sbjct: 361 IINLPEDREMGKTKECSRMTSNLVNADNGNEDILRVESAEACAMALSQAAEAITSGKNEV 420

Query: 421 SDAVSEAGIIILPRPSDANEEASTNGENISEPHSSSEKSNKPGILRSDLFDPDDSWYDSP 480
           SDAVSEAGIIILPRPSDANEEASTNGENISEPHSSSEKSNKPGILRSDLFDP+DSWYDSP
Sbjct: 421 SDAVSEAGIIILPRPSDANEEASTNGENISEPHSSSEKSNKPGILRSDLFDPNDSWYDSP 480

Query: 481 PEGFSLTLSSFATMWMAIFAWMTSSSLAYIYGKDEKFHEEFQYIDGREYPRKIVSADGRS 540
           PEGFSLTLSSFATMWMAIFAWMTSSSLAYIYGKDEKFHEEFQYIDGREYPRKIVSADGRS
Sbjct: 481 PEGFSLTLSSFATMWMAIFAWMTSSSLAYIYGKDEKFHEEFQYIDGREYPRKIVSADGRS 540

Query: 541 SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGMSEDRIRSSNFLISCPDGPFVLLGCL 600
           SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGM                      GCL
Sbjct: 541 SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGM----------------------GCL 600

Query: 601 LDTMTFLDALPAFRTKQWQVIVLLFIEALSVCRIPSLDSQVSHSRSLFHKVLDRAQIRSD 660
           LDTMTFLDALPAFRTKQWQVIVLLFIEALSVCRIPSLDSQVSHSRSLFHKVLDRAQIRSD
Sbjct: 601 LDTMTFLDALPAFRTKQWQVIVLLFIEALSVCRIPSLDSQVSHSRSLFHKVLDRAQIRSD 635

Query: 661 EYETLKDHILPLGRTAQFPGENGA 684
           EYETLKDHILPLGRTAQFPGEN A
Sbjct: 661 EYETLKDHILPLGRTAQFPGENDA 635

BLAST of Cp4.1LG03g13150 vs. ExPASy TrEMBL
Match: A0A6J1IUY3 (RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog OS=Cucurbita maxima OX=3661 GN=LOC111479539 PE=3 SV=1)

HSP 1 Score: 1185 bits (3065), Expect = 0.0
Identity = 616/684 (90.06%), Postives = 626/684 (91.52%), Query Frame = 0

Query: 1   MAKNQTILIKDTVYKLQLALLDGIHNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC 60
           MAKNQTILIKDTVYKLQLALLDGIHNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC
Sbjct: 1   MAKNQTILIKDTVYKLQLALLDGIHNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC 60

Query: 61  QSNLPSDNTRKGRYRISLKEHKVYDLEETYKYCSSTCLINSRAFSGRLQDERCSVMNPGK 120
           QSNLPSDNTRKGRYRISLKEHKVYDLEETYKYCSSTCLINSRAFSGRLQDERCSVMNPGK
Sbjct: 61  QSNLPSDNTRKGRYRISLKEHKVYDLEETYKYCSSTCLINSRAFSGRLQDERCSVMNPGK 120

Query: 121 LKEILRLFENLSLDSKENTRNSCDLGLEIQEKIVSSIGEVPIEEWMGPSNAIEGYVPHRN 180
           LKEILRLFENLSLDSKENTRNSCDLGLEIQEKI SSIGEVPIEEWMGPSNAIEGYVPHRN
Sbjct: 121 LKEILRLFENLSLDSKENTRNSCDLGLEIQEKIESSIGEVPIEEWMGPSNAIEGYVPHRN 180

Query: 181 HNIMTLPSKDGKELKDGSKAKIKQLGVGKDFFSDFSFATTVITDEEYSVSKISSGLKEMT 240
           HNIMTLPSKDGKELKD                           DEEYSVSKISSGLKEMT
Sbjct: 181 HNIMTLPSKDGKELKD---------------------------DEEYSVSKISSGLKEMT 240

Query: 241 FDTKSKEQTGEFCGKQSNEQFTILETPHSPAPTKNSVGRKARGSKERTNVSATAESNNNL 300
           FDTKSK QTGEFCGKQSNEQFTILETPH PAPTKNSVGRKARG+KERTNVSATAESNNNL
Sbjct: 241 FDTKSKAQTGEFCGKQSNEQFTILETPHGPAPTKNSVGRKARGTKERTNVSATAESNNNL 300

Query: 301 SDAPSTSNHCSTNCNITTEEPNGGSNDLNETQIKSSLKQPGKKNLRRSVTWADAKTDETS 360
           SD+PSTSNHC+TNCNITTEEP GGSN+LNETQIKSSLKQPGKKNLRRSVTWADAKTDETS
Sbjct: 301 SDSPSTSNHCNTNCNITTEEPKGGSNELNETQIKSSLKQPGKKNLRRSVTWADAKTDETS 360

Query: 361 IINLPEDREMGKTKECSRMTSNLVNADNGNEDILRVESAEACAMALSQAAEAITSGQNEV 420
           IINLPEDREMGKTKECSRMTSNLVNADNGNED+LRVESAEACAMALSQAAEAITSGQNEV
Sbjct: 361 IINLPEDREMGKTKECSRMTSNLVNADNGNEDMLRVESAEACAMALSQAAEAITSGQNEV 420

Query: 421 SDAVSEAGIIILPRPSDANEEASTNGENISEPHSSSEKSNKPGILRSDLFDPDDSWYDSP 480
           SDAVSEAGIIILPRPSDANEE STNG+NISEP+SSSEKSNKPGIL SDLFDP+DSWYDSP
Sbjct: 421 SDAVSEAGIIILPRPSDANEEVSTNGKNISEPYSSSEKSNKPGILHSDLFDPEDSWYDSP 480

Query: 481 PEGFSLTLSSFATMWMAIFAWMTSSSLAYIYGKDEKFHEEFQYIDGREYPRKIVSADGRS 540
           PEGFSLTLSSFATMWMAIFAWMTSSSLAYIYGKDEKFHEEFQYIDGREYPRKIVSADGRS
Sbjct: 481 PEGFSLTLSSFATMWMAIFAWMTSSSLAYIYGKDEKFHEEFQYIDGREYPRKIVSADGRS 540

Query: 541 SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGMSEDRIRSSNFLISCPDGPFVLLGCL 600
           SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGM                      GCL
Sbjct: 541 SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGM----------------------GCL 600

Query: 601 LDTMTFLDALPAFRTKQWQVIVLLFIEALSVCRIPSLDSQVSHSRSLFHKVLDRAQIRSD 660
           LDTMTFLDALPAFR KQWQVIVLLFIEALSVCRIPSLDSQVS+SRSLFHKVLDRAQIRS+
Sbjct: 601 LDTMTFLDALPAFRMKQWQVIVLLFIEALSVCRIPSLDSQVSNSRSLFHKVLDRAQIRSN 635

Query: 661 EYETLKDHILPLGRTAQFPGENGA 684
           EYETLKDHILPLGRTAQF GEN A
Sbjct: 661 EYETLKDHILPLGRTAQFSGENDA 635

BLAST of Cp4.1LG03g13150 vs. ExPASy TrEMBL
Match: A0A0A0KVU3 (RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog OS=Cucumis sativus OX=3659 GN=Csa_4G009360 PE=3 SV=1)

HSP 1 Score: 1078 bits (2787), Expect = 0.0
Identity = 560/684 (81.87%), Postives = 600/684 (87.72%), Query Frame = 0

Query: 1   MAKNQTILIKDTVYKLQLALLDGIHNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC 60
           MAKNQ++LIKDTVYKLQLAL +GI NENQLFAAGSLMSRSDYEDVVTERSIA+LCGYPLC
Sbjct: 1   MAKNQSVLIKDTVYKLQLALYEGIKNENQLFAAGSLMSRSDYEDVVTERSIADLCGYPLC 60

Query: 61  QSNLPSDNTRKGRYRISLKEHKVYDLEETYKYCSSTCLINSRAFSGRLQDERCSVMNPGK 120
            SNLPSDNTR+GRYRISLKEHKVYDLEETYKYCSS CLINSRAFSGRLQDERCSVMNP K
Sbjct: 61  HSNLPSDNTRRGRYRISLKEHKVYDLEETYKYCSSACLINSRAFSGRLQDERCSVMNPDK 120

Query: 121 LKEILRLFENLSLDSKENTRNSCDLGLEIQEKIVSSIGEVPIEEWMGPSNAIEGYVPHRN 180
           LKEIL+LFEN+SLDSKEN  N+CD GLEIQEKI S+IGEVPIEEWMGPSNAIEGYVPHR+
Sbjct: 121 LKEILKLFENMSLDSKENMGNNCDSGLEIQEKIESNIGEVPIEEWMGPSNAIEGYVPHRD 180

Query: 181 HNIMTLPSKDGKELKDGSKAKIKQLGVGKDFFSDFSFATTVITDEEYSVSKISSGLKEMT 240
           H +MTL SKDGKE KDGSKAKIK LG GKDFFSDFS  +T+ITDEEYSVSKISSGLKEM 
Sbjct: 181 HKVMTLHSKDGKESKDGSKAKIKPLGGGKDFFSDFSITSTIITDEEYSVSKISSGLKEMA 240

Query: 241 FDTKSKEQTGEFCGKQSNEQFTILETPHSPAPTKNSVGRKARGSKERTNVSATAESNNNL 300
            DT SK QTGEFCGK+SN+QF ILETPH+PAP KNSVGRKARGSKERT VSAT ES +NL
Sbjct: 241 LDTNSKNQTGEFCGKESNDQFAILETPHAPAPPKNSVGRKARGSKERTKVSATKESTDNL 300

Query: 301 SDAPSTSNHCSTNCNITTEEPNGGSNDLNETQIKSSLKQPGKKNLRRSVTWADAKTDETS 360
           SDAPSTS + STN N+ TEEP GG NDL+ T++KSSLK+PGKKNL RSVTWAD KTD+ S
Sbjct: 301 SDAPSTSKNRSTNFNLMTEEPRGGFNDLSGTELKSSLKKPGKKNLCRSVTWADEKTDDAS 360

Query: 361 IINLPEDREMGKTKECSRMTSNLVNADNGNEDILRVESAEACAMALSQAAEAITSGQNEV 420
           I+NLPE  EMGKTKECSR TSNLVN DN NEDILRVESAEACAMALSQAAEAITSGQ+EV
Sbjct: 361 IMNLPEVGEMGKTKECSRTTSNLVNFDNDNEDILRVESAEACAMALSQAAEAITSGQSEV 420

Query: 421 SDAVSEAGIIILPRPSDANEEASTNGENISEPHSSSEKSNKPGILRSDLFDPDDSWYDSP 480
           SDAVSEAGIIILP PSDANEEAST+  N SEPHS SEKSNK G+LRSDLFDP DSWYD+P
Sbjct: 421 SDAVSEAGIIILPHPSDANEEASTDPVNASEPHSFSEKSNKLGVLRSDLFDPSDSWYDAP 480

Query: 481 PEGFSLTLSSFATMWMAIFAWMTSSSLAYIYGKDEKFHEEFQYIDGREYPRKIVSADGRS 540
           PEGFSLTLSSFATMWMAIFAW+TSSSLAYIYGKD+KFHEEF YIDG+EYP KIVSADGRS
Sbjct: 481 PEGFSLTLSSFATMWMAIFAWVTSSSLAYIYGKDDKFHEEFLYIDGKEYPSKIVSADGRS 540

Query: 541 SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGMSEDRIRSSNFLISCPDGPFVLLGCL 600
           SEIKQTLAGCLTR+IPGLASEL LSTPIS LE+GM+                       L
Sbjct: 541 SEIKQTLAGCLTRAIPGLASELNLSTPISRLENGMAH----------------------L 600

Query: 601 LDTMTFLDALPAFRTKQWQVIVLLFIEALSVCRIPSLDSQVSHSRSLFHKVLDRAQIRSD 660
           LDTMTFLDALPAFR KQWQVIVLLFIEALSV RIPSL S +S SR+L+HKVLDRAQIRSD
Sbjct: 601 LDTMTFLDALPAFRMKQWQVIVLLFIEALSVSRIPSLASHMSSSRNLYHKVLDRAQIRSD 660

Query: 661 EYETLKDHILPLGRTAQFPGENGA 684
           EYE ++DHILPLGRTAQ   EN A
Sbjct: 661 EYEIMRDHILPLGRTAQLSDENDA 662

BLAST of Cp4.1LG03g13150 vs. TAIR 10
Match: AT5G26760.2 (unknown protein; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF408 (InterPro:IPR007308); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 420.2 bits (1079), Expect = 3.1e-117
Identity = 306/791 (38.69%), Postives = 411/791 (51.96%), Query Frame = 0

Query: 1   MAK-NQTILIKDTVYKLQLALLDGIHNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPL 60
           MAK N+ I I D V+KLQL +L+   ++NQLFAA  LMSRSDYEDVVTER+IA LCGY L
Sbjct: 1   MAKDNEAIAINDAVHKLQLYMLENTTDQNQLFAARKLMSRSDYEDVVTERAIAKLCGYTL 60

Query: 61  CQSNLPSDNTRKGRYRISLKEHKVYDLEETYKYCSSTCLINSRAFSGRLQDERCSVMNPG 120
           CQ  LPSD +R+G+YRISLK+HKVYDL+ET K+CS+ CLI+S+ FSG LQ+ R    +  
Sbjct: 61  CQRFLPSDVSRRGKYRISLKDHKVYDLQETSKFCSAGCLIDSKTFSGSLQEARTLEFDSV 120

Query: 121 KLKEILRLF-ENLSLDSKENTRNSCDLG-LEIQEKIVSSIGEVPIEEWMGPSNAIEGYVP 180
           KL EIL LF ++L +    +     DL  L I+E       E+ +E+WMGPSNA+EGYVP
Sbjct: 121 KLNEILDLFGDSLEVKGSLDVNKDLDLSKLMIKENFGVRGEELSLEKWMGPSNAVEGYVP 180

Query: 181 HRNHNIMTLPSKDGKELKDGSKAKIKQLGVGKDFFSDFSFATTVITDEEYSVSKISSGLK 240
                  +  S D K     ++ K            +  F +TVI  +  SVSK+    K
Sbjct: 181 FDR----SKSSNDSKATTQSNQEK-----------HEMDFTSTVIMPDVNSVSKLPPQTK 240

Query: 241 EMTFDTKS---------KEQT--------GEFCGKQSNEQFTILETPHSPAPTKNSV--- 300
           + +   +S         KEQT          F  ++  E+ T        A  K +V   
Sbjct: 241 QASTVVESVDGKGKTVLKEQTVVPPTKKVSRFRREKEKEKKTFGVDGMGCAQEKTTVLPR 300

Query: 301 -----------GRKARGSKERTNVSATAESNN-----NLSDAPSTSNHCSTNCNI----- 360
                        K  G  E    S+   S+      ++S  P  S   S +C +     
Sbjct: 301 KILSFCNEIEKDFKNFGFDEMGLASSAMMSDGYGVEYSVSKQPQCSMEDSLSCKLKGDLQ 360

Query: 361 ------TTEEPNGGSN-------------------------------------------- 420
                 T    + GSN                                            
Sbjct: 361 TLDGKNTLSGSSSGSNTKGSKTKPEKSRKKIISVEYHANSYEDGEEILAAESYERHKAQD 420

Query: 421 --DLNETQIKSSLKQPGKKNLRRSVTWADAKTDETSIINL-PEDREMGKTKECSRMTSNL 480
               +E   KS LK  G K L RSVTWAD       +  +   D   G +     ++SN 
Sbjct: 421 VCSSSEIVTKSCLKISGSKKLSRSVTWADQNDGRGDLCEVRNNDNAAGPS-----LSSND 480

Query: 481 VNADNGNEDILRVESAEACAMALSQAAEAITSGQNEVSDAVSEAGIIILPRPSDANEEAS 540
           +   N    + R+  AEA A ALSQAAEA++SG ++ SDA ++AGII+LP     +EE  
Sbjct: 481 IEDVN---SLSRLALAEALATALSQAAEAVSSGNSDASDATAKAGIILLPSTHQLDEE-- 540

Query: 541 TNGENISEPHSSSEKS----------NKPGILRSDLFDPDDSWYDSPPEGFSLTLSSFAT 600
                ++E HS  E +          NKPGI  SDLFD D SW+D PPEGF+LTLS+FA 
Sbjct: 541 -----VTEEHSEEEMTEEEPTLLKWPNKPGIPDSDLFDRDQSWFDGPPEGFNLTLSNFAV 600

Query: 601 MWMAIFAWMTSSSLAYIYGKDEKFHEEFQYIDGREYPRKIVSADGRSSEIKQTLAGCLTR 660
           MW ++F W++SSSLAYIYGK+E  HEEF  ++G+EYPR+I+  DG SSEIKQT+AGCL R
Sbjct: 601 MWDSLFGWVSSSSLAYIYGKEESAHEEFLLVNGKEYPRRIIMVDGLSSEIKQTIAGCLAR 660

Query: 661 SIPGLASELKLSTPISSLEHGMSEDRIRSSNFLISCPDGPFVLLGCLLDTMTFLDALPAF 685
           ++P + + L+L   IS LE G                      LG LL+TM+   A+P+F
Sbjct: 661 ALPRVVTHLRLPIAISELEKG----------------------LGSLLETMSLTGAVPSF 720

BLAST of Cp4.1LG03g13150 vs. TAIR 10
Match: AT5G26760.1 (unknown protein; Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 266.5 bits (680), Expect = 5.7e-71
Identity = 189/477 (39.62%), Postives = 256/477 (53.67%), Query Frame = 0

Query: 226 EYSVSKISSGLKEMTFDTKSKEQTGEFCGKQSNEQFTILETPHSPAPTKNSVGRKARGSK 285
           EYSVSK      E +   K K       GK          T    +   N+ G K +  K
Sbjct: 16  EYSVSKQPQCSMEDSLSCKLKGDLQTLDGK---------NTLSGSSSGSNTKGSKTKPEK 75

Query: 286 ERTNVSATAESNNNLSD------APSTSNHCSTN-CNITTEEPNGGSNDLNETQIKSSLK 345
            R  + +     N+  D      A S   H + + C+             +E   KS LK
Sbjct: 76  SRKKIISVEYHANSYEDGEEILAAESYERHKAQDVCS------------SSEIVTKSCLK 135

Query: 346 QPGKKNLRRSVTWADAKTDETSIINL-PEDREMGKTKECSRMTSNLVNADNGNEDILRVE 405
             G K L RSVTWAD       +  +   D   G +     ++SN +   N    + R+ 
Sbjct: 136 ISGSKKLSRSVTWADQNDGRGDLCEVRNNDNAAGPS-----LSSNDIEDVN---SLSRLA 195

Query: 406 SAEACAMALSQAAEAITSGQNEVSDAVSEAGIIILPRPSDANEEASTNGENISEPHSSSE 465
            AEA A ALSQAAEA++SG ++ SDA ++AGII+LP     +EE       ++E HS  E
Sbjct: 196 LAEALATALSQAAEAVSSGNSDASDATAKAGIILLPSTHQLDEE-------VTEEHSEEE 255

Query: 466 KS----------NKPGILRSDLFDPDDSWYDSPPEGFSLTLSSFATMWMAIFAWMTSSSL 525
            +          NKPGI  SDLFD D SW+D PPEGF+LTLS+FA MW ++F W++SSSL
Sbjct: 256 MTEEEPTLLKWPNKPGIPDSDLFDRDQSWFDGPPEGFNLTLSNFAVMWDSLFGWVSSSSL 315

Query: 526 AYIYGKDEKFHEEFQYIDGREYPRKIVSADGRSSEIKQTLAGCLTRSIPGLASELKLSTP 585
           AYIYGK+E  HEEF  ++G+EYPR+I+  DG SSEIKQT+AGCL R++P + + L+L   
Sbjct: 316 AYIYGKEESAHEEFLLVNGKEYPRRIIMVDGLSSEIKQTIAGCLARALPRVVTHLRLPIA 375

Query: 586 ISSLEHGMSEDRIRSSNFLISCPDGPFVLLGCLLDTMTFLDALPAFRTKQWQVIVLLFIE 645
           IS LE G                      LG LL+TM+   A+P+FR K+W VIVLLF++
Sbjct: 376 ISELEKG----------------------LGSLLETMSLTGAVPSFRVKEWLVIVLLFLD 430

Query: 646 ALSVCRIPSLDSQVSHSRSLFHKVLDRAQIRSDEYETLKDHILPLGRTAQFPGENGA 685
           ALSV RIP +   +S+      K+L+ + I ++EYET+KD +LPLGR  QF   +GA
Sbjct: 436 ALSVSRIPRIAPYISNR----DKILEGSGIGNEEYETMKDILLPLGRVPQFATRSGA 430

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
F4K1B14.4e-11638.69Putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog OS=Arabidops... [more]
A2Y0402.7e-9735.80Putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog OS=Oryza sat... [more]
Q6AVZ95.9e-9736.20Putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog OS=Oryza sat... [more]
Q8IXW51.5e-0723.66Putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 OS=Homo sapiens OX=9... [more]
Q5RA372.0e-0723.66Putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 OS=Pongo abelii OX=9... [more]
Match NameE-valueIdentityDescription
XP_023528028.10.096.78putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog isoform X1 [... [more]
KAG6581990.10.096.05putative RNA polymerase II subunit B1 CTD phosphatase RPAP2-like protein, partia... [more]
KAG7018411.10.095.91putative RNA polymerase II subunit B1 CTD phosphatase RPAP2-like protein [Cucurb... [more]
XP_022955995.10.095.61putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog isoform X1 [... [more]
XP_022980004.10.093.71putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog isoform X1 [... [more]
Match NameE-valueIdentityDescription
A0A6J1GWL90.095.61RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog OS=Cucurbita moschata... [more]
A0A6J1IY570.093.71RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog OS=Cucurbita maxima O... [more]
A0A6J1GVD10.091.67RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog OS=Cucurbita moschata... [more]
A0A6J1IUY30.090.06RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog OS=Cucurbita maxima O... [more]
A0A0A0KVU30.081.87RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog OS=Cucumis sativus OX... [more]
Match NameE-valueIdentityDescription
AT5G26760.23.1e-11738.69unknown protein; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF408 ... [more]
AT5G26760.15.7e-7139.62unknown protein; Has 30201 Blast hits to 17322 proteins in 780 species: Archae -... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR007308Rtr1/RPAP2 domainPFAMPF04181RPAP2_Rtr1coord: 36..108
e-value: 7.9E-23
score: 80.6
IPR007308Rtr1/RPAP2 domainPROSITEPS51479ZF_RTR1coord: 32..117
score: 20.946203
IPR038534Rtr1/RPAP2 domain superfamilyGENE3D1.25.40.820coord: 2..145
e-value: 7.8E-30
score: 105.8
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 263..330
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 438..462
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 433..465
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 287..330
IPR039693Rtr1/RPAP2PANTHERPTHR14732UNCHARACTERIZEDcoord: 4..678

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG03g13150.1Cp4.1LG03g13150.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0070940 dephosphorylation of RNA polymerase II C-terminal domain
cellular_component GO:0005634 nucleus
molecular_function GO:0046872 metal ion binding
molecular_function GO:0043175 RNA polymerase core enzyme binding
molecular_function GO:0008420 RNA polymerase II CTD heptapeptide repeat phosphatase activity