CmoCh05G006050 (gene) Cucurbita moschata (Rifu)

NameCmoCh05G006050
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
Description(Poly(A) polymerase) (2.7.7.19)
LocationCmo_Chr05 : 3021298 .. 3026828 (+)
   



The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTGGGTTCAGACAGTTCAACTGCTTTTCACCCGGCACCTCAGCCGGCGAGTAATTATGGTGTTACGAAACCGATTTCTCTTGCTGGACCGACGGATGCTGATATTCAGCGCAATATCGAACTGGAGAAGGTGTTGTTACTTTTATTTATTGGTTAAATCTCAATTAATTTGGTTGTTTTTTTTGTGCCGTGTGAAATTGCAGTAAACGAAAAGAATTTCAATTTTGATTGTGTTCTACATATTGGTAGTTCTTGATTGACTCGGGGCTTTACGAAAGTAAAGAAGAAGCTGCGAAAAGAGAAGAGGTTTTGGGCCGCATTGACCAGGTGCAGACCTTCTTCACTTCAATAGTCTTTTACTTAGGGGATTGTTCTCAGTAGCCAGCTCGAACAAAACAGTGTTCTAATCGGTGACGAAAGCGAGATTTTAGTTGATTAATTATTGTTTAATGTATTCAAGCGTTTGACTTCCTTTTATGACATAAACTTCTTGTCCTTAGCCATTTTCTTTCATTCTAGATTGTTAAAAATTGGGTCAAGCAATTAACTCTCCTAAGAGGATATACAGAGCAGATGGTGGAGGATGCAAATGCAGTAATTTTTACTTTTGGGTCTTATCGTCTTGGGGTAAGCATAATATTTGATTTAGTTTTTGGGGAAAAAAGCTTATTACTTTATGCAACTCTTCGTAATTTTTGTTGGAGTCACGGACTGGTTGGATGATTGCTTAGAGCAATGATCTTTTATTTCTTGGAAAGTTATGTTCTTCCTGTATTTTTCCCTTATTGGCCTGTGCATTATACTCAAATATATGACTTCCAAGTATTCAAAGATGTGACTTAAAATTAACCATATGACAGGTACATGGACCAGGAAGTGACATAGACACATTGTGTGTTGGACCATCTTACGTGAATAGAGAAGTGAGTAGTATAAATTGCTGATACTTCAACAACATTTTTTGCCTTCTCATATGGGCTTAAATTTCAGCTTATTCTTGATCTGACAATTTGTTTTTTTTTTTCTAATGGAGGACTTCTTCATCATCTTGCATAATATTTTAGCTGAAATGGAAGAGGTTACTGATCTTCAACCGGTTCCCGATGCTCATGTTCCAGTAATGAGATTCAAATTTATGGGGATATCAATTGATCTTCTGTATGCGAGCATTTCCGTACGGGTTGTTCCTGAAGTAAGTTCTAGCACTTTCTTTGAAGTTTTATTATATTTATTTTATGTTTTCAGATTGTTGTTACTTTATTTTATGTTTTCACAAAACTGTAACCATATTTCTTTTTGTTACATTATTTATTGTAGTTGATCATATGAGATTTCGAACTATTGTATGTTATATGTTACTTCTCTCTCTACTTCTATATTCAGCTTTATGGCGAATATTTCCATAATTCATGCAGAACTACCTGTCTGATAAAAGTCTAAATGCCAAACATTGTAATGTGAATGGACATGAATGCCAATTGTGATTACTTTTATGGATTATTTTTGTAGTTTTATTTATCGTTTGAACCGTTTCTACCAGCTTTCATGTACCCTCCAGTGCAGTCACCATCTTTTCTTGTTTGCATATCCTGTGGCTGAAGCTGTATAATTTGTTCCCTCCTTTCCCGACAAATGTGGTTTTGATTGAATTTCAGGGATTTCTTTCTTCTATCTTCTTGTTTTATTTTATTTTATCTTTTACATGGCTAGCGGCTGGATAGAGAAATTTCTCTCCTGAAAGAGGACTATTATATACAACAGGAATTGCTCTTTTCTCCCTCACACCATTCCGAACATTTTTTATTTTAATTATCTGATAAGTTCGTTTTGTTACCTTGAGTTTGTGTAGGTGCACTAATTGTTCACTGTGTGTGTGTGTGTGTTAACACTGAACAGGTGATACATTTTTGCATAACCTTATCTGCTGTAGATTTAACTGTGATTGGGTGGTATCACTTACTAATTGTTTTACTACTAATGTTTCAATCCTCTTTTTCCCTGATCTTTTGCACTGTTGCTATTTCCCATAAACTTTTTGCAATTTACTTAATCATCAGGAGCTGAACATCTCTCATGGATCTGTGCTGTGCAATGTTGATGAACAAACTGTTCGAAGTCTTAATGGATGCCGAGTTGCAGATCAAATTCTAAGACTTGTTCCAAACGTTGAGGTGAAACACATTTTAAACCTGAGTTATTTTTCCATAAATCTATTTCGTGATTTTCTCTGTAACCTATAATTGCCAATTGCAGCACTTTCGCACGACACTCAGATGTTTAAAGTTTTGGGCCAAAAGACGGGGTGTATACTCCAATGTAAGTTGTAAGATCATGGAGTTTTTCCTCTTAGCTTCACTTATATTTTTTCATAATTCAGATTTCACTTTTGTTTTTTTTTTCATGTATAGGTCACTGGGTTCCTTGGAGGTGTAAATTGGGCACTTTTGGTTGCTCAGGTATGCCAGCTTTACCCCAATGCAATTCCAAGCATGCTAGTTTCTCGTTTCTTCAGAGTGTACACCCAGTGGCGTTGGCCAAATCCTGTAATGCTATGTTCAATAGAAGAGAATGAACTTGGATTTCCTGTTTGGGATCCTCGTAAAAATCCGCGTGATCGATTTCATCTTATGCCAATAATAACTCCTGCTTACCCTTGCATGAACTCTAGCTATAATGTCTCAACTAGTACTCTCCGTGTTATGATGGAACAGTTCCGCTGTGGTAATAATATCTGTGAGGTGAGGTTATTTTTATTTGGATTCAATATCTTCTAAATTTTAACCTGCTCTTAATAGTATTTCTCTTACATCTCTGAGTGCCTTAACAGGAGATTGACATGAGCAAAGCCCAGTGGAGTGCTTTATTTGAGCCATATTTATTTTTCGAGATATACAAAAACTATCTACAGGTGGACATAATTGCAGCAGATGCTGATGATTTGCTAGCTTGGAAAGGATGGGTAGAATCTCGCTTTAGACAGCTTACCTTGAAGGTAAATCTATTATTCTTTCCTTATTATCTATCTAGAGTTGGAGGGTCTTTCAAGAGCCTGATAGATTCTACTTTTCCTTCCTGCAGATAGAGAGAGACACCAGGGGGATGCTGCAGTGCCATCCCTATCCTATTGAGTATTCTGACACGTCCAAGCCATGTTCTCACTGTGCTTTTTTCATGGGCTTGCAGAGAAAAGAGGGATTAAGAGGCCAAGGAGGCCAGCAGTTTGACATACGAGGAACAGTTGATGAGTTCAGGCAAGAGATAAACACGTATGCTTTTTGGAAGCCAGGGATGGATATCCATGTTTCTCATGTGCGGAGGAAGCAACTGCCCACCTTTGTTTTTCCTGATGGACACAAACGGGCAAAGCCAGTCAGGCATGAAGGGCAGCAAGCTGACCCAGTTTGTGCTGACATGCTCCAAGATCAGTCTGGGATCACTGAGAAAGGGAAAAAAAGGAAAAGTGACCATGAAGAAACAGAGATGGAAAAGAAACAAGCCTTTGCCAGTCAACCAGCAGAAGAATCTCCCATGCTTGAAACTCATGGTGGTGGATCAGATGGGAAATTGCCGAGTTTAAAATCTGCAAATGCAGATTGTTACTCTGAGGTTTGGCCATCTTTTGAACAGCTTGATAGCAGAATGGATACTGATGGAAATGGCATGGATATTCCATCGTTGACCAAGGAGACTGGTCCTACGATGGATCAGGCAGAGCTAGCTAAAGTAATGGAAGGATCTTCCTCAACGAAGGAGGTGCCTGATCTATATCAAGGCGGTCTTTCGAAATCCGAGGAAGCTTTACAAATTGAGATGAACCAGGAAAAAATTGAGGGATTAGCATCTAATATGAGTGGAAGTGCACAGACAGTAGCCATCAGAAATTTCCTCCATTGGACAAAGGATGTTGTGAGGATTGACTCTGAGTCAGCAAATCCATTTGGTCAAACGACCGGGGAAGAGAGTACCCAAGTTGACTTTCAACCAAATTGCAATGCACATAACCTTAGTTGCAAGGTGAGTGACTGAGTTTGACTACATTGGATTACAAATCATAGCCAATGGCAGTAAGAGAAGTGGCATTCTGGAGATAGTGAGTCGGGTATTTGCAGGGCAATGATAGCAGGACGGATCCAGAGTTGGCGTTGGAAAATGGTTCCGTTGTGACTGGAAGAGTGTTTCAAAATGGTCAACCTAAAGAGTTGGAGGTTTGAATCTCTAATTGCTTAAATTACTTACATTTTATATATAAGTACACAGCACATTATAACTGATGACGTTGCTTCCACCAAGGAATCTAAGATGAGAGGAGGCTGAGATTTTAATTGATATCAATCAGAAGAATAGAAGATATAAACATATTTATGGTTTGATACCTTAAGTTTTTATTCTTTTTGCCAAAAAGCACTCGTTGTTTCATTCCATATCTCTTCTCCATGCCTTCTCTCCTTTTCTCTCTTTCCGACCCATAATTGTTGGTCTATTTTCCTGTTGCATTTTAATTATCATTTTGTAATTCCAAATTTTCGCACAGGCAATCTTCTGACTGGTGCACATGTGGTGGCAGGTCCTGCGCCACTATACGTGTTATATTTTCAGTAACTGTGCTAAAGAACCCATTTCTGCTCTCAAGTTTTTCCCGCATTACCTTTTTTCCCTCATGAAAATCTTTGCCTAATTAGTTCTTTTCTTTTTTAAACTTCATTTTATAGCCGAAGTTCAGCGATCAGGCAGTCCCAGAATGGAGCTAAATAAGACTCTATGCAAAATTCTGGTGAAAAAGAGACGGGAGGATCTGGCAAGAAGGCATGTGTATTTAGCCTCGATGCAATTTCAATTAGAAAAAGGATTATTTTTCTGAATCTGGCAAAAGGATTAGTACTTAATTTGCCTGATTATTATGTTTATTTTGATCATCTTCTCTTTATGTTGCTTGTTTGGCTGTTTAGCATCCTTTGAACAGACTGAATCTGAACCTCAAAGACATGAACATGTGGAGGGGAATTTCTGGATGGCAGATGGGTTGGGAGAGCGGGAGCTAGCCATATTTAGTAAGTATCACCTCAATAACTGGCTTTGATATTGATGTAGATAGTGTTCATCATGCATGAACCTGCGGACATGGGGAGGGGTCGCGTGGCTTGGTGGCGTGGCATACAATTCAGCTCCAATATGCTGTTTGTAGTCCTTGAACATGGGAATCCGGAGAAGCAAGCACTGTTGGACCTTAAATAAATGCGCTTGTGAGTTATGCTTTGAAGATTGTGGTGCATTTCATGTTAGATGCCTGTAACTGGAACTGTCACAGCTTCTTCCCATGTTTAGAAGACGGCAGCTTCTTTGTGAAAAGGAGGAACTGCTCGTGAGGTCGTTTCTAAGGAGAGAATCTAATCTACACTTATTGAGAGAGACCTTCAGTCTTGCAGATTAACCATATTGAGTTTCAATTGAACATTGTGCGGAGAAAGTTTCCATGTTTTCTTCCATTTCATCTCTGTTTTTCTTTTCCTCCTCTGTTCTTGGGGGCTGGGG

mRNA sequence

ATGGTGGGTTCAGACAGTTCAACTGCTTTTCACCCGGCACCTCAGCCGGCGAGTAATTATGGTGTTACGAAACCGATTTCTCTTGCTGGACCGACGGATGCTGATATTCAGCGCAATATCGAACTGGAGAAGTTCTTGATTGACTCGGGGCTTTACGAAAGTAAAGAAGAAGCTGCGAAAAGAGAAGAGGTTTTGGGCCGCATTGACCAGGTGCAGACCTTCTTCACTTCAATAGGGATTGTTCTCAGTAGCCAGCTCGAACAAAACAGTGTTCTAATCGGTGACGAAAGCGAGATTTTAATTGTTAAAAATTGGGTCAAGCAATTAACTCTCCTAAGAGGATATACAGAGCAGATGGTGGAGGATGCAAATGCAGTAATTTTTACTTTTGGGTCTTATCGTCTTGGGGTAAGCATAATATTTGATTTAGTTTTTGGGGAAAAAAGCTTATTACTTTATGCAACTCTTCGTAATTTTTGTTGGAGTCACGGACTGGTACATGGACCAGGAAGTGACATAGACACATTGTGTGTTGGACCATCTTACGTGAATAGAGAAGACTTCTTCATCATCTTGCATAATATTTTAGCTGAAATGGAAGAGGTTACTGATCTTCAACCGGTTCCCGATGCTCATGTTCCAGTAATGAGATTCAAATTTATGGGGATATCAATTGATCTTCTGTATGCGAGCATTTCCGTACGGGTTGTTCCTGAAGAGCTGAACATCTCTCATGGATCTGTGCTGTGCAATGTTGATGAACAAACTGTTCGAAGTCTTAATGGATGCCGAGTTGCAGATCAAATTCTAAGACTTGTTCCAAACGTTGAGCACTTTCGCACGACACTCAGATGTTTAAAGTTTTGGGCCAAAAGACGGGGTGTATACTCCAATGTCACTGGGTTCCTTGGAGGTGTATGCCAGCTTTACCCCAATGCAATTCCAAGCATGCTAGTTTCTCGTTTCTTCAGAGTGTACACCCAGTGGCGTTGGCCAAATCCTGTAATGCTATGTTCAATAGAAGAGAATGAACTTGGATTTCCTGTTTGGGATCCTCGTAAAAATCCGCGTGATCGATTTCATCTTATGCCAATAATAACTCCTGCTTACCCTTGCATGAACTCTAGCTATAATGTCTCAACTAGTACTCTCCGTGTTATGATGGAACAGTTCCGCTGTGGTAATAATATCTGTGAGGAGATTGACATGAGCAAAGCCCAGTGGAGTGCTTTATTTGAGCCATATTTATTTTTCGAGATATACAAAAACTATCTACAGGTGGACATAATTGCAGCAGATGCTGATGATTTGCTAGCTTGGAAAGGATGGGTAGAATCTCGCTTTAGACAGCTTACCTTGAAGAGTTGGAGGGTCTTTCAAGAGCCTGATAGATTCTACTTTTCCTTCCTGCAGATAGAGAGAGACACCAGGGGGATGCTGCAGTGCCATCCCTATCCTATTGAGTATTCTGACACGTCCAAGCCATGTTCTCACTGTGCTTTTTTCATGGGCTTGCAGAGAAAAGAGGGATTAAGAGGCCAAGGAGGCCAGCAGTTTGACATACGAGGAACAGTTGATGAGTTCAGGCAAGAGATAAACACGTATGCTTTTTGGAAGCCAGGGATGGATATCCATGTTTCTCATGTGCGGAGGAAGCAACTGCCCACCTTTGTTTTTCCTGATGGACACAAACGGGCAAAGCCAGTCAGGCATGAAGGGCAGCAAGCTGACCCAGTTTGTGCTGACATGCTCCAAGATCAGTCTGGGATCACTGAGAAAGGGAAAAAAAGGAAAAGTGACCATGAAGAAACAGAGATGGAAAAGAAACAAGCCTTTGCCAGTCAACCAGCAGAAGAATCTCCCATGCTTGAAACTCATGGTGGTGGATCAGATGGGAAATTGCCGAGTTTAAAATCTGCAAATGCAGATTGTTACTCTGAGGTTTGGCCATCTTTTGAACAGCTTGATAGCAGAATGGATACTGATGGAAATGGCATGGATATTCCATCGTTGACCAAGGAGACTGGTCCTACGATGGATCAGGCAGAGCTAGCTAAAGTAATGGAAGGATCTTCCTCAACGAAGGAGGTGCCTGATCTATATCAAGGCGGTCTTTCGAAATCCGAGGAAGCTTTACAAATTGAGATGAACCAGGAAAAAATTGAGGGATTAGCATCTAATATGAGTGGAAGTGCACAGACAGTAGCCATCAGAAATTTCCTCCATTGGACAAAGGATGTTGTGAGGATTGACTCTGAGTCAGCAAATCCATTTGGTCAAACGACCGGGGAAGAGAGTACCCAAGTTGACTTTCAACCAAATTGCAATGCACATAACCTTAGTTGCAAGTTTGACTACATTGGATTACAAATCATAGCCAATGGCAGTAAGAGAAGTGGCATTCTGGAGATAGTGAGTCGGGGCAATGATAGCAGGACGGATCCAGAGTTGGCGTTGGAAAATGGTTCCGTTGTGACTGGAAGAGTGTTTCAAAATGGTCAACCTAAAGAGCAATCTTCTGACTGGTGCACATGTGGTGGCAGGTCCTGCGCCACTATACCCGAAGTTCAGCGATCAGGCAGTCCCAGAATGGAGCTAAATAAGACTCTATGCAAAATTCTGGTGAAAAAGAGACGGGAGGATCTGGCAAGAAGGCATCATCCTTTGAACAGACTGAATCTGAACCTCAAAGACATGAACATGTGGAGGGGAATTTCTGGATGGCAGATGGGTTGGGAGAGCGGGAGCTAGCCATATTTAGTAAGTATCACCTCAATAACTGGCTTTGATATTGATGTAGATAGTGTTCATCATGCATGAACCTGCGGACATGGGGAGGGGTCGCGTGGCTTGGTGGCGTGGCATACAATTCAGCTCCAATATGCTGTTTGTAGTCCTTGAACATGGGAATCCGGAGAAGCAAGCACTGTTGGACCTTAAATAAATGCGCTTGTGAGTTATGCTTTGAAGATTGTGGTGCATTTCATGTTAGATGCCTGTAACTGGAACTGTCACAGCTTCTTCCCATGTTTAGAAGACGGCAGCTTCTTTGTGAAAAGGAGGAACTGCTCGTGAGGTCGTTTCTAAGGAGAGAATCTAATCTACACTTATTGAGAGAGACCTTCAGTCTTGCAGATTAACCATATTGAGTTTCAATTGAACATTGTGCGGAGAAAGTTTCCATGTTTTCTTCCATTTCATCTCTGTTTTTCTTTTCCTCCTCTGTTCTTGGGGGCTGGGG

Coding sequence (CDS)

ATGGTGGGTTCAGACAGTTCAACTGCTTTTCACCCGGCACCTCAGCCGGCGAGTAATTATGGTGTTACGAAACCGATTTCTCTTGCTGGACCGACGGATGCTGATATTCAGCGCAATATCGAACTGGAGAAGTTCTTGATTGACTCGGGGCTTTACGAAAGTAAAGAAGAAGCTGCGAAAAGAGAAGAGGTTTTGGGCCGCATTGACCAGGTGCAGACCTTCTTCACTTCAATAGGGATTGTTCTCAGTAGCCAGCTCGAACAAAACAGTGTTCTAATCGGTGACGAAAGCGAGATTTTAATTGTTAAAAATTGGGTCAAGCAATTAACTCTCCTAAGAGGATATACAGAGCAGATGGTGGAGGATGCAAATGCAGTAATTTTTACTTTTGGGTCTTATCGTCTTGGGGTAAGCATAATATTTGATTTAGTTTTTGGGGAAAAAAGCTTATTACTTTATGCAACTCTTCGTAATTTTTGTTGGAGTCACGGACTGGTACATGGACCAGGAAGTGACATAGACACATTGTGTGTTGGACCATCTTACGTGAATAGAGAAGACTTCTTCATCATCTTGCATAATATTTTAGCTGAAATGGAAGAGGTTACTGATCTTCAACCGGTTCCCGATGCTCATGTTCCAGTAATGAGATTCAAATTTATGGGGATATCAATTGATCTTCTGTATGCGAGCATTTCCGTACGGGTTGTTCCTGAAGAGCTGAACATCTCTCATGGATCTGTGCTGTGCAATGTTGATGAACAAACTGTTCGAAGTCTTAATGGATGCCGAGTTGCAGATCAAATTCTAAGACTTGTTCCAAACGTTGAGCACTTTCGCACGACACTCAGATGTTTAAAGTTTTGGGCCAAAAGACGGGGTGTATACTCCAATGTCACTGGGTTCCTTGGAGGTGTATGCCAGCTTTACCCCAATGCAATTCCAAGCATGCTAGTTTCTCGTTTCTTCAGAGTGTACACCCAGTGGCGTTGGCCAAATCCTGTAATGCTATGTTCAATAGAAGAGAATGAACTTGGATTTCCTGTTTGGGATCCTCGTAAAAATCCGCGTGATCGATTTCATCTTATGCCAATAATAACTCCTGCTTACCCTTGCATGAACTCTAGCTATAATGTCTCAACTAGTACTCTCCGTGTTATGATGGAACAGTTCCGCTGTGGTAATAATATCTGTGAGGAGATTGACATGAGCAAAGCCCAGTGGAGTGCTTTATTTGAGCCATATTTATTTTTCGAGATATACAAAAACTATCTACAGGTGGACATAATTGCAGCAGATGCTGATGATTTGCTAGCTTGGAAAGGATGGGTAGAATCTCGCTTTAGACAGCTTACCTTGAAGAGTTGGAGGGTCTTTCAAGAGCCTGATAGATTCTACTTTTCCTTCCTGCAGATAGAGAGAGACACCAGGGGGATGCTGCAGTGCCATCCCTATCCTATTGAGTATTCTGACACGTCCAAGCCATGTTCTCACTGTGCTTTTTTCATGGGCTTGCAGAGAAAAGAGGGATTAAGAGGCCAAGGAGGCCAGCAGTTTGACATACGAGGAACAGTTGATGAGTTCAGGCAAGAGATAAACACGTATGCTTTTTGGAAGCCAGGGATGGATATCCATGTTTCTCATGTGCGGAGGAAGCAACTGCCCACCTTTGTTTTTCCTGATGGACACAAACGGGCAAAGCCAGTCAGGCATGAAGGGCAGCAAGCTGACCCAGTTTGTGCTGACATGCTCCAAGATCAGTCTGGGATCACTGAGAAAGGGAAAAAAAGGAAAAGTGACCATGAAGAAACAGAGATGGAAAAGAAACAAGCCTTTGCCAGTCAACCAGCAGAAGAATCTCCCATGCTTGAAACTCATGGTGGTGGATCAGATGGGAAATTGCCGAGTTTAAAATCTGCAAATGCAGATTGTTACTCTGAGGTTTGGCCATCTTTTGAACAGCTTGATAGCAGAATGGATACTGATGGAAATGGCATGGATATTCCATCGTTGACCAAGGAGACTGGTCCTACGATGGATCAGGCAGAGCTAGCTAAAGTAATGGAAGGATCTTCCTCAACGAAGGAGGTGCCTGATCTATATCAAGGCGGTCTTTCGAAATCCGAGGAAGCTTTACAAATTGAGATGAACCAGGAAAAAATTGAGGGATTAGCATCTAATATGAGTGGAAGTGCACAGACAGTAGCCATCAGAAATTTCCTCCATTGGACAAAGGATGTTGTGAGGATTGACTCTGAGTCAGCAAATCCATTTGGTCAAACGACCGGGGAAGAGAGTACCCAAGTTGACTTTCAACCAAATTGCAATGCACATAACCTTAGTTGCAAGTTTGACTACATTGGATTACAAATCATAGCCAATGGCAGTAAGAGAAGTGGCATTCTGGAGATAGTGAGTCGGGGCAATGATAGCAGGACGGATCCAGAGTTGGCGTTGGAAAATGGTTCCGTTGTGACTGGAAGAGTGTTTCAAAATGGTCAACCTAAAGAGCAATCTTCTGACTGGTGCACATGTGGTGGCAGGTCCTGCGCCACTATACCCGAAGTTCAGCGATCAGGCAGTCCCAGAATGGAGCTAAATAAGACTCTATGCAAAATTCTGGTGAAAAAGAGACGGGAGGATCTGGCAAGAAGGCATCATCCTTTGAACAGACTGAATCTGAACCTCAAAGACATGAACATGTGGAGGGGAATTTCTGGATGGCAGATGGGTTGGGAGAGCGGGAGCTAG
BLAST of CmoCh05G006050 vs. Swiss-Prot
Match: PAPS4_ARATH (Nuclear poly(A) polymerase 4 OS=Arabidopsis thaliana GN=PAPS4 PE=1 SV=1)

HSP 1 Score: 786.2 bits (2029), Expect = 3.9e-226
Identity = 404/627 (64.43%), Postives = 458/627 (73.05%), Query Frame = 1

Query: 1   MVGSDSSTAFHPAPQPASNYGVTKPISLAGPTDADIQRNIELEKFLIDSGLYESKEEAAK 60
           MVG+ +     P      +YG+TKP+SLAGP+ ADI+RN+ELEK+L+D GLYESK++  +
Sbjct: 2   MVGTQNLGGSLPPLNSPKSYGITKPLSLAGPSSADIKRNVELEKYLVDEGLYESKDDTMR 61

Query: 61  REEVLGRIDQVQTFFTSIGIVLSSQLEQNSVLIGDESEILIVKNWVKQLTLLRGYTEQMV 120
           REEVLGRIDQ                              IVK+WVKQLT  RGYT+QMV
Sbjct: 62  REEVLGRIDQ------------------------------IVKHWVKQLTQQRGYTDQMV 121

Query: 121 EDANAVIFTFGSYRLGVSIIFDLVFGEKSLLLYATLRNFCWSHGLVHGPGSDIDTLCVGP 180
           EDANAVIFTFGSYRLGV                             HGPG+DIDTLCVGP
Sbjct: 122 EDANAVIFTFGSYRLGV-----------------------------HGPGADIDTLCVGP 181

Query: 181 SYVNRE-DFFIILHNILAEMEEVTDLQPVPDAHVPVMRFKFMGISIDLLYASISVRVVPE 240
           SYVNRE DFFIILH+ILAEMEEVT+L PVPDAHVPVM+FKF GI IDLLYASIS+ VVP+
Sbjct: 182 SYVNREEDFFIILHDILAEMEEVTELHPVPDAHVPVMKFKFQGIPIDLLYASISLLVVPQ 241

Query: 241 ELNISHGSVLCNVDEQTVRSLNGCRVADQILRLVPNVEHFRTTLRCLKFWAKRRGVYSNV 300
           +L+IS  SVLC VDE TVRSLNGCRVADQIL+LVPN EHFRTTLRCLK+WAK+RGVYSNV
Sbjct: 242 DLDISSSSVLCEVDEPTVRSLNGCRVADQILKLVPNFEHFRTTLRCLKYWAKKRGVYSNV 301

Query: 301 TGFLGG---------VCQLYPNAIPSMLVSRFFRVYTQWRWPNPVMLCSIEENELGFPVW 360
           TGFLGG         VCQLYPNAIPSMLVSRFFRVYTQWRWPNPVMLC+IEE+ELGFPVW
Sbjct: 302 TGFLGGVNWALLVARVCQLYPNAIPSMLVSRFFRVYTQWRWPNPVMLCAIEEDELGFPVW 361

Query: 361 DPRKNPRDRFHLMPIITPAYPCMNSSYNVSTSTLRVMMEQFRCGNNICEEIDMSKAQWSA 420
           D RKN RDR+HLMPIITPAYPCMNSSYNVS STLRVM EQF+ GNNI +EI+++K  WS+
Sbjct: 362 DRRKNHRDRYHLMPIITPAYPCMNSSYNVSQSTLRVMTEQFQFGNNILQEIELNKQHWSS 421

Query: 421 LFEPYLFFEIYKNYLQVDIIAADADDLLAWKGWVESRFRQLTLKSWRVFQEPDRFYFSFL 480
           LFE Y+FFE YKNYLQVDI+AADA+DLLAWKGWVESRFRQLTLK                
Sbjct: 422 LFEQYMFFEAYKNYLQVDIVAADAEDLLAWKGWVESRFRQLTLK---------------- 481

Query: 481 QIERDTRGMLQCHPYPIEYSDTSKPCSHCAFFMGLQRKEGLRGQGGQQFDIRGTVDEFRQ 540
            IERDT GML CHP P EY DT++   HCAFFMGLQR EG+ GQ  QQFDIRGTVDEFRQ
Sbjct: 482 -IERDTNGMLMCHPQPNEYVDTARQFLHCAFFMGLQRAEGVGGQECQQFDIRGTVDEFRQ 541

Query: 541 EINTYAFWKPGMDIHVSHVRRKQLPTFVFPDGHKRAKPVRHEGQQADPVCADMLQDQSG- 600
           E+N Y FWKPGMD+ VSHVRR+QLP FVFP+G++R +  RH+         D     SG 
Sbjct: 542 EVNMYMFWKPGMDVFVSHVRRRQLPPFVFPNGYRRPRQSRHQNLPGGKSGEDGSVSHSGS 552

Query: 601 ITEKGKKRKSDHE--ETEMEKKQAFAS 615
           + E+  KRK+D E  +   EK +  AS
Sbjct: 602 VVERHAKRKNDSEMMDVRPEKPEKRAS 552

BLAST of CmoCh05G006050 vs. Swiss-Prot
Match: PAPS2_ARATH (Nuclear poly(A) polymerase 2 OS=Arabidopsis thaliana GN=PAPS2 PE=1 SV=2)

HSP 1 Score: 780.8 bits (2015), Expect = 1.6e-224
Identity = 394/615 (64.07%), Postives = 455/615 (73.98%), Query Frame = 1

Query: 12  PAPQPASNYGVTKPISLAGPTDADIQRNIELEKFLIDSGLYESKEEAAKREEVLGRIDQV 71
           P      +YG+T+P+S+AGP+ AD++RN+ELEKFL+D GLYESKEE  +REEV+ RIDQ 
Sbjct: 15  PVKASLKSYGITEPLSIAGPSAADVKRNLELEKFLVDEGLYESKEETMRREEVVVRIDQ- 74

Query: 72  QTFFTSIGIVLSSQLEQNSVLIGDESEILIVKNWVKQLTLLRGYTEQMVEDANAVIFTFG 131
                                        IVK+WVKQLT  RGYT+QMVEDANAVIFTFG
Sbjct: 75  -----------------------------IVKHWVKQLTRQRGYTDQMVEDANAVIFTFG 134

Query: 132 SYRLGVSIIFDLVFGEKSLLLYATLRNFCWSHGLVHGPGSDIDTLCVGPSYVNR-EDFFI 191
           SYRLG                             VHGP +DIDTLCVGPSYVNR EDFFI
Sbjct: 135 SYRLG-----------------------------VHGPMADIDTLCVGPSYVNREEDFFI 194

Query: 192 ILHNILAEMEEVTDLQPVPDAHVPVMRFKFMGISIDLLYASISVRVVPEELNISHGSVLC 251
              +ILAEMEEVT+LQPV DAHVPVM+FKF GISIDLLYASIS+ V+P++L+IS+ SVLC
Sbjct: 195 FFRDILAEMEEVTELQPVTDAHVPVMKFKFQGISIDLLYASISLLVIPQDLDISNSSVLC 254

Query: 252 NVDEQTVRSLNGCRVADQILRLVPNVEHFRTTLRCLKFWAKRRGVYSNVTGFLGGV---- 311
           +VDEQTVRSLNGCRVADQIL+LVPN EHFRTTLRCLK+WAK+RGVYSNVTGFLGGV    
Sbjct: 255 DVDEQTVRSLNGCRVADQILKLVPNSEHFRTTLRCLKYWAKKRGVYSNVTGFLGGVNWAL 314

Query: 312 -----CQLYPNAIPSMLVSRFFRVYTQWRWPNPVMLCSIEENELGFPVWDPRKNPRDRFH 371
                CQ YPNAIPSMLVSRFFRVYTQWRWPNPVMLC+IEE++L FPVWDPRKN RDR+H
Sbjct: 315 LVARLCQFYPNAIPSMLVSRFFRVYTQWRWPNPVMLCAIEEDDLSFPVWDPRKNHRDRYH 374

Query: 372 LMPIITPAYPCMNSSYNVSTSTLRVMMEQFRCGNNICEEIDMSKAQWSALFEPYLFFEIY 431
           LMPIITPAYPCMNSSYNVS STLRVM EQF+ GN IC+EI+++K  WS+LF+ Y+FFE Y
Sbjct: 375 LMPIITPAYPCMNSSYNVSQSTLRVMTEQFQFGNTICQEIELNKQHWSSLFQQYMFFEAY 434

Query: 432 KNYLQVDIIAADADDLLAWKGWVESRFRQLTLKSWRVFQEPDRFYFSFLQIERDTRGMLQ 491
           KNYLQVD++AADA+DLLAWKGWVESRFRQLTLK                 IERDT GML 
Sbjct: 435 KNYLQVDVLAADAEDLLAWKGWVESRFRQLTLK-----------------IERDTNGMLM 494

Query: 492 CHPYPIEYSDTSKPCSHCAFFMGLQRKEGLRGQGGQQFDIRGTVDEFRQEINTYAFWKPG 551
           CHP P EY DTSK   HCAFFMGLQR +G  GQ  QQFDIRGTVDEFRQE+N Y FW+PG
Sbjct: 495 CHPQPNEYVDTSKQFRHCAFFMGLQRADGFGGQECQQFDIRGTVDEFRQEVNMYMFWRPG 553

Query: 552 MDIHVSHVRRKQLPTFVFPDGHKRAKPVRHEGQQADPVCADMLQDQSGITEKGKKRKSDH 611
           MD+HVSHVRR+QLP+FVFP+G+KR++  RH+ QQ      + +   S   E+  KRK+D 
Sbjct: 555 MDVHVSHVRRRQLPSFVFPNGYKRSRQSRHQSQQCREPGDEGVGSLSDSVERYAKRKNDD 553

Query: 612 E--ETEMEKKQAFAS 615
           E   +  EK++  AS
Sbjct: 615 EIMNSRPEKREKRAS 553

BLAST of CmoCh05G006050 vs. Swiss-Prot
Match: PAPS1_ARATH (Nuclear poly(A) polymerase 1 OS=Arabidopsis thaliana GN=PAPS1 PE=1 SV=1)

HSP 1 Score: 618.2 bits (1593), Expect = 1.4e-175
Identity = 319/654 (48.78%), Postives = 405/654 (61.93%), Query Frame = 1

Query: 15  QPASNYGVTKPISLAGPTDADIQRNIELEKFLIDSGLYESKEEAAKREEVLGRIDQVQTF 74
           Q    +GV++PIS+ GPT+ D+ +  ELEK L D GLYESKEEA +REEVLG +DQ    
Sbjct: 6   QNGQRFGVSEPISMGGPTEFDVIKTRELEKHLQDVGLYESKEEAVRREEVLGILDQ---- 65

Query: 75  FTSIGIVLSSQLEQNSVLIGDESEILIVKNWVKQLTLLRGYTEQMVEDANAVIFTFGSYR 134
                                     IVK W+K ++  +G  +Q++ +ANA IFTFGSYR
Sbjct: 66  --------------------------IVKTWIKTISRAKGLNDQLLHEANAKIFTFGSYR 125

Query: 135 LGVSIIFDLVFGEKSLLLYATLRNFCWSHGLVHGPGSDIDTLCVGPSYVNRE-DFFIILH 194
           LG                             VHGPG+DIDTLCVGP +  RE DFF  L 
Sbjct: 126 LG-----------------------------VHGPGADIDTLCVGPRHATREGDFFGELQ 185

Query: 195 NILAEMEEVTDLQPVPDAHVPVMRFKFMGISIDLLYASISVRVVPEELNISHGSVLCNVD 254
            +L+EM EVT+L PVPDAHVP+M FK  G+SIDLLYA + + V+PE+L++S  S+L N D
Sbjct: 186 RMLSEMPEVTELHPVPDAHVPLMGFKLNGVSIDLLYAQLPLWVIPEDLDLSQDSILQNAD 245

Query: 255 EQTVRSLNGCRVADQILRLVPNVEHFRTTLRCLKFWAKRRGVYSNVTGFLGG-------- 314
           EQTVRSLNGCRV DQILRLVPN+++FRTTLRC++FWAKRRGVYSNV+GFLGG        
Sbjct: 246 EQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVSGFLGGINWALLVA 305

Query: 315 -VCQLYPNAIPSMLVSRFFRVYTQWRWPNPVMLCSIEENELGFPVWDPRKNPRDRFHLMP 374
            +CQLYPNA+P++LVSRFFRV+ QW WPN + LCS +E  LG  VWDPR NP+DR H+MP
Sbjct: 306 RICQLYPNALPNILVSRFFRVFYQWNWPNAIFLCSPDEGSLGLQVWDPRINPKDRLHIMP 365

Query: 375 IITPAYPCMNSSYNVSTSTLRVMMEQFRCGNNICEEIDMSKAQWSALFEPYLFFEIYKNY 434
           IITPAYPCMNSSYNVS STLR+M  +F+ GN ICE ++ +KA W  LFEP+ FFE YKNY
Sbjct: 366 IITPAYPCMNSSYNVSESTLRIMKGEFQRGNEICEAMESNKADWDTLFEPFAFFEAYKNY 425

Query: 435 LQVDIIAADADDLLAWKGWVESRFRQLTLKSWRVFQEPDRFYFSFLQIERDTRGMLQCHP 494
           LQ+DI AA+ DDL  WKGWVESR RQLTLK  R F+                  ML CHP
Sbjct: 426 LQIDISAANVDDLRKWKGWVESRLRQLTLKIERHFK------------------MLHCHP 485

Query: 495 YPIEYSDTSKPCSHCAFFMGLQRKEGLRGQGGQQFDIRGTVDEFRQEINTYAFWKPGMDI 554
           +P ++ DTS+P  HC++FMGLQRK+G+    G+QFDIR TV+EF+  +N Y  W PGM+I
Sbjct: 486 HPHDFQDTSRPL-HCSYFMGLQRKQGVPAAEGEQFDIRRTVEEFKHTVNAYTLWIPGMEI 545

Query: 555 HVSHVRRKQLPTFVFPDG------------HKRAKPVRHEGQQADPVCADMLQDQSGITE 614
            V H++R+ LP FVFP G              R    R+    + P       + S  ++
Sbjct: 546 SVGHIKRRSLPNFVFPGGVRPSHTSKGTWDSNRRSEHRNSSTSSAPAATTTTTEMSSESK 581

Query: 615 KGKKRKSDHEETEMEKKQAFASQP------AEESPMLETHGGGSDGKLPSLKSA 641
            G     D ++ +    +    QP      A   P+    GG  +  + S+ S+
Sbjct: 606 AGSNSPVDGKKRKWGDSETLTDQPRNSKHIAVSVPVENCEGGSPNPSVGSICSS 581

BLAST of CmoCh05G006050 vs. Swiss-Prot
Match: PAP_DICDI (Poly(A) polymerase OS=Dictyostelium discoideum GN=papA PE=3 SV=1)

HSP 1 Score: 400.2 bits (1027), Expect = 6.1e-110
Identity = 239/625 (38.24%), Postives = 331/625 (52.96%), Query Frame = 1

Query: 21  GVTKPISLAGPTDADIQRNIELEKFLIDSGLYESKEEAAKREEVLGRIDQVQTFFTSIGI 80
           GVT+PIS A P+  D + + ELE  LI   L+ES EE+ KREE+LG+++Q          
Sbjct: 54  GVTEPISTAPPSSIDFKLSTELENTLISFNLFESPEESRKREEILGKLNQ---------- 113

Query: 81  VLSSQLEQNSVLIGDESEILIVKNWVKQLTLLRGYTEQMVEDANAVIFTFGSYRLGVSII 140
                               IV+ W KQ++L +GY EQ   +  A IFTFGSYRLG    
Sbjct: 114 --------------------IVREWAKQVSLKKGYPEQTASEVVAKIFTFGSYRLG---- 173

Query: 141 FDLVFGEKSLLLYATLRNFCWSHGLVHGPGSDIDTLCVGPSYVNREDFFIILHNILAEME 200
                                    VHGPGSDIDTLCVGP ++ R DFF  L +IL    
Sbjct: 174 -------------------------VHGPGSDIDTLCVGPKHIMRSDFFDDLSDILKVHP 233

Query: 201 EVTDLQPVPDAHVPVMRFKFMGISIDLLYASISVRVVPEELN-ISHGSVLCNVDEQTVRS 260
           E+T+   V DA VPV+   F GI IDL+YA +++  +PEELN +   S L N+DE+++ S
Sbjct: 234 EITEFTTVKDAFVPVITMVFSGIPIDLIYAKLALTAIPEELNDLIDESFLKNIDEKSILS 293

Query: 261 LNGCRVADQILRLVPNVEHFRTTLRCLKFWAKRRGVYSNVTGFLGGV---------CQLY 320
           LNGCRV DQIL+LVPN+ +FR  LRC+K WA RRG+YSN+ GFLGGV         CQLY
Sbjct: 294 LNGCRVTDQILKLVPNIPNFRMALRCIKLWAIRRGIYSNILGFLGGVSWALLTARICQLY 353

Query: 321 PNAIPSMLVSRFFRVYTQWRWPNPVMLCSIEENE-LGFPVWDPRKNPRDRFHLMPIITPA 380
           PN+ PS ++ RFF+VY  W+WP P++LC I+E   LG  VW+P+   RD+ HLMPIITPA
Sbjct: 354 PNSAPSTIIHRFFKVYEIWKWPAPILLCHIQEGGILGPKVWNPK---RDKAHLMPIITPA 413

Query: 381 YPCMNSSYNVSTSTLRVMMEQFRCGNNICEEIDMSKAQWSALFEPYLFFEIYKNYLQVDI 440
           YP MNS+YNVS STL++M  +F  G  I  +I+  +  W  L E   FF  Y  Y+++D 
Sbjct: 414 YPSMNSTYNVSKSTLQLMKSEFVRGAEITRKIETGECTWKNLLEKCDFFTRYSFYIEIDC 473

Query: 441 IAADADDLLAWKGWVESRFRQLTLKSWRVFQEPDRFYFSFLQIERDTRGMLQCHPYPIEY 500
            + + +D   W+GW+ES+                RF  S L+    T  M    PYP  +
Sbjct: 474 YSMNEEDSRKWEGWIESKL---------------RFLISNLE---STPKMKFAVPYPKGF 533

Query: 501 SD----TSKPCSHC-AFFMGLQRKEGLRGQGGQQFDIRGTVDEFRQEINTYAFWKPG--- 560
           ++     + P   C +FFMGL           +  D+   V EF   I  +   +P    
Sbjct: 534 TNNLHKANNPDQICTSFFMGLSFNFSNTPGADKSVDLTKAVTEFTGIIKDWLRTQPNPDT 586

Query: 561 MDIHVSHVRRKQLPTFVFPDG-HKRAKPVRHEGQQADPVCADMLQDQSGITEKGKKRKSD 620
           MDI V ++++KQLP FV  +G  +  K  +      +P            +   KK KS+
Sbjct: 594 MDIKVQYIKKKQLPAFVKDEGPEEPVKTTKKRSSTGEP------------SATRKKLKSE 586

Query: 621 HEETEMEKKQAFASQPAEESPMLET 626
           + + ++   ++  +     +P   T
Sbjct: 654 NSDNKLNSPKSPITTNINSTPTTST 586

BLAST of CmoCh05G006050 vs. Swiss-Prot
Match: PAPOA_HUMAN (Poly(A) polymerase alpha OS=Homo sapiens GN=PAPOLA PE=1 SV=4)

HSP 1 Score: 375.6 bits (963), Expect = 1.6e-102
Identity = 215/552 (38.95%), Postives = 301/552 (54.53%), Query Frame = 1

Query: 16  PASNYGVTKPISLAGPTDADIQRNIELEKFLIDSGLYESKEEAAKREEVLGRIDQVQTFF 75
           P  +YG+T PISLA P + D     +L + L   G++E +EE  +R  +LG+++      
Sbjct: 16  PQKHYGITSPISLAAPKETDCVLTQKLIETLKPFGVFEEEEELQRRILILGKLNN----- 75

Query: 76  TSIGIVLSSQLEQNSVLIGDESEILIVKNWVKQLTLLRGYTEQMVEDANAVIFTFGSYRL 135
                                    +VK W+++++  +   + ++E+    IFTFGSYRL
Sbjct: 76  -------------------------LVKEWIREISESKNLPQSVIENVGGKIFTFGSYRL 135

Query: 136 GVSIIFDLVFGEKSLLLYATLRNFCWSHGLVHGPGSDIDTLCVGPSYVNREDFFIILHNI 195
           GV                             H  G+DID LCV P +V+R DFF   ++ 
Sbjct: 136 GV-----------------------------HTKGADIDALCVAPRHVDRSDFFTSFYDK 195

Query: 196 LAEMEEVTDLQPVPDAHVPVMRFKFMGISIDLLYASISVRVVPEELNISHGSVLCNVDEQ 255
           L   EEV DL+ V +A VPV++  F GI ID+L+A ++++ +PE+L++   S+L N+D +
Sbjct: 196 LKLQEEVKDLRAVEEAFVPVIKLCFDGIEIDILFARLALQTIPEDLDLRDDSLLKNLDIR 255

Query: 256 TVRSLNGCRVADQILRLVPNVEHFRTTLRCLKFWAKRRGVYSNVTGFLGGV--------- 315
            +RSLNGCRV D+IL LVPN+++FR TLR +K WAKR  +YSN+ GFLGGV         
Sbjct: 256 CIRSLNGCRVTDEILHLVPNIDNFRLTLRAIKLWAKRHNIYSNILGFLGGVSWAMLVART 315

Query: 316 CQLYPNAIPSMLVSRFFRVYTQWRWPNPVMLCSIEENELGFPVWDPRKNPRDRFHLMPII 375
           CQLYPNAI S LV +FF V+++W WPNPV+L   EE  L  PVWDPR NP DR+HLMPII
Sbjct: 316 CQLYPNAIASTLVHKFFLVFSKWEWPNPVLLKQPEECNLNLPVWDPRVNPSDRYHLMPII 375

Query: 376 TPAYPCMNSSYNVSTSTLRVMMEQFRCGNNICEEIDMSKAQWSALFEPYLFFEIYKNYLQ 435
           TPAYP  NS+YNVS ST  VM+E+F+ G  I +EI +SKA+WS LFE   FF+ YK+Y+ 
Sbjct: 376 TPAYPQQNSTYNVSVSTRMVMVEEFKQGLAITDEILLSKAEWSKLFEAPNFFQKYKHYIV 435

Query: 436 VDIIAADADDLLAWKGWVESRFRQLTLKSWRVFQEPDRFYFSFLQIERDTRGMLQCHPYP 495
           +   A      L W G VES+ R L                S  + E  T   +    +P
Sbjct: 436 LLASAPTEKQRLEWVGLVESKIRILV--------------GSLEKNEFITLAHVNPQSFP 493

Query: 496 IEYSDTSKPCSHCAFFMGLQRKEGLRGQG---GQQFDIRGTVDE-FRQEINTYAFWKPGM 555
               +  K      + +GL  K+    +       +DI+   D  +RQ IN+  F +  M
Sbjct: 496 APKENPDKEEFRTMWVIGLVFKKTENSENLSVDLTYDIQSFTDTVYRQAINSKMF-EVDM 493

BLAST of CmoCh05G006050 vs. TrEMBL
Match: A0A0A0LP30_CUCSA (Poly(A) polymerase beta OS=Cucumis sativus GN=Csa_2G373380 PE=4 SV=1)

HSP 1 Score: 1246.1 bits (3223), Expect = 0.0e+00
Identity = 642/846 (75.89%), Postives = 674/846 (79.67%), Query Frame = 1

Query: 1   MVGSDSSTAFHPAPQPASNYGVTKPISLAGPTDADIQRNIELEKFLIDSGLYESKEEAAK 60
           MVGSD ST       P +NYGVTKPISLAGP DADI RNIELEKFL+DS LYESKEEAAK
Sbjct: 1   MVGSDCSTGLPSVSHPVTNYGVTKPISLAGPMDADIHRNIELEKFLVDSELYESKEEAAK 60

Query: 61  REEVLGRIDQVQTFFTSIGIVLSSQLEQNSVLIGDESEILIVKNWVKQLTLLRGYTEQMV 120
           REEVLGRIDQ                              IVK+WVKQLTLLRGYTEQMV
Sbjct: 61  REEVLGRIDQ------------------------------IVKSWVKQLTLLRGYTEQMV 120

Query: 121 EDANAVIFTFGSYRLGVSIIFDLVFGEKSLLLYATLRNFCWSHGLVHGPGSDIDTLCVGP 180
           EDANAVIFTFGSYRLG                             VHGPGSDIDTLCVGP
Sbjct: 121 EDANAVIFTFGSYRLG-----------------------------VHGPGSDIDTLCVGP 180

Query: 181 SYVNREDFFIILHNILAEMEEVTDLQPVPDAHVPVMRFKFMGISIDLLYASISVRVVPEE 240
           SYVNREDFFIILHNILAEMEEVTDLQPVPDAHVPVMRFKF+GISIDLLYASIS+ VVPE+
Sbjct: 181 SYVNREDFFIILHNILAEMEEVTDLQPVPDAHVPVMRFKFLGISIDLLYASISLLVVPED 240

Query: 241 LNISHGSVLCNVDEQTVRSLNGCRVADQILRLVPNVEHFRTTLRCLKFWAKRRGVYSNVT 300
           L+ISHGSVL NVDEQTVRSLNGCRVADQILRLVPNVEHFRTTLRCLKFWAKRRGVYSNVT
Sbjct: 241 LDISHGSVLYNVDEQTVRSLNGCRVADQILRLVPNVEHFRTTLRCLKFWAKRRGVYSNVT 300

Query: 301 GFLGG---------VCQLYPNAIPSMLVSRFFRVYTQWRWPNPVMLCSIEENELGFPVWD 360
           GFLGG         VCQLYPNAIPSMLVSRFFRVYTQWRWPNPVMLCSIEENELGFPVWD
Sbjct: 301 GFLGGVNWALLVAQVCQLYPNAIPSMLVSRFFRVYTQWRWPNPVMLCSIEENELGFPVWD 360

Query: 361 PRKNPRDRFHLMPIITPAYPCMNSSYNVSTSTLRVMMEQFRCGNNICEEIDMSKAQWSAL 420
           PR+NPRDRFHLMPIITPAYPCMNSSYNVSTSTLRVMMEQFR GN+ICEEID+SKAQWSAL
Sbjct: 361 PRRNPRDRFHLMPIITPAYPCMNSSYNVSTSTLRVMMEQFRYGNSICEEIDLSKAQWSAL 420

Query: 421 FEPYLFFEIYKNYLQVDIIAADADDLLAWKGWVESRFRQLTLKSWRVFQEPDRFYFSFLQ 480
           FEPYLFFE YKNYLQVDIIAADADDLLAWKGWVESRFRQLTLK                 
Sbjct: 421 FEPYLFFETYKNYLQVDIIAADADDLLAWKGWVESRFRQLTLK----------------- 480

Query: 481 IERDTRGMLQCHPYPIEYSDTSKPCSHCAFFMGLQRKEGLRGQGGQQFDIRGTVDEFRQE 540
           IERDTRGMLQCHPYPIEYSDTSKPCSHCAFFMGLQRKEGLRGQGGQQFDIRGTVDEFRQE
Sbjct: 481 IERDTRGMLQCHPYPIEYSDTSKPCSHCAFFMGLQRKEGLRGQGGQQFDIRGTVDEFRQE 540

Query: 541 INTYAFWKPGMDIHVSHVRRKQLPTFVFPDGHKRAKPVRHEGQQADPVCADMLQDQSGIT 600
           IN YAFWKPGMDI+VSHVRRKQLPTFVFPDGHKRAKP+RHEGQQ D VCADMLQDQSGIT
Sbjct: 541 INMYAFWKPGMDIYVSHVRRKQLPTFVFPDGHKRAKPLRHEGQQVDTVCADMLQDQSGIT 600

Query: 601 EKGKKRKSDHEETEMEKKQAFASQPAEESPMLETHGGGSDGKLPSLKSANADCYSEVWPS 660
           EKGKKRKSDHEE E EKKQA  S PAE+SPM E  GG  DGK PS K ANADC+ +VW S
Sbjct: 601 EKGKKRKSDHEEEEKEKKQALISPPAEQSPMPEFFGGDPDGKWPSSKFANADCHLKVWSS 660

Query: 661 FEQLDSRMDTDGNGMDIPSLTKETGPTMDQAEL-AKVMEGSSSTKEVPDLYQGGLSKSEE 720
           FEQ  SR DT+GNG DI +LTKETG T DQ  L AK +E SSS KEVPDLY+G +S S+E
Sbjct: 661 FEQPVSRTDTNGNGTDIATLTKETGSTGDQLGLPAKEIEESSSRKEVPDLYKGSISTSKE 720

Query: 721 ALQIEMNQEKIEGLASNMSGSAQTVAIRNFLHWTKDVVRIDSESANPFGQTTGEESTQVD 780
           ALQI  ++E ++GLA NM+GS QTV+IR  LHWTKDVVRIDSES N +G+ TG ESTQV+
Sbjct: 721 ALQIGTDRENVDGLAPNMNGSVQTVSIRTLLHWTKDVVRIDSESGNTYGEMTGGESTQVE 746

Query: 781 FQPNCNAHNLSCKFDYIGLQIIANGSKRSGILEIVSRGNDSRTDPELALENGSVVTGRVF 837
           FQPNCN HNLSCK                        GNDSRTDP+LAL+NGSVVTGRV 
Sbjct: 781 FQPNCNTHNLSCK------------------------GNDSRTDPDLALDNGSVVTGRVS 746

BLAST of CmoCh05G006050 vs. TrEMBL
Match: A0A061GKW6_THECC (Poly(A) polymerase 1 isoform 2 OS=Theobroma cacao GN=TCM_037229 PE=4 SV=1)

HSP 1 Score: 876.3 bits (2263), Expect = 3.2e-251
Identity = 491/862 (56.96%), Postives = 566/862 (65.66%), Query Frame = 1

Query: 15  QPASNYGVTKPISLAGPTDADIQRNIELEKFLIDSGLYESKEEAAKREEVLGRIDQVQTF 74
           Q    YG+TKPISLAGP++AD+QRN ELEKFLI+SGLYESKEEA KREEVLG I++    
Sbjct: 15  QSLKKYGITKPISLAGPSEADVQRNTELEKFLIESGLYESKEEAVKREEVLGHINE---- 74

Query: 75  FTSIGIVLSSQLEQNSVLIGDESEILIVKNWVKQLTLLRGYTEQMVEDANAVIFTFGSYR 134
                                     IVK+WVKQLT  RGYT+QMVE+ANAVIFTFGSY 
Sbjct: 75  --------------------------IVKSWVKQLTRQRGYTDQMVEEANAVIFTFGSYC 134

Query: 135 LGVSIIFDLVFGEKSLLLYATLRNFCWSHGLVHGPGSDIDTLCVGPSYVNR-EDFFIILH 194
           LG                             VHGPG+DIDTLC+GPSYVNR EDFFIILH
Sbjct: 135 LG-----------------------------VHGPGADIDTLCIGPSYVNREEDFFIILH 194

Query: 195 NILAEMEEVTDLQPVPDAHVPVMRFKFMGISIDLLYASISVRVVPEELNISHGSVLCNVD 254
           +ILAEMEEVT+LQPVPDAHVPVM+FKF GISIDLLYASIS+ VVP+ L+ISHGSVL NVD
Sbjct: 195 DILAEMEEVTELQPVPDAHVPVMKFKFQGISIDLLYASISLLVVPDNLDISHGSVLHNVD 254

Query: 255 EQTVRSLNGCRVADQILRLVPNVEHFRTTLRCLKFWAKRRGVYSNVTGFLGG-------- 314
           EQTVRSLNGCRVADQIL+LVPNVEHFR TLRCLKFWAKRRGVYSNVTGFLGG        
Sbjct: 255 EQTVRSLNGCRVADQILKLVPNVEHFRMTLRCLKFWAKRRGVYSNVTGFLGGVNWALLVA 314

Query: 315 -VCQLYPNAIPSMLVSRFFRVYTQWRWPNPVMLCSIEENELGFPVWDPRKNPRDRFHLMP 374
            VCQLYPNAIPSMLVSRFFRVYTQWRWPNPVMLCSIEE+ELGFPVWDPRKNPRDRFH MP
Sbjct: 315 RVCQLYPNAIPSMLVSRFFRVYTQWRWPNPVMLCSIEEDELGFPVWDPRKNPRDRFHHMP 374

Query: 375 IITPAYPCMNSSYNVSTSTLRVMMEQFRCGNNICEEIDMSKAQWSALFEPYLFFEIYKNY 434
           IITPAYPCMNSSYNVS STLRVMMEQF+CGN ICEEI+++K+QW+ALFEPYLFFE YKNY
Sbjct: 375 IITPAYPCMNSSYNVSISTLRVMMEQFQCGNRICEEIELNKSQWNALFEPYLFFEAYKNY 434

Query: 435 LQVDIIAADADDLLAWKGWVESRFRQLTLKSWRVFQEPDRFYFSFLQIERDTRGMLQCHP 494
           LQVDI++A+ADDLLAWKGWVESR RQLTLK                 IERDT GMLQCHP
Sbjct: 435 LQVDIVSAEADDLLAWKGWVESRLRQLTLK-----------------IERDTNGMLQCHP 494

Query: 495 YPIEYSDTSKPCSHCAFFMGLQRKEGLRGQGGQQFDIRGTVDEFRQEINTYAFWKPGMDI 554
           YP EY DTSK   HCAFFMGLQRKEG+ GQ GQQFDIRGTVDEFRQEI+ Y +WKPGMDI
Sbjct: 495 YPNEYVDTSKQFPHCAFFMGLQRKEGVSGQEGQQFDIRGTVDEFRQEISMYMYWKPGMDI 554

Query: 555 HVSHVRRKQLPTFVFPDGHKRAKPVRHEGQQADPVCADMLQDQSGITEKGKKRKSDHE-- 614
           +VSHVRR+QLP FVFPDG+KR +  RH GQQ   +C D+ + QSG  E+  KRK + E  
Sbjct: 555 YVSHVRRRQLPAFVFPDGYKRPRSSRHPGQQTGKICEDITRSQSGSVERQIKRKHEDEAF 614

Query: 615 ETEMEK--KQAFASQPAEESPMLE---THGGG----SDGKLPSLK---SANADCYSEVWP 674
           + +M+K  K++  S    ES   E   +  GG    SDG++ +L+   + + D  S +  
Sbjct: 615 DEKMDKPDKRSSISPQRLESVSPESSASRSGGTSHISDGQMVTLERPTTWDVDSNSVLRQ 674

Query: 675 SFEQLDSRMDTDG------NGMDIPSLTKETGPTMDQAELAKVMEGSSSTKEV--PDLYQ 734
           S   LDS     G        +D  SLT     ++D      V+    S +++  P L Q
Sbjct: 675 SSGLLDSEKRNVGISIQQARTVDQGSLTLSGQTSLDVVHNLSVVRNVESAEQMGEPFLRQ 734

Query: 735 GGLSKSE-------EALQIEMNQEKIEGLASNMSGSAQTVAIRNFLHWTKDVVRIDSESA 794
              S  E       E  +  +NQEK    +S     A+T + R  L+W    V +D E  
Sbjct: 735 ESHSPCEVPDSELRETCKTGVNQEKTGDYSSAYMNDAETGSSRRILNWKGGGVGVDQEVV 776

Query: 795 NPFGQTTGEESTQVDFQPNCNAHNLSCKFDYIGLQIIANGSKRSGILEIVSRGNDSRTDP 838
            P  QT   E  +  F  + NA NL+C+                        G     D 
Sbjct: 795 KPCNQTAVVEIAESVFGSSSNAQNLNCE------------------------GVVCSADL 776

BLAST of CmoCh05G006050 vs. TrEMBL
Match: A0A061GK78_THECC (Poly(A) polymerase 1 isoform 3 OS=Theobroma cacao GN=TCM_037229 PE=4 SV=1)

HSP 1 Score: 876.3 bits (2263), Expect = 3.2e-251
Identity = 491/862 (56.96%), Postives = 566/862 (65.66%), Query Frame = 1

Query: 15  QPASNYGVTKPISLAGPTDADIQRNIELEKFLIDSGLYESKEEAAKREEVLGRIDQVQTF 74
           Q    YG+TKPISLAGP++AD+QRN ELEKFLI+SGLYESKEEA KREEVLG I++    
Sbjct: 15  QSLKKYGITKPISLAGPSEADVQRNTELEKFLIESGLYESKEEAVKREEVLGHINE---- 74

Query: 75  FTSIGIVLSSQLEQNSVLIGDESEILIVKNWVKQLTLLRGYTEQMVEDANAVIFTFGSYR 134
                                     IVK+WVKQLT  RGYT+QMVE+ANAVIFTFGSY 
Sbjct: 75  --------------------------IVKSWVKQLTRQRGYTDQMVEEANAVIFTFGSYC 134

Query: 135 LGVSIIFDLVFGEKSLLLYATLRNFCWSHGLVHGPGSDIDTLCVGPSYVNR-EDFFIILH 194
           LG                             VHGPG+DIDTLC+GPSYVNR EDFFIILH
Sbjct: 135 LG-----------------------------VHGPGADIDTLCIGPSYVNREEDFFIILH 194

Query: 195 NILAEMEEVTDLQPVPDAHVPVMRFKFMGISIDLLYASISVRVVPEELNISHGSVLCNVD 254
           +ILAEMEEVT+LQPVPDAHVPVM+FKF GISIDLLYASIS+ VVP+ L+ISHGSVL NVD
Sbjct: 195 DILAEMEEVTELQPVPDAHVPVMKFKFQGISIDLLYASISLLVVPDNLDISHGSVLHNVD 254

Query: 255 EQTVRSLNGCRVADQILRLVPNVEHFRTTLRCLKFWAKRRGVYSNVTGFLGG-------- 314
           EQTVRSLNGCRVADQIL+LVPNVEHFR TLRCLKFWAKRRGVYSNVTGFLGG        
Sbjct: 255 EQTVRSLNGCRVADQILKLVPNVEHFRMTLRCLKFWAKRRGVYSNVTGFLGGVNWALLVA 314

Query: 315 -VCQLYPNAIPSMLVSRFFRVYTQWRWPNPVMLCSIEENELGFPVWDPRKNPRDRFHLMP 374
            VCQLYPNAIPSMLVSRFFRVYTQWRWPNPVMLCSIEE+ELGFPVWDPRKNPRDRFH MP
Sbjct: 315 RVCQLYPNAIPSMLVSRFFRVYTQWRWPNPVMLCSIEEDELGFPVWDPRKNPRDRFHHMP 374

Query: 375 IITPAYPCMNSSYNVSTSTLRVMMEQFRCGNNICEEIDMSKAQWSALFEPYLFFEIYKNY 434
           IITPAYPCMNSSYNVS STLRVMMEQF+CGN ICEEI+++K+QW+ALFEPYLFFE YKNY
Sbjct: 375 IITPAYPCMNSSYNVSISTLRVMMEQFQCGNRICEEIELNKSQWNALFEPYLFFEAYKNY 434

Query: 435 LQVDIIAADADDLLAWKGWVESRFRQLTLKSWRVFQEPDRFYFSFLQIERDTRGMLQCHP 494
           LQVDI++A+ADDLLAWKGWVESR RQLTLK                 IERDT GMLQCHP
Sbjct: 435 LQVDIVSAEADDLLAWKGWVESRLRQLTLK-----------------IERDTNGMLQCHP 494

Query: 495 YPIEYSDTSKPCSHCAFFMGLQRKEGLRGQGGQQFDIRGTVDEFRQEINTYAFWKPGMDI 554
           YP EY DTSK   HCAFFMGLQRKEG+ GQ GQQFDIRGTVDEFRQEI+ Y +WKPGMDI
Sbjct: 495 YPNEYVDTSKQFPHCAFFMGLQRKEGVSGQEGQQFDIRGTVDEFRQEISMYMYWKPGMDI 554

Query: 555 HVSHVRRKQLPTFVFPDGHKRAKPVRHEGQQADPVCADMLQDQSGITEKGKKRKSDHE-- 614
           +VSHVRR+QLP FVFPDG+KR +  RH GQQ   +C D+ + QSG  E+  KRK + E  
Sbjct: 555 YVSHVRRRQLPAFVFPDGYKRPRSSRHPGQQTGKICEDITRSQSGSVERQIKRKHEDEAF 614

Query: 615 ETEMEK--KQAFASQPAEESPMLE---THGGG----SDGKLPSLK---SANADCYSEVWP 674
           + +M+K  K++  S    ES   E   +  GG    SDG++ +L+   + + D  S +  
Sbjct: 615 DEKMDKPDKRSSISPQRLESVSPESSASRSGGTSHISDGQMVTLERPTTWDVDSNSVLRQ 674

Query: 675 SFEQLDSRMDTDG------NGMDIPSLTKETGPTMDQAELAKVMEGSSSTKEV--PDLYQ 734
           S   LDS     G        +D  SLT     ++D      V+    S +++  P L Q
Sbjct: 675 SSGLLDSEKRNVGISIQQARTVDQGSLTLSGQTSLDVVHNLSVVRNVESAEQMGEPFLRQ 734

Query: 735 GGLSKSE-------EALQIEMNQEKIEGLASNMSGSAQTVAIRNFLHWTKDVVRIDSESA 794
              S  E       E  +  +NQEK    +S     A+T + R  L+W    V +D E  
Sbjct: 735 ESHSPCEVPDSELRETCKTGVNQEKTGDYSSAYMNDAETGSSRRILNWKGGGVGVDQEVV 776

Query: 795 NPFGQTTGEESTQVDFQPNCNAHNLSCKFDYIGLQIIANGSKRSGILEIVSRGNDSRTDP 838
            P  QT   E  +  F  + NA NL+C+                        G     D 
Sbjct: 795 KPCNQTAVVEIAESVFGSSSNAQNLNCE------------------------GVVCSADL 776

BLAST of CmoCh05G006050 vs. TrEMBL
Match: A0A061GIW2_THECC (Poly(A) polymerase 1 isoform 5 OS=Theobroma cacao GN=TCM_037229 PE=4 SV=1)

HSP 1 Score: 876.3 bits (2263), Expect = 3.2e-251
Identity = 491/862 (56.96%), Postives = 566/862 (65.66%), Query Frame = 1

Query: 15  QPASNYGVTKPISLAGPTDADIQRNIELEKFLIDSGLYESKEEAAKREEVLGRIDQVQTF 74
           Q    YG+TKPISLAGP++AD+QRN ELEKFLI+SGLYESKEEA KREEVLG I++    
Sbjct: 15  QSLKKYGITKPISLAGPSEADVQRNTELEKFLIESGLYESKEEAVKREEVLGHINE---- 74

Query: 75  FTSIGIVLSSQLEQNSVLIGDESEILIVKNWVKQLTLLRGYTEQMVEDANAVIFTFGSYR 134
                                     IVK+WVKQLT  RGYT+QMVE+ANAVIFTFGSY 
Sbjct: 75  --------------------------IVKSWVKQLTRQRGYTDQMVEEANAVIFTFGSYC 134

Query: 135 LGVSIIFDLVFGEKSLLLYATLRNFCWSHGLVHGPGSDIDTLCVGPSYVNR-EDFFIILH 194
           LG                             VHGPG+DIDTLC+GPSYVNR EDFFIILH
Sbjct: 135 LG-----------------------------VHGPGADIDTLCIGPSYVNREEDFFIILH 194

Query: 195 NILAEMEEVTDLQPVPDAHVPVMRFKFMGISIDLLYASISVRVVPEELNISHGSVLCNVD 254
           +ILAEMEEVT+LQPVPDAHVPVM+FKF GISIDLLYASIS+ VVP+ L+ISHGSVL NVD
Sbjct: 195 DILAEMEEVTELQPVPDAHVPVMKFKFQGISIDLLYASISLLVVPDNLDISHGSVLHNVD 254

Query: 255 EQTVRSLNGCRVADQILRLVPNVEHFRTTLRCLKFWAKRRGVYSNVTGFLGG-------- 314
           EQTVRSLNGCRVADQIL+LVPNVEHFR TLRCLKFWAKRRGVYSNVTGFLGG        
Sbjct: 255 EQTVRSLNGCRVADQILKLVPNVEHFRMTLRCLKFWAKRRGVYSNVTGFLGGVNWALLVA 314

Query: 315 -VCQLYPNAIPSMLVSRFFRVYTQWRWPNPVMLCSIEENELGFPVWDPRKNPRDRFHLMP 374
            VCQLYPNAIPSMLVSRFFRVYTQWRWPNPVMLCSIEE+ELGFPVWDPRKNPRDRFH MP
Sbjct: 315 RVCQLYPNAIPSMLVSRFFRVYTQWRWPNPVMLCSIEEDELGFPVWDPRKNPRDRFHHMP 374

Query: 375 IITPAYPCMNSSYNVSTSTLRVMMEQFRCGNNICEEIDMSKAQWSALFEPYLFFEIYKNY 434
           IITPAYPCMNSSYNVS STLRVMMEQF+CGN ICEEI+++K+QW+ALFEPYLFFE YKNY
Sbjct: 375 IITPAYPCMNSSYNVSISTLRVMMEQFQCGNRICEEIELNKSQWNALFEPYLFFEAYKNY 434

Query: 435 LQVDIIAADADDLLAWKGWVESRFRQLTLKSWRVFQEPDRFYFSFLQIERDTRGMLQCHP 494
           LQVDI++A+ADDLLAWKGWVESR RQLTLK                 IERDT GMLQCHP
Sbjct: 435 LQVDIVSAEADDLLAWKGWVESRLRQLTLK-----------------IERDTNGMLQCHP 494

Query: 495 YPIEYSDTSKPCSHCAFFMGLQRKEGLRGQGGQQFDIRGTVDEFRQEINTYAFWKPGMDI 554
           YP EY DTSK   HCAFFMGLQRKEG+ GQ GQQFDIRGTVDEFRQEI+ Y +WKPGMDI
Sbjct: 495 YPNEYVDTSKQFPHCAFFMGLQRKEGVSGQEGQQFDIRGTVDEFRQEISMYMYWKPGMDI 554

Query: 555 HVSHVRRKQLPTFVFPDGHKRAKPVRHEGQQADPVCADMLQDQSGITEKGKKRKSDHE-- 614
           +VSHVRR+QLP FVFPDG+KR +  RH GQQ   +C D+ + QSG  E+  KRK + E  
Sbjct: 555 YVSHVRRRQLPAFVFPDGYKRPRSSRHPGQQTGKICEDITRSQSGSVERQIKRKHEDEAF 614

Query: 615 ETEMEK--KQAFASQPAEESPMLE---THGGG----SDGKLPSLK---SANADCYSEVWP 674
           + +M+K  K++  S    ES   E   +  GG    SDG++ +L+   + + D  S +  
Sbjct: 615 DEKMDKPDKRSSISPQRLESVSPESSASRSGGTSHISDGQMVTLERPTTWDVDSNSVLRQ 674

Query: 675 SFEQLDSRMDTDG------NGMDIPSLTKETGPTMDQAELAKVMEGSSSTKEV--PDLYQ 734
           S   LDS     G        +D  SLT     ++D      V+    S +++  P L Q
Sbjct: 675 SSGLLDSEKRNVGISIQQARTVDQGSLTLSGQTSLDVVHNLSVVRNVESAEQMGEPFLRQ 734

Query: 735 GGLSKSE-------EALQIEMNQEKIEGLASNMSGSAQTVAIRNFLHWTKDVVRIDSESA 794
              S  E       E  +  +NQEK    +S     A+T + R  L+W    V +D E  
Sbjct: 735 ESHSPCEVPDSELRETCKTGVNQEKTGDYSSAYMNDAETGSSRRILNWKGGGVGVDQEVV 776

Query: 795 NPFGQTTGEESTQVDFQPNCNAHNLSCKFDYIGLQIIANGSKRSGILEIVSRGNDSRTDP 838
            P  QT   E  +  F  + NA NL+C+                        G     D 
Sbjct: 795 KPCNQTAVVEIAESVFGSSSNAQNLNCE------------------------GVVCSADL 776

BLAST of CmoCh05G006050 vs. TrEMBL
Match: A0A061GRQ3_THECC (Poly(A) polymerase 1 isoform 4 OS=Theobroma cacao GN=TCM_037229 PE=4 SV=1)

HSP 1 Score: 876.3 bits (2263), Expect = 3.2e-251
Identity = 491/862 (56.96%), Postives = 566/862 (65.66%), Query Frame = 1

Query: 15  QPASNYGVTKPISLAGPTDADIQRNIELEKFLIDSGLYESKEEAAKREEVLGRIDQVQTF 74
           Q    YG+TKPISLAGP++AD+QRN ELEKFLI+SGLYESKEEA KREEVLG I++    
Sbjct: 15  QSLKKYGITKPISLAGPSEADVQRNTELEKFLIESGLYESKEEAVKREEVLGHINE---- 74

Query: 75  FTSIGIVLSSQLEQNSVLIGDESEILIVKNWVKQLTLLRGYTEQMVEDANAVIFTFGSYR 134
                                     IVK+WVKQLT  RGYT+QMVE+ANAVIFTFGSY 
Sbjct: 75  --------------------------IVKSWVKQLTRQRGYTDQMVEEANAVIFTFGSYC 134

Query: 135 LGVSIIFDLVFGEKSLLLYATLRNFCWSHGLVHGPGSDIDTLCVGPSYVNR-EDFFIILH 194
           LG                             VHGPG+DIDTLC+GPSYVNR EDFFIILH
Sbjct: 135 LG-----------------------------VHGPGADIDTLCIGPSYVNREEDFFIILH 194

Query: 195 NILAEMEEVTDLQPVPDAHVPVMRFKFMGISIDLLYASISVRVVPEELNISHGSVLCNVD 254
           +ILAEMEEVT+LQPVPDAHVPVM+FKF GISIDLLYASIS+ VVP+ L+ISHGSVL NVD
Sbjct: 195 DILAEMEEVTELQPVPDAHVPVMKFKFQGISIDLLYASISLLVVPDNLDISHGSVLHNVD 254

Query: 255 EQTVRSLNGCRVADQILRLVPNVEHFRTTLRCLKFWAKRRGVYSNVTGFLGG-------- 314
           EQTVRSLNGCRVADQIL+LVPNVEHFR TLRCLKFWAKRRGVYSNVTGFLGG        
Sbjct: 255 EQTVRSLNGCRVADQILKLVPNVEHFRMTLRCLKFWAKRRGVYSNVTGFLGGVNWALLVA 314

Query: 315 -VCQLYPNAIPSMLVSRFFRVYTQWRWPNPVMLCSIEENELGFPVWDPRKNPRDRFHLMP 374
            VCQLYPNAIPSMLVSRFFRVYTQWRWPNPVMLCSIEE+ELGFPVWDPRKNPRDRFH MP
Sbjct: 315 RVCQLYPNAIPSMLVSRFFRVYTQWRWPNPVMLCSIEEDELGFPVWDPRKNPRDRFHHMP 374

Query: 375 IITPAYPCMNSSYNVSTSTLRVMMEQFRCGNNICEEIDMSKAQWSALFEPYLFFEIYKNY 434
           IITPAYPCMNSSYNVS STLRVMMEQF+CGN ICEEI+++K+QW+ALFEPYLFFE YKNY
Sbjct: 375 IITPAYPCMNSSYNVSISTLRVMMEQFQCGNRICEEIELNKSQWNALFEPYLFFEAYKNY 434

Query: 435 LQVDIIAADADDLLAWKGWVESRFRQLTLKSWRVFQEPDRFYFSFLQIERDTRGMLQCHP 494
           LQVDI++A+ADDLLAWKGWVESR RQLTLK                 IERDT GMLQCHP
Sbjct: 435 LQVDIVSAEADDLLAWKGWVESRLRQLTLK-----------------IERDTNGMLQCHP 494

Query: 495 YPIEYSDTSKPCSHCAFFMGLQRKEGLRGQGGQQFDIRGTVDEFRQEINTYAFWKPGMDI 554
           YP EY DTSK   HCAFFMGLQRKEG+ GQ GQQFDIRGTVDEFRQEI+ Y +WKPGMDI
Sbjct: 495 YPNEYVDTSKQFPHCAFFMGLQRKEGVSGQEGQQFDIRGTVDEFRQEISMYMYWKPGMDI 554

Query: 555 HVSHVRRKQLPTFVFPDGHKRAKPVRHEGQQADPVCADMLQDQSGITEKGKKRKSDHE-- 614
           +VSHVRR+QLP FVFPDG+KR +  RH GQQ   +C D+ + QSG  E+  KRK + E  
Sbjct: 555 YVSHVRRRQLPAFVFPDGYKRPRSSRHPGQQTGKICEDITRSQSGSVERQIKRKHEDEAF 614

Query: 615 ETEMEK--KQAFASQPAEESPMLE---THGGG----SDGKLPSLK---SANADCYSEVWP 674
           + +M+K  K++  S    ES   E   +  GG    SDG++ +L+   + + D  S +  
Sbjct: 615 DEKMDKPDKRSSISPQRLESVSPESSASRSGGTSHISDGQMVTLERPTTWDVDSNSVLRQ 674

Query: 675 SFEQLDSRMDTDG------NGMDIPSLTKETGPTMDQAELAKVMEGSSSTKEV--PDLYQ 734
           S   LDS     G        +D  SLT     ++D      V+    S +++  P L Q
Sbjct: 675 SSGLLDSEKRNVGISIQQARTVDQGSLTLSGQTSLDVVHNLSVVRNVESAEQMGEPFLRQ 734

Query: 735 GGLSKSE-------EALQIEMNQEKIEGLASNMSGSAQTVAIRNFLHWTKDVVRIDSESA 794
              S  E       E  +  +NQEK    +S     A+T + R  L+W    V +D E  
Sbjct: 735 ESHSPCEVPDSELRETCKTGVNQEKTGDYSSAYMNDAETGSSRRILNWKGGGVGVDQEVV 776

Query: 795 NPFGQTTGEESTQVDFQPNCNAHNLSCKFDYIGLQIIANGSKRSGILEIVSRGNDSRTDP 838
            P  QT   E  +  F  + NA NL+C+                        G     D 
Sbjct: 795 KPCNQTAVVEIAESVFGSSSNAQNLNCE------------------------GVVCSADL 776

BLAST of CmoCh05G006050 vs. TAIR10
Match: AT4G32850.8 (AT4G32850.8 nuclear poly(a) polymerase)

HSP 1 Score: 786.2 bits (2029), Expect = 2.2e-227
Identity = 404/627 (64.43%), Postives = 458/627 (73.05%), Query Frame = 1

Query: 1   MVGSDSSTAFHPAPQPASNYGVTKPISLAGPTDADIQRNIELEKFLIDSGLYESKEEAAK 60
           MVG+ +     P      +YG+TKP+SLAGP+ ADI+RN+ELEK+L+D GLYESK++  +
Sbjct: 2   MVGTQNLGGSLPPLNSPKSYGITKPLSLAGPSSADIKRNVELEKYLVDEGLYESKDDTMR 61

Query: 61  REEVLGRIDQVQTFFTSIGIVLSSQLEQNSVLIGDESEILIVKNWVKQLTLLRGYTEQMV 120
           REEVLGRIDQ                              IVK+WVKQLT  RGYT+QMV
Sbjct: 62  REEVLGRIDQ------------------------------IVKHWVKQLTQQRGYTDQMV 121

Query: 121 EDANAVIFTFGSYRLGVSIIFDLVFGEKSLLLYATLRNFCWSHGLVHGPGSDIDTLCVGP 180
           EDANAVIFTFGSYRLGV                             HGPG+DIDTLCVGP
Sbjct: 122 EDANAVIFTFGSYRLGV-----------------------------HGPGADIDTLCVGP 181

Query: 181 SYVNRE-DFFIILHNILAEMEEVTDLQPVPDAHVPVMRFKFMGISIDLLYASISVRVVPE 240
           SYVNRE DFFIILH+ILAEMEEVT+L PVPDAHVPVM+FKF GI IDLLYASIS+ VVP+
Sbjct: 182 SYVNREEDFFIILHDILAEMEEVTELHPVPDAHVPVMKFKFQGIPIDLLYASISLLVVPQ 241

Query: 241 ELNISHGSVLCNVDEQTVRSLNGCRVADQILRLVPNVEHFRTTLRCLKFWAKRRGVYSNV 300
           +L+IS  SVLC VDE TVRSLNGCRVADQIL+LVPN EHFRTTLRCLK+WAK+RGVYSNV
Sbjct: 242 DLDISSSSVLCEVDEPTVRSLNGCRVADQILKLVPNFEHFRTTLRCLKYWAKKRGVYSNV 301

Query: 301 TGFLGG---------VCQLYPNAIPSMLVSRFFRVYTQWRWPNPVMLCSIEENELGFPVW 360
           TGFLGG         VCQLYPNAIPSMLVSRFFRVYTQWRWPNPVMLC+IEE+ELGFPVW
Sbjct: 302 TGFLGGVNWALLVARVCQLYPNAIPSMLVSRFFRVYTQWRWPNPVMLCAIEEDELGFPVW 361

Query: 361 DPRKNPRDRFHLMPIITPAYPCMNSSYNVSTSTLRVMMEQFRCGNNICEEIDMSKAQWSA 420
           D RKN RDR+HLMPIITPAYPCMNSSYNVS STLRVM EQF+ GNNI +EI+++K  WS+
Sbjct: 362 DRRKNHRDRYHLMPIITPAYPCMNSSYNVSQSTLRVMTEQFQFGNNILQEIELNKQHWSS 421

Query: 421 LFEPYLFFEIYKNYLQVDIIAADADDLLAWKGWVESRFRQLTLKSWRVFQEPDRFYFSFL 480
           LFE Y+FFE YKNYLQVDI+AADA+DLLAWKGWVESRFRQLTLK                
Sbjct: 422 LFEQYMFFEAYKNYLQVDIVAADAEDLLAWKGWVESRFRQLTLK---------------- 481

Query: 481 QIERDTRGMLQCHPYPIEYSDTSKPCSHCAFFMGLQRKEGLRGQGGQQFDIRGTVDEFRQ 540
            IERDT GML CHP P EY DT++   HCAFFMGLQR EG+ GQ  QQFDIRGTVDEFRQ
Sbjct: 482 -IERDTNGMLMCHPQPNEYVDTARQFLHCAFFMGLQRAEGVGGQECQQFDIRGTVDEFRQ 541

Query: 541 EINTYAFWKPGMDIHVSHVRRKQLPTFVFPDGHKRAKPVRHEGQQADPVCADMLQDQSG- 600
           E+N Y FWKPGMD+ VSHVRR+QLP FVFP+G++R +  RH+         D     SG 
Sbjct: 542 EVNMYMFWKPGMDVFVSHVRRRQLPPFVFPNGYRRPRQSRHQNLPGGKSGEDGSVSHSGS 552

Query: 601 ITEKGKKRKSDHE--ETEMEKKQAFAS 615
           + E+  KRK+D E  +   EK +  AS
Sbjct: 602 VVERHAKRKNDSEMMDVRPEKPEKRAS 552

BLAST of CmoCh05G006050 vs. TAIR10
Match: AT2G25850.2 (AT2G25850.2 poly(A) polymerase 2)

HSP 1 Score: 780.8 bits (2015), Expect = 9.3e-226
Identity = 394/615 (64.07%), Postives = 455/615 (73.98%), Query Frame = 1

Query: 12  PAPQPASNYGVTKPISLAGPTDADIQRNIELEKFLIDSGLYESKEEAAKREEVLGRIDQV 71
           P      +YG+T+P+S+AGP+ AD++RN+ELEKFL+D GLYESKEE  +REEV+ RIDQ 
Sbjct: 15  PVKASLKSYGITEPLSIAGPSAADVKRNLELEKFLVDEGLYESKEETMRREEVVVRIDQ- 74

Query: 72  QTFFTSIGIVLSSQLEQNSVLIGDESEILIVKNWVKQLTLLRGYTEQMVEDANAVIFTFG 131
                                        IVK+WVKQLT  RGYT+QMVEDANAVIFTFG
Sbjct: 75  -----------------------------IVKHWVKQLTRQRGYTDQMVEDANAVIFTFG 134

Query: 132 SYRLGVSIIFDLVFGEKSLLLYATLRNFCWSHGLVHGPGSDIDTLCVGPSYVNR-EDFFI 191
           SYRLG                             VHGP +DIDTLCVGPSYVNR EDFFI
Sbjct: 135 SYRLG-----------------------------VHGPMADIDTLCVGPSYVNREEDFFI 194

Query: 192 ILHNILAEMEEVTDLQPVPDAHVPVMRFKFMGISIDLLYASISVRVVPEELNISHGSVLC 251
              +ILAEMEEVT+LQPV DAHVPVM+FKF GISIDLLYASIS+ V+P++L+IS+ SVLC
Sbjct: 195 FFRDILAEMEEVTELQPVTDAHVPVMKFKFQGISIDLLYASISLLVIPQDLDISNSSVLC 254

Query: 252 NVDEQTVRSLNGCRVADQILRLVPNVEHFRTTLRCLKFWAKRRGVYSNVTGFLGGV---- 311
           +VDEQTVRSLNGCRVADQIL+LVPN EHFRTTLRCLK+WAK+RGVYSNVTGFLGGV    
Sbjct: 255 DVDEQTVRSLNGCRVADQILKLVPNSEHFRTTLRCLKYWAKKRGVYSNVTGFLGGVNWAL 314

Query: 312 -----CQLYPNAIPSMLVSRFFRVYTQWRWPNPVMLCSIEENELGFPVWDPRKNPRDRFH 371
                CQ YPNAIPSMLVSRFFRVYTQWRWPNPVMLC+IEE++L FPVWDPRKN RDR+H
Sbjct: 315 LVARLCQFYPNAIPSMLVSRFFRVYTQWRWPNPVMLCAIEEDDLSFPVWDPRKNHRDRYH 374

Query: 372 LMPIITPAYPCMNSSYNVSTSTLRVMMEQFRCGNNICEEIDMSKAQWSALFEPYLFFEIY 431
           LMPIITPAYPCMNSSYNVS STLRVM EQF+ GN IC+EI+++K  WS+LF+ Y+FFE Y
Sbjct: 375 LMPIITPAYPCMNSSYNVSQSTLRVMTEQFQFGNTICQEIELNKQHWSSLFQQYMFFEAY 434

Query: 432 KNYLQVDIIAADADDLLAWKGWVESRFRQLTLKSWRVFQEPDRFYFSFLQIERDTRGMLQ 491
           KNYLQVD++AADA+DLLAWKGWVESRFRQLTLK                 IERDT GML 
Sbjct: 435 KNYLQVDVLAADAEDLLAWKGWVESRFRQLTLK-----------------IERDTNGMLM 494

Query: 492 CHPYPIEYSDTSKPCSHCAFFMGLQRKEGLRGQGGQQFDIRGTVDEFRQEINTYAFWKPG 551
           CHP P EY DTSK   HCAFFMGLQR +G  GQ  QQFDIRGTVDEFRQE+N Y FW+PG
Sbjct: 495 CHPQPNEYVDTSKQFRHCAFFMGLQRADGFGGQECQQFDIRGTVDEFRQEVNMYMFWRPG 553

Query: 552 MDIHVSHVRRKQLPTFVFPDGHKRAKPVRHEGQQADPVCADMLQDQSGITEKGKKRKSDH 611
           MD+HVSHVRR+QLP+FVFP+G+KR++  RH+ QQ      + +   S   E+  KRK+D 
Sbjct: 555 MDVHVSHVRRRQLPSFVFPNGYKRSRQSRHQSQQCREPGDEGVGSLSDSVERYAKRKNDD 553

Query: 612 E--ETEMEKKQAFAS 615
           E   +  EK++  AS
Sbjct: 615 EIMNSRPEKREKRAS 553

BLAST of CmoCh05G006050 vs. TAIR10
Match: AT1G17980.1 (AT1G17980.1 poly(A) polymerase 1)

HSP 1 Score: 618.2 bits (1593), Expect = 8.0e-177
Identity = 319/654 (48.78%), Postives = 405/654 (61.93%), Query Frame = 1

Query: 15  QPASNYGVTKPISLAGPTDADIQRNIELEKFLIDSGLYESKEEAAKREEVLGRIDQVQTF 74
           Q    +GV++PIS+ GPT+ D+ +  ELEK L D GLYESKEEA +REEVLG +DQ    
Sbjct: 6   QNGQRFGVSEPISMGGPTEFDVIKTRELEKHLQDVGLYESKEEAVRREEVLGILDQ---- 65

Query: 75  FTSIGIVLSSQLEQNSVLIGDESEILIVKNWVKQLTLLRGYTEQMVEDANAVIFTFGSYR 134
                                     IVK W+K ++  +G  +Q++ +ANA IFTFGSYR
Sbjct: 66  --------------------------IVKTWIKTISRAKGLNDQLLHEANAKIFTFGSYR 125

Query: 135 LGVSIIFDLVFGEKSLLLYATLRNFCWSHGLVHGPGSDIDTLCVGPSYVNRE-DFFIILH 194
           LG                             VHGPG+DIDTLCVGP +  RE DFF  L 
Sbjct: 126 LG-----------------------------VHGPGADIDTLCVGPRHATREGDFFGELQ 185

Query: 195 NILAEMEEVTDLQPVPDAHVPVMRFKFMGISIDLLYASISVRVVPEELNISHGSVLCNVD 254
            +L+EM EVT+L PVPDAHVP+M FK  G+SIDLLYA + + V+PE+L++S  S+L N D
Sbjct: 186 RMLSEMPEVTELHPVPDAHVPLMGFKLNGVSIDLLYAQLPLWVIPEDLDLSQDSILQNAD 245

Query: 255 EQTVRSLNGCRVADQILRLVPNVEHFRTTLRCLKFWAKRRGVYSNVTGFLGG-------- 314
           EQTVRSLNGCRV DQILRLVPN+++FRTTLRC++FWAKRRGVYSNV+GFLGG        
Sbjct: 246 EQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVSGFLGGINWALLVA 305

Query: 315 -VCQLYPNAIPSMLVSRFFRVYTQWRWPNPVMLCSIEENELGFPVWDPRKNPRDRFHLMP 374
            +CQLYPNA+P++LVSRFFRV+ QW WPN + LCS +E  LG  VWDPR NP+DR H+MP
Sbjct: 306 RICQLYPNALPNILVSRFFRVFYQWNWPNAIFLCSPDEGSLGLQVWDPRINPKDRLHIMP 365

Query: 375 IITPAYPCMNSSYNVSTSTLRVMMEQFRCGNNICEEIDMSKAQWSALFEPYLFFEIYKNY 434
           IITPAYPCMNSSYNVS STLR+M  +F+ GN ICE ++ +KA W  LFEP+ FFE YKNY
Sbjct: 366 IITPAYPCMNSSYNVSESTLRIMKGEFQRGNEICEAMESNKADWDTLFEPFAFFEAYKNY 425

Query: 435 LQVDIIAADADDLLAWKGWVESRFRQLTLKSWRVFQEPDRFYFSFLQIERDTRGMLQCHP 494
           LQ+DI AA+ DDL  WKGWVESR RQLTLK  R F+                  ML CHP
Sbjct: 426 LQIDISAANVDDLRKWKGWVESRLRQLTLKIERHFK------------------MLHCHP 485

Query: 495 YPIEYSDTSKPCSHCAFFMGLQRKEGLRGQGGQQFDIRGTVDEFRQEINTYAFWKPGMDI 554
           +P ++ DTS+P  HC++FMGLQRK+G+    G+QFDIR TV+EF+  +N Y  W PGM+I
Sbjct: 486 HPHDFQDTSRPL-HCSYFMGLQRKQGVPAAEGEQFDIRRTVEEFKHTVNAYTLWIPGMEI 545

Query: 555 HVSHVRRKQLPTFVFPDG------------HKRAKPVRHEGQQADPVCADMLQDQSGITE 614
            V H++R+ LP FVFP G              R    R+    + P       + S  ++
Sbjct: 546 SVGHIKRRSLPNFVFPGGVRPSHTSKGTWDSNRRSEHRNSSTSSAPAATTTTTEMSSESK 581

Query: 615 KGKKRKSDHEETEMEKKQAFASQP------AEESPMLETHGGGSDGKLPSLKSA 641
            G     D ++ +    +    QP      A   P+    GG  +  + S+ S+
Sbjct: 606 AGSNSPVDGKKRKWGDSETLTDQPRNSKHIAVSVPVENCEGGSPNPSVGSICSS 581

BLAST of CmoCh05G006050 vs. TAIR10
Match: AT3G06560.1 (AT3G06560.1 poly(A) polymerase 3)

HSP 1 Score: 242.3 bits (617), Expect = 1.2e-63
Identity = 163/545 (29.91%), Postives = 248/545 (45.50%), Query Frame = 1

Query: 35  DIQRNIELEKFLIDSGLYESKEEAAKREEVLGRIDQVQTFFTSIGIVLSSQLEQNSVLIG 94
           D + +I L + +++ GL  S E+  KR  V+ ++ +                        
Sbjct: 14  DDESSISLRQLMVNEGLIPSLEDEVKRRGVINQLRK------------------------ 73

Query: 95  DESEILIVKNWVKQLTLLRGYTEQMVEDANAVIFTFGSYRLGVSIIFDLVFGEKSLLLYA 154
                 IV  WVK +       +  ++  NA I  +GSY LGV                 
Sbjct: 74  ------IVVRWVKNVAWQHRLPQNQIDATNATILPYGSYGLGV----------------- 133

Query: 155 TLRNFCWSHGLVHGPGSDIDTLCVGPSYVN-REDFFIILHNILAEMEEVTDLQPVPDAHV 214
                       +G  SDID LC+GP + +  EDFFI L ++L    EV++L  V DA V
Sbjct: 134 ------------YGSESDIDALCIGPFFASIAEDFFISLRDMLKSRREVSELHCVKDAKV 193

Query: 215 PVMRFKFMGISIDLLYASISVRVVPEELNISHGSVLCNVDEQTVRSLNGCRVADQILRLV 274
           P++RFKF GI +DL YA + V  +P  +++ +   L ++DE + + L+G R    IL+LV
Sbjct: 194 PLIRFKFDGILVDLPYAQLRVLSIPNNVDVLNPFFLRDIDETSWKILSGVRANKCILQLV 253

Query: 275 PNVEHFRTTLRCLKFWAKRRGVYSNVTGFLGG---------VCQLYPNAIPSMLVSRFFR 334
           P++E F++ LRC+K WAKRRGVY N+ GFLGG         VC   PNA  S L++ FF 
Sbjct: 254 PSLELFQSLLRCVKLWAKRRGVYGNLNGFLGGVHMAILAAFVCGYQPNATLSSLLANFFY 313

Query: 335 VYTQWRWPNPVMLCSIEENELGFPVWDPRKNPRDRFHLMPIITPAYPCMNSSYNVSTSTL 394
            +  W+WP PV+L        G P             LMPI  P       +  ++ ST 
Sbjct: 314 TFAHWQWPTPVVLLEDTYPSTGAPP-----------GLMPIQLPCGSHQYCNSTITRSTF 373

Query: 395 RVMMEQFRCGNNICEEIDMSKAQWSALFEPYLFFEIYKNYLQVDIIAADADDLLAWKGWV 454
             ++ +F  G+N+ ++       W  LFE Y +   Y  + ++ + AA+ +DL  W GWV
Sbjct: 374 YKIVAEFLLGHNLTKDYLKLNFSWKDLFELYPYANTYTWFTKIHLSAANQEDLSDWVGWV 433

Query: 455 ESRFRQLTLKSWRVFQEPDRFYFSFLQIERDTRGMLQCHPYPIEYSDTSKPCSHCAFFMG 514
           +SRFR L +K   V+                  G+  C P P EY +T     +  F+ G
Sbjct: 434 KSRFRCLLIKIEEVY------------------GI--CDPNPTEYVETYTKQPNIVFYWG 462

Query: 515 LQRKEGLRGQGGQQFDIRGTVDEFRQEINTYAFWKPGMDIHVSHVRRKQLPTFVFPDGHK 570
           LQ +           DI     +F + +N+ +F      I ++ V+  QLP       + 
Sbjct: 494 LQLRT------INVSDIESVKIDFLKNVNSGSFRGTVGRIQLTLVKASQLPKNGECGSNN 462

BLAST of CmoCh05G006050 vs. NCBI nr
Match: gi|778671979|ref|XP_011649721.1| (PREDICTED: nuclear poly(A) polymerase 4 isoform X2 [Cucumis sativus])

HSP 1 Score: 1246.1 bits (3223), Expect = 0.0e+00
Identity = 642/846 (75.89%), Postives = 674/846 (79.67%), Query Frame = 1

Query: 1   MVGSDSSTAFHPAPQPASNYGVTKPISLAGPTDADIQRNIELEKFLIDSGLYESKEEAAK 60
           MVGSD ST       P +NYGVTKPISLAGP DADI RNIELEKFL+DS LYESKEEAAK
Sbjct: 1   MVGSDCSTGLPSVSHPVTNYGVTKPISLAGPMDADIHRNIELEKFLVDSELYESKEEAAK 60

Query: 61  REEVLGRIDQVQTFFTSIGIVLSSQLEQNSVLIGDESEILIVKNWVKQLTLLRGYTEQMV 120
           REEVLGRIDQ                              IVK+WVKQLTLLRGYTEQMV
Sbjct: 61  REEVLGRIDQ------------------------------IVKSWVKQLTLLRGYTEQMV 120

Query: 121 EDANAVIFTFGSYRLGVSIIFDLVFGEKSLLLYATLRNFCWSHGLVHGPGSDIDTLCVGP 180
           EDANAVIFTFGSYRLG                             VHGPGSDIDTLCVGP
Sbjct: 121 EDANAVIFTFGSYRLG-----------------------------VHGPGSDIDTLCVGP 180

Query: 181 SYVNREDFFIILHNILAEMEEVTDLQPVPDAHVPVMRFKFMGISIDLLYASISVRVVPEE 240
           SYVNREDFFIILHNILAEMEEVTDLQPVPDAHVPVMRFKF+GISIDLLYASIS+ VVPE+
Sbjct: 181 SYVNREDFFIILHNILAEMEEVTDLQPVPDAHVPVMRFKFLGISIDLLYASISLLVVPED 240

Query: 241 LNISHGSVLCNVDEQTVRSLNGCRVADQILRLVPNVEHFRTTLRCLKFWAKRRGVYSNVT 300
           L+ISHGSVL NVDEQTVRSLNGCRVADQILRLVPNVEHFRTTLRCLKFWAKRRGVYSNVT
Sbjct: 241 LDISHGSVLYNVDEQTVRSLNGCRVADQILRLVPNVEHFRTTLRCLKFWAKRRGVYSNVT 300

Query: 301 GFLGG---------VCQLYPNAIPSMLVSRFFRVYTQWRWPNPVMLCSIEENELGFPVWD 360
           GFLGG         VCQLYPNAIPSMLVSRFFRVYTQWRWPNPVMLCSIEENELGFPVWD
Sbjct: 301 GFLGGVNWALLVAQVCQLYPNAIPSMLVSRFFRVYTQWRWPNPVMLCSIEENELGFPVWD 360

Query: 361 PRKNPRDRFHLMPIITPAYPCMNSSYNVSTSTLRVMMEQFRCGNNICEEIDMSKAQWSAL 420
           PR+NPRDRFHLMPIITPAYPCMNSSYNVSTSTLRVMMEQFR GN+ICEEID+SKAQWSAL
Sbjct: 361 PRRNPRDRFHLMPIITPAYPCMNSSYNVSTSTLRVMMEQFRYGNSICEEIDLSKAQWSAL 420

Query: 421 FEPYLFFEIYKNYLQVDIIAADADDLLAWKGWVESRFRQLTLKSWRVFQEPDRFYFSFLQ 480
           FEPYLFFE YKNYLQVDIIAADADDLLAWKGWVESRFRQLTLK                 
Sbjct: 421 FEPYLFFETYKNYLQVDIIAADADDLLAWKGWVESRFRQLTLK----------------- 480

Query: 481 IERDTRGMLQCHPYPIEYSDTSKPCSHCAFFMGLQRKEGLRGQGGQQFDIRGTVDEFRQE 540
           IERDTRGMLQCHPYPIEYSDTSKPCSHCAFFMGLQRKEGLRGQGGQQFDIRGTVDEFRQE
Sbjct: 481 IERDTRGMLQCHPYPIEYSDTSKPCSHCAFFMGLQRKEGLRGQGGQQFDIRGTVDEFRQE 540

Query: 541 INTYAFWKPGMDIHVSHVRRKQLPTFVFPDGHKRAKPVRHEGQQADPVCADMLQDQSGIT 600
           IN YAFWKPGMDI+VSHVRRKQLPTFVFPDGHKRAKP+RHEGQQ D VCADMLQDQSGIT
Sbjct: 541 INMYAFWKPGMDIYVSHVRRKQLPTFVFPDGHKRAKPLRHEGQQVDTVCADMLQDQSGIT 600

Query: 601 EKGKKRKSDHEETEMEKKQAFASQPAEESPMLETHGGGSDGKLPSLKSANADCYSEVWPS 660
           EKGKKRKSDHEE E EKKQA  S PAE+SPM E  GG  DGK PS K ANADC+ +VW S
Sbjct: 601 EKGKKRKSDHEEEEKEKKQALISPPAEQSPMPEFFGGDPDGKWPSSKFANADCHLKVWSS 660

Query: 661 FEQLDSRMDTDGNGMDIPSLTKETGPTMDQAEL-AKVMEGSSSTKEVPDLYQGGLSKSEE 720
           FEQ  SR DT+GNG DI +LTKETG T DQ  L AK +E SSS KEVPDLY+G +S S+E
Sbjct: 661 FEQPVSRTDTNGNGTDIATLTKETGSTGDQLGLPAKEIEESSSRKEVPDLYKGSISTSKE 720

Query: 721 ALQIEMNQEKIEGLASNMSGSAQTVAIRNFLHWTKDVVRIDSESANPFGQTTGEESTQVD 780
           ALQI  ++E ++GLA NM+GS QTV+IR  LHWTKDVVRIDSES N +G+ TG ESTQV+
Sbjct: 721 ALQIGTDRENVDGLAPNMNGSVQTVSIRTLLHWTKDVVRIDSESGNTYGEMTGGESTQVE 746

Query: 781 FQPNCNAHNLSCKFDYIGLQIIANGSKRSGILEIVSRGNDSRTDPELALENGSVVTGRVF 837
           FQPNCN HNLSCK                        GNDSRTDP+LAL+NGSVVTGRV 
Sbjct: 781 FQPNCNTHNLSCK------------------------GNDSRTDPDLALDNGSVVTGRVS 746

BLAST of CmoCh05G006050 vs. NCBI nr
Match: gi|778671968|ref|XP_011649717.1| (PREDICTED: nuclear poly(A) polymerase 4 isoform X1 [Cucumis sativus])

HSP 1 Score: 1246.1 bits (3223), Expect = 0.0e+00
Identity = 642/846 (75.89%), Postives = 674/846 (79.67%), Query Frame = 1

Query: 1   MVGSDSSTAFHPAPQPASNYGVTKPISLAGPTDADIQRNIELEKFLIDSGLYESKEEAAK 60
           MVGSD ST       P +NYGVTKPISLAGP DADI RNIELEKFL+DS LYESKEEAAK
Sbjct: 1   MVGSDCSTGLPSVSHPVTNYGVTKPISLAGPMDADIHRNIELEKFLVDSELYESKEEAAK 60

Query: 61  REEVLGRIDQVQTFFTSIGIVLSSQLEQNSVLIGDESEILIVKNWVKQLTLLRGYTEQMV 120
           REEVLGRIDQ                              IVK+WVKQLTLLRGYTEQMV
Sbjct: 61  REEVLGRIDQ------------------------------IVKSWVKQLTLLRGYTEQMV 120

Query: 121 EDANAVIFTFGSYRLGVSIIFDLVFGEKSLLLYATLRNFCWSHGLVHGPGSDIDTLCVGP 180
           EDANAVIFTFGSYRLG                             VHGPGSDIDTLCVGP
Sbjct: 121 EDANAVIFTFGSYRLG-----------------------------VHGPGSDIDTLCVGP 180

Query: 181 SYVNREDFFIILHNILAEMEEVTDLQPVPDAHVPVMRFKFMGISIDLLYASISVRVVPEE 240
           SYVNREDFFIILHNILAEMEEVTDLQPVPDAHVPVMRFKF+GISIDLLYASIS+ VVPE+
Sbjct: 181 SYVNREDFFIILHNILAEMEEVTDLQPVPDAHVPVMRFKFLGISIDLLYASISLLVVPED 240

Query: 241 LNISHGSVLCNVDEQTVRSLNGCRVADQILRLVPNVEHFRTTLRCLKFWAKRRGVYSNVT 300
           L+ISHGSVL NVDEQTVRSLNGCRVADQILRLVPNVEHFRTTLRCLKFWAKRRGVYSNVT
Sbjct: 241 LDISHGSVLYNVDEQTVRSLNGCRVADQILRLVPNVEHFRTTLRCLKFWAKRRGVYSNVT 300

Query: 301 GFLGG---------VCQLYPNAIPSMLVSRFFRVYTQWRWPNPVMLCSIEENELGFPVWD 360
           GFLGG         VCQLYPNAIPSMLVSRFFRVYTQWRWPNPVMLCSIEENELGFPVWD
Sbjct: 301 GFLGGVNWALLVAQVCQLYPNAIPSMLVSRFFRVYTQWRWPNPVMLCSIEENELGFPVWD 360

Query: 361 PRKNPRDRFHLMPIITPAYPCMNSSYNVSTSTLRVMMEQFRCGNNICEEIDMSKAQWSAL 420
           PR+NPRDRFHLMPIITPAYPCMNSSYNVSTSTLRVMMEQFR GN+ICEEID+SKAQWSAL
Sbjct: 361 PRRNPRDRFHLMPIITPAYPCMNSSYNVSTSTLRVMMEQFRYGNSICEEIDLSKAQWSAL 420

Query: 421 FEPYLFFEIYKNYLQVDIIAADADDLLAWKGWVESRFRQLTLKSWRVFQEPDRFYFSFLQ 480
           FEPYLFFE YKNYLQVDIIAADADDLLAWKGWVESRFRQLTLK                 
Sbjct: 421 FEPYLFFETYKNYLQVDIIAADADDLLAWKGWVESRFRQLTLK----------------- 480

Query: 481 IERDTRGMLQCHPYPIEYSDTSKPCSHCAFFMGLQRKEGLRGQGGQQFDIRGTVDEFRQE 540
           IERDTRGMLQCHPYPIEYSDTSKPCSHCAFFMGLQRKEGLRGQGGQQFDIRGTVDEFRQE
Sbjct: 481 IERDTRGMLQCHPYPIEYSDTSKPCSHCAFFMGLQRKEGLRGQGGQQFDIRGTVDEFRQE 540

Query: 541 INTYAFWKPGMDIHVSHVRRKQLPTFVFPDGHKRAKPVRHEGQQADPVCADMLQDQSGIT 600
           IN YAFWKPGMDI+VSHVRRKQLPTFVFPDGHKRAKP+RHEGQQ D VCADMLQDQSGIT
Sbjct: 541 INMYAFWKPGMDIYVSHVRRKQLPTFVFPDGHKRAKPLRHEGQQVDTVCADMLQDQSGIT 600

Query: 601 EKGKKRKSDHEETEMEKKQAFASQPAEESPMLETHGGGSDGKLPSLKSANADCYSEVWPS 660
           EKGKKRKSDHEE E EKKQA  S PAE+SPM E  GG  DGK PS K ANADC+ +VW S
Sbjct: 601 EKGKKRKSDHEEEEKEKKQALISPPAEQSPMPEFFGGDPDGKWPSSKFANADCHLKVWSS 660

Query: 661 FEQLDSRMDTDGNGMDIPSLTKETGPTMDQAEL-AKVMEGSSSTKEVPDLYQGGLSKSEE 720
           FEQ  SR DT+GNG DI +LTKETG T DQ  L AK +E SSS KEVPDLY+G +S S+E
Sbjct: 661 FEQPVSRTDTNGNGTDIATLTKETGSTGDQLGLPAKEIEESSSRKEVPDLYKGSISTSKE 720

Query: 721 ALQIEMNQEKIEGLASNMSGSAQTVAIRNFLHWTKDVVRIDSESANPFGQTTGEESTQVD 780
           ALQI  ++E ++GLA NM+GS QTV+IR  LHWTKDVVRIDSES N +G+ TG ESTQV+
Sbjct: 721 ALQIGTDRENVDGLAPNMNGSVQTVSIRTLLHWTKDVVRIDSESGNTYGEMTGGESTQVE 746

Query: 781 FQPNCNAHNLSCKFDYIGLQIIANGSKRSGILEIVSRGNDSRTDPELALENGSVVTGRVF 837
           FQPNCN HNLSCK                        GNDSRTDP+LAL+NGSVVTGRV 
Sbjct: 781 FQPNCNTHNLSCK------------------------GNDSRTDPDLALDNGSVVTGRVS 746

BLAST of CmoCh05G006050 vs. NCBI nr
Match: gi|778671986|ref|XP_011649723.1| (PREDICTED: nuclear poly(A) polymerase 4 isoform X3 [Cucumis sativus])

HSP 1 Score: 1246.1 bits (3223), Expect = 0.0e+00
Identity = 642/846 (75.89%), Postives = 674/846 (79.67%), Query Frame = 1

Query: 1   MVGSDSSTAFHPAPQPASNYGVTKPISLAGPTDADIQRNIELEKFLIDSGLYESKEEAAK 60
           MVGSD ST       P +NYGVTKPISLAGP DADI RNIELEKFL+DS LYESKEEAAK
Sbjct: 1   MVGSDCSTGLPSVSHPVTNYGVTKPISLAGPMDADIHRNIELEKFLVDSELYESKEEAAK 60

Query: 61  REEVLGRIDQVQTFFTSIGIVLSSQLEQNSVLIGDESEILIVKNWVKQLTLLRGYTEQMV 120
           REEVLGRIDQ                              IVK+WVKQLTLLRGYTEQMV
Sbjct: 61  REEVLGRIDQ------------------------------IVKSWVKQLTLLRGYTEQMV 120

Query: 121 EDANAVIFTFGSYRLGVSIIFDLVFGEKSLLLYATLRNFCWSHGLVHGPGSDIDTLCVGP 180
           EDANAVIFTFGSYRLG                             VHGPGSDIDTLCVGP
Sbjct: 121 EDANAVIFTFGSYRLG-----------------------------VHGPGSDIDTLCVGP 180

Query: 181 SYVNREDFFIILHNILAEMEEVTDLQPVPDAHVPVMRFKFMGISIDLLYASISVRVVPEE 240
           SYVNREDFFIILHNILAEMEEVTDLQPVPDAHVPVMRFKF+GISIDLLYASIS+ VVPE+
Sbjct: 181 SYVNREDFFIILHNILAEMEEVTDLQPVPDAHVPVMRFKFLGISIDLLYASISLLVVPED 240

Query: 241 LNISHGSVLCNVDEQTVRSLNGCRVADQILRLVPNVEHFRTTLRCLKFWAKRRGVYSNVT 300
           L+ISHGSVL NVDEQTVRSLNGCRVADQILRLVPNVEHFRTTLRCLKFWAKRRGVYSNVT
Sbjct: 241 LDISHGSVLYNVDEQTVRSLNGCRVADQILRLVPNVEHFRTTLRCLKFWAKRRGVYSNVT 300

Query: 301 GFLGG---------VCQLYPNAIPSMLVSRFFRVYTQWRWPNPVMLCSIEENELGFPVWD 360
           GFLGG         VCQLYPNAIPSMLVSRFFRVYTQWRWPNPVMLCSIEENELGFPVWD
Sbjct: 301 GFLGGVNWALLVAQVCQLYPNAIPSMLVSRFFRVYTQWRWPNPVMLCSIEENELGFPVWD 360

Query: 361 PRKNPRDRFHLMPIITPAYPCMNSSYNVSTSTLRVMMEQFRCGNNICEEIDMSKAQWSAL 420
           PR+NPRDRFHLMPIITPAYPCMNSSYNVSTSTLRVMMEQFR GN+ICEEID+SKAQWSAL
Sbjct: 361 PRRNPRDRFHLMPIITPAYPCMNSSYNVSTSTLRVMMEQFRYGNSICEEIDLSKAQWSAL 420

Query: 421 FEPYLFFEIYKNYLQVDIIAADADDLLAWKGWVESRFRQLTLKSWRVFQEPDRFYFSFLQ 480
           FEPYLFFE YKNYLQVDIIAADADDLLAWKGWVESRFRQLTLK                 
Sbjct: 421 FEPYLFFETYKNYLQVDIIAADADDLLAWKGWVESRFRQLTLK----------------- 480

Query: 481 IERDTRGMLQCHPYPIEYSDTSKPCSHCAFFMGLQRKEGLRGQGGQQFDIRGTVDEFRQE 540
           IERDTRGMLQCHPYPIEYSDTSKPCSHCAFFMGLQRKEGLRGQGGQQFDIRGTVDEFRQE
Sbjct: 481 IERDTRGMLQCHPYPIEYSDTSKPCSHCAFFMGLQRKEGLRGQGGQQFDIRGTVDEFRQE 540

Query: 541 INTYAFWKPGMDIHVSHVRRKQLPTFVFPDGHKRAKPVRHEGQQADPVCADMLQDQSGIT 600
           IN YAFWKPGMDI+VSHVRRKQLPTFVFPDGHKRAKP+RHEGQQ D VCADMLQDQSGIT
Sbjct: 541 INMYAFWKPGMDIYVSHVRRKQLPTFVFPDGHKRAKPLRHEGQQVDTVCADMLQDQSGIT 600

Query: 601 EKGKKRKSDHEETEMEKKQAFASQPAEESPMLETHGGGSDGKLPSLKSANADCYSEVWPS 660
           EKGKKRKSDHEE E EKKQA  S PAE+SPM E  GG  DGK PS K ANADC+ +VW S
Sbjct: 601 EKGKKRKSDHEEEEKEKKQALISPPAEQSPMPEFFGGDPDGKWPSSKFANADCHLKVWSS 660

Query: 661 FEQLDSRMDTDGNGMDIPSLTKETGPTMDQAEL-AKVMEGSSSTKEVPDLYQGGLSKSEE 720
           FEQ  SR DT+GNG DI +LTKETG T DQ  L AK +E SSS KEVPDLY+G +S S+E
Sbjct: 661 FEQPVSRTDTNGNGTDIATLTKETGSTGDQLGLPAKEIEESSSRKEVPDLYKGSISTSKE 720

Query: 721 ALQIEMNQEKIEGLASNMSGSAQTVAIRNFLHWTKDVVRIDSESANPFGQTTGEESTQVD 780
           ALQI  ++E ++GLA NM+GS QTV+IR  LHWTKDVVRIDSES N +G+ TG ESTQV+
Sbjct: 721 ALQIGTDRENVDGLAPNMNGSVQTVSIRTLLHWTKDVVRIDSESGNTYGEMTGGESTQVE 746

Query: 781 FQPNCNAHNLSCKFDYIGLQIIANGSKRSGILEIVSRGNDSRTDPELALENGSVVTGRVF 837
           FQPNCN HNLSCK                        GNDSRTDP+LAL+NGSVVTGRV 
Sbjct: 781 FQPNCNTHNLSCK------------------------GNDSRTDPDLALDNGSVVTGRVS 746

BLAST of CmoCh05G006050 vs. NCBI nr
Match: gi|659088469|ref|XP_008444997.1| (PREDICTED: poly(A) polymerase pla1 isoform X3 [Cucumis melo])

HSP 1 Score: 1245.7 bits (3222), Expect = 0.0e+00
Identity = 648/846 (76.60%), Postives = 671/846 (79.31%), Query Frame = 1

Query: 1   MVGSDSSTAFHPAPQPASNYGVTKPISLAGPTDADIQRNIELEKFLIDSGLYESKEEAAK 60
           MVGSD ST       PA+NYGVTKPISLAGPTDADI RNIELEKFL+DSGLYESKEEAAK
Sbjct: 1   MVGSDCSTGLPSVSHPATNYGVTKPISLAGPTDADIHRNIELEKFLVDSGLYESKEEAAK 60

Query: 61  REEVLGRIDQVQTFFTSIGIVLSSQLEQNSVLIGDESEILIVKNWVKQLTLLRGYTEQMV 120
           REEVLGRIDQ                              IVKNWVKQLTLLRGYTEQMV
Sbjct: 61  REEVLGRIDQ------------------------------IVKNWVKQLTLLRGYTEQMV 120

Query: 121 EDANAVIFTFGSYRLGVSIIFDLVFGEKSLLLYATLRNFCWSHGLVHGPGSDIDTLCVGP 180
           EDANAVIFTFGSYRLG                             VHGPGSDIDTLCVGP
Sbjct: 121 EDANAVIFTFGSYRLG-----------------------------VHGPGSDIDTLCVGP 180

Query: 181 SYVNREDFFIILHNILAEMEEVTDLQPVPDAHVPVMRFKFMGISIDLLYASISVRVVPEE 240
           SYVNREDFFIILH+ILAEMEEVTDLQPVPDAHVPVMRFKF+GISIDLLYASIS+ VVPE+
Sbjct: 181 SYVNREDFFIILHDILAEMEEVTDLQPVPDAHVPVMRFKFLGISIDLLYASISLLVVPED 240

Query: 241 LNISHGSVLCNVDEQTVRSLNGCRVADQILRLVPNVEHFRTTLRCLKFWAKRRGVYSNVT 300
           L+ISHGSVL NVDEQTVRSLNGCRVADQILRLVPNV HFRTTLRCLKFWAKRRGVYSNVT
Sbjct: 241 LDISHGSVLYNVDEQTVRSLNGCRVADQILRLVPNVGHFRTTLRCLKFWAKRRGVYSNVT 300

Query: 301 GFLGG---------VCQLYPNAIPSMLVSRFFRVYTQWRWPNPVMLCSIEENELGFPVWD 360
           GFLGG         VCQLYPNAIPSMLVSRFFRVYTQWRWPNPVMLCSIEENELGFPVWD
Sbjct: 301 GFLGGVNWALLVAQVCQLYPNAIPSMLVSRFFRVYTQWRWPNPVMLCSIEENELGFPVWD 360

Query: 361 PRKNPRDRFHLMPIITPAYPCMNSSYNVSTSTLRVMMEQFRCGNNICEEIDMSKAQWSAL 420
           PR+NPRDRFHLMPIITPAYPCMNSSYNVSTSTLRVMMEQF  GNNICEEID+SKAQWSAL
Sbjct: 361 PRRNPRDRFHLMPIITPAYPCMNSSYNVSTSTLRVMMEQFCYGNNICEEIDLSKAQWSAL 420

Query: 421 FEPYLFFEIYKNYLQVDIIAADADDLLAWKGWVESRFRQLTLKSWRVFQEPDRFYFSFLQ 480
           FEPYLFFE YKNYLQVDIIAADADDLLAWKGWVESRFRQLTLK                 
Sbjct: 421 FEPYLFFETYKNYLQVDIIAADADDLLAWKGWVESRFRQLTLK----------------- 480

Query: 481 IERDTRGMLQCHPYPIEYSDTSKPCSHCAFFMGLQRKEGLRGQGGQQFDIRGTVDEFRQE 540
           IERDTRGMLQCHPYPIEYSDTSKPCSHCAFFMGLQRKEGLRGQGGQQFDIRGTVDEFRQE
Sbjct: 481 IERDTRGMLQCHPYPIEYSDTSKPCSHCAFFMGLQRKEGLRGQGGQQFDIRGTVDEFRQE 540

Query: 541 INTYAFWKPGMDIHVSHVRRKQLPTFVFPDGHKRAKPVRHEGQQADPVCADMLQDQSGIT 600
           IN YAFWKPGMDI+VSHVRRKQLPTFVFPDGHKRAKP RHEGQQAD VCADMLQDQSGIT
Sbjct: 541 INMYAFWKPGMDIYVSHVRRKQLPTFVFPDGHKRAKPSRHEGQQADTVCADMLQDQSGIT 600

Query: 601 EKGKKRKSDHEETEMEKKQAFASQPAEESPMLETHGGGSDGKLPSLKSANADCYSEVWPS 660
           EKGKKRKSDHEE E EKKQAF SQPAE+SPM E  GG  DGKLPS K ANADC+ E W S
Sbjct: 601 EKGKKRKSDHEEEEKEKKQAFISQPAEQSPMPEFIGGEPDGKLPSSKFANADCHLEAWSS 660

Query: 661 FEQLDSRMDTDGNGMDIPSLTKETGPTMDQAE-LAKVMEGSSSTKEVPDLYQGGLSKSEE 720
           FEQ DSR DT+GNG DI +LTKETG T D+   LAK M GSS  K VPDLY+     SEE
Sbjct: 661 FEQPDSRTDTNGNGTDISTLTKETGSTGDEVGLLAKEMGGSSLRKGVPDLYKA----SEE 720

Query: 721 ALQIEMNQEKIEGLASNMSGSAQTVAIRNFLHWTKDVVRIDSESANPFGQTTGEESTQVD 780
           AL I  N E +EGLA +M+GSAQTVAIR  L WTKDVVRIDS+S N FG+ TG ESTQVD
Sbjct: 721 ALHIRTNGENVEGLAPSMNGSAQTVAIRTLLDWTKDVVRIDSKSGNTFGEMTGGESTQVD 742

Query: 781 FQPNCNAHNLSCKFDYIGLQIIANGSKRSGILEIVSRGNDSRTDPELALENGSVVTGRVF 837
           FQPNCN HNLSCK                        GNDSRTDP+LAL+NGSVVTGRV 
Sbjct: 781 FQPNCNTHNLSCK------------------------GNDSRTDPDLALDNGSVVTGRVS 742

BLAST of CmoCh05G006050 vs. NCBI nr
Match: gi|659088467|ref|XP_008444996.1| (PREDICTED: poly(A) polymerase beta isoform X2 [Cucumis melo])

HSP 1 Score: 1230.3 bits (3182), Expect = 0.0e+00
Identity = 648/874 (74.14%), Postives = 671/874 (76.77%), Query Frame = 1

Query: 1   MVGSDSSTAFHPAPQPASNYGVTKPISLAGPTDADIQRNIELEKFLIDSGLYESKEEAAK 60
           MVGSD ST       PA+NYGVTKPISLAGPTDADI RNIELEKFL+DSGLYESKEEAAK
Sbjct: 1   MVGSDCSTGLPSVSHPATNYGVTKPISLAGPTDADIHRNIELEKFLVDSGLYESKEEAAK 60

Query: 61  REEVLGRIDQVQTFFTSIGIVLSSQLEQNSVLIGDESEILIVKNWVKQLTLLRGYTEQMV 120
           REEVLGRIDQ                              IVKNWVKQLTLLRGYTEQMV
Sbjct: 61  REEVLGRIDQ------------------------------IVKNWVKQLTLLRGYTEQMV 120

Query: 121 EDANAVIFTFGSYRLGVSIIFDLVFGEKSLLLYATLRNFCWSHGLVHGPGSDIDTLCVGP 180
           EDANAVIFTFGSYRLGV                             HGPGSDIDTLCVGP
Sbjct: 121 EDANAVIFTFGSYRLGV-----------------------------HGPGSDIDTLCVGP 180

Query: 181 SYVNREDFFIILHNILAEMEEVTDLQPVPDAHVPVMRFKFMGISIDLLYASISVRVVPEE 240
           SYVNREDFFIILH+ILAEMEEVTDLQPVPDAHVPVMRFKF+GISIDLLYASIS+ VVPE+
Sbjct: 181 SYVNREDFFIILHDILAEMEEVTDLQPVPDAHVPVMRFKFLGISIDLLYASISLLVVPED 240

Query: 241 LNISHGSVLCNVDEQTVRSLNGCRVADQILRLVPNVEHFRTTLRCLKFWAKRRGVYSNVT 300
           L+ISHGSVL NVDEQTVRSLNGCRVADQILRLVPNV HFRTTLRCLKFWAKRRGVYSNVT
Sbjct: 241 LDISHGSVLYNVDEQTVRSLNGCRVADQILRLVPNVGHFRTTLRCLKFWAKRRGVYSNVT 300

Query: 301 GFLGGV---------CQLYPNAIPSMLVSRFFRVYTQWRWPNPVMLCSIEENELGFPVWD 360
           GFLGGV         CQLYPNAIPSMLVSRFFRVYTQWRWPNPVMLCSIEENELGFPVWD
Sbjct: 301 GFLGGVNWALLVAQVCQLYPNAIPSMLVSRFFRVYTQWRWPNPVMLCSIEENELGFPVWD 360

Query: 361 PRKNPRDRFHLMPIITPAYPCMNSSYNVSTSTLRVMMEQFRCGNNICE------------ 420
           PR+NPRDRFHLMPIITPAYPCMNSSYNVSTSTLRVMMEQF  GNNICE            
Sbjct: 361 PRRNPRDRFHLMPIITPAYPCMNSSYNVSTSTLRVMMEQFCYGNNICEVRFFFIHHILNI 420

Query: 421 ----------------EIDMSKAQWSALFEPYLFFEIYKNYLQVDIIAADADDLLAWKGW 480
                           EID+SKAQWSALFEPYLFFE YKNYLQVDIIAADADDLLAWKGW
Sbjct: 421 SLLLIIFSLTSVSALQEIDLSKAQWSALFEPYLFFETYKNYLQVDIIAADADDLLAWKGW 480

Query: 481 VESRFRQLTLKSWRVFQEPDRFYFSFLQIERDTRGMLQCHPYPIEYSDTSKPCSHCAFFM 540
           VESRFRQLTLK                 IERDTRGMLQCHPYPIEYSDTSKPCSHCAFFM
Sbjct: 481 VESRFRQLTLK-----------------IERDTRGMLQCHPYPIEYSDTSKPCSHCAFFM 540

Query: 541 GLQRKEGLRGQGGQQFDIRGTVDEFRQEINTYAFWKPGMDIHVSHVRRKQLPTFVFPDGH 600
           GLQRKEGLRGQGGQQFDIRGTVDEFRQEIN YAFWKPGMDI+VSHVRRKQLPTFVFPDGH
Sbjct: 541 GLQRKEGLRGQGGQQFDIRGTVDEFRQEINMYAFWKPGMDIYVSHVRRKQLPTFVFPDGH 600

Query: 601 KRAKPVRHEGQQADPVCADMLQDQSGITEKGKKRKSDHEETEMEKKQAFASQPAEESPML 660
           KRAKP RHEGQQAD VCADMLQDQSGITEKGKKRKSDHEE E EKKQAF SQPAE+SPM 
Sbjct: 601 KRAKPSRHEGQQADTVCADMLQDQSGITEKGKKRKSDHEEEEKEKKQAFISQPAEQSPMP 660

Query: 661 ETHGGGSDGKLPSLKSANADCYSEVWPSFEQLDSRMDTDGNGMDIPSLTKETGPTMDQAE 720
           E  GG  DGKLPS K ANADC+ E W SFEQ DSR DT+GNG DI +LTKETG T D+  
Sbjct: 661 EFIGGEPDGKLPSSKFANADCHLEAWSSFEQPDSRTDTNGNGTDISTLTKETGSTGDEVG 720

Query: 721 L-AKVMEGSSSTKEVPDLYQGGLSKSEEALQIEMNQEKIEGLASNMSGSAQTVAIRNFLH 780
           L AK M GSS  K VPDLY+     SEEAL I  N E +EGLA +M+GSAQTVAIR  L 
Sbjct: 721 LLAKEMGGSSLRKGVPDLYKA----SEEALHIRTNGENVEGLAPSMNGSAQTVAIRTLLD 770

Query: 781 WTKDVVRIDSESANPFGQTTGEESTQVDFQPNCNAHNLSCKFDYIGLQIIANGSKRSGIL 837
           WTKDVVRIDS+S N FG+ TG ESTQVDFQPNCN HNLSCK                   
Sbjct: 781 WTKDVVRIDSKSGNTFGEMTGGESTQVDFQPNCNTHNLSCK------------------- 770

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PAPS4_ARATH3.9e-22664.43Nuclear poly(A) polymerase 4 OS=Arabidopsis thaliana GN=PAPS4 PE=1 SV=1[more]
PAPS2_ARATH1.6e-22464.07Nuclear poly(A) polymerase 2 OS=Arabidopsis thaliana GN=PAPS2 PE=1 SV=2[more]
PAPS1_ARATH1.4e-17548.78Nuclear poly(A) polymerase 1 OS=Arabidopsis thaliana GN=PAPS1 PE=1 SV=1[more]
PAP_DICDI6.1e-11038.24Poly(A) polymerase OS=Dictyostelium discoideum GN=papA PE=3 SV=1[more]
PAPOA_HUMAN1.6e-10238.95Poly(A) polymerase alpha OS=Homo sapiens GN=PAPOLA PE=1 SV=4[more]
Match NameE-valueIdentityDescription
A0A0A0LP30_CUCSA0.0e+0075.89Poly(A) polymerase beta OS=Cucumis sativus GN=Csa_2G373380 PE=4 SV=1[more]
A0A061GKW6_THECC3.2e-25156.96Poly(A) polymerase 1 isoform 2 OS=Theobroma cacao GN=TCM_037229 PE=4 SV=1[more]
A0A061GK78_THECC3.2e-25156.96Poly(A) polymerase 1 isoform 3 OS=Theobroma cacao GN=TCM_037229 PE=4 SV=1[more]
A0A061GIW2_THECC3.2e-25156.96Poly(A) polymerase 1 isoform 5 OS=Theobroma cacao GN=TCM_037229 PE=4 SV=1[more]
A0A061GRQ3_THECC3.2e-25156.96Poly(A) polymerase 1 isoform 4 OS=Theobroma cacao GN=TCM_037229 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G32850.82.2e-22764.43 nuclear poly(a) polymerase[more]
AT2G25850.29.3e-22664.07 poly(A) polymerase 2[more]
AT1G17980.18.0e-17748.78 poly(A) polymerase 1[more]
AT3G06560.11.2e-6329.91 poly(A) polymerase 3[more]
Match NameE-valueIdentityDescription
gi|778671979|ref|XP_011649721.1|0.0e+0075.89PREDICTED: nuclear poly(A) polymerase 4 isoform X2 [Cucumis sativus][more]
gi|778671968|ref|XP_011649717.1|0.0e+0075.89PREDICTED: nuclear poly(A) polymerase 4 isoform X1 [Cucumis sativus][more]
gi|778671986|ref|XP_011649723.1|0.0e+0075.89PREDICTED: nuclear poly(A) polymerase 4 isoform X3 [Cucumis sativus][more]
gi|659088469|ref|XP_008444997.1|0.0e+0076.60PREDICTED: poly(A) polymerase pla1 isoform X3 [Cucumis melo][more]
gi|659088467|ref|XP_008444996.1|0.0e+0074.14PREDICTED: poly(A) polymerase beta isoform X2 [Cucumis melo][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR007010PolA_pol_RNA-bd_dom
IPR007012PolA_pol_cen_dom
IPR011068NuclTrfase_I-like_C
IPR014492PolyA_polymerase
Vocabulary: Molecular Function
TermDefinition
GO:0003723RNA binding
GO:0004652polynucleotide adenylyltransferase activity
GO:0016779nucleotidyltransferase activity
Vocabulary: Biological Process
TermDefinition
GO:0043631RNA polyadenylation
GO:0031123RNA 3'-end processing
Vocabulary: Cellular Component
TermDefinition
GO:0005634nucleus
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0031123 RNA 3'-end processing
biological_process GO:0043631 RNA polyadenylation
cellular_component GO:0005634 nucleus
molecular_function GO:0004652 polynucleotide adenylyltransferase activity
molecular_function GO:0003723 RNA binding
molecular_function GO:0016779 nucleotidyltransferase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh05G006050.1CmoCh05G006050.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR007010Poly(A) polymerase, RNA-binding domainGENE3DG3DSA:3.30.70.590coord: 417..559
score: 2.6
IPR007010Poly(A) polymerase, RNA-binding domainPFAMPF04926PAP_RNA-bindcoord: 417..453
score: 2.1E-7coord: 493..566
score: 4.
IPR007012Poly(A) polymerase, central domainPFAMPF04928PAP_centralcoord: 21..414
score: 1.6
IPR011068Nucleotidyltransferase, class I, C-terminal-likeunknownSSF55003PAP/Archaeal CCA-adding enzyme, C-terminal domaincoord: 417..559
score: 2.35
IPR014492Poly(A) polymerasePANTHERPTHR10682POLY A POLYMERASEcoord: 101..136
score: 0.0coord: 1..70
score: 0.0coord: 166..452
score: 0.0coord: 470..610
score:
NoneNo IPR availableGENE3DG3DSA:1.10.1410.10coord: 219..415
score: 1.0
NoneNo IPR availablePANTHERPTHR10682:SF22NUCLEAR POLY(A) POLYMERASEcoord: 1..70
score: 0.0coord: 470..610
score: 0.0coord: 166..452
score: 0.0coord: 101..136
score:
NoneNo IPR availableunknownSSF81301Nucleotidyltransferasecoord: 166..271
score: 8.99
NoneNo IPR availableunknownSSF81631PAP/OAS1 substrate-binding domaincoord: 274..414
score: 2.88

The following gene(s) are paralogous to this gene:

None