CmoCh06G001410 (gene) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh06G001410
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
DescriptionPoly(A) polymerase I
LocationCmo_Chr06: 755710 .. 762548 (-)
RNA-Seq ExpressionCmoCh06G001410
SyntenyCmoCh06G001410
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ACGGCGTGCGGGAGCAACCAAAGCACAGTAGAATTGAGAAGCTTGCGCCTCCATGGCGGAATAACAAAGCGCGCCATGCTACAATACTCCTCTTCTTATGGCGTTTTTTTCTCTGAGACCCAACAACGGCTTCATTTCTCACATCCTTGACCTAATCAAGCTTCAGGTTAATCCCTTTCTCTTGTCGGCTTCGTTTTCGTCTTCGTTCCCTTTCTTTTTCTCGATTCTTTGAACAGTTTCAGCACTCGCTCTGATTTGTTTCCATAACGACGTTGGAAATTTTGTATTTCAGAGGCTCACTCACGCTTTTACTAACGGAGAGCCTATTCGGGCTCCTATTTACTCAGCAATGGATTTTCATTCAATGGGTCGTCAAGGTGTAGTTTTATCCCCCTCGAGTTCGAGTTTGGATTTTGCTTTCGTTACCAATTGAAGGAAGTTACCAATGGAGTAGTATCTTTTCTAATGTGGATTTCTCTGCCATTTATGCTGTTAATATAACTCTATTTCTTGCGTTATGCAATTTTTGGTTGTTAAATGGCTTATGGTACCGTGTTTCATTCATGTAGATGCATCCAACATTGACATGCCCAAATGGAATAAAGTTGATGGACGGGCTTTTGGGATCAGTCGTTCAATGATCCCATCTTCATCATGGATGGTCTTGAAAATTCTTCACAATAAAGGTAGCAAGTTCATTGTTTGTACACCAACAAATCTTTGCAGCCATCACACTGACGTTGTCCATGTCGTTAACTCAAAATTTGATCCTGCTGCAGTTAATATTAATGTGGCTGGGATGAATTATCCTTGTTTTGTTATGGTAGCTAAATATGTTGCTATAGTTTAAAACCTTCTGCTTCATGAACATCCAAGAAACCATTCATATTCTTAATCAGTTATATATTGAGCTTGTAAGGTCTTGACATGTAGTTGGCTGCCTATTCTTCAATTATATCGCATTTTGTTTTATTAATTTGAACTTGATACATTACGTAATGGACTTCATTTCTTTGTTATGCTTAGGGTTTGAGGCCTATTTGGTTGGTGGATGTGTGAGAGACTTGCTCCTAAATAGAGCACCGAAAGACTTTGATGTGATTACCACTGCTGGACTAAGACAGGTCTTTATAAGCTTTGTGTAACTTCTTGTAGTTTTGTTTTTACTTTCCCTTTACTATAAAGAGTGAGATCAATGTATTTAATAATGACATGGTTTGTTGTGTCTTATGCAGATCCACAAACTTTTTCATCATGCTCAAATTGTTGGACGCCGTTTTCCTATTTGTATGGTTAATATCAAAGGCTCTGTCATTGAGGTAGCGAAGCATTTCTATATTCATGGGTCTTCTTTTACATATGAATTTGGTTTTTCTTGATCTAGTAAGTAATGTTCTGAAAGGCATTTTTTTTTTAGTTCAACAATCTATGGGTAGGGATTGGAACAGCTCACCTTGAAGATAGATTTCTTATTTCTTAGTTACTAGATATATGTTCAAGTTAGTAAAGAATTTTGGAGTAACATCGTTGTTCTCACCTAGATTTCCATCTCATAGGACTTCTTTAGGGCTATAAATTAATCTAATTAGCATGTCAGTATTATTTTTATATTGATATATTTCTTTTTTTAAATAGTACTGGGTACAAATTTATAGTAGGTTCCATAATTAATTCAATCAAATGAGGCGCTGTTGGCTTAATGGGTATGACATTTTCTGTTGGAGCCGAGTTCTTTGTGGCACAAGGTTATTGTGAGCAAGTAGAATCTCATCCATTGGAATGTGTCTTGGGTCATGGGTCTGGAGGCACAACCAAGAACTCATGGATCGCGGTTGGTGTAGACCTTCCTTTTTCGCCGAAGGGACTAGTTTTATATAATTATTATGTTTATGCCTGATTTGCATTTTTATGATTTTTCTTCCTCTTTTTCAATTAAAAAAAAAATATATAAATCAAGGTTTACACCTCATACCCCCTTTAGGCTTACTTCTCACTTTTTAAGCGGGAAAATCTCAAATCCCCAAATAGTCTTTGTAATACATCGCTGGCAAGTAATTTACAATTATTGATCTGCTATGTGCTCCGGTGATTTTCTATCAGGTTTCAAGTTTTGAAACAGTTGCAAAACATTCTAAAGGAAAAGAAACAGTAACGTCTTCTCCAACACCAAGAAAGTGTGATGAAAAGGACTTAATCCGGTGGAGGAACTCTCTGCATAGGGATTTCACAATTAACAGGTCAGTTCTATTTGTGTTTATTGTATAGTCCATTTCTGTACACGTTGTCTATGTTCTTTATGAAATATATCTTTTGCAATGTGAGGTTCACCTTCTTCCCTTGTTAGTAAGTGTTATATCAGATCTTTTACATTATATAATTTTTAAATGTGCTGAGATATTTAGATCTTCAACTTAAGTTTGGCATTGTTGGAAATCTGCTTTCTATCAAAAAACCCTGTATTAACTACTGAATACTTATTTACTAATCATGTTAAATTTGTGTTTGATTCGTGGTTGTCTTGATATTTGTTGAAGCAGTTTATTCTTTGACCCTTTTCTGAACGTAATCTATGACTATGCCGAAGGGATAGCAGACTTAAGGTCCTTGAAGGTGAAGATCTTTCCCCTTTTTAAAAATACGCATTTGTGTCAGTGCCCGATACTTTCTCTTAAGAATAGTCCAGGCCTTTTAACCTTTTTATAACAACCTAATTGGTCATTATGTCTGAATTGCTCATTAATTACTCATGAAATGTTATATGGTATTCATTTAGTTTCTACAAAATGAACTCGTTCAAGCAATTTCCATTTCTTGTATTCCAATTCCTTTTTAACTCTAAATTATGTGGGCAATTGGCATTAGTACTAATAATTGTTCTCTTGGATGTTGATTGCGTCATTGCAAAGAGCATAGAAACATTTTGTTTGATGAAAAATAAACATTTATTTCCTATATATAAATATATATTTTATGCCATTCTTTCTTATTTTTGGGACGGGCCTAATAGATACAGTACAAAGGAAAGTTTGCATATTACCTGTATTCTGTTGTCCATCCCTCTTCCTATATCAAGTATTTTTTTTCCCCATTTGTTGCATGACCTGGACTCAGCTGCGGACGCTAATTCCTGCATCATTGTCATTCAAAGAGGACTGTGGTAGGTTTTACTTTTCCTTGGGTACTCTGTAGGCGTGATAACATTTGGGACTAGTGGCACCTGTGGTGTCCAACAATACTAACTAGGCTACTGCTATTGCATGTTTAATTTTTCACTGTAGCTAGAATTCTGCGTGGCTTAAGAATTGCAGCTCGTTTGGGTTTGTCACTCTCGAAGGATACAGAGACTGCAATGCGTAAACTTTCATCTTCTATCGCAAGCTTGGATAAGGTAACCTTTCGGAAGTTGAACAATACTAATCTCCTATGCTTTACCTCGTGTTCTTCTCCTGCTTTCACTTTTTTTCCGGAGTACACTTCAGCTTTAATAGGAAGGCTTTCCTCTAGTATATGCTCTTAAAGTACAGTACATACTTTCTGAGGTTTTACTCTTTTCTATTTGGTTGTCCTAGTTGTGGTAACTTAGAGTTGGCAGCTACTGTTAGATAGGTTACTTAGATAACATTTTTATATTTTCAGTCCAGGTTGATGATGGAATTTAACTACATGCTATCTTATGGAGCTGCTGTTCCTTCTCTCTATTTGCTCCAGAGGTTCAACCTGCTTGAAATTCTGCTGCCATTTCATGTATGCTTTTCTGATGATTCCTTCACTACGAACGTGATCTTGACCTAATCCTAGTATGTATCTGAGTTGCAGTTATGATTGCAGGCTGCATATCTCGATAAACAGGACATTAAGAAATCTTCTCTCAATTCCACAATGTTGATGGTGAGTGCAGTAATTTGTCACTTCTGGTTTTACCTAACAAGTGTGCTTCTGTACGTTTCTTACTTTTACAATAAATTTGAGAGAATTGTCTATCCCATTCGGGAGTCAAAGAATCTGATTTCAGATAATCTGAAATACCGAGGAGGTTCTCTCTGAAATATCCATTCTTAAGCTAAGCTACCATATCCATATCTGACCTTTGGACACTCCAACACTTGATGGACATGCATTAGACACTTATTAATACAATAGACATGTGTTAGTAGTTGTACAAAGTGAATATTAGTCCATCATTTGTTAGGCACGTTTCAAAAACTTGTTAAAGATACTAAATAGACACAATAGATATATACAACAAAAATAATAAATTTGGAGAGAGAAGTATGTCAAACTCATTTTATTTATATAAATTCATAAACTCAGACTTTGAATTTCTCGGTTCTATAAAAATGTGTCCTTATCATGTACACATCTCAGGTTTTTTTTAAAAATAACGTCTCATCGTGTCTTGTGTCATGTTATATTCATGTCTCATATAACATACCTGTATCCGTATTCATGCTTCTTAGATGATGGTTTTTTTTTTCTTTTTCAAGTGAGTCTAAGCTGTCTCCATTAATATAGGATTTAGGCAACAAATTAGAATAGAAAATTATTGAAGACTAATATTCCCTTGCTTTTGTGTAGGCTAATATGTCCTATTTGCTTGCTTTCAAAATTTGTGAATTTGGCTTTCTTATACATATCAGGGTTCTCTAATATGCTCCTCCACTCCTTGAACACGATTTCAATTTTCAACACTACTTGTTCGAAGTCAAACTAGCTAAACATGTCATTTTCTGCAATTCTATGTAGGATTCTTGTATTGATTGCTTCTATTCTTTATTCTATGCAGAAACTGTTCTCCAATTTGGATAAATTGGTTTCATGTGATCGGCCTTCAGACTGCAATATATGGTTAGCTCATTATTTAGTTTTCTTTCACACTTGGTTGTTAGATTTTCCCGTTCACTTTTAAAGTCAGATTTTCTATATCTAATGTCATGCTAGCCATCATATTAATATTTTCTTTTGAATTGTTTCAGTTTTATTTAAGTGGTATTTTGTAACCAATAAATGATGAGCATTCGTCAGCTTTAATAAGTAACTTACGTTTGATATTTCTTCCAAGACTCACTGCGGGTTTAAGTCTAAGAGTTTTGATATTGAAGCTAGGTAAAAATAAAGGAACTGCATGTATGGCTAGATTTTAAGCCACATATCTATCCAGGCCCCAATATTGAATTAAGAAACTTGATAGAACAACCTTGCGGAGATGTTGAAGCTCCTTTTCTTTTCTATAAATGCCAGTTTGTCAATATGGATATCTCATCCATGTTTCCGTCCTAGGTAAATATAGACCATTGGCATATTAGTGAAAAAGTATCTCTGATGAGATTGAAAATCGTGCAGGGTCGCATTGTTGGCGTTTCACATGGCGTTAGTTAACAACCCTCAGAACTCTCTTATAGTTCTGGCTTTTGCTGCCACCTTGTACCATGGAGAGTGGAATGAAGGTGTGAATTATGCAAGAGAAAACTCTCTTCTGCAGATCAATTTAAGACCGGAGATTACAAGATCGGCTCAATTTAAATCAGCGGAAGAGCTTGCCGAAAGGGTTACTCACTTTGCTTTAAAAGTACAGGGTTGCATTGCTGCTTTAACTTCAGCAGACTGTCTCTTAGAAGCCATGTCAACGTTTCCAGCCTCTCCACACTCTAGTTTGGTAAATAATACTTATATATTAAGTTAACAATCATCATTTATATTGCTTACCAATGTCTATATTAATTCTAATGAGAACGTCAACTACGTTTGATCTTTTATCCGTGCCTGTAATAATTAGTGCTTCTTTTTAGGTGTTTGTGTCCAAGAAAGCTGCCAAAGATGTTGCTAAAATTATTGAAGTGCTGGTGAATGATGTTGAATCCTACAAAAATACGAGAAAAAATTTTGAAATTGACTATCAGCTGCTTAAGAAGGGTATTTTGAATGAGAGTAGATTTGTCTTGGGAAAAGTTATCTTGGAAACCCTGAAAGAGGCAATTGTGCAGGAAGATGGAATCATTCTTGACGTGAAACAAAATCTTTGTGTTGATGCAACTACTGAGGAAAATTATAGTTCACCCGTCTCTGATTCGGTGAAGGACCAATTGGTGGTTAAAAGAAATAAGAAAGTTCGAAAACTGCTATCTAGTTCTGAAGTTGAGTGGGAAGGGAATAAGAAAAATAAGCTTGATGGGAAGGAGGGAAGTATTTTTGACAGGGTAGTTGAGGATGGAAGGTGTGTTAACATTGCAGAACCCTACAAAAAGGGAGTAGAGGCATCTCAATTACCCCATGCTGGATTAAATTCGATGGAGGATTCATTGTTGGAGTCAAGCAAATGTCATCACTTTGAAGTGAGGGAGAGTGAGGTTATGCAAGAAAATCTTGAAACCATGGACAATCAGGTTAAGAAGATGACCCCATCCCAGGAAACACGTGATAAGGTTACCAAGGAGCTACTTCATGCTGTAGAGGCCAACCCATGGAAGATGGACAAAGTAAATGGGAAAGAAGGAAAGCCAGAGAAGAAAGAGCATGGTTTGCAGCCTCAGGGAAAGGAGAATATTGAAAAGAAACGTAGACATGTAACAGATATCGAGCAGCACAAACGTCCGCTGTCTAGCCTTTTTAAGTAAATTCCTAAAGGGTATGAATCAGAAGCTGTTCTTTGAATTGCTCAAAACCGGCGCACCATCGTGGAGGTGTATGATTCAAGTACCATACTTGTTTCTACCCAAGATTTGATCATTACAGTGCCATACTTCAACACGACCGCCCCTGTTCGTTTCTGTTCAATAGTAGAAGCAGTTCAGATTAGAACTCCCGGTGATCTAGAAATCCAACCGGAGTTTTTCCTCAACTAA

mRNA sequence

ACGGCGTGCGGGAGCAACCAAAGCACAGTAGAATTGAGAAGCTTGCGCCTCCATGGCGGAATAACAAAGCGCGCCATGCTACAATACTCCTCTTCTTATGGCGTTTTTTTCTCTGAGACCCAACAACGGCTTCATTTCTCACATCCTTGACCTAATCAAGCTTCAGAGGCTCACTCACGCTTTTACTAACGGAGAGCCTATTCGGGCTCCTATTTACTCAGCAATGGATTTTCATTCAATGGGTCGTCAAGATGCATCCAACATTGACATGCCCAAATGGAATAAAGTTGATGGACGGGCTTTTGGGATCAGTCGTTCAATGATCCCATCTTCATCATGGATGGTCTTGAAAATTCTTCACAATAAAGGGTTTGAGGCCTATTTGGTTGGTGGATGTGTGAGAGACTTGCTCCTAAATAGAGCACCGAAAGACTTTGATGTGATTACCACTGCTGGACTAAGACAGATCCACAAACTTTTTCATCATGCTCAAATTGTTGGACGCCGTTTTCCTATTTGTATGGTTAATATCAAAGGCTCTGTCATTGAGGTTTCAAGTTTTGAAACAGTTGCAAAACATTCTAAAGGAAAAGAAACAGTAACGTCTTCTCCAACACCAAGAAAGTGTGATGAAAAGGACTTAATCCGGTGGAGGAACTCTCTGCATAGGGATTTCACAATTAACAGTTTATTCTTTGACCCTTTTCTGAACGTAATCTATGACTATGCCGAAGGGATAGCAGACTTAAGGTCCTTGAAGCTGCGGACGCTAATTCCTGCATCATTGTCATTCAAAGAGGACTGTGCTAGAATTCTGCGTGGCTTAAGAATTGCAGCTCGTTTGGGTTTGTCACTCTCGAAGGATACAGAGACTGCAATGCGTAAACTTTCATCTTCTATCGCAAGCTTGGATAAGTCCAGGTTGATGATGGAATTTAACTACATGCTATCTTATGGAGCTGCTGTTCCTTCTCTCTATTTGCTCCAGAGGTTCAACCTGCTTGAAATTCTGCTGCCATTTCATGCTGCATATCTCGATAAACAGGACATTAAGAAATCTTCTCTCAATTCCACAATGTTGATGAAACTGTTCTCCAATTTGGATAAATTGGTTTCATGTGATCGGCCTTCAGACTGCAATATATGGGTCGCATTGTTGGCGTTTCACATGGCGTTAGTTAACAACCCTCAGAACTCTCTTATAGTTCTGGCTTTTGCTGCCACCTTGTACCATGGAGAGTGGAATGAAGGTGTGAATTATGCAAGAGAAAACTCTCTTCTGCAGATCAATTTAAGACCGGAGATTACAAGATCGGCTCAATTTAAATCAGCGGAAGAGCTTGCCGAAAGGGTTACTCACTTTGCTTTAAAAGTACAGGGTTGCATTGCTGCTTTAACTTCAGCAGACTGTCTCTTAGAAGCCATGTCAACGTTTCCAGCCTCTCCACACTCTAGTTTGGTGTTTGTGTCCAAGAAAGCTGCCAAAGATGTTGCTAAAATTATTGAAGTGCTGGTGAATGATGTTGAATCCTACAAAAATACGAGAAAAAATTTTGAAATTGACTATCAGCTGCTTAAGAAGGGTATTTTGAATGAGAGTAGATTTGTCTTGGGAAAAGTTATCTTGGAAACCCTGAAAGAGGCAATTGTGCAGGAAGATGGAATCATTCTTGACGTGAAACAAAATCTTTGTGTTGATGCAACTACTGAGGAAAATTATAGTTCACCCGTCTCTGATTCGGTGAAGGACCAATTGGTGGTTAAAAGAAATAAGAAAGTTCGAAAACTGCTATCTAGTTCTGAAGTTGAGTGGGAAGGGAATAAGAAAAATAAGCTTGATGGGAAGGAGGGAAGTATTTTTGACAGGGTAGTTGAGGATGGAAGGTGTGTTAACATTGCAGAACCCTACAAAAAGGGAGTAGAGGCATCTCAATTACCCCATGCTGGATTAAATTCGATGGAGGATTCATTGTTGGAGTCAAGCAAATGTCATCACTTTGAAGTGAGGGAGAGTGAGGTTATGCAAGAAAATCTTGAAACCATGGACAATCAGGTTAAGAAGATGACCCCATCCCAGGAAACACGTGATAAGGTTACCAAGGAGCTACTTCATGCTGTAGAGGCCAACCCATGGAAGATGGACAAAGTAAATGGGAAAGAAGGAAAGCCAGAGAAGAAAGAGCATGGTTTGCAGCCTCAGGGAAAGGAGAATATTGAAAAGAAACGTAGACATGTAACAGATATCGAGCAGCACAAACGTCCGCTGTCTAGCCTTTTTAATACCATACTTGTTTCTACCCAAGATTTGATCATTACAGTGCCATACTTCAACACGACCGCCCCTGTTCGTTTCTGTTCAATAGTAGAAGCAGTTCAGATTAGAACTCCCGGTGATCTAGAAATCCAACCGGAGTTTTTCCTCAACTAA

Coding sequence (CDS)

ATGGCGTTTTTTTCTCTGAGACCCAACAACGGCTTCATTTCTCACATCCTTGACCTAATCAAGCTTCAGAGGCTCACTCACGCTTTTACTAACGGAGAGCCTATTCGGGCTCCTATTTACTCAGCAATGGATTTTCATTCAATGGGTCGTCAAGATGCATCCAACATTGACATGCCCAAATGGAATAAAGTTGATGGACGGGCTTTTGGGATCAGTCGTTCAATGATCCCATCTTCATCATGGATGGTCTTGAAAATTCTTCACAATAAAGGGTTTGAGGCCTATTTGGTTGGTGGATGTGTGAGAGACTTGCTCCTAAATAGAGCACCGAAAGACTTTGATGTGATTACCACTGCTGGACTAAGACAGATCCACAAACTTTTTCATCATGCTCAAATTGTTGGACGCCGTTTTCCTATTTGTATGGTTAATATCAAAGGCTCTGTCATTGAGGTTTCAAGTTTTGAAACAGTTGCAAAACATTCTAAAGGAAAAGAAACAGTAACGTCTTCTCCAACACCAAGAAAGTGTGATGAAAAGGACTTAATCCGGTGGAGGAACTCTCTGCATAGGGATTTCACAATTAACAGTTTATTCTTTGACCCTTTTCTGAACGTAATCTATGACTATGCCGAAGGGATAGCAGACTTAAGGTCCTTGAAGCTGCGGACGCTAATTCCTGCATCATTGTCATTCAAAGAGGACTGTGCTAGAATTCTGCGTGGCTTAAGAATTGCAGCTCGTTTGGGTTTGTCACTCTCGAAGGATACAGAGACTGCAATGCGTAAACTTTCATCTTCTATCGCAAGCTTGGATAAGTCCAGGTTGATGATGGAATTTAACTACATGCTATCTTATGGAGCTGCTGTTCCTTCTCTCTATTTGCTCCAGAGGTTCAACCTGCTTGAAATTCTGCTGCCATTTCATGCTGCATATCTCGATAAACAGGACATTAAGAAATCTTCTCTCAATTCCACAATGTTGATGAAACTGTTCTCCAATTTGGATAAATTGGTTTCATGTGATCGGCCTTCAGACTGCAATATATGGGTCGCATTGTTGGCGTTTCACATGGCGTTAGTTAACAACCCTCAGAACTCTCTTATAGTTCTGGCTTTTGCTGCCACCTTGTACCATGGAGAGTGGAATGAAGGTGTGAATTATGCAAGAGAAAACTCTCTTCTGCAGATCAATTTAAGACCGGAGATTACAAGATCGGCTCAATTTAAATCAGCGGAAGAGCTTGCCGAAAGGGTTACTCACTTTGCTTTAAAAGTACAGGGTTGCATTGCTGCTTTAACTTCAGCAGACTGTCTCTTAGAAGCCATGTCAACGTTTCCAGCCTCTCCACACTCTAGTTTGGTGTTTGTGTCCAAGAAAGCTGCCAAAGATGTTGCTAAAATTATTGAAGTGCTGGTGAATGATGTTGAATCCTACAAAAATACGAGAAAAAATTTTGAAATTGACTATCAGCTGCTTAAGAAGGGTATTTTGAATGAGAGTAGATTTGTCTTGGGAAAAGTTATCTTGGAAACCCTGAAAGAGGCAATTGTGCAGGAAGATGGAATCATTCTTGACGTGAAACAAAATCTTTGTGTTGATGCAACTACTGAGGAAAATTATAGTTCACCCGTCTCTGATTCGGTGAAGGACCAATTGGTGGTTAAAAGAAATAAGAAAGTTCGAAAACTGCTATCTAGTTCTGAAGTTGAGTGGGAAGGGAATAAGAAAAATAAGCTTGATGGGAAGGAGGGAAGTATTTTTGACAGGGTAGTTGAGGATGGAAGGTGTGTTAACATTGCAGAACCCTACAAAAAGGGAGTAGAGGCATCTCAATTACCCCATGCTGGATTAAATTCGATGGAGGATTCATTGTTGGAGTCAAGCAAATGTCATCACTTTGAAGTGAGGGAGAGTGAGGTTATGCAAGAAAATCTTGAAACCATGGACAATCAGGTTAAGAAGATGACCCCATCCCAGGAAACACGTGATAAGGTTACCAAGGAGCTACTTCATGCTGTAGAGGCCAACCCATGGAAGATGGACAAAGTAAATGGGAAAGAAGGAAAGCCAGAGAAGAAAGAGCATGGTTTGCAGCCTCAGGGAAAGGAGAATATTGAAAAGAAACGTAGACATGTAACAGATATCGAGCAGCACAAACGTCCGCTGTCTAGCCTTTTTAATACCATACTTGTTTCTACCCAAGATTTGATCATTACAGTGCCATACTTCAACACGACCGCCCCTGTTCGTTTCTGTTCAATAGTAGAAGCAGTTCAGATTAGAACTCCCGGTGATCTAGAAATCCAACCGGAGTTTTTCCTCAACTAA

Protein sequence

MAFFSLRPNNGFISHILDLIKLQRLTHAFTNGEPIRAPIYSAMDFHSMGRQDASNIDMPKWNKVDGRAFGISRSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLLNRAPKDFDVITTAGLRQIHKLFHHAQIVGRRFPICMVNIKGSVIEVSSFETVAKHSKGKETVTSSPTPRKCDEKDLIRWRNSLHRDFTINSLFFDPFLNVIYDYAEGIADLRSLKLRTLIPASLSFKEDCARILRGLRIAARLGLSLSKDTETAMRKLSSSIASLDKSRLMMEFNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQDIKKSSLNSTMLMKLFSNLDKLVSCDRPSDCNIWVALLAFHMALVNNPQNSLIVLAFAATLYHGEWNEGVNYARENSLLQINLRPEITRSAQFKSAEELAERVTHFALKVQGCIAALTSADCLLEAMSTFPASPHSSLVFVSKKAAKDVAKIIEVLVNDVESYKNTRKNFEIDYQLLKKGILNESRFVLGKVILETLKEAIVQEDGIILDVKQNLCVDATTEENYSSPVSDSVKDQLVVKRNKKVRKLLSSSEVEWEGNKKNKLDGKEGSIFDRVVEDGRCVNIAEPYKKGVEASQLPHAGLNSMEDSLLESSKCHHFEVRESEVMQENLETMDNQVKKMTPSQETRDKVTKELLHAVEANPWKMDKVNGKEGKPEKKEHGLQPQGKENIEKKRRHVTDIEQHKRPLSSLFNTILVSTQDLIITVPYFNTTAPVRFCSIVEAVQIRTPGDLEIQPEFFLN
Homology
BLAST of CmoCh06G001410 vs. ExPASy Swiss-Prot
Match: P44439 (Poly(A) polymerase I OS=Haemophilus influenzae (strain ATCC 51907 / DSM 11121 / KW20 / Rd) OX=71421 GN=pcnB PE=3 SV=1)

HSP 1 Score: 150.6 bits (379), Expect = 7.3e-35
Identity = 96/287 (33.45%), Postives = 159/287 (55.40%), Query Frame = 0

Query: 62  NKVDGRAFGISRSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLLNRAPKDFDVITTAGL 121
           N +    F IS      ++  V++ L  +GFEAY+VGGC+RDLLL + PKDFDV T A  
Sbjct: 18  NVIKAAQFDISPRDFSRNALNVVEKLQRQGFEAYIVGGCIRDLLLGKKPKDFDVATNARP 77

Query: 122 RQIHKLF-HHAQIVGRRFPICMVNIKGSVIEVSSFETVAKHSKGKETVTSSPTPRKCDEK 181
            QI  +F    ++VGRRF +  +     +IEV++F   A HS  +    +    ++ +E 
Sbjct: 78  EQIQNIFQRQCRLVGRRFRLAHIMFGRDIIEVATFR--ANHSDAR----NENQAKQSNEG 137

Query: 182 DLIR-------WRNSLHRDFTINSLFFDPFLNVIYDYAEGIADLRSLKLRTLIPASLSFK 241
            L+R        +++  RDFT+N+L+++P  N + DY EGI DL++ KLR +      ++
Sbjct: 138 MLLRDNVYGTIEQDAARRDFTVNALYYNPQDNTLRDYFEGIKDLKAGKLRLIGDPVTRYQ 197

Query: 242 EDCARILRGLRIAARLGLSLSKDTETAMRKLSSSIASLDKSRLMMEFNYMLSYGAAVPSL 301
           ED  R+LR +R  A+L + L K +E  +R+L+  + ++  +RL  E   +L  G  V + 
Sbjct: 198 EDPVRMLRSIRFMAKLDMFLEKPSEQPIRELAPLLKNIPPARLFDESLKLLQAGQGVKTY 257

Query: 302 YLLQRFNLLEILLPFHAAYLDKQDIKKSSLNSTMLMKLFSNLDKLVS 341
            LL+++ L E L P  +AY  +   K+ S    M++   ++ D+ V+
Sbjct: 258 RLLRQYGLFEQLFPALSAYFTE---KEDSFAERMIVTALTSTDERVA 295

BLAST of CmoCh06G001410 vs. ExPASy Swiss-Prot
Match: P0ABF3 (Poly(A) polymerase I OS=Escherichia coli O157:H7 OX=83334 GN=pcnB PE=3 SV=2)

HSP 1 Score: 131.7 bits (330), Expect = 3.5e-29
Identity = 86/261 (32.95%), Postives = 137/261 (52.49%), Query Frame = 0

Query: 59  PKWNKVDGRAFGISRSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLLNRAPKDFDVITT 118
           P+   +      ISR  I  ++  V+  L+  G+EA+LVGG VRDLLL + PKDFDV T 
Sbjct: 27  PQVTVIPREQHAISRKDISENALKVMYRLNKAGYEAWLVGGGVRDLLLGKKPKDFDVTTN 86

Query: 119 AGLRQIHKLFHHAQIVGRRFPICMVNIKGSVIEVSSFETVAKHSKGKETVTSSPTPRKCD 178
           A   Q+ KLF + ++VGRRF +  V     +IEV++F     H +G   V+   T ++  
Sbjct: 87  ATPEQVRKLFRNCRLVGRRFRLAHVMFGPEIIEVATFR---GHHEG--NVSDRTTSQRGQ 146

Query: 179 EKDLIR-------WRNSLHRDFTINSLFFDPFLNVIYDYAEGIADLRSLKLRTLIPASLS 238
              L+R         ++  RDFTINSL++      + DY  G+ DL+   +R +      
Sbjct: 147 NGMLLRDNIFGSIEEDAQRRDFTINSLYYSVADFTVRDYVGGMKDLKDGVIRLIGNPETR 206

Query: 239 FKEDCARILRGLRIAARLGLSLSKDTETAMRKLSSSIASLDKSRLMMEFNYMLSYGAAVP 298
           ++ED  R+LR +R AA+LG+ +S +T   + +L++ +  +  +RL  E   +L  G    
Sbjct: 207 YREDPVRMLRAVRFAAKLGMRISPETAEPIPRLATLLNDIPPARLFEESLKLLQAGYGYE 266

Query: 299 SLYLLQRFNLLEILLPFHAAY 313
           +  LL  ++L + L P    Y
Sbjct: 267 TYKLLCEYHLFQPLFPTITRY 282

BLAST of CmoCh06G001410 vs. ExPASy Swiss-Prot
Match: P0ABF2 (Poly(A) polymerase I OS=Escherichia coli O6:H1 (strain CFT073 / ATCC 700928 / UPEC) OX=199310 GN=pcnB PE=3 SV=2)

HSP 1 Score: 131.7 bits (330), Expect = 3.5e-29
Identity = 86/261 (32.95%), Postives = 137/261 (52.49%), Query Frame = 0

Query: 59  PKWNKVDGRAFGISRSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLLNRAPKDFDVITT 118
           P+   +      ISR  I  ++  V+  L+  G+EA+LVGG VRDLLL + PKDFDV T 
Sbjct: 27  PQVTVIPREQHAISRKDISENALKVMYRLNKAGYEAWLVGGGVRDLLLGKKPKDFDVTTN 86

Query: 119 AGLRQIHKLFHHAQIVGRRFPICMVNIKGSVIEVSSFETVAKHSKGKETVTSSPTPRKCD 178
           A   Q+ KLF + ++VGRRF +  V     +IEV++F     H +G   V+   T ++  
Sbjct: 87  ATPEQVRKLFRNCRLVGRRFRLAHVMFGPEIIEVATFR---GHHEG--NVSDRTTSQRGQ 146

Query: 179 EKDLIR-------WRNSLHRDFTINSLFFDPFLNVIYDYAEGIADLRSLKLRTLIPASLS 238
              L+R         ++  RDFTINSL++      + DY  G+ DL+   +R +      
Sbjct: 147 NGMLLRDNIFGSIEEDAQRRDFTINSLYYSVADFTVRDYVGGMKDLKDGVIRLIGNPETR 206

Query: 239 FKEDCARILRGLRIAARLGLSLSKDTETAMRKLSSSIASLDKSRLMMEFNYMLSYGAAVP 298
           ++ED  R+LR +R AA+LG+ +S +T   + +L++ +  +  +RL  E   +L  G    
Sbjct: 207 YREDPVRMLRAVRFAAKLGMRISPETAEPIPRLATLLNDIPPARLFEESLKLLQAGYGYE 266

Query: 299 SLYLLQRFNLLEILLPFHAAY 313
           +  LL  ++L + L P    Y
Sbjct: 267 TYKLLCEYHLFQPLFPTITRY 282

BLAST of CmoCh06G001410 vs. ExPASy Swiss-Prot
Match: P0ABF1 (Poly(A) polymerase I OS=Escherichia coli (strain K12) OX=83333 GN=pcnB PE=1 SV=2)

HSP 1 Score: 131.7 bits (330), Expect = 3.5e-29
Identity = 86/261 (32.95%), Postives = 137/261 (52.49%), Query Frame = 0

Query: 59  PKWNKVDGRAFGISRSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLLNRAPKDFDVITT 118
           P+   +      ISR  I  ++  V+  L+  G+EA+LVGG VRDLLL + PKDFDV T 
Sbjct: 27  PQVTVIPREQHAISRKDISENALKVMYRLNKAGYEAWLVGGGVRDLLLGKKPKDFDVTTN 86

Query: 119 AGLRQIHKLFHHAQIVGRRFPICMVNIKGSVIEVSSFETVAKHSKGKETVTSSPTPRKCD 178
           A   Q+ KLF + ++VGRRF +  V     +IEV++F     H +G   V+   T ++  
Sbjct: 87  ATPEQVRKLFRNCRLVGRRFRLAHVMFGPEIIEVATFR---GHHEG--NVSDRTTSQRGQ 146

Query: 179 EKDLIR-------WRNSLHRDFTINSLFFDPFLNVIYDYAEGIADLRSLKLRTLIPASLS 238
              L+R         ++  RDFTINSL++      + DY  G+ DL+   +R +      
Sbjct: 147 NGMLLRDNIFGSIEEDAQRRDFTINSLYYSVADFTVRDYVGGMKDLKDGVIRLIGNPETR 206

Query: 239 FKEDCARILRGLRIAARLGLSLSKDTETAMRKLSSSIASLDKSRLMMEFNYMLSYGAAVP 298
           ++ED  R+LR +R AA+LG+ +S +T   + +L++ +  +  +RL  E   +L  G    
Sbjct: 207 YREDPVRMLRAVRFAAKLGMRISPETAEPIPRLATLLNDIPPARLFEESLKLLQAGYGYE 266

Query: 299 SLYLLQRFNLLEILLPFHAAY 313
           +  LL  ++L + L P    Y
Sbjct: 267 TYKLLCEYHLFQPLFPTITRY 282

BLAST of CmoCh06G001410 vs. ExPASy Swiss-Prot
Match: Q8Z9C3 (Poly(A) polymerase I OS=Salmonella typhi OX=90370 GN=pcnB PE=3 SV=2)

HSP 1 Score: 130.2 bits (326), Expect = 1.0e-28
Identity = 88/276 (31.88%), Postives = 143/276 (51.81%), Query Frame = 0

Query: 71  ISRSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLLNRAPKDFDVITTAGLRQIHKLFHH 130
           ISR  I  ++  VL  L+  G+EAYLVGG VRDLLL + PKDFDV T A   Q+ KLF +
Sbjct: 39  ISRKDISENALKVLYRLNKAGYEAYLVGGGVRDLLLGKKPKDFDVTTNATPDQVRKLFRN 98

Query: 131 AQIVGRRFPICMVNIKGSVIEVSSFETVAKHSKGKETVTSSPTPRKCDEKDLIR------ 190
            ++VGRRF +  V     +IEV++F     H++G E  +   T ++     L+R      
Sbjct: 99  CRLVGRRFRLAHVMFGPEIIEVATFR---GHNEGSE--SDRTTSQRGQNGMLLRDNIFGS 158

Query: 191 -WRNSLHRDFTINSLFFDPFLNVIYDYAEGIADLRSLKLRTLIPASLSFKEDCARILRGL 250
              ++  RDFTINSL++      + DY  G+ DL+   +R +      ++ED  R+LR +
Sbjct: 159 IEEDAQRRDFTINSLYYSVADFTVRDYVGGMQDLQEGVIRLIGNPETRYREDPVRMLRAV 218

Query: 251 RIAARLGLSLSKDTETAMRKLSSSIASLDKSRLMMEFNYMLSYGAAVPSLYLLQRFNLLE 310
           R AA+L + +S +T   + +L++ +  +  +RL  E   +L  G    +   L+ ++L +
Sbjct: 219 RFAAKLNMRISPETAEPIPRLATLLNDIPPARLFEESLKLLQAGNGFETYQQLREYHLFQ 278

Query: 311 ILLPFHAAYLDKQDIKKSSLNSTMLMKLFSNLDKLV 340
            L P    Y  +      S    ++ ++  N D  +
Sbjct: 279 PLFPTITRYFTENG---DSAMERIIAQVLKNTDNRI 306

BLAST of CmoCh06G001410 vs. ExPASy TrEMBL
Match: A0A6J1G6Y5 (uncharacterized protein LOC111451426 OS=Cucurbita moschata OX=3662 GN=LOC111451426 PE=3 SV=1)

HSP 1 Score: 1418.3 bits (3670), Expect = 0.0e+00
Identity = 727/727 (100.00%), Postives = 727/727 (100.00%), Query Frame = 0

Query: 1   MAFFSLRPNNGFISHILDLIKLQRLTHAFTNGEPIRAPIYSAMDFHSMGRQDASNIDMPK 60
           MAFFSLRPNNGFISHILDLIKLQRLTHAFTNGEPIRAPIYSAMDFHSMGRQDASNIDMPK
Sbjct: 1   MAFFSLRPNNGFISHILDLIKLQRLTHAFTNGEPIRAPIYSAMDFHSMGRQDASNIDMPK 60

Query: 61  WNKVDGRAFGISRSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLLNRAPKDFDVITTAG 120
           WNKVDGRAFGISRSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLLNRAPKDFDVITTAG
Sbjct: 61  WNKVDGRAFGISRSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLLNRAPKDFDVITTAG 120

Query: 121 LRQIHKLFHHAQIVGRRFPICMVNIKGSVIEVSSFETVAKHSKGKETVTSSPTPRKCDEK 180
           LRQIHKLFHHAQIVGRRFPICMVNIKGSVIEVSSFETVAKHSKGKETVTSSPTPRKCDEK
Sbjct: 121 LRQIHKLFHHAQIVGRRFPICMVNIKGSVIEVSSFETVAKHSKGKETVTSSPTPRKCDEK 180

Query: 181 DLIRWRNSLHRDFTINSLFFDPFLNVIYDYAEGIADLRSLKLRTLIPASLSFKEDCARIL 240
           DLIRWRNSLHRDFTINSLFFDPFLNVIYDYAEGIADLRSLKLRTLIPASLSFKEDCARIL
Sbjct: 181 DLIRWRNSLHRDFTINSLFFDPFLNVIYDYAEGIADLRSLKLRTLIPASLSFKEDCARIL 240

Query: 241 RGLRIAARLGLSLSKDTETAMRKLSSSIASLDKSRLMMEFNYMLSYGAAVPSLYLLQRFN 300
           RGLRIAARLGLSLSKDTETAMRKLSSSIASLDKSRLMMEFNYMLSYGAAVPSLYLLQRFN
Sbjct: 241 RGLRIAARLGLSLSKDTETAMRKLSSSIASLDKSRLMMEFNYMLSYGAAVPSLYLLQRFN 300

Query: 301 LLEILLPFHAAYLDKQDIKKSSLNSTMLMKLFSNLDKLVSCDRPSDCNIWVALLAFHMAL 360
           LLEILLPFHAAYLDKQDIKKSSLNSTMLMKLFSNLDKLVSCDRPSDCNIWVALLAFHMAL
Sbjct: 301 LLEILLPFHAAYLDKQDIKKSSLNSTMLMKLFSNLDKLVSCDRPSDCNIWVALLAFHMAL 360

Query: 361 VNNPQNSLIVLAFAATLYHGEWNEGVNYARENSLLQINLRPEITRSAQFKSAEELAERVT 420
           VNNPQNSLIVLAFAATLYHGEWNEGVNYARENSLLQINLRPEITRSAQFKSAEELAERVT
Sbjct: 361 VNNPQNSLIVLAFAATLYHGEWNEGVNYARENSLLQINLRPEITRSAQFKSAEELAERVT 420

Query: 421 HFALKVQGCIAALTSADCLLEAMSTFPASPHSSLVFVSKKAAKDVAKIIEVLVNDVESYK 480
           HFALKVQGCIAALTSADCLLEAMSTFPASPHSSLVFVSKKAAKDVAKIIEVLVNDVESYK
Sbjct: 421 HFALKVQGCIAALTSADCLLEAMSTFPASPHSSLVFVSKKAAKDVAKIIEVLVNDVESYK 480

Query: 481 NTRKNFEIDYQLLKKGILNESRFVLGKVILETLKEAIVQEDGIILDVKQNLCVDATTEEN 540
           NTRKNFEIDYQLLKKGILNESRFVLGKVILETLKEAIVQEDGIILDVKQNLCVDATTEEN
Sbjct: 481 NTRKNFEIDYQLLKKGILNESRFVLGKVILETLKEAIVQEDGIILDVKQNLCVDATTEEN 540

Query: 541 YSSPVSDSVKDQLVVKRNKKVRKLLSSSEVEWEGNKKNKLDGKEGSIFDRVVEDGRCVNI 600
           YSSPVSDSVKDQLVVKRNKKVRKLLSSSEVEWEGNKKNKLDGKEGSIFDRVVEDGRCVNI
Sbjct: 541 YSSPVSDSVKDQLVVKRNKKVRKLLSSSEVEWEGNKKNKLDGKEGSIFDRVVEDGRCVNI 600

Query: 601 AEPYKKGVEASQLPHAGLNSMEDSLLESSKCHHFEVRESEVMQENLETMDNQVKKMTPSQ 660
           AEPYKKGVEASQLPHAGLNSMEDSLLESSKCHHFEVRESEVMQENLETMDNQVKKMTPSQ
Sbjct: 601 AEPYKKGVEASQLPHAGLNSMEDSLLESSKCHHFEVRESEVMQENLETMDNQVKKMTPSQ 660

Query: 661 ETRDKVTKELLHAVEANPWKMDKVNGKEGKPEKKEHGLQPQGKENIEKKRRHVTDIEQHK 720
           ETRDKVTKELLHAVEANPWKMDKVNGKEGKPEKKEHGLQPQGKENIEKKRRHVTDIEQHK
Sbjct: 661 ETRDKVTKELLHAVEANPWKMDKVNGKEGKPEKKEHGLQPQGKENIEKKRRHVTDIEQHK 720

Query: 721 RPLSSLF 728
           RPLSSLF
Sbjct: 721 RPLSSLF 727

BLAST of CmoCh06G001410 vs. ExPASy TrEMBL
Match: A0A6J1I3F4 (uncharacterized protein LOC111470184 OS=Cucurbita maxima OX=3661 GN=LOC111470184 PE=3 SV=1)

HSP 1 Score: 1379.4 bits (3569), Expect = 0.0e+00
Identity = 710/727 (97.66%), Postives = 717/727 (98.62%), Query Frame = 0

Query: 1   MAFFSLRPNNGFISHILDLIKLQRLTHAFTNGEPIRAPIYSAMDFHSMGRQDASNIDMPK 60
           MAFFSLRPNNGFISHILDLIKLQRLTHAFTNGE IRAPI+SAMDFHS+GRQDASNIDMPK
Sbjct: 1   MAFFSLRPNNGFISHILDLIKLQRLTHAFTNGELIRAPIFSAMDFHSVGRQDASNIDMPK 60

Query: 61  WNKVDGRAFGISRSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLLNRAPKDFDVITTAG 120
           WNKVDGRAFGISRSMIPSSSWMVLKILHNKGFE YLVGGCVRDLLLNRAPKDFDVITTAG
Sbjct: 61  WNKVDGRAFGISRSMIPSSSWMVLKILHNKGFETYLVGGCVRDLLLNRAPKDFDVITTAG 120

Query: 121 LRQIHKLFHHAQIVGRRFPICMVNIKGSVIEVSSFETVAKHSKGKETVTSSPTPRKCDEK 180
           LRQIHKLFHHAQIVGRRFPICMVNIKGSVIEVSSFETVAKHSKGKETVTSSPTPRKCDEK
Sbjct: 121 LRQIHKLFHHAQIVGRRFPICMVNIKGSVIEVSSFETVAKHSKGKETVTSSPTPRKCDEK 180

Query: 181 DLIRWRNSLHRDFTINSLFFDPFLNVIYDYAEGIADLRSLKLRTLIPASLSFKEDCARIL 240
           DLIRWRNSLHRDFTINSLFFDPFLNVIYDYAEGIADLRSLKLRTLIPASLSFKEDCARIL
Sbjct: 181 DLIRWRNSLHRDFTINSLFFDPFLNVIYDYAEGIADLRSLKLRTLIPASLSFKEDCARIL 240

Query: 241 RGLRIAARLGLSLSKDTETAMRKLSSSIASLDKSRLMMEFNYMLSYGAAVPSLYLLQRFN 300
           RGLRIAARLGLSLSKDTETAM KLSSSIASLDKSRLMMEFNYMLSYGAAVPSLYLLQRFN
Sbjct: 241 RGLRIAARLGLSLSKDTETAMHKLSSSIASLDKSRLMMEFNYMLSYGAAVPSLYLLQRFN 300

Query: 301 LLEILLPFHAAYLDKQDIKKSSLNSTMLMKLFSNLDKLVSCDRPSDCNIWVALLAFHMAL 360
           LLEILLPFHAAYL+KQDIKKS LNSTMLMKLFSNLDKLVSCDRPSDCNIWVALLAFHMAL
Sbjct: 301 LLEILLPFHAAYLNKQDIKKSPLNSTMLMKLFSNLDKLVSCDRPSDCNIWVALLAFHMAL 360

Query: 361 VNNPQNSLIVLAFAATLYHGEWNEGVNYARENSLLQINLRPEITRSAQFKSAEELAERVT 420
           VNNPQNSLIVLAFAATLYHGEWNEGVNYARENSLLQINLRPEITRSAQFKSAEELAERVT
Sbjct: 361 VNNPQNSLIVLAFAATLYHGEWNEGVNYARENSLLQINLRPEITRSAQFKSAEELAERVT 420

Query: 421 HFALKVQGCIAALTSADCLLEAMSTFPASPHSSLVFVSKKAAKDVAKIIEVLVNDVESYK 480
           HFALKVQGCIAALTSADCLLEAMSTFPASP+SSLVFVSKKAAKDVAKIIEVLVNDVESYK
Sbjct: 421 HFALKVQGCIAALTSADCLLEAMSTFPASPYSSLVFVSKKAAKDVAKIIEVLVNDVESYK 480

Query: 481 NTRKNFEIDYQLLKKGILNESRFVLGKVILETLKEAIVQEDGIILDVKQNLCVDATTEEN 540
           NTRKNFEIDYQLLKKGILNESRFVLGKVILETLKEAIVQ DGIILDVKQNLCVDATTEEN
Sbjct: 481 NTRKNFEIDYQLLKKGILNESRFVLGKVILETLKEAIVQGDGIILDVKQNLCVDATTEEN 540

Query: 541 YSSPVSDSVKDQLVVKRNKKVRKLLSSSEVEWEGNKKNKLDGKEGSIFDRVVEDGRCVNI 600
           YSSPVSDSVKDQL+VKRNKKVRKLLSSSEV+WEGNKKNKLDGKEG+IFDRVVEDGRCVNI
Sbjct: 541 YSSPVSDSVKDQLMVKRNKKVRKLLSSSEVKWEGNKKNKLDGKEGNIFDRVVEDGRCVNI 600

Query: 601 AEPYKKGVEASQLPHAGLNSMEDSLLESSKCHHFEVRESEVMQENLETMDNQVKKMTPSQ 660
           AEPYKKGVEASQLPHAGLNSME SLLESSKCHHFEVRESE  QENLETMDNQVKKMTPSQ
Sbjct: 601 AEPYKKGVEASQLPHAGLNSMEHSLLESSKCHHFEVRESENRQENLETMDNQVKKMTPSQ 660

Query: 661 ETRDKVTKELLHAVEANPWKMDKVNGKEGKPEKKEHGLQPQGKENIEKKRRHVTDIEQHK 720
           ETRDKVTKELLHAVEANP KMDKVNGKEGKPEKKEHGL PQGKENIEKKRRHVTDIEQHK
Sbjct: 661 ETRDKVTKELLHAVEANPRKMDKVNGKEGKPEKKEHGLLPQGKENIEKKRRHVTDIEQHK 720

Query: 721 RPLSSLF 728
           RPLSSLF
Sbjct: 721 RPLSSLF 727

BLAST of CmoCh06G001410 vs. ExPASy TrEMBL
Match: A0A6J1CVT5 (uncharacterized protein LOC111014843 isoform X4 OS=Momordica charantia OX=3673 GN=LOC111014843 PE=3 SV=1)

HSP 1 Score: 989.6 bits (2557), Expect = 7.6e-285
Identity = 539/736 (73.23%), Postives = 588/736 (79.89%), Query Frame = 0

Query: 1   MAFFSLRPN-NGFISHILDLIKLQRLTHAFTNGEPIRAPIYSAMDFHSMGRQD-ASNIDM 60
           MAFFSLR N NGF+  + DLIKLQ L H F NG   R+P+   MDFHS GR+D A+NI M
Sbjct: 1   MAFFSLRSNYNGFLHRLNDLIKLQSLRHGFANGGVFRSPMNPEMDFHSAGRRDEAANIGM 60

Query: 61  PKWNKVDGRAFGISRSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLLNRAPKDFDVITT 120
            KWNKVD RAFGI RSMIP SSWMVL+IL  KGFE YLVGGCVRDL+LNR PKDFDVITT
Sbjct: 61  SKWNKVDARAFGIKRSMIPPSSWMVLQILQKKGFETYLVGGCVRDLILNRVPKDFDVITT 120

Query: 121 AGLRQIHKLFHHAQIVGRRFPICMVNIKGSVIEVSSFETVAKHSKGKETVTSSPTPRKCD 180
           AGL+QI KLFH AQIVGRRFPICMVNIKGSVIEVSSFETVAKHS+GK TV SS  PRKC 
Sbjct: 121 AGLQQIRKLFHRAQIVGRRFPICMVNIKGSVIEVSSFETVAKHSEGKGTVVSSQIPRKCV 180

Query: 181 EKDLIRWRNSLHRDFTINSLFFDPFLNVIYDYAEGIADLRSLKLRTLIPASLSFKEDCAR 240
           ++DLIRWRNS+HRDFTINSLFFDPF N+IYDYAEGI DLRSLKLRTLIPASLSFKEDCAR
Sbjct: 181 KEDLIRWRNSMHRDFTINSLFFDPFRNMIYDYAEGITDLRSLKLRTLIPASLSFKEDCAR 240

Query: 241 ILRGLRIAARLGLSLSKDTETAMRKLSSSIASLDKSRLMMEFNYMLSYGAAVPSLYLLQR 300
           ILRGLRIAARLGLSLSKDTETA+RKLS SI SLDK+R+MME NYMLSYGAAVPSLYLLQR
Sbjct: 241 ILRGLRIAARLGLSLSKDTETAIRKLSPSIMSLDKTRIMMELNYMLSYGAAVPSLYLLQR 300

Query: 301 FNLLEILLPFHAAYLDKQDIKKSSLNSTMLMKLFSNLDKLVSCDRPSDCNIWVALLAFHM 360
           FNLL+ILLPFHAAYLDKQDIK+SSLNS MLMKLFSNLDKLVSCDRPSDCNIWV LLAFHM
Sbjct: 301 FNLLQILLPFHAAYLDKQDIKESSLNSIMLMKLFSNLDKLVSCDRPSDCNIWVGLLAFHM 360

Query: 361 ALVNNPQNSLIVLAFAATLYHGEWNEGVNYARENSLLQINLRPEITRSAQFKSAEELAER 420
           ALV NPQNSLIVLAFA TLYHG+WNEGVNYARENSL+QINLRPEITRSAQFKS EELAE 
Sbjct: 361 ALVKNPQNSLIVLAFAGTLYHGDWNEGVNYARENSLVQINLRPEITRSAQFKSKEELAEG 420

Query: 421 VTHFALKVQGCIAALTSADCLLEAMSTFPASPHSSLVFVSKKAAKDVAKIIEVLVNDVES 480
           V+HFA KVQGCIAA T ADCL EA  T P SP S+LVFVSKK AKDVAKI EVLVNDVES
Sbjct: 421 VSHFASKVQGCIAAFTGADCLFEATPTLPTSPCSNLVFVSKKTAKDVAKIFEVLVNDVES 480

Query: 481 YKNTRKNFEIDYQLLKKGILNESRFVLGKVILETLKEAIVQEDGIILDVKQNLCVDATTE 540
           +KN R+NFEIDYQLL KGIL+ESR+V+GK+I ETL  AIVQ D  ILD KQNLCVD TT+
Sbjct: 481 FKNKRENFEIDYQLLGKGILSESRYVMGKIIFETLNGAIVQGDENILDKKQNLCVDTTTK 540

Query: 541 ENYSSPVSDSVKDQLVVKRNKKVRKLLSSSEVEWEGNKKNKLDGKEGSIFDRVVEDGRCV 600
           ENY+SPVSD VKDQLVV +  KV+KL S+SEV    NKK KL  KEG   +   ED   +
Sbjct: 541 ENYNSPVSDIVKDQLVVMKEMKVKKLPSTSEVRLGANKKRKLVQKEGRQTEHKCEDLEIM 600

Query: 601 NIAEPYKKGVEASQLPHAGLNS---MEDSLLESSKCHHFEVRE--SEVMQENLETMDNQV 660
            + +      E S+     +       +  L ++K H    +E   E  Q +LET  NQV
Sbjct: 601 GMMDEVIGQEEKSEKRERKIKKSPPSSEGKLRANKKHKLVKKEGREENNQTDLETAGNQV 660

Query: 661 KKMTPSQETRDKVTKELLHAVEANPWKMD--KVNGKEGKPEKKEHGLQPQGKENIEKKRR 720
           K M   QE  DKVTKELLHAV+ NP  M+  +V G+EGK EKKE  L  QGKEN  KK R
Sbjct: 661 KDMILPQEAHDKVTKELLHAVDVNPRNMNGVEVIGQEGKSEKKERYLLSQGKENTNKKHR 720

Query: 721 HVTDIEQHKRPLSSLF 728
           HVT   Q K PLSSLF
Sbjct: 721 HVTGTAQPKGPLSSLF 736

BLAST of CmoCh06G001410 vs. ExPASy TrEMBL
Match: A0A6J1CWE8 (uncharacterized protein LOC111014843 isoform X3 OS=Momordica charantia OX=3673 GN=LOC111014843 PE=3 SV=1)

HSP 1 Score: 984.6 bits (2544), Expect = 2.4e-283
Identity = 539/738 (73.04%), Postives = 588/738 (79.67%), Query Frame = 0

Query: 1   MAFFSLRPN-NGFISHILDLIKLQRLTHAFTNGEPIRAPIYSAMDFHSMGRQD-ASNIDM 60
           MAFFSLR N NGF+  + DLIKLQ L H F NG   R+P+   MDFHS GR+D A+NI M
Sbjct: 1   MAFFSLRSNYNGFLHRLNDLIKLQSLRHGFANGGVFRSPMNPEMDFHSAGRRDEAANIGM 60

Query: 61  PKWNKVDGRAFGISRSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLLNRAPKDFDVITT 120
            KWNKVD RAFGI RSMIP SSWMVL+IL  KGFE YLVGGCVRDL+LNR PKDFDVITT
Sbjct: 61  SKWNKVDARAFGIKRSMIPPSSWMVLQILQKKGFETYLVGGCVRDLILNRVPKDFDVITT 120

Query: 121 AGLRQ--IHKLFHHAQIVGRRFPICMVNIKGSVIEVSSFETVAKHSKGKETVTSSPTPRK 180
           AGL+Q  I KLFH AQIVGRRFPICMVNIKGSVIEVSSFETVAKHS+GK TV SS  PRK
Sbjct: 121 AGLQQVFIRKLFHRAQIVGRRFPICMVNIKGSVIEVSSFETVAKHSEGKGTVVSSQIPRK 180

Query: 181 CDEKDLIRWRNSLHRDFTINSLFFDPFLNVIYDYAEGIADLRSLKLRTLIPASLSFKEDC 240
           C ++DLIRWRNS+HRDFTINSLFFDPF N+IYDYAEGI DLRSLKLRTLIPASLSFKEDC
Sbjct: 181 CVKEDLIRWRNSMHRDFTINSLFFDPFRNMIYDYAEGITDLRSLKLRTLIPASLSFKEDC 240

Query: 241 ARILRGLRIAARLGLSLSKDTETAMRKLSSSIASLDKSRLMMEFNYMLSYGAAVPSLYLL 300
           ARILRGLRIAARLGLSLSKDTETA+RKLS SI SLDK+R+MME NYMLSYGAAVPSLYLL
Sbjct: 241 ARILRGLRIAARLGLSLSKDTETAIRKLSPSIMSLDKTRIMMELNYMLSYGAAVPSLYLL 300

Query: 301 QRFNLLEILLPFHAAYLDKQDIKKSSLNSTMLMKLFSNLDKLVSCDRPSDCNIWVALLAF 360
           QRFNLL+ILLPFHAAYLDKQDIK+SSLNS MLMKLFSNLDKLVSCDRPSDCNIWV LLAF
Sbjct: 301 QRFNLLQILLPFHAAYLDKQDIKESSLNSIMLMKLFSNLDKLVSCDRPSDCNIWVGLLAF 360

Query: 361 HMALVNNPQNSLIVLAFAATLYHGEWNEGVNYARENSLLQINLRPEITRSAQFKSAEELA 420
           HMALV NPQNSLIVLAFA TLYHG+WNEGVNYARENSL+QINLRPEITRSAQFKS EELA
Sbjct: 361 HMALVKNPQNSLIVLAFAGTLYHGDWNEGVNYARENSLVQINLRPEITRSAQFKSKEELA 420

Query: 421 ERVTHFALKVQGCIAALTSADCLLEAMSTFPASPHSSLVFVSKKAAKDVAKIIEVLVNDV 480
           E V+HFA KVQGCIAA T ADCL EA  T P SP S+LVFVSKK AKDVAKI EVLVNDV
Sbjct: 421 EGVSHFASKVQGCIAAFTGADCLFEATPTLPTSPCSNLVFVSKKTAKDVAKIFEVLVNDV 480

Query: 481 ESYKNTRKNFEIDYQLLKKGILNESRFVLGKVILETLKEAIVQEDGIILDVKQNLCVDAT 540
           ES+KN R+NFEIDYQLL KGIL+ESR+V+GK+I ETL  AIVQ D  ILD KQNLCVD T
Sbjct: 481 ESFKNKRENFEIDYQLLGKGILSESRYVMGKIIFETLNGAIVQGDENILDKKQNLCVDTT 540

Query: 541 TEENYSSPVSDSVKDQLVVKRNKKVRKLLSSSEVEWEGNKKNKLDGKEGSIFDRVVEDGR 600
           T+ENY+SPVSD VKDQLVV +  KV+KL S+SEV    NKK KL  KEG   +   ED  
Sbjct: 541 TKENYNSPVSDIVKDQLVVMKEMKVKKLPSTSEVRLGANKKRKLVQKEGRQTEHKCEDLE 600

Query: 601 CVNIAEPYKKGVEASQLPHAGLNS---MEDSLLESSKCHHFEVRE--SEVMQENLETMDN 660
            + + +      E S+     +       +  L ++K H    +E   E  Q +LET  N
Sbjct: 601 IMGMMDEVIGQEEKSEKRERKIKKSPPSSEGKLRANKKHKLVKKEGREENNQTDLETAGN 660

Query: 661 QVKKMTPSQETRDKVTKELLHAVEANPWKMD--KVNGKEGKPEKKEHGLQPQGKENIEKK 720
           QVK M   QE  DKVTKELLHAV+ NP  M+  +V G+EGK EKKE  L  QGKEN  KK
Sbjct: 661 QVKDMILPQEAHDKVTKELLHAVDVNPRNMNGVEVIGQEGKSEKKERYLLSQGKENTNKK 720

Query: 721 RRHVTDIEQHKRPLSSLF 728
            RHVT   Q K PLSSLF
Sbjct: 721 HRHVTGTAQPKGPLSSLF 738

BLAST of CmoCh06G001410 vs. ExPASy TrEMBL
Match: A0A6J1CW57 (uncharacterized protein LOC111014843 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111014843 PE=3 SV=1)

HSP 1 Score: 981.1 bits (2535), Expect = 2.7e-282
Identity = 539/747 (72.16%), Postives = 588/747 (78.71%), Query Frame = 0

Query: 1   MAFFSLRPN-NGFISHILDLIKLQRLTHAFTNGEPIRAPIYSAMDFHSMGRQD-ASNIDM 60
           MAFFSLR N NGF+  + DLIKLQ L H F NG   R+P+   MDFHS GR+D A+NI M
Sbjct: 1   MAFFSLRSNYNGFLHRLNDLIKLQSLRHGFANGGVFRSPMNPEMDFHSAGRRDEAANIGM 60

Query: 61  PKWNKVDGRAFGISRSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLLNRAPKDFDVITT 120
            KWNKVD RAFGI RSMIP SSWMVL+IL  KGFE YLVGGCVRDL+LNR PKDFDVITT
Sbjct: 61  SKWNKVDARAFGIKRSMIPPSSWMVLQILQKKGFETYLVGGCVRDLILNRVPKDFDVITT 120

Query: 121 AGLRQIHKLFHHAQIVGRRFPICMVNIKGSVIEVSSFETVAKHSKGKETVTSSPTPRKCD 180
           AGL+QI KLFH AQIVGRRFPICMVNIKGSVIEVSSFETVAKHS+GK TV SS  PRKC 
Sbjct: 121 AGLQQIRKLFHRAQIVGRRFPICMVNIKGSVIEVSSFETVAKHSEGKGTVVSSQIPRKCV 180

Query: 181 EKDLIRWRNSLHRDFTINSLFFDPFLNVIYDYAEGIADLRSLK-----------LRTLIP 240
           ++DLIRWRNS+HRDFTINSLFFDPF N+IYDYAEGI DLRSLK           LRTLIP
Sbjct: 181 KEDLIRWRNSMHRDFTINSLFFDPFRNMIYDYAEGITDLRSLKFPPFCWMAWTQLRTLIP 240

Query: 241 ASLSFKEDCARILRGLRIAARLGLSLSKDTETAMRKLSSSIASLDKSRLMMEFNYMLSYG 300
           ASLSFKEDCARILRGLRIAARLGLSLSKDTETA+RKLS SI SLDK+R+MME NYMLSYG
Sbjct: 241 ASLSFKEDCARILRGLRIAARLGLSLSKDTETAIRKLSPSIMSLDKTRIMMELNYMLSYG 300

Query: 301 AAVPSLYLLQRFNLLEILLPFHAAYLDKQDIKKSSLNSTMLMKLFSNLDKLVSCDRPSDC 360
           AAVPSLYLLQRFNLL+ILLPFHAAYLDKQDIK+SSLNS MLMKLFSNLDKLVSCDRPSDC
Sbjct: 301 AAVPSLYLLQRFNLLQILLPFHAAYLDKQDIKESSLNSIMLMKLFSNLDKLVSCDRPSDC 360

Query: 361 NIWVALLAFHMALVNNPQNSLIVLAFAATLYHGEWNEGVNYARENSLLQINLRPEITRSA 420
           NIWV LLAFHMALV NPQNSLIVLAFA TLYHG+WNEGVNYARENSL+QINLRPEITRSA
Sbjct: 361 NIWVGLLAFHMALVKNPQNSLIVLAFAGTLYHGDWNEGVNYARENSLVQINLRPEITRSA 420

Query: 421 QFKSAEELAERVTHFALKVQGCIAALTSADCLLEAMSTFPASPHSSLVFVSKKAAKDVAK 480
           QFKS EELAE V+HFA KVQGCIAA T ADCL EA  T P SP S+LVFVSKK AKDVAK
Sbjct: 421 QFKSKEELAEGVSHFASKVQGCIAAFTGADCLFEATPTLPTSPCSNLVFVSKKTAKDVAK 480

Query: 481 IIEVLVNDVESYKNTRKNFEIDYQLLKKGILNESRFVLGKVILETLKEAIVQEDGIILDV 540
           I EVLVNDVES+KN R+NFEIDYQLL KGIL+ESR+V+GK+I ETL  AIVQ D  ILD 
Sbjct: 481 IFEVLVNDVESFKNKRENFEIDYQLLGKGILSESRYVMGKIIFETLNGAIVQGDENILDK 540

Query: 541 KQNLCVDATTEENYSSPVSDSVKDQLVVKRNKKVRKLLSSSEVEWEGNKKNKLDGKEGSI 600
           KQNLCVD TT+ENY+SPVSD VKDQLVV +  KV+KL S+SEV    NKK KL  KEG  
Sbjct: 541 KQNLCVDTTTKENYNSPVSDIVKDQLVVMKEMKVKKLPSTSEVRLGANKKRKLVQKEGRQ 600

Query: 601 FDRVVEDGRCVNIAEPYKKGVEASQLPHAGLNS---MEDSLLESSKCHHFEVRE--SEVM 660
            +   ED   + + +      E S+     +       +  L ++K H    +E   E  
Sbjct: 601 TEHKCEDLEIMGMMDEVIGQEEKSEKRERKIKKSPPSSEGKLRANKKHKLVKKEGREENN 660

Query: 661 QENLETMDNQVKKMTPSQETRDKVTKELLHAVEANPWKMD--KVNGKEGKPEKKEHGLQP 720
           Q +LET  NQVK M   QE  DKVTKELLHAV+ NP  M+  +V G+EGK EKKE  L  
Sbjct: 661 QTDLETAGNQVKDMILPQEAHDKVTKELLHAVDVNPRNMNGVEVIGQEGKSEKKERYLLS 720

Query: 721 QGKENIEKKRRHVTDIEQHKRPLSSLF 728
           QGKEN  KK RHVT   Q K PLSSLF
Sbjct: 721 QGKENTNKKHRHVTGTAQPKGPLSSLF 747

BLAST of CmoCh06G001410 vs. TAIR 10
Match: AT2G17580.1 (Polynucleotide adenylyltransferase family protein )

HSP 1 Score: 497.7 bits (1280), Expect = 1.7e-140
Identity = 282/557 (50.63%), Postives = 372/557 (66.79%), Query Frame = 0

Query: 51  QDASNIDMPKWNKVDGRAFGISRSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLLNRAP 110
           +D +++D  KW KV     GI  SMIP SS  VL++L  +GF+AYLVGGCVRDL+LNR P
Sbjct: 49  EDINSVDTSKWKKVRASDAGIKNSMIPESSMNVLRLLRRQGFDAYLVGGCVRDLILNRVP 108

Query: 111 KDFDVITTAGLRQIHKLFHHAQIVGRRFPICMVNIKGSVIEVSSFETVA------KHSKG 170
           KD+DVITTA L+QI +LFH AQ++G+RFPIC V + GS+IEVSSF+TVA      + SK 
Sbjct: 109 KDYDVITTADLKQIRRLFHRAQVIGKRFPICHVWMGGSIIEVSSFDTVAHSDSDLEKSKE 168

Query: 171 KETVTSSPTPRK----------CDEKDLIRWRNSLHRDFTINSLFFDPFLNVIYDYAEGI 230
           K  V+      K           D KD  RWRNSL RDFTINSLF++PF   IYDYA G+
Sbjct: 169 KSGVSLDTKANKNNSLFKMYSGWDIKDCKRWRNSLQRDFTINSLFYNPFDFTIYDYANGM 228

Query: 231 ADLRSLKLRTLIPASLSFKEDCARILRGLRIAARLGLSLSKDTETAMRKLSSSIASLDKS 290
            DL  LKLRTL+PA LSFKEDCARILRGLRIAARLGLSLSKD +TA+ +  SS+A+LD+ 
Sbjct: 229 EDLTDLKLRTLVPAHLSFKEDCARILRGLRIAARLGLSLSKDVKTAIPEFVSSVANLDQF 288

Query: 291 RLMMEFNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQDIKKSSLNSTMLMKLFSN 350
           RL+ME NYML+YGAA PS+ LL +F LL +LLPF AAYLD Q  K S  +S ML++LFSN
Sbjct: 289 RLIMEMNYMLAYGAAAPSILLLMKFKLLHVLLPFQAAYLD-QASKTSLSSSLMLVRLFSN 348

Query: 351 LDKLVSCDRPSDCNIWVALLAFHMALVNNPQNSLIVLAFAATLYHGEWNEGVNYARENSL 410
           +DKLVSCD+P+D  +W+A+LAFH+ALV NPQ +++V AFAA LYHG W++ V +ARE+  
Sbjct: 349 MDKLVSCDQPADPKLWIAVLAFHIALVRNPQEAIVVRAFAALLYHGNWSKAVEFAREHET 408

Query: 411 LQINLRPEITRSAQFKSAEELAERVTHFALKVQGCIAALTSADCLLEAMSTFPASPHSSL 470
             I   PE+++S++ +S E+LAE V+ F   ++     LT  + L EA+  +P    S L
Sbjct: 409 SVIGYAPEVSKSSRKRSDEDLAEAVSEFTCLLKDTQYVLTDKEALREALYLYPDFKFSGL 468

Query: 471 VFVSKKAAKDVAKIIEVLVNDVESYKNTRKNFEIDYQLLKKGILNESRFVLGKVILETLK 530
           VF+ KK  +DVA+   + ++DVESY++ ++ F IDY LL KG   E RFVLGK+IL+T+ 
Sbjct: 469 VFIPKKKGRDVAEGF-MRLSDVESYESQKEGFSIDYVLLGKGNPCEVRFVLGKIILDTIT 528

Query: 531 EAIVQE----------------DGIILDVKQNLCVDATTEE--NYSSPVSDSVKDQL--V 572
           E  V E                    L+ K  L V  +++E  N  +PV DS    +  +
Sbjct: 529 EGTVIEPLNSVKKKQSTRNHIVPAACLEKKDELFVSKSSKEDNNNQTPVHDSNASSVLKI 588

BLAST of CmoCh06G001410 vs. TAIR 10
Match: AT1G28090.1 (Polynucleotide adenylyltransferase family protein )

HSP 1 Score: 373.2 bits (957), Expect = 4.9e-103
Identity = 208/458 (45.41%), Postives = 290/458 (63.32%), Query Frame = 0

Query: 61  WNKVDGRAFGISRSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLLNRAPKDFDVITTAG 120
           W K+D   FGI RSMIP S+ MVL  L  KGF+ YLVGGCVRDL+L+R PKDFDVITTA 
Sbjct: 82  WKKLDANEFGIQRSMIPDSTRMVLNKLKKKGFQVYLVGGCVRDLILDRIPKDFDVITTAE 141

Query: 121 LRQIHKLFHHAQIVGRRFPICMVNIKGSVIEVSSFETVAKHSKGKETVTSSPTPRKCDEK 180
           L+++ K+F   QIVGRRFPIC V +   +IEVSSF T A+   GK    S   P  CDE+
Sbjct: 142 LKEVRKVFPGCQIVGRRFPICHVYVDDIIIEVSSFSTSAR--TGKAPNKSFRRPAGCDER 201

Query: 181 DLIRWRNSLHRDFTINSLFFDPFLNVIYDYAEGIADLRSLKLRTLIPASLSFKEDCARIL 240
           D IRW+N L RDFT+N L FDP  NV+YDY  G+ DLR+ K+RT+  A+LSF ED ARIL
Sbjct: 202 DYIRWKNCLQRDFTVNGLMFDPSENVVYDYIGGVEDLRNSKVRTVSAANLSFVEDTARIL 261

Query: 241 RGLRIAARLGLSLSKDTETAMRKLSSSIASLDKSRLMMEFNYMLSYGAAVPSLYLLQRFN 300
           R +RIAARLG SL+KD   ++++LSSS+  LD SR+ ME NYML+YG+A  SL LL RF 
Sbjct: 262 RAIRIAARLGFSLTKDVAISVKELSSSLLRLDPSRIRMEINYMLAYGSAEASLRLLWRFG 321

Query: 301 LLEILLPFHAAYLDKQDIKKSSLNSTMLMKLFSNLDKLVSCDRPSDCNIWVALLAFHMAL 360
           L+EILLP  A+YL  Q  ++    S ML+ LF NLD+LV+ DRP    +W+ +LAFH AL
Sbjct: 322 LMEILLPIQASYLVSQGFRRRDGRSNMLLSLFRNLDRLVAPDRPCSEFLWIGILAFHKAL 381

Query: 361 VNNPQNSLIVLAFAATLY-HGEWNEGVNYARENSLLQINLRPEITRSAQ--FKSAEELAE 420
           V+ P++  +V +F   +Y     +E +  AR NS    +   E++   +    S  ++++
Sbjct: 382 VDQPRDPTVVASFCLAIYSEVSLSEAIAIARSNSKQHNSHFQELSSPEKDTADSESKISQ 441

Query: 421 RVTHFALKVQGCIAALTSADCLLEAMSTFPASPHSSLVFVSKKAAKDVAKIIEVLVNDVE 480
           +V   A  ++     L + D +  AMS +P +P S +VF+S+   + V K+   +     
Sbjct: 442 QVIKLAESIRSAARKLNNRDYIANAMSKYPQAPGSDMVFLSRLMLERVEKMFGNVRRKGN 501

Query: 481 SYKNTRKNFE--IDYQLLKKGILNESRFVLGKVILETL 514
             ++   + E  I+Y+ L  G  +E+R V  +++ +T+
Sbjct: 502 QERDDVPSLERRINYKSLALGDFHETRRVFARIVFDTI 537

BLAST of CmoCh06G001410 vs. TAIR 10
Match: AT1G28090.2 (Polynucleotide adenylyltransferase family protein )

HSP 1 Score: 373.2 bits (957), Expect = 4.9e-103
Identity = 208/458 (45.41%), Postives = 290/458 (63.32%), Query Frame = 0

Query: 61  WNKVDGRAFGISRSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLLNRAPKDFDVITTAG 120
           W K+D   FGI RSMIP S+ MVL  L  KGF+ YLVGGCVRDL+L+R PKDFDVITTA 
Sbjct: 46  WKKLDANEFGIQRSMIPDSTRMVLNKLKKKGFQVYLVGGCVRDLILDRIPKDFDVITTAE 105

Query: 121 LRQIHKLFHHAQIVGRRFPICMVNIKGSVIEVSSFETVAKHSKGKETVTSSPTPRKCDEK 180
           L+++ K+F   QIVGRRFPIC V +   +IEVSSF T A+   GK    S   P  CDE+
Sbjct: 106 LKEVRKVFPGCQIVGRRFPICHVYVDDIIIEVSSFSTSAR--TGKAPNKSFRRPAGCDER 165

Query: 181 DLIRWRNSLHRDFTINSLFFDPFLNVIYDYAEGIADLRSLKLRTLIPASLSFKEDCARIL 240
           D IRW+N L RDFT+N L FDP  NV+YDY  G+ DLR+ K+RT+  A+LSF ED ARIL
Sbjct: 166 DYIRWKNCLQRDFTVNGLMFDPSENVVYDYIGGVEDLRNSKVRTVSAANLSFVEDTARIL 225

Query: 241 RGLRIAARLGLSLSKDTETAMRKLSSSIASLDKSRLMMEFNYMLSYGAAVPSLYLLQRFN 300
           R +RIAARLG SL+KD   ++++LSSS+  LD SR+ ME NYML+YG+A  SL LL RF 
Sbjct: 226 RAIRIAARLGFSLTKDVAISVKELSSSLLRLDPSRIRMEINYMLAYGSAEASLRLLWRFG 285

Query: 301 LLEILLPFHAAYLDKQDIKKSSLNSTMLMKLFSNLDKLVSCDRPSDCNIWVALLAFHMAL 360
           L+EILLP  A+YL  Q  ++    S ML+ LF NLD+LV+ DRP    +W+ +LAFH AL
Sbjct: 286 LMEILLPIQASYLVSQGFRRRDGRSNMLLSLFRNLDRLVAPDRPCSEFLWIGILAFHKAL 345

Query: 361 VNNPQNSLIVLAFAATLY-HGEWNEGVNYARENSLLQINLRPEITRSAQ--FKSAEELAE 420
           V+ P++  +V +F   +Y     +E +  AR NS    +   E++   +    S  ++++
Sbjct: 346 VDQPRDPTVVASFCLAIYSEVSLSEAIAIARSNSKQHNSHFQELSSPEKDTADSESKISQ 405

Query: 421 RVTHFALKVQGCIAALTSADCLLEAMSTFPASPHSSLVFVSKKAAKDVAKIIEVLVNDVE 480
           +V   A  ++     L + D +  AMS +P +P S +VF+S+   + V K+   +     
Sbjct: 406 QVIKLAESIRSAARKLNNRDYIANAMSKYPQAPGSDMVFLSRLMLERVEKMFGNVRRKGN 465

Query: 481 SYKNTRKNFE--IDYQLLKKGILNESRFVLGKVILETL 514
             ++   + E  I+Y+ L  G  +E+R V  +++ +T+
Sbjct: 466 QERDDVPSLERRINYKSLALGDFHETRRVFARIVFDTI 501

BLAST of CmoCh06G001410 vs. TAIR 10
Match: AT1G28090.3 (Polynucleotide adenylyltransferase family protein )

HSP 1 Score: 373.2 bits (957), Expect = 4.9e-103
Identity = 208/458 (45.41%), Postives = 290/458 (63.32%), Query Frame = 0

Query: 61  WNKVDGRAFGISRSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLLNRAPKDFDVITTAG 120
           W K+D   FGI RSMIP S+ MVL  L  KGF+ YLVGGCVRDL+L+R PKDFDVITTA 
Sbjct: 19  WKKLDANEFGIQRSMIPDSTRMVLNKLKKKGFQVYLVGGCVRDLILDRIPKDFDVITTAE 78

Query: 121 LRQIHKLFHHAQIVGRRFPICMVNIKGSVIEVSSFETVAKHSKGKETVTSSPTPRKCDEK 180
           L+++ K+F   QIVGRRFPIC V +   +IEVSSF T A+   GK    S   P  CDE+
Sbjct: 79  LKEVRKVFPGCQIVGRRFPICHVYVDDIIIEVSSFSTSAR--TGKAPNKSFRRPAGCDER 138

Query: 181 DLIRWRNSLHRDFTINSLFFDPFLNVIYDYAEGIADLRSLKLRTLIPASLSFKEDCARIL 240
           D IRW+N L RDFT+N L FDP  NV+YDY  G+ DLR+ K+RT+  A+LSF ED ARIL
Sbjct: 139 DYIRWKNCLQRDFTVNGLMFDPSENVVYDYIGGVEDLRNSKVRTVSAANLSFVEDTARIL 198

Query: 241 RGLRIAARLGLSLSKDTETAMRKLSSSIASLDKSRLMMEFNYMLSYGAAVPSLYLLQRFN 300
           R +RIAARLG SL+KD   ++++LSSS+  LD SR+ ME NYML+YG+A  SL LL RF 
Sbjct: 199 RAIRIAARLGFSLTKDVAISVKELSSSLLRLDPSRIRMEINYMLAYGSAEASLRLLWRFG 258

Query: 301 LLEILLPFHAAYLDKQDIKKSSLNSTMLMKLFSNLDKLVSCDRPSDCNIWVALLAFHMAL 360
           L+EILLP  A+YL  Q  ++    S ML+ LF NLD+LV+ DRP    +W+ +LAFH AL
Sbjct: 259 LMEILLPIQASYLVSQGFRRRDGRSNMLLSLFRNLDRLVAPDRPCSEFLWIGILAFHKAL 318

Query: 361 VNNPQNSLIVLAFAATLY-HGEWNEGVNYARENSLLQINLRPEITRSAQ--FKSAEELAE 420
           V+ P++  +V +F   +Y     +E +  AR NS    +   E++   +    S  ++++
Sbjct: 319 VDQPRDPTVVASFCLAIYSEVSLSEAIAIARSNSKQHNSHFQELSSPEKDTADSESKISQ 378

Query: 421 RVTHFALKVQGCIAALTSADCLLEAMSTFPASPHSSLVFVSKKAAKDVAKIIEVLVNDVE 480
           +V   A  ++     L + D +  AMS +P +P S +VF+S+   + V K+   +     
Sbjct: 379 QVIKLAESIRSAARKLNNRDYIANAMSKYPQAPGSDMVFLSRLMLERVEKMFGNVRRKGN 438

Query: 481 SYKNTRKNFE--IDYQLLKKGILNESRFVLGKVILETL 514
             ++   + E  I+Y+ L  G  +E+R V  +++ +T+
Sbjct: 439 QERDDVPSLERRINYKSLALGDFHETRRVFARIVFDTI 474

BLAST of CmoCh06G001410 vs. TAIR 10
Match: AT5G23690.1 (Polynucleotide adenylyltransferase family protein )

HSP 1 Score: 339.7 bits (870), Expect = 6.0e-93
Identity = 186/461 (40.35%), Postives = 278/461 (60.30%), Query Frame = 0

Query: 60  KWNKVDGRAFGISRSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLLNRAPKDFDVITTA 119
           +W +++ +  G+S SMI  S+  VL  L +KG + YLVGGCVRDL+L R PKDFD++T+A
Sbjct: 63  EWKQLNSKDLGLSSSMIAKSTRKVLNGLKSKGHDVYLVGGCVRDLILKRTPKDFDILTSA 122

Query: 120 GLRQIHKLFHHAQIVGRRFPICMVNIKGSVIEVSSFETVAKHSKGKETVTSSPTPRKCDE 179
            LR++ + F   +IVGRRFPIC V+I   +IEVSSF T A++S          +     +
Sbjct: 123 ELREVVRTFPRCEIVGRRFPICHVHIGDDLIEVSSFSTSAQNSSRNTRTECKESSGSDGD 182

Query: 180 KDLIRWRNSLHRDFTINSLFFDPFLNVIYDYAEGIADLRSLKLRTLIPASLSFKEDCARI 239
           +D IR  N L RDFTIN L FDP+  V+YDY  G+ D+R  K+RT+I A  SF +DCARI
Sbjct: 183 EDCIRLNNCLQRDFTINGLMFDPYAKVVYDYLGGMEDIRKAKVRTVIHAGTSFHQDCARI 242

Query: 240 LRGLRIAARLGLSLSKDTETAMRKLSSSIASLDKSRLMMEFNYMLSYGAAVPSLYLLQRF 299
           LR +RIAARLG  +SK+T   ++ LS  +  LDK R++ME NYML+YG+A  SL LL +F
Sbjct: 243 LRAIRIAARLGFRMSKETAHFIKNLSLLVQRLDKGRILMEMNYMLAYGSAEASLRLLWKF 302

Query: 300 NLLEILLPFHAAYLDKQDIKKSSLNSTMLMKLFSNLDKLVSCDRPSDCNIWVALLAFHMA 359
            +LEILLP  AAYL +   ++    + ML+ LF+NLDKL++ DRP   ++W+A+LAFH A
Sbjct: 303 GILEILLPIQAAYLARSGFRRRDKRTNMLLSLFANLDKLLAPDRPCHSSLWIAILAFHKA 362

Query: 360 LVNNPQNSLIVLAFAATLYH-GEWNEGVNYARENSLLQINLRPEITRSAQFKSAEELAER 419
           L + P++ ++V AF+  +++ G+  E V   ++ +        E+    +    + L + 
Sbjct: 363 LADKPRSPIVVAAFSLAVHNCGDILEAVEITKKITRPHDKSFFELVEPEENLDFQTLLDE 422

Query: 420 VTHFALKVQGCIAALTSADCLLEAMSTFPASPHSSLVFVSKKAAKDVAKIIEVLVNDVES 479
           V      ++  +  +T A  + +AMS +P +P+S LVF+  +      +I + + N+   
Sbjct: 423 VMDLDASIEDALNQMTDAYFISKAMSAYPQAPYSDLVFIPLQLYLRAGRIFDCVKNE--- 482

Query: 480 YKNTRKNFE------IDYQLLKKGILNESRFVLGKVILETL 514
              TR  FE      I+Y  L  G   E R V  +V+ +T+
Sbjct: 483 --ETRIGFEAKQGSKIEYGSLNSGYFPEIRHVFARVVFDTV 518

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P444397.3e-3533.45Poly(A) polymerase I OS=Haemophilus influenzae (strain ATCC 51907 / DSM 11121 / ... [more]
P0ABF33.5e-2932.95Poly(A) polymerase I OS=Escherichia coli O157:H7 OX=83334 GN=pcnB PE=3 SV=2[more]
P0ABF23.5e-2932.95Poly(A) polymerase I OS=Escherichia coli O6:H1 (strain CFT073 / ATCC 700928 / UP... [more]
P0ABF13.5e-2932.95Poly(A) polymerase I OS=Escherichia coli (strain K12) OX=83333 GN=pcnB PE=1 SV=2[more]
Q8Z9C31.0e-2831.88Poly(A) polymerase I OS=Salmonella typhi OX=90370 GN=pcnB PE=3 SV=2[more]
Match NameE-valueIdentityDescription
A0A6J1G6Y50.0e+00100.00uncharacterized protein LOC111451426 OS=Cucurbita moschata OX=3662 GN=LOC1114514... [more]
A0A6J1I3F40.0e+0097.66uncharacterized protein LOC111470184 OS=Cucurbita maxima OX=3661 GN=LOC111470184... [more]
A0A6J1CVT57.6e-28573.23uncharacterized protein LOC111014843 isoform X4 OS=Momordica charantia OX=3673 G... [more]
A0A6J1CWE82.4e-28373.04uncharacterized protein LOC111014843 isoform X3 OS=Momordica charantia OX=3673 G... [more]
A0A6J1CW572.7e-28272.16uncharacterized protein LOC111014843 isoform X2 OS=Momordica charantia OX=3673 G... [more]
Match NameE-valueIdentityDescription
AT2G17580.11.7e-14050.63Polynucleotide adenylyltransferase family protein [more]
AT1G28090.14.9e-10345.41Polynucleotide adenylyltransferase family protein [more]
AT1G28090.24.9e-10345.41Polynucleotide adenylyltransferase family protein [more]
AT1G28090.34.9e-10345.41Polynucleotide adenylyltransferase family protein [more]
AT5G23690.16.0e-9340.35Polynucleotide adenylyltransferase family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 469..489
NoneNo IPR availableGENE3D1.10.3090.10coord: 224..388
e-value: 5.6E-30
score: 106.8
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 682..707
NoneNo IPR availablePANTHERPTHR43051:SF1POLYNUCLEOTIDE ADENYLYLTRANSFERASE FAMILY PROTEINcoord: 9..727
NoneNo IPR availablePANTHERPTHR43051POLYNUCLEOTIDE ADENYLYLTRANSFERASE FAMILY PROTEINcoord: 9..727
NoneNo IPR availableSUPERFAMILY81891Poly A polymerase C-terminal region-likecoord: 221..365
IPR002646Poly A polymerase, head domainPFAMPF01743PolyA_polcoord: 94..223
e-value: 1.1E-22
score: 80.8
IPR002646Poly A polymerase, head domainCDDcd05398NT_ClassII-CCAasecoord: 85..218
e-value: 4.84613E-31
score: 116.538
IPR032828tRNA nucleotidyltransferase/poly(A) polymerase, RNA and SrmB- binding domainPFAMPF12627PolyA_pol_RNAbdcoord: 250..312
e-value: 4.2E-13
score: 48.8
IPR043519Nucleotidyltransferase superfamilyGENE3D3.30.460.10Beta Polymerase, domain 2coord: 76..223
e-value: 2.2E-38
score: 133.2
IPR043519Nucleotidyltransferase superfamilySUPERFAMILY81301Nucleotidyltransferasecoord: 75..218

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh06G001410.1CmoCh06G001410.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0001680 tRNA 3'-terminal CCA addition
biological_process GO:0006396 RNA processing
molecular_function GO:0016779 nucleotidyltransferase activity
molecular_function GO:0003723 RNA binding