Cp4.1LG17g02530 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG17g02530
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPoly(A) RNA polymerase cid11
LocationCp4.1LG17 : 2189250 .. 2195630 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TGATTGGCAGCAAGAAGTTGCAAAGCAAATCGGATCCCAAATTTTGCATAGTCAATGAAGATTTTATGCTTTGATATTTCCTTCTTCCCTCCAATTCTTACTCATTCTTCGTTGCTTCTCCTTCCACTCGCCCTCTCTCTCTCTCTTTTGTTACTGACCGCCGCTGACTCGCCCTACCATGGCCGGCGACGCTGGTGGCGACGGTCACTCTCCCTTTCCCCCACCTTCTAACGGCGGAGAGTTTCTTCTCTCCCTCCTACAACGACCCCCAAATCGCCAATCGCATCTCAACATCAATCCTCAGCCTCAGCCTCACCTTCACCCTCCCCCTGCCCTTGATCCAGCCGTCGCCGCCGTTGGTCCCTCCCTTACCTCCCTCCCGCCGCCCTGGCCTTCCACTGGCTCCGATCTTCTATACCCAATTCCTCTTTCCCCTTGGTCACACTCCCACCAGTCCTTGTCTGCTCCTATTGCTTCTAACTTTGTGGGGTTTCAACACCTTCAGCAGAACCCTTTTCCTCTTCCTCGGAATCAGTTTGGGGGAGCCCAATTTGCGGCGAGTCACAGTTCGGGTGACCTAATTCAAGGGGGTCTTGGGGGTGCAGACGATTTGAAGAGGTTAGGGCTTCGTGGAAACCATGATAGACCGAATGGTACTGTTCATAACCTTTCGCAGCATAATGAACTGGAGAATAAGCTTCAGTTTGGCTCTTTTTCTCCCACTCAATTTTCTAGGGTTTTGGTTAGTGGGAATAATAGCTCTGCCAATGATTTAAACCGCGAAGTAGGGTTTAGGGAGACAATCCCTAATGGGATGAATAGAAATCAGGGATTGGATTCTCATGGGAATTCGAATTTCATATCATATGGGAATTCGATTTCCAATGCCAATGTTCATTCCTTTCGCCGTGGAGAGTTTGATTACTCAGAGCAAGAAAGAGGGCGTGTGTTGGGTGAAAACTACAGTTTTCATCCCTTGGTGAAGGTGTTGGAGCCATCTGGTTTTATGAGCAAGCCAAAGGGAGGAGGGCATTTGGATTCTGTGAACATTAGAAGGAGAGATTTTGATCATGTAGTAAATAGAGAGAGAGCTAGTTCAAGTCAACTTGGGGAGGCGTTTCACAGGCTCGAACTTGGTGCACAGCTTCATGACCCTGTACGCCCTTCTAGGAGTGATCTTCATTCAGCATCAGCATTGGATATGGAAGAACGTGGTTTGAATTTGCATCCCGATTTTGCTGAAGGCAGGCCCAGAGATAGTCATGAGGGGCGTGGTTGGATGAGAAAGGATGTTGATTCAACTAATGGTAATGATAGCCAGGAACTGGAGAACAATATAGGTGAACAGCTTGCTGATTCCTTGTTGCACGAGGAGGAACCTGATGAGAAAAGCGATGCTAAACACGTGCGTCGTGAGAAGGTTGAATATTCTCTTTTCATTTATCTTGTATTTTCATACAATGCCTTTTTTTGTGTGGAAACATCACTGTTGAAGTCTCATGTATGAATGATAAGATACTAAAAATGAATTTTGTTGAAGGAAAGAGGCATACCATTTTACTGATCATCTTAACGTTGAGATTGTATGCCCCATGAATTTACAAACTTGCATCTTTTCTAGAATATTGTACTTATGTCTTCAAAGGAACAACATGTGAGAAAATTCACTATATAAGTTTGTATTGAAACATTTGTGGATAGGGTCACTATATAAGTTTGTATTGAAACATTTGTGGATAGGGTATTATTAATATTAAAGGTCAGATTGGAGACAAAATTTATGTGAAATTTTCTACACAGGATTGCAGGGGGAATCGGCTACTTACACACAGGGAGAGGATCTCTCGAAGACATATAAAGTGCCGTGGTGACATAGATATGTTGAAAGTTCCTCTTCTTGCAATTTATGAATCTCTGATACCACCTAAAGAAGAAAAGGAAAAACAGATGCAGTTATTAACATCACTAGAGAAGTTGGTTGTTAATGAATGGCCTTGTGCTCGTCTATGTCTCTTTGGATCATGTGCGAACTCCTTCGGTGTTTCAAATAGTGATTTAGATGTGTGTCTTGTGCATAGAGATGCTGATATTGACAAGGCTGAGATCTTACTGAAGTTGGCAGATAGACTGCAATCAGCTAACTTCCAGAATGTGCAGGTACTTGGATTATCTACCATTCAGAATATTTTTCTTGATTGAAGTTGAAACGTCTTGGTAATGATGTCAAGATGTGTTTATAATTGTATGTTATATTTTAGTTGCATGCGTCATACGAGCTTGTCTTCTGTTCTCCGATGTACGTGTATCCTTTTTAGGTGATGAAATAGGCAGTATGCATGTATGAAAATAACAAAATGGTGACGTAATGTTTGGATACAACTTCAAGCGACATTCACTATCATGCACACAAATAAAAGCTCTTTAAATTTCTATTATTTGACTTCAAGTTACTATGTTTGTAGGCCCTGACGCATGCAAGGGTTCCTATTATAAAGCTCAAGGATCCGGTGACTGGAATATCTTGTGACATATGCATAAACAATGTTTTGGCTGTTGTAAATACAAAACTTCTCCGGGATTATGCACAAATAGATGTGAGATTACCACAATTGGCATTTATTGTGAAGCATTGGGCTAAGTCTAGAGGAGTGAATGAAACATACCAGGGAACACTTTCTAGCTATGCGTAAGTTGTCAAACTTTATGGAATTCTTCTACTGTGCAGTATGATCATGTTGTATGAATTAATCTTCTATCATGATTAGAGATAAGTGTCACTATGATAAGGATACGAACACGTAATTTCTTGGTTTTTGAGAGTTGCGAACACTTCCTACATTGTCAAGTTTTCTAATTTCACATGTTAACTTGATGGAGTATGCTAATATGCCTATTATCAATTCAGGTATGTTTTGATGTGCATCCATTTCTTACAACATCGAGATCCTCCTATCCTGCCTTGTTTACAGGTTTGTATCCAAGTCTCTTAATATATGCTCACAGTCCGAATTTGGATTTTAAATACTCCTTTTATGATGACCATCCATTCTCATGGGTGGCTATTCTACTATCTACATATGCTTATCTGTACAAATAAACATTTTAGATTTCTTTTATTCAAGCCTCTAGTATGAAATAGGTAGATAAGAAGCAAAGAACTAAAAAGATGAAAATTGAGCATAGAGGTTAAGTGCATCCCTGTTCATGTCAGCAGTGGTACAATTCTTAACATGTACAAGACACTGGAATTTTATGCACTCCTTAAATAAATGGAGCTCTCGTAACAAAATTATTTCTGAGACCCAAATGACCTGCAAACCGTAATTTTATTGTGTATATTGAAATCACTAGTATAAACTTGATTTTGAAGGCAAGAACAACTTTAAGGAAAGGTCATCATTCAAAACATTCTACAAGAAAATAAGAAAAAACAATTTTTAGAAAACAATTATCAAAGTAGACTTTTTGTTTAACAAAAGGTTGTGGAAACAGTTCGAAAAGAAGCCCTAAGCTGTTGATGTTGTACTATAGGAGCTACTTCCTTTTTCTATTTCAGTTATTTGTTATGGGTATTAATAGTTGTTGTTGAGTTGCTCAATTAGTATTTAATTTATGTCCAATAGTTTTAGTTTGATATGTTATGTTGTTTGGATAAGATGGCTGTATTCTTGATAATAGATTAGCTGACCAACAAAAGTTTCACCTTTGATATTTATATTTATGCAAGTTTATCTGGTCGTATATTTATTATTTGCATTTAGTGATCAGTATATCCCTCCACCACAGGAAACGAAGATTGTTACTTATCATGAAATTGTTGATAATATTGAATGTGCATACTTTGATCAAGTTGAAAATCTGAAAAGTTTTGGATCCAATAACAATGAAAGCGTTGCTCGACTCGTCTGGGGATTCTTCCATTATTGGGCATATTGTCATGATTATGCCAACACCGTCATATCCATTCGTACTAAAAGCACTATCAGGTAAGATATGTGATATAATATTTGTTTCTATCATTATATGAAAGAGCTTTTAAATTCAGTGTAACCGTTGTTAAGGAATAATAGACCAATGATAGTTTTTTGTTTTGGTACTAAAAAACATGATGCTGGACTGGATTAAAGAGGTAGGTGTGCGTGATTCTTTCCTAGGCAAAAAAGTAGAGGTCTTAAATAGCTTGAGTTGGGACCTTTTACAAACTGGTTAGCATCAGAAAACTCGTAAGATTGTGGTATTCTTGTTAGAAAACCACCTATCTTGAACCTATGACTTCAAAGTTCCTGAACTTTGTTCGGTGGTTTTTGGTTTTGCGGTTCTGGAAGTATGACGCTTGGAATAAGAGATGGATGGGCACAATTCTTAGAATATTTTTTAGCAAAAGATGGAAGCTATTCCACCCGCCATTTTGAGATAAGGAGACCCTTTGATTTGCAGCCTTTTTGGTTAATTTGTGGAATATTTGGCTCGACAGAAGTATGATAGTGGCATATTGTATCTAATTTGCACCCCAATTTGCTTTGCAAATTAGAGGAAGGGTGAGCATAGACAAGAGAGTTAGTTATGTAAAGAAAAGATATGTATGGTCGTTTATGAGCAGTAGTATTATATACAAGCAAATGAGTTAGAAAAGCGAAGAAGGTCGTTGTGAGATCCAACAGCGGTTGGGGGAGGAGAACAAAACATTCTTTATAAGTGTGTGGAAATCTCTCCCTAGCAGTCGCGTTTTAAAAACCTTGAGGGAACCGAAAGGGAAGGCCCAAAGAGGACAATATCTGCTAGCGGTGGGCTTGAGCATGTTACATTCGTTGTCACTATATAGTGTCTCTGTGGAGATTACATTTCCGAATAGGAGAAGTTTTTAGAGATGTCAAGAGAGGTCTTGAGGGAGGTGTGAACTTTGGTTAGATTTAATACCCTATTTCCGAGTATCTTTTGGTCATTGCACTTAGGGTATTATGCCTGTAGCAATGTCGAGTTTCATCATCGTTTAACTTTTAGTTGTAGTGATTCAGTTGAACTGCCTTAAAACCCCTTCTAGTGAAATTCTTATCAGTTGATTCTCTGTTTAGGCAAGATAATTGACAATCCTCTCCACTAGCTGTCCTTCTGTTAGGGGCATAGGCCTCATGTCTAGCAAGTGAACAAAGCAAGATAAACAAAAATGAGATTTTAGTTGGATCCACTCTGTTCTTCTTCTCATTACAGGGATTCAAATGGGGAAAAGGAAAAGAAAGAAAGACCTGCAACTAAATCTCTCTCTTTCCAGTTCTTTGTTATTGTTTTTGATAATACCCTGTCTTGAAAAGGAAAGGACTAGAGGCCTACAAAGGTAGTGCTTATTAGAAGAAACCAGAGAATGTTCAAATGGATTGAGAGCGCCCATGGAAGGTCTGTTTTATACCAAGGTTTCATGCCTCTCTTCGATGTCCGTGACCGTGTCAAAGAGTTTTCTCATCTTCTTAATCTCGTAATTTTTTTCAGTGGTATCGGTCGTTTTAAGTCCTTCTGTAGCGTTATCCGTTTGTTTTCCCTTTATTTATATCAATAAAAGCTTTTGATCTCTCTATTCTCAATATGTTTCTTTGCTTCCTAGTATTAAACTTGCAAGATATTAAAATTTTCATCATTTGTCTTTGTATGCTCTCTTTATGGATGGTAAAAAAGTCTGAATTCTGATGAAGGATCCAAGTGACACCTGGGGTTCTTTTCTCTGGTTAACTTTTGTTCTTACTTTTATTCTCAGATCCAGCCCTCTTTTGAATCAGCATTGAAACTTCTACTTTGTTTCTGAGTAGTTAATGGGCTTTAGCTTATTGTAGCCCAAAATTTTAATCTGGGGAGGAGTATGCTGATAGATATAGATATGGCTCTTCATCTCATTGTTGTCTTTCTTATGTTCATCTCATTGTCAACTGAATGTGCAGCAAGAGAGCAAAAGATTGGACGAGGAGAATCGGCAAGGATCGTCATTTGATATGCATCGAGGATCCATTTGAGACATCTCACGACCTCGGTCGAGTTGTCGACAAGTACAGTATCAAAGTTTTGAGGGAAGAATTTGAACGTGCTGCTACTATTCTACAAACCTATCCAAACCCGTGCGAGAAGCTCTTCGAACCCTTCATCCCGAGTTAGCACTTCTTGTTTTTCTTAATTATCAGAGAGCTATTTAAAAACACAGTCATTATTATGTTCATAGCTAATCAAGGTTGTAAATTGCTTGCTATGGTTTCTTTTTTTTTTCTTTTTTTTTTTTTGTGCTTATTTATGTGTCAAAGTTTGTTTGGGTTTTTGTAAAAAGATGTTGTGGTGATCTTTATACCATCTAGAATTTTGATTTTCCCAGTTTGTAATATTATTTCTAGCAAGAAATTCCTT

mRNA sequence

TGATTGGCAGCAAGAAGTTGCAAAGCAAATCGGATCCCAAATTTTGCATAGTCAATGAAGATTTTATGCTTTGATATTTCCTTCTTCCCTCCAATTCTTACTCATTCTTCGTTGCTTCTCCTTCCACTCGCCCTCTCTCTCTCTCTTTTGTTACTGACCGCCGCTGACTCGCCCTACCATGGCCGGCGACGCTGGTGGCGACGGTCACTCTCCCTTTCCCCCACCTTCTAACGGCGGAGAGTTTCTTCTCTCCCTCCTACAACGACCCCCAAATCGCCAATCGCATCTCAACATCAATCCTCAGCCTCAGCCTCACCTTCACCCTCCCCCTGCCCTTGATCCAGCCGTCGCCGCCGTTGGTCCCTCCCTTACCTCCCTCCCGCCGCCCTGGCCTTCCACTGGCTCCGATCTTCTATACCCAATTCCTCTTTCCCCTTGGTCACACTCCCACCAGTCCTTGTCTGCTCCTATTGCTTCTAACTTTGTGGGGTTTCAACACCTTCAGCAGAACCCTTTTCCTCTTCCTCGGAATCAGTTTGGGGGAGCCCAATTTGCGGCGAGTCACAGTTCGGGTGACCTAATTCAAGGGGGTCTTGGGGGTGCAGACGATTTGAAGAGGTTAGGGCTTCGTGGAAACCATGATAGACCGAATGGTACTGTTCATAACCTTTCGCAGCATAATGAACTGGAGAATAAGCTTCAGTTTGGCTCTTTTTCTCCCACTCAATTTTCTAGGGTTTTGGTTAGTGGGAATAATAGCTCTGCCAATGATTTAAACCGCGAAGTAGGGTTTAGGGAGACAATCCCTAATGGGATGAATAGAAATCAGGGATTGGATTCTCATGGGAATTCGAATTTCATATCATATGGGAATTCGATTTCCAATGCCAATGTTCATTCCTTTCGCCGTGGAGAGTTTGATTACTCAGAGCAAGAAAGAGGGCGTGTGTTGGGTGAAAACTACAGTTTTCATCCCTTGGTGAAGGTGTTGGAGCCATCTGGTTTTATGAGCAAGCCAAAGGGAGGAGGGCATTTGGATTCTGTGAACATTAGAAGGAGAGATTTTGATCATGTAGTAAATAGAGAGAGAGCTAGTTCAAGTCAACTTGGGGAGGCGTTTCACAGGCTCGAACTTGGTGCACAGCTTCATGACCCTGTACGCCCTTCTAGGAGTGATCTTCATTCAGCATCAGCATTGGATATGGAAGAACGTGGTTTGAATTTGCATCCCGATTTTGCTGAAGGCAGGCCCAGAGATAGTCATGAGGGGCGTGGTTGGATGAGAAAGGATGTTGATTCAACTAATGGTAATGATAGCCAGGAACTGGAGAACAATATAGGTGAACAGCTTGCTGATTCCTTGTTGCACGAGGAGGAACCTGATGAGAAAAGCGATGCTAAACACGTGCGTCGTGAGAAGGATTGCAGGGGGAATCGGCTACTTACACACAGGGAGAGGATCTCTCGAAGACATATAAAGTGCCGTGGTGACATAGATATGTTGAAAGTTCCTCTTCTTGCAATTTATGAATCTCTGATACCACCTAAAGAAGAAAAGGAAAAACAGATGCAGTTATTAACATCACTAGAGAAGTTGGTTGTTAATGAATGGCCTTGTGCTCGTCTATGTCTCTTTGGATCATGTGCGAACTCCTTCGGTGTTTCAAATAGTGATTTAGATGTGTGTCTTGTGCATAGAGATGCTGATATTGACAAGGCTGAGATCTTACTGAAGTTGGCAGATAGACTGCAATCAGCTAACTTCCAGAATGTGCAGGCCCTGACGCATGCAAGGGTTCCTATTATAAAGCTCAAGGATCCGGTGACTGGAATATCTTGTGACATATGCATAAACAATGTTTTGGCTGTTGTAAATACAAAACTTCTCCGGGATTATGCACAAATAGATGTGAGATTACCACAATTGGCATTTATTGTGAAGCATTGGGCTAAGTCTAGAGGAGTGAATGAAACATACCAGGGAACACTTTCTAGCTATGCGTATGTTTTGATGTGCATCCATTTCTTACAACATCGAGATCCTCCTATCCTGCCTTGTTTACAGGAAACGAAGATTGTTACTTATCATGAAATTGTTGATAATATTGAATGTGCATACTTTGATCAAGTTGAAAATCTGAAAAGTTTTGGATCCAATAACAATGAAAGCGTTGCTCGACTCGTCTGGGGATTCTTCCATTATTGGGCATATTGTCATGATTATGCCAACACCGTCATATCCATTCGTACTAAAAGCACTATCAGCAAGAGAGCAAAAGATTGGACGAGGAGAATCGGCAAGGATCGTCATTTGATATGCATCGAGGATCCATTTGAGACATCTCACGACCTCGGTCGAGTTGTCGACAAGTACAGTATCAAAGTTTTGAGGGAAGAATTTGAACGTGCTGCTACTATTCTACAAACCTATCCAAACCCGTGCGAGAAGCTCTTCGAACCCTTCATCCCGAGTTAGCACTTCTTGTTTTTCTTAATTATCAGAGAGCTATTTAAAAACACAGTCATTATTATGTTCATAGCTAATCAAGGTTGTAAATTGCTTGCTATGGTTTCTTTTTTTTTTCTTTTTTTTTTTTTGTGCTTATTTATGTGTCAAAGTTTGTTTGGGTTTTTGTAAAAAGATGTTGTGGTGATCTTTATACCATCTAGAATTTTGATTTTCCCAGTTTGTAATATTATTTCTAGCAAGAAATTCCTT

Coding sequence (CDS)

ATGGCCGGCGACGCTGGTGGCGACGGTCACTCTCCCTTTCCCCCACCTTCTAACGGCGGAGAGTTTCTTCTCTCCCTCCTACAACGACCCCCAAATCGCCAATCGCATCTCAACATCAATCCTCAGCCTCAGCCTCACCTTCACCCTCCCCCTGCCCTTGATCCAGCCGTCGCCGCCGTTGGTCCCTCCCTTACCTCCCTCCCGCCGCCCTGGCCTTCCACTGGCTCCGATCTTCTATACCCAATTCCTCTTTCCCCTTGGTCACACTCCCACCAGTCCTTGTCTGCTCCTATTGCTTCTAACTTTGTGGGGTTTCAACACCTTCAGCAGAACCCTTTTCCTCTTCCTCGGAATCAGTTTGGGGGAGCCCAATTTGCGGCGAGTCACAGTTCGGGTGACCTAATTCAAGGGGGTCTTGGGGGTGCAGACGATTTGAAGAGGTTAGGGCTTCGTGGAAACCATGATAGACCGAATGGTACTGTTCATAACCTTTCGCAGCATAATGAACTGGAGAATAAGCTTCAGTTTGGCTCTTTTTCTCCCACTCAATTTTCTAGGGTTTTGGTTAGTGGGAATAATAGCTCTGCCAATGATTTAAACCGCGAAGTAGGGTTTAGGGAGACAATCCCTAATGGGATGAATAGAAATCAGGGATTGGATTCTCATGGGAATTCGAATTTCATATCATATGGGAATTCGATTTCCAATGCCAATGTTCATTCCTTTCGCCGTGGAGAGTTTGATTACTCAGAGCAAGAAAGAGGGCGTGTGTTGGGTGAAAACTACAGTTTTCATCCCTTGGTGAAGGTGTTGGAGCCATCTGGTTTTATGAGCAAGCCAAAGGGAGGAGGGCATTTGGATTCTGTGAACATTAGAAGGAGAGATTTTGATCATGTAGTAAATAGAGAGAGAGCTAGTTCAAGTCAACTTGGGGAGGCGTTTCACAGGCTCGAACTTGGTGCACAGCTTCATGACCCTGTACGCCCTTCTAGGAGTGATCTTCATTCAGCATCAGCATTGGATATGGAAGAACGTGGTTTGAATTTGCATCCCGATTTTGCTGAAGGCAGGCCCAGAGATAGTCATGAGGGGCGTGGTTGGATGAGAAAGGATGTTGATTCAACTAATGGTAATGATAGCCAGGAACTGGAGAACAATATAGGTGAACAGCTTGCTGATTCCTTGTTGCACGAGGAGGAACCTGATGAGAAAAGCGATGCTAAACACGTGCGTCGTGAGAAGGATTGCAGGGGGAATCGGCTACTTACACACAGGGAGAGGATCTCTCGAAGACATATAAAGTGCCGTGGTGACATAGATATGTTGAAAGTTCCTCTTCTTGCAATTTATGAATCTCTGATACCACCTAAAGAAGAAAAGGAAAAACAGATGCAGTTATTAACATCACTAGAGAAGTTGGTTGTTAATGAATGGCCTTGTGCTCGTCTATGTCTCTTTGGATCATGTGCGAACTCCTTCGGTGTTTCAAATAGTGATTTAGATGTGTGTCTTGTGCATAGAGATGCTGATATTGACAAGGCTGAGATCTTACTGAAGTTGGCAGATAGACTGCAATCAGCTAACTTCCAGAATGTGCAGGCCCTGACGCATGCAAGGGTTCCTATTATAAAGCTCAAGGATCCGGTGACTGGAATATCTTGTGACATATGCATAAACAATGTTTTGGCTGTTGTAAATACAAAACTTCTCCGGGATTATGCACAAATAGATGTGAGATTACCACAATTGGCATTTATTGTGAAGCATTGGGCTAAGTCTAGAGGAGTGAATGAAACATACCAGGGAACACTTTCTAGCTATGCGTATGTTTTGATGTGCATCCATTTCTTACAACATCGAGATCCTCCTATCCTGCCTTGTTTACAGGAAACGAAGATTGTTACTTATCATGAAATTGTTGATAATATTGAATGTGCATACTTTGATCAAGTTGAAAATCTGAAAAGTTTTGGATCCAATAACAATGAAAGCGTTGCTCGACTCGTCTGGGGATTCTTCCATTATTGGGCATATTGTCATGATTATGCCAACACCGTCATATCCATTCGTACTAAAAGCACTATCAGCAAGAGAGCAAAAGATTGGACGAGGAGAATCGGCAAGGATCGTCATTTGATATGCATCGAGGATCCATTTGAGACATCTCACGACCTCGGTCGAGTTGTCGACAAGTACAGTATCAAAGTTTTGAGGGAAGAATTTGAACGTGCTGCTACTATTCTACAAACCTATCCAAACCCGTGCGAGAAGCTCTTCGAACCCTTCATCCCGAGTTAG

Protein sequence

MAGDAGGDGHSPFPPPSNGGEFLLSLLQRPPNRQSHLNINPQPQPHLHPPPALDPAVAAVGPSLTSLPPPWPSTGSDLLYPIPLSPWSHSHQSLSAPIASNFVGFQHLQQNPFPLPRNQFGGAQFAASHSSGDLIQGGLGGADDLKRLGLRGNHDRPNGTVHNLSQHNELENKLQFGSFSPTQFSRVLVSGNNSSANDLNREVGFRETIPNGMNRNQGLDSHGNSNFISYGNSISNANVHSFRRGEFDYSEQERGRVLGENYSFHPLVKVLEPSGFMSKPKGGGHLDSVNIRRRDFDHVVNRERASSSQLGEAFHRLELGAQLHDPVRPSRSDLHSASALDMEERGLNLHPDFAEGRPRDSHEGRGWMRKDVDSTNGNDSQELENNIGEQLADSLLHEEEPDEKSDAKHVRREKDCRGNRLLTHRERISRRHIKCRGDIDMLKVPLLAIYESLIPPKEEKEKQMQLLTSLEKLVVNEWPCARLCLFGSCANSFGVSNSDLDVCLVHRDADIDKAEILLKLADRLQSANFQNVQALTHARVPIIKLKDPVTGISCDICINNVLAVVNTKLLRDYAQIDVRLPQLAFIVKHWAKSRGVNETYQGTLSSYAYVLMCIHFLQHRDPPILPCLQETKIVTYHEIVDNIECAYFDQVENLKSFGSNNNESVARLVWGFFHYWAYCHDYANTVISIRTKSTISKRAKDWTRRIGKDRHLICIEDPFETSHDLGRVVDKYSIKVLREEFERAATILQTYPNPCEKLFEPFIPS
BLAST of Cp4.1LG17g02530 vs. Swiss-Prot
Match: URT1_ARATH (UTP:RNA uridylyltransferase 1 OS=Arabidopsis thaliana GN=URT1 PE=1 SV=2)

HSP 1 Score: 565.1 bits (1455), Expect = 1.2e-159
Identity = 375/808 (46.41%), Postives = 457/808 (56.56%), Query Frame = 1

Query: 5   AGGDGHSPFPPPS-NGGEFLLSLLQRPPNRQSHLNINPQPQPHLHPPPALDPAVAAVGPS 64
           A G    P PP S N GEFLLS+L   P+  S         P  H   ALDPA+AA+GP+
Sbjct: 2   ADGGAEPPAPPSSINAGEFLLSILHGSPSPSSQ-------GPQHHQSFALDPAIAAIGPT 61

Query: 65  LTSLPPP--W------PSTGSDLLYPIPLSPWSHSHQSLSAPIASNFVGFQHLQQNPFPL 124
           + +  PP  W      PS  +   +P+  SP  H+       ++ NF+GF   Q  P P 
Sbjct: 62  VNNPFPPSNWQSNGHRPSNHNPPSWPLAFSP-PHN-------LSPNFLGFP--QFPPSPF 121

Query: 125 PRNQFGGAQFAASHSSGDLIQGGLGGADDLKRLGLRGNHDRPNGTVHNLSQHNEL----- 184
             NQF G Q  +               +D  RLG  G  +    ++    Q  +L     
Sbjct: 122 TTNQFDGNQRVS--------------PEDAYRLGFPGTTNPAIQSMVQQQQQQQLPPPQS 181

Query: 185 -ENKLQFGSFS--PTQFSRVLVSGN-NSSANDLNREVGFRETIPNGMNRNQGLDSHGNSN 244
              KL FGSFS   TQ    L +GN    +N   + +   ++  +  N +  L  H    
Sbjct: 182 ETRKLVFGSFSGDATQSLNGLHNGNLKYDSNQHEQLMRHPQSTLSNSNMDPNLSHH---- 241

Query: 245 FISYGNSISNANVHSFRRGEFDYSEQERGRVLGENYSFHPLVKVLEPSGFMSKPKG---- 304
                    N ++H  R G   +S +     +G N           P GF S  +G    
Sbjct: 242 --------RNHDLHEQRGG---HSGRGNWGHIGNNGRGLKSTPPPPPPGFSSNQRGWDMS 301

Query: 305 -GGHLDSVNIRRR----------------DFDHVVNRERASSSQLGEAFHRLELGAQLHD 364
            G   D   + R                 DF    NR R  S Q    F+   L  Q+  
Sbjct: 302 LGSKDDDRGMGRNHDQAMGEHSKVWNQSVDFSAEANRLRGLSIQNESKFN---LSQQIDH 361

Query: 365 PVRPSRSDLHSASALDMEERGLNLHPDFAEGRPRDSHEGRGWMRKDVDSTNGNDSQELEN 424
           P  P  + LHS SA D  +    L+ +   G  R   E  G + K     N N S E+E 
Sbjct: 362 PGPPKGASLHSVSAADAADSFSMLNKEARRGGER--REELGQLSKAKREGNAN-SDEIE- 421

Query: 425 NIGEQLADSLLHEEEP------DEKSDAKHVRREK---DCRGNRLLTHRERISRRHIKCR 484
           + GE +  SLL E+E       D K D+K  R ++   D RG RLL  + R+ + ++ CR
Sbjct: 422 DFGEDIVKSLLLEDETGEKDANDGKKDSKTSREKESRVDNRGQRLLGQKARMVKMYMACR 481

Query: 485 GDIDMLKVPLLAIYESLIPPKEEKEKQMQLLTSLEKLVVNEWPCARLCLFGSCANSFGVS 544
            DI       +AIY+SLIP +EE EKQ QL+  LE LV  EWP A+L L+GSCANSFG  
Sbjct: 482 NDIHRYDATFIAIYKSLIPAEEELEKQRQLMAHLENLVAKEWPHAKLYLYGSCANSFGFP 541

Query: 545 NSDLDVCLVHRDADIDKAEILLKLADRLQSANFQNVQALTHARVPIIKLKDPVTGISCDI 604
            SD+DVCL     DI+K+E+LLKLA+ L+S N QNVQALT ARVPI+KL DPVTGISCDI
Sbjct: 542 KSDIDVCLAIEGDDINKSEMLLKLAEILESDNLQNVQALTRARVPIVKLMDPVTGISCDI 601

Query: 605 CINNVLAVVNTKLLRDYAQIDVRLPQLAFIVKHWAKSRGVNETYQGTLSSYAYVLMCIHF 664
           CINNVLAVVNTKLLRDYAQIDVRL QLAFIVKHWAKSR VNETYQGTLSSYAYVLMCIHF
Sbjct: 602 CINNVLAVVNTKLLRDYAQIDVRLRQLAFIVKHWAKSRRVNETYQGTLSSYAYVLMCIHF 661

Query: 665 LQHRDPPILPCLQETKIVTYHEIVDNIECAYFDQVENLKSFGSNNNESVARLVWGFFHYW 724
           LQ R PPILPCLQE +  TY   VDNI C YFD V+ L++FGSNN E++A LVWGFF+YW
Sbjct: 662 LQQRRPPILPCLQEME-PTYSVRVDNIRCTYFDNVDRLRNFGSNNRETIAELVWGFFNYW 721

Query: 725 AYCHDYANTVISIRTKSTISKRAKDWTRRIGKDRHLICIEDPFETSHDLGRVVDKYSIKV 765
           AY HDYA  V+S+RT S + KR KDWTRR+G DRHLICIEDPFETSHDLGRVVDK+SI+V
Sbjct: 722 AYAHDYAYNVVSVRTGSILGKREKDWTRRVGNDRHLICIEDPFETSHDLGRVVDKFSIRV 755

BLAST of Cp4.1LG17g02530 vs. Swiss-Prot
Match: CID11_SCHPO (Poly(A) RNA polymerase cid11 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=cid11 PE=3 SV=1)

HSP 1 Score: 184.9 bits (468), Expect = 3.3e-45
Identity = 105/314 (33.44%), Postives = 174/314 (55.41%), Query Frame = 1

Query: 449 IYESLIPPKEEKEKQMQLLTSLEKLVVNEWPCARLCLF--GSCANSFGVSNSDLDVCLVH 508
           +Y  L P  EE  ++ Q +  L  ++  E   A+L LF  GS  N+  +  SD+DVC++ 
Sbjct: 54  LYMRLKPSNEEVSRRQQFVDKLRTILSTEIKDAKLDLFVFGSTENNLAIQQSDVDVCIIT 113

Query: 509 RDADIDKAEILLKLADRLQSANFQNVQALTHARVPIIKLKDPVTGISCDICINNVLAVVN 568
             +    +    +LA  L S   + +  ++ ARVPI+K+ DP   I CD+ INN +A +N
Sbjct: 114 NGSKYLNSTC--QLAQLLYSYGMKQIVCVSRARVPIVKIWDPQFDIHCDLNINNDVAKIN 173

Query: 569 TKLLRDYAQIDVRLPQLAFIVKHWAKSRGV-NETYQGTLSSYAYVLMCIHFLQHRDPPIL 628
           TK+LR +  ID R+  L  I+K+WAK R + +    GT++SY    M ++FLQ R+PPIL
Sbjct: 174 TKMLRLFVSIDPRVRPLGLIIKYWAKQRALCDAAGSGTITSYTISCMLVNFLQTRNPPIL 233

Query: 629 PCLQETKIVTYHEIVDNIECAYF-DQVENLKSFGSNNNESVARLVWGFFHYWAYCHDYAN 688
           P +         +++ N +   F D +   K   + N  S+ RL+  FF+Y+ +  +Y +
Sbjct: 234 PAML--------DLMSNDDNKMFVDDIVGFKEKATLNKTSLGRLLIDFFYYYGFSFNYLD 293

Query: 689 TVISIRTKSTISKRAKDWTRRIGKDRHLICIEDPFETSHDLGRVVDKYSIKVLREEFERA 748
           +V+S+R+ + ++K+ K W   +      +C+E+PF T+ +L    D  S+K L+ EF+RA
Sbjct: 294 SVVSVRSGTVLNKQEKGWAMEVNNS---LCVEEPFNTARNLANTADNPSVKGLQSEFQRA 353

Query: 749 ATILQTYPNPCEKL 759
              L +  N CE+L
Sbjct: 354 FR-LMSENNACERL 353

BLAST of Cp4.1LG17g02530 vs. Swiss-Prot
Match: CID1_SCHPO (Terminal uridylyltransferase cid1 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=cid1 PE=1 SV=2)

HSP 1 Score: 183.7 bits (465), Expect = 7.4e-45
Identity = 119/328 (36.28%), Postives = 178/328 (54.27%), Query Frame = 1

Query: 451 ESLIPPKEEKEKQMQLLTSLEKLVVNEWPCARLCLFGSCANSFGVSNSDLDVCLVHRDAD 510
           E  I  KE KEK+  L T L   +    P A L  FGS  +   + NSD+D+C++  D+ 
Sbjct: 54  EIKISDKEFKEKRAALDT-LRLCLKRISPDAELVAFGSLESGLALKNSDMDLCVL-MDSR 113

Query: 511 IDKAEILLKLADRLQSANFQNVQALTHARVPIIKLKDPV-----TGISCDICINNVLAVV 570
           +    I L+  + L +  F+  + L  AR+PIIKL             CDI  NN LA+ 
Sbjct: 114 VQSDTIALQFYEELIAEGFEG-KFLQRARIPIIKLTSDTKNGFGASFQCDIGFNNRLAIH 173

Query: 571 NTKLLRDYAQIDVRLPQLAFIVKHWAKSRGVNETYQGTLSSYAYVLMCIHFLQH-RDPPI 630
           NT LL  Y ++D RL  +  +VKHWAK + +N  Y GTLSSY YVLM +++L H   PP+
Sbjct: 174 NTLLLSSYTKLDARLKPMVLLVKHWAKRKQINSPYFGTLSSYGYVLMVLYYLIHVIKPPV 233

Query: 631 LPCLQETKIVTYHEIVDNIECAYFDQVENLKSFGSNNNESVARLVWGFFHYWAYCHDYAN 690
            P L  + +    +IVD  +  + D++E++    S N  S+  L+ GFF ++AY  +   
Sbjct: 234 FPNLLLSPL-KQEKIVDGFDVGFDDKLEDIPP--SQNYSSLGSLLHGFFRFYAYKFEPRE 293

Query: 691 TVISI-RTKSTISKRAKDWTR---------RIGKDRHLICIEDPFETSHDLGRVVDKYSI 750
            V++  R    ++K+ K WT          +I KDR+++ IEDPFE SH++GR V    +
Sbjct: 294 KVVTFRRPDGYLTKQEKGWTSATEHTGSADQIIKDRYILAIEDPFEISHNVGRTVSSSGL 353

Query: 751 KVLREEFERAATIL--QTYPNPCEKLFE 761
             +R EF  A+ +L  ++YP P + LFE
Sbjct: 354 YRIRGEFMAASRLLNSRSYPIPYDSLFE 375

BLAST of Cp4.1LG17g02530 vs. Swiss-Prot
Match: CID13_SCHPO (Poly(A) RNA polymerase cid13 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=cid13 PE=1 SV=1)

HSP 1 Score: 182.2 bits (461), Expect = 2.2e-44
Identity = 102/315 (32.38%), Postives = 173/315 (54.92%), Query Frame = 1

Query: 438 DIDMLKVPLLAIYESLIPPKEEKEKQMQLLTSLEKLVVNEWPCARL--CLFGSCANSFGV 497
           D D++   L  +Y+S+I      E++   +  LE+++  E+P   +   LFGS  +    
Sbjct: 47  DTDLISSQLYELYDSIILNDSGLERRYAFVQKLEQILKKEFPYKNIKTSLFGSTQSLLAS 106

Query: 498 SNSDLDVCLVHRDADIDKAEILLKLADRLQSANFQNVQALTHARVPIIKLKDPVTGISCD 557
           + SD+D+C++        A    +++        + V  ++ A+VPI+K+ D    +SCD
Sbjct: 107 NASDIDLCIITDPPQC--APTTCEVSAAFARNGLKKVVCISTAKVPIVKVWDSELQLSCD 166

Query: 558 ICINNVLAVVNTKLLRDYAQIDVRLPQLAFIVKHWAKSRGVNETYQ-GTLSSYAYVLMCI 617
             IN  ++ +NT+L+R Y   D R+  L  ++K+WAK R +N+  + GTL+SY    M I
Sbjct: 167 CNINKTISTLNTRLMRSYVLCDPRVRPLIVMIKYWAKRRCLNDAAEGGTLTSYTISCMVI 226

Query: 618 HFLQHRDPPILPCLQE-TKIVTYHEIVDNIECAYFDQVENLKSFGSNNNESVARLVWGFF 677
           +FLQ RDPPILP LQ    +     + D ++ ++FD  + +  FG  N ES+  L   FF
Sbjct: 227 NFLQKRDPPILPSLQMLPHLQDSSTMTDGLDVSFFDDPDLVHGFGDKNEESLGILFVEFF 286

Query: 678 HYWAYCHDYANTVISIRTKSTISKRAKDWTRRIGKDRHLICIEDPFETSHDLGRVVDKYS 737
            ++ Y  DY + V+SIR  + +SKRAK W  ++    + +C+E+PF TS +L    D+ +
Sbjct: 287 RFFGYLFDYEHFVLSIRHGTFLSKRAKGWQFQL---NNFLCVEEPFHTSRNLANTADEIT 346

Query: 738 IKVLREEFERAATIL 749
           +K ++ EF R   +L
Sbjct: 347 MKGIQLEFRRVFRLL 356

BLAST of Cp4.1LG17g02530 vs. Swiss-Prot
Match: TUT4_HUMAN (Terminal uridylyltransferase 4 OS=Homo sapiens GN=ZCCHC11 PE=1 SV=3)

HSP 1 Score: 181.4 bits (459), Expect = 3.7e-44
Identity = 116/318 (36.48%), Postives = 174/318 (54.72%), Query Frame = 1

Query: 450  YESLIPPKEEKEKQMQLLTSLEKLVVNEWP-CARLCLFGSCANSFGVSNSDLDVCLV--- 509
            ++ L PP  E+  + Q+L  LEK +  E+   ARLCLFGS  N FG  +SDLD+C+    
Sbjct: 959  FDELSPPCSEQHNREQILIGLEKFIQKEYDEKARLCLFGSSKNGFGFRDSDLDICMTLEG 1018

Query: 510  HRDAD-IDKAEILLKLADRLQS-ANFQNVQALTHARVPIIKLKDPVTGISCDICINNVLA 569
            H +A+ ++  EI+  LA  L+     +N+  +T A+VPI+K +   +G+  DI + N LA
Sbjct: 1019 HENAEKLNCKEIIENLAKILKRHPGLRNILPITTAKVPIVKFEHRRSGLEGDISLYNTLA 1078

Query: 570  VVNTKLLRDYAQIDVRLPQLAFIVKHWAKSRGVNETYQGTLSSYAYVLMCIHFLQHRDPP 629
              NT++L  YA ID R+  L + +K +AK   + +  +G+LSSYAY+LM ++FLQ R PP
Sbjct: 1079 QHNTRMLATYAAIDPRVQYLGYTMKVFAKRCDIGDASRGSLSSYAYILMVLYFLQQRKPP 1138

Query: 630  ILPCLQET---KIVTYHEIVDNIECAYFDQVENLK----SFGSNNNESVARLVWGFFHYW 689
            ++P LQE    K +    +VD     +FD+ E LK    S G  N ES+  L  G   ++
Sbjct: 1139 VIPVLQEIFDGKQIP-QRMVDGWNAFFFDKTEELKKRLPSLG-KNTESLGELWLGLLRFY 1198

Query: 690  AYCHDYANTVISIRTKSTISKRAKDWTRRIGKDRHLICIEDPFETSHDLGRVVDKYSIKV 749
                D+   VISIR K  ++   K WT +       I IEDPF+ +H+LG  V +     
Sbjct: 1199 TEEFDFKEYVISIRQKKLLTTFEKQWTSK------CIAIEDPFDLNHNLGAGVSRKMTNF 1258

Query: 750  LREEFERAATILQT--YP 753
            + + F     +  T  YP
Sbjct: 1259 IMKAFINGRKLFGTPFYP 1268

BLAST of Cp4.1LG17g02530 vs. TrEMBL
Match: A0A0A0KBJ1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G076730 PE=4 SV=1)

HSP 1 Score: 1274.6 bits (3297), Expect = 0.0e+00
Identity = 639/766 (83.42%), Postives = 685/766 (89.43%), Query Frame = 1

Query: 1   MAGDAGGDGHSPFPPPSNGGEFLLSLLQRPPNRQSHLNINPQPQPHLHPPPALDPAVAAV 60
           MAGDAGGDGHSPFPPPSNGGEFLLSLLQRPPNRQSHLN+N  P  HLHP  ++DPAVAAV
Sbjct: 1   MAGDAGGDGHSPFPPPSNGGEFLLSLLQRPPNRQSHLNLNSLP--HLHPSSSIDPAVAAV 60

Query: 61  GPSLTSLPPPWPSTGSDLLYPIPLSPWSHSHQSLSAPIASNFVGFQHLQQNPFPLPRNQF 120
           GPSLTSLP PWPS+GSDLLYPIPLSPWSHSHQSLS PIA N+VGFQHLQQNPFPLPR+QF
Sbjct: 61  GPSLTSLPTPWPSSGSDLLYPIPLSPWSHSHQSLSTPIAPNYVGFQHLQQNPFPLPRSQF 120

Query: 121 GGAQFAASHSSGDLIQGGLGGADDLKRLGLRGNHDRPNGTV-HNLSQHNELENKLQFGSF 180
           GGAQFAAS +SGD IQGG GG DD KRLG  GNHDR NGTV HN SQHN+LENKLQFGSF
Sbjct: 121 GGAQFAASQTSGDQIQGGFGGVDDFKRLGFPGNHDRANGTVTHNFSQHNQLENKLQFGSF 180

Query: 181 SPTQFSRVLVSGNNSSANDLNREVGFRETIPNGMNRNQGLDSHGNSNFISYGNSISNANV 240
           SP+ F R+L++GN+S+A DLNREVGFRE+IPNG+NRNQGLDSHGNSNF SYGNS  NANV
Sbjct: 181 SPSLFPRILINGNSSTAKDLNREVGFRESIPNGLNRNQGLDSHGNSNFTSYGNSNPNANV 240

Query: 241 HSFRRGEFDYSEQERGRVLGENYSFHPLVKVLEPSGFMSKPKGGGHLDSVNIRRRDFDHV 300
           HSF RGE DYS+QERGRVLGENY+FHP VK  E SGFMS P GGGHLD  NIR+RDF+H 
Sbjct: 241 HSFGRGECDYSDQERGRVLGENYNFHPQVKASEVSGFMSNPTGGGHLDFGNIRKRDFEHG 300

Query: 301 VNRERASSSQLGEAFHRLELGAQLHDPVRPSRSDLHSASALDMEERGLNLHPDFAEGRPR 360
            NRER  SSQ GE   RLELGAQL DPVRPSRSDL SA AL++EER LNL  +  EGR R
Sbjct: 301 GNRERPRSSQFGEGSRRLELGAQLRDPVRPSRSDLQSALALNIEERVLNLDSEIDEGRHR 360

Query: 361 DSHEGRGWMRKDVDSTNGNDSQELENNIGEQLADSLLHEEEPDEKSDAKHVRREKDCRGN 420
           DS++             G+DSQEL+ NIGEQLADSLL E+EPDEKSD+K +RREKDCRGN
Sbjct: 361 DSYQ-------------GHDSQELD-NIGEQLADSLLLEDEPDEKSDSKFIRREKDCRGN 420

Query: 421 RLLTHRERISRRHIKCRGDIDMLKVPLLAIYESLIPPKEEKEKQMQLLTSLEKLVVNEWP 480
           RLLTHRERI+R+HI CRGDIDML +PLL IYESLIPP+EEKEKQ QLL SLEKLVVNEWP
Sbjct: 421 RLLTHRERIARKHIHCRGDIDMLTIPLLRIYESLIPPEEEKEKQRQLLISLEKLVVNEWP 480

Query: 481 CARLCLFGSCANSFGVSNSDLDVCLVHRDADIDKAEILLKLADRLQSANFQNVQALTHAR 540
            A L LFGSCANSFGVSNSD+DVCLV RDADIDK+EILLKLA+ LQSANFQNVQALT AR
Sbjct: 481 HAHLFLFGSCANSFGVSNSDVDVCLVLRDADIDKSEILLKLAEILQSANFQNVQALTRAR 540

Query: 541 VPIIKLKDPVTGISCDICINNVLAVVNTKLLRDYAQIDVRLPQLAFIVKHWAKSRGVNET 600
           VPIIKLKDPVTG+SCDICINNVLAVVNTKLLRDYAQIDVRLPQLAFIVKHWAKSRGVNET
Sbjct: 541 VPIIKLKDPVTGVSCDICINNVLAVVNTKLLRDYAQIDVRLPQLAFIVKHWAKSRGVNET 600

Query: 601 YQGTLSSYAYVLMCIHFLQHRDPPILPCLQETKIVTYHEIVDNIECAYFDQVENLKSFGS 660
           YQGTLSSYAYVLMCIHFLQHRDPPILPCLQETKIVTYH+IVDNIECAYFDQVE LK+FGS
Sbjct: 601 YQGTLSSYAYVLMCIHFLQHRDPPILPCLQETKIVTYHKIVDNIECAYFDQVEKLKTFGS 660

Query: 661 NNNESVARLVWGFFHYWAYCHDYANTVISIRTKSTISKRAKDWTRRIGKDRHLICIEDPF 720
           +N ESVARLVWGFFHYWAYCHDYANTV+S+RTK+T+SKRAKDWTRRIGKDRHLICIEDPF
Sbjct: 661 DNKESVARLVWGFFHYWAYCHDYANTVVSVRTKNTVSKRAKDWTRRIGKDRHLICIEDPF 720

Query: 721 ETSHDLGRVVDKYSIKVLREEFERAATILQTYPNPCEKLFEPFIPS 766
           ETSHDLGRVVDKYSIKVLREEFERAATILQTYPNPCEKLFEPF+PS
Sbjct: 721 ETSHDLGRVVDKYSIKVLREEFERAATILQTYPNPCEKLFEPFVPS 750

BLAST of Cp4.1LG17g02530 vs. TrEMBL
Match: E6NU32_JATCU (JHL05D22.13 protein OS=Jatropha curcas GN=JHL05D22.13 PE=4 SV=1)

HSP 1 Score: 680.6 bits (1755), Expect = 2.2e-192
Identity = 421/807 (52.17%), Postives = 494/807 (61.21%), Query Frame = 1

Query: 4   DAGGDGHSPFPPPSNGGEFLLSLLQRP----------PNRQSHLNINPQPQPHLHPPP-- 63
           + GG    P  P  NGGEFLLSLLQRP          P+ Q  + I   PQ +       
Sbjct: 2   NGGGADAPPMQPAVNGGEFLLSLLQRPNHQLQTPAPPPHSQLPIPIPITPQQYQQQQQQQ 61

Query: 64  -----ALDPAVAAVGPSLTSLPPPWPSTGSDLLYPIPLSPWSHSHQSLSAPIASNFVGFQ 123
                ALDPAVAAVGPSL    P W S G D+L P    PW H+  +  AP+   F+GF 
Sbjct: 62  QQQSLALDPAVAAVGPSLPFSQPVWQSNGRDVLTP----PWPHNLSA--APLLPGFLGFP 121

Query: 124 HLQQNPFPLPRNQFGGAQFAASHSSGDLIQGGLGGADDLKRLGLRGNHDRPNGTVHN-LS 183
              QN +P P N     QF  +       QG LG  DDL+ LG  G   R N T+HN + 
Sbjct: 122 ---QNHWPSPANHLAAGQFQGNQ------QGVLG--DDLQILGFSGADVRANNTIHNRVQ 181

Query: 184 QHNELENKLQFGSF-SPTQFSRVLVSGNNSSANDLNREVGFRETIPNGMNRNQGLDSHGN 243
           Q  +LE KLQFGSF S  Q    L++ N+        EV       NG+  +Q  DS   
Sbjct: 182 QKQQLEQKLQFGSFRSDIQNVEALLNVNSKLNAAKELEVRLATRNLNGLESDQKFDSQLR 241

Query: 244 SNFISYGNSISNANVHSFRRGEFDYSEQERG----RVLGENYSFHPLVKVLEPSGFMSKP 303
           +                     FD  EQ+R     R      ++ P    + P GF +KP
Sbjct: 242 T---------------------FDLREQDRSGGGWRKQPHGGNYRPQETRMPPPGFSNKP 301

Query: 304 KGGGHLDSVNIRRRDFDHVVNRERASSSQL----------------GEAFHRLELGAQLH 363
           +GGG+ D V+ RRR+ D+ VN+E+ +  +L                G+    L L  QL 
Sbjct: 302 RGGGNWDYVS-RRRELDYNVNKEKGNQGELSNRNALFSSEDKIPRDGDRSRDLGLTGQLD 361

Query: 364 DPVRPSRSDLHSASALDMEERGLNLHPDFAEGRPRDSHEGRGWMRKDVDSTNGNDSQELE 423
            P  P+ S+L+S SA D+E   LN+  +  E                    +G D     
Sbjct: 362 RPGPPAGSNLYSVSAADVELSMLNVEAEVVE--------------------DGKDEGREL 421

Query: 424 NNIGEQLADSLLHEEEPDEKSDAKHVR--REK----DCRGNRLLTHRERISRRHIKCRGD 483
           +  GE+L DSLL E E D K+D K  R  REK    D RG R L+ R R+ +R ++CR D
Sbjct: 422 DEAGEELVDSLLLEGESDGKNDKKQNRHSREKESRSDNRGQRTLSQRMRMLKRQMECRRD 481

Query: 484 IDMLKVPLLAIYESLIPPKEEKEKQMQLLTSLEKLVVNEWPCARLCLFGSCANSFGVSNS 543
           ID L  P LAIYESL+PP+EEK KQ QLL+ LEKLV  EWP ARL L+GSCANSFGV  S
Sbjct: 482 IDRLNAPFLAIYESLVPPEEEKAKQKQLLSLLEKLVNKEWPQARLYLYGSCANSFGVLKS 541

Query: 544 DLDVCLVHRDADIDKAEILLKLADRLQSANFQNVQALTHARVPIIKLKDPVTGISCDICI 603
           D+DVCL  ++ADI+K+E+LLKLAD LQS N QNVQALT ARVPI+KL DPVTGISCDICI
Sbjct: 542 DIDVCLAIQNADINKSEVLLKLADILQSDNLQNVQALTRARVPIVKLMDPVTGISCDICI 601

Query: 604 NNVLAVVNTKLLRDYAQIDVRLPQLAFIVKHWAKSRGVNETYQGTLSSYAYVLMCIHFLQ 663
           NNVLAVVNTKLL DYAQIDVRL QLAFIVKHWAKSRGVNETY GTLSSYAYVLMCIHFLQ
Sbjct: 602 NNVLAVVNTKLLWDYAQIDVRLRQLAFIVKHWAKSRGVNETYHGTLSSYAYVLMCIHFLQ 661

Query: 664 HRDPPILPCLQETKIVTYHEIVDNIECAYFDQVENLKSFGSNNNESVARLVWGFFHYWAY 723
            R P ILPCLQE +  TY   VD+I+CAYFDQVE L+ FGS N E++A+LVW FF+YWAY
Sbjct: 662 QRRPAILPCLQEME-ATYSVAVDDIQCAYFDQVEKLRGFGSRNKETIAQLVWAFFNYWAY 721

Query: 724 CHDYANTVISIRTKSTISKRAKDWTRRIGKDRHLICIEDPFETSHDLGRVVDKYSIKVLR 766
            HDYAN VISIRT S ISKR KDWTRRIG DRHLICIEDPFE SHDLGRVVDKYSIKVLR
Sbjct: 722 RHDYANAVISIRTGSIISKREKDWTRRIGNDRHLICIEDPFEISHDLGRVVDKYSIKVLR 748

BLAST of Cp4.1LG17g02530 vs. TrEMBL
Match: A0A061E1N6_THECC (Nucleotidyltransferase family protein isoform 1 OS=Theobroma cacao GN=TCM_005467 PE=4 SV=1)

HSP 1 Score: 676.8 bits (1745), Expect = 3.1e-191
Identity = 415/786 (52.80%), Postives = 498/786 (63.36%), Query Frame = 1

Query: 6   GGDGHSPFPPPSNGGEFLLSLLQRPPNRQSHLNIN-------------PQPQPHLHP--- 65
           G  G +P PP +NGGEFLLSLLQ+P   Q HL                PQPQ        
Sbjct: 3   GNGGEAPSPPAANGGEFLLSLLQKP---QQHLQQQQSPLFSRATPVTIPQPQQQQQQQQQ 62

Query: 66  -PPALDPAVAAVGPSLTSLPPPWPSTGSDL--LYPIPLSPWSHSHQSLSAPIASNFVGFQ 125
            P  +DPAVAAVGP+L    P WPS G DL  L+P          Q+LS P+A NF+GF 
Sbjct: 63  QPLVIDPAVAAVGPTLP-FRPLWPSNGRDLPGLWP----------QTLSPPLAPNFLGFP 122

Query: 126 HLQQNPFPLPRNQFGGAQFAASHSSGDLIQGGLGGADDLKRLGLRGNHDRPNGTVHNLSQ 185
               +P+  P NQF G Q A                DDL+RLGL G  +  N  + N  Q
Sbjct: 123 ---LSPWSSPGNQFAGNQGALM--------------DDLRRLGLSGIDNNKNHVIQNRVQ 182

Query: 186 HNELENKLQFGSFSPTQFSRVLVSGNNSSANDLNREVGFRETIPNGMN-RNQGLDSHGNS 245
               + KL FGSF P+    +     + + N L           + +N  NQ LDS  NS
Sbjct: 183 QKHQDQKLVFGSF-PSDIQTLKTPEGSPNGNLLEN---------SKLNLSNQQLDSRLNS 242

Query: 246 NFISYGNSISNANVHSFR-RGEFDYSEQERGRVLGENYSFHPLVKVLE-PSGFMSKPKGG 305
           N         N + + F+ R   D  +Q++    G +Y   P  +    P GF+ KP+GG
Sbjct: 243 N--------PNTSPYVFQHRNSGDRGKQQQH---GGSYRPTPSPEARRSPPGFLGKPRGG 302

Query: 306 GHLDSVNIRRRDFDHVVNRERASSSQLGEAFHRLELGAQLHDPVRPSRSDLHSASALDME 365
           G       RRR F+H V++ +A  SQ   + + + L  QL  P  P+ S+L S SA D+E
Sbjct: 303 GGNRDFGNRRRHFEHNVDKAKAEYSQ-PSSDNEVGLSGQLDRPGPPAGSNLQSVSATDIE 362

Query: 366 ERGLNLHPDFAEGRPRDSHEGRGWMRKDVDSTNGNDSQELENNIGEQLADSLLHEEEPDE 425
           E  L LH D   GR R S           D     D  E++  +GEQL +SLL E+E D+
Sbjct: 363 ESLLELHSD--GGRDRFSRR---------DKFRREDGGEVDE-VGEQLLESLLIEDESDD 422

Query: 426 KSDAKHVRREK----DCRGNRLLTHRERISRRHIKCRGDIDMLKVPLLAIYESLIPPKEE 485
           K+D K  RREK    D RG RLL+ R R+ +R ++CR DI  L  P LA+YESLIPP+EE
Sbjct: 423 KNDKKQHRREKESRIDNRGQRLLSQRMRMLKRQMECRSDIHRLNAPFLALYESLIPPEEE 482

Query: 486 KEKQMQLLTSLEKLVVNEWPCARLCLFGSCANSFGVSNSDLDVCLVHRDADIDKAEILLK 545
           + KQ QLL  LEKLV  EWP ARL L+GSCANSFGVS SD+DVCL   + D++K+EILLK
Sbjct: 483 RAKQKQLLALLEKLVCKEWPEARLYLYGSCANSFGVSKSDIDVCLAFNEMDVNKSEILLK 542

Query: 546 LADRLQSANFQNVQALTHARVPIIKLKDPVTGISCDICINNVLAVVNTKLLRDYAQIDVR 605
           LAD LQS N QNVQALT ARVPI+KL DP TGISCDICINNVLAVVNTKLLRDYA++D R
Sbjct: 543 LADILQSDNLQNVQALTRARVPIVKLMDPATGISCDICINNVLAVVNTKLLRDYAKLDAR 602

Query: 606 LPQLAFIVKHWAKSRGVNETYQGTLSSYAYVLMCIHFLQHRDPPILPCLQETKIVTYHEI 665
           L QLAFIVKHWAKSRGVNETYQGTLSSYAYVLMCIHFLQ R P ILPCLQ  +  TY   
Sbjct: 603 LRQLAFIVKHWAKSRGVNETYQGTLSSYAYVLMCIHFLQQRRPAILPCLQGME-TTYSVT 662

Query: 666 VDNIECAYFDQVENLKSFGSNNNESVARLVWGFFHYWAYCHDYANTVISIRTKSTISKRA 725
           VD++ECAYFDQVE L++FGS+N ESVA+LVW FF+YWAY HDYAN+VIS+RT S ISK+ 
Sbjct: 663 VDDVECAYFDQVERLRNFGSSNKESVAQLVWAFFNYWAYGHDYANSVISVRTGSIISKQE 722

Query: 726 KDWTRRIGKDRHLICIEDPFETSHDLGRVVDKYSIKVLREEFERAATILQTYPNPCEKLF 766
           KDWTRRIG DRHLICIEDPFE SHDLGRVVDK+SI+V+REEFERAA ++Q  PNPC  LF
Sbjct: 723 KDWTRRIGNDRHLICIEDPFEISHDLGRVVDKFSIRVIREEFERAADVMQYDPNPCVTLF 722

BLAST of Cp4.1LG17g02530 vs. TrEMBL
Match: A0A151UCB0_CAJCA (Poly(A) RNA polymerase cid11 OS=Cajanus cajan GN=KK1_021216 PE=4 SV=1)

HSP 1 Score: 646.0 bits (1665), Expect = 5.9e-182
Identity = 404/793 (50.95%), Postives = 490/793 (61.79%), Query Frame = 1

Query: 7   GDGHSPFPPPSNGGEFLLSLLQRPPNRQSHLNINPQPQPHLHPPPALDPAVAAVGPSLTS 66
           G G    PPP+NGGEFLLSL+QRP     H N +P+PQ      PA+DPAVA +GP++  
Sbjct: 3   GGGGDLTPPPANGGEFLLSLIQRP-----HSNPHPRPQS-----PAIDPAVALIGPTIPV 62

Query: 67  LPPPWPSTGSDLLY----PIP-LSPWSHSHQSLSAPI-ASNFVGFQHLQQNPFPLPRNQF 126
              PWP  G+D  +    P+  L PWSH+   LS+P+   NF G  H   NPFP PR  F
Sbjct: 63  GARPWPIAGADQAHQHNHPLHHLPPWSHT---LSSPLYPPNFFGLPH---NPFPPPRTHF 122

Query: 127 GGAQFAASHSSGDLIQGGLGGADDLKRLGL----------RGNHDRPNGTVHNLSQHNEL 186
                     + + +  G     DL++LG              H   +  V+   Q  EL
Sbjct: 123 PAV-------TPNAVTNGASLTHDLRKLGFPIEENTAHAHTQTHTNNSNKVNAFVQQQEL 182

Query: 187 ENKLQFGSFSPTQFSRVLVSGNNSSANDLNREVGFRETIPNGMNRNQGLDSHGNSNFISY 246
             KLQFGS     +S   VS N  S  +L                N+G D +  S  + +
Sbjct: 183 --KLQFGSLPTVAYSAAEVSSNGDSLLNLKFN-------------NRGYDGYDGSLHVDH 242

Query: 247 GNSISNANVHSFRRGEFDYSEQERGRVLGENYSFHPLVKVLEPSGFMSKPKGGGHLDSVN 306
            NS S+ NV    +G  D  E+ER R LG   +         P GF +K +G G   S  
Sbjct: 243 PNSNSSGNV--VVQGNHDVVEKER-RGLGGYRTSGSFSSERVPPGFANKNRGKGLEGS-- 302

Query: 307 IRRRDFDHVVNRERASSSQLGEAFHRLELGAQLHDPVRPSRSDLHSASALDM------EE 366
             RRD      R         E  +    G ++   V   RS++      +M      + 
Sbjct: 303 --RRD------RVGEMGGGRNENLYGKREGVRM---VSGERSNVKGNVVREMGFVDQLDH 362

Query: 367 RGLNLHPD----FAEGRPRDS-HEGRGWMRKDVDSTNGNDSQELENNIGEQLADSLLHEE 426
            G+NLH        E   RDS H+G G +R +    +G    + +  +GEQLADSL+ E+
Sbjct: 363 AGINLHSVNETVIGEVGVRDSKHKGGGGLRVEGVPQSGGSGADADV-LGEQLADSLVVED 422

Query: 427 EPDEKSDAKHVR--REKDCR-----GNRLLTHRERISRRHIKCRGDIDMLKVPLLAIYES 486
           E D++S++K  R  REKD R     G  LL+ R R+ +R + CR DID+  VP LAIYES
Sbjct: 423 ESDDRSNSKQRRGPREKDARSLDSRGQHLLSQRARMYKRQMMCRRDIDVHNVPFLAIYES 482

Query: 487 LIPPKEEKEKQMQLLTSLEKLVVNEWPCARLCLFGSCANSFGVSNSDLDVCLVHRDADID 546
           LIPP+EEK KQ QL+  LEKLV  EWP A+L L+GSCANSFGVS SD+DVCL   +AD+D
Sbjct: 483 LIPPEEEKLKQKQLVALLEKLVSKEWPTAKLFLYGSCANSFGVSKSDIDVCLAIEEADMD 542

Query: 547 KAEILLKLADRLQSANFQNVQALTHARVPIIKLKDPVTGISCDICINNVLAVVNTKLLRD 606
           K++I++KLAD LQS N QNVQALT ARVPI+KL DPVTGISCDICINN+LAVVNTKLLRD
Sbjct: 543 KSKIIMKLADILQSDNLQNVQALTRARVPIVKLMDPVTGISCDICINNLLAVVNTKLLRD 602

Query: 607 YAQIDVRLPQLAFIVKHWAKSRGVNETYQGTLSSYAYVLMCIHFLQHRDPPILPCLQETK 666
           YA+ID RL QLAFI+KHWAKSRGVNETY GTLSSYAYVLMCIH+LQ R P ILPCLQE +
Sbjct: 603 YARIDARLRQLAFIIKHWAKSRGVNETYHGTLSSYAYVLMCIHYLQMRRPAILPCLQEME 662

Query: 667 IVTYHEIVDNIECAYFDQVENLKSFGSNNNESVARLVWGFFHYWAYCHDYANTVISIRTK 726
             TY   VD+I CAYFDQVE L  FG +NNE++A+LVWGFF+YWAYCHDYAN VIS+R  
Sbjct: 663 -TTYSVTVDDINCAYFDQVEKLGDFGHHNNETIAQLVWGFFYYWAYCHDYANAVISVRAG 722

Query: 727 STISKRAKDWTRRIGKDRHLICIEDPFETSHDLGRVVDKYSIKVLREEFERAATILQTYP 766
           S ISKR KDWTRRIG DRHLICIEDPFETSHDLGRVVDK+SIKVLREEFERAA I+Q  P
Sbjct: 723 SIISKREKDWTRRIGNDRHLICIEDPFETSHDLGRVVDKHSIKVLREEFERAAEIMQFDP 739

BLAST of Cp4.1LG17g02530 vs. TrEMBL
Match: W9RV71_9ROSA (Poly(A) RNA polymerase cid11 OS=Morus notabilis GN=L484_020763 PE=4 SV=1)

HSP 1 Score: 638.6 bits (1646), Expect = 9.4e-180
Identity = 401/791 (50.70%), Postives = 489/791 (61.82%), Query Frame = 1

Query: 4   DAGGDGHSPFPPPSNGGEFLLSLLQRPPNRQSHLNINPQPQPHLHPPP------------ 63
           + GG+  SP  P +NGGEFLLSLLQ+P   +S  +  PQP P   PPP            
Sbjct: 2   NGGGNAPSPPTPAANGGEFLLSLLQKPQAAKS-ASPPPQPPPPQPPPPQSQQRQQPQQSL 61

Query: 64  ALDPAVAAVGPSLTSLPPP--WPSTGSDLLYPIPLSPWSHSHQSLSAPIASN-FVGFQHL 123
           A+DPAVAA GPS+   PPP  WPS G DLL+P+    W     +   P A N F+GF H 
Sbjct: 62  AVDPAVAAGGPSVP-FPPPHLWPSNGQDLLHPLH---WPVHSLANPPPFAPNGFLGFPH- 121

Query: 124 QQNPFPLPRNQFGGAQFAASHSSGDLIQGGLGGADDLKRLGLRGN-HDRPN---GTVHNL 183
             + FP   NQF G Q          + G +G  +DL+RLG  G  +  PN     +H +
Sbjct: 122 --SFFP---NQFQGKQ----------VSGNVG--EDLRRLGFSGGVNSNPNLNLNPIHGI 181

Query: 184 -SQHNELENKLQFGSFSPTQFSRVLVSGNNSSANDLNREVGFRETIPNGMNRNQGLDSHG 243
             Q N+LE+KL+FGS  P++   +  +     A++ N          N ++R++ L S+ 
Sbjct: 182 VQQKNQLEHKLKFGSL-PSEIVIIPEALPKVDASNFN----------NLVDRSRRLSSNS 241

Query: 244 NSNFISYGNSISNANVHSFRRGEFDYSEQERGRVLGENYSFHPLVKVLEPSGFMSKPKGG 303
           +SN +  GN                Y  Q                    P GF SKPK  
Sbjct: 242 SSNAVRQGN----------------YEHQRTN----------------PPPGFRSKPKRT 301

Query: 304 GHLDSVNIRRRDFDHVVNRERASSSQLG---EAFHRLELGAQLHDPVRPSRSDLHSASAL 363
           G   S+         ++      +  +G   +    LEL AQL  P  PS S+L S  A 
Sbjct: 302 GLNHSIGGENSVSGDLMRTRDVLAEDIGIRGDGSRGLELSAQLDRPGPPSGSNLRSVLAS 361

Query: 364 DMEERGLNLHPDFAEGRPRDSHEGRGWMRKDVDSTNGNDSQELENNIGEQLADSLLHEEE 423
           D+EE  + L  D  E        G G    ++D            +IG++L DSLL E+E
Sbjct: 362 DVEESMMKLESDAVE-------VGGG---HEID------------DIGQRLVDSLLIEDE 421

Query: 424 PDEKSDAKHVRREKD------CRGNRLLTHRERISRRHIKCRGDIDMLKVPLLAIYESLI 483
            D+K++ K  +  +D       RG RLL+ R R+ +R ++CR DID L    +AI +SLI
Sbjct: 422 SDDKNETKKHKNSRDKDSRSDSRGQRLLSQRMRVYKRQMRCRSDIDRLDDAFIAIVKSLI 481

Query: 484 PPKEEKEKQMQLLTSLEKLVVNEWPCARLCLFGSCANSFGVSNSDLDVCLVHRDADIDKA 543
           P +EEK KQ QLLT LEKL++ EWP ARL L+GSCANSFGVS SD+D+CLV  +AD++KA
Sbjct: 482 PAEEEKAKQQQLLTLLEKLIIKEWPKARLYLYGSCANSFGVSKSDVDLCLVMEEADVNKA 541

Query: 544 EILLKLADRLQSANFQNVQALTHARVPIIKLKDPVTGISCDICINNVLAVVNTKLLRDYA 603
           E+LLKLAD LQS N QNVQALT ARVPI+KL DP TGISCDICINNVLAVVNT+LLRDYA
Sbjct: 542 EVLLKLADILQSDNLQNVQALTRARVPIVKLMDPSTGISCDICINNVLAVVNTRLLRDYA 601

Query: 604 QIDVRLPQLAFIVKHWAKSRGVNETYQGTLSSYAYVLMCIHFLQHRDPPILPCLQETKIV 663
           +IDVRL QLAFIVKHWAKSRGVNETYQGTLSSYAYVLMCIHFLQ R P ILPCLQ  +  
Sbjct: 602 RIDVRLRQLAFIVKHWAKSRGVNETYQGTLSSYAYVLMCIHFLQQRRPAILPCLQGME-A 661

Query: 664 TYHEIVDNIECAYFDQVENLKSFGSNNNESVARLVWGFFHYWAYCHDYANTVISIRTKST 723
           TY   VDNI CAYFDQVE L  F S+N E+VA+LVWGFF+YWAYCHDY ++VIS+RT S 
Sbjct: 662 TYSVTVDNIGCAYFDQVEKLSDFRSHNKETVAQLVWGFFNYWAYCHDYTDSVISVRTGSI 703

Query: 724 ISKRAKDWTRRIGKDRHLICIEDPFETSHDLGRVVDKYSIKVLREEFERAATILQTYPNP 766
           ISKR KDWTRRIG DRHLICIEDPFE SHDLGRVVDK+SI+VLREEFERAA I+Q  PNP
Sbjct: 722 ISKREKDWTRRIGNDRHLICIEDPFEISHDLGRVVDKHSIRVLREEFERAAEIMQYDPNP 703

BLAST of Cp4.1LG17g02530 vs. TAIR10
Match: AT2G45620.1 (AT2G45620.1 Nucleotidyltransferase family protein)

HSP 1 Score: 565.1 bits (1455), Expect = 6.7e-161
Identity = 375/808 (46.41%), Postives = 457/808 (56.56%), Query Frame = 1

Query: 5   AGGDGHSPFPPPS-NGGEFLLSLLQRPPNRQSHLNINPQPQPHLHPPPALDPAVAAVGPS 64
           A G    P PP S N GEFLLS+L   P+  S         P  H   ALDPA+AA+GP+
Sbjct: 2   ADGGAEPPAPPSSINAGEFLLSILHGSPSPSSQ-------GPQHHQSFALDPAIAAIGPT 61

Query: 65  LTSLPPP--W------PSTGSDLLYPIPLSPWSHSHQSLSAPIASNFVGFQHLQQNPFPL 124
           + +  PP  W      PS  +   +P+  SP  H+       ++ NF+GF   Q  P P 
Sbjct: 62  VNNPFPPSNWQSNGHRPSNHNPPSWPLAFSP-PHN-------LSPNFLGFP--QFPPSPF 121

Query: 125 PRNQFGGAQFAASHSSGDLIQGGLGGADDLKRLGLRGNHDRPNGTVHNLSQHNEL----- 184
             NQF G Q  +               +D  RLG  G  +    ++    Q  +L     
Sbjct: 122 TTNQFDGNQRVS--------------PEDAYRLGFPGTTNPAIQSMVQQQQQQQLPPPQS 181

Query: 185 -ENKLQFGSFS--PTQFSRVLVSGN-NSSANDLNREVGFRETIPNGMNRNQGLDSHGNSN 244
              KL FGSFS   TQ    L +GN    +N   + +   ++  +  N +  L  H    
Sbjct: 182 ETRKLVFGSFSGDATQSLNGLHNGNLKYDSNQHEQLMRHPQSTLSNSNMDPNLSHH---- 241

Query: 245 FISYGNSISNANVHSFRRGEFDYSEQERGRVLGENYSFHPLVKVLEPSGFMSKPKG---- 304
                    N ++H  R G   +S +     +G N           P GF S  +G    
Sbjct: 242 --------RNHDLHEQRGG---HSGRGNWGHIGNNGRGLKSTPPPPPPGFSSNQRGWDMS 301

Query: 305 -GGHLDSVNIRRR----------------DFDHVVNRERASSSQLGEAFHRLELGAQLHD 364
            G   D   + R                 DF    NR R  S Q    F+   L  Q+  
Sbjct: 302 LGSKDDDRGMGRNHDQAMGEHSKVWNQSVDFSAEANRLRGLSIQNESKFN---LSQQIDH 361

Query: 365 PVRPSRSDLHSASALDMEERGLNLHPDFAEGRPRDSHEGRGWMRKDVDSTNGNDSQELEN 424
           P  P  + LHS SA D  +    L+ +   G  R   E  G + K     N N S E+E 
Sbjct: 362 PGPPKGASLHSVSAADAADSFSMLNKEARRGGER--REELGQLSKAKREGNAN-SDEIE- 421

Query: 425 NIGEQLADSLLHEEEP------DEKSDAKHVRREK---DCRGNRLLTHRERISRRHIKCR 484
           + GE +  SLL E+E       D K D+K  R ++   D RG RLL  + R+ + ++ CR
Sbjct: 422 DFGEDIVKSLLLEDETGEKDANDGKKDSKTSREKESRVDNRGQRLLGQKARMVKMYMACR 481

Query: 485 GDIDMLKVPLLAIYESLIPPKEEKEKQMQLLTSLEKLVVNEWPCARLCLFGSCANSFGVS 544
            DI       +AIY+SLIP +EE EKQ QL+  LE LV  EWP A+L L+GSCANSFG  
Sbjct: 482 NDIHRYDATFIAIYKSLIPAEEELEKQRQLMAHLENLVAKEWPHAKLYLYGSCANSFGFP 541

Query: 545 NSDLDVCLVHRDADIDKAEILLKLADRLQSANFQNVQALTHARVPIIKLKDPVTGISCDI 604
            SD+DVCL     DI+K+E+LLKLA+ L+S N QNVQALT ARVPI+KL DPVTGISCDI
Sbjct: 542 KSDIDVCLAIEGDDINKSEMLLKLAEILESDNLQNVQALTRARVPIVKLMDPVTGISCDI 601

Query: 605 CINNVLAVVNTKLLRDYAQIDVRLPQLAFIVKHWAKSRGVNETYQGTLSSYAYVLMCIHF 664
           CINNVLAVVNTKLLRDYAQIDVRL QLAFIVKHWAKSR VNETYQGTLSSYAYVLMCIHF
Sbjct: 602 CINNVLAVVNTKLLRDYAQIDVRLRQLAFIVKHWAKSRRVNETYQGTLSSYAYVLMCIHF 661

Query: 665 LQHRDPPILPCLQETKIVTYHEIVDNIECAYFDQVENLKSFGSNNNESVARLVWGFFHYW 724
           LQ R PPILPCLQE +  TY   VDNI C YFD V+ L++FGSNN E++A LVWGFF+YW
Sbjct: 662 LQQRRPPILPCLQEME-PTYSVRVDNIRCTYFDNVDRLRNFGSNNRETIAELVWGFFNYW 721

Query: 725 AYCHDYANTVISIRTKSTISKRAKDWTRRIGKDRHLICIEDPFETSHDLGRVVDKYSIKV 765
           AY HDYA  V+S+RT S + KR KDWTRR+G DRHLICIEDPFETSHDLGRVVDK+SI+V
Sbjct: 722 AYAHDYAYNVVSVRTGSILGKREKDWTRRVGNDRHLICIEDPFETSHDLGRVVDKFSIRV 755

BLAST of Cp4.1LG17g02530 vs. TAIR10
Match: AT3G45750.1 (AT3G45750.1 Nucleotidyltransferase family protein)

HSP 1 Score: 102.4 bits (254), Expect = 1.2e-21
Identity = 75/249 (30.12%), Postives = 126/249 (50.60%), Query Frame = 1

Query: 439 IDMLKVPLLAIYESLIPPKEEKEKQMQLLTSLEKLVVN-----EWPCARLCLFGSCANSF 498
           +D+ KV L  +Y S  P   +   + +L+ +L  + ++     E     L  +GS     
Sbjct: 41  LDLDKV-LNDVYCSFRPVSADYNTRKELVKNLNTMALDIYGKSEESSPVLEAYGSFVMDM 100

Query: 499 GVSNSDLDVCLVHRDADID-----KAEILLKLADRLQS----ANFQNVQALTHARVPIIK 558
             S SDLDV +   +   +     K EIL + A +L+S       +NV+++  A+VPI+K
Sbjct: 101 YSSQSDLDVSINFGNGTSEIPREKKLEILKRFAKKLRSLQGEGQVKNVESIFSAKVPIVK 160

Query: 559 LKDPVTGISCDICINNVLAVVNTKLLRDYAQIDVRLPQLAFIVKHWAKSRGVNETYQGTL 618
             D  TG+ CD+ + N   ++N++++R  +QID R  +L  +VKHWAK+  VN     TL
Sbjct: 161 FSDQGTGVECDLSVENKDGILNSQIVRIISQIDGRFQKLCLLVKHWAKAHEVNSALHRTL 220

Query: 619 SSYAYVLMCIHFLQHRDPPILPCLQETKIVTYHEIVDNIECAYFDQVENLKSFGSNNNES 674
           +S +  L+    LQ ++PPILP    + ++       N+E     + +   ++G  N ES
Sbjct: 221 NSVSITLLVALHLQTQNPPILPPF--SMLLKDGMDPPNVE----KRAQKFLNWGQRNQES 280

BLAST of Cp4.1LG17g02530 vs. TAIR10
Match: AT3G45760.1 (AT3G45760.1 Nucleotidyltransferase family protein)

HSP 1 Score: 102.1 bits (253), Expect = 1.6e-21
Identity = 70/239 (29.29%), Postives = 115/239 (48.12%), Query Frame = 1

Query: 450 YESLIPPKEEKEKQMQLLTSLEKLVVN-----EWPCARLCLFGSCANSFGVSNSDLDVCL 509
           Y S  P   +   + +L+ +L  + ++     E     L  +GS A +   S  DLDV +
Sbjct: 51  YSSFRPVSADYNTRKELVKNLNAMAIDIFGKSEESSPVLEAYGSFAMNTFSSQKDLDVSI 110

Query: 510 VHRDADID-----KAEILLKLADRLQSANFQ----NVQALTHARVPIIKLKDPVTGISCD 569
                  +     K EIL + A +L+S   Q    NV  +  ARVPI++  D  TGI CD
Sbjct: 111 NFSSGTSEFYREKKLEILTRFATKLRSLEGQGFVRNVVPILSARVPIVRFCDQGTGIECD 170

Query: 570 ICINNVLAVVNTKLLRDYAQIDVRLPQLAFIVKHWAKSRGVNETYQGTLSSYAYVLMCIH 629
           + + +   ++ ++++R  +QID R  +L  ++KHWA++ GVN     TL+S +  ++  H
Sbjct: 171 LTVESKDGILTSQIIRIISQIDDRFQKLCLLIKHWARAHGVNNASHNTLNSISITMLVAH 230

Query: 630 FLQHRDPPILPCLQETKIVTYHEIVDNIECAYFD-QVENLKSFGSNNNESVARLVWGFF 674
            LQ + PPILP              D I+    + + +   ++G  N ES+ RL   FF
Sbjct: 231 HLQTQSPPILPPFSTL-------FKDGIDPPIVEKRTQKFLNWGQRNQESLGRLFATFF 282

BLAST of Cp4.1LG17g02530 vs. TAIR10
Match: AT2G39740.1 (AT2G39740.1 Nucleotidyltransferase family protein)

HSP 1 Score: 75.9 bits (185), Expect = 1.2e-13
Identity = 80/327 (24.46%), Postives = 147/327 (44.95%), Query Frame = 1

Query: 446 LLAIYESLIPPKEEKEKQMQLLTSLEKLVVNEWPCARLCLFGSCANSFG--VSN-----S 505
           L  I + + P + +++ ++ ++  L  ++ +       CL G+    FG  VSN      
Sbjct: 11  LQEILQVIKPTRADRDTRITVIDQLRDVLQSVE-----CLRGATVQPFGSFVSNLFTRWG 70

Query: 506 DLDVCL-VHRDADI------DKAEILLKLADRLQSAN-FQNVQALTHARVPIIKLKDPVT 565
           DLD+ + +   + I       K  +L  L   L+++  +  +Q + HARVPI+K+     
Sbjct: 71  DLDISVDLFSGSSILFTGKKQKQTLLGHLLRALRASGLWYKLQFVIHARVPILKVVSGHQ 130

Query: 566 GISCDICINNVLAVVNTKLLRDYAQIDVRLPQLAFIVKHWAKSRGVNETYQGTLSSYAYV 625
            ISCDI I+N+  ++ ++ L   ++ID R   L  +VK WAK+  +N++  GT +SY+  
Sbjct: 131 RISCDISIDNLDGLLKSRFLFWISEIDGRFRDLVLLVKEWAKAHNINDSKTGTFNSYSLS 190

Query: 626 LMCIHFLQHRDPPILPCLQETKIVTYHEIVDNI--------ECAYFDQVENLKSFGSN-- 685
           L+ I   Q   P ILP L   +++     VD++        E        N+  F S   
Sbjct: 191 LLVIFHFQTCVPAILPPL---RVIYPKSAVDDLTGVRKTAEESIAQVTAANIARFKSERA 250

Query: 686 ---NNESVARLVWGFFHYWAYCHDYANT--VISIRTKSTISKRAKDWTRRIGKDRHLICI 743
              N  S++ L+  FF  ++  +  A    V     +         W  +     + + +
Sbjct: 251 KSVNRSSLSELLVSFFAKFSDINVKAQEFGVCPFTGRWETISSNTTWLPK----TYSLFV 310

BLAST of Cp4.1LG17g02530 vs. NCBI nr
Match: gi|659120337|ref|XP_008460141.1| (PREDICTED: uncharacterized protein LOC103499041 [Cucumis melo])

HSP 1 Score: 1285.0 bits (3324), Expect = 0.0e+00
Identity = 647/767 (84.35%), Postives = 686/767 (89.44%), Query Frame = 1

Query: 1   MAGDAGGDGHSPFPPPSNGGEFLLSLLQRPPNRQS-HLNINPQPQPHLHPPPALDPAVAA 60
           MAGDAGGDGHSP PPPSNGGEFLLSLLQRPPNRQS HLN+N  P  HLHP  ++DPAVAA
Sbjct: 1   MAGDAGGDGHSPLPPPSNGGEFLLSLLQRPPNRQSQHLNLNSLP--HLHPSSSIDPAVAA 60

Query: 61  VGPSLTSLPPPWPSTGSDLLYPIPLSPWSHSHQSLSAPIASNFVGFQHLQQNPFPLPRNQ 120
           VGPSLTSLP PWPS+GSDLLYPIPLSPWSHSHQSLS PIA N+VGFQHLQQNPFPLPRNQ
Sbjct: 61  VGPSLTSLPTPWPSSGSDLLYPIPLSPWSHSHQSLSTPIAPNYVGFQHLQQNPFPLPRNQ 120

Query: 121 FGGAQFAASHSSGDLIQGGLGGADDLKRLGLRGNHDRPNGTV-HNLSQHNELENKLQFGS 180
           FGGAQFAA+ +SGD IQGG GG DD KRLG  GNHDR NGTV  N SQHN+LENKLQFGS
Sbjct: 121 FGGAQFAATQTSGDQIQGGFGGVDDFKRLGFPGNHDRANGTVAQNFSQHNQLENKLQFGS 180

Query: 181 FSPTQFSRVLVSGNNSSANDLNREVGFRETIPNGMNRNQGLDSHGNSNFISYGNSISNAN 240
           FSPT F RVL++GN+SSA DLNREVGFRETIPNG+NRNQGLDSHGNSNF SYGNS  NAN
Sbjct: 181 FSPTLFPRVLINGNSSSAKDLNREVGFRETIPNGLNRNQGLDSHGNSNFTSYGNSNPNAN 240

Query: 241 VHSFRRGEFDYSEQERGRVLGENYSFHPLVKVLEPSGFMSKPKGGGHLDSVNIRRRDFDH 300
           VHSFRRGE DYS+QERGRVLGENY+FHP VK  E SGFMS PKGGGHLD  NIR+RDF+H
Sbjct: 241 VHSFRRGECDYSDQERGRVLGENYNFHPQVKTSELSGFMSNPKGGGHLDFGNIRKRDFEH 300

Query: 301 VVNRERASSSQLGEAFHRLELGAQLHDPVRPSRSDLHSASALDMEERGLNLHPDFAEGRP 360
             NRER  SSQ GE  HRLELGAQLHDPVRPSRSD  SA A ++EER LNL  +  EGR 
Sbjct: 301 GGNRERPRSSQFGEGSHRLELGAQLHDPVRPSRSDPQSALAFNIEERVLNLDSEIDEGRH 360

Query: 361 RDSHEGRGWMRKDVDSTNGNDSQELENNIGEQLADSLLHEEEPDEKSDAKHVRREKDCRG 420
           RDSH+             G+DSQEL+ NIGEQLADSLL E+EPDEKSDAK +RREKDCRG
Sbjct: 361 RDSHQ-------------GHDSQELD-NIGEQLADSLLLEDEPDEKSDAKFIRREKDCRG 420

Query: 421 NRLLTHRERISRRHIKCRGDIDMLKVPLLAIYESLIPPKEEKEKQMQLLTSLEKLVVNEW 480
           NRLLTHRERI+R+HI CRGDIDML VPLL IYESLIPPKEEKEKQMQLLTSLEKLVV+EW
Sbjct: 421 NRLLTHRERIARKHINCRGDIDMLTVPLLRIYESLIPPKEEKEKQMQLLTSLEKLVVSEW 480

Query: 481 PCARLCLFGSCANSFGVSNSDLDVCLVHRDADIDKAEILLKLADRLQSANFQNVQALTHA 540
           P A LCLFGSCANSFGVSNSD+DVCLV RDADIDK+EILLKLA+ L SANFQNVQALT A
Sbjct: 481 PHAHLCLFGSCANSFGVSNSDVDVCLVLRDADIDKSEILLKLAEILLSANFQNVQALTRA 540

Query: 541 RVPIIKLKDPVTGISCDICINNVLAVVNTKLLRDYAQIDVRLPQLAFIVKHWAKSRGVNE 600
           RVPIIKLKDPVTG+SCDICINNVLAVVNTKLLRDYAQIDVRLPQLAFIVKHWAKSRGVNE
Sbjct: 541 RVPIIKLKDPVTGVSCDICINNVLAVVNTKLLRDYAQIDVRLPQLAFIVKHWAKSRGVNE 600

Query: 601 TYQGTLSSYAYVLMCIHFLQHRDPPILPCLQETKIVTYHEIVDNIECAYFDQVENLKSFG 660
           TYQGTLSSYAYVLMCIHFLQHRDPPILPCLQETKIVTYH+IVDNIECAYFDQVE LK+FG
Sbjct: 601 TYQGTLSSYAYVLMCIHFLQHRDPPILPCLQETKIVTYHKIVDNIECAYFDQVEKLKTFG 660

Query: 661 SNNNESVARLVWGFFHYWAYCHDYANTVISIRTKSTISKRAKDWTRRIGKDRHLICIEDP 720
           S N E+VARLVWGFFHYWAYCHDYANTV+S+RTK+T+SKRAKDWTRRIGKDRHLICIEDP
Sbjct: 661 SRNKETVARLVWGFFHYWAYCHDYANTVVSVRTKNTVSKRAKDWTRRIGKDRHLICIEDP 720

Query: 721 FETSHDLGRVVDKYSIKVLREEFERAATILQTYPNPCEKLFEPFIPS 766
           FETSHDLGRVVDKYSIKVLREEFERAATILQTYPNPCEKLFEPF+PS
Sbjct: 721 FETSHDLGRVVDKYSIKVLREEFERAATILQTYPNPCEKLFEPFVPS 751

BLAST of Cp4.1LG17g02530 vs. NCBI nr
Match: gi|778711120|ref|XP_011656690.1| (PREDICTED: uncharacterized protein LOC101204551 [Cucumis sativus])

HSP 1 Score: 1274.6 bits (3297), Expect = 0.0e+00
Identity = 639/766 (83.42%), Postives = 685/766 (89.43%), Query Frame = 1

Query: 1   MAGDAGGDGHSPFPPPSNGGEFLLSLLQRPPNRQSHLNINPQPQPHLHPPPALDPAVAAV 60
           MAGDAGGDGHSPFPPPSNGGEFLLSLLQRPPNRQSHLN+N  P  HLHP  ++DPAVAAV
Sbjct: 1   MAGDAGGDGHSPFPPPSNGGEFLLSLLQRPPNRQSHLNLNSLP--HLHPSSSIDPAVAAV 60

Query: 61  GPSLTSLPPPWPSTGSDLLYPIPLSPWSHSHQSLSAPIASNFVGFQHLQQNPFPLPRNQF 120
           GPSLTSLP PWPS+GSDLLYPIPLSPWSHSHQSLS PIA N+VGFQHLQQNPFPLPR+QF
Sbjct: 61  GPSLTSLPTPWPSSGSDLLYPIPLSPWSHSHQSLSTPIAPNYVGFQHLQQNPFPLPRSQF 120

Query: 121 GGAQFAASHSSGDLIQGGLGGADDLKRLGLRGNHDRPNGTV-HNLSQHNELENKLQFGSF 180
           GGAQFAAS +SGD IQGG GG DD KRLG  GNHDR NGTV HN SQHN+LENKLQFGSF
Sbjct: 121 GGAQFAASQTSGDQIQGGFGGVDDFKRLGFPGNHDRANGTVTHNFSQHNQLENKLQFGSF 180

Query: 181 SPTQFSRVLVSGNNSSANDLNREVGFRETIPNGMNRNQGLDSHGNSNFISYGNSISNANV 240
           SP+ F R+L++GN+S+A DLNREVGFRE+IPNG+NRNQGLDSHGNSNF SYGNS  NANV
Sbjct: 181 SPSLFPRILINGNSSTAKDLNREVGFRESIPNGLNRNQGLDSHGNSNFTSYGNSNPNANV 240

Query: 241 HSFRRGEFDYSEQERGRVLGENYSFHPLVKVLEPSGFMSKPKGGGHLDSVNIRRRDFDHV 300
           HSF RGE DYS+QERGRVLGENY+FHP VK  E SGFMS P GGGHLD  NIR+RDF+H 
Sbjct: 241 HSFGRGECDYSDQERGRVLGENYNFHPQVKASEVSGFMSNPTGGGHLDFGNIRKRDFEHG 300

Query: 301 VNRERASSSQLGEAFHRLELGAQLHDPVRPSRSDLHSASALDMEERGLNLHPDFAEGRPR 360
            NRER  SSQ GE   RLELGAQL DPVRPSRSDL SA AL++EER LNL  +  EGR R
Sbjct: 301 GNRERPRSSQFGEGSRRLELGAQLRDPVRPSRSDLQSALALNIEERVLNLDSEIDEGRHR 360

Query: 361 DSHEGRGWMRKDVDSTNGNDSQELENNIGEQLADSLLHEEEPDEKSDAKHVRREKDCRGN 420
           DS++             G+DSQEL+ NIGEQLADSLL E+EPDEKSD+K +RREKDCRGN
Sbjct: 361 DSYQ-------------GHDSQELD-NIGEQLADSLLLEDEPDEKSDSKFIRREKDCRGN 420

Query: 421 RLLTHRERISRRHIKCRGDIDMLKVPLLAIYESLIPPKEEKEKQMQLLTSLEKLVVNEWP 480
           RLLTHRERI+R+HI CRGDIDML +PLL IYESLIPP+EEKEKQ QLL SLEKLVVNEWP
Sbjct: 421 RLLTHRERIARKHIHCRGDIDMLTIPLLRIYESLIPPEEEKEKQRQLLISLEKLVVNEWP 480

Query: 481 CARLCLFGSCANSFGVSNSDLDVCLVHRDADIDKAEILLKLADRLQSANFQNVQALTHAR 540
            A L LFGSCANSFGVSNSD+DVCLV RDADIDK+EILLKLA+ LQSANFQNVQALT AR
Sbjct: 481 HAHLFLFGSCANSFGVSNSDVDVCLVLRDADIDKSEILLKLAEILQSANFQNVQALTRAR 540

Query: 541 VPIIKLKDPVTGISCDICINNVLAVVNTKLLRDYAQIDVRLPQLAFIVKHWAKSRGVNET 600
           VPIIKLKDPVTG+SCDICINNVLAVVNTKLLRDYAQIDVRLPQLAFIVKHWAKSRGVNET
Sbjct: 541 VPIIKLKDPVTGVSCDICINNVLAVVNTKLLRDYAQIDVRLPQLAFIVKHWAKSRGVNET 600

Query: 601 YQGTLSSYAYVLMCIHFLQHRDPPILPCLQETKIVTYHEIVDNIECAYFDQVENLKSFGS 660
           YQGTLSSYAYVLMCIHFLQHRDPPILPCLQETKIVTYH+IVDNIECAYFDQVE LK+FGS
Sbjct: 601 YQGTLSSYAYVLMCIHFLQHRDPPILPCLQETKIVTYHKIVDNIECAYFDQVEKLKTFGS 660

Query: 661 NNNESVARLVWGFFHYWAYCHDYANTVISIRTKSTISKRAKDWTRRIGKDRHLICIEDPF 720
           +N ESVARLVWGFFHYWAYCHDYANTV+S+RTK+T+SKRAKDWTRRIGKDRHLICIEDPF
Sbjct: 661 DNKESVARLVWGFFHYWAYCHDYANTVVSVRTKNTVSKRAKDWTRRIGKDRHLICIEDPF 720

Query: 721 ETSHDLGRVVDKYSIKVLREEFERAATILQTYPNPCEKLFEPFIPS 766
           ETSHDLGRVVDKYSIKVLREEFERAATILQTYPNPCEKLFEPF+PS
Sbjct: 721 ETSHDLGRVVDKYSIKVLREEFERAATILQTYPNPCEKLFEPFVPS 750

BLAST of Cp4.1LG17g02530 vs. NCBI nr
Match: gi|802698669|ref|XP_012083529.1| (PREDICTED: terminal uridylyltransferase 7 [Jatropha curcas])

HSP 1 Score: 680.6 bits (1755), Expect = 3.1e-192
Identity = 421/807 (52.17%), Postives = 494/807 (61.21%), Query Frame = 1

Query: 4   DAGGDGHSPFPPPSNGGEFLLSLLQRP----------PNRQSHLNINPQPQPHLHPPP-- 63
           + GG    P  P  NGGEFLLSLLQRP          P+ Q  + I   PQ +       
Sbjct: 2   NGGGADAPPMQPAVNGGEFLLSLLQRPNHQLQTPAPPPHSQLPIPIPITPQQYQQQQQQQ 61

Query: 64  -----ALDPAVAAVGPSLTSLPPPWPSTGSDLLYPIPLSPWSHSHQSLSAPIASNFVGFQ 123
                ALDPAVAAVGPSL    P W S G D+L P    PW H+  +  AP+   F+GF 
Sbjct: 62  QQQSLALDPAVAAVGPSLPFSQPVWQSNGRDVLTP----PWPHNLSA--APLLPGFLGFP 121

Query: 124 HLQQNPFPLPRNQFGGAQFAASHSSGDLIQGGLGGADDLKRLGLRGNHDRPNGTVHN-LS 183
              QN +P P N     QF  +       QG LG  DDL+ LG  G   R N T+HN + 
Sbjct: 122 ---QNHWPSPANHLAAGQFQGNQ------QGVLG--DDLQILGFSGADVRANNTIHNRVQ 181

Query: 184 QHNELENKLQFGSF-SPTQFSRVLVSGNNSSANDLNREVGFRETIPNGMNRNQGLDSHGN 243
           Q  +LE KLQFGSF S  Q    L++ N+        EV       NG+  +Q  DS   
Sbjct: 182 QKQQLEQKLQFGSFRSDIQNVEALLNVNSKLNAAKELEVRLATRNLNGLESDQKFDSQLR 241

Query: 244 SNFISYGNSISNANVHSFRRGEFDYSEQERG----RVLGENYSFHPLVKVLEPSGFMSKP 303
           +                     FD  EQ+R     R      ++ P    + P GF +KP
Sbjct: 242 T---------------------FDLREQDRSGGGWRKQPHGGNYRPQETRMPPPGFSNKP 301

Query: 304 KGGGHLDSVNIRRRDFDHVVNRERASSSQL----------------GEAFHRLELGAQLH 363
           +GGG+ D V+ RRR+ D+ VN+E+ +  +L                G+    L L  QL 
Sbjct: 302 RGGGNWDYVS-RRRELDYNVNKEKGNQGELSNRNALFSSEDKIPRDGDRSRDLGLTGQLD 361

Query: 364 DPVRPSRSDLHSASALDMEERGLNLHPDFAEGRPRDSHEGRGWMRKDVDSTNGNDSQELE 423
            P  P+ S+L+S SA D+E   LN+  +  E                    +G D     
Sbjct: 362 RPGPPAGSNLYSVSAADVELSMLNVEAEVVE--------------------DGKDEGREL 421

Query: 424 NNIGEQLADSLLHEEEPDEKSDAKHVR--REK----DCRGNRLLTHRERISRRHIKCRGD 483
           +  GE+L DSLL E E D K+D K  R  REK    D RG R L+ R R+ +R ++CR D
Sbjct: 422 DEAGEELVDSLLLEGESDGKNDKKQNRHSREKESRSDNRGQRTLSQRMRMLKRQMECRRD 481

Query: 484 IDMLKVPLLAIYESLIPPKEEKEKQMQLLTSLEKLVVNEWPCARLCLFGSCANSFGVSNS 543
           ID L  P LAIYESL+PP+EEK KQ QLL+ LEKLV  EWP ARL L+GSCANSFGV  S
Sbjct: 482 IDRLNAPFLAIYESLVPPEEEKAKQKQLLSLLEKLVNKEWPQARLYLYGSCANSFGVLKS 541

Query: 544 DLDVCLVHRDADIDKAEILLKLADRLQSANFQNVQALTHARVPIIKLKDPVTGISCDICI 603
           D+DVCL  ++ADI+K+E+LLKLAD LQS N QNVQALT ARVPI+KL DPVTGISCDICI
Sbjct: 542 DIDVCLAIQNADINKSEVLLKLADILQSDNLQNVQALTRARVPIVKLMDPVTGISCDICI 601

Query: 604 NNVLAVVNTKLLRDYAQIDVRLPQLAFIVKHWAKSRGVNETYQGTLSSYAYVLMCIHFLQ 663
           NNVLAVVNTKLL DYAQIDVRL QLAFIVKHWAKSRGVNETY GTLSSYAYVLMCIHFLQ
Sbjct: 602 NNVLAVVNTKLLWDYAQIDVRLRQLAFIVKHWAKSRGVNETYHGTLSSYAYVLMCIHFLQ 661

Query: 664 HRDPPILPCLQETKIVTYHEIVDNIECAYFDQVENLKSFGSNNNESVARLVWGFFHYWAY 723
            R P ILPCLQE +  TY   VD+I+CAYFDQVE L+ FGS N E++A+LVW FF+YWAY
Sbjct: 662 QRRPAILPCLQEME-ATYSVAVDDIQCAYFDQVEKLRGFGSRNKETIAQLVWAFFNYWAY 721

Query: 724 CHDYANTVISIRTKSTISKRAKDWTRRIGKDRHLICIEDPFETSHDLGRVVDKYSIKVLR 766
            HDYAN VISIRT S ISKR KDWTRRIG DRHLICIEDPFE SHDLGRVVDKYSIKVLR
Sbjct: 722 RHDYANAVISIRTGSIISKREKDWTRRIGNDRHLICIEDPFEISHDLGRVVDKYSIKVLR 748

BLAST of Cp4.1LG17g02530 vs. NCBI nr
Match: gi|590722773|ref|XP_007051991.1| (Nucleotidyltransferase family protein isoform 1 [Theobroma cacao])

HSP 1 Score: 676.8 bits (1745), Expect = 4.5e-191
Identity = 415/786 (52.80%), Postives = 498/786 (63.36%), Query Frame = 1

Query: 6   GGDGHSPFPPPSNGGEFLLSLLQRPPNRQSHLNIN-------------PQPQPHLHP--- 65
           G  G +P PP +NGGEFLLSLLQ+P   Q HL                PQPQ        
Sbjct: 3   GNGGEAPSPPAANGGEFLLSLLQKP---QQHLQQQQSPLFSRATPVTIPQPQQQQQQQQQ 62

Query: 66  -PPALDPAVAAVGPSLTSLPPPWPSTGSDL--LYPIPLSPWSHSHQSLSAPIASNFVGFQ 125
            P  +DPAVAAVGP+L    P WPS G DL  L+P          Q+LS P+A NF+GF 
Sbjct: 63  QPLVIDPAVAAVGPTLP-FRPLWPSNGRDLPGLWP----------QTLSPPLAPNFLGFP 122

Query: 126 HLQQNPFPLPRNQFGGAQFAASHSSGDLIQGGLGGADDLKRLGLRGNHDRPNGTVHNLSQ 185
               +P+  P NQF G Q A                DDL+RLGL G  +  N  + N  Q
Sbjct: 123 ---LSPWSSPGNQFAGNQGALM--------------DDLRRLGLSGIDNNKNHVIQNRVQ 182

Query: 186 HNELENKLQFGSFSPTQFSRVLVSGNNSSANDLNREVGFRETIPNGMN-RNQGLDSHGNS 245
               + KL FGSF P+    +     + + N L           + +N  NQ LDS  NS
Sbjct: 183 QKHQDQKLVFGSF-PSDIQTLKTPEGSPNGNLLEN---------SKLNLSNQQLDSRLNS 242

Query: 246 NFISYGNSISNANVHSFR-RGEFDYSEQERGRVLGENYSFHPLVKVLE-PSGFMSKPKGG 305
           N         N + + F+ R   D  +Q++    G +Y   P  +    P GF+ KP+GG
Sbjct: 243 N--------PNTSPYVFQHRNSGDRGKQQQH---GGSYRPTPSPEARRSPPGFLGKPRGG 302

Query: 306 GHLDSVNIRRRDFDHVVNRERASSSQLGEAFHRLELGAQLHDPVRPSRSDLHSASALDME 365
           G       RRR F+H V++ +A  SQ   + + + L  QL  P  P+ S+L S SA D+E
Sbjct: 303 GGNRDFGNRRRHFEHNVDKAKAEYSQ-PSSDNEVGLSGQLDRPGPPAGSNLQSVSATDIE 362

Query: 366 ERGLNLHPDFAEGRPRDSHEGRGWMRKDVDSTNGNDSQELENNIGEQLADSLLHEEEPDE 425
           E  L LH D   GR R S           D     D  E++  +GEQL +SLL E+E D+
Sbjct: 363 ESLLELHSD--GGRDRFSRR---------DKFRREDGGEVDE-VGEQLLESLLIEDESDD 422

Query: 426 KSDAKHVRREK----DCRGNRLLTHRERISRRHIKCRGDIDMLKVPLLAIYESLIPPKEE 485
           K+D K  RREK    D RG RLL+ R R+ +R ++CR DI  L  P LA+YESLIPP+EE
Sbjct: 423 KNDKKQHRREKESRIDNRGQRLLSQRMRMLKRQMECRSDIHRLNAPFLALYESLIPPEEE 482

Query: 486 KEKQMQLLTSLEKLVVNEWPCARLCLFGSCANSFGVSNSDLDVCLVHRDADIDKAEILLK 545
           + KQ QLL  LEKLV  EWP ARL L+GSCANSFGVS SD+DVCL   + D++K+EILLK
Sbjct: 483 RAKQKQLLALLEKLVCKEWPEARLYLYGSCANSFGVSKSDIDVCLAFNEMDVNKSEILLK 542

Query: 546 LADRLQSANFQNVQALTHARVPIIKLKDPVTGISCDICINNVLAVVNTKLLRDYAQIDVR 605
           LAD LQS N QNVQALT ARVPI+KL DP TGISCDICINNVLAVVNTKLLRDYA++D R
Sbjct: 543 LADILQSDNLQNVQALTRARVPIVKLMDPATGISCDICINNVLAVVNTKLLRDYAKLDAR 602

Query: 606 LPQLAFIVKHWAKSRGVNETYQGTLSSYAYVLMCIHFLQHRDPPILPCLQETKIVTYHEI 665
           L QLAFIVKHWAKSRGVNETYQGTLSSYAYVLMCIHFLQ R P ILPCLQ  +  TY   
Sbjct: 603 LRQLAFIVKHWAKSRGVNETYQGTLSSYAYVLMCIHFLQQRRPAILPCLQGME-TTYSVT 662

Query: 666 VDNIECAYFDQVENLKSFGSNNNESVARLVWGFFHYWAYCHDYANTVISIRTKSTISKRA 725
           VD++ECAYFDQVE L++FGS+N ESVA+LVW FF+YWAY HDYAN+VIS+RT S ISK+ 
Sbjct: 663 VDDVECAYFDQVERLRNFGSSNKESVAQLVWAFFNYWAYGHDYANSVISVRTGSIISKQE 722

Query: 726 KDWTRRIGKDRHLICIEDPFETSHDLGRVVDKYSIKVLREEFERAATILQTYPNPCEKLF 766
           KDWTRRIG DRHLICIEDPFE SHDLGRVVDK+SI+V+REEFERAA ++Q  PNPC  LF
Sbjct: 723 KDWTRRIGNDRHLICIEDPFEISHDLGRVVDKFSIRVIREEFERAADVMQYDPNPCVTLF 722

BLAST of Cp4.1LG17g02530 vs. NCBI nr
Match: gi|1009162216|ref|XP_015899318.1| (PREDICTED: UTP:RNA uridylyltransferase 1 [Ziziphus jujuba])

HSP 1 Score: 674.5 bits (1739), Expect = 2.2e-190
Identity = 426/812 (52.46%), Postives = 504/812 (62.07%), Query Frame = 1

Query: 9   GHSPFPPPSNGGEFLLSLLQRPPNRQSH-----LNINPQPQPHLHP------PPALDPAV 68
           G++P P  SNGGEFLLSLLQRP  + +H      N +  P P + P      P  LDPAV
Sbjct: 5   GNAPPPSDSNGGEFLLSLLQRPNYQNTHRFQPQTNSSSPPPPPVQPSHQPPQPVVLDPAV 64

Query: 69  AAVGPSLTSLPPP-WPSTGSDLLY----PIPLSPWSHSHQSLSAPIASN-FVGFQHLQQN 128
           AAVGP+L   PP  WPS G DL +    P+PL         +S P  SN F+GF H   N
Sbjct: 65  AAVGPTLPFPPPSSWPSNGRDLPHHPHWPVPLG--------VSPPFPSNGFLGFPH---N 124

Query: 129 PFPLPRN-QFGGAQFAASHSSGDLIQGGLGGADDLKRLGLRGNHDRPNGTVHNLSQHN-E 188
            FP P N QF G Q  A+ SS           DD +RLG  G+    N  V NL  HN E
Sbjct: 125 LFPPPANHQFPGNQIPANPSS-----------DDFRRLGFPGS----NSMVQNLVHHNHE 184

Query: 189 LENKLQFGSFSPTQFSRVLVSGNNSSANDLNREVGFRETIPNGMNRNQGLDSHGNSNFIS 248
           LE+KL+FGSF+                     ++   E + NG++RN  L S        
Sbjct: 185 LEHKLKFGSFTS--------------------DIRVPEELSNGLDRNGRLGS-------- 244

Query: 249 YGNSISNANVHSFRRGEFDYSEQER-------GRVLGENYSFHPLVKVLEPSGFMSKP-- 308
                   N+ + R G  D  EQER       G   G     H   +   P GF S+P  
Sbjct: 245 --------NLDAVRHGNCDSWEQERRGGGGGGGGSGGGRGKQH--TRTTPPPGFPSRPIG 304

Query: 309 --KGGGHLDSVNIRR-RDFDHVVNRERASSSQL---------------GEAFHRLELGAQ 368
              GGG+ DS N  R R  +H V+RE+ SS +                G+ F+ L L AQ
Sbjct: 305 GAGGGGNWDSANRDRGRGINHNVDREKISSDEFVSNRDVFSAEAVRIRGDKFNGLGLRAQ 364

Query: 369 LHDPVRPSRSDLHSASALDMEERGLNLHPDFAEGRPRDSHEGR---GWMRKDVDSTNGND 428
           L  P  PS S LHS S  D+E+  +NL     E   RD H  +   G   +++D      
Sbjct: 365 LDRPGPPSGSSLHSVSVSDVEDSFMNLENGNFE--VRDGHRQQLPVGAGEREID------ 424

Query: 429 SQELENNIGEQLADSLLHEEEPDEKSDAKHVR--REKDCR----GNRLLTHRERISRRHI 488
                 +IGE+L  SLL EEE D+K+++K  +  REKD R    G  LL+ R R  +R +
Sbjct: 425 ------DIGERLVGSLLIEEESDDKNESKQHQHSREKDSRSDGRGQHLLSQRMRNYKRQM 484

Query: 489 KCRGDIDMLKVPLLAIYESLIPPKEEKEKQMQLLTSLEKLVVNEWPCARLCLFGSCANSF 548
           +CR DI+ L  P LAI ESLIP +EEK KQ QLLT LEKL+  EWP A+L L+GSCANSF
Sbjct: 485 RCRCDIERLNAPFLAIVESLIPAEEEKAKQNQLLTLLEKLICREWPEAQLYLYGSCANSF 544

Query: 549 GVSNSDLDVCLVHRDADIDKAEILLKLADRLQSANFQNVQALTHARVPIIKLKDPVTGIS 608
           G SNSD+D+CL   DADI+K+EIL+KLAD LQS N QNVQALT ARVPI+KL DPVTGIS
Sbjct: 545 GFSNSDIDLCLAIGDADINKSEILIKLADILQSDNLQNVQALTRARVPIVKLMDPVTGIS 604

Query: 609 CDICINNVLAVVNTKLLRDYAQIDVRLPQLAFIVKHWAKSRGVNETYQGTLSSYAYVLMC 668
           CDICINNVLAVVNT+LLRDYA+ID RL QLAFIVKHWAKSRGVNETYQGTLSSYAYVLMC
Sbjct: 605 CDICINNVLAVVNTRLLRDYARIDARLRQLAFIVKHWAKSRGVNETYQGTLSSYAYVLMC 664

Query: 669 IHFLQHRDPPILPCLQETKIVTYHEIVDNIECAYFDQVENLKSFGSNNNESVARLVWGFF 728
           IHFLQ R P ILPCLQ  +  TY   +DN+ECAYFDQVE L+ FGS N E+VA+LVW FF
Sbjct: 665 IHFLQQRKPAILPCLQGME-ATYSVTIDNVECAYFDQVEMLRDFGSRNRETVAQLVWEFF 724

Query: 729 HYWAYCHDYANTVISIRTKSTISKRAKDWTRRIGKDRHLICIEDPFETSHDLGRVVDKYS 766
           +YWAY HDY ++V+S+RT + ISKRAKDWTRR+G DRHLICIEDPFE SHDLGRVVDKYS
Sbjct: 725 NYWAYRHDYTDSVVSVRTGNIISKRAKDWTRRVGNDRHLICIEDPFEVSHDLGRVVDKYS 737

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
URT1_ARATH1.2e-15946.41UTP:RNA uridylyltransferase 1 OS=Arabidopsis thaliana GN=URT1 PE=1 SV=2[more]
CID11_SCHPO3.3e-4533.44Poly(A) RNA polymerase cid11 OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
CID1_SCHPO7.4e-4536.28Terminal uridylyltransferase cid1 OS=Schizosaccharomyces pombe (strain 972 / ATC... [more]
CID13_SCHPO2.2e-4432.38Poly(A) RNA polymerase cid13 OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
TUT4_HUMAN3.7e-4436.48Terminal uridylyltransferase 4 OS=Homo sapiens GN=ZCCHC11 PE=1 SV=3[more]
Match NameE-valueIdentityDescription
A0A0A0KBJ1_CUCSA0.0e+0083.42Uncharacterized protein OS=Cucumis sativus GN=Csa_6G076730 PE=4 SV=1[more]
E6NU32_JATCU2.2e-19252.17JHL05D22.13 protein OS=Jatropha curcas GN=JHL05D22.13 PE=4 SV=1[more]
A0A061E1N6_THECC3.1e-19152.80Nucleotidyltransferase family protein isoform 1 OS=Theobroma cacao GN=TCM_005467... [more]
A0A151UCB0_CAJCA5.9e-18250.95Poly(A) RNA polymerase cid11 OS=Cajanus cajan GN=KK1_021216 PE=4 SV=1[more]
W9RV71_9ROSA9.4e-18050.70Poly(A) RNA polymerase cid11 OS=Morus notabilis GN=L484_020763 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G45620.16.7e-16146.41 Nucleotidyltransferase family protein[more]
AT3G45750.11.2e-2130.12 Nucleotidyltransferase family protein[more]
AT3G45760.11.6e-2129.29 Nucleotidyltransferase family protein[more]
AT2G39740.11.2e-1324.46 Nucleotidyltransferase family protein[more]
Match NameE-valueIdentityDescription
gi|659120337|ref|XP_008460141.1|0.0e+0084.35PREDICTED: uncharacterized protein LOC103499041 [Cucumis melo][more]
gi|778711120|ref|XP_011656690.1|0.0e+0083.42PREDICTED: uncharacterized protein LOC101204551 [Cucumis sativus][more]
gi|802698669|ref|XP_012083529.1|3.1e-19252.17PREDICTED: terminal uridylyltransferase 7 [Jatropha curcas][more]
gi|590722773|ref|XP_007051991.1|4.5e-19152.80Nucleotidyltransferase family protein isoform 1 [Theobroma cacao][more]
gi|1009162216|ref|XP_015899318.1|2.2e-19052.46PREDICTED: UTP:RNA uridylyltransferase 1 [Ziziphus jujuba][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0016779nucleotidyltransferase activity
Vocabulary: INTERPRO
TermDefinition
IPR002934Polymerase_NTP_transf_dom
IPR002058PAP_assoc
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008152 metabolic process
biological_process GO:1903705 positive regulation of production of siRNA involved in RNA interference
biological_process GO:0060964 regulation of gene silencing by miRNA
biological_process GO:0071076 RNA 3' uridylation
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0016779 nucleotidyltransferase activity
molecular_function GO:0003729 mRNA binding
molecular_function GO:0050265 RNA uridylyltransferase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG17g02530.1Cp4.1LG17g02530.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002058PAP/25A-associatedPFAMPF03828PAP_assoccoord: 664..723
score: 1.3
IPR002934Polymerase, nucleotidyl transferase domainPFAMPF01909NTP_transf_2coord: 471..559
score: 1.
NoneNo IPR availableGENE3DG3DSA:1.10.1410.10coord: 659..760
score: 3.0E-8coord: 595..618
score: 3.
NoneNo IPR availableGENE3DG3DSA:3.30.460.10coord: 456..594
score: 3.1
NoneNo IPR availablePANTHERPTHR12271POLY A POLYMERASE CID PAP -RELATEDcoord: 414..762
score: 1.1E-182coord: 133..157
score: 1.1E
NoneNo IPR availablePANTHERPTHR12271:SF53SUBFAMILY NOT NAMEDcoord: 133..157
score: 1.1E-182coord: 414..762
score: 1.1E
NoneNo IPR availableunknownSSF81301Nucleotidyltransferasecoord: 441..575
score: 4.67
NoneNo IPR availableunknownSSF81631PAP/OAS1 substrate-binding domaincoord: 579..751
score: 2.43

The following gene(s) are paralogous to this gene:

None