CaUC03G059830 (gene) Watermelon (USVL246-FR2) v1

Overview
NameCaUC03G059830
Typegene
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionReverse transcriptase
LocationCiama_Chr03: 17671240 .. 17673498 (+)
RNA-Seq ExpressionCaUC03G059830
SyntenyCaUC03G059830
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCATTCGGGCTATGTAACGCACCAGGGACGTTCCAAAGGTGCATGATGGCAATATTTTCTGACTATTTGGAACAGTCAGTAGAGGTGTTTATGGACGACTTCTCAGTATTTGGGAAATCTTATGATGAATGCTTGACCAACCTAGAACGAGTGCTCAAACGATGCGAAGACACAAATCTAGTCCTCAACTGGGAAAAATGCCATTTTATGGTTACTGAGGGTATTGTGTTGGGGCATAAAATCTCCAAGGTTGGATTGGAAGTGGATCAGGCAAAGATAGATGCGATAGCAAAACTCCCAGCCCCCACAAATGTTAAGACATTGCGAAGTTTCCTAGGCCATGCGGGCTTTTATAGAAGATTCATTAAGGGATTCTCGCAAGTAGCAAAACCACTGAGCGAGTTGCTGGAAGTCAACAGGGAATTCAATTTTGACGGTAAATGCTTAAACGCATTCGAGTCTCTAAGGCAAGCTCTGATTTCAGCACCTATTTTAGTTGCACCTGACTGGTCTCTCCCGTTTGAATTAATGTGCGACGCAAGTAACCATGCGGTGGGAGCAGTACTGGGGCAGAGAAAAGAGAAAATTATGCACCCGATCTATTATGTTAGTAAAACACTGAATTCATCTCAGGAGAACTACACTACTACTGAGAAGGAAATGTTAGCCATAGTCTTTGCGGTTGACAAATTTAGAGCATACTTGATAGGGTCAAAGGTGACCATCTATAGTGATCATTCTGCGATCAAGTATTTGATGGCGAAAAAGGACGCAAAGCCTAGACTTATCCGTTGGGTCCTGCTGTTGCAAGAATTCGATTTGGAGATCAAAGATAGAAAGGGAACCGAGAATCAGGTTGCGGACCACCTATCTCGGTTGGAAAATAAAGAGGTCCAGGAGAGTTGGAATGATATAGAGGAACGATTCCCAGACGAGCATGTAATGAACGCAGAGAGTCAGGAACCATGGTACGCAGACATAGTCAACTATCTGGTTTGCAACCAATGGCCTGAAGAATTCAACGCTCAACAAAAGAAAAAGCTCCGACATGAAAGTAAGTTCTACTGCTGGGATGAGCCATATCTATACAGACTTGGCCCGGACCACATACTACGTCGATGCGTTCCAGAATATGAAACGCATAGCATTTTGAGAAGCTGTCATGAAGCACCTTACGGAGGACACTTTGGAGGGCAGAGAACAACTGCAAAGGTGCTGCAAAGTGGGTATTTCTGGCCCACATTATTCAAAGACGCAAGGGCATATGCGGTAGCTTGCGATCGTTGTCAGAGAATAGGCAACATTTCCAACCGGAATGAGATGCCTCTGAATTCAATGCTGGAAGTTGAGTTGTTCGACGTATGGGGAATCGACTTCATGGGACCATTCCCTCCTTCTTGCGGCAATCAATATATTTTAGTAGCGGTCGACTATGTATCAAAGTGGGTAGAAGCAGCAGCCTGTGCAAAGAATGACGCAAACACAGTGTCCAAGTTCTTAAAGAAGCAAATATTCTCTCGATTTGGGACACCAAGGGCGATAATTAGTGATGAAGGTACACATTTTATAAATCGCATCATCACTAATTTACTGACGAAGTTTAATGTCTCGCACAGGGTAGCAACTGCCTATCACCCGCAAACAAACGGCCAAGCTGAAATAACCAACCGGGAGATCAAGTCCATACTTGAAAAGGTCGTGAGCACATCAAGGAAGGACTGGACGAAGAGATTAGATGAAGCTCTATGGGCCTACAGAACGGCATTCAAAACACCGATAGGCATGTCACCCTATGCGTTGGTGTTTGGTAAAGCATGCCATCTCCCACCTGAGCTGGAACACAAGGGCATCTGGGCCATGAAGAAGCTCAATTTAGACCAGGAGGCAAGCGGAGAAGCCAGAAAGCTTCAACTAAATGAACTCCTGGAGTGGAGGCATTCAGCTTACGAAAACGCAAAGCTGTACAAAGAAAGGACCAAGAAATGGCACGACAAAAACATCAGTAAGAAAACTCTACACGTCGGCCAGAAGGTCCTGTTATTTAATTCAAGGTTGCGTTTGTTTCCAGGTAAGCTGAAGTCTCGATGGTCGGGTCCATTCATAATCAAGGAAGTGTTCCCGCATGGTGCGGTGATGCTGACAAATGAAAATGGAACCACATCCTTCAAGGTCAATGGACAAAGGGTAAAGCCCTACCACAGTGGAGAGTTCGAAATTAACAAGACCTCCATTGACCTACGTGAGTGTAATGACTAA

mRNA sequence

ATGCCATTCGGGCTATGTAACGCACCAGGGACGTTCCAAAGGTGCATGATGGCAATATTTTCTGACTATTTGGAACAGTCAGTAGAGGTGTTTATGGACGACTTCTCAGTATTTGGGAAATCTTATGATGAATGCTTGACCAACCTAGAACGAGTGCTCAAACGATGCGAAGACACAAATCTAGTCCTCAACTGGGAAAAATGCCATTTTATGGTTACTGAGGGTATTGTGTTGGGGCATAAAATCTCCAAGGTTGGATTGGAAGTGGATCAGGCAAAGATAGATGCGATAGCAAAACTCCCAGCCCCCACAAATGTTAAGACATTGCGAAGTTTCCTAGGCCATGCGGGCTTTTATAGAAGATTCATTAAGGGATTCTCGCAAGTAGCAAAACCACTGAGCGAGTTGCTGGAAGTCAACAGGGAATTCAATTTTGACGGTAAATGCTTAAACGCATTCGAGTCTCTAAGGCAAGCTCTGATTTCAGCACCTATTTTAGTTGCACCTGACTGGTCTCTCCCGTTTGAATTAATGTGCGACGCAAGTAACCATGCGGTGGGAGCAGTACTGGGGCAGAGAAAAGAGAAAATTATGCACCCGATCTATTATGTTAGTAAAACACTGAATTCATCTCAGGAGAACTACACTACTACTGAGAAGGAAATGTTAGCCATAGTCTTTGCGGTTGACAAATTTAGAGCATACTTGATAGGGTCAAAGGTGACCATCTATAGTGATCATTCTGCGATCAAGTATTTGATGGCGAAAAAGGACGCAAAGCCTAGACTTATCCGTTGGGTCCTGCTGTTGCAAGAATTCGATTTGGAGATCAAAGATAGAAAGGGAACCGAGAATCAGGTTGCGGACCACCTATCTCGGTTGGAAAATAAAGAGGTCCAGGAGAGTTGGAATGATATAGAGGAACGATTCCCAGACGAGCATGTAATGAACGCAGAGAGTCAGGAACCATGGTACGCAGACATAGTCAACTATCTGGTTTGCAACCAATGGCCTGAAGAATTCAACGCTCAACAAAAGAAAAAGCTCCGACATGAAAGTAAGTTCTACTGCTGGGATGAGCCATATCTATACAGACTTGGCCCGGACCACATACTACGTCGATGCGTTCCAGAATATGAAACGCATAGCATTTTGAGAAGCTGTCATGAAGCACCTTACGGAGGACACTTTGGAGGGCAGAGAACAACTGCAAAGGTGCTGCAAAGTGGGTATTTCTGGCCCACATTATTCAAAGACGCAAGGGCATATGCGGTAGCTTGCGATCGTTGTCAGAGAATAGGCAACATTTCCAACCGGAATGAGATGCCTCTGAATTCAATGCTGGAAGTTGAGTTGTTCGACGTATGGGGAATCGACTTCATGGGACCATTCCCTCCTTCTTGCGGCAATCAATATATTTTAGTAGCGGTCGACTATGTATCAAAGTGGGTAGAAGCAGCAGCCTGTGCAAAGAATGACGCAAACACAGTGTCCAAGTTCTTAAAGAAGCAAATATTCTCTCGATTTGGGACACCAAGGGCGATAATTAGTGATGAAGGCATGTCACCCTATGCGTTGGTGTTTGGTAAAGCATGCCATCTCCCACCTGAGCTGGAACACAAGGGCATCTGGGCCATGAAGAAGCTCAATTTAGACCAGGAGGCAAGCGGAGAAGCCAGAAAGCTTCAACTAAATGAACTCCTGGAGTGGAGGCATTCAGCTTACGAAAACGCAAAGCTGTACAAAGAAAGGACCAAGAAATGGCACGACAAAAACATCAGTAAGAAAACTCTACACGTCGGCCAGAAGGTCCTGTTATTTAATTCAAGGTTGCGTTTGTTTCCAGGTAAGCTGAAGTCTCGATGGTCGGGTCCATTCATAATCAAGGAAGTGTTCCCGCATGGTGCGGTGATGCTGACAAATGAAAATGGAACCACATCCTTCAAGGTCAATGGACAAAGGGTAAAGCCCTACCACAGTGGAGAGTTCGAAATTAACAAGACCTCCATTGACCTACGTGAGTGTAATGACTAA

Coding sequence (CDS)

ATGCCATTCGGGCTATGTAACGCACCAGGGACGTTCCAAAGGTGCATGATGGCAATATTTTCTGACTATTTGGAACAGTCAGTAGAGGTGTTTATGGACGACTTCTCAGTATTTGGGAAATCTTATGATGAATGCTTGACCAACCTAGAACGAGTGCTCAAACGATGCGAAGACACAAATCTAGTCCTCAACTGGGAAAAATGCCATTTTATGGTTACTGAGGGTATTGTGTTGGGGCATAAAATCTCCAAGGTTGGATTGGAAGTGGATCAGGCAAAGATAGATGCGATAGCAAAACTCCCAGCCCCCACAAATGTTAAGACATTGCGAAGTTTCCTAGGCCATGCGGGCTTTTATAGAAGATTCATTAAGGGATTCTCGCAAGTAGCAAAACCACTGAGCGAGTTGCTGGAAGTCAACAGGGAATTCAATTTTGACGGTAAATGCTTAAACGCATTCGAGTCTCTAAGGCAAGCTCTGATTTCAGCACCTATTTTAGTTGCACCTGACTGGTCTCTCCCGTTTGAATTAATGTGCGACGCAAGTAACCATGCGGTGGGAGCAGTACTGGGGCAGAGAAAAGAGAAAATTATGCACCCGATCTATTATGTTAGTAAAACACTGAATTCATCTCAGGAGAACTACACTACTACTGAGAAGGAAATGTTAGCCATAGTCTTTGCGGTTGACAAATTTAGAGCATACTTGATAGGGTCAAAGGTGACCATCTATAGTGATCATTCTGCGATCAAGTATTTGATGGCGAAAAAGGACGCAAAGCCTAGACTTATCCGTTGGGTCCTGCTGTTGCAAGAATTCGATTTGGAGATCAAAGATAGAAAGGGAACCGAGAATCAGGTTGCGGACCACCTATCTCGGTTGGAAAATAAAGAGGTCCAGGAGAGTTGGAATGATATAGAGGAACGATTCCCAGACGAGCATGTAATGAACGCAGAGAGTCAGGAACCATGGTACGCAGACATAGTCAACTATCTGGTTTGCAACCAATGGCCTGAAGAATTCAACGCTCAACAAAAGAAAAAGCTCCGACATGAAAGTAAGTTCTACTGCTGGGATGAGCCATATCTATACAGACTTGGCCCGGACCACATACTACGTCGATGCGTTCCAGAATATGAAACGCATAGCATTTTGAGAAGCTGTCATGAAGCACCTTACGGAGGACACTTTGGAGGGCAGAGAACAACTGCAAAGGTGCTGCAAAGTGGGTATTTCTGGCCCACATTATTCAAAGACGCAAGGGCATATGCGGTAGCTTGCGATCGTTGTCAGAGAATAGGCAACATTTCCAACCGGAATGAGATGCCTCTGAATTCAATGCTGGAAGTTGAGTTGTTCGACGTATGGGGAATCGACTTCATGGGACCATTCCCTCCTTCTTGCGGCAATCAATATATTTTAGTAGCGGTCGACTATGTATCAAAGTGGGTAGAAGCAGCAGCCTGTGCAAAGAATGACGCAAACACAGTGTCCAAGTTCTTAAAGAAGCAAATATTCTCTCGATTTGGGACACCAAGGGCGATAATTAGTGATGAAGGCATGTCACCCTATGCGTTGGTGTTTGGTAAAGCATGCCATCTCCCACCTGAGCTGGAACACAAGGGCATCTGGGCCATGAAGAAGCTCAATTTAGACCAGGAGGCAAGCGGAGAAGCCAGAAAGCTTCAACTAAATGAACTCCTGGAGTGGAGGCATTCAGCTTACGAAAACGCAAAGCTGTACAAAGAAAGGACCAAGAAATGGCACGACAAAAACATCAGTAAGAAAACTCTACACGTCGGCCAGAAGGTCCTGTTATTTAATTCAAGGTTGCGTTTGTTTCCAGGTAAGCTGAAGTCTCGATGGTCGGGTCCATTCATAATCAAGGAAGTGTTCCCGCATGGTGCGGTGATGCTGACAAATGAAAATGGAACCACATCCTTCAAGGTCAATGGACAAAGGGTAAAGCCCTACCACAGTGGAGAGTTCGAAATTAACAAGACCTCCATTGACCTACGTGAGTGTAATGACTAA

Protein sequence

MPFGLCNAPGTFQRCMMAIFSDYLEQSVEVFMDDFSVFGKSYDECLTNLERVLKRCEDTNLVLNWEKCHFMVTEGIVLGHKISKVGLEVDQAKIDAIAKLPAPTNVKTLRSFLGHAGFYRRFIKGFSQVAKPLSELLEVNREFNFDGKCLNAFESLRQALISAPILVAPDWSLPFELMCDASNHAVGAVLGQRKEKIMHPIYYVSKTLNSSQENYTTTEKEMLAIVFAVDKFRAYLIGSKVTIYSDHSAIKYLMAKKDAKPRLIRWVLLLQEFDLEIKDRKGTENQVADHLSRLENKEVQESWNDIEERFPDEHVMNAESQEPWYADIVNYLVCNQWPEEFNAQQKKKLRHESKFYCWDEPYLYRLGPDHILRRCVPEYETHSILRSCHEAPYGGHFGGQRTTAKVLQSGYFWPTLFKDARAYAVACDRCQRIGNISNRNEMPLNSMLEVELFDVWGIDFMGPFPPSCGNQYILVAVDYVSKWVEAAACAKNDANTVSKFLKKQIFSRFGTPRAIISDEGMSPYALVFGKACHLPPELEHKGIWAMKKLNLDQEASGEARKLQLNELLEWRHSAYENAKLYKERTKKWHDKNISKKTLHVGQKVLLFNSRLRLFPGKLKSRWSGPFIIKEVFPHGAVMLTNENGTTSFKVNGQRVKPYHSGEFEINKTSIDLRECND
Homology
BLAST of CaUC03G059830 vs. NCBI nr
Match: PIM97577.1 (DNA-directed DNA polymerase [Handroanthus impetiginosus])

HSP 1 Score: 933.3 bits (2411), Expect = 1.2e-267
Identity = 459/750 (61.20%), Postives = 537/750 (71.60%), Query Frame = 0

Query: 1    MPFGLCNAPGTFQRCMMAIFSDYLEQSVEVFMDDFSVFGKSYDECLTNLERVLKRCEDTN 60
            MPFGLCNAP TFQRCMMAIF+D +E  +EVFMDDFSV+G S+DECL NL  VLKRCEDTN
Sbjct: 826  MPFGLCNAPATFQRCMMAIFTDMVENCLEVFMDDFSVYGNSFDECLNNLSCVLKRCEDTN 885

Query: 61   LVLNWEKCHFMVTEGIVLGHKISKVGLEVDQAKIDAIAKLPAPTNVKTLRSFLGHAGFYR 120
            L+LNWEKCHFMV EGIVLGHK+S  G+EVD+AK++ I KLP PT+VK +RSFLGHAGFYR
Sbjct: 886  LILNWEKCHFMVQEGIVLGHKVSNRGIEVDKAKLETIEKLPPPTSVKGVRSFLGHAGFYR 945

Query: 121  RFIKGFSQVAKPLSELLEVNREFNFDGKCLNAFESLRQALISAPILVAPDWSLPFELMCD 180
            RFIK FS+++KPL  LLE +  FNFD  C +AF  L+  LISAPI+  PDWS PFELMCD
Sbjct: 946  RFIKDFSKISKPLCNLLEKDIPFNFDDACRDAFNDLKGRLISAPIITVPDWSFPFELMCD 1005

Query: 181  ASNHAVGAVLGQRKEKIMHPIYYVSKTLNSSQENYTTTEKEMLAIVFAVDKFRAYLIGSK 240
            AS+ AVGAVLGQRK+KI   IYY SKTLN +Q NYTTTEKE+LA+VFA DKFR+YL+G+K
Sbjct: 1006 ASDFAVGAVLGQRKDKIFRSIYYASKTLNDAQLNYTTTEKELLAVVFAFDKFRSYLVGTK 1065

Query: 241  VTIYSDHSAIKYLMAKKDAKPRLIRWVLLLQEFDLEIKDRKGTENQVADHLSRLENKEVQ 300
            V +Y+DH+AI+YL+ KKDAKPRLIRWVLLLQEFDLEI+DRKGTENQ+ADHLSRLE+    
Sbjct: 1066 VIVYTDHAAIRYLIEKKDAKPRLIRWVLLLQEFDLEIRDRKGTENQIADHLSRLESPAKT 1125

Query: 301  ESWNDIEERFPDEHVMN-AESQEPWYADIVNYLVCNQWPEEFNAQQKKKLRHESKFYCWD 360
            +  N I + FPDE ++    S  PWYADIVNYL C   P + +AQQKKK   +++ Y WD
Sbjct: 1126 DEPNLINDNFPDEQLLAIVASDVPWYADIVNYLTCGIIPFDLSAQQKKKFLFDTRRYFWD 1185

Query: 361  EPYLYRLGPDHILRRCVPEYETHSILRSCHEAPYGGHFGGQRTTAKVLQSGYFWPTLFKD 420
            +P+L++ GPD+ILRRCVPE E + IL  CH +PYGGHF G RT AK+LQSG+FWP LFKD
Sbjct: 1186 DPFLFKQGPDNILRRCVPEIEMNDILEQCHASPYGGHFHGDRTAAKILQSGFFWPNLFKD 1245

Query: 421  ARAYAVACDRCQRIGNISNRNEMPLNSMLEVELFDVWGIDFMGPFPPSCGNQYILVAVDY 480
            A ++   CDRCQR GNIS R+EMPLN++LEVELFDVWGIDFMGPF PS GN YILVAVDY
Sbjct: 1246 AHSFVANCDRCQRTGNISRRHEMPLNTILEVELFDVWGIDFMGPFIPSFGNMYILVAVDY 1305

Query: 481  VSKWVEAAACAKNDANTVSKFLKKQIFSRFGTPRAIISDE-------------------- 540
            VSKWVEAAA   ND+  V  F+KK IF+RFGTPRAIISD                     
Sbjct: 1306 VSKWVEAAAVPNNDSKVVVNFIKKNIFTRFGTPRAIISDGGTHFCNRSFEALLSKYGVKH 1365

Query: 541  -------------------------------------------------------GMSPY 600
                                                                   GMSPY
Sbjct: 1366 KISTPYHPQTSGQVEVSNREIKRILEKTVSSTRKDWSKRLDEALWAYRTAYKTPIGMSPY 1425

Query: 601  ALVFGKACHLPPELEHKGIWAMKKLNLDQEASGEARKLQLNELLEWRHSAYENAKLYKER 660
             LVFGKACHLP ELEH   WA++KLN D +A+GE R LQLNEL E+R  AYENAK+YKE+
Sbjct: 1426 RLVFGKACHLPVELEHNAYWAIRKLNFDMQAAGEKRLLQLNELDEFRLHAYENAKIYKEK 1485

Query: 661  TKKWHDKNISKKTLHVGQKVLLFNSRLRLFPGKLKSRWSGPFIIKEVFPHGAVMLTNENG 675
             K+WH+K I ++    GQ VLLFNSRL+LFPGKLKSRWSGPF I EVFPHGAV L N+N 
Sbjct: 1486 KKRWHEKKIVERHFEPGQYVLLFNSRLKLFPGKLKSRWSGPFRITEVFPHGAVELENKNS 1545

BLAST of CaUC03G059830 vs. NCBI nr
Match: BBH06778.1 (transposable element gene [Prunus dulcis])

HSP 1 Score: 907.9 bits (2345), Expect = 5.2e-260
Identity = 445/750 (59.33%), Postives = 535/750 (71.33%), Query Frame = 0

Query: 1   MPFGLCNAPGTFQRCMMAIFSDYLEQSVEVFMDDFSVFGKSYDECLTNLERVLKRCEDTN 60
           MPFGLCNAP TFQRCMM+IFSD +E+ +EVFMDDFSVFG S+D CL NL  VL RCE+TN
Sbjct: 148 MPFGLCNAPATFQRCMMSIFSDMVERFIEVFMDDFSVFGSSFDSCLDNLALVLARCEETN 207

Query: 61  LVLNWEKCHFMVTEGIVLGHKISKVGLEVDQAKIDAIAKLPAPTNVKTLRSFLGHAGFYR 120
           LVLNWEKCHFMV EGIVLGHKIS  G+EVD+AKI+ I KLP P+ VK +RSFLGHAGFYR
Sbjct: 208 LVLNWEKCHFMVQEGIVLGHKISARGIEVDRAKIETIEKLPPPSTVKGIRSFLGHAGFYR 267

Query: 121 RFIKGFSQVAKPLSELLEVNREFNFDGKCLNAFESLRQALISAPILVAPDWSLPFELMCD 180
           RFIK FS++ KPL +LL  + EFNFD  CL AF  L+  L +AP+++APDW LPFE+MCD
Sbjct: 268 RFIKDFSKITKPLCKLLLKDSEFNFDSDCLEAFNLLKTKLTTAPVIMAPDWELPFEIMCD 327

Query: 181 ASNHAVGAVLGQRKEKIMHPIYYVSKTLNSSQENYTTTEKEMLAIVFAVDKFRAYLIGSK 240
           AS++A+GAVLGQRK K++H I+Y S+TLN +Q NY TTEKE+LA+VFA+DKFR+YL+G+K
Sbjct: 328 ASDYAIGAVLGQRKNKLLHVIHYASRTLNDAQLNYATTEKELLAVVFALDKFRSYLLGAK 387

Query: 241 VTIYSDHSAIKYLMAKKDAKPRLIRWVLLLQEFDLEIKDRKGTENQVADHLSRL-ENKEV 300
           V +Y+DH+A+K+L+AKK+AKPRLIRWVLLLQEFD+EI+D+KG+EN VADHLSRL    EV
Sbjct: 388 VIVYTDHAALKFLLAKKEAKPRLIRWVLLLQEFDIEIRDKKGSENVVADHLSRLVREDEV 447

Query: 301 QESWNDIEERFPDEHVMNAESQE----PWYADIVNYLVCNQWPEEFNAQQKKKLRHESKF 360
            E    I E FPDE + +  S +    PWYAD VNYL C   P + +  QKKK     K 
Sbjct: 448 IEDVGPILETFPDEQLYSIYSAKEFITPWYADFVNYLACGILPPDMSFYQKKKFLSLVKH 507

Query: 361 YCWDEPYLYRLGPDHILRRCVPEYETHSILRSCHEAPYGGHFGGQRTTAKVLQSGYFWPT 420
           Y WD+PYL++ GPD ++RRCVPE E   IL  CH    GGH+G  +TTAKVLQSG+FWPT
Sbjct: 508 YYWDDPYLWKHGPDQVIRRCVPETEMADILLHCHTLACGGHYGASKTTAKVLQSGFFWPT 567

Query: 421 LFKDARAYAVACDRCQRIGNISNRNEMPLNSMLEVELFDVWGIDFMGPFPPSCGNQYILV 480
           LFKDA+ +   CD CQR GNIS+RN+MPLN++LEVELFDVWGIDFMGPFP S GN YILV
Sbjct: 568 LFKDAQDFVARCDPCQRTGNISSRNQMPLNNILEVELFDVWGIDFMGPFPASYGNLYILV 627

Query: 481 AVDYVSKWVEAAACAKNDANTVSKFLKKQIFSRFGTPRAIISDE---------------- 540
           AVDYVSKWVEAAA   NDA  V +FL+K IF+RFG PRAIISD                 
Sbjct: 628 AVDYVSKWVEAAALPTNDAKVVVRFLRKNIFTRFGVPRAIISDGGTHFCNRQFNSLLAKY 687

Query: 541 -----------------------------------------------------------G 600
                                                                      G
Sbjct: 688 GITHKVSTPYHPQTSGQVEVSNRELKKILEKTVSASRKDWSLKLDDALWAYRTAFKAPIG 747

Query: 601 MSPYALVFGKACHLPPELEHKGIWAMKKLNLDQEASGEARKLQLNELLEWRHSAYENAKL 660
           MSPY LVFGKACHLP ELEHK  WA+K LN D  ++GE RKLQLNEL E R+ +YENAK+
Sbjct: 748 MSPYRLVFGKACHLPVELEHKAFWAIKTLNFDMSSAGEKRKLQLNELEELRNESYENAKI 807

Query: 661 YKERTKKWHDKNISKKTLHVGQKVLLFNSRLRLFPGKLKSRWSGPFIIKEVFPHGAVMLT 671
           YK+RTKKWHDK+I KK  +VGQ VLL+NSRL+LFPGKL+SRWSGPF +  V+P+G V + 
Sbjct: 808 YKDRTKKWHDKHILKKEFYVGQSVLLYNSRLKLFPGKLRSRWSGPFTVLTVYPYGTVEIK 867

BLAST of CaUC03G059830 vs. NCBI nr
Match: XP_012833379.1 (PREDICTED: uncharacterized protein LOC105954252 [Erythranthe guttata])

HSP 1 Score: 902.5 bits (2331), Expect = 2.2e-258
Identity = 439/734 (59.81%), Postives = 525/734 (71.53%), Query Frame = 0

Query: 1    MPFGLCNAPGTFQRCMMAIFSDYLEQSVEVFMDDFSVFGKSYDECLTNLERVLKRCEDTN 60
            MPFGLCNAP TFQRCMM+IF D +E+ +EVFMDDFSVFG S+D C+ NLE VLKRC +TN
Sbjct: 918  MPFGLCNAPATFQRCMMSIFHDMVEEFLEVFMDDFSVFGSSFDHCVHNLELVLKRCTETN 977

Query: 61   LVLNWEKCHFMVTEGIVLGHKISKVGLEVDQAKIDAIAKLPAPTNVKTLRSFLGHAGFYR 120
            LVLNWEKCHFMV EGIVLGHK+SK GLEVD+AKI+ I KLP P +VK +RSFLGHAGFYR
Sbjct: 978  LVLNWEKCHFMVREGIVLGHKVSKKGLEVDRAKIETIEKLPPPKDVKGVRSFLGHAGFYR 1037

Query: 121  RFIKGFSQVAKPLSELLEVNREFNFDGKCLNAFESLRQALISAPILVAPDWSLPFELMCD 180
            RFIK FS++ KPL  LLE    F+FD  CL AF  L++ L  +PI++ P+W  PFE+MCD
Sbjct: 1038 RFIKDFSKIVKPLCHLLEKEAVFDFDSACLQAFTFLKEKLTQSPIMITPNWEEPFEIMCD 1097

Query: 181  ASNHAVGAVLGQRKEKIMHPIYYVSKTLNSSQENYTTTEKEMLAIVFAVDKFRAYLIGSK 240
            AS++AVGAVLGQR++KI   IYY S+TL+ +Q+NY+TTEKEMLA+V+AVDKFR Y++GS+
Sbjct: 1098 ASDYAVGAVLGQRRDKIFKAIYYSSRTLDQAQKNYSTTEKEMLAVVYAVDKFRPYILGSQ 1157

Query: 241  VTIYSDHSAIKYLMAKKDAKPRLIRWVLLLQEFDLEIKDRKGTENQVADHLSRLENKEVQ 300
            V IY+DH+AI+YL AKKDAKPRLIRWVLLLQEFDLEI+D+KG+EN VADHLSRL  +EV 
Sbjct: 1158 VIIYTDHAAIRYLFAKKDAKPRLIRWVLLLQEFDLEIRDKKGSENVVADHLSRLILEEVP 1217

Query: 301  ESWNDIEERFPDEHVMNAESQEPWYADIVNYLVCNQWPEEFNAQQKKKLRHESKFYCWDE 360
               N I+E FPDE ++   +  PWYAD+ N+L     P++ +  QKKK  H+S+FY WDE
Sbjct: 1218 AEGN-IQESFPDEQLLAISTHTPWYADVANFLASGIIPDDLSYHQKKKFLHDSRFYLWDE 1277

Query: 361  PYLYRLGPDHILRRCVPEYETHSILRSCHEAPYGGHFGGQRTTAKVLQSGYFWPTLFKDA 420
            P L+R GPD ++RRCVPE E   IL  CH +P GGH G  RT AKVLQSG+FWPTLF+D+
Sbjct: 1278 PLLFRTGPDRVIRRCVPETEVREILTHCHSSPCGGHHGESRTAAKVLQSGFFWPTLFRDS 1337

Query: 421  RAYAVACDRCQRIGNISNRNEMPLNSMLEVELFDVWGIDFMGPFPPSCGNQYILVAVDYV 480
              +   CDRCQR GN+SN+++MPLN+M EVELFDVWGIDFMGPFP S G  YIL+AVDYV
Sbjct: 1338 YDFVKRCDRCQRTGNLSNKSQMPLNNMQEVELFDVWGIDFMGPFPSSNGKLYILLAVDYV 1397

Query: 481  SKWVEAAACAKNDANTVSKFLKKQIFSRFGTPRAIISDE--------------------- 540
            SKWVEA A   NDA TV KF  K IFSRFGTPRAIISDE                     
Sbjct: 1398 SKWVEAIATTANDARTVLKFFHKNIFSRFGTPRAIISDEGSHFCNKLLTNLTNKLGIRHK 1457

Query: 541  ------------------------------------------------------GMSPYA 600
                                                                  GMSPY 
Sbjct: 1458 IALAYHPQTNGLAELSNREIKQILEKTVSTNRKDWALKLDDALWAYRTAFKTPIGMSPYK 1517

Query: 601  LVFGKACHLPPELEHKGIWAMKKLNLDQEASGEARKLQLNELLEWRHSAYENAKLYKERT 660
            LVFGKACHLP ELEH+  WA+KKLN DQ A+G+ R LQLNE+ E+R+ AYENAK+YKE+T
Sbjct: 1518 LVFGKACHLPVELEHRAYWAVKKLNFDQTATGDRRLLQLNEMEEFRNDAYENAKIYKEKT 1577

BLAST of CaUC03G059830 vs. NCBI nr
Match: XP_012858910.1 (PREDICTED: uncharacterized protein LOC105978045 [Erythranthe guttata])

HSP 1 Score: 902.5 bits (2331), Expect = 2.2e-258
Identity = 440/734 (59.95%), Postives = 524/734 (71.39%), Query Frame = 0

Query: 1    MPFGLCNAPGTFQRCMMAIFSDYLEQSVEVFMDDFSVFGKSYDECLTNLERVLKRCEDTN 60
            MPFGLCNAP TFQRCMM+IF D +E+ +EVFMDDFSVFG S+D C+ NLE VLKRC +TN
Sbjct: 958  MPFGLCNAPATFQRCMMSIFHDMVEEFLEVFMDDFSVFGSSFDHCVHNLELVLKRCTETN 1017

Query: 61   LVLNWEKCHFMVTEGIVLGHKISKVGLEVDQAKIDAIAKLPAPTNVKTLRSFLGHAGFYR 120
            LVLNWEKCHFMV EGIVLGHK+SK GLEVD+AKI+ I KLP P +VK +RSFLGHAGFYR
Sbjct: 1018 LVLNWEKCHFMVREGIVLGHKVSKKGLEVDRAKIETIEKLPPPKDVKGVRSFLGHAGFYR 1077

Query: 121  RFIKGFSQVAKPLSELLEVNREFNFDGKCLNAFESLRQALISAPILVAPDWSLPFELMCD 180
            RFIK FS++ KPL  LLE    F+FD  CL AF  L++ L  +PI++ P+W  PFE+MCD
Sbjct: 1078 RFIKDFSKIVKPLCHLLEKEAVFDFDSACLQAFTFLKEKLTQSPIMITPNWEEPFEIMCD 1137

Query: 181  ASNHAVGAVLGQRKEKIMHPIYYVSKTLNSSQENYTTTEKEMLAIVFAVDKFRAYLIGSK 240
            AS++AVGAVLGQR++KI   IYY S+TL+ +Q+NY+TTEKEMLA+V+AVDKFR Y++GS+
Sbjct: 1138 ASDYAVGAVLGQRRDKIFKAIYYSSRTLDQAQKNYSTTEKEMLAVVYAVDKFRPYILGSQ 1197

Query: 241  VTIYSDHSAIKYLMAKKDAKPRLIRWVLLLQEFDLEIKDRKGTENQVADHLSRLENKEVQ 300
            V IY+DH+AI+YL AKKDAKPRLIRWVLLLQEFDLEI+D+KG+EN VADHLSRL  +EV 
Sbjct: 1198 VIIYTDHAAIRYLFAKKDAKPRLIRWVLLLQEFDLEIRDKKGSENVVADHLSRLILEEVP 1257

Query: 301  ESWNDIEERFPDEHVMNAESQEPWYADIVNYLVCNQWPEEFNAQQKKKLRHESKFYCWDE 360
               N I+E FPDE ++   +  PWYAD+ N+L     P++    QKKK  H+S+FY WDE
Sbjct: 1258 AEGN-IQESFPDEQLLAISTHTPWYADVANFLASGIIPDDLYYHQKKKFLHDSRFYLWDE 1317

Query: 361  PYLYRLGPDHILRRCVPEYETHSILRSCHEAPYGGHFGGQRTTAKVLQSGYFWPTLFKDA 420
            P L+R GPD ++RRCVPE E   IL  CH +P GGH G  RT AKVLQSG+FWPTLF+D+
Sbjct: 1318 PLLFRTGPDRVIRRCVPETEVREILTHCHSSPCGGHHGESRTAAKVLQSGFFWPTLFRDS 1377

Query: 421  RAYAVACDRCQRIGNISNRNEMPLNSMLEVELFDVWGIDFMGPFPPSCGNQYILVAVDYV 480
              +   CDRCQR GN+SN+++MPLN+M EVELFDVWGIDFMGPFP S G  YIL+AVDYV
Sbjct: 1378 YEFVKRCDRCQRTGNLSNKSQMPLNNMQEVELFDVWGIDFMGPFPSSNGKLYILLAVDYV 1437

Query: 481  SKWVEAAACAKNDANTVSKFLKKQIFSRFGTPRAIISDE--------------------- 540
            SKWVEA A   NDA TV KF  K IFSRFGTPRAIISDE                     
Sbjct: 1438 SKWVEAIATTTNDARTVLKFFHKNIFSRFGTPRAIISDEGSHFCNKLLTNLTNKLGIRHK 1497

Query: 541  ------------------------------------------------------GMSPYA 600
                                                                  GMSPY 
Sbjct: 1498 IALAYHPQTNGLVELSNREIKQILEKTVSTNRKDWALKLDDALWAYRTAFKTPIGMSPYK 1557

Query: 601  LVFGKACHLPPELEHKGIWAMKKLNLDQEASGEARKLQLNELLEWRHSAYENAKLYKERT 660
            LVFGKACHLP ELEH+  WA+KKLN DQ A+G+ R LQLNEL E+R+ AYENAK+YKE+T
Sbjct: 1558 LVFGKACHLPVELEHRAYWAVKKLNFDQTATGDRRLLQLNELEEFRNDAYENAKIYKEKT 1617

BLAST of CaUC03G059830 vs. NCBI nr
Match: XP_012853783.1 (PREDICTED: uncharacterized protein LOC105973307 [Erythranthe guttata])

HSP 1 Score: 902.1 bits (2330), Expect = 2.9e-258
Identity = 439/734 (59.81%), Postives = 523/734 (71.25%), Query Frame = 0

Query: 1    MPFGLCNAPGTFQRCMMAIFSDYLEQSVEVFMDDFSVFGKSYDECLTNLERVLKRCEDTN 60
            MPFGLCNAP TFQRCMM+IF D +E+ +EVFMDDFSVFG S+D C+ NLE VLKRC +TN
Sbjct: 993  MPFGLCNAPATFQRCMMSIFHDMVEEFLEVFMDDFSVFGSSFDRCVHNLELVLKRCTETN 1052

Query: 61   LVLNWEKCHFMVTEGIVLGHKISKVGLEVDQAKIDAIAKLPAPTNVKTLRSFLGHAGFYR 120
            LVLNWEKCHFMV EGIVLGHK+SK GLEVD+AKI+ I KLP P +VK +RSFLGHAGFYR
Sbjct: 1053 LVLNWEKCHFMVREGIVLGHKVSKKGLEVDRAKIETIEKLPPPKDVKGVRSFLGHAGFYR 1112

Query: 121  RFIKGFSQVAKPLSELLEVNREFNFDGKCLNAFESLRQALISAPILVAPDWSLPFELMCD 180
            RFIK FS++ KPL  LLE    F+FD  CL AF  L++ L  +PI++ PDW  PFE+MCD
Sbjct: 1113 RFIKDFSKIVKPLCHLLEKEAVFDFDSACLQAFTFLKEKLTQSPIMITPDWEEPFEIMCD 1172

Query: 181  ASNHAVGAVLGQRKEKIMHPIYYVSKTLNSSQENYTTTEKEMLAIVFAVDKFRAYLIGSK 240
            AS++AVGAVLGQR++KI   IYY+S+TL+ +Q+NY+TTEKEMLA+V+AVDKFR Y++GS+
Sbjct: 1173 ASDYAVGAVLGQRRDKIFKAIYYLSRTLDQAQKNYSTTEKEMLAVVYAVDKFRPYILGSQ 1232

Query: 241  VTIYSDHSAIKYLMAKKDAKPRLIRWVLLLQEFDLEIKDRKGTENQVADHLSRLENKEVQ 300
            V IY+DH+AI+YL AKKDAKPRLIRWVLLLQEFDLEI+D+KG+EN VADHLSRL   EV 
Sbjct: 1233 VIIYTDHAAIRYLFAKKDAKPRLIRWVLLLQEFDLEIRDKKGSENVVADHLSRLILGEVP 1292

Query: 301  ESWNDIEERFPDEHVMNAESQEPWYADIVNYLVCNQWPEEFNAQQKKKLRHESKFYCWDE 360
               N I+E FPDE ++   +  PWYAD+ N+L     P++ +  QKKK  H+S+FY WDE
Sbjct: 1293 AEGN-IQESFPDEQLLAISTHTPWYADVANFLASGIIPDDLSYHQKKKFLHDSRFYLWDE 1352

Query: 361  PYLYRLGPDHILRRCVPEYETHSILRSCHEAPYGGHFGGQRTTAKVLQSGYFWPTLFKDA 420
            P L+R GPD ++RRCVPE E   IL  CH +P GGH G  RT AKVLQSG+FWPTLF+D+
Sbjct: 1353 PLLFRTGPDRVIRRCVPETEVREILTHCHSSPCGGHHGESRTAAKVLQSGFFWPTLFRDS 1412

Query: 421  RAYAVACDRCQRIGNISNRNEMPLNSMLEVELFDVWGIDFMGPFPPSCGNQYILVAVDYV 480
              +   CDRCQR GN+SN+++MPLN M EVELFDVWGIDFMGPFP S G  YIL+AVDYV
Sbjct: 1413 YEFVKRCDRCQRTGNLSNKSQMPLNDMQEVELFDVWGIDFMGPFPSSNGKLYILLAVDYV 1472

Query: 481  SKWVEAAACAKNDANTVSKFLKKQIFSRFGTPRAIISDE--------------------- 540
            SKWVEA A   NDA TV KF  K IFSRFGTPRAIISDE                     
Sbjct: 1473 SKWVEAIATTTNDARTVLKFFHKNIFSRFGTPRAIISDEGSHFCNKLLTNLTNKLGIRHK 1532

Query: 541  ------------------------------------------------------GMSPYA 600
                                                                  GMSPY 
Sbjct: 1533 IALAYHPQTNGLAELSNREIKQILEKTVSTNRKDWALKLDDALWAYRTAFKTPIGMSPYK 1592

Query: 601  LVFGKACHLPPELEHKGIWAMKKLNLDQEASGEARKLQLNELLEWRHSAYENAKLYKERT 660
            LV+GKACHLP ELEH+  WA+KKLN DQ A G+ R LQLNE+ E+R+ AYENAK+YKE+T
Sbjct: 1593 LVYGKACHLPVELEHRAYWAVKKLNFDQTAIGDRRLLQLNEMEEFRNDAYENAKIYKEKT 1652

BLAST of CaUC03G059830 vs. ExPASy Swiss-Prot
Match: P04323 (Retrovirus-related Pol polyprotein from transposon 17.6 OS=Drosophila melanogaster OX=7227 GN=pol PE=4 SV=1)

HSP 1 Score: 239.2 bits (609), Expect = 1.4e-61
Identity = 130/299 (43.48%), Postives = 183/299 (61.20%), Query Frame = 0

Query: 1   MPFGLCNAPGTFQRCMMAIFSDYLEQSVEVFMDDFSVFGKSYDECLTNLERVLKRCEDTN 60
           MPFGL NAP TFQRCM  I    L +   V++DD  VF  S DE L +L  V ++    N
Sbjct: 334 MPFGLKNAPATFQRCMNDILRPLLNKHCLVYLDDIIVFSTSLDEHLQSLGLVFEKLAKAN 393

Query: 61  LVLNWEKCHFMVTEGIVLGHKISKVGLEVDQAKIDAIAKLPAPTNVKTLRSFLGHAGFYR 120
           L L  +KC F+  E   LGH ++  G++ +  KI+AI K P PT  K +++FLG  G+YR
Sbjct: 394 LKLQLDKCEFLKQETTFLGHVLTPDGIKPNPEKIEAIQKYPIPTKPKEIKAFLGLTGYYR 453

Query: 121 RFIKGFSQVAKPLSELLEVNREFN-FDGKCLNAFESLRQALISAPILVAPDWSLPFELMC 180
           +FI  F+ +AKP+++ L+ N + +  + +  +AF+ L+  +   PIL  PD++  F L  
Sbjct: 454 KFIPNFADIAKPMTKCLKKNMKIDTTNPEYDSAFKKLKYLISEDPILKVPDFTKKFTLTT 513

Query: 181 DASNHAVGAVLGQRKEKIMHPIYYVSKTLNSSQENYTTTEKEMLAIVFAVDKFRAYLIGS 240
           DAS+ A+GAVL Q      HP+ Y+S+TLN  + NY+T EKE+LAIV+A   FR YL+G 
Sbjct: 514 DASDVALGAVLSQDG----HPLSYISRTLNEHEINYSTIEKELLAIVWATKTFRHYLLGR 573

Query: 241 KVTIYSDHSAIKYLMAKKDAKPRLIRWVLLLQEFDLEIKDRKGTENQVADHLSRLENKE 299
              I SDH  + +L   KD   +L RW + L EFD +IK  KG EN VAD LSR++ +E
Sbjct: 574 HFEISSDHQPLSWLYRMKDPNSKLTRWRVKLSEFDFDIKYIKGKENCVADALSRIKLEE 628

BLAST of CaUC03G059830 vs. ExPASy Swiss-Prot
Match: P10394 (Retrovirus-related Pol polyprotein from transposon 412 OS=Drosophila melanogaster OX=7227 GN=POL PE=4 SV=1)

HSP 1 Score: 236.1 bits (601), Expect = 1.2e-60
Identity = 168/593 (28.33%), Postives = 263/593 (44.35%), Query Frame = 0

Query: 1    MPFGLCNAPGTFQRCMMAIFSDYLEQSVEVFMDDFSVFGKSYDECLTNLERVLKRCEDTN 60
            +PFGL  AP +FQR M   FS        ++MDD  V G S    L NL  V  +C + N
Sbjct: 442  LPFGLKIAPNSFQRMMTIAFSGIEPSQAFLYMDDLIVIGCSEKHMLKNLTEVFGKCREYN 501

Query: 61   LVLNWEKCHFMVTEGIVLGHKISKVGLEVDQAKIDAIAKLPAPTNVKTLRSFLGHAGFYR 120
            L L+ EKC F + E   LGHK +  G+  D  K D I   P P +  + R F+    +YR
Sbjct: 502  LKLHPEKCSFFMHEVTFLGHKCTDKGILPDDKKYDVIQNYPVPHDADSARRFVAFCNYYR 561

Query: 121  RFIKGFSQVAKPLSELLEVNREFNFDGKCLNAFESLRQALISAPILVAPDWSLPFELMCD 180
            RFIK F+  ++ ++ L + N  F +  +C  AF  L+  LI+  +L  PD+S  F +  D
Sbjct: 562  RFIKNFADYSRHITRLCKKNVPFEWTDECQKAFIHLKSQLINPTLLQYPDFSKEFCITTD 621

Query: 181  ASNHAVGAVLGQRKEKIMHPIYYVSKTLNSSQENYTTTEKEMLAIVFAVDKFRAYLIGSK 240
            AS  A GAVL Q       P+ Y S+     + N +TTE+E+ AI +A+  FR Y+ G  
Sbjct: 622  ASKQACGAVLTQNHNGHQLPVAYASRAFTKGESNKSTTEQELAAIHWAIIHFRPYIYGKH 681

Query: 241  VTIYSDHSAIKYLMAKKDAKPRLIRWVLLLQEFDLEIKDRKGTENQVADHLSRLENKEVQ 300
             T+ +DH  + YL +  +   +L R  L L+E++  ++  KG +N VAD LSR+  KE++
Sbjct: 682  FTVKTDHRPLTYLFSMVNPSSKLTRIRLELEEYNFTVEYLKGKDNHVADALSRITIKELK 741

Query: 301  ESWNDI----------------EERFPDEHVMNAESQEPWYADIV-------------NY 360
            +   +I                +E+   +      + EP   +++             N 
Sbjct: 742  DITGNILKVTTRFQSRQKSCAGKEQLDLQKQTKEIASEPNVYEVITNDEVRKVVTLQLND 801

Query: 361  LVC-------------------------NQWPEEFNAQQ-------------KKKLRHES 420
             +C                         +Q+ +    Q              KK   H S
Sbjct: 802  SICLFKHGKKIIARYDVGDLYTNGILDLDQFLQRLELQAGIYDISQIKMAPWKKIFEHVS 861

Query: 421  --KFYCWDEPYLYRLGPDHI--LRRCVPEYETHSILRSCHEAP-YGGHFGGQRTTAKVLQ 480
              KF       L  L    +  + +   E E  +IL + H+ P  GGH G  +T AKV +
Sbjct: 862  IDKFKNMGNKILKNLKVALLNPVTQINNEKEKEAILSTLHDDPIQGGHTGITKTLAKV-K 921

Query: 481  SGYFWPTLFKDARAYAVACDRCQRIGNISNRNEMPLNSMLEVELFDVWGIDFMGPFPPS- 521
              Y+W  + K  + Y   C +CQ+     +       +      FD   +D +GP P S 
Sbjct: 922  RHYYWKNMSKYIKEYVRKCQKCQKAKTTKHTKTPMTITETPEHAFDRVVVDTIGPLPKSE 981

BLAST of CaUC03G059830 vs. ExPASy Swiss-Prot
Match: P20825 (Retrovirus-related Pol polyprotein from transposon 297 OS=Drosophila melanogaster OX=7227 GN=pol PE=4 SV=1)

HSP 1 Score: 229.2 bits (583), Expect = 1.4e-58
Identity = 124/303 (40.92%), Postives = 178/303 (58.75%), Query Frame = 0

Query: 1   MPFGLCNAPGTFQRCMMAIFSDYLEQSVEVFMDDFSVFGKSYDECLTNLERVLKRCEDTN 60
           MPFGL NAP TFQRCM  I    L +   V++DD  +F  S  E L +++ V  +  D N
Sbjct: 333 MPFGLRNAPATFQRCMNNILRPLLNKHCLVYLDDIIIFSTSLTEHLNSIQLVFTKLADAN 392

Query: 61  LVLNWEKCHFMVTEGIVLGHKISKVGLEVDQAKIDAIAKLPAPTNVKTLRSFLGHAGFYR 120
           L L  +KC F+  E   LGH ++  G++ +  K+ AI   P PT  K +R+FLG  G+YR
Sbjct: 393 LKLQLDKCEFLKKEANFLGHIVTPDGIKPNPIKVKAIVSYPIPTKDKEIRAFLGLTGYYR 452

Query: 121 RFIKGFSQVAKPLSELLEVNREFNFDG-KCLNAFESLRQALISAPILVAPDWSLPFELMC 180
           +FI  ++ +AKP++  L+   + +    + + AFE L+  +I  PIL  PD+   F L  
Sbjct: 453 KFIPNYADIAKPMTSCLKKRTKIDTQKLEYIEAFEKLKALIIRDPILQLPDFEKKFVLTT 512

Query: 181 DASNHAVGAVLGQRKEKIMHPIYYVSKTLNSSQENYTTTEKEMLAIVFAVDKFRAYLIGS 240
           DASN A+GAVL Q      HPI ++S+TLN  + NY+  EKE+LAIV+A   FR YL+G 
Sbjct: 513 DASNLALGAVLSQNG----HPISFISRTLNDHELNYSAIEKELLAIVWATKTFRHYLLGR 572

Query: 241 KVTIYSDHSAIKYLMAKKDAKPRLIRWVLLLQEFDLEIKDRKGTENQVADHLSRLENKEV 300
           +  I SDH  +++L   K+   +L RW + L E+  +I   KG EN VAD LSR++ +E 
Sbjct: 573 QFLIASDHQPLRWLHNLKEPGAKLERWRVRLSEYQFKIDYIKGKENSVADALSRIKIEEN 631

Query: 301 QES 303
             S
Sbjct: 633 HHS 631

BLAST of CaUC03G059830 vs. ExPASy Swiss-Prot
Match: Q8I7P9 (Retrovirus-related Pol polyprotein from transposon opus OS=Drosophila melanogaster OX=7227 GN=pol PE=4 SV=1)

HSP 1 Score: 224.2 bits (570), Expect = 4.5e-57
Identity = 119/306 (38.89%), Postives = 183/306 (59.80%), Query Frame = 0

Query: 1   MPFGLCNAPGTFQRCMMAIFSDYLEQSVEVFMDDFSVFGKSYDECLTNLERVLKRCEDTN 60
           +PFGL NAP  FQR +  I  +++ +   V++DD  VF + YD    NL  VL      N
Sbjct: 250 LPFGLKNAPAIFQRMIDDILREHIGKVCYVYIDDIIVFSEDYDTHWKNLRLVLASLSKAN 309

Query: 61  LVLNWEKCHFMVTEGIVLGHKISKVGLEVDQAKIDAIAKLPAPTNVKTLRSFLGHAGFYR 120
           L +N EK HF+ T+   LG+ ++  G++ D  K+ AI+++P PT+VK L+ FLG   +YR
Sbjct: 310 LQVNLEKSHFLDTQVEFLGYIVTADGIKADPKKVRAISEMPPPTSVKELKRFLGMTSYYR 369

Query: 121 RFIKGFSQVAKPLSEL---LEVNRE--------FNFDGKCLNAFESLRQALISAPILVAP 180
           +FI+ +++VAKPL+ L   L  N +           D   L +F  L+  L S+ IL  P
Sbjct: 370 KFIQDYAKVAKPLTNLTRGLYANIKSSQSSKVPITLDETALQSFNDLKSILCSSEILAFP 429

Query: 181 DWSLPFELMCDASNHAVGAVLGQRKEKIMHPIYYVSKTLNSSQENYTTTEKEMLAIVFAV 240
            ++ PF L  DASN A+GAVL Q  +    PI Y+S++LN ++ENY T EKEMLAI++++
Sbjct: 430 CFTKPFHLTTDASNWAIGAVLSQDDQGRDRPIAYISRSLNKTEENYATIEKEMLAIIWSL 489

Query: 241 DKFRAYLIGS-KVTIYSDHSAIKYLMAKKDAKPRLIRWVLLLQEFDLEIKDRKGTENQVA 295
           D  RAYL G+  + +Y+DH  + + +  ++   +L RW   ++E++ E+  + G  N VA
Sbjct: 490 DNLRAYLYGAGTIKVYTDHQPLTFALGNRNFNAKLKRWKARIEEYNCELIYKPGKSNVVA 549

BLAST of CaUC03G059830 vs. ExPASy Swiss-Prot
Match: Q7LHG5 (Transposon Ty3-I Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY3B-I PE=1 SV=2)

HSP 1 Score: 223.0 bits (567), Expect = 1.0e-56
Identity = 172/530 (32.45%), Postives = 256/530 (48.30%), Query Frame = 0

Query: 1    MPFGLCNAPGTFQRCMMAIFSDYLEQSVEVFMDDFSVFGKSYDECLTNLERVLKRCEDTN 60
            MPFGL NAP TF R M   F D   + V V++DD  +F +S +E   +L+ VL+R ++ N
Sbjct: 744  MPFGLVNAPSTFARYMADTFRDL--RFVNVYLDDILIFSESPEEHWKHLDTVLERLKNEN 803

Query: 61   LVLNWEKCHFMVTEGIVLGHKISKVGLEVDQAKIDAIAKLPAPTNVKTLRSFLGHAGFYR 120
            L++  +KC F   E   LG+ I    +   Q K  AI   P P  VK  + FLG   +YR
Sbjct: 804  LIVKKKKCKFASEETEFLGYSIGIQKIAPLQHKCAAIRDFPTPKTVKQAQRFLGMINYYR 863

Query: 121  RFIKGFSQVAKPLSELLEVNREFNFDGKCLNAFESLRQALISAPILVAPDWSLPFELMCD 180
            RFI   S++A+P+   L +  +  +  K   A E L+ AL ++P+LV  +    + L  D
Sbjct: 864  RFIPNCSKIAQPIQ--LFICDKSQWTEKQDKAIEKLKAALCNSPVLVPFNNKANYRLTTD 923

Query: 181  ASNHAVGAVLGQ--RKEKIMHPIYYVSKTLNSSQENYTTTEKEMLAIVFAVDKFRAYLIG 240
            AS   +GAVL +   K K++  + Y SK+L S+Q+NY   E E+L I+ A+  FR  L G
Sbjct: 924  ASKDGIGAVLEEVDNKNKLVGVVGYFSKSLESAQKNYPAGELELLGIIKALHHFRYMLHG 983

Query: 241  SKVTIYSDHSAIKYLMAKKDAKPRLIRWVLLLQEFDLEIKDRKGTENQVADHLSRLENKE 300
               T+ +DH ++  L  K +   R+ RW+  L  +D  ++   G +N VAD +SR     
Sbjct: 984  KHFTLRTDHISLLSLQNKNEPARRVQRWLDDLATYDFTLEYLAGPKNVVADAISRAIYTI 1043

Query: 301  VQESWNDIEERFPDEHVMNAESQEPWYADIVNYLVCNQWPEE---FNAQQKKKLRHES-- 360
              E+   I+      +  +          +      N  PE+   F + QKK    E+  
Sbjct: 1044 TPETSRPIDTESWKSYYKSDPLCSAVLIHMKELTQHNVTPEDMSAFRSYQKKLELSETFR 1103

Query: 361  KFYCWDEPYLYRLGPDHILRRCVPEYETHSILRSCHE-APYGGHFGGQRTTAKVLQSGYF 420
            K Y  ++  +Y     +  R  VP  + ++++R  H+   +GGHFG   T AK+    Y+
Sbjct: 1104 KNYSLEDEMIY-----YQDRLVVPIKQQNAVMRLYHDHTLFGGHFGVTVTLAKI-SPIYY 1163

Query: 421  WPTLFKDARAYAVACDRCQRIGNISNRNEMPLNSM--LEVELFDVWGIDFMGPFPPSCGN 480
            WP L      Y   C +CQ I +   R    L  +   E    D+  +DF+   PP+  N
Sbjct: 1164 WPKLQHSIIQYIRTCVQCQLIKSHRPRLHGLLQPLPIAEGRWLDI-SMDFVTGLPPTSNN 1223

Query: 481  -QYILVAVDYVSKWVEAAACAKN-DANTVSKFLKKQIFSRFGTPRAIISD 519
               ILV VD  SK     A  K  DA  +   L + IFS  G PR I SD
Sbjct: 1224 LNMILVVVDRFSKRAHFIATRKTLDATQLIDLLFRYIFSYHGFPRTITSD 1262

BLAST of CaUC03G059830 vs. ExPASy TrEMBL
Match: A0A2G9FWY3 (Reverse transcriptase OS=Handroanthus impetiginosus OX=429701 GN=CDL12_29952 PE=4 SV=1)

HSP 1 Score: 933.3 bits (2411), Expect = 5.6e-268
Identity = 459/750 (61.20%), Postives = 537/750 (71.60%), Query Frame = 0

Query: 1    MPFGLCNAPGTFQRCMMAIFSDYLEQSVEVFMDDFSVFGKSYDECLTNLERVLKRCEDTN 60
            MPFGLCNAP TFQRCMMAIF+D +E  +EVFMDDFSV+G S+DECL NL  VLKRCEDTN
Sbjct: 826  MPFGLCNAPATFQRCMMAIFTDMVENCLEVFMDDFSVYGNSFDECLNNLSCVLKRCEDTN 885

Query: 61   LVLNWEKCHFMVTEGIVLGHKISKVGLEVDQAKIDAIAKLPAPTNVKTLRSFLGHAGFYR 120
            L+LNWEKCHFMV EGIVLGHK+S  G+EVD+AK++ I KLP PT+VK +RSFLGHAGFYR
Sbjct: 886  LILNWEKCHFMVQEGIVLGHKVSNRGIEVDKAKLETIEKLPPPTSVKGVRSFLGHAGFYR 945

Query: 121  RFIKGFSQVAKPLSELLEVNREFNFDGKCLNAFESLRQALISAPILVAPDWSLPFELMCD 180
            RFIK FS+++KPL  LLE +  FNFD  C +AF  L+  LISAPI+  PDWS PFELMCD
Sbjct: 946  RFIKDFSKISKPLCNLLEKDIPFNFDDACRDAFNDLKGRLISAPIITVPDWSFPFELMCD 1005

Query: 181  ASNHAVGAVLGQRKEKIMHPIYYVSKTLNSSQENYTTTEKEMLAIVFAVDKFRAYLIGSK 240
            AS+ AVGAVLGQRK+KI   IYY SKTLN +Q NYTTTEKE+LA+VFA DKFR+YL+G+K
Sbjct: 1006 ASDFAVGAVLGQRKDKIFRSIYYASKTLNDAQLNYTTTEKELLAVVFAFDKFRSYLVGTK 1065

Query: 241  VTIYSDHSAIKYLMAKKDAKPRLIRWVLLLQEFDLEIKDRKGTENQVADHLSRLENKEVQ 300
            V +Y+DH+AI+YL+ KKDAKPRLIRWVLLLQEFDLEI+DRKGTENQ+ADHLSRLE+    
Sbjct: 1066 VIVYTDHAAIRYLIEKKDAKPRLIRWVLLLQEFDLEIRDRKGTENQIADHLSRLESPAKT 1125

Query: 301  ESWNDIEERFPDEHVMN-AESQEPWYADIVNYLVCNQWPEEFNAQQKKKLRHESKFYCWD 360
            +  N I + FPDE ++    S  PWYADIVNYL C   P + +AQQKKK   +++ Y WD
Sbjct: 1126 DEPNLINDNFPDEQLLAIVASDVPWYADIVNYLTCGIIPFDLSAQQKKKFLFDTRRYFWD 1185

Query: 361  EPYLYRLGPDHILRRCVPEYETHSILRSCHEAPYGGHFGGQRTTAKVLQSGYFWPTLFKD 420
            +P+L++ GPD+ILRRCVPE E + IL  CH +PYGGHF G RT AK+LQSG+FWP LFKD
Sbjct: 1186 DPFLFKQGPDNILRRCVPEIEMNDILEQCHASPYGGHFHGDRTAAKILQSGFFWPNLFKD 1245

Query: 421  ARAYAVACDRCQRIGNISNRNEMPLNSMLEVELFDVWGIDFMGPFPPSCGNQYILVAVDY 480
            A ++   CDRCQR GNIS R+EMPLN++LEVELFDVWGIDFMGPF PS GN YILVAVDY
Sbjct: 1246 AHSFVANCDRCQRTGNISRRHEMPLNTILEVELFDVWGIDFMGPFIPSFGNMYILVAVDY 1305

Query: 481  VSKWVEAAACAKNDANTVSKFLKKQIFSRFGTPRAIISDE-------------------- 540
            VSKWVEAAA   ND+  V  F+KK IF+RFGTPRAIISD                     
Sbjct: 1306 VSKWVEAAAVPNNDSKVVVNFIKKNIFTRFGTPRAIISDGGTHFCNRSFEALLSKYGVKH 1365

Query: 541  -------------------------------------------------------GMSPY 600
                                                                   GMSPY
Sbjct: 1366 KISTPYHPQTSGQVEVSNREIKRILEKTVSSTRKDWSKRLDEALWAYRTAYKTPIGMSPY 1425

Query: 601  ALVFGKACHLPPELEHKGIWAMKKLNLDQEASGEARKLQLNELLEWRHSAYENAKLYKER 660
             LVFGKACHLP ELEH   WA++KLN D +A+GE R LQLNEL E+R  AYENAK+YKE+
Sbjct: 1426 RLVFGKACHLPVELEHNAYWAIRKLNFDMQAAGEKRLLQLNELDEFRLHAYENAKIYKEK 1485

Query: 661  TKKWHDKNISKKTLHVGQKVLLFNSRLRLFPGKLKSRWSGPFIIKEVFPHGAVMLTNENG 675
             K+WH+K I ++    GQ VLLFNSRL+LFPGKLKSRWSGPF I EVFPHGAV L N+N 
Sbjct: 1486 KKRWHEKKIVERHFEPGQYVLLFNSRLKLFPGKLKSRWSGPFRITEVFPHGAVELENKNS 1545

BLAST of CaUC03G059830 vs. ExPASy TrEMBL
Match: A0A4Y1RS99 (Transposable element protein OS=Prunus dulcis OX=3755 GN=Prudu_018514 PE=4 SV=1)

HSP 1 Score: 907.9 bits (2345), Expect = 2.5e-260
Identity = 445/750 (59.33%), Postives = 535/750 (71.33%), Query Frame = 0

Query: 1   MPFGLCNAPGTFQRCMMAIFSDYLEQSVEVFMDDFSVFGKSYDECLTNLERVLKRCEDTN 60
           MPFGLCNAP TFQRCMM+IFSD +E+ +EVFMDDFSVFG S+D CL NL  VL RCE+TN
Sbjct: 148 MPFGLCNAPATFQRCMMSIFSDMVERFIEVFMDDFSVFGSSFDSCLDNLALVLARCEETN 207

Query: 61  LVLNWEKCHFMVTEGIVLGHKISKVGLEVDQAKIDAIAKLPAPTNVKTLRSFLGHAGFYR 120
           LVLNWEKCHFMV EGIVLGHKIS  G+EVD+AKI+ I KLP P+ VK +RSFLGHAGFYR
Sbjct: 208 LVLNWEKCHFMVQEGIVLGHKISARGIEVDRAKIETIEKLPPPSTVKGIRSFLGHAGFYR 267

Query: 121 RFIKGFSQVAKPLSELLEVNREFNFDGKCLNAFESLRQALISAPILVAPDWSLPFELMCD 180
           RFIK FS++ KPL +LL  + EFNFD  CL AF  L+  L +AP+++APDW LPFE+MCD
Sbjct: 268 RFIKDFSKITKPLCKLLLKDSEFNFDSDCLEAFNLLKTKLTTAPVIMAPDWELPFEIMCD 327

Query: 181 ASNHAVGAVLGQRKEKIMHPIYYVSKTLNSSQENYTTTEKEMLAIVFAVDKFRAYLIGSK 240
           AS++A+GAVLGQRK K++H I+Y S+TLN +Q NY TTEKE+LA+VFA+DKFR+YL+G+K
Sbjct: 328 ASDYAIGAVLGQRKNKLLHVIHYASRTLNDAQLNYATTEKELLAVVFALDKFRSYLLGAK 387

Query: 241 VTIYSDHSAIKYLMAKKDAKPRLIRWVLLLQEFDLEIKDRKGTENQVADHLSRL-ENKEV 300
           V +Y+DH+A+K+L+AKK+AKPRLIRWVLLLQEFD+EI+D+KG+EN VADHLSRL    EV
Sbjct: 388 VIVYTDHAALKFLLAKKEAKPRLIRWVLLLQEFDIEIRDKKGSENVVADHLSRLVREDEV 447

Query: 301 QESWNDIEERFPDEHVMNAESQE----PWYADIVNYLVCNQWPEEFNAQQKKKLRHESKF 360
            E    I E FPDE + +  S +    PWYAD VNYL C   P + +  QKKK     K 
Sbjct: 448 IEDVGPILETFPDEQLYSIYSAKEFITPWYADFVNYLACGILPPDMSFYQKKKFLSLVKH 507

Query: 361 YCWDEPYLYRLGPDHILRRCVPEYETHSILRSCHEAPYGGHFGGQRTTAKVLQSGYFWPT 420
           Y WD+PYL++ GPD ++RRCVPE E   IL  CH    GGH+G  +TTAKVLQSG+FWPT
Sbjct: 508 YYWDDPYLWKHGPDQVIRRCVPETEMADILLHCHTLACGGHYGASKTTAKVLQSGFFWPT 567

Query: 421 LFKDARAYAVACDRCQRIGNISNRNEMPLNSMLEVELFDVWGIDFMGPFPPSCGNQYILV 480
           LFKDA+ +   CD CQR GNIS+RN+MPLN++LEVELFDVWGIDFMGPFP S GN YILV
Sbjct: 568 LFKDAQDFVARCDPCQRTGNISSRNQMPLNNILEVELFDVWGIDFMGPFPASYGNLYILV 627

Query: 481 AVDYVSKWVEAAACAKNDANTVSKFLKKQIFSRFGTPRAIISDE---------------- 540
           AVDYVSKWVEAAA   NDA  V +FL+K IF+RFG PRAIISD                 
Sbjct: 628 AVDYVSKWVEAAALPTNDAKVVVRFLRKNIFTRFGVPRAIISDGGTHFCNRQFNSLLAKY 687

Query: 541 -----------------------------------------------------------G 600
                                                                      G
Sbjct: 688 GITHKVSTPYHPQTSGQVEVSNRELKKILEKTVSASRKDWSLKLDDALWAYRTAFKAPIG 747

Query: 601 MSPYALVFGKACHLPPELEHKGIWAMKKLNLDQEASGEARKLQLNELLEWRHSAYENAKL 660
           MSPY LVFGKACHLP ELEHK  WA+K LN D  ++GE RKLQLNEL E R+ +YENAK+
Sbjct: 748 MSPYRLVFGKACHLPVELEHKAFWAIKTLNFDMSSAGEKRKLQLNELEELRNESYENAKI 807

Query: 661 YKERTKKWHDKNISKKTLHVGQKVLLFNSRLRLFPGKLKSRWSGPFIIKEVFPHGAVMLT 671
           YK+RTKKWHDK+I KK  +VGQ VLL+NSRL+LFPGKL+SRWSGPF +  V+P+G V + 
Sbjct: 808 YKDRTKKWHDKHILKKEFYVGQSVLLYNSRLKLFPGKLRSRWSGPFTVLTVYPYGTVEIK 867

BLAST of CaUC03G059830 vs. ExPASy TrEMBL
Match: A0A2K3PBF7 (Reverse transcriptase OS=Trifolium pratense OX=57577 GN=L195_g009250 PE=4 SV=1)

HSP 1 Score: 890.6 bits (2300), Expect = 4.2e-255
Identity = 436/749 (58.21%), Postives = 529/749 (70.63%), Query Frame = 0

Query: 1    MPFGLCNAPGTFQRCMMAIFSDYLEQSVEVFMDDFSVFGKSYDECLTNLERVLKRCEDTN 60
            MPFGLCNAP TFQRCM AIFSD +E+ +EVFMDDFSVFG S+  CL NL+ VLKRC +TN
Sbjct: 1066 MPFGLCNAPATFQRCMQAIFSDLIEKCIEVFMDDFSVFGPSFHGCLKNLDTVLKRCVETN 1125

Query: 61   LVLNWEKCHFMVTEGIVLGHKISKVGLEVDQAKIDAIAKLPAPTNVKTLRSFLGHAGFYR 120
            LVLNWEKCHFMVTEGIVLGHKIS  G+EVD+AK++ I KLP P NVK +RSFLGHAGFYR
Sbjct: 1126 LVLNWEKCHFMVTEGIVLGHKISAKGIEVDKAKVEVIEKLPPPVNVKGIRSFLGHAGFYR 1185

Query: 121  RFIKGFSQVAKPLSELLEVNREFNFDGKCLNAFESLRQALISAPILVAPDWSLPFELMCD 180
            RFIK FS++AKPLS LL  ++ FNFD  CLNAFE L+  L +API++APDW+L FELMCD
Sbjct: 1186 RFIKDFSKIAKPLSNLLNKDKSFNFDNSCLNAFEELKMRLTTAPIIIAPDWTLKFELMCD 1245

Query: 181  ASNHAVGAVLGQRKEKIMHPIYYVSKTLNSSQENYTTTEKEMLAIVFAVDKFRAYLIGSK 240
            AS++AVGAVLGQRK+KI H I+Y SK LN +Q NY TTEKE+LAIV+A++KFR+YLIGSK
Sbjct: 1246 ASDYAVGAVLGQRKDKIFHAIHYASKVLNEAQINYATTEKELLAIVYALEKFRSYLIGSK 1305

Query: 241  VTIYSDHSAIKYLMAKKDAKPRLIRWVLLLQEFDLEIKDRKGTENQVADHLSRLENKEVQ 300
            + +Y+DH+AIKYL+ K D+KPRLIRW+LLLQEFDLEIKD+KGTEN VADHLSRL NKEV 
Sbjct: 1306 IVVYTDHAAIKYLITKSDSKPRLIRWMLLLQEFDLEIKDKKGTENLVADHLSRLVNKEVT 1365

Query: 301  ESWNDIEERFPDEHVMNAESQEPWYADIVNYLVCNQWPEEFNAQQKKKLRHESKFYCWDE 360
            +  +++ E FPDE ++  + + PW+AD+ NY      PE+ N  QKKK    +  Y WD+
Sbjct: 1366 KHEHEVREEFPDEKLLMMQ-ERPWFADMANYKASGLIPEDLNWHQKKKFLRNANQYVWDD 1425

Query: 361  PYLYRLGPDHILRRCVPEYETHSILRSCHEAPYGGHFGGQRTTAKVLQSGYFWPTLFKDA 420
            PYL+++G D++LRRCV   E  SIL  CH +PYGGH+ G+RT AKVLQSG+FWPTLFKDA
Sbjct: 1426 PYLFKIGADNLLRRCVTTEEATSILWHCHNSPYGGHYNGERTAAKVLQSGFFWPTLFKDA 1485

Query: 421  RAYAVACDRCQRIGNISNRNEMPLNSMLEVELFDVWGIDFMGPFPPSCGNQYILVAVDYV 480
              +A  CD+CQ  G IS RNEMPL ++L VE+FD WGIDF+GPFP S  N+YILVAVDYV
Sbjct: 1486 YQHAQKCDKCQMTGGISKRNEMPLQNILVVEVFDCWGIDFVGPFPSSFSNEYILVAVDYV 1545

Query: 481  SKWVEAAACAKNDANTVSKFLKKQIFSRFGTPRAIISDE--------------------- 540
            SKWVEA A  K D  TV KFLKK IF+RFGTPR +ISD                      
Sbjct: 1546 SKWVEAIASPKADGKTVIKFLKKNIFTRFGTPRVLISDGGSHFCNSQLEKALEHYGVRHK 1605

Query: 541  ------------------------------------------------------GMSPYA 600
                                                                  G++P+ 
Sbjct: 1606 VASPYHPQTNGQAEVSNREIKRILEKTVSTSRKDWSSKLDDALWAYRTAFKSPIGLTPFQ 1665

Query: 601  LVFGKACHLPPELEHKGIWAMKKLNLDQEASGEARKLQLNELLEWRHSAYENAKLYKERT 660
            +V+GKACHLP ELEHK  WA+K LN D   SG+ RKLQL+EL E R  AYE++KLYKE+ 
Sbjct: 1666 MVYGKACHLPVELEHKAYWALKFLNFDPCFSGDKRKLQLHELEEMRAQAYESSKLYKEKV 1725

Query: 661  KKWHDKNISKKTLHVGQKVLLFNSRLRLFPGKLKSRWSGPFIIKEVFPHGAVMLTNENGT 675
            K +HDK I  K    GQ VLLFNSRL+LFPGKLKS+WSGPF IKE+ P+GAV+L +    
Sbjct: 1726 KSYHDKKILSKEFKPGQMVLLFNSRLKLFPGKLKSKWSGPFRIKEIKPYGAVLLEDPKTK 1785

BLAST of CaUC03G059830 vs. ExPASy TrEMBL
Match: A0A2K3NPD0 (Reverse transcriptase OS=Trifolium pratense OX=57577 GN=L195_g001324 PE=4 SV=1)

HSP 1 Score: 888.6 bits (2295), Expect = 1.6e-254
Identity = 430/749 (57.41%), Postives = 529/749 (70.63%), Query Frame = 0

Query: 1    MPFGLCNAPGTFQRCMMAIFSDYLEQSVEVFMDDFSVFGKSYDECLTNLERVLKRCEDTN 60
            MPFGLCNAP TFQRCM AIFSD +E+ +EVFMDDFSVFG S+D CL NL+ VLKRC +TN
Sbjct: 633  MPFGLCNAPATFQRCMQAIFSDLIEKCIEVFMDDFSVFGSSFDCCLANLDTVLKRCVETN 692

Query: 61   LVLNWEKCHFMVTEGIVLGHKISKVGLEVDQAKIDAIAKLPAPTNVKTLRSFLGHAGFYR 120
            LVLNWEKCHFMVTEGIVLGHKIS  G+EVD+AK++ I KLP P N+K +RSFLGHAGFYR
Sbjct: 693  LVLNWEKCHFMVTEGIVLGHKISSKGIEVDKAKVEVIEKLPPPINIKGIRSFLGHAGFYR 752

Query: 121  RFIKGFSQVAKPLSELLEVNREFNFDGKCLNAFESLRQALISAPILVAPDWSLPFELMCD 180
            RFIK FS++AKPLS LL  ++ FNFD  CL AF  L++ L +API+ APDWSL FELMCD
Sbjct: 753  RFIKDFSKIAKPLSNLLNKDKPFNFDKSCLIAFNDLKERLTTAPIITAPDWSLDFELMCD 812

Query: 181  ASNHAVGAVLGQRKEKIMHPIYYVSKTLNSSQENYTTTEKEMLAIVFAVDKFRAYLIGSK 240
            AS++AVGAVLGQRK K  H I+Y SK LN +Q NY TTEKE+LAIV+A++KFR+YLIGSK
Sbjct: 813  ASDYAVGAVLGQRKNKFFHAIHYASKVLNDAQINYATTEKELLAIVYALEKFRSYLIGSK 872

Query: 241  VTIYSDHSAIKYLMAKKDAKPRLIRWVLLLQEFDLEIKDRKGTENQVADHLSRLENKEVQ 300
            + +Y+DH+AIKYL+ K D+K RLIRW+LLLQEFDLEIKD+KGTEN VADHLSRL NK V 
Sbjct: 873  IIVYTDHAAIKYLITKSDSKQRLIRWMLLLQEFDLEIKDKKGTENLVADHLSRLVNKGVT 932

Query: 301  ESWNDIEERFPDEHVMNAESQEPWYADIVNYLVCNQWPEEFNAQQKKKLRHESKFYCWDE 360
            E   ++ E FPDE ++  + + PW+AD+ NY      P++FN  QKK+    +  + WD+
Sbjct: 933  EQEREVLEEFPDEKLLMVQ-ERPWFADMANYKASGLIPDDFNWHQKKRFLRIANQFVWDD 992

Query: 361  PYLYRLGPDHILRRCVPEYETHSILRSCHEAPYGGHFGGQRTTAKVLQSGYFWPTLFKDA 420
            PYL++LG D++LRRCV + E  SIL  CH +PYGGH+ G+RT AK+LQ+G+FWPT+FKD+
Sbjct: 993  PYLFKLGADNLLRRCVTKEEATSILWHCHNSPYGGHYNGERTAAKILQAGFFWPTVFKDS 1052

Query: 421  RAYAVACDRCQRIGNISNRNEMPLNSMLEVELFDVWGIDFMGPFPPSCGNQYILVAVDYV 480
              Y  +CD CQR G IS RNEMPL S+LEVE+FD WGIDF+GPFP S  N+YILVAVDYV
Sbjct: 1053 YEYVQSCDNCQRTGGISRRNEMPLQSILEVEVFDCWGIDFVGPFPSSLSNEYILVAVDYV 1112

Query: 481  SKWVEAAACAKNDANTVSKFLKKQIFSRFGTPRAIISDE--------------------- 540
            SKWVEA A  K D  TV KFLK+ IF+RFGTPR +ISD                      
Sbjct: 1113 SKWVEAIASPKADGKTVIKFLKRNIFTRFGTPRVLISDGGSHFCNSQLAKALEHYGVKHK 1172

Query: 541  ------------------------------------------------------GMSPYA 600
                                                                  G++P+ 
Sbjct: 1173 IASPYHPQTNGQAEVSNREIKKILEKTVSTSRKDWSLKLDEALWAYRTAFKSPIGLTPFQ 1232

Query: 601  LVFGKACHLPPELEHKGIWAMKKLNLDQEASGEARKLQLNELLEWRHSAYENAKLYKERT 660
            +++GKACHLP ELEHK  WA+K LN D+  +GE RK QL+EL E R  AYE++KLYK++ 
Sbjct: 1233 MIYGKACHLPVELEHKAFWALKFLNFDENQAGEKRKFQLHELEEMRFHAYESSKLYKQKV 1292

Query: 661  KKWHDKNISKKTLHVGQKVLLFNSRLRLFPGKLKSRWSGPFIIKEVFPHGAVMLTNENGT 675
            K +HDK I K+    GQKVLLFNSRL+LFPGKLKS+WSGPFIIKEV P+GAV + +   +
Sbjct: 1293 KSYHDKQIVKRDFQPGQKVLLFNSRLKLFPGKLKSKWSGPFIIKEVKPYGAVEIEDVEMS 1352

BLAST of CaUC03G059830 vs. ExPASy TrEMBL
Match: A0A6P8CBX2 (Reverse transcriptase OS=Punica granatum OX=22663 GN=LOC116194359 PE=4 SV=1)

HSP 1 Score: 887.9 bits (2293), Expect = 2.7e-254
Identity = 436/749 (58.21%), Postives = 532/749 (71.03%), Query Frame = 0

Query: 1    MPFGLCNAPGTFQRCMMAIFSDYLEQSVEVFMDDFSVFGKSYDECLTNLERVLKRCEDTN 60
            MPFGLCNAP TFQRCMM+IFSD LE  +E+FMDDFSVFGKS++ CLTNL  VLKRC++TN
Sbjct: 927  MPFGLCNAPATFQRCMMSIFSDMLENFIEIFMDDFSVFGKSFESCLTNLGCVLKRCKETN 986

Query: 61   LVLNWEKCHFMVTEGIVLGHKISKVGLEVDQAKIDAIAKLPAPTNVKTLRSFLGHAGFYR 120
            L+LNWEKCHFMV EGIVLGHK+SK G+EVD+AK++ I KLP PT+ K +RSFLGHAGFYR
Sbjct: 987  LLLNWEKCHFMVREGIVLGHKVSKKGIEVDRAKVEIIEKLPPPTSTKGVRSFLGHAGFYR 1046

Query: 121  RFIKGFSQVAKPLSELLEVNREFNFDGKCLNAFESLRQALISAPILVAPDWSLPFELMCD 180
            RFIK FS++++PL  LLE +  F F+  CL AF  L++ L SAP++VAP+W LPFELMCD
Sbjct: 1047 RFIKDFSKISRPLCNLLEKDSAFVFNDNCLQAFNLLKEKLTSAPVIVAPNWELPFELMCD 1106

Query: 181  ASNHAVGAVLGQRKEKIMHPIYYVSKTLNSSQENYTTTEKEMLAIVFAVDKFRAYLIGSK 240
            AS++AVGAVLGQR+ K+ H IYY S+TLN +Q+NY TTEKE+LA++FA DKFR YLIGSK
Sbjct: 1107 ASDYAVGAVLGQRRGKVFHAIYYASRTLNEAQKNYATTEKELLAVIFACDKFRPYLIGSK 1166

Query: 241  VTIYSDHSAIKYLMAKKDAKPRLIRWVLLLQEFDLEIKDRKGTENQVADHLSRLENKEVQ 300
            + +Y+DH+A+KYL AK DAKPRLIRW+LLLQEFDLEI+D KGTEN VADHLSRLE+  + 
Sbjct: 1167 IIVYTDHAALKYLFAKADAKPRLIRWILLLQEFDLEIRDTKGTENVVADHLSRLESDCLD 1226

Query: 301  ESWNDIEERFPDEHVMNAESQE-PWYADIVNYLVCNQWPEEFNAQQKKKLRHESKFYCWD 360
               + I E+FPDE +  AE Q  PWYADIVNY+V N  P   ++QQKKK  H+ K+Y WD
Sbjct: 1227 ---SPINEKFPDEQLHVAEIQGLPWYADIVNYMVSNITPYGLSSQQKKKFLHDVKYYFWD 1286

Query: 361  EPYLYRLGPDHILRRCVPEYETHSILRSCHEAPYGGHFGGQRTTAKVLQSGYFWPTLFKD 420
            EPYL++   D ++RRCVPE E  SI++ CH    GGHFG +RT  K+L  G++WP +F D
Sbjct: 1287 EPYLFKYCADQVIRRCVPETEQLSIIQHCHSKEAGGHFGVERTATKILSCGFYWPRVFHD 1346

Query: 421  ARAYAVACDRCQRIGNISNRNEMPLNSMLEVELFDVWGIDFMGPFPPSCGNQYILVAVDY 480
             R Y ++C  CQR GNIS R+E+P NS+L +ELFDVWGIDFMGPFP S  N+YILVAVDY
Sbjct: 1347 CRNYIMSCAPCQRTGNISRRHEVPQNSILVIELFDVWGIDFMGPFPSSFSNKYILVAVDY 1406

Query: 481  VSKWVEAAACAKNDANTVSKFLKKQIFSRFGTPRAIISDE-------------------- 540
            VSKWVEA A   NDA  V +FLKK IFSRFG PRAIISD                     
Sbjct: 1407 VSKWVEAVALQSNDARVVIRFLKKNIFSRFGVPRAIISDGGSHFCNRQFEKLLSKYGVTH 1466

Query: 541  -------------------------------------------------------GMSPY 600
                                                                   GMSPY
Sbjct: 1467 KIATPYHPQTCGQVEVSNREIKRILEKTVNASRKDWSLKLDDALWAYRTAFKTPIGMSPY 1526

Query: 601  ALVFGKACHLPPELEHKGIWAMKKLNLDQEASGEARKLQLNELLEWRHSAYENAKLYKER 660
             +V+GK+CHLP ELEHK  WA+K LN D +A+GE R LQLN++ E R  AYENA++YKER
Sbjct: 1527 KIVYGKSCHLPVELEHKAYWAIKYLNFDLQAAGEKRLLQLNQMAEMREEAYENARIYKER 1586

Query: 661  TKKWHDKNISKKTLHVGQKVLLFNSRLRLFPGKLKSRWSGPFIIKEVFPHGAVMLTNENG 673
             K+WHD+NI K+    GQKVLL+NSRL+LFPGKLKSRWSGPF+I  VFP+GAV L +E+ 
Sbjct: 1587 AKRWHDRNILKREFLPGQKVLLYNSRLKLFPGKLKSRWSGPFVISNVFPYGAVELKSEDD 1646

BLAST of CaUC03G059830 vs. TAIR 10
Match: ATMG00860.1 (DNA/RNA polymerases superfamily protein )

HSP 1 Score: 92.8 bits (229), Expect = 1.1e-18
Identity = 49/132 (37.12%), Postives = 74/132 (56.06%), Query Frame = 0

Query: 46  LTNLERVLKRCEDTNLVLNWEKCHFMVTEGIVLGHK--ISKVGLEVDQAKIDAIAKLPAP 105
           + +L  VL+  E      N +KC F   +   LGH+  IS  G+  D AK++A+   P P
Sbjct: 1   MNHLGMVLQIWEQHQFYANRKKCAFGQPQIAYLGHRHIISGEGVSADPAKLEAMVGWPEP 60

Query: 106 TNVKTLRSFLGHAGFYRRFIKGFSQVAKPLSELLEVNREFNFDGKCLNAFESLRQALISA 165
            N   LR FLG  G+YRRF+K + ++ +PL+ELL+ N    +      AF++L+ A+ + 
Sbjct: 61  KNTTELRGFLGLTGYYRRFVKNYGKIVRPLTELLKKN-SLKWTEMAALAFKALKGAVTTL 120

Query: 166 PILVAPDWSLPF 176
           P+L  PD  LPF
Sbjct: 121 PVLALPDLKLPF 131

BLAST of CaUC03G059830 vs. TAIR 10
Match: ATMG00750.1 (GAG/POL/ENV polyprotein )

HSP 1 Score: 85.9 bits (211), Expect = 1.4e-16
Identity = 35/56 (62.50%), Postives = 44/56 (78.57%), Query Frame = 0

Query: 406 VLQSGYFWPTLFKDARAYAVACDRCQRIGNISNRNEMPLNSMLEVELFDVWGIDFM 462
           VLQ+G++WPT FKDA  +  +CD CQR GN + RNEMP + +LEVE+FDVWGI FM
Sbjct: 35  VLQAGFYWPTTFKDAHGFVSSCDACQRKGNFTKRNEMPQHFILEVEVFDVWGIYFM 90

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PIM97577.11.2e-26761.20DNA-directed DNA polymerase [Handroanthus impetiginosus][more]
BBH06778.15.2e-26059.33transposable element gene [Prunus dulcis][more]
XP_012833379.12.2e-25859.81PREDICTED: uncharacterized protein LOC105954252 [Erythranthe guttata][more]
XP_012858910.12.2e-25859.95PREDICTED: uncharacterized protein LOC105978045 [Erythranthe guttata][more]
XP_012853783.12.9e-25859.81PREDICTED: uncharacterized protein LOC105973307 [Erythranthe guttata][more]
Match NameE-valueIdentityDescription
P043231.4e-6143.48Retrovirus-related Pol polyprotein from transposon 17.6 OS=Drosophila melanogast... [more]
P103941.2e-6028.33Retrovirus-related Pol polyprotein from transposon 412 OS=Drosophila melanogaste... [more]
P208251.4e-5840.92Retrovirus-related Pol polyprotein from transposon 297 OS=Drosophila melanogaste... [more]
Q8I7P94.5e-5738.89Retrovirus-related Pol polyprotein from transposon opus OS=Drosophila melanogast... [more]
Q7LHG51.0e-5632.45Transposon Ty3-I Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
Match NameE-valueIdentityDescription
A0A2G9FWY35.6e-26861.20Reverse transcriptase OS=Handroanthus impetiginosus OX=429701 GN=CDL12_29952 PE=... [more]
A0A4Y1RS992.5e-26059.33Transposable element protein OS=Prunus dulcis OX=3755 GN=Prudu_018514 PE=4 SV=1[more]
A0A2K3PBF74.2e-25558.21Reverse transcriptase OS=Trifolium pratense OX=57577 GN=L195_g009250 PE=4 SV=1[more]
A0A2K3NPD01.6e-25457.41Reverse transcriptase OS=Trifolium pratense OX=57577 GN=L195_g001324 PE=4 SV=1[more]
A0A6P8CBX22.7e-25458.21Reverse transcriptase OS=Punica granatum OX=22663 GN=LOC116194359 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
ATMG00860.11.1e-1837.12DNA/RNA polymerases superfamily protein [more]
ATMG00750.11.4e-1662.50GAG/POL/ENV polyprotein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (USVL246-FR2) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR041373Reverse transcriptase, RNase H-like domainPFAMPF17917RT_RNaseHcoord: 170..273
e-value: 2.2E-35
score: 121.1
NoneNo IPR availableGENE3D1.10.340.70coord: 339..433
e-value: 1.0E-16
score: 62.9
NoneNo IPR availablePANTHERPTHR24559:SF373REVERSE TRANSCRIPTASE DOMAIN, RIBONUCLEASE H-LIKE DOMAIN PROTEIN-RELATEDcoord: 24..520
NoneNo IPR availablePANTHERPTHR24559TRANSPOSON TY3-I GAG-POL POLYPROTEINcoord: 24..520
NoneNo IPR availableCDDcd09274RNase_HI_RT_Ty3coord: 176..294
e-value: 5.74964E-61
score: 197.715
NoneNo IPR availableCDDcd01647RT_LTRcoord: 1..82
e-value: 2.73112E-34
score: 126.941
IPR000477Reverse transcriptase domainPFAMPF00078RVT_1coord: 1..81
e-value: 2.9E-12
score: 46.6
IPR043128Reverse transcriptase/Diguanylate cyclase domainGENE3D3.30.70.270coord: 1..86
e-value: 7.0E-29
score: 102.4
IPR043128Reverse transcriptase/Diguanylate cyclase domainGENE3D3.30.70.270coord: 91..186
e-value: 3.6E-26
score: 93.0
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 449..521
e-value: 8.1E-20
score: 72.9
IPR041588Integrase zinc-binding domainPFAMPF17921Integrase_H2C2coord: 376..432
e-value: 1.0E-11
score: 44.7
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 439..520
score: 11.730016
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 452..532
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 1..277

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CaUC03G059830.1CaUC03G059830.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0016779 nucleotidyltransferase activity