CmUC11G214470 (gene) Watermelon (USVL531) v1

Overview
NameCmUC11G214470
Typegene
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
LocationCmU531Chr11: 20025574 .. 20029704 (-)
RNA-Seq ExpressionCmUC11G214470
SyntenyCmUC11G214470
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AATAATGCTAAATCTAGTTTTGTGCCTATATGTCATAATTGTGGTGTTGAAGGTCATATTAGACCTAACTGCTTTAAATTGAAGTATGCTCATAATACTTCTTCAAGAAGAAATTTTTCTCAAAGAGCAAAGTTCTACAATGCTCCAAGAGAGAATTTCTCCAAGAAAAGTAAAGTGCACAAATATGTCTTCAAAGATAGATCCTTGCATAATCTTGTTTGTTTTTCTTGTGGCAAGTATGGACATAAAGCTTATTCTTGTTACTTGTCTAGATCCAATGCTTTTAATGCAAATGCAAAAATGAAATGGATCCCAAAGTTTGTAAATGCTAACACTCTTGGACCCAAACAAGTATGGGTACCAAAGAATCAAACTTGAATTTTTTGTATGTAGGTTTGTTTGAAAGCCTCCAAGAAAAATAGATGGTACTTGGATAGTGGTTGCTCAAGGCACATGACCGGAGACCGATCCAAGTTTATTTCTTTCTCTAGAAAAGATGGAGGCATGGTAACCTTTGGTGATAACAAGAAAGGTAAAATAATTGGTAAGGGTGATATAGGTAATGAGTCTTCTATTTTTATTGAAAATGTGCTTTTAGTTGATGGTTTAAAGCATGATTTACTTAGTGTTAGTCAATTATGTGATAAAGGATTTAGAATTGTTTTTGATAAAAAGAATTGTATCATCAAAAATGCTAGTGATAGAAAAATCTTGTTCGTTGGAAATAGAGATGGAAATGTATACACTATTGATTTGAATGATTTCCCTATTAGTGACAAATGTCTTTCGGCTTTGCATGATGATTCTTGGTTATGGCATAGAAGATTAGGACATGCTAGCATGCACTTAATTTCAAATATTTCCAAGAATTCTTTGGTTAGAGGTCTTCCCAAATTTAAATTTGAAAAGGATAAAGTTTGTGATGCTTGTCAAATGGGTAAGCAAACTAAGTCCTCCTTCAAGTCTAAAAATGTGATCTCTACTACTAGACCCTACAACTATTACACATGGACTTATTTGGTCCTTCTAGAATTGCTAGCTTTGGAGGGAACTATTATGCATTTGTGATAGTTGATGATTTTTCAAGATTTACTTGGGTTTTGATGATAAAACATAAGGACGATGCTTTGAAAAGTTTTATTAGTTTTGCAAAAAGAGTTCAAAACGAAAAAGGATTTTTTATTTCTAAAATTAGAAGTGATCATGGAGGAGAATTTGATAATGATGCTTTTAAAGCTTATTGTGAAGAAAATGGTTTTTCTCATAATTTTTCCTCTCCAAGAACTCCTCAACAAAATGGTGTGGTTGAAAGGAAAAATCGTACTTTGCAAGAATTTACTAGGTCAATGTTGCATGAGTATGGTTTACCTAAATATTTTTGGGCGGAAGCCATTAACACCGCTTGCTATGTTTTAAATAGAGTTTTAATTAGACCATCTTTAGATAAAACTCCTTATGAGCTTTGGCATGACAAAAGCCCAAATATTGGGTATTTCAAAGTTTTTGGTTGTAAATGTTTTATTTTGAACAATAAAGGAAAACTTGTAAAATTTGATTCTAAGACGGATGTTGGTATTTTCCTTGGATATTCCTCTACTAGTAAAGCTTATAGAGTTTTCAATAAGAGAACTTTAGTTATTGAGGAATCTATGCATGTTGTGTTTGATGAATCTTGTAATATTGTTTCTAATGAGTCTATTTATAGTGATGATTTAGAAAAAGATTTTGAAGATTTACTTGTAAGTGAAAAAGGTAAAGAAATTGTTCCAAGTATGGAAGAAGTGAGCATCAATGAAAAGAAGGAAGATGGTTCTTCATCTATGCCCAAAGAATGGAGATATGCTCCATCCCATCCCAAGGAATTAATTCTTGGTGATCCCGAACAAGGTGTGAAAACTCGTTCTTCTCTTAATTTATTTAGTAATCTTGCTTTTGTTTCTCAAGTTGAACCTAAAAGTTTTAAGGATGCCGAATGTGATGAGTTTTGGATTTTAGCTATGCAAGAAGAATTACATCAATTTGAAAGGTACAATGTTTGGGAATTAGTCCCTAGGCCTTCTAATGCTTCTATAATTGGGACTAAATGGGTTTTTAGAAACAAGATGGATGAACATGGAAATATCATTAGAAATAAAGCTAGACTTGTAGCTCAAGGTTATTGTCAAGAAGAAGGTATAGATTATGAAGAAACTTTTGCACCAGTTGCTAGATTAGAAGCTATTAGAATGTTGCTTGCTTTTGCCTCTTATAAGAATTTCATTTTGTATCAAATGGATGTGAAAAGTGCGTTTTTAAATGGTTATATTTTCGAGGAAGTTTATGTAGAACAACCTCTGGGCTTTGAAAATTTTGAATTGCCTAGTCATGTTTATAAGTTGAAAAAGGCTCTTTATGGCTTAAAACAAGCTCCAAGAGCTTGGTATGATAGGCTTAGTAATTTTCTACTTAAGAATGACTTTAAAACAGGAAAAATTGATACTACTCTCTTTATTAAGGTTAAAGAAAATGATATGCTTATAGTGCAAATATATGTGGATGATATTATTTTTGGTTCTACTAATCAATCTTTGTGTGAAGAATTTTCTAAGTGTATGCATAGTGAGTTTGAGATGAGTATGATGGGAGAACTTAGTTTCTTCCTTGGGCTCCAAGTCAAACAACTCAAGGATGGCATCTTCATAAGTCAAGAAAAATACACAAGAGATTTGCTCAAGAGATTCAAATTCAATGAAGGTAAAATTGCAAAAACTCCTATGAGCACTACCACTAAGCTTGACAAAGATGAAAAAGGTAAATGTGTGGACATAAAGATTTATCGAGGTATGATTGGATCTTTACTTTATTTGACCTCTAGTAGACCCGATATTATGTTTAGTGTATGTCCTTGTGCTAGATTTCAATCTTGTCCTAAGGAATCACATTTGCATGCCGTTAAAAGAATATTTAAATATTTGCTTGGAACTATTGATGTAGGCTTGTGGTATCCTAGAAATGTTGAGTTTAATTTGGTAGGTTATTCCAATGCGGATTTTGCCAGTAGCTTACTTGACCGTAAGAGTACTAGTGGGACTTGTCAATTTCTTGGTAGTTCCTAAGTATCTTGGTTTAGTAAAAAGCAAAATTCGGTTGCTTTATCCACTACCGAAGCGGAATATATTGCGCTTGCTAGTTGTTGTGCACAAATTCTTTGGATGAAACAAACTCTTTGTGATTTTGGATTAAAATTTGATAGTGTGCCCATATTTTGTGACAATACTAGTGCTATTAATTTGACTAAAAATCCTATACATCATTCTAGGACTAAGCATATTGATATTAGACATCATTTTATTAGAGAACATGTACAAAATGGTCATATTACTCTTGAATTTGTGAGTTCTAATAATCAATTAGCGGATATTTTTACCAAGCCTTTGAATGAAGAAAGTTTTTGCAAAAATAGGCTTGAACTTGGTATCATTCGTTGTGATGCATCTTGATATTATTATAGAATTCATATTTTTAGGGGGAGCTTTTCACTTAACATATATATGAATTTCATGTTAAAATGAAATTGATTCCTACATATTGGTTTATTGTTGTGTGTCCCTCTTATGTATATAGTTTATCCTCTTTGGGGGAGTTATGCATATTGTTTATCATTTTAGGGGGAGTAATTATCTTTGAAATAAAATTTAATTTAAAAATTAATTGATATTTCTACCCTCAAGTTGATATGTGTTAAATGTCTTTTTGTCCCTTATTAAGAGGGAGTATTCAATTGATTTTAAAGGGGGGGAGAAATTTGTGAAATTGGTTTTTATCTTGAGTTTGGTTGATACTTTGTGATGGCTCTTAAATTTACAATGTTTGTTGTATTTCTTTTAGAGGACATGTTGATAGGAGAAGAATTGCAAAATTATGTGTTCATCTTTTTAAGGAAAACTTTCATGTCATTGCTTATTTTTCTTAATTGCTATTTTTGATTGATTCCAAAAGGGGGAGAGGTAATAAGAAAAATGAAAGTATGTTAGTTTTGAATATCTTTGTTGCTATTGAATTGTGAATTGTTGTTATTGAAATATATCATTTGTTTACAAGTGAATTAAAATTATGTATGTGTGTTGATTGCAATTTAAATTTATATCGCAC

mRNA sequence

AATAATGCTAAATCTAGTTTTGTGCCTATATGTCATAATTGTGGTGTTGAAGGTCATATTAGACCTAACTGCTTTAAATTGAAGTATGCTCATAATACTTCTTCAAGAAGAAATTTTTCTCAAAGAGCAAAGTTCTACAATGCTCCAAGAGAGAATTTCTCCAAGAAAAGTAAAGTGCACAAATATGTCTTCAAAGATAGATCCTTGCATAATCTTGTTTGTTTTTCTTGTGGCAAGTATGGACATAAAGCTTATTCTTGTTACTTGTCTAGATCCAATGCTTTTAATGCAAATGCAAAAATGAAATGGATCCCAAAGTTTGTAAATGCTAACACTCTTGGACCCAAACAAGTATGGGTACCAAAGAATCAAACTTGAATTTTTTGTATGTAGGTTTGTTTGAAAGCCTCCAAGAAAAATAGATGGTACTTGGATAGTGGTTGCTCAAGGCACATGACCGGAGACCGATCCAAGTTTATTTCTTTCTCTAGAAAAGATGGAGGCATGGTAACCTTTGGTGATAACAAGAAAGGTAAAATAATTGGTAAGGGTGATATAGGTAATGAGTCTTCTATTTTTATTGAAAATGTGCTTTTAGTTGATGGTTTAAAGCATGATTTACTTAGTGTTAGTCAATTATGTGATAAAGGATTTAGAATTGTTTTTGATAAAAAGAATTGTATCATCAAAAATGCTAGTGATAGAAAAATCTTGTTCGTTGGAAATAGAGATGGAAATGTATACACTATTGATTTGAATGATTTCCCTATTAGTGACAAATGTCTTTCGGCTTTGCATGATGATTCTTGGTTATGGCATAGAAGATTAGGACATGCTAGCATGCACTTAATTTCAAATATTTCCAAGAATTCTTTGGTTAGAGGTCTTCCCAAATTTAAATTTGAAAAGGATAAAGTTTGTGATGCTTGTCAAATGGGTAAGCAAACTAAGTCCTCCTTCAAGTCTAAAAATGTGATCTCTACTACTAGACCCTACAACTATTACACATGGACTTATTTGGTCCTTCTAGAATTGCTAGCTTTGGAGGGAACTATTATGCATTTGTGATAGTTGATGATTTTTCAAGATTTACTTGGGTTTTGATGATAAAACATAAGGACGATGCTTTGAAAAGTTTTATTAGTTTTGCAAAAAGAGTTCAAAACGAAAAAGGATTTTTTATTTCTAAAATTAGAAGTGATCATGGAGGAGAATTTGATAATGATGCTTTTAAAGCTTATTGTGAAGAAAATGGTTTTTCTCATAATTTTTCCTCTCCAAGAACTCCTCAACAAAATGGTGTGGTTGAAAGGAAAAATCGTACTTTGCAAGAATTTACTAGGTCAATGTTGCATGAGTATGGTTTACCTAAATATTTTTGGGCGGAAGCCATTAACACCGCTTGCTATGTTTTAAATAGAGTTTTAATTAGACCATCTTTAGATAAAACTCCTTATGAGCTTTGGCATGACAAAAGCCCAAATATTGGGTATTTCAAAGTTTTTGGTTGTAAATGTTTTATTTTGAACAATAAAGGAAAACTTGTAAAATTTGATTCTAAGACGGATGTTGGTATTTTCCTTGGATATTCCTCTACTAGTAAAGCTTATAGAGTTTTCAATAAGAGAACTTTAGTTATTGAGGAATCTATGCATGTTGTGTTTGATGAATCTTGTAATATTGTTTCTAATGAGTCTATTTATAGTGATGATTTAGAAAAAGATTTTGAAGATTTACTTGTAAGTGAAAAAGGTAAAGAAATTGTTCCAAGTATGGAAGAAGTGAGCATCAATGAAAAGAAGGAAGATGGTTCTTCATCTATGCCCAAAGAATGGAGATATGCTCCATCCCATCCCAAGGAATTAATTCTTGGTGATCCCGAACAAGGTGTGAAAACTCGTTCTTCTCTTAATTTATTTAGTAATCTTGCTTTTGTTTCTCAAGTTGAACCTAAAAGTTTTAAGGATGCCGAATGTGATGAGTTTTGGATTTTAGCTATGCAAGAAGAATTACATCAATTTGAAAGGTACAATGTTTGGGAATTAGTCCCTAGGCCTTCTAATGCTTCTATAATTGGGACTAAATGGGTTTTTAGAAACAAGATGGATGAACATGGAAATATCATTAGAAATAAAGCTAGACTTGTAGCTCAAGGTTATTGTCAAGAAGAAGGTATAGATTATGAAGAAACTTTTGCACCAGTTGCTAGATTAGAAGCTATTAGAATGTTGCTTGCTTTTGCCTCTTATAAGAATTTCATTTTGTATCAAATGGATGTGAAAAGTGCGTTTTTAAATGGTTATATTTTCGAGGAAGTTTATGTAGAACAACCTCTGGGCTTTGAAAATTTTGAATTGCCTAGTCATGTTTATAAGTTGAAAAAGGCTCTTTATGGCTTAAAACAAGCTCCAAGAGCTTGGTATGATAGGCTTAGTAATTTTCTACTTAAGAATGACTTTAAAACAGGAAAAATTGATACTACTCTCTTTATTAAGGTTAAAGAAAATGATATGCTTATAGTGCAAATATATGTGGATGATATTATTTTTGGTTCTACTAATCAATCTTTGTGTGAAGAATTTTCTAAGTGTATGCATAGTGAGTTTGAGATGAGTATGATGGGAGAACTTAGTTTCTTCCTTGGGCTCCAAGTCAAACAACTCAAGGATGGCATCTTCATAAGTCAAGAAAAATACACAAGAGATTTGCTCAAGAGATTCAAATTCAATGAAGGTAAAATTGCAAAAACTCCTATGAGCACTACCACTAAGCTTGACAAAGATGAAAAAGGTAAATGTGTGGACATAAAGATTTATCGAGGTATGATTGGATCTTTACTTTATTTGACCTCTAGTAGACCCGATATTATGTTTAGTGTATGTCCTTGTGCTAGATTTCAATCTTGTCCTAAGGAATCACATTTGCATGCCGTTAAAAGAATATTTAAATATTTGCTTGGAACTATTGATGTAGGCTTGTGGTATCCTAGAAATGTTGAGTTTAATTTGGTAGGTTATTCCAATGCGGATTTTGCCAGTAGCTTACTTGACCGTAAGAGTACTAGTGGGACTTGTCAATTTCTTGGTAGTTCCTAAGTATCTTGGTTTAGTAAAAAGCAAAATTCGGTTGCTTTATCCACTACCGAAGCGGAATATATTGCGCTTGCTAGTTGTTGTGCACAAATTCTTTGGATGAAACAAACTCTTTGTGATTTTGGATTAAAATTTGATAGTGTGCCCATATTTTGTGACAATACTAGTGCTATTAATTTGACTAAAAATCCTATACATCATTCTAGGACTAAGCATATTGATATTAGACATCATTTTATTAGAGAACATGTACAAAATGGTCATATTACTCTTGAATTTGTGAGTTCTAATAATCAATTAGCGGATATTTTTACCAAGCCTTTGAATGAAGAAAGTTTTTGCAAAAATAGGCTTGAACTTGGTATCATTCGTTGTGATGCATCTTGATATTATTATAGAATTCATATTTTTAGGGGGAGCTTTTCACTTAACATATATATGAATTTCATGTTAAAATGAAATTGATTCCTACATATTGGTTTATTGTTGTGTGTCCCTCTTATGTATATAGTTTATCCTCTTTGGGGGAGTTATGCATATTGTTTATCATTTTAGGGGGAGTAATTATCTTTGAAATAAAATTTAATTTAAAAATTAATTGATATTTCTACCCTCAAGTTGATATGTGTTAAATGTCTTTTTGTCCCTTATTAAGAGGGAGTATTCAATTGATTTTAAAGGGGGGGAGAAATTTGTGAAATTGGTTTTTATCTTGAGTTTGGTTGATACTTTGTGATGGCTCTTAAATTTACAATGTTTGTTGTATTTCTTTTAGAGGACATGTTGATAGGAGAAGAATTGCAAAATTATGTGTTCATCTTTTTAAGGAAAACTTTCATGTCATTGCTTATTTTTCTTAATTGCTATTTTTGATTGATTCCAAAAGGGGGAGAGGTAATAAGAAAAATGAAAGTATGTTAGTTTTGAATATCTTTGTTGCTATTGAATTGTGAATTGTTGTTATTGAAATATATCATTTGTTTACAAGTGAATTAAAATTATGTATGTGTGTTGATTGCAATTTAAATTTATATCGCAC

Coding sequence (CDS)

ATGGACTTATTTGGTCCTTCTAGAATTGCTAGCTTTGGAGGGAACTATTATGCATTTGTGATAGTTGATGATTTTTCAAGATTTACTTGGGTTTTGATGATAAAACATAAGGACGATGCTTTGAAAAGTTTTATTAGTTTTGCAAAAAGAGTTCAAAACGAAAAAGGATTTTTTATTTCTAAAATTAGAAGTGATCATGGAGGAGAATTTGATAATGATGCTTTTAAAGCTTATTGTGAAGAAAATGGTTTTTCTCATAATTTTTCCTCTCCAAGAACTCCTCAACAAAATGGTGTGGTTGAAAGGAAAAATCGTACTTTGCAAGAATTTACTAGGTCAATGTTGCATGAGTATGGTTTACCTAAATATTTTTGGGCGGAAGCCATTAACACCGCTTGCTATGTTTTAAATAGAGTTTTAATTAGACCATCTTTAGATAAAACTCCTTATGAGCTTTGGCATGACAAAAGCCCAAATATTGGGTATTTCAAAGTTTTTGGTTGTAAATGTTTTATTTTGAACAATAAAGGAAAACTTGTAAAATTTGATTCTAAGACGGATGTTGGTATTTTCCTTGGATATTCCTCTACTAGTAAAGCTTATAGAGTTTTCAATAAGAGAACTTTAGTTATTGAGGAATCTATGCATGTTGTGTTTGATGAATCTTGTAATATTGTTTCTAATGAGTCTATTTATAGTGATGATTTAGAAAAAGATTTTGAAGATTTACTTGTAAGTGAAAAAGGTAAAGAAATTGTTCCAAGTATGGAAGAAGTGAGCATCAATGAAAAGAAGGAAGATGGTTCTTCATCTATGCCCAAAGAATGGAGATATGCTCCATCCCATCCCAAGGAATTAATTCTTGGTGATCCCGAACAAGGTGTGAAAACTCGTTCTTCTCTTAATTTATTTAGTAATCTTGCTTTTGTTTCTCAAGTTGAACCTAAAAGTTTTAAGGATGCCGAATGTGATGAGTTTTGGATTTTAGCTATGCAAGAAGAATTACATCAATTTGAAAGGTACAATGTTTGGGAATTAGTCCCTAGGCCTTCTAATGCTTCTATAATTGGGACTAAATGGGTTTTTAGAAACAAGATGGATGAACATGGAAATATCATTAGAAATAAAGCTAGACTTGTAGCTCAAGGTTATTGTCAAGAAGAAGGTATAGATTATGAAGAAACTTTTGCACCAGTTGCTAGATTAGAAGCTATTAGAATGTTGCTTGCTTTTGCCTCTTATAAGAATTTCATTTTGTATCAAATGGATGTGAAAAGTGCGTTTTTAAATGGTTATATTTTCGAGGAAGTTTATGTAGAACAACCTCTGGGCTTTGAAAATTTTGAATTGCCTAGTCATGTTTATAAGTTGAAAAAGGCTCTTTATGGCTTAAAACAAGCTCCAAGAGCTTGGTATGATAGGCTTAGTAATTTTCTACTTAAGAATGACTTTAAAACAGGAAAAATTGATACTACTCTCTTTATTAAGGTTAAAGAAAATGATATGCTTATAGTGCAAATATATGTGGATGATATTATTTTTGGTTCTACTAATCAATCTTTGTGTGAAGAATTTTCTAAGTGTATGCATAGTGAGTTTGAGATGAGTATGATGGGAGAACTTAGTTTCTTCCTTGGGCTCCAAGTCAAACAACTCAAGGATGGCATCTTCATAAGTCAAGAAAAATACACAAGAGATTTGCTCAAGAGATTCAAATTCAATGAAGGTAAAATTGCAAAAACTCCTATGAGCACTACCACTAAGCTTGACAAAGATGAAAAAGGTAAATGTGTGGACATAAAGATTTATCGAGGTATGATTGGATCTTTACTTTATTTGACCTCTAGTAGACCCGATATTATGTTTAGTGTATGTCCTTGTGCTAGATTTCAATCTTGTCCTAAGGAATCACATTTGCATGCCGTTAAAAGAATATTTAAATATTTGCTTGGAACTATTGATGTAGGCTTGTGGTATCCTAGAAATGTTGAGTTTAATTTGGTAGGTTATTCCAATGCGGATTTTGCCAGTAGCTTACTTGACCGTAAGAGTACTAGTGGGACTTGTCAATTTCTTGGTAGTTCCTAA

Protein sequence

MDLFGPSRIASFGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAYCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFTRSMLHEYGLPKYFWAEAINTACYVLNRVLIRPSLDKTPYELWHDKSPNIGYFKVFGCKCFILNNKGKLVKFDSKTDVGIFLGYSSTSKAYRVFNKRTLVIEESMHVVFDESCNIVSNESIYSDDLEKDFEDLLVSEKGKEIVPSMEEVSINEKKEDGSSSMPKEWRYAPSHPKELILGDPEQGVKTRSSLNLFSNLAFVSQVEPKSFKDAECDEFWILAMQEELHQFERYNVWELVPRPSNASIIGTKWVFRNKMDEHGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIFEEVYVEQPLGFENFELPSHVYKLKKALYGLKQAPRAWYDRLSNFLLKNDFKTGKIDTTLFIKVKENDMLIVQIYVDDIIFGSTNQSLCEEFSKCMHSEFEMSMMGELSFFLGLQVKQLKDGIFISQEKYTRDLLKRFKFNEGKIAKTPMSTTTKLDKDEKGKCVDIKIYRGMIGSLLYLTSSRPDIMFSVCPCARFQSCPKESHLHAVKRIFKYLLGTIDVGLWYPRNVEFNLVGYSNADFASSLLDRKSTSGTCQFLGSS
Homology
BLAST of CmUC11G214470 vs. NCBI nr
Match: RVW71911.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera])

HSP 1 Score: 996.9 bits (2576), Expect = 8.8e-287
Identity = 487/712 (68.40%), Postives = 576/712 (80.90%), Query Frame = 0

Query: 1   MDLFGPSRIASFGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFIS 60
           MDLFGPSR  S GG  YA+VIVDDFSR+TWVL +  K +A   F  F  +VQNEKGF I+
Sbjct: 239 MDLFGPSRTPSLGGKSYAYVIVDDFSRYTWVLFLSQKSEAFYEFSKFCNKVQNEKGFSIT 298

Query: 61  KIRSDHGGEFDNDAFKAYCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFTRSMLHEYGL 120
            IRSDHG EF+N  F+ YC ++G +HNFS+PRTPQQNGVVERKNRTLQE  R+ML+E  L
Sbjct: 299 CIRSDHGREFENFDFEEYCNKHGINHNFSAPRTPQQNGVVERKNRTLQEMARTMLNENNL 358

Query: 121 PKYFWAEAINTACYVLNRVLIRPSLDKTPYELWHDKSPNIGYFKVFGCKCFILNNKGKLV 180
           PKYFWAEA+NT+CYVLNR+L+RP L KTPYELW +K PNI YFKVFGCKCFILN K  L 
Sbjct: 359 PKYFWAEAVNTSCYVLNRILLRPILKKTPYELWKNKKPNISYFKVFGCKCFILNTKDNLG 418

Query: 181 KFDSKTDVGIFLGYSSTSKAYRVFNKRTLVIEESMHVVFDESCNIVSNESIYSDD--LEK 240
           KFD+K+DVGIFLGYS++SKA+RVFNKRT+V+EES+HV+FDES N +       DD  LE 
Sbjct: 419 KFDAKSDVGIFLGYSTSSKAFRVFNKRTMVVEESIHVIFDESNNSLQERESVDDDLGLET 478

Query: 241 DFEDLLVSEKGKEIVPSMEEVSINEKKED--------------GSSSMPKEWRYAPSHPK 300
               L + +K ++     EE   + KKED               S  +PK+W++  +HP+
Sbjct: 479 SMGKLQIEDKRQQ-----EESGEDPKKEDSPLALPPPQQVQGESSQDLPKDWKFVINHPQ 538

Query: 301 ELILGDPEQGVKTRSSL-NLFSNLAFVSQVEPKSFKDAECDEFWILAMQEELHQFERYNV 360
           + I+G+P  GV+TRSSL N+ +NLAF+SQ+EPK+ KDA  DE W++AMQEEL+QFER  V
Sbjct: 539 DQIIGNPSSGVRTRSSLRNICNNLAFISQIEPKNIKDAIVDENWMIAMQEELNQFERSEV 598

Query: 361 WELVPRPSNASIIGTKWVFRNKMDEHGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLE 420
           WELVPRPSN S+IGTKWVFRNKMDE+G I+RNKARLVAQGY QEEGIDYEETFAPVARLE
Sbjct: 599 WELVPRPSNQSVIGTKWVFRNKMDENGIIVRNKARLVAQGYNQEEGIDYEETFAPVARLE 658

Query: 421 AIRMLLAFASYKNFILYQMDVKSAFLNGYIFEEVYVEQPLGFENFELPSHVYKLKKALYG 480
           AIRMLLAFA +K+FILYQMDVKSAFLNG+I EEVYVEQP GF++F  P+HV+KLKKALYG
Sbjct: 659 AIRMLLAFACFKDFILYQMDVKSAFLNGFINEEVYVEQPPGFQSFNFPNHVFKLKKALYG 718

Query: 481 LKQAPRAWYDRLSNFLLKNDFKTGKIDTTLFIKVKENDMLIVQIYVDDIIFGSTNQSLCE 540
           LKQAPRAWY+RLS FLLK  FK GKIDTTLFIK KE DML+VQIYVDDIIFG+TN SLCE
Sbjct: 719 LKQAPRAWYERLSKFLLKKGFKMGKIDTTLFIKTKEKDMLLVQIYVDDIIFGATNDSLCE 778

Query: 541 EFSKCMHSEFEMSMMGELSFFLGLQVKQLKDGIFISQEKYTRDLLKRFKFNEGKIAKTPM 600
           +FSKCMHSEFEMSMMGEL++FLGLQ+KQLK+G FI+Q KY +DLLKRF   E K+ KTPM
Sbjct: 779 DFSKCMHSEFEMSMMGELNYFLGLQIKQLKEGTFINQAKYIKDLLKRFNMEEAKVMKTPM 838

Query: 601 STTTKLDKDEKGKCVDIKIYRGMIGSLLYLTSSRPDIMFSVCPCARFQSCPKESHLHAVK 660
           S++ KLD DEKGK +D  +YRGMIGSLLYLT+SRPDIM+SVC CARFQSCPKESHL AVK
Sbjct: 839 SSSIKLDMDEKGKSIDSTMYRGMIGSLLYLTASRPDIMYSVCLCARFQSCPKESHLSAVK 898

Query: 661 RIFKYLLGTIDVGLWYPRNVEFNLVGYSNADFASSLLDRKSTSGTCQFLGSS 696
           RI +YL GT+++GLWYP+   F L+G+S+ADFA   ++RKSTSGTC FLG S
Sbjct: 899 RILRYLKGTMNIGLWYPKGDNFELIGFSDADFAGCRVERKSTSGTCHFLGHS 945

BLAST of CmUC11G214470 vs. NCBI nr
Match: KYP33754.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan])

HSP 1 Score: 905.2 bits (2338), Expect = 3.5e-259
Identity = 448/697 (64.28%), Postives = 548/697 (78.62%), Query Frame = 0

Query: 1   MDLFGPSRIASFGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFIS 60
           MDLFGPSR  SFGG+YY  V+VDDFSR+TW L + +K D    F  FAK +QN+K   I 
Sbjct: 186 MDLFGPSRTMSFGGSYYGLVLVDDFSRYTWTLFLANKSDTFGVFRKFAKLIQNKKNLKIV 245

Query: 61  KIRSDHGGEFDNDAFKAYCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFTRSMLHEYGL 120
            IRSDHG EF+N  F  +CEENG  HNFS+PRTPQQNGVVERKNR+L+E  R+ML++  L
Sbjct: 246 SIRSDHGKEFENKDFDLFCEENGIEHNFSAPRTPQQNGVVERKNRSLEELARTMLNDSKL 305

Query: 121 PKYFWAEAINTACYVLNRVLIRPSLDKTPYELWHDKSPNIGYFKVFGCKCFILNN-KGKL 180
           PKYFWAEA+NTACY +NR LIRP L KTPYEL++ + PNI +  +FGCKCF+LNN K  L
Sbjct: 306 PKYFWAEAVNTACYTMNRALIRPILKKTPYELFNGRKPNISHLHIFGCKCFVLNNGKDNL 365

Query: 181 VKFDSKTDVGIFLGYSSTSKAYRVFNKRTLVIEESMHVVFDESCNIVSNESIYSDDLEKD 240
            KFD+K+D GIFLGYS  SK++R++NKRT+ IEES+HVVFDE+ N+V       D++ + 
Sbjct: 366 GKFDAKSDEGIFLGYSLNSKSFRIYNKRTMTIEESIHVVFDET-NLVCPRRDIIDEIVES 425

Query: 241 FEDLLVSEKGKEIVPSMEEVSINEKKEDGSSSMPKEWRYAPSHPKELILGDPEQGVKTRS 300
           FED  ++E+  +     E+     ++   + +  +EWR + +HP E I+GD  +GV TR+
Sbjct: 426 FEDTHINEQTHKDDKDKEKEDSTIQEGQTNINPQREWRISRNHPLENIIGDITKGVITRN 485

Query: 301 SL-NLFSNLAFVSQVEPKSFKDAECDEFWILAMQEELHQFERYNVWELVPRPSNASIIGT 360
           SL    +N++FVS++E K+  +A  DE WI AMQEEL+QFER  VW+LV RP+N  IIGT
Sbjct: 486 SLKEACNNMSFVSEIEVKNIDEALNDEHWINAMQEELNQFERNQVWDLVNRPTNHPIIGT 545

Query: 361 KWVFRNKMDEHGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFI 420
           KW+FRNK+DEHG +IRNKARLVA+GY QEEGIDYEET+APVARLEAIRMLLA+AS  +F 
Sbjct: 546 KWIFRNKLDEHGLVIRNKARLVAKGYNQEEGIDYEETYAPVARLEAIRMLLAYASIMDFK 605

Query: 421 LYQMDVKSAFLNGYIFEEVYVEQPLGFENFELPSHVYKLKKALYGLKQAPRAWYDRLSNF 480
           LYQMDVKSAFLNG+I EEVYVEQP GFEN E P+HV+KLKKALYGLKQAPRAWY+RLS F
Sbjct: 606 LYQMDVKSAFLNGFIQEEVYVEQPPGFENSEFPNHVFKLKKALYGLKQAPRAWYERLSKF 665

Query: 481 LLKNDFKTGKIDTTLFIKVKENDMLIVQIYVDDIIFGSTNQSLCEEFSKCMHSEFEMSMM 540
           LL+ +F  GK+DTTLFIK K ND+L+VQIYVDDIIFG+TN  LC+EFS  M SEFEMSMM
Sbjct: 666 LLEKEFTRGKVDTTLFIKRKMNDILLVQIYVDDIIFGATNDYLCKEFSNDMQSEFEMSMM 725

Query: 541 GELSFFLGLQVKQLKDGIFISQEKYTRDLLKRFKFNEGKIAKTPMSTTTKLDKDEKGKCV 600
           GEL+FFLGLQ++Q K+GIFI+Q KY ++LLKRF     K   TPMSTT  LDKDE GK +
Sbjct: 726 GELNFFLGLQIRQTKNGIFINQSKYCKELLKRFGMENAKSMATPMSTTCYLDKDEVGKSI 785

Query: 601 DIKIYRGMIGSLLYLTSSRPDIMFSVCPCARFQSCPKESHLHAVKRIFKYLLGTIDVGLW 660
           D+K YRGMIGSLLYL++SRPDIMFSVC CAR+QS PKESHL AVKRI +YLL T ++GLW
Sbjct: 786 DVKKYRGMIGSLLYLSASRPDIMFSVCFCARYQSNPKESHLSAVKRIMRYLLRTTNLGLW 845

Query: 661 YPRNVEFNLVGYSNADFASSLLDRKSTSGTCQFLGSS 696
           YP+N+ FNLVGYS++DFA    DRKSTSGTC F+GS+
Sbjct: 846 YPKNMSFNLVGYSDSDFAGCKTDRKSTSGTCHFIGSA 881

BLAST of CmUC11G214470 vs. NCBI nr
Match: KYP66812.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan])

HSP 1 Score: 904.4 bits (2336), Expect = 5.9e-259
Identity = 448/697 (64.28%), Postives = 548/697 (78.62%), Query Frame = 0

Query: 1   MDLFGPSRIASFGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFIS 60
           MDLFGPSR  SFGG+YY  V+VDDFSR+TW L + +K D    F  FAK +QN+K   I 
Sbjct: 75  MDLFGPSRTMSFGGSYYGLVLVDDFSRYTWTLFLANKSDTFGVFRKFAKLIQNKKNLKIV 134

Query: 61  KIRSDHGGEFDNDAFKAYCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFTRSMLHEYGL 120
            IRSDHG EF+N  F  +CEENG  HNFS+PRTPQQNGVVERKNR+L+E  R+ML++  L
Sbjct: 135 SIRSDHGKEFENKDFDLFCEENGIEHNFSAPRTPQQNGVVERKNRSLEELARTMLNDSKL 194

Query: 121 PKYFWAEAINTACYVLNRVLIRPSLDKTPYELWHDKSPNIGYFKVFGCKCFILNN-KGKL 180
           PKYFWAEA+NTACY +NR LIRP L KTPYEL++ + PNI +  +FGCKCF+LNN K  L
Sbjct: 195 PKYFWAEAVNTACYTMNRALIRPILKKTPYELFNGRKPNISHLHIFGCKCFVLNNGKDNL 254

Query: 181 VKFDSKTDVGIFLGYSSTSKAYRVFNKRTLVIEESMHVVFDESCNIVSNESIYSDDLEKD 240
            KFD+K+D GIFLGYS  SK++R++NKRT+ IEES+HVVFDE+ N+V       D++ + 
Sbjct: 255 GKFDAKSDEGIFLGYSLNSKSFRIYNKRTMTIEESVHVVFDET-NLVCPRRDVFDEIVES 314

Query: 241 FEDLLVSEKGKEIVPSMEEVSINEKKEDGSSSMPKEWRYAPSHPKELILGDPEQGVKTRS 300
           FED  ++E+  +     E+     ++   + +  +EWR + +HP E I+GD  +GV TR+
Sbjct: 315 FEDTHLNEQTHKDDKDKEKEDSTIQEGQTNINSEREWRISRNHPLENIIGDITKGVITRN 374

Query: 301 SL-NLFSNLAFVSQVEPKSFKDAECDEFWILAMQEELHQFERYNVWELVPRPSNASIIGT 360
           SL    +N++FVS++E K+  +A  DE WI AMQEEL+QFER  VW+LV RP+N  IIGT
Sbjct: 375 SLKEACNNMSFVSEIEVKNIDEALNDEHWINAMQEELNQFERNQVWDLVNRPTNHPIIGT 434

Query: 361 KWVFRNKMDEHGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFI 420
           KW+FRNK+DEHG +IRNKARLVA+GY QEEGIDYEET+APVARLEAIRMLLA+AS  +F 
Sbjct: 435 KWIFRNKLDEHGLVIRNKARLVAKGYNQEEGIDYEETYAPVARLEAIRMLLAYASIMDFK 494

Query: 421 LYQMDVKSAFLNGYIFEEVYVEQPLGFENFELPSHVYKLKKALYGLKQAPRAWYDRLSNF 480
           LYQMDVKSAFLNG+I EEVYVEQP GFEN E P+HV+KLKKALYGLKQAPRAWY+RLS F
Sbjct: 495 LYQMDVKSAFLNGFIQEEVYVEQPPGFENSEFPNHVFKLKKALYGLKQAPRAWYERLSKF 554

Query: 481 LLKNDFKTGKIDTTLFIKVKENDMLIVQIYVDDIIFGSTNQSLCEEFSKCMHSEFEMSMM 540
           LL+ +F  GK+DTTLFIK K ND+L+VQIYVDDIIFG+TN  LC+EFS  M SEFEMSMM
Sbjct: 555 LLEKEFTRGKVDTTLFIKRKMNDILLVQIYVDDIIFGATNDYLCKEFSNDMQSEFEMSMM 614

Query: 541 GELSFFLGLQVKQLKDGIFISQEKYTRDLLKRFKFNEGKIAKTPMSTTTKLDKDEKGKCV 600
           GEL+FFLGLQ++Q K+GIFI+Q KY ++LLKRF     K   TPMSTT  LDKDE GK +
Sbjct: 615 GELNFFLGLQIRQTKNGIFINQSKYCKELLKRFGMENAKSMATPMSTTCYLDKDEVGKSI 674

Query: 601 DIKIYRGMIGSLLYLTSSRPDIMFSVCPCARFQSCPKESHLHAVKRIFKYLLGTIDVGLW 660
           D+K YRGMIGSLLYL++SRPDIMFSVC CAR+QS PKESHL AVKRI + LLGT ++GLW
Sbjct: 675 DVKKYRGMIGSLLYLSASRPDIMFSVCLCARYQSNPKESHLSAVKRIMRCLLGTTNLGLW 734

Query: 661 YPRNVEFNLVGYSNADFASSLLDRKSTSGTCQFLGSS 696
           YP+N+ FNLVGYS++DFA    DRKSTSGTC F+GS+
Sbjct: 735 YPKNMPFNLVGYSDSDFAGCKTDRKSTSGTCHFIGSA 770

BLAST of CmUC11G214470 vs. NCBI nr
Match: KYP64004.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Cajanus cajan])

HSP 1 Score: 899.8 bits (2324), Expect = 1.5e-257
Identity = 445/697 (63.85%), Postives = 546/697 (78.34%), Query Frame = 0

Query: 1   MDLFGPSRIASFGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFIS 60
           MDLFGPSR  SFGG+YY  V+VDDFSR+TW L + +K D    F  FAK +QN+K   I 
Sbjct: 196 MDLFGPSRTMSFGGSYYGLVLVDDFSRYTWTLFLANKSDTFGVFRKFAKLIQNKKNLKIV 255

Query: 61  KIRSDHGGEFDNDAFKAYCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFTRSMLHEYGL 120
            IRSDHG EF+N  F  +CEENG  HNFS+PRTPQQNGVVERKNR+L+E  R+ML++  L
Sbjct: 256 SIRSDHGKEFENKDFDLFCEENGIEHNFSAPRTPQQNGVVERKNRSLEELARTMLNDSKL 315

Query: 121 PKYFWAEAINTACYVLNRVLIRPSLDKTPYELWHDKSPNIGYFKVFGCKCFILNN-KGKL 180
           PKYFWAEA+NTACY +NR LIRP L KTPYEL++ + PNI +  +FGCKCF+LNN K  L
Sbjct: 316 PKYFWAEAVNTACYTMNRALIRPILKKTPYELFNGRKPNISHLHIFGCKCFVLNNGKDNL 375

Query: 181 VKFDSKTDVGIFLGYSSTSKAYRVFNKRTLVIEESMHVVFDESCNIVSNESIYSDDLEKD 240
            KFD+K+D GIFLGYS  SK++R++NKRT+ IEES+HVVFDE+ N+V       D++ + 
Sbjct: 376 GKFDAKSDEGIFLGYSLNSKSFRIYNKRTMTIEESIHVVFDET-NLVCPRRDIIDEIVES 435

Query: 241 FEDLLVSEKGKEIVPSMEEVSINEKKEDGSSSMPKEWRYAPSHPKELILGDPEQGVKTRS 300
           FED  ++E+  +     E+     ++   + +  +EWR + +HP E I+GD  +GV TR+
Sbjct: 436 FEDTHINEQTHKDDKDKEKEDSTIQEGQTNINSQREWRISRNHPLENIIGDITKGVITRN 495

Query: 301 SL-NLFSNLAFVSQVEPKSFKDAECDEFWILAMQEELHQFERYNVWELVPRPSNASIIGT 360
           SL    +N++FVS++E K+  +A  DE WI AMQEEL+QFER  VW+LV RP+N  IIGT
Sbjct: 496 SLKEACNNMSFVSEIEVKNIDEALNDEHWINAMQEELNQFERNQVWDLVNRPTNHPIIGT 555

Query: 361 KWVFRNKMDEHGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFI 420
           KW+FRNK+DEHG +IRNKARLVA+GY QEEGIDYEET+APVARLEAIRMLLA+AS  +F 
Sbjct: 556 KWIFRNKLDEHGLVIRNKARLVAKGYNQEEGIDYEETYAPVARLEAIRMLLAYASIMDFK 615

Query: 421 LYQMDVKSAFLNGYIFEEVYVEQPLGFENFELPSHVYKLKKALYGLKQAPRAWYDRLSNF 480
           LYQMDVKSAFLNG+I EEVYVEQP GFEN E P+HV+KLKKALYGLKQAPRAWY+RLS F
Sbjct: 616 LYQMDVKSAFLNGFIQEEVYVEQPPGFENSEFPNHVFKLKKALYGLKQAPRAWYERLSKF 675

Query: 481 LLKNDFKTGKIDTTLFIKVKENDMLIVQIYVDDIIFGSTNQSLCEEFSKCMHSEFEMSMM 540
           LL+ +F  GK+DTTLFIK K ND+L+VQIYVDDIIFG+TN  LC+EFS  M SEFEMSMM
Sbjct: 676 LLEKEFTRGKVDTTLFIKRKMNDILLVQIYVDDIIFGATNDYLCKEFSNDMQSEFEMSMM 735

Query: 541 GELSFFLGLQVKQLKDGIFISQEKYTRDLLKRFKFNEGKIAKTPMSTTTKLDKDEKGKCV 600
           GEL+FFLGLQ++Q K+GIFI+Q KY ++LLKRF     K   TPMSTT  LDKDE GK +
Sbjct: 736 GELNFFLGLQIRQTKNGIFINQSKYCKELLKRFGMENAKSMATPMSTTCYLDKDEVGKSI 795

Query: 601 DIKIYRGMIGSLLYLTSSRPDIMFSVCPCARFQSCPKESHLHAVKRIFKYLLGTIDVGLW 660
           D+K YRGMIGSLLYL++SRP+IMFSVC C R+QS PKESHL AVKRI +YLLGT ++GLW
Sbjct: 796 DVKKYRGMIGSLLYLSTSRPNIMFSVCLCTRYQSNPKESHLSAVKRIMRYLLGTTNLGLW 855

Query: 661 YPRNVEFNLVGYSNADFASSLLDRKSTSGTCQFLGSS 696
           Y +N+ FNLVGYS++DFA    DRKS SGTC F+GS+
Sbjct: 856 YSKNMPFNLVGYSDSDFAGCKTDRKSISGTCHFIGSA 891

BLAST of CmUC11G214470 vs. NCBI nr
Match: KYP78729.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan])

HSP 1 Score: 897.1 bits (2317), Expect = 9.4e-257
Identity = 443/697 (63.56%), Postives = 548/697 (78.62%), Query Frame = 0

Query: 1    MDLFGPSRIASFGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFIS 60
            MDLFGPSR  S GGNYY  VIVDD+SR+TWV+ + +K+DA  +F  FAK VQNEK   I+
Sbjct: 743  MDLFGPSRTMSLGGNYYGLVIVDDYSRYTWVMFLANKNDAFNAFRKFAKLVQNEKCSNIT 802

Query: 61   KIRSDHGGEFDNDAFKAYCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFTRSMLHEYGL 120
             IRSDHGGEF N  F+ +CEE+G +HNFS+PRTPQQNGVVERKNR+L+E  R+ML+E  L
Sbjct: 803  SIRSDHGGEFQNILFQKFCEEHGINHNFSAPRTPQQNGVVERKNRSLEELARTMLNETNL 862

Query: 121  PKYFWAEAINTACYVLNRVLIRPSLDKTPYELWHDKSPNIGYFKVFGCKCFILNN-KGKL 180
            PKYFWA+AINTAC+VLN+VLIRP L KTPYE++  K PNI YF+VFGCKC++LNN K +L
Sbjct: 863  PKYFWADAINTACHVLNKVLIRPILKKTPYEIYKGKKPNISYFRVFGCKCYVLNNGKEQL 922

Query: 181  VKFDSKTDVGIFLGYSSTSKAYRVFNKRTLVIEESMHVVFDESCNIVSNESIYSDDLEKD 240
             KFD+K D  IFLGYS+ SKAYR++NKRTLV+EES+HVVFDES N         +DL + 
Sbjct: 923  GKFDAKADEAIFLGYSTNSKAYRIYNKRTLVVEESVHVVFDES-NKQETRQTEIEDLNEL 982

Query: 241  FEDLLVSEKGKEIVPSMEEVSINEKKEDGSSSMPKEWRYAPSHPKELILGDPEQGVKTRS 300
             +  L+  +  E+    E +   EK ++    +PKEW+ +     + I+G+  +GV TRS
Sbjct: 983  LDQSLLENEPNEVPKESESL---EKAKETCEQLPKEWKTSRDLSMDNIIGNIGKGVSTRS 1042

Query: 301  SL-NLFSNLAFVSQVEPKSFKDAECDEFWILAMQEELHQFERYNVWELVPRPSNASIIGT 360
            ++ N+ + +AFVSQVEPK+  +A  DE W++AMQEEL+QFER  VW+LVP P +  IIGT
Sbjct: 1043 AIKNICNTMAFVSQVEPKNIDEALKDEHWLMAMQEELNQFERNEVWDLVPLPKDYPIIGT 1102

Query: 361  KWVFRNKMDEHGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFI 420
            KWVFRNK+DE G I+RNKARLVA+GY QEEGIDY+ETFAPVAR+EAIR+LLA++S KNF 
Sbjct: 1103 KWVFRNKLDESGIILRNKARLVAKGYNQEEGIDYDETFAPVARIEAIRLLLAYSSIKNFK 1162

Query: 421  LYQMDVKSAFLNGYIFEEVYVEQPLGFENFELPSHVYKLKKALYGLKQAPRAWYDRLSNF 480
            LYQMDVKSAFLNG+I EEVYVEQP GF +++ P+HVYKLKKALYGLKQAPR+WYDRLS F
Sbjct: 1163 LYQMDVKSAFLNGFIQEEVYVEQPPGFVDYKNPNHVYKLKKALYGLKQAPRSWYDRLSKF 1222

Query: 481  LLKNDFKTGKIDTTLFIKVKENDMLIVQIYVDDIIFGSTNQSLCEEFSKCMHSEFEMSMM 540
            L++ND++ GK+D TLF+K  +ND + VQIYVDDI+FGSTN SLC+EF+K M  EFEMSMM
Sbjct: 1223 LIENDYERGKVDNTLFVKKFKNDTMYVQIYVDDIVFGSTNTSLCKEFAKTMQGEFEMSMM 1282

Query: 541  GELSFFLGLQVKQLKDGIFISQEKYTRDLLKRFKFNEGKIAKTPMSTTTKLDKDEKGKCV 600
            GEL+FFLGLQ+KQ+ DGIFISQ KY  +LLK+F     K A TP+S    LD DEKG  V
Sbjct: 1283 GELTFFLGLQIKQMHDGIFISQSKYCNELLKKFGMEGCKEAATPISNNCNLDLDEKGIAV 1342

Query: 601  DIKIYRGMIGSLLYLTSSRPDIMFSVCPCARFQSCPKESHLHAVKRIFKYLLGTIDVGLW 660
            D   YRG+IGSLLYLT+SRPDIMF+VC CARFQ+ PKESH+ +VKRI KYL GT +VGLW
Sbjct: 1343 DSSKYRGIIGSLLYLTASRPDIMFAVCLCARFQANPKESHMKSVKRILKYLKGTTNVGLW 1402

Query: 661  YPRNVEFNLVGYSNADFASSLLDRKSTSGTCQFLGSS 696
            YP+ V  +L+GYS++D+A   LDRKSTSGTC  LGS+
Sbjct: 1403 YPKGVSLSLIGYSDSDYAGCRLDRKSTSGTCHLLGSA 1435

BLAST of CmUC11G214470 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 374.8 bits (961), Expect = 2.1e-102
Identity = 242/722 (33.52%), Postives = 375/722 (51.94%), Query Frame = 0

Query: 2    DLFGPSRIASFGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISK 61
            D+ GP  I S GGN Y    +DD SR  WV ++K KD   + F  F   V+ E G  + +
Sbjct: 487  DVCGPMEIESMGGNKYFVTFIDDASRKLWVYILKTKDQVFQVFQKFHALVERETGRKLKR 546

Query: 62   IRSDHGGEFDNDAFKAYCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFTRSMLHEYGLP 121
            +RSD+GGE+ +  F+ YC  +G  H  + P TPQ NGV ER NRT+ E  RSML    LP
Sbjct: 547  LRSDNGGEYTSREFEEYCSSHGIRHEKTVPGTPQHNGVAERMNRTIVEKVRSMLRMAKLP 606

Query: 122  KYFWAEAINTACYVLNRVLIRPSLDKTPYELWHDKSPNIGYFKVFGCKCFILNNKGKLVK 181
            K FW EA+ TACY++NR    P   + P  +W +K  +  + KVFGC+ F    K +  K
Sbjct: 607  KSFWGEAVQTACYLINRSPSVPLAFEIPERVWTNKEVSYSHLKVFGCRAFAHVPKEQRTK 666

Query: 182  FDSKTDVGIFLGYSSTSKAYRVFNKRTLVIEESMHVVFDES-----------------CN 241
             D K+   IF+GY      YR+++     +  S  VVF ES                  N
Sbjct: 667  LDDKSIPCIFIGYGDEEFGYRLWDPVKKKVIRSRDVVFRESEVRTAADMSEKVKNGIIPN 726

Query: 242  IVSNESIYSDDLEKDFEDLLVSEKGKEIVPSMEE-VSINEKKEDGSSSMPKEWRYAPSHP 301
             V+  S  ++    +     VSE+G++    +E+   ++E  E+       E ++ P   
Sbjct: 727  FVTIPSTSNNPTSAESTTDEVSEQGEQPGEVIEQGEQLDEGVEEVEHPTQGEEQHQPLRR 786

Query: 302  KELILGDPEQGVKTRSSLNLFSNLAFV---SQVEPKSFKDA----ECDEFWILAMQEELH 361
             E          + R     + +  +V      EP+S K+     E ++  + AMQEE+ 
Sbjct: 787  SE----------RPRVESRRYPSTEYVLISDDREPESLKEVLSHPEKNQL-MKAMQEEME 846

Query: 362  QFERYNVWELVPRPSNASIIGTKWVFRNKMDEHGNIIRNKARLVAQGYCQEEGIDYEETF 421
              ++   ++LV  P     +  KWVF+ K D    ++R KARLV +G+ Q++GID++E F
Sbjct: 847  SLQKNGTYKLVELPKGKRPLKCKWVFKLKKDGDCKLVRYKARLVVKGFEQKKGIDFDEIF 906

Query: 422  APVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIFEEVYVEQPLGFENFELPSHVYK 481
            +PV ++ +IR +L+ A+  +  + Q+DVK+AFL+G + EE+Y+EQP GFE       V K
Sbjct: 907  SPVVKMTSIRTILSLAASLDLEVEQLDVKTAFLHGDLEEEIYMEQPEGFEVAGKKHMVCK 966

Query: 482  LKKALYGLKQAPRAWYDRLSNFLLKNDF-KTGKIDTTLFIKVKENDMLIVQIYVDDIIFG 541
            L K+LYGLKQAPR WY +  +F+    + KT       F +  EN+ +I+ +YVDD++  
Sbjct: 967  LNKSLYGLKQAPRQWYMKFDSFMKSQTYLKTYSDPCVYFKRFSENNFIILLLYVDDMLIV 1026

Query: 542  STNQSLCEEFSKCMHSEFEMSMMGELSFFLGLQV--KQLKDGIFISQEKYTRDLLKRFKF 601
              ++ L  +    +   F+M  +G     LG+++  ++    +++SQEKY   +L+RF  
Sbjct: 1027 GKDKGLIAKLKGDLSKSFDMKDLGPAQQILGMKIVRERTSRKLWLSQEKYIERVLERFNM 1086

Query: 602  NEGKIAKTPMSTTTKLDK-------DEKGKCVDIKIYRGMIGSLLY-LTSSRPDIMFSVC 661
               K   TP++   KL K       +EKG    +  Y   +GSL+Y +  +RPDI  +V 
Sbjct: 1087 KNAKPVSTPLAGHLKLSKKMCPTTVEEKGNMAKVP-YSSAVGSLMYAMVCTRPDIAHAVG 1146

Query: 662  PCARFQSCPKESHLHAVKRIFKYLLGTIDVGLWYPRNVEFNLVGYSNADFASSLLDRKST 688
              +RF   P + H  AVK I +YL GT    L +  +    L GY++AD A  + +RKS+
Sbjct: 1147 VVSRFLENPGKEHWEAVKWILRYLRGTTGDCLCFGGSDPI-LKGYTDADMAGDIDNRKSS 1195

BLAST of CmUC11G214470 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 362.8 bits (930), Expect = 8.4e-99
Identity = 245/805 (30.43%), Postives = 376/805 (46.71%), Query Frame = 0

Query: 7    SRIASFGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDH 66
            S I S     Y  + VD F+R+TW+  +K K    ++FI+F   ++N     I    SD+
Sbjct: 534  SPILSHDNYRYYVIFVDHFTRYTWLYPLKQKSQVKETFITFKNLLENRFQTRIGTFYSDN 593

Query: 67   GGEFDNDAFKAYCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFTRSMLHEYGLPKYFWA 126
            GGEF   A   Y  ++G SH  S P TP+ NG+ ERK+R + E   ++L    +PK +W 
Sbjct: 594  GGEF--VALWEYFSQHGISHLTSPPHTPEHNGLSERKHRHIVETGLTLLSHASIPKTYWP 653

Query: 127  EAINTACYVLNRVLIRPSLD-KTPYELWHDKSPNIGYFKVFGCKCFILNNKGKLVKFDSK 186
             A   A Y++NR L  P L  ++P++     SPN    +VFGC C+         K D K
Sbjct: 654  YAFAVAVYLINR-LPTPLLQLESPFQKLFGTSPNYDKLRVFGCACYPWLRPYNQHKLDDK 713

Query: 187  TDVGIFLGYSSTSKAYRVFNKRTLVIEESMHVVFDESCNIVS------------------ 246
            +   +FLGYS T  AY   + +T  +  S HV FDE+C   S                  
Sbjct: 714  SRQCVFLGYSLTQSAYLCLHLQTSRLYISRHVRFDENCFPFSNYLATLSPVQEQRRESSC 773

Query: 247  --------------------------------------NESIYSDDLEKDFEDLLVS--- 306
                                                  N  + S +L+  F     S   
Sbjct: 774  VWSPHTTLPTRTPVLPAPSCSDPHHAATPPSSPSAPFRNSQVSSSNLDSSFSSSFPSSPE 833

Query: 307  -----------------------------------EKGKEIVPSMEEVSINEKKEDGSSS 366
                                               E   ++  S+   + +       ++
Sbjct: 834  PTAPRQNGPQPTTQPTQTQTQTHSSQNTSQNNPTNESPSQLAQSLSTPAQSSSSSPSPTT 893

Query: 367  MPKEWRYAPSHPKELILGDP---------------EQGVKTRSSLNLFS-------NLAF 426
                   +P+ P  LI   P                  + TR+   +          ++ 
Sbjct: 894  SASSSSTSPTPPSILIHPPPPLAQIVNNNNQAPLNTHSMGTRAKAGIIKPNPKYSLAVSL 953

Query: 427  VSQVEPKSFKDAECDEFWILAMQEELHQFERYNVWELV-PRPSNASIIGTKWVFRNKMDE 486
             ++ EP++   A  DE W  AM  E++     + W+LV P PS+ +I+G +W+F  K + 
Sbjct: 954  AAESEPRTAIQALKDERWRNAMGSEINAQIGNHTWDLVPPPPSHVTIVGCRWIFTKKYNS 1013

Query: 487  HGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAF 546
             G++ R KARLVA+GY Q  G+DY ETF+PV +  +IR++L  A  +++ + Q+DV +AF
Sbjct: 1014 DGSLNRYKARLVAKGYNQRPGLDYAETFSPVIKSTSIRIVLGVAVDRSWPIRQLDVNNAF 1073

Query: 547  LNGYIFEEVYVEQPLGFENFELPSHVYKLKKALYGLKQAPRAWYDRLSNFLLKNDFKTGK 606
            L G + ++VY+ QP GF + + P++V KL+KALYGLKQAPRAWY  L N+LL   F    
Sbjct: 1074 LQGTLTDDVYMSQPPGFIDKDRPNYVCKLRKALYGLKQAPRAWYVELRNYLLTIGFVNSV 1133

Query: 607  IDTTLFIKVKENDMLIVQIYVDDIIFGSTNQSLCEEFSKCMHSEFEMSMMGELSFFLGLQ 666
             DT+LF+  +   ++ + +YVDDI+    + +L       +   F +    EL +FLG++
Sbjct: 1134 SDTSLFVLQRGKSIVYMLVYVDDILITGNDPTLLHNTLDNLSQRFSVKDHEELHYFLGIE 1193

Query: 667  VKQLKDGIFISQEKYTRDLLKRFKFNEGKIAKTPMSTTTKLDKDEKGKCVDIKIYRGMIG 694
             K++  G+ +SQ +Y  DLL R      K   TPM+ + KL      K  D   YRG++G
Sbjct: 1194 AKRVPTGLHLSQRRYILDLLARTNMITAKPVTTPMAPSPKLSLYSGTKLTDPTEYRGIVG 1253

BLAST of CmUC11G214470 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 351.3 bits (900), Expect = 2.5e-95
Identity = 236/809 (29.17%), Postives = 367/809 (45.36%), Query Frame = 0

Query: 7    SRIASFGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDH 66
            S I S     Y  + VD F+R+TW+  +K K     +FI F   V+N     I  + SD+
Sbjct: 513  SPILSIDNYRYYVIFVDHFTRYTWLYPLKQKSQVKDTFIIFKSLVENRFQTRIGTLYSDN 572

Query: 67   GGEFDNDAFKAYCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFTRSMLHEYGLPKYFWA 126
            GGEF     + Y  ++G SH  S P TP+ NG+ ERK+R + E   ++L    +PK +W 
Sbjct: 573  GGEF--VVLRDYLSQHGISHFTSPPHTPEHNGLSERKHRHIVEMGLTLLSHASVPKTYWP 632

Query: 127  EAINTACYVLNRVLIRPSLD-KTPYELWHDKSPNIGYFKVFGCKCFILNNKGKLVKFDSK 186
             A + A Y++NR L  P L  ++P++    + PN    KVFGC C+         K + K
Sbjct: 633  YAFSVAVYLINR-LPTPLLQLQSPFQKLFGQPPNYEKLKVFGCACYPWLRPYNRHKLEDK 692

Query: 187  TDVGIFLGYSSTSKAYRVFNKRTLVIEESMHVVFDESCNIVSNESIYSDDLEKDFEDLLV 246
            +    F+GYS T  AY   +  T  +  S HV FDE C   S  +      ++   D   
Sbjct: 693  SKQCAFMGYSLTQSAYLCLHIPTGRLYTSRHVQFDERCFPFSTTNFGVSTSQEQRSDSAP 752

Query: 247  SEKGKEIVPS-----------------------------MEEVSINEKKEDGSSSMPKEW 306
            +      +P+                               +VS +       SS     
Sbjct: 753  NWPSHTTLPTTPLVLPAPPCLGPHLDTSPRPPSSPSPLCTTQVSSSNLPSSSISSPSSSE 812

Query: 307  RYAPSHPKELILGDPEQGVKTRSSLNLFSN------------------------------ 366
              APSH        P Q   + S+  + +N                              
Sbjct: 813  PTAPSHNGPQPTAQPHQTQNSNSNSPILNNPNPNSPSPNSPNQNSPLPQSPISSPHIPTP 872

Query: 367  ------------------------------------------------------------ 426
                                                                        
Sbjct: 873  STSISEPNSPSSSSTSTPPLPPVLPAPPIIQVNAQAPVNTHSMATRAKDGIRKPNQKYSY 932

Query: 427  -LAFVSQVEPKSFKDAECDEFWILAMQEELHQFERYNVWELV-PRPSNASIIGTKWVFRN 486
              +  +  EP++   A  D+ W  AM  E++     + W+LV P P + +I+G +W+F  
Sbjct: 933  ATSLAANSEPRTAIQAMKDDRWRQAMGSEINAQIGNHTWDLVPPPPPSVTIVGCRWIFTK 992

Query: 487  KMDEHGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDV 546
            K +  G++ R KARLVA+GY Q  G+DY ETF+PV +  +IR++L  A  +++ + Q+DV
Sbjct: 993  KFNSDGSLNRYKARLVAKGYNQRPGLDYAETFSPVIKSTSIRIVLGVAVDRSWPIRQLDV 1052

Query: 547  KSAFLNGYIFEEVYVEQPLGFENFELPSHVYKLKKALYGLKQAPRAWYDRLSNFLLKNDF 606
             +AFL G + +EVY+ QP GF + + P +V +L+KA+YGLKQAPRAWY  L  +LL   F
Sbjct: 1053 NNAFLQGTLTDEVYMSQPPGFVDKDRPDYVCRLRKAIYGLKQAPRAWYVELRTYLLTVGF 1112

Query: 607  KTGKIDTTLFIKVKENDMLIVQIYVDDIIFGSTNQSLCEEFSKCMHSEFEMSMMGELSFF 666
                 DT+LF+  +   ++ + +YVDDI+    +  L +     +   F +    +L +F
Sbjct: 1113 VNSISDTSLFVLQRGRSIIYMLVYVDDILITGNDTVLLKHTLDALSQRFSVKEHEDLHYF 1172

Query: 667  LGLQVKQLKDGIFISQEKYTRDLLKRFKFNEGKIAKTPMSTTTKLDKDEKGKCVDIKIYR 694
            LG++ K++  G+ +SQ +YT DLL R      K   TPM+T+ KL      K  D   YR
Sbjct: 1173 LGIEAKRVPQGLHLSQRRYTLDLLARTNMLTAKPVATPMATSPKLTLHSGTKLPDPTEYR 1232

BLAST of CmUC11G214470 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 340.1 bits (871), Expect = 5.8e-92
Identity = 237/791 (29.96%), Postives = 391/791 (49.43%), Query Frame = 0

Query: 2    DLFGPSRIASFGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISK 61
            D+ GP    +     Y  + VD F+ +    +IK+K D    F  F  + +      +  
Sbjct: 487  DVCGPITPVTLDDKNYFVIFVDQFTHYCVTYLIKYKSDVFSMFQDFVAKSEAHFNLKVVY 546

Query: 62   IRSDHGGEFDNDAFKAYCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFTRSMLHEYGLP 121
            +  D+G E+ ++  + +C + G S++ + P TPQ NGV ER  RT+ E  R+M+    L 
Sbjct: 547  LYIDNGREYLSNEMRQFCVKKGISYHLTVPHTPQLNGVSERMIRTITEKARTMVSGAKLD 606

Query: 122  KYFWAEAINTACYVLNRVLIRPSLD--KTPYELWHDKSPNIGYFKVFGCKCFILNNKGKL 181
            K FW EA+ TA Y++NR+  R  +D  KTPYE+WH+K P + + +VFG   ++ + K K 
Sbjct: 607  KSFWGEAVLTATYLINRIPSRALVDSSKTPYEMWHNKKPYLKHLRVFGATVYV-HIKNKQ 666

Query: 182  VKFDSKTDVGIFLGYSSTS-KAYRVFNK-----RTLVIEES---------MHVVF----- 241
             KFD K+   IF+GY     K +   N+     R +V++E+            VF     
Sbjct: 667  GKFDDKSFKSIFVGYEPNGFKLWDAVNEKFIVARDVVVDETNMVNSRAVKFETVFLKDSK 726

Query: 242  --------DESCNIVS----NESIYSDDLE-----KDFEDLLVSEKGKEIVPS------- 301
                    ++S  I+     NES   D+++     K+ E+       ++I+ +       
Sbjct: 727  ESENKNFPNDSRKIIQTEFPNESKECDNIQFLKDSKESENKNFPNDSRKIIQTEFPNESK 786

Query: 302  -------------MEEVSINEKK---------EDGSSSMPKEWRYAPS--HPKELILGDP 361
                           +  +NE K         E   S  P E R + +  H KE+ + +P
Sbjct: 787  ECDNIQFLKDSKESNKYFLNESKKRKRDDHLNESKGSGNPNESRESETAEHLKEIGIDNP 846

Query: 362  ------------EQGVKTR---------SSLN-LFSNLAFVSQVEPKSFKDAECDE---F 421
                         + +KT+         +SLN +  N   +    P SF + +  +    
Sbjct: 847  TKNDGIEIINRRSERLKTKPQISYNEEDNSLNKVVLNAHTIFNDVPNSFDEIQYRDDKSS 906

Query: 422  WILAMQEELHQFERYNVWELVPRPSNASIIGTKWVFRNKMDEHGNIIRNKARLVAQGYCQ 481
            W  A+  EL+  +  N W +  RP N +I+ ++WVF  K +E GN IR KARLVA+G+ Q
Sbjct: 907  WEEAINTELNAHKINNTWTITKRPENKNIVDSRWVFSVKYNELGNPIRYKARLVARGFTQ 966

Query: 482  EEGIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIFEEVYVEQPLGFE 541
            +  IDYEETFAPVAR+ + R +L+     N  ++QMDVK+AFLNG + EE+Y+  P G  
Sbjct: 967  KYQIDYEETFAPVARISSFRFILSLVIQYNLKVHQMDVKTAFLNGTLKEEIYMRLPQGIS 1026

Query: 542  NFELPSHVYKLKKALYGLKQAPRAWYDRLSNFLLKNDFKTGKIDTTLFI--KVKENDMLI 601
                  +V KL KA+YGLKQA R W++     L + +F    +D  ++I  K   N+ + 
Sbjct: 1027 CNS--DNVCKLNKAIYGLKQAARCWFEVFEQALKECEFVNSSVDRCIYILDKGNINENIY 1086

Query: 602  VQIYVDDIIFGSTNQSLCEEFSKCMHSEFEMSMMGELSFFLGLQVKQLKDGIFISQEKYT 661
            V +YVDD++  + + +    F + +  +F M+ + E+  F+G++++  +D I++SQ  Y 
Sbjct: 1087 VLLYVDDVVIATGDMTRMNNFKRYLMEKFRMTDLNEIKHFIGIRIEMQEDKIYLSQSAYV 1146

Query: 662  RDLLKRFKFNEGKIAKTPMSTTTKL-----DKDEKGKCVDIKIYRGMIGSLLY-LTSSRP 688
            + +L +F         TP+ +         D+D    C      R +IG L+Y +  +RP
Sbjct: 1147 KKILSKFNMENCNAVSTPLPSKINYELLNSDEDCNTPC------RSLIGCLMYIMLCTRP 1206

BLAST of CmUC11G214470 vs. ExPASy Swiss-Prot
Match: P25600 (Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY5A PE=5 SV=2)

HSP 1 Score: 131.7 bits (330), Expect = 3.2e-29
Identity = 76/254 (29.92%), Postives = 133/254 (52.36%), Query Frame = 0

Query: 422 MDVKSAFLNGYIFEEVYVEQPLGFENFELPSHVYKLKKALYGLKQAPRAWYDRLSNFLLK 481
           MDV +AFLN  + E +YV+QP GF N   P +V++L   +YGLKQAP  W + ++N L K
Sbjct: 1   MDVDTAFLNSTMDEPIYVKQPPGFVNERNPDYVWELYGGMYGLKQAPLLWNEHINNTLKK 60

Query: 482 NDFKTGKIDTTLFIKVKENDMLIVQIYVDDIIFGSTNQSLCEEFSKCMHSEFEMSMMGEL 541
             F   + +  L+ +   +  + + +YVDD++  + +  + +   + +   + M  +G++
Sbjct: 61  IGFCRHEGEHGLYFRSTSDGPIYIAVYVDDLLVAAPSPKIYDRVKQELTKLYSMKDLGKV 120

Query: 542 SFFLGLQVKQLKDG-IFISQEKYTRDLLKRFKFNEGKIAKTPMSTTTKLDKDEKGKCVDI 601
             FLGL + Q  +G I +S + Y        + N  K+ +TP+  +  L +       DI
Sbjct: 121 DKFLGLNIHQSSNGDITLSLQDYIAKAASESEINTFKLTQTPLCNSKPLFETTSPHLKDI 180

Query: 602 KIYRGMIGSLLYLTSS-RPDIMFSVCPCARFQSCPKESHLHAVKRIFKYLLGTIDVGLWY 661
             Y+ ++G LL+  ++ RPDI + V   +RF   P+  HL + +R+ +YL  T  + L Y
Sbjct: 181 TPYQSIVGQLLFCANTGRPDISYPVSLLSRFLREPRAIHLESARRVLRYLYTTRSMCLKY 240

Query: 662 PRNVEFNLVGYSNA 674
               +  L  Y +A
Sbjct: 241 RSGSQLALTVYCDA 254

BLAST of CmUC11G214470 vs. ExPASy TrEMBL
Match: A0A438GI90 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX=29760 GN=POLX_2030 PE=4 SV=1)

HSP 1 Score: 996.9 bits (2576), Expect = 4.2e-287
Identity = 487/712 (68.40%), Postives = 576/712 (80.90%), Query Frame = 0

Query: 1   MDLFGPSRIASFGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFIS 60
           MDLFGPSR  S GG  YA+VIVDDFSR+TWVL +  K +A   F  F  +VQNEKGF I+
Sbjct: 239 MDLFGPSRTPSLGGKSYAYVIVDDFSRYTWVLFLSQKSEAFYEFSKFCNKVQNEKGFSIT 298

Query: 61  KIRSDHGGEFDNDAFKAYCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFTRSMLHEYGL 120
            IRSDHG EF+N  F+ YC ++G +HNFS+PRTPQQNGVVERKNRTLQE  R+ML+E  L
Sbjct: 299 CIRSDHGREFENFDFEEYCNKHGINHNFSAPRTPQQNGVVERKNRTLQEMARTMLNENNL 358

Query: 121 PKYFWAEAINTACYVLNRVLIRPSLDKTPYELWHDKSPNIGYFKVFGCKCFILNNKGKLV 180
           PKYFWAEA+NT+CYVLNR+L+RP L KTPYELW +K PNI YFKVFGCKCFILN K  L 
Sbjct: 359 PKYFWAEAVNTSCYVLNRILLRPILKKTPYELWKNKKPNISYFKVFGCKCFILNTKDNLG 418

Query: 181 KFDSKTDVGIFLGYSSTSKAYRVFNKRTLVIEESMHVVFDESCNIVSNESIYSDD--LEK 240
           KFD+K+DVGIFLGYS++SKA+RVFNKRT+V+EES+HV+FDES N +       DD  LE 
Sbjct: 419 KFDAKSDVGIFLGYSTSSKAFRVFNKRTMVVEESIHVIFDESNNSLQERESVDDDLGLET 478

Query: 241 DFEDLLVSEKGKEIVPSMEEVSINEKKED--------------GSSSMPKEWRYAPSHPK 300
               L + +K ++     EE   + KKED               S  +PK+W++  +HP+
Sbjct: 479 SMGKLQIEDKRQQ-----EESGEDPKKEDSPLALPPPQQVQGESSQDLPKDWKFVINHPQ 538

Query: 301 ELILGDPEQGVKTRSSL-NLFSNLAFVSQVEPKSFKDAECDEFWILAMQEELHQFERYNV 360
           + I+G+P  GV+TRSSL N+ +NLAF+SQ+EPK+ KDA  DE W++AMQEEL+QFER  V
Sbjct: 539 DQIIGNPSSGVRTRSSLRNICNNLAFISQIEPKNIKDAIVDENWMIAMQEELNQFERSEV 598

Query: 361 WELVPRPSNASIIGTKWVFRNKMDEHGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLE 420
           WELVPRPSN S+IGTKWVFRNKMDE+G I+RNKARLVAQGY QEEGIDYEETFAPVARLE
Sbjct: 599 WELVPRPSNQSVIGTKWVFRNKMDENGIIVRNKARLVAQGYNQEEGIDYEETFAPVARLE 658

Query: 421 AIRMLLAFASYKNFILYQMDVKSAFLNGYIFEEVYVEQPLGFENFELPSHVYKLKKALYG 480
           AIRMLLAFA +K+FILYQMDVKSAFLNG+I EEVYVEQP GF++F  P+HV+KLKKALYG
Sbjct: 659 AIRMLLAFACFKDFILYQMDVKSAFLNGFINEEVYVEQPPGFQSFNFPNHVFKLKKALYG 718

Query: 481 LKQAPRAWYDRLSNFLLKNDFKTGKIDTTLFIKVKENDMLIVQIYVDDIIFGSTNQSLCE 540
           LKQAPRAWY+RLS FLLK  FK GKIDTTLFIK KE DML+VQIYVDDIIFG+TN SLCE
Sbjct: 719 LKQAPRAWYERLSKFLLKKGFKMGKIDTTLFIKTKEKDMLLVQIYVDDIIFGATNDSLCE 778

Query: 541 EFSKCMHSEFEMSMMGELSFFLGLQVKQLKDGIFISQEKYTRDLLKRFKFNEGKIAKTPM 600
           +FSKCMHSEFEMSMMGEL++FLGLQ+KQLK+G FI+Q KY +DLLKRF   E K+ KTPM
Sbjct: 779 DFSKCMHSEFEMSMMGELNYFLGLQIKQLKEGTFINQAKYIKDLLKRFNMEEAKVMKTPM 838

Query: 601 STTTKLDKDEKGKCVDIKIYRGMIGSLLYLTSSRPDIMFSVCPCARFQSCPKESHLHAVK 660
           S++ KLD DEKGK +D  +YRGMIGSLLYLT+SRPDIM+SVC CARFQSCPKESHL AVK
Sbjct: 839 SSSIKLDMDEKGKSIDSTMYRGMIGSLLYLTASRPDIMYSVCLCARFQSCPKESHLSAVK 898

Query: 661 RIFKYLLGTIDVGLWYPRNVEFNLVGYSNADFASSLLDRKSTSGTCQFLGSS 696
           RI +YL GT+++GLWYP+   F L+G+S+ADFA   ++RKSTSGTC FLG S
Sbjct: 899 RILRYLKGTMNIGLWYPKGDNFELIGFSDADFAGCRVERKSTSGTCHFLGHS 945

BLAST of CmUC11G214470 vs. ExPASy TrEMBL
Match: A0A151QU14 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan OX=3821 GN=KK1_045365 PE=4 SV=1)

HSP 1 Score: 905.2 bits (2338), Expect = 1.7e-259
Identity = 448/697 (64.28%), Postives = 548/697 (78.62%), Query Frame = 0

Query: 1   MDLFGPSRIASFGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFIS 60
           MDLFGPSR  SFGG+YY  V+VDDFSR+TW L + +K D    F  FAK +QN+K   I 
Sbjct: 186 MDLFGPSRTMSFGGSYYGLVLVDDFSRYTWTLFLANKSDTFGVFRKFAKLIQNKKNLKIV 245

Query: 61  KIRSDHGGEFDNDAFKAYCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFTRSMLHEYGL 120
            IRSDHG EF+N  F  +CEENG  HNFS+PRTPQQNGVVERKNR+L+E  R+ML++  L
Sbjct: 246 SIRSDHGKEFENKDFDLFCEENGIEHNFSAPRTPQQNGVVERKNRSLEELARTMLNDSKL 305

Query: 121 PKYFWAEAINTACYVLNRVLIRPSLDKTPYELWHDKSPNIGYFKVFGCKCFILNN-KGKL 180
           PKYFWAEA+NTACY +NR LIRP L KTPYEL++ + PNI +  +FGCKCF+LNN K  L
Sbjct: 306 PKYFWAEAVNTACYTMNRALIRPILKKTPYELFNGRKPNISHLHIFGCKCFVLNNGKDNL 365

Query: 181 VKFDSKTDVGIFLGYSSTSKAYRVFNKRTLVIEESMHVVFDESCNIVSNESIYSDDLEKD 240
            KFD+K+D GIFLGYS  SK++R++NKRT+ IEES+HVVFDE+ N+V       D++ + 
Sbjct: 366 GKFDAKSDEGIFLGYSLNSKSFRIYNKRTMTIEESIHVVFDET-NLVCPRRDIIDEIVES 425

Query: 241 FEDLLVSEKGKEIVPSMEEVSINEKKEDGSSSMPKEWRYAPSHPKELILGDPEQGVKTRS 300
           FED  ++E+  +     E+     ++   + +  +EWR + +HP E I+GD  +GV TR+
Sbjct: 426 FEDTHINEQTHKDDKDKEKEDSTIQEGQTNINPQREWRISRNHPLENIIGDITKGVITRN 485

Query: 301 SL-NLFSNLAFVSQVEPKSFKDAECDEFWILAMQEELHQFERYNVWELVPRPSNASIIGT 360
           SL    +N++FVS++E K+  +A  DE WI AMQEEL+QFER  VW+LV RP+N  IIGT
Sbjct: 486 SLKEACNNMSFVSEIEVKNIDEALNDEHWINAMQEELNQFERNQVWDLVNRPTNHPIIGT 545

Query: 361 KWVFRNKMDEHGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFI 420
           KW+FRNK+DEHG +IRNKARLVA+GY QEEGIDYEET+APVARLEAIRMLLA+AS  +F 
Sbjct: 546 KWIFRNKLDEHGLVIRNKARLVAKGYNQEEGIDYEETYAPVARLEAIRMLLAYASIMDFK 605

Query: 421 LYQMDVKSAFLNGYIFEEVYVEQPLGFENFELPSHVYKLKKALYGLKQAPRAWYDRLSNF 480
           LYQMDVKSAFLNG+I EEVYVEQP GFEN E P+HV+KLKKALYGLKQAPRAWY+RLS F
Sbjct: 606 LYQMDVKSAFLNGFIQEEVYVEQPPGFENSEFPNHVFKLKKALYGLKQAPRAWYERLSKF 665

Query: 481 LLKNDFKTGKIDTTLFIKVKENDMLIVQIYVDDIIFGSTNQSLCEEFSKCMHSEFEMSMM 540
           LL+ +F  GK+DTTLFIK K ND+L+VQIYVDDIIFG+TN  LC+EFS  M SEFEMSMM
Sbjct: 666 LLEKEFTRGKVDTTLFIKRKMNDILLVQIYVDDIIFGATNDYLCKEFSNDMQSEFEMSMM 725

Query: 541 GELSFFLGLQVKQLKDGIFISQEKYTRDLLKRFKFNEGKIAKTPMSTTTKLDKDEKGKCV 600
           GEL+FFLGLQ++Q K+GIFI+Q KY ++LLKRF     K   TPMSTT  LDKDE GK +
Sbjct: 726 GELNFFLGLQIRQTKNGIFINQSKYCKELLKRFGMENAKSMATPMSTTCYLDKDEVGKSI 785

Query: 601 DIKIYRGMIGSLLYLTSSRPDIMFSVCPCARFQSCPKESHLHAVKRIFKYLLGTIDVGLW 660
           D+K YRGMIGSLLYL++SRPDIMFSVC CAR+QS PKESHL AVKRI +YLL T ++GLW
Sbjct: 786 DVKKYRGMIGSLLYLSASRPDIMFSVCFCARYQSNPKESHLSAVKRIMRYLLRTTNLGLW 845

Query: 661 YPRNVEFNLVGYSNADFASSLLDRKSTSGTCQFLGSS 696
           YP+N+ FNLVGYS++DFA    DRKSTSGTC F+GS+
Sbjct: 846 YPKNMSFNLVGYSDSDFAGCKTDRKSTSGTCHFIGSA 881

BLAST of CmUC11G214470 vs. ExPASy TrEMBL
Match: A0A151TIF5 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan OX=3821 GN=KK1_013123 PE=4 SV=1)

HSP 1 Score: 904.4 bits (2336), Expect = 2.9e-259
Identity = 448/697 (64.28%), Postives = 548/697 (78.62%), Query Frame = 0

Query: 1   MDLFGPSRIASFGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFIS 60
           MDLFGPSR  SFGG+YY  V+VDDFSR+TW L + +K D    F  FAK +QN+K   I 
Sbjct: 75  MDLFGPSRTMSFGGSYYGLVLVDDFSRYTWTLFLANKSDTFGVFRKFAKLIQNKKNLKIV 134

Query: 61  KIRSDHGGEFDNDAFKAYCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFTRSMLHEYGL 120
            IRSDHG EF+N  F  +CEENG  HNFS+PRTPQQNGVVERKNR+L+E  R+ML++  L
Sbjct: 135 SIRSDHGKEFENKDFDLFCEENGIEHNFSAPRTPQQNGVVERKNRSLEELARTMLNDSKL 194

Query: 121 PKYFWAEAINTACYVLNRVLIRPSLDKTPYELWHDKSPNIGYFKVFGCKCFILNN-KGKL 180
           PKYFWAEA+NTACY +NR LIRP L KTPYEL++ + PNI +  +FGCKCF+LNN K  L
Sbjct: 195 PKYFWAEAVNTACYTMNRALIRPILKKTPYELFNGRKPNISHLHIFGCKCFVLNNGKDNL 254

Query: 181 VKFDSKTDVGIFLGYSSTSKAYRVFNKRTLVIEESMHVVFDESCNIVSNESIYSDDLEKD 240
            KFD+K+D GIFLGYS  SK++R++NKRT+ IEES+HVVFDE+ N+V       D++ + 
Sbjct: 255 GKFDAKSDEGIFLGYSLNSKSFRIYNKRTMTIEESVHVVFDET-NLVCPRRDVFDEIVES 314

Query: 241 FEDLLVSEKGKEIVPSMEEVSINEKKEDGSSSMPKEWRYAPSHPKELILGDPEQGVKTRS 300
           FED  ++E+  +     E+     ++   + +  +EWR + +HP E I+GD  +GV TR+
Sbjct: 315 FEDTHLNEQTHKDDKDKEKEDSTIQEGQTNINSEREWRISRNHPLENIIGDITKGVITRN 374

Query: 301 SL-NLFSNLAFVSQVEPKSFKDAECDEFWILAMQEELHQFERYNVWELVPRPSNASIIGT 360
           SL    +N++FVS++E K+  +A  DE WI AMQEEL+QFER  VW+LV RP+N  IIGT
Sbjct: 375 SLKEACNNMSFVSEIEVKNIDEALNDEHWINAMQEELNQFERNQVWDLVNRPTNHPIIGT 434

Query: 361 KWVFRNKMDEHGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFI 420
           KW+FRNK+DEHG +IRNKARLVA+GY QEEGIDYEET+APVARLEAIRMLLA+AS  +F 
Sbjct: 435 KWIFRNKLDEHGLVIRNKARLVAKGYNQEEGIDYEETYAPVARLEAIRMLLAYASIMDFK 494

Query: 421 LYQMDVKSAFLNGYIFEEVYVEQPLGFENFELPSHVYKLKKALYGLKQAPRAWYDRLSNF 480
           LYQMDVKSAFLNG+I EEVYVEQP GFEN E P+HV+KLKKALYGLKQAPRAWY+RLS F
Sbjct: 495 LYQMDVKSAFLNGFIQEEVYVEQPPGFENSEFPNHVFKLKKALYGLKQAPRAWYERLSKF 554

Query: 481 LLKNDFKTGKIDTTLFIKVKENDMLIVQIYVDDIIFGSTNQSLCEEFSKCMHSEFEMSMM 540
           LL+ +F  GK+DTTLFIK K ND+L+VQIYVDDIIFG+TN  LC+EFS  M SEFEMSMM
Sbjct: 555 LLEKEFTRGKVDTTLFIKRKMNDILLVQIYVDDIIFGATNDYLCKEFSNDMQSEFEMSMM 614

Query: 541 GELSFFLGLQVKQLKDGIFISQEKYTRDLLKRFKFNEGKIAKTPMSTTTKLDKDEKGKCV 600
           GEL+FFLGLQ++Q K+GIFI+Q KY ++LLKRF     K   TPMSTT  LDKDE GK +
Sbjct: 615 GELNFFLGLQIRQTKNGIFINQSKYCKELLKRFGMENAKSMATPMSTTCYLDKDEVGKSI 674

Query: 601 DIKIYRGMIGSLLYLTSSRPDIMFSVCPCARFQSCPKESHLHAVKRIFKYLLGTIDVGLW 660
           D+K YRGMIGSLLYL++SRPDIMFSVC CAR+QS PKESHL AVKRI + LLGT ++GLW
Sbjct: 675 DVKKYRGMIGSLLYLSASRPDIMFSVCLCARYQSNPKESHLSAVKRIMRCLLGTTNLGLW 734

Query: 661 YPRNVEFNLVGYSNADFASSLLDRKSTSGTCQFLGSS 696
           YP+N+ FNLVGYS++DFA    DRKSTSGTC F+GS+
Sbjct: 735 YPKNMPFNLVGYSDSDFAGCKTDRKSTSGTCHFIGSA 770

BLAST of CmUC11G214470 vs. ExPASy TrEMBL
Match: A0A151TAG4 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 (Fragment) OS=Cajanus cajan OX=3821 GN=KK1_018591 PE=4 SV=1)

HSP 1 Score: 899.8 bits (2324), Expect = 7.1e-258
Identity = 445/697 (63.85%), Postives = 546/697 (78.34%), Query Frame = 0

Query: 1   MDLFGPSRIASFGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFIS 60
           MDLFGPSR  SFGG+YY  V+VDDFSR+TW L + +K D    F  FAK +QN+K   I 
Sbjct: 196 MDLFGPSRTMSFGGSYYGLVLVDDFSRYTWTLFLANKSDTFGVFRKFAKLIQNKKNLKIV 255

Query: 61  KIRSDHGGEFDNDAFKAYCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFTRSMLHEYGL 120
            IRSDHG EF+N  F  +CEENG  HNFS+PRTPQQNGVVERKNR+L+E  R+ML++  L
Sbjct: 256 SIRSDHGKEFENKDFDLFCEENGIEHNFSAPRTPQQNGVVERKNRSLEELARTMLNDSKL 315

Query: 121 PKYFWAEAINTACYVLNRVLIRPSLDKTPYELWHDKSPNIGYFKVFGCKCFILNN-KGKL 180
           PKYFWAEA+NTACY +NR LIRP L KTPYEL++ + PNI +  +FGCKCF+LNN K  L
Sbjct: 316 PKYFWAEAVNTACYTMNRALIRPILKKTPYELFNGRKPNISHLHIFGCKCFVLNNGKDNL 375

Query: 181 VKFDSKTDVGIFLGYSSTSKAYRVFNKRTLVIEESMHVVFDESCNIVSNESIYSDDLEKD 240
            KFD+K+D GIFLGYS  SK++R++NKRT+ IEES+HVVFDE+ N+V       D++ + 
Sbjct: 376 GKFDAKSDEGIFLGYSLNSKSFRIYNKRTMTIEESIHVVFDET-NLVCPRRDIIDEIVES 435

Query: 241 FEDLLVSEKGKEIVPSMEEVSINEKKEDGSSSMPKEWRYAPSHPKELILGDPEQGVKTRS 300
           FED  ++E+  +     E+     ++   + +  +EWR + +HP E I+GD  +GV TR+
Sbjct: 436 FEDTHINEQTHKDDKDKEKEDSTIQEGQTNINSQREWRISRNHPLENIIGDITKGVITRN 495

Query: 301 SL-NLFSNLAFVSQVEPKSFKDAECDEFWILAMQEELHQFERYNVWELVPRPSNASIIGT 360
           SL    +N++FVS++E K+  +A  DE WI AMQEEL+QFER  VW+LV RP+N  IIGT
Sbjct: 496 SLKEACNNMSFVSEIEVKNIDEALNDEHWINAMQEELNQFERNQVWDLVNRPTNHPIIGT 555

Query: 361 KWVFRNKMDEHGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFI 420
           KW+FRNK+DEHG +IRNKARLVA+GY QEEGIDYEET+APVARLEAIRMLLA+AS  +F 
Sbjct: 556 KWIFRNKLDEHGLVIRNKARLVAKGYNQEEGIDYEETYAPVARLEAIRMLLAYASIMDFK 615

Query: 421 LYQMDVKSAFLNGYIFEEVYVEQPLGFENFELPSHVYKLKKALYGLKQAPRAWYDRLSNF 480
           LYQMDVKSAFLNG+I EEVYVEQP GFEN E P+HV+KLKKALYGLKQAPRAWY+RLS F
Sbjct: 616 LYQMDVKSAFLNGFIQEEVYVEQPPGFENSEFPNHVFKLKKALYGLKQAPRAWYERLSKF 675

Query: 481 LLKNDFKTGKIDTTLFIKVKENDMLIVQIYVDDIIFGSTNQSLCEEFSKCMHSEFEMSMM 540
           LL+ +F  GK+DTTLFIK K ND+L+VQIYVDDIIFG+TN  LC+EFS  M SEFEMSMM
Sbjct: 676 LLEKEFTRGKVDTTLFIKRKMNDILLVQIYVDDIIFGATNDYLCKEFSNDMQSEFEMSMM 735

Query: 541 GELSFFLGLQVKQLKDGIFISQEKYTRDLLKRFKFNEGKIAKTPMSTTTKLDKDEKGKCV 600
           GEL+FFLGLQ++Q K+GIFI+Q KY ++LLKRF     K   TPMSTT  LDKDE GK +
Sbjct: 736 GELNFFLGLQIRQTKNGIFINQSKYCKELLKRFGMENAKSMATPMSTTCYLDKDEVGKSI 795

Query: 601 DIKIYRGMIGSLLYLTSSRPDIMFSVCPCARFQSCPKESHLHAVKRIFKYLLGTIDVGLW 660
           D+K YRGMIGSLLYL++SRP+IMFSVC C R+QS PKESHL AVKRI +YLLGT ++GLW
Sbjct: 796 DVKKYRGMIGSLLYLSTSRPNIMFSVCLCTRYQSNPKESHLSAVKRIMRYLLGTTNLGLW 855

Query: 661 YPRNVEFNLVGYSNADFASSLLDRKSTSGTCQFLGSS 696
           Y +N+ FNLVGYS++DFA    DRKS SGTC F+GS+
Sbjct: 856 YSKNMPFNLVGYSDSDFAGCKTDRKSISGTCHFIGSA 891

BLAST of CmUC11G214470 vs. ExPASy TrEMBL
Match: A0A151UHG7 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan OX=3821 GN=KK1_048795 PE=4 SV=1)

HSP 1 Score: 897.1 bits (2317), Expect = 4.6e-257
Identity = 443/697 (63.56%), Postives = 548/697 (78.62%), Query Frame = 0

Query: 1    MDLFGPSRIASFGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFIS 60
            MDLFGPSR  S GGNYY  VIVDD+SR+TWV+ + +K+DA  +F  FAK VQNEK   I+
Sbjct: 743  MDLFGPSRTMSLGGNYYGLVIVDDYSRYTWVMFLANKNDAFNAFRKFAKLVQNEKCSNIT 802

Query: 61   KIRSDHGGEFDNDAFKAYCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFTRSMLHEYGL 120
             IRSDHGGEF N  F+ +CEE+G +HNFS+PRTPQQNGVVERKNR+L+E  R+ML+E  L
Sbjct: 803  SIRSDHGGEFQNILFQKFCEEHGINHNFSAPRTPQQNGVVERKNRSLEELARTMLNETNL 862

Query: 121  PKYFWAEAINTACYVLNRVLIRPSLDKTPYELWHDKSPNIGYFKVFGCKCFILNN-KGKL 180
            PKYFWA+AINTAC+VLN+VLIRP L KTPYE++  K PNI YF+VFGCKC++LNN K +L
Sbjct: 863  PKYFWADAINTACHVLNKVLIRPILKKTPYEIYKGKKPNISYFRVFGCKCYVLNNGKEQL 922

Query: 181  VKFDSKTDVGIFLGYSSTSKAYRVFNKRTLVIEESMHVVFDESCNIVSNESIYSDDLEKD 240
             KFD+K D  IFLGYS+ SKAYR++NKRTLV+EES+HVVFDES N         +DL + 
Sbjct: 923  GKFDAKADEAIFLGYSTNSKAYRIYNKRTLVVEESVHVVFDES-NKQETRQTEIEDLNEL 982

Query: 241  FEDLLVSEKGKEIVPSMEEVSINEKKEDGSSSMPKEWRYAPSHPKELILGDPEQGVKTRS 300
             +  L+  +  E+    E +   EK ++    +PKEW+ +     + I+G+  +GV TRS
Sbjct: 983  LDQSLLENEPNEVPKESESL---EKAKETCEQLPKEWKTSRDLSMDNIIGNIGKGVSTRS 1042

Query: 301  SL-NLFSNLAFVSQVEPKSFKDAECDEFWILAMQEELHQFERYNVWELVPRPSNASIIGT 360
            ++ N+ + +AFVSQVEPK+  +A  DE W++AMQEEL+QFER  VW+LVP P +  IIGT
Sbjct: 1043 AIKNICNTMAFVSQVEPKNIDEALKDEHWLMAMQEELNQFERNEVWDLVPLPKDYPIIGT 1102

Query: 361  KWVFRNKMDEHGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFI 420
            KWVFRNK+DE G I+RNKARLVA+GY QEEGIDY+ETFAPVAR+EAIR+LLA++S KNF 
Sbjct: 1103 KWVFRNKLDESGIILRNKARLVAKGYNQEEGIDYDETFAPVARIEAIRLLLAYSSIKNFK 1162

Query: 421  LYQMDVKSAFLNGYIFEEVYVEQPLGFENFELPSHVYKLKKALYGLKQAPRAWYDRLSNF 480
            LYQMDVKSAFLNG+I EEVYVEQP GF +++ P+HVYKLKKALYGLKQAPR+WYDRLS F
Sbjct: 1163 LYQMDVKSAFLNGFIQEEVYVEQPPGFVDYKNPNHVYKLKKALYGLKQAPRSWYDRLSKF 1222

Query: 481  LLKNDFKTGKIDTTLFIKVKENDMLIVQIYVDDIIFGSTNQSLCEEFSKCMHSEFEMSMM 540
            L++ND++ GK+D TLF+K  +ND + VQIYVDDI+FGSTN SLC+EF+K M  EFEMSMM
Sbjct: 1223 LIENDYERGKVDNTLFVKKFKNDTMYVQIYVDDIVFGSTNTSLCKEFAKTMQGEFEMSMM 1282

Query: 541  GELSFFLGLQVKQLKDGIFISQEKYTRDLLKRFKFNEGKIAKTPMSTTTKLDKDEKGKCV 600
            GEL+FFLGLQ+KQ+ DGIFISQ KY  +LLK+F     K A TP+S    LD DEKG  V
Sbjct: 1283 GELTFFLGLQIKQMHDGIFISQSKYCNELLKKFGMEGCKEAATPISNNCNLDLDEKGIAV 1342

Query: 601  DIKIYRGMIGSLLYLTSSRPDIMFSVCPCARFQSCPKESHLHAVKRIFKYLLGTIDVGLW 660
            D   YRG+IGSLLYLT+SRPDIMF+VC CARFQ+ PKESH+ +VKRI KYL GT +VGLW
Sbjct: 1343 DSSKYRGIIGSLLYLTASRPDIMFAVCLCARFQANPKESHMKSVKRILKYLKGTTNVGLW 1402

Query: 661  YPRNVEFNLVGYSNADFASSLLDRKSTSGTCQFLGSS 696
            YP+ V  +L+GYS++D+A   LDRKSTSGTC  LGS+
Sbjct: 1403 YPKGVSLSLIGYSDSDYAGCRLDRKSTSGTCHLLGSA 1435

BLAST of CmUC11G214470 vs. TAIR 10
Match: AT4G23160.1 (cysteine-rich RLK (RECEPTOR-like protein kinase) 8 )

HSP 1 Score: 275.0 bits (702), Expect = 1.6e-73
Identity = 151/386 (39.12%), Postives = 222/386 (57.51%), Query Frame = 0

Query: 314 EPKSFKDAECDEFWILAMQEELHQFERYNVWELVPRPSNASIIGTKWVFRNKMDEHGNII 373
           EP ++ +A+    W  AM +E+   E  + WE+   P N   IG KWV++ K +  G I 
Sbjct: 85  EPSTYNEAKEFLVWCGAMDDEIGAMETTHTWEICTLPPNKKPIGCKWVYKIKYNSDGTIE 144

Query: 374 RNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYI 433
           R KARLVA+GY Q+EGID+ ETF+PV +L +++++LA ++  NF L+Q+D+ +AFLNG +
Sbjct: 145 RYKARLVAKGYTQQEGIDFIETFSPVCKLTSVKLILAISAIYNFTLHQLDISNAFLNGDL 204

Query: 434 FEEVYVEQPLGFENFE----LPSHVYKLKKALYGLKQAPRAWYDRLSNFLLKNDFKTGKI 493
            EE+Y++ P G+   +     P+ V  LKK++YGLKQA R W+ + S  L+   F     
Sbjct: 205 DEEIYMKLPPGYAARQGDSLPPNAVCYLKKSIYGLKQASRQWFLKFSVTLIGFGFVQSHS 264

Query: 494 DTTLFIKVKENDMLIVQIYVDDIIFGSTNQSLCEEFSKCMHSEFEMSMMGELSFFLGLQV 553
           D T F+K+     L V +YVDDII  S N +  +E    + S F++  +G L +FLGL++
Sbjct: 265 DHTYFLKITATLFLCVLVYVDDIIICSNNDAAVDELKSQLKSCFKLRDLGPLKYFLGLEI 324

Query: 554 KQLKDGIFISQEKYTRDLLKRFKFNEGKIAKTPMSTTTKLDKDEKGKCVDIKIYRGMIGS 613
            +   GI I Q KY  DLL        K +  PM  +        G  VD K YR +IG 
Sbjct: 325 ARSAAGINICQRKYALDLLDETGLLGCKPSSVPMDPSVTFSAHSGGDFVDAKAYRRLIGR 384

Query: 614 LLYLTSSRPDIMFSVCPCARFQSCPKESHLHAVKRIFKYLLGTIDVGLWYPRNVEFNLVG 673
           L+YL  +R DI F+V   ++F   P+ +H  AV +I  Y+ GT+  GL+Y    E  L  
Sbjct: 385 LMYLQITRLDISFAVNKLSQFSEAPRLAHQQAVMKILHYIKGTVGQGLFYSSQAEMQLQV 444

Query: 674 YSNADFASSLLDRKSTSGTCQFLGSS 696
           +S+A F S    R+ST+G C FLG+S
Sbjct: 445 FSDASFQSCKDTRRSTNGYCMFLGTS 470

BLAST of CmUC11G214470 vs. TAIR 10
Match: ATMG00810.1 (DNA/RNA polymerases superfamily protein )

HSP 1 Score: 110.2 bits (274), Expect = 7.0e-24
Identity = 63/190 (33.16%), Postives = 102/190 (53.68%), Query Frame = 0

Query: 507 IYVDDIIFGSTNQSLCEEFSKCMHSEFEMSMMGELSFFLGLQVKQLKDGIFISQEKYTRD 566
           +YVDDI+   ++ +L       + S F M  +G + +FLG+Q+K    G+F+SQ KY   
Sbjct: 5   LYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSGLFLSQTKYAEQ 64

Query: 567 LLKRFKFNEGKIAKTPMSTTTKLDKDEK---GKCVDIKIYRGMIGSLLYLTSSRPDIMFS 626
           +L     N G +   PMST   L  +      K  D   +R ++G+L YLT +RPDI ++
Sbjct: 65  ILN----NAGMLDCKPMSTPLPLKLNSSVSTAKYPDPSDFRSIVGALQYLTLTRPDISYA 124

Query: 627 VCPCARFQSCPKESHLHAVKRIFKYLLGTIDVGLWYPRNVEFNLVGYSNADFASSLLDRK 686
           V    +    P  +    +KR+ +Y+ GTI  GL+  +N + N+  + ++D+A     R+
Sbjct: 125 VNIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCDSDWAGCTSTRR 184

Query: 687 STSGTCQFLG 694
           ST+G C FLG
Sbjct: 185 STTGFCTFLG 190

BLAST of CmUC11G214470 vs. TAIR 10
Match: ATMG00820.1 (Reverse transcriptase (RNA-dependent DNA polymerase) )

HSP 1 Score: 95.5 bits (236), Expect = 1.8e-19
Identity = 50/99 (50.51%), Postives = 61/99 (61.62%), Query Frame = 0

Query: 314 EPKSFKDAECDEFWILAMQEELHQFERYNVWELVPRPSNASIIGTKWVFRNKMDEHGNII 373
           EPKS   A  D  W  AMQEEL    R   W LVP P N +I+G KWVF+ K+   G + 
Sbjct: 27  EPKSVIFALKDPGWCQAMQEELDALSRNKTWILVPPPVNQNILGCKWVFKTKLHSDGTLD 86

Query: 374 RNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFA 413
           R KARLVA+G+ QEEGI + ET++PV R   IR +L  A
Sbjct: 87  RLKARLVAKGFHQEEGIYFVETYSPVVRTATIRTILNVA 125

BLAST of CmUC11G214470 vs. TAIR 10
Match: ATMG00710.1 (Polynucleotidyl transferase, ribonuclease H-like superfamily protein )

HSP 1 Score: 63.9 bits (154), Expect = 5.7e-10
Identity = 32/76 (42.11%), Postives = 43/76 (56.58%), Query Frame = 0

Query: 104 NRTLQEFTRSMLHEYGLPKYFWAEAINTACYVLNRVLIRPSLDKTPYELWHDKSPNIGYF 163
           NRT+ E  RSML E GLPK F A+A NTA +++N+          P E+W    P   Y 
Sbjct: 2   NRTIIEKVRSMLCECGLPKTFRADAANTAVHIINKYPSTAINFHVPDEVWFQSVPTYSYL 61

Query: 164 KVFGCKCFILNNKGKL 180
           + FGC  +I  ++GKL
Sbjct: 62  RRFGCVAYIHCDEGKL 77

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
RVW71911.18.8e-28768.40Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera][more]
KYP33754.13.5e-25964.28Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan][more]
KYP66812.15.9e-25964.28Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan][more]
KYP64004.11.5e-25763.85Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Cajanus ca... [more]
KYP78729.19.4e-25763.56Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan][more]
Match NameE-valueIdentityDescription
P109782.1e-10233.52Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
Q94HW28.4e-9930.43Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
Q9ZT942.5e-9529.17Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
P041465.8e-9229.96Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
P256003.2e-2929.92Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain AT... [more]
Match NameE-valueIdentityDescription
A0A438GI904.2e-28768.40Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX... [more]
A0A151QU141.7e-25964.28Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan OX=... [more]
A0A151TIF52.9e-25964.28Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan OX=... [more]
A0A151TAG47.1e-25863.85Retrovirus-related Pol polyprotein from transposon TNT 1-94 (Fragment) OS=Cajanu... [more]
A0A151UHG74.6e-25763.56Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan OX=... [more]
Match NameE-valueIdentityDescription
AT4G23160.11.6e-7339.12cysteine-rich RLK (RECEPTOR-like protein kinase) 8 [more]
ATMG00810.17.0e-2433.16DNA/RNA polymerases superfamily protein [more]
ATMG00820.11.8e-1950.51Reverse transcriptase (RNA-dependent DNA polymerase) [more]
ATMG00710.15.7e-1042.11Polynucleotidyl transferase, ribonuclease H-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (USVL531) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 1..94
e-value: 4.5E-11
score: 42.9
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 1..157
score: 24.278612
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 342..583
e-value: 5.6E-75
score: 252.1
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 1..165
e-value: 1.2E-41
score: 144.2
NoneNo IPR availablePANTHERPTHR45895FAMILY NOT NAMEDcoord: 2..652
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 343..662
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 1..166

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmUC11G214470.1CmUC11G214470.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
molecular_function GO:0004491 methylmalonate-semialdehyde dehydrogenase (acylating) activity
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding