Moc02g17040 (gene) Bitter gourd (OHB3-1) v2

Overview
NameMoc02g17040
Typegene
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag/pol protein
Locationchr2: 12786345 .. 12796343 (-)
RNA-Seq ExpressionMoc02g17040
SyntenyMoc02g17040
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGCTATGCAGTAGCGCCATGGCACTGCGGGGACAGCACACAGCACCACGGCGCTGCACTGTAGCACCGCGGCCCTGCTGCTGCGGCATTTTGGTGCAACAGTGCCGTGGCGCTGCCCTTAGGCACCGAGACGCTGTCCCGAGTGTTTTTCGACGTGCTTCCGTGGCTCCGGTTCGCGGTTCGAGGGCGGTTGCAGTCGTTTTCTTAAGTTTTTGTATTATTTTGTTTCCTTAGGGTGCCAAAAACCGGTCCAACTTTGCTATTCAATTATTTTATGCATTATGAATGTATATTTTTAATTAAAAGCATATTGCATATTGTATGCCATATAGTTTTAAAATCCCACCATAGATTACATGCATAATATGTATGTTTGATATATAGTATGTATGTATTATCATGAATCATATAGTTATAAATGTTATAATGTATGAATGCATGTTTTATTTTCATGATATATAAGTGTTGTATTCATTAGATGGAAATAAAGTTGCATTGAGCATGACATTACTTTCATTAATATAATTGTTATATTGATGTATGCTTACAATGCATGATTATAGTTTTAATTTCTTAAATTTTATAATAGTTATAAAATGTGGTAGATCGAAATTAAAATCTATAATAATGAATTGCATGCAAATATAGGGTCTATGTTAATTAATTTTAAAGGGGTTTAAAATTAAGTAGACCTTAGGTTATATCTTCTAATATGATTAGGGGGTAGTCTTTTATTTTGTTTTAAGTAGGTTTAAAATGAAATGATAAAAGACTAAATATAAAATATTGTCTATAAGGGAACCTTGTCTAAGGAAGGTTCTGTCTAGGTTGAGGTACTTAAGCTGACCGTAAGGGAACACCTCCCCATGTAACCGACCTGGGGGTTGAATTAGTCAATATTTTATATACATGCATTAATGTCGTTGGTTTATTAAAATGTTTAATGAACTAAAACATTTATGTACAACTACCAATATAATAGTTATATTGGGCCGACTAAAAATTCACTTAGTTAATTTTACTTAGCCGGGATTTATCTAAGTAATATACCCTAGTCTTAGAATACTAAGTGGGAGCAAAAGAGAATATGTAATATACAGCGTATATATTTGATAGACAAAGTATATATAGCATACTTTTCTCTCTCTCACGTTCTCCTTTATATTGACACTGTGAGTTCCATGCTCCGTCTCGCGTCGCCCTTGGCGCGGCCTCCCTACGGAAGGTGTTTGCATGGTTCAATATCGAAGTGAATGGAGAAAGTATTCATAGTAAGTGGGAGAAGGACGTGTGACAACACATCCTGTGGTCTCCGCCATTGGTTTGCACCGTGAGGTTTCATACATGACTTGCGTGTCGTCCTGAAGCGACCATCCCTACGGAGGGTTCATTGTATGGAATCAAAACCAAGGCAAACTTCAGAAATGGATAAGGGTTTCTTAGGTTTTGCTCCAATATTTTCCTTCCATACGGCAGGAATATTGGGGCGGATCTCTGAGGTCTGAAAATGATGGGTCACACTTACGGGGAGTTGTTAAGTTAGTTAGCAATTTCCTGACCAAATAAACGATGACTAAAGTTTATAGGAATAAGAGTTATTCTGGTGTAAAATTTAGTTAAAAACGTCTTAATTCAGTGAAGGAGTACCTGTCTGCCCTACGGTAGCTGTTGCTCTAAATCACTGAAGCGTCGTTGCAAAACAATTTTGTTAGGGTGCTTAATTACTTTTCCTAAAATTTGATTGGATTTAAAGGCTAATGCATGAAACTGATATAGGTTTGTTTTTACTTTCAGCATGTCTGCTTCCATTATTGCACTCCTAGCCGCTCAAAAACTTAACGGCGAGAATTACAAACAATGGAAATCGAATCTAAACACTATTCTCGTGATAGATGATCTTAGGTTCGTCTTGCAAGAGGATTGTCCTCAAGCTCCTGCGTTTAACACCACTGTGGCGGTGCGCAACGTCTATGACAGGTGGATCAAGGCCAATGACAAGGCCAAGGTCTACATCTTGACGAGCATATCTGATGTGCTTGCCAAGAAGCACGAGGACACGGTCACCGCTAAGGAGATCACGGACTCGCTGCAGAGCATGTTTCGACAACCGTCCTCACAGGCTCGACATGAAGCCCTTAAGTTCATTTACAACTCCCGCATGAAGGAGGGCTCTTCAGTGCGAGAACACGTTCTCAACCTGATGGTCCACTTCAATGTGGCAGAGTCGAACAGGGCTGTCATAAACGAGCAAAGTCAGGTTAACTTCCTTCTGGAATCTCTTCCGAAGAGTTTCCTTCCATTCCGCAGCAATGTGGTTATGAATAAGCTGGAGTACACTCTTACCACGCTCTTAAACGAGCTGCAGACCTACCAGTCTCTTATGAAGAGTAAGGGACAAGAAGGGGAGGCAAACGTTGCCACCTCAAAGAGGTTCAATTGAGGTTCGTCCTCTGGAACCAAGTCTGCACCATCTTCTTCTGGAAGTAAGACTTTTAAGAAGAAAGCTGCTGGTAAGGCGTCTAAACCTGACTCCGCTGCTGCCGCTGCCAAGAAAGGCAAGACCAAGGTTGCAGAGAAAGGAAAGTGTTTCAACTGCAATGTGGACAGACATTGGAAGCGCAACTGCCCAAAGTACTTTGCCGAAAAGAAGAAAGCCAACGAAGGTAAATATGATTTACTTGTTTTGGAAACATGTTTAGTGGAGAACGATGACTGCGCCTGGATACTGGATTCAGGAGCCACTAATCACGTTTGTTCTTCATCTCAGGGAATTAGTTCCTGGAGGCAGCTTGACGCTGGAGAGATGACTCTCAAGGTCGAAACGGGAGAGGTCGTCTCAACTGTGGCGGTAGGGGAGCTCAAGTTGTTTACAAACAAGAATATGTATATATTATTAGATAATGTGTACGTGGTTCCTAAAATTAAAAGGAACTTAATTTCAGTTTTTTGTTTGTTAGAACATTTGTATTCCATTTCATTTAATTTGAATGAAGCGTTAATTTCAAGAAATGGTGTTAGTATTTGTTCTGCTTTGCTTGAAAACAACTTGTATTTGCTAAGACCAACCATAACTAAAGCAATTTTAAATACCGAGTTGTTTAAAAATGCTAGAACCCAGAATAAAAGAGGCAAAAAGTTTCTCAAAAAGAAAATACCTATCTTTGACACTTATGATTAGGTCACACTAATCTCAATAGGATTTGGAGGTTGGTTAAGAATGGCATTCTACGTGAGTTAGAAGATGATTTTTTACCACCATGTGAGTCTTGTCTTGAAGGCAAGATGACTAAAAGACCTTTTACTGGAAAAGGTTATAGAGCCAAAGAGCCCTTAGAACTTATACATTCAGACCTTTGTGGTCCAATGAATGTTAAAGCTCGAGGAGGGTATGAATACTTCATCTCTTTTATAGATGTTTATTCAAGGTATGGTTATTTATACCTAAAGCAACATAAGTCTGAAACCCTTGAAAAGTTCAAGGAGTACAAGACTGAAGTTGAAAATCAATTAGGTAAAACCATTAAAACACTTCGATCTGATCGCAGTGGAGAGTATATAGACTTGAGATTCCAGGACTATATGATAGAACATGGAATCACATCCTAACTCTCAGCTCCTGGTACAACTCAACAAAACGTTGTATCAGAAAAGAGAAACAGAATCCTGTTGGACATGGTTCGATCAATGATGAGCTATGCTCAGTTGCCTAACTCATTTTGGGGTTATGCAGTGGAGACTGCGGTTTATATTTTGAACGTAGTTCCATCAAAGAGTGTATCTGAAACACCCTTTGAGCTATGGAAGGGTCGTAAAGCTAGTTTACGTCATTTCCGTATTTGGGGATGCTCGGCACACGTGCTCATGACTAACCCAAAGAAACTAGAGCCTCGTTCAAAAGTATGCCTATTCGTAGGTTACCCTAAAGAGATGAGAGGTGGTCTCTTCTTCGATCCTCAAGAAAACAGAGTGTTTGTATCGACAAATGCCACTTTCTTGGAAGAATACCACGTTCGAGATCATAAACCACGGAGTAAGATAGTGATGAACGAGATTTCCAAAGAGGCTACAAACACGTCAACAAGAGTTGTTGATCAAATTGGCACTACGACAAGAGTTGTTGATGAAGCCAGTACATCACGTCAGTCACATCCACCTCAAGTATTGAAGGTGCCACGACGTAGTGGGAGGATTGTGTCACAACCTGACCGCTATATGGGTTTAACTGAAGCCCAAGTTGTCATACCTGATGACGGTGTCGAGGATCCATTGACCTACAAAAAGGCAATGGAAGATACTGACAAGGACAAAGGGTCAAAGCAATGAACCTGTAAATGGAGTCGATGTACTTCAATTCCGTTTGGGAACGTGTAGATCAACCTGATAGGGTAAACCCATAGGGTGCAAATGGATCTATAAGCAAAAACGAGACGCCGAAGGAAAAGTACGGATCTATAAGGCGAGACTTGTGGCAAAAGGCTACACACAAAGGGAAGGGGTGGACTATGAAGAAACTTTTTCCCTTGTAGCTATGCTTAAGTCCATTAGAATATTTTTGTCCATTGCCACATATTATGACTATGAGATATGACAAATGGACGTCAAGACTGCTTTTCTGAATGACAATCTTGAAGAAAGTATTTGTATGTCTCAACCCGAAGGGTTCATAGCTCAAGGTCAAGAGCAAAAGGTTTGCAAGCTTAAACGGTCTATATATGGACTGAAACAAGCATCTCGATCTTGTAACATAAGGTTCGATACTGCGATCAAATCTTATGGTTTTGACCAAAACGTTGATGAGCCTTGTGTATACAAGAAGGTCGTCAACAACAATGTAGCCTTCTTAGTGTTGTACGTGGATGATATCCTACTCATTGGGAATGATGTAGGATACCTCATTGACATAAAGAAGTGACTAGCGACTGAGTTCCAAATGAAGGATTTCGGAGAAGCTCAGTATGTTCTTGGCATCCAAATTCTAAGGGATCGCAAGAACAAAACGTTAGCCCTGTCTCAAGCAACTTATATCGACAAGATGCTTGTTCGATATTCGATGCAGAATTCCAAGAAAGGCTTATTACCCTTCAGGCATGGAATTCATCTGTCTAATGAACAGTGTCCTGAGACACCTCAAGAGGTTGAGGATATGAGACGTATACCCTATGCTTCAGCAGTGGGCAGCCTAATGTACGTTATACTTGCACTAGGCCTGACATTTGCTATGCAGTAGGGATGGTCAGTAGGTACCAATCCAACCCAGGATATGATCACTGGACTGCTGTTAAGGGAATCCTCAAGTATCTTAGGAGAACGAGGGACTACAAGCTCGTGTATGGCGCTAAGGATTTGATCCTTACAAGATACACTGACTCTGACTTCCAAACAGACAAGGATTCAAGGAAATCCACGTCGGGCTCAGTATTCACTCTTAACGGAGGAGCGGTAGTTTGGCGAAGCATCAAACAAGGATGCATCGCAGACTCCACCATGGAGGTTGAGTATGTTGCAGCTTGTGAAGCAGCAAAAAAGGCGGTTCGGCTTAGAAAGTTCTTGACAGATTTGGAAGTCGTTCCAAACATGAACTTCCCCATCACACTTTATTGTGATAACAGTGGTGCTGTGGCAAATTCTAAAGAACCTCGAAGCCACAAGAGAGGCAAACACATCGAAAGGAAATACCACCTTATAATGGAGATAGTGTAACGAGGAGATGTGATCGTCATAGAGATTGCTTCGGAGCACAATATTGCTGATCCTTTTACGAAGGCACTCACGGCTAAAGTGTTTGAGAGCCATCTAGAAAGTCTAGGTCTACGAGCATTGTACAGTTAGTCTACGGCAAGTGGGAGAAATGGAATGGGTATATGTATGCCCTAGTTTATTGTATTTTCCCTTTGCTTTATGATAATGTACACCCCACTCGAGTTAAGTTCAAGTAGGAGTTTGTTGGGTGTGTTGCCCTAAACTCGTAGTTTGTATTTTGTGTAAAAACAAATGTATATTCAATAAATTTGTTGTTGAATTCATAGCATTGCATTGTCCAAAATCCAATAAACAAACATTAGAGGTTATTTTAGTGTGACTGGGACAGTATATGTACTGGGTATATATTTGATATACTAGGTATATAATTGATATACCCCTGATTTACGAGGTATATGTTTGATATACCCCTGATATATAAGTGGATAATGTCCCGTGAATAACCTAAAGGGTCTATAGTATATGGATAAGGCTGGGTACCTTATCCTGACAAGACTATGGATACGGCCCACTCTATAAAGGTTACATACGATTCGATCCAGGTCGTTCGTGTGGAGACATGTGAGTGGGGGTATCCTATACAATGAGTTTGTATAAGACCGGACCACGAAATAGCCAATCTTTAGATGTAACACCGTTAACTAATAGATTGCTGTTTCTTAGGATGACCAGACAACTCATTCTCCATCCTGAGTGAGTTATGGACTCCCGCCCGTGAGGGCTCGTCCTTTAATCTGTATGGGTGAGAGTGGCCCAAGTTGCCGACTCAATATTCCTACCATTTTGGGGACGAGACAGAGTGGGGAGCTGGGAACATAACTTCACAAGATGGAATTACTCCTTCCCGACTTTAGGGGAATAGATGAGTGTTCTCTTAAATGCTGACTCCGGGACTTGAACAAGGGGCCCCACCCTCTCATTGGCTCGAGAAGGACTTCTGGTTATTGGTCGGACCATAACCGAGTTGTTCATTAGAGGAGCAGTGGTATTTAAGGAGTAAGATGTAACTTAGGGGTAAAACGGTAATTTGACCCAACTTCAGTTACGAGCACTCGTGAAGGATTGACTTGTCATTTATGGCGTATATCCGTGGACATGAAAACATTCTGCAGTGAGAAGAGTGCAACTGCGGGTCTTTAGTGGAATGAGCCGTAGTTAACGAATGTTGATTAACTCGGTCAATGAGTTTGATCGATTAATCTCGTATCGTTGGAGCTTCTGATCTGTAGGTCCATTAGGTCCCCTTGCTAGCTCATAACGGGTAAAACAAGGATCAAGTTTTTGGAAGAATTTGAACTGTTCAAATTGATTAAGGGATTATATGTATATTGATACATTATAATATATAGTTAATTCTAAATTAAAACCATATATTATTGAGAGAAATTATTTGAATGAGATTCAAATAAAATTATTGAGAGAAATTAAGCATTACCATTGGCTTCATTGGCTTAAGGCTCTAACTTTGGCGGAGCTGATTTTGCCTGCCTTAATCTTTCATAGGTTTCTAACTCATTGGCTATGGCCAGCTGTTTTACACCCGTTAGTCTCGTTGTCGTTTGACTTTTTGTTTTTGCTCCACCTTGTGTTTTATCCGTGCTAGGCCAAGGGGCCCGCTTTTGGTTTGGCCTTGATTGAGCTCTCCTTAATTGTCCCTTCCAACTTTTGTTGATCTAGTTGATCCATATATTATCATGCCCCCTACTTGAGGAGTACATGGAAAGAAGTAACCGTGGTTGCTTAAGTATGTAGATAATAGAAATTTTATTGAACCAAAATGTAACAACCAAAAAAAATATATAAATATATATATTTAAATTATTATTTTGTTGTTTTCTTTTATTTTCTCAAGGATAAACCTTTTTTTTTAAAAAATTATTTTTAAAAGATAATTTGTTCTTAGAATTCGAAAAAAAAATAGAAAGAAAGAGTTTTATTAAATCTTTATCTTTTTGTTATCCTTTTTAAATAAGGATTCATTAGGGTTTATTATAAAAGCCCTTAAACCCTAGGTTTTATTTTCTTAAACTGTTAGACCTAGTTTTCTCTCAAAGAGAAAAAGAAAAGAAAAGAAGACGAGTGTTGGTATTTTCTCTCAAAGATCGGTAAGATTTTCTTCTCATTGTTTTAGTTTTATTTTTCTTGATTTGTAGTTCTTTAATTCAGTTTATTGGGAAAAGTTTTGAGGTCTTTGTTTTTTTGAATAAAATATTTTAAGGGCTTTTGCCCTCAAAAAAAATTTTCCAGAATTTTTAGTTTGAATCTTTTTGTTTATGAGATTGTTTTTGGGCTATTTTGAGACTCACTTTGCTTAAAAAGTTTACCTATTTATATCCTGTAAATTATAGTTTGTTGTTATGATTTTTACATGAAAACTTATACGTTTTTACCAAGTTTTAAAGGTTTTATATACGGTTTTTCAGATCTGATTTTTTTATAAAAATTAGTAAATTGTTAATTGTTTGATGGAAAATTCTTGGAAGTTTTATTTTGTAAAATAGAAAATTTTTCAGATTTAAAATCCCCAAATTTTCAATTGTTTAGAAGGTTTTTAAGTTGGTTTTATTGAGAAACTTTTTGTAAATTTAGTTTGTATTGATTTATATCGATTCTTCTAATTTTAAAATTTGTTATTATTGAATTTAGAGGCTTTTTAACCTAGTTTTTATATCTGGTTTTTTTAGCCCTAAATCCATACTTTTCTAATTATTTTGCATAAAAAAGTGTTTTGAATGGGAAAATTGGGTAGGTAAGATTTGGGTTAGTAATTGGTTGAATTTTAGATCCTTGGATAGGATTTAAGAATTGATTTTGGATATTGGTTGTTTCTAGGCCTCTGAAAGAGGTAAGTAATTTGTCATATCCCTTCGAAAAAGTATTGTAAGTGATTTACTGTATCTTTCGAAAAGTATTGTAAGTAATTTGTCATATCCGTTCGAAAAAGTGTTGTAAGTAATTTGTCATATCCCTTCGAAAAAGTATTGTAAGTAATTTACCGTATCCCTTCAAAAAGTTTTGTAAGTGATTTGTCATATTTCAAAAACTAAAATGTATTTTGGATTTGATAGGAAGATTTGAAATTGATATCTTGCATATGAAATTGAAGTTTTATTATGGATTTGAAAACTAATTTGGCAATTGAAATTATGTTTATTTGCAAAGTATTTGAAGTATTCTTAAATGATTTGATAAAATTTTGTTTAAATTTATTTAAAGGTTTGAAATTGATATTTTGTTTATTTGTGAAATATTTGAAATCCTGTTTTGGACGATTGATAAAGTTTGTTTTAAATTATTTCAATGTTTTGAATTAATTTAATATTGAAAGTAAGTATTTGTTTTGCATTTCATTTGAAAGATATTTTGATTGATTTCATAGGGCTATCCGGATATGCCATGATTTTGTTTTGAGGAATGTTTTGCTTATTTGGGGTAGTAGCCTAATTTGGAATTGATGGATTTCTGTTCTTACTTATTTGTGGTTTGGCATTCCTTGGGAATGAAGATTCATGTGTGTGTTATTCACTCACATGACCTTACACTTAAGTAACTCTGTACTGGTCGATATTATTTCGACAACTCTGTTCTGGTCGATATTATTTCGACAACTCTGTTCTGGTCGATATTATTTCGACAACTCTGTTCTGGTCGATATTATTTCGACAACTCTATTCTGGTCGATATTAATTCGACAACTCTGATAGAAACTTCTGATTGATTTGGTGTTACCGACGGGTAACCACATATGTTTTGAAGGCTACCCAAATAATGCAACCTTCCATTCTGATTTTGAAAGTATGAATGAATGTAAGATTCTTTTAATGTTGAATGATTTAGTTATTCAAAAGTTGGAAAATTGTACCAAGTGTTTTAGAATGAAGTTTGAAAGTTTTAATTGAAAGAAAGTTTGTTTTGAAGGATTTGAAGTTTGATCGAATTAAAGTTTGTTATGAAAGAATGAAAGTTTGTTTTGATTGACTGAAAATTTTATCTTATTTGAAGACTTTGTTTCCAAAAATCTAGTTTATTTTGAAAGTTTTGAAAATTGTATTAAACATTTTAAAGGACTTTAAAAGTTTTGGTTTGTTCGTATTTTTGTTTTGAAAGGAAAATGAATTTCGAAATTCGATATGGAAATTGCTTATTGAGTATTTTTTTGTACTCATTTTTTTTCTATTTTCTAAATGTTTTCAGATGAAGGTTCAAGTGACTGCTCAGATAGTGATTGAGGAATAG

mRNA sequence

ATGGCGCTATGCAGTAGCGCCATGGCACTGCGGGGACAGCACACAGCACCACGGCGCTGCACTGTAGCACCGCGGCCCTGCTGCTGCGGCATTTTGGTGCAACAGTGCCGTGGCGCTGCCCTTAGGCACCGAGACGCTGTCCCGAGTGTTTTTCGACGTGCTTCCGTGGCTCCGGAATATTGGGGCGGATCTCTGAGGTCTGAAAATGATGGGTCACACTTACGGGGAGTTGTTAACATGTCTGCTTCCATTATTGCACTCCTAGCCGCTCAAAAACTTAACGGCGAGAATTACAAACAATGGAAATCGAATCTAAACACTATTCTCGTGATAGATGATCTTAGGTTCGTCTTGCAAGAGGATTGTCCTCAAGCTCCTGCGTTTAACACCACTGTGGCGGTGCGCAACGTCTATGACAGGTGGATCAAGGCCAATGACAAGGCCAAGGTCTACATCTTGACGAGCATATCTGATGTGCTTGCCAAGAAGCACGAGGACACGGTCACCGCTAAGGAGATCACGGACTCGCTGCAGAGCATGTTTCGACAACCGTCCTCACAGGCTCGACATGAAGCCCTTAAGTTCATTTACAACTCCCGCATGAAGGAGGGCTCTTCAGTGCGAGAACACGTTCTCAACCTGATGGTCCACTTCAATGTGGCAGAGTCGAACAGGGCTGTCATAAACGAGCAAAGTCAGGTTAACTTCCTTCTGGAATCTCTTCCGAAGAGTTTCCTTCCATTCCGCAGCAATGTGGTTATGAATAAGCTGGAGTACACTCTTACCACGCTCTTAAACGAGCTGCAGACCTACCAGTCTCTTATGAAGAGTAAGGGACAAGAAGGGGAGGCAAACACTTTTAAGAAGAAAGCTGCTGGTAAGGCGTCTAAACCTGACTCCGCTGCTGCCGCTGCCAAGAAAGGCAAGACCAAGGTTGCAGAGAAAGGAAAGTGTTTCAACTGCAATGTGGACAGACATTGGAAGCGCAACTGCCCAAAGTACTTTGCCGAAAAGAAGAAAGCCAACGAAGGTAAATATGATTTACTTGTTTTGGAAACATGTTTAGTGGAGAACGATGACTGCGCCTGGATACTGGATTCAGGAGCCACTAATCACGTTTGTTCTTCATCTCAGGGAATTAGTTCCTGGAGGCAGCTTGACGCTGGAGAGATGACTCTCAAGGTCGAAACGGGAGAGGTCGTCTCAACTGTGGCGATGAAGGTTCAAGTGACTGCTCAGATAGTGATTGAGGAATAG

Coding sequence (CDS)

ATGGCGCTATGCAGTAGCGCCATGGCACTGCGGGGACAGCACACAGCACCACGGCGCTGCACTGTAGCACCGCGGCCCTGCTGCTGCGGCATTTTGGTGCAACAGTGCCGTGGCGCTGCCCTTAGGCACCGAGACGCTGTCCCGAGTGTTTTTCGACGTGCTTCCGTGGCTCCGGAATATTGGGGCGGATCTCTGAGGTCTGAAAATGATGGGTCACACTTACGGGGAGTTGTTAACATGTCTGCTTCCATTATTGCACTCCTAGCCGCTCAAAAACTTAACGGCGAGAATTACAAACAATGGAAATCGAATCTAAACACTATTCTCGTGATAGATGATCTTAGGTTCGTCTTGCAAGAGGATTGTCCTCAAGCTCCTGCGTTTAACACCACTGTGGCGGTGCGCAACGTCTATGACAGGTGGATCAAGGCCAATGACAAGGCCAAGGTCTACATCTTGACGAGCATATCTGATGTGCTTGCCAAGAAGCACGAGGACACGGTCACCGCTAAGGAGATCACGGACTCGCTGCAGAGCATGTTTCGACAACCGTCCTCACAGGCTCGACATGAAGCCCTTAAGTTCATTTACAACTCCCGCATGAAGGAGGGCTCTTCAGTGCGAGAACACGTTCTCAACCTGATGGTCCACTTCAATGTGGCAGAGTCGAACAGGGCTGTCATAAACGAGCAAAGTCAGGTTAACTTCCTTCTGGAATCTCTTCCGAAGAGTTTCCTTCCATTCCGCAGCAATGTGGTTATGAATAAGCTGGAGTACACTCTTACCACGCTCTTAAACGAGCTGCAGACCTACCAGTCTCTTATGAAGAGTAAGGGACAAGAAGGGGAGGCAAACACTTTTAAGAAGAAAGCTGCTGGTAAGGCGTCTAAACCTGACTCCGCTGCTGCCGCTGCCAAGAAAGGCAAGACCAAGGTTGCAGAGAAAGGAAAGTGTTTCAACTGCAATGTGGACAGACATTGGAAGCGCAACTGCCCAAAGTACTTTGCCGAAAAGAAGAAAGCCAACGAAGGTAAATATGATTTACTTGTTTTGGAAACATGTTTAGTGGAGAACGATGACTGCGCCTGGATACTGGATTCAGGAGCCACTAATCACGTTTGTTCTTCATCTCAGGGAATTAGTTCCTGGAGGCAGCTTGACGCTGGAGAGATGACTCTCAAGGTCGAAACGGGAGAGGTCGTCTCAACTGTGGCGATGAAGGTTCAAGTGACTGCTCAGATAGTGATTGAGGAATAG

Protein sequence

MALCSSAMALRGQHTAPRRCTVAPRPCCCGILVQQCRGAALRHRDAVPSVFRRASVAPEYWGGSLRSENDGSHLRGVVNMSASIIALLAAQKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAFNTTVAVRNVYDRWIKANDKAKVYILTSISDVLAKKHEDTVTAKEITDSLQSMFRQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVAESNRAVINEQSQVNFLLESLPKSFLPFRSNVVMNKLEYTLTTLLNELQTYQSLMKSKGQEGEANTFKKKAAGKASKPDSAAAAAKKGKTKVAEKGKCFNCNVDRHWKRNCPKYFAEKKKANEGKYDLLVLETCLVENDDCAWILDSGATNHVCSSSQGISSWRQLDAGEMTLKVETGEVVSTVAMKVQVTAQIVIEE
Homology
BLAST of Moc02g17040 vs. NCBI nr
Match: TYK14550.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 433.3 bits (1113), Expect = 2.3e-117
Identity = 226/353 (64.02%), Postives = 270/353 (76.49%), Query Frame = 0

Query: 80  MSASIIALLAAQKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAFNTTVAVRNVYD 139
           M+++ + +LAA KLNG NY  WK+ +NT+L+IDDLRFVL E+CPQ PA N T  VR  Y+
Sbjct: 1   MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFVLVEECPQVPAANATRTVREPYE 60

Query: 140 RWIKANDKAKVYILTSISDVLAKKHEDTVTAKEITDSLQSMFRQPSSQARHEALKFIYNS 199
           RW KAN+KA+ YIL S+S+VLAKKHE  +TA+EI DSLQ MF Q S Q +H+ALK+IYN+
Sbjct: 61  RWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQEMFGQASYQIKHDALKYIYNA 120

Query: 200 RMKEGSSVREHVLNLMVHFNVAESNRAVINEQSQVNFLLESLPKSFLPFRSNVVMNKLEY 259
           RM EG+SVREHVLN+MVHFNVAE N AVI+E SQV+F+LESLP+SFL FRSN VMNK+ Y
Sbjct: 121 RMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAY 180

Query: 260 TLTTLLNELQTYQSLMKSKGQEGEANT--------------------------FKKKAAG 319
           TLTTLLNELQT++SLMK KGQ+GEAN                           +KKK  G
Sbjct: 181 TLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSGNKKWKKKKGG 240

Query: 320 KASKPDSAAAAAKKGKTKVAEKGKCFNCNVDRHWKRNCPKYFAEKKKANEGKYDLLVLET 379
           + +K + AAA   K KTK A KG CF+CN + HWKRNCPKY AEKKKA +GKYDLLVLET
Sbjct: 241 QGNKANLAAAKTTK-KTKAA-KGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLET 300

Query: 380 CLVENDDCAWILDSGATNHVCSSSQGISSWRQLDAGEMTLKVETGEVVSTVAM 407
           CLVENDD AWI+DSGATNHVCSS QGISSWRQL+ GEMT++V TG VVS +A+
Sbjct: 301 CLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAV 351

BLAST of Moc02g17040 vs. NCBI nr
Match: KAA0035879.1 (gag/pol protein [Cucumis melo var. makuwa] >KAA0044276.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0051221.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0051893.1 gag/pol protein [Cucumis melo var. makuwa] >TYK00551.1 gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 433.3 bits (1113), Expect = 2.3e-117
Identity = 226/353 (64.02%), Postives = 270/353 (76.49%), Query Frame = 0

Query: 80  MSASIIALLAAQKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAFNTTVAVRNVYD 139
           M+++ + +LAA KLNG NY  WK+ +NT+L+IDDLRFVL E+CPQ PA N T  VR  Y+
Sbjct: 1   MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFVLVEECPQVPAANATRTVREPYE 60

Query: 140 RWIKANDKAKVYILTSISDVLAKKHEDTVTAKEITDSLQSMFRQPSSQARHEALKFIYNS 199
           RW KAN+KA+ YIL S+S+VLAKKHE  +TA+EI DSLQ MF Q S Q +H+ALK+IYN+
Sbjct: 61  RWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQEMFGQASYQIKHDALKYIYNA 120

Query: 200 RMKEGSSVREHVLNLMVHFNVAESNRAVINEQSQVNFLLESLPKSFLPFRSNVVMNKLEY 259
           RM EG+SVREHVLN+MVHFNVAE N AVI+E SQV+F+LESLP+SFL FRSN VMNK+ Y
Sbjct: 121 RMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAY 180

Query: 260 TLTTLLNELQTYQSLMKSKGQEGEANT--------------------------FKKKAAG 319
           TLTTLLNELQT++SLMK KGQ+GEAN                           +KKK  G
Sbjct: 181 TLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSGNKKWKKKKGG 240

Query: 320 KASKPDSAAAAAKKGKTKVAEKGKCFNCNVDRHWKRNCPKYFAEKKKANEGKYDLLVLET 379
           + +K + AAA   K KTK A KG CF+CN + HWKRNCPKY AEKKKA +GKYDLLVLET
Sbjct: 241 QGNKANLAAAKTTK-KTKAA-KGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLET 300

Query: 380 CLVENDDCAWILDSGATNHVCSSSQGISSWRQLDAGEMTLKVETGEVVSTVAM 407
           CLVENDD AWI+DSGATNHVCSS QGISSWRQL+ GEMT++V TG VVS +A+
Sbjct: 301 CLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAV 351

BLAST of Moc02g17040 vs. NCBI nr
Match: KAA0054490.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 432.2 bits (1110), Expect = 5.2e-117
Identity = 224/353 (63.46%), Postives = 268/353 (75.92%), Query Frame = 0

Query: 80  MSASIIALLAAQKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAFNTTVAVRNVYD 139
           M+++ + +LAA KLNG NY  WK+ +NT+L+IDDLRFVL E+CPQ PA N T  VR  Y+
Sbjct: 1   MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFVLVEECPQVPAANATRTVREPYE 60

Query: 140 RWIKANDKAKVYILTSISDVLAKKHEDTVTAKEITDSLQSMFRQPSSQARHEALKFIYNS 199
           RW KAN+KA+ YIL S+S+VLAKKHE  +TA+EI DSLQ MF Q S Q +H+ALK+IYN+
Sbjct: 61  RWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQEMFGQASYQIKHDALKYIYNA 120

Query: 200 RMKEGSSVREHVLNLMVHFNVAESNRAVINEQSQVNFLLESLPKSFLPFRSNVVMNKLEY 259
           RM EG+SVREHVLN+MVHFNVAE N AVI+E SQV+F+LESLP+SFL FRSN VMNK+ Y
Sbjct: 121 RMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAY 180

Query: 260 TLTTLLNELQTYQSLMKSKGQEGEANT--------------------------FKKKAAG 319
           TLTTLLNELQT++SLMK KGQ+GEAN                           +KKK  G
Sbjct: 181 TLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSGNKKWKKKKGG 240

Query: 320 KASKPDSAAAAAKKGKTKVAEKGKCFNCNVDRHWKRNCPKYFAEKKKANEGKYDLLVLET 379
           + +K +   AAAK  K   A KG CF+CN + HWKRNCPKY AEKKKA +GKYDLLVLET
Sbjct: 241 QGNKAN--LAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLET 300

Query: 380 CLVENDDCAWILDSGATNHVCSSSQGISSWRQLDAGEMTLKVETGEVVSTVAM 407
           CLVENDD AWI+DSGATNHVCSS QGISSWRQL+ GEMT++V TG VVS +A+
Sbjct: 301 CLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAV 351

BLAST of Moc02g17040 vs. NCBI nr
Match: KAA0047792.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 432.2 bits (1110), Expect = 5.2e-117
Identity = 224/353 (63.46%), Postives = 268/353 (75.92%), Query Frame = 0

Query: 80  MSASIIALLAAQKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAFNTTVAVRNVYD 139
           M+++ + +LAA KLNG NY  WK+ +NT+L+IDDLRFVL E+CPQ PA N T  VR  Y+
Sbjct: 1   MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFVLVEECPQVPAANATRTVREPYE 60

Query: 140 RWIKANDKAKVYILTSISDVLAKKHEDTVTAKEITDSLQSMFRQPSSQARHEALKFIYNS 199
           RW KAN+KA+ YIL S+S+VLAKKHE  +TA+EI DSLQ MF Q S Q +H+ALK+IYN+
Sbjct: 61  RWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQEMFGQASYQIKHDALKYIYNA 120

Query: 200 RMKEGSSVREHVLNLMVHFNVAESNRAVINEQSQVNFLLESLPKSFLPFRSNVVMNKLEY 259
           RM EG+SVREHVLN+MVHFNVAE N AVI+E SQV+F+LESLP+SFL FRSN VMNK+ Y
Sbjct: 121 RMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAY 180

Query: 260 TLTTLLNELQTYQSLMKSKGQEGEANT--------------------------FKKKAAG 319
           TLTTLLNELQT++SLMK KGQ+GEAN                           +KKK  G
Sbjct: 181 TLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSGNKKWKKKKGG 240

Query: 320 KASKPDSAAAAAKKGKTKVAEKGKCFNCNVDRHWKRNCPKYFAEKKKANEGKYDLLVLET 379
           + +K +   AAAK  K   A KG CF+CN + HWKRNCPKY AEKKKA +GKYDLLVLET
Sbjct: 241 QGNKAN--LAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLET 300

Query: 380 CLVENDDCAWILDSGATNHVCSSSQGISSWRQLDAGEMTLKVETGEVVSTVAM 407
           CLVENDD AWI+DSGATNHVCSS QGISSWRQL+ GEMT++V TG VVS +A+
Sbjct: 301 CLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAV 351

BLAST of Moc02g17040 vs. NCBI nr
Match: KAA0031826.1 (gag/pol protein [Cucumis melo var. makuwa] >KAA0032384.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0039313.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0043789.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0048789.1 gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 432.2 bits (1110), Expect = 5.2e-117
Identity = 224/353 (63.46%), Postives = 268/353 (75.92%), Query Frame = 0

Query: 80  MSASIIALLAAQKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAFNTTVAVRNVYD 139
           M+++ + +LAA KLNG NY  WK+ +NT+L+IDDLRFVL E+CPQ PA N T  VR  Y+
Sbjct: 1   MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFVLVEECPQVPAANATRTVREPYE 60

Query: 140 RWIKANDKAKVYILTSISDVLAKKHEDTVTAKEITDSLQSMFRQPSSQARHEALKFIYNS 199
           RW KAN+KA+ YIL S+S+VLAKKHE  +TA+EI DSLQ MF Q S Q +H+ALK+IYN+
Sbjct: 61  RWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQEMFGQASYQIKHDALKYIYNA 120

Query: 200 RMKEGSSVREHVLNLMVHFNVAESNRAVINEQSQVNFLLESLPKSFLPFRSNVVMNKLEY 259
           RM EG+SVREHVLN+MVHFNVAE N AVI+E SQV+F+LESLP+SFL FRSN VMNK+ Y
Sbjct: 121 RMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAY 180

Query: 260 TLTTLLNELQTYQSLMKSKGQEGEANT--------------------------FKKKAAG 319
           TLTTLLNELQT++SLMK KGQ+GEAN                           +KKK  G
Sbjct: 181 TLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSGNKKWKKKKGG 240

Query: 320 KASKPDSAAAAAKKGKTKVAEKGKCFNCNVDRHWKRNCPKYFAEKKKANEGKYDLLVLET 379
           + +K +   AAAK  K   A KG CF+CN + HWKRNCPKY AEKKKA +GKYDLLVLET
Sbjct: 241 QGNKAN--LAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLET 300

Query: 380 CLVENDDCAWILDSGATNHVCSSSQGISSWRQLDAGEMTLKVETGEVVSTVAM 407
           CLVENDD AWI+DSGATNHVCSS QGISSWRQL+ GEMT++V TG VVS +A+
Sbjct: 301 CLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAV 351

BLAST of Moc02g17040 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 64.7 bits (156), Expect = 2.8e-09
Identity = 68/303 (22.44%), Postives = 127/303 (41.91%), Query Frame = 0

Query: 92  KLNGEN-YKQWKSNLNTILVIDDLRFVLQEDCPQAPAFNTTVAVRNVYDRWIKANDKAKV 151
           K NG+N +  W+  +  +L+   L  VL  D  +              + W   +++A  
Sbjct: 10  KFNGDNGFSTWQRRMRDLLIQQGLHKVLDVDSKKPDTMKA--------EDWADLDERAAS 69

Query: 152 YILTSISDVLAKKHEDTVTAKEITDSLQSMFRQPSSQARHEALKFIYNSRMKEGSSVREH 211
            I   +SD +     D  TA+ I   L+S++   +   +    K +Y   M EG++   H
Sbjct: 70  AIRLHLSDDVVNNIIDEDTARGIWTRLESLYMSKTLTNKLYLKKQLYALHMSEGTNFLSH 129

Query: 212 VLNLMVHFNVAESNRAV-INEQSQVNFLLESLPKSFLPFRSNVVMNKLEYTLTTLLNELQ 271
            LN+        +N  V I E+ +   LL SLP S+    + ++  K    L  + + L 
Sbjct: 130 -LNVFNGLITQLANLGVKIEEEDKAILLLNSLPSSYDNLATTILHGKTTIELKDVTSALL 189

Query: 272 TYQSLMKSKGQEGEANTFKKKAAGKASKPDSAAAAAKKGKTKVAEKGK---CFNCNVDRH 331
             + + K    +G+A   + +        ++   +  +GK+K   K +   C+NCN   H
Sbjct: 190 LNEKMRKKPENQGQALITEGRGRSYQRSSNNYGRSGARGKSKNRSKSRVRNCYNCNQPGH 249

Query: 332 WKRNCPKYFAEKKKANEGKYD-------------LLVL---ETCL-VENDDCAWILDSGA 373
           +KR+CP     K + +  K D             +L +   E C+ +   +  W++D+ A
Sbjct: 250 FKRDCPNPRKGKGETSGQKNDDNTAAMVQNNDNVVLFINEEEECMHLSGPESEWVVDTAA 303

BLAST of Moc02g17040 vs. ExPASy TrEMBL
Match: A0A5D3CPJ6 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold119G00040 PE=4 SV=1)

HSP 1 Score: 433.3 bits (1113), Expect = 1.1e-117
Identity = 226/353 (64.02%), Postives = 270/353 (76.49%), Query Frame = 0

Query: 80  MSASIIALLAAQKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAFNTTVAVRNVYD 139
           M+++ + +LAA KLNG NY  WK+ +NT+L+IDDLRFVL E+CPQ PA N T  VR  Y+
Sbjct: 1   MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFVLVEECPQVPAANATRTVREPYE 60

Query: 140 RWIKANDKAKVYILTSISDVLAKKHEDTVTAKEITDSLQSMFRQPSSQARHEALKFIYNS 199
           RW KAN+KA+ YIL S+S+VLAKKHE  +TA+EI DSLQ MF Q S Q +H+ALK+IYN+
Sbjct: 61  RWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQEMFGQASYQIKHDALKYIYNA 120

Query: 200 RMKEGSSVREHVLNLMVHFNVAESNRAVINEQSQVNFLLESLPKSFLPFRSNVVMNKLEY 259
           RM EG+SVREHVLN+MVHFNVAE N AVI+E SQV+F+LESLP+SFL FRSN VMNK+ Y
Sbjct: 121 RMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAY 180

Query: 260 TLTTLLNELQTYQSLMKSKGQEGEANT--------------------------FKKKAAG 319
           TLTTLLNELQT++SLMK KGQ+GEAN                           +KKK  G
Sbjct: 181 TLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSGNKKWKKKKGG 240

Query: 320 KASKPDSAAAAAKKGKTKVAEKGKCFNCNVDRHWKRNCPKYFAEKKKANEGKYDLLVLET 379
           + +K + AAA   K KTK A KG CF+CN + HWKRNCPKY AEKKKA +GKYDLLVLET
Sbjct: 241 QGNKANLAAAKTTK-KTKAA-KGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLET 300

Query: 380 CLVENDDCAWILDSGATNHVCSSSQGISSWRQLDAGEMTLKVETGEVVSTVAM 407
           CLVENDD AWI+DSGATNHVCSS QGISSWRQL+ GEMT++V TG VVS +A+
Sbjct: 301 CLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAV 351

BLAST of Moc02g17040 vs. ExPASy TrEMBL
Match: A0A5D3CSZ6 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold552G00320 PE=4 SV=1)

HSP 1 Score: 433.3 bits (1113), Expect = 1.1e-117
Identity = 226/353 (64.02%), Postives = 270/353 (76.49%), Query Frame = 0

Query: 80  MSASIIALLAAQKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAFNTTVAVRNVYD 139
           M+++ + +LAA KLNG NY  WK+ +NT+L+IDDLRFVL E+CPQ PA N T  VR  Y+
Sbjct: 1   MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFVLVEECPQVPAANATRTVREPYE 60

Query: 140 RWIKANDKAKVYILTSISDVLAKKHEDTVTAKEITDSLQSMFRQPSSQARHEALKFIYNS 199
           RW KAN+KA+ YIL S+S+VLAKKHE  +TA+EI DSLQ MF Q S Q +H+ALK+IYN+
Sbjct: 61  RWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQEMFGQASYQIKHDALKYIYNA 120

Query: 200 RMKEGSSVREHVLNLMVHFNVAESNRAVINEQSQVNFLLESLPKSFLPFRSNVVMNKLEY 259
           RM EG+SVREHVLN+MVHFNVAE N AVI+E SQV+F+LESLP+SFL FRSN VMNK+ Y
Sbjct: 121 RMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAY 180

Query: 260 TLTTLLNELQTYQSLMKSKGQEGEANT--------------------------FKKKAAG 319
           TLTTLLNELQT++SLMK KGQ+GEAN                           +KKK  G
Sbjct: 181 TLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSGNKKWKKKKGG 240

Query: 320 KASKPDSAAAAAKKGKTKVAEKGKCFNCNVDRHWKRNCPKYFAEKKKANEGKYDLLVLET 379
           + +K + AAA   K KTK A KG CF+CN + HWKRNCPKY AEKKKA +GKYDLLVLET
Sbjct: 241 QGNKANLAAAKTTK-KTKAA-KGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLET 300

Query: 380 CLVENDDCAWILDSGATNHVCSSSQGISSWRQLDAGEMTLKVETGEVVSTVAM 407
           CLVENDD AWI+DSGATNHVCSS QGISSWRQL+ GEMT++V TG VVS +A+
Sbjct: 301 CLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAV 351

BLAST of Moc02g17040 vs. ExPASy TrEMBL
Match: A0A5A7SMH8 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold219G002560 PE=4 SV=1)

HSP 1 Score: 432.2 bits (1110), Expect = 2.5e-117
Identity = 224/353 (63.46%), Postives = 268/353 (75.92%), Query Frame = 0

Query: 80  MSASIIALLAAQKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAFNTTVAVRNVYD 139
           M+++ + +LAA KLNG NY  WK+ +NT+L+IDDLRFVL E+CPQ PA N T  VR  Y+
Sbjct: 1   MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFVLVEECPQVPAANATRTVREPYE 60

Query: 140 RWIKANDKAKVYILTSISDVLAKKHEDTVTAKEITDSLQSMFRQPSSQARHEALKFIYNS 199
           RW KAN+KA+ YIL S+S+VLAKKHE  +TA+EI DSLQ MF Q S Q +H+ALK+IYN+
Sbjct: 61  RWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQEMFGQASYQIKHDALKYIYNA 120

Query: 200 RMKEGSSVREHVLNLMVHFNVAESNRAVINEQSQVNFLLESLPKSFLPFRSNVVMNKLEY 259
           RM EG+SVREHVLN+MVHFNVAE N AVI+E SQV+F+LESLP+SFL FRSN VMNK+ Y
Sbjct: 121 RMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAY 180

Query: 260 TLTTLLNELQTYQSLMKSKGQEGEANT--------------------------FKKKAAG 319
           TLTTLLNELQT++SLMK KGQ+GEAN                           +KKK  G
Sbjct: 181 TLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSGNKKWKKKKGG 240

Query: 320 KASKPDSAAAAAKKGKTKVAEKGKCFNCNVDRHWKRNCPKYFAEKKKANEGKYDLLVLET 379
           + +K +   AAAK  K   A KG CF+CN + HWKRNCPKY AEKKKA +GKYDLLVLET
Sbjct: 241 QGNKAN--LAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLET 300

Query: 380 CLVENDDCAWILDSGATNHVCSSSQGISSWRQLDAGEMTLKVETGEVVSTVAM 407
           CLVENDD AWI+DSGATNHVCSS QGISSWRQL+ GEMT++V TG VVS +A+
Sbjct: 301 CLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAV 351

BLAST of Moc02g17040 vs. ExPASy TrEMBL
Match: A0A5A7TWB9 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold133G00310 PE=4 SV=1)

HSP 1 Score: 432.2 bits (1110), Expect = 2.5e-117
Identity = 224/353 (63.46%), Postives = 268/353 (75.92%), Query Frame = 0

Query: 80  MSASIIALLAAQKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAFNTTVAVRNVYD 139
           M+++ + +LAA KLNG NY  WK+ +NT+L+IDDLRFVL E+CPQ PA N T  VR  Y+
Sbjct: 1   MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFVLVEECPQVPAANATRTVREPYE 60

Query: 140 RWIKANDKAKVYILTSISDVLAKKHEDTVTAKEITDSLQSMFRQPSSQARHEALKFIYNS 199
           RW KAN+KA+ YIL S+S+VLAKKHE  +TA+EI DSLQ MF Q S Q +H+ALK+IYN+
Sbjct: 61  RWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQEMFGQASYQIKHDALKYIYNA 120

Query: 200 RMKEGSSVREHVLNLMVHFNVAESNRAVINEQSQVNFLLESLPKSFLPFRSNVVMNKLEY 259
           RM EG+SVREHVLN+MVHFNVAE N AVI+E SQV+F+LESLP+SFL FRSN VMNK+ Y
Sbjct: 121 RMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAY 180

Query: 260 TLTTLLNELQTYQSLMKSKGQEGEANT--------------------------FKKKAAG 319
           TLTTLLNELQT++SLMK KGQ+GEAN                           +KKK  G
Sbjct: 181 TLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSGNKKWKKKKGG 240

Query: 320 KASKPDSAAAAAKKGKTKVAEKGKCFNCNVDRHWKRNCPKYFAEKKKANEGKYDLLVLET 379
           + +K +   AAAK  K   A KG CF+CN + HWKRNCPKY AEKKKA +GKYDLLVLET
Sbjct: 241 QGNKAN--LAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLET 300

Query: 380 CLVENDDCAWILDSGATNHVCSSSQGISSWRQLDAGEMTLKVETGEVVSTVAM 407
           CLVENDD AWI+DSGATNHVCSS QGISSWRQL+ GEMT++V TG VVS +A+
Sbjct: 301 CLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAV 351

BLAST of Moc02g17040 vs. ExPASy TrEMBL
Match: A0A5A7UGV2 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold24G002690 PE=4 SV=1)

HSP 1 Score: 432.2 bits (1110), Expect = 2.5e-117
Identity = 224/353 (63.46%), Postives = 268/353 (75.92%), Query Frame = 0

Query: 80  MSASIIALLAAQKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAFNTTVAVRNVYD 139
           M+++ + +LAA KLNG NY  WK+ +NT+L+IDDLRFVL E+CPQ PA N T  VR  Y+
Sbjct: 1   MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFVLVEECPQVPAANATRTVREPYE 60

Query: 140 RWIKANDKAKVYILTSISDVLAKKHEDTVTAKEITDSLQSMFRQPSSQARHEALKFIYNS 199
           RW KAN+KA+ YIL S+S+VLAKKHE  +TA+EI DSLQ MF Q S Q +H+ALK+IYN+
Sbjct: 61  RWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQEMFGQASYQIKHDALKYIYNA 120

Query: 200 RMKEGSSVREHVLNLMVHFNVAESNRAVINEQSQVNFLLESLPKSFLPFRSNVVMNKLEY 259
           RM EG+SVREHVLN+MVHFNVAE N AVI+E SQV+F+LESLP+SFL FRSN VMNK+ Y
Sbjct: 121 RMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAY 180

Query: 260 TLTTLLNELQTYQSLMKSKGQEGEANT--------------------------FKKKAAG 319
           TLTTLLNELQT++SLMK KGQ+GEAN                           +KKK  G
Sbjct: 181 TLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSGNKKWKKKKGG 240

Query: 320 KASKPDSAAAAAKKGKTKVAEKGKCFNCNVDRHWKRNCPKYFAEKKKANEGKYDLLVLET 379
           + +K +   AAAK  K   A KG CF+CN + HWKRNCPKY AEKKKA +GKYDLLVLET
Sbjct: 241 QGNKAN--LAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLET 300

Query: 380 CLVENDDCAWILDSGATNHVCSSSQGISSWRQLDAGEMTLKVETGEVVSTVAM 407
           CLVENDD AWI+DSGATNHVCSS QGISSWRQL+ GEMT++V TG VVS +A+
Sbjct: 301 CLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAV 351

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
TYK14550.12.3e-11764.02gag/pol protein [Cucumis melo var. makuwa][more]
KAA0035879.12.3e-11764.02gag/pol protein [Cucumis melo var. makuwa] >KAA0044276.1 gag/pol protein [Cucumi... [more]
KAA0054490.15.2e-11763.46gag/pol protein [Cucumis melo var. makuwa][more]
KAA0047792.15.2e-11763.46gag/pol protein [Cucumis melo var. makuwa][more]
KAA0031826.15.2e-11763.46gag/pol protein [Cucumis melo var. makuwa] >KAA0032384.1 gag/pol protein [Cucumi... [more]
Match NameE-valueIdentityDescription
P109782.8e-0922.44Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
Match NameE-valueIdentityDescription
A0A5D3CPJ61.1e-11764.02Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold119G0004... [more]
A0A5D3CSZ61.1e-11764.02Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold552G0032... [more]
A0A5A7SMH82.5e-11763.46Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold219G0025... [more]
A0A5A7TWB92.5e-11763.46Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold133G0031... [more]
A0A5A7UGV22.5e-11763.46Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold24G00269... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (OHB3-1) v2
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001878Zinc finger, CCHC-typeSMARTSM00343c2hcfinal6coord: 317..333
e-value: 0.007
score: 24.5
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 141..273
e-value: 1.5E-17
score: 63.7
NoneNo IPR availablePANTHERPTHR35317:SF16ZINC FINGER, CCHC-TYPE-RELATEDcoord: 86..337
NoneNo IPR availablePANTHERPTHR35317OS04G0629600 PROTEINcoord: 86..337
IPR036875Zinc finger, CCHC-type superfamilySUPERFAMILY57756Retrovirus zinc finger-like domainscoord: 307..337

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Moc02g17040.1Moc02g17040.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0043170 macromolecule metabolic process
biological_process GO:0006807 nitrogen compound metabolic process
biological_process GO:0044238 primary metabolic process
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding