Moc03g19760 (gene) Bitter gourd (OHB3-1) v2

Overview
NameMoc03g19760
Typegene
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Locationchr3: 13317015 .. 13326312 (+)
RNA-Seq ExpressionMoc03g19760
SyntenyMoc03g19760
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGTTCGATGGGGCCAATTTCCGATATTGGAAAATGCAGATTAAGGATTACTTGACCTGCAAGATGTTGAATATAGCTTTAAAGGAACGACCAGCAGAGGTGAAAGATGAAGATTGGGTAGAGACAAATCAACAGGCAGTTGCCTTTATCAGATTATCTTTGTTGATGAATGTGGCAAGTCTCGTAGCAAACGAAATAAATGCAATAGATCTGATGAAAGCACTGGAGAATAGGTACGAGAAATCCTCAGCTAATAATAAGGTATATCTCGTAAGGAGATTCTTTAACATTCAGATGGAAGAGAACACTTCTATCAATTCCTACATAAATGAAGTCACAAAATTAATCAATCAATTGGCATCAGTCAAGATTACTTTTAGTGACAAAGTGAATGTTATTTTGTTGCTAACATCTTTATCTGAAAGCTTGGAAATGATGAAGACAACAATGTCCAATTCGTTAGGAGGAAAATCTTTGAAATTTTCAAAAAATTTTGATGCCGCAACTGCTGAAGAAATTTGCAAAAAGAGGAGTGAAAAAGAATCTACTTCCGAAAAAACTCAGCATTGGTTGTCGAAAAAGGTAAAGAGAAGGTTGCATCTGATGAAAGATAGCAGAAACATAGTAGGAGGGATTGGAAGAGAGATGCTGATTGTTTTCACTGCCATAAAAAAGAGCCACATCAAGAAAAATTGTATAATTTTAAAAGAGGATCTGAAAAGGTATATGACAGAGTCAAATGCAGTTGTAGACGATGCCCTCGTGTGTGTTGAAAGCAACACAAAAACAGGAAACCAGTAATCGGAGTGAGTAATAGACAGCGCAGCTTCAATACTCATATCTTTAAATAAAAGATTATTCACATCATTCAGAAGAGGCAATCGCGGCCACATGAGGATGGCGAATGAAATTCTTTCCAAGACAAAAGGATTGAAAATATATGATTGAAGACCGATAATGGGACTGAGTTATTACTGCAAGATGTCAGGTATGTACCCAGTATCAGAATGAATCTTATCTTGATAGGAAAGTTGGACGATGATGATTATCAGAGTGAGTTTGGTGGGAACCAGTGGAAGGTCACCAAAGGATCCAAGTTGGTGGCAATTGGCCATAGAAGATCTACAGTTTACACGTCACAATTGAGTGTTGCCAGAAGATCTTTGAAACAACGAATGTAAGTTGCAGATGGTGTCCAAAGAGGTAGGACTGAACTAGCAAGAATGACAACCAGAACAGATCAGGAGGACTTGCCATCAATTCAGGAACAACAATTGGGAAGTTAAAGAACGAGAACAGGTTCTTTAGGTTGTCAGGGATTATCCCTAGTTGTCAGACGGATAGGTGAATTGATGAAGTCGCGTCAGCGAACAGTTGCATCAGAGAAGATGAACCCAATAGGTGCTGAGGTAAAGAATAGAGTTTCAATTCCGGCAACAGGCTTGAATAGAGTTGTCAAGTCATCAGTGAGAAGTTCTTTCTTCAAGAATCAGTGTTTATGAACAAAGAAGAAAGCTTCAGATAGCACCACTTAGTTTGAGTGGGAGCACGTAGTTGTGTCTCTTGTGTTTGGGAGACAACACCACGTAGATTGCTCGACCTCAGGGCAATTACAGATGAGACCAATGATTTCTGACATTTTGTTTTACAAGTTTTTAAACTTGCAAAAGAAGCCTTGAAGATTTTTTCAAGAAGATTAAAAGTTGTTGCAGAAGAAGCCTTGGAGTTTTAATTTCCAAGATTATAAGAGACTGATGTTGCAGTGGGAGGCAAGATCGATGTTTTGTCTCCAAGTGGGAGATTGTTGGGATAATGGAGCCAAAACAGATGACCCGTGTCCTATGAGGAAAGGATTTTCAACACTACTACCAATTCCTACAAGGAGTAGGATATGATTGTCTTATAAGAATTACAAAATTGTTTTCCTATAAGGATTGAGATATTTGACCTCTTATAAATGGAACTCTTTTGTTAAGGTTTCATTGGTATAGAATCTAAATAAGAGAGATTTGAGTGGTTGTGGGCAGTATGTGAAAAAATGAGTTTGTATCTCGTGAGTTTAATGAAGGCGTTTTCAATTGTGTAGTCCTAGGTACCTTTTTTTAGGATACTTAGAAGCAAAATAATAATATGTTGAGTGAGATCTCCATAGTATAATCCATCCCTATCTCTTATTCCTTTGTTATTGTTACTTACTTACATGAGGATCGATTAATCTATATTTTTATCTCTACAATGGTATGATATTGTCCGCTTTGGGCATATCCCTCACGGCTTTGCTTTTGATTTCACCCAAAAGGCCTCATACCAATGGAAATAATTGTCCTCTCTTATATACCATGGATCACCCCTCCCCTAGCCAATGTGGGACATATATAAGATTTTCGTGATAGTAAAATGTGTCAATATAAAAGTATTAATGACAATTAAAAATTGTCAATATAAAGATTTATATGACAGTTAAAACTGTCAATATTGGCCGTTAAAACTATCAAAATAAAAGTTTTTGTAACAGTTTTAAATAATCATTATTATGTTTTTCTTCATGTTTATAAACAGTGGTAAAACAATATTTTGTGTCAATTTTTTTACCGTTAAAAAACATTTGATGATAATTATAAACTGTCGTTATTAAGTTTTTGTTGACATAAGAAACAAATCTTTTATTGTTTCAGTGACGTTTGTAAATTGTCATCAAATAAGATTTTGTAACATTTTTTAATTGTCATGGAAAATCTATTCGTGACAATTTATAACTGTAGTTGTAAATACATTGATGACAATTTTCAAAAACTGTCAAGTTCTCATTTTTCTTGATGCCTTAAATCATTTCATCACTTAATTTTGATTTTGATAAATTATTCTCACAAACATACAATATATCAAAATCAATAGATAACAAGAACGTTTTCATACGATGTCTCCAAATATCATATTTTCTACCATCAAAGAATGGAGGTCTATCAATGCAATAACTTTCATAAAAATCTAAAGTTGTCATAGCTCTAGCAATTAAGCTTTTCAAATAGTTTAAAATAAAAAATTAACAAGATACTTAGTATTGAGATAAAATTTGAAAAATAATTTATGTAGAAAAGCGAAACCGAGCTCTGATACCAACTTGTTGGATCGGTATGACTACCCTAGAGGGGTAAATAGGGTTTAATGAAACTTTATAAACTTTTTAACATGGTGGGCCCAACAATTATGTAAACTTTTTGCTAACAAATAATTTTATCATGTATGAAACATACCTACAAAAATAAGGCTCAACCAAAAGATCTTGGTACCCATTTCTTAGGTCTTCATTGTTAGGAAGCATAGAGTTTGGAACTCATTTCAAATTAGTTTCAACATTCTTTGTATTTAATTTTAGCAAATAACATGTATGAAATTTATGACCAATTTTGCCACAAGAATAACATATAACATTAGACAAGAATTTCTTTTTAACAAAATGATTTGGCACAATAATTTTCTTATAGAATTTCTTCCTTGGAGCATTTTTCAATTTAAAACAATTTGACCTAATATGACTTTTAACTCCACAAAAATGACATGTAGGCATAAATTTTGATTTTACAAGCATATCTCTAGCTTTTCATATCTATGTGCAAATTTAGTACTATGCACACATTTAGATTTGGTATAAACATGTTTAGACACAAATTTAGTATTAGGCATATAACTAGTTCTACCACAAGCATGATTAAGCACAAAGTTTGATTTTGTATGCATATTTCTATTATTTTCATACTTATTCACAAATTTCACATTATGGTCTAATTTTGATCTAGCATGAAATTGTCTAGGCACAAACTTTGAATTTCCATGCATATTTATAGATTTTTCATGCTTTGGCACAAATTTAGATTTAACAAGATTAGGTACATTGGGTAAAGCATCTTTAAGCATAGAAGAAGTAGAATTTTTATCCACCTAGACTAATCCCCTTTTATCATTAAAAGATTTACTCATGTCTATAATCTTATATAACCTTCGTGCACCTATAGTAAATTTCTTAATAGATTCTTTTGCAATTTCAAGCTTTAATATCAAACAATTCTTTTTTTTTAAGCACATGCAAAAAATCATTTTTTTTCACCATTATCACATTTAAGAATTTAATTTTATCAATAAAAGAAAATTTTTCTTTTTCTAAAGATGTATTACTAGTTTCTAAAATAATTATTTCATTTTTCAAACTAGCAAATTTGTCTTTTAATTCAAAAGAAAAAAAATTCAATTTTTTCATTTCTATTTCTCAAATCATTTAAATCAACTTTTAATTCACATGCTAAAGAATTTGAGATCCTCATTTTATTTTTCGATAAATTTAACTCTATTAATCAAAGCAACTTTCTCATTAGTAGTTCTATTTTTCAAACAAGAGTTTTGATCTATAAAAGATTTATTTTCAATAGAGAATGTTTTATACTTCTTTGATAGTCTAACACATTTAGAACTAACTTTTTCTAAATTATGTTGCATATTTTCAAAAGCATCAAACAATTCATCATATGAAGGGGGTTTATGGTTTACCTCGTCCTCATCACTTTCGAGATCGGTTCCGTCGTCGAATGAGCCATCAAACACATGGAGGATCTATCACATTTTTCTTTCTCCACTTTATGGAATTTTTCTTGATTTTGAGAATTGTAATTCTCCTTCACATCAAATAAATCATTAGTAGTACAAGGAGAATTTTCAACACTATACATATTATCAAGAAAATTCCAAATATCATATGCATATGTAAAATGTGAAATTTTATCACATATATTTTTATCTAAAGCACAACATATGATATTTTTGGCTTGATTATTAAAAGAAATCATTTCATCACTTAAATTTGATTTTGATACATTATTCTCACAAACATACAATATATTTATCAATACATAACAAGAACATTTTCATACGATGTCTCCAAATATCATATTTTCTACCATGAAAGAATGGAGGTCTATCAATGCAATAACTTTTAAGAAAATCTAAAATTGTCATAGTTCTAGCAATTAAGCTTTTCAAATAATTTTTCAAAAATTAACAAGACGATACTTAGTATCGAGATAAAATTTGAAAAATAATTTATGTAGAAGAGCAAAATCGAGCTCTGGTACCAACTTGTTGGATCGGTATGACAACTCTAGGGGGTGAATAGGGTTTAATGAAACTTTATAAACTTTTTAACATGGTGGGTCCAATTTAAACAATTATGTAAACTTTTTACTAACAAATCATGTTATCTCAAGTTAAGTAATAAATTAGAAATGCAAGTAACCTTATTAATTCTCTCACCCAAATAATTATCAAGTAATTGAACTAAGAAGTCGCAATTAAAAATCACAATTAATTCTAGCAATACATTACAAGCCTCATAATTAATTCTATGTAAAGATTCACAATAAACATGAAGAACAAAATATAGCAATTAATTCATGCAACAAATCAAAATATAAAATTAATGCATAAATTAAGTAAGAGAGGGTTAGAGAGAATGACACCGGAAATTTATAGTGGTTCGGCAACACCGGCCTATTCCACTTCCAAGCTCTTCTTTGGGGGTTCAATTGAAGAACTTGACTCTTTCCACGACATTGAGTCCAACCAGTATAATGTTCTTTTTCCGGGAGCAAGAACAAACCCTATCCTTTCCACGGTTCAGGATCAAATCATTACAATGCTCTTTTTCGGGCTCAAGAGCAACCCTTACAATCAAATTGTAAATTTGAGAACACACTTGCAAACTCTCTCAAAGAGTGGATATACAAATTTTCGGTCACAAATTTTGTAAGAAAAATTTTTACAAAATAAAGCCCTCAAATAGCTAAGAAAAATAAAATTGGAAACACAAGTATCTCAAAAATTAAATCTCCCTCAAGAATTTAGAGCAAAAATTGAAGCTTTTAAGATAGATTAACTATTGATGCAAAAATCTGCCAAGGCATGGAGAATGACTTGGGAGGTCGAGGTGCAATACTTGAAGTATTCTTCGGGGGTACCTACAAGGAGGTAAGCCACTCCGACGTCAAAGTCAATAAGTAATCTAAGTTCAACAGATCAGCTTCAGACAGATTTTCATACCTGGGCATTTGATGTGGTGGTGTATTTATTGTTGTTGACAAGGTAGTTAGCTATCCCATCGGGGTATTTCCTTGCTCGCACGGGGTCCCTTGAGCGACCTTTGGTCCTTGCTTTGGACAGGTCCTTTCCCTCCTTTGGCAGGTGGGACTTGGCGACCTGCTTCCCCTTCAGAGTAGTCGAGGTCGACTTGTTAGGTCGGTCTTTTCAATTTTGCTCGAACAAGTCGAACGTCTACAGTTTGACCCCCAATATTGGCCCCCTGTTGGGTCGGGCGACTGGTCGCTCAGTTGAGAGACGTACGGGGGGCAGTGGTCGACCTGTTCTTGACCTTGCCTCCGAGTTGGCCATGGACAACGTGGACCAATCAGTCTCTCCGACAGGCGCATTTAATAGCCGTCAAGGGAGTTTCACATCGGGCACGACCTCGTAAGCGTTCTTGAACACATAACCGTTGCTAAGTACTCGATTTGGACCACTCACTAGGACGACACATGTAGGGCTGAGATTGAATCCTGTAGCTTATATATAAGCTTGAAGGCTTCCCCCTCACTTTACGCATCCAAGACTTCCTTTTGCACTTAGTGAAGGGATGTGAATCTTGGAGCGGAAGCGGACCGTTGTTCGTGAAATTCGGGAAACAAAATCAAAATGTGTAAAGATGAAAATACAGAGGAAAATCATACCTTTGAATAACTTTTCTTCAACGGTTTCCACCCAAACACCACCACCGTGTCTTCCTTGCTATCCTCTTGGGTCTCGAGAGCGTGCTTGTGGGAGCACCGATTTGGAATGTGAGGGAGAGTTGAGAGAGATCAAATTGAGAGAGTTGGGTGGGAGTTCTTTTTTTTTTTTCTTCTGTTTTGTTCTCTTCTTTGCAGAATAGTTCTGCAAATCAAAAAAGAGAAGAAGAATAACGTGGGGTAACTCCCCACGTCATATTAACTCCCCACTCCCCTGCAAATTTGCAGGGGAGTTTAATAATATTATTAACATTATTATTAATATTAATAATATATAACAAATATATATATATATATAATAATTAATATATATATATAAATAATTAATCATATTAATTAATAAAAGTTAATTAATTATTAATCTAATCAATAATTAATTAATATACATATATATCAAATAAATAATTAACCCTCAATTATTTATCACTTGAATAGAATCATATTCTATAACATTTCTGTCGTATAACCTATAGTTCTAATCTACATCTTACACACATTATTTGTAACCTATAGTTTTAACTATGTATTATAATGTATCAATATACATATAATCCCTTAAGTAATTTGAACAATTCAAATTCTTCCAAAACTTGATCCTCGTTTTACCCGTTATGAGCTAGCAAGGGGACCTAATGTACCTATAGATCCGAAGCTCCAACAATACGAGATTAATCGGTCAAACTCATTTACTGAGTTAATCAATATTCGTTAACTACGGGTCACTGCACTAAAGACCCGCAGTTGCACTCTTCTCACTACAGAATGTTTTCGTGTCCATGGATATACGACCATTAATGACAAGTCAATCCTTCACGAGTGTTCGTAACTCAAGTTGGGTTAAATTACCGTTTTACCCCTGAGTTACATCTTACTCCTTAAATACCACTGCTCCTCTAATGAACAACCCGGTTATGGTCCAACCAATAACCGGAAGTCCCTCTCGGGCCAATGAGAGGGTGGGGCCCCTTGTTCAAGTCCCGGAGTTAGCATTTAAGAGAACACTCATCTACTCCCCTAAAGTCGGGGAGGAGTGAATTCCATCTTGTGAAGTTATGTTCCCAACTTCCCACTCGGTCTCGTCCCCAAAATGGTAGGAATATTGAGTCGGCAACTCGGGCCACTCTCACCCATACAGATCAAAGAATGAGCCCTCACGGGCAGGAGTCCATAACTCACTCAGGATTGAGAATGAGTTGTCTAGTCATCCTAAGAAACAACAATCTATTAGTTAACGGTGTTACATCTAAAGATTGGCTATTTCGTGGTCCAGTCTTATACAAACTCATTGTATAGGATACCCTCACTCGCATGTCTCCACACGAACGACCTGGACGAGTCGTCTGTAACCTTTACAGAGTGGACCGTATCCATAGTGTTGCCAGTATAAGGTACCCAGCCTTATCCATATACTATAGATCCTTTAGGTTATTCACGGGACATTATCCACTTGTATATCAGAGCATATCAAATATATACCTCGTATATAGAGGTATATCAAATATATACCCAGTATATCACATATATACTCAGTACATATATTGTCCCGGTTACACTAAAATAACCTCGGTTGTTTAGTTTATTGGATTTTGGACAATGCAATGCTATGAATTCAACAACAAATTTATTGAATAAAACTTTATTGAATATAGATTTGTTTTTACACAAAATACAAACTACGAGTTTAGGGCAACACACCCAACACTTAGGTTGTAAGAGATGTCCAATAGGGAGAGCAATAGTTCTAGGTCGGGGCCAAGGAGCTCGGAAGAGAGCTCTAGGTCGTCTTTGGATGGGTCACCCCGGTAGCGTTTTCGAGCGGTAGGTCAGCCTCCAACGGGTCAGCCTCCGCCGAATCTTCCAATGACAGGTCTTTTTCCCCCGCATGTGGGGATGAAGTGCCTCCATGCCTAGCTATTCAACCTTACGTCCCTCCCCCCAGGGTCTCAAAAAAACCGAAGGGAGCCCGTCCAACCTGGGCTACGGCTGCCTAGCCTATTTTAGAGATCGGTACGACATTTCTGCGTTTATTGGGATGACGTTTCCCAGGAAGGATGAGGCTATAAACAATCCTCTTGAAGGGCACATCGCCTTTTACGAGAAAATGTTTGAGTCCGGGGTCAGGTTCCCCTTCATCCTTTTGCGCAAGAGTTTCTAGACGAGGTTAACATCGCCCATACGTAGCTCGTCCCCAATGGTTGGGGCATTCTTTTGGGCTACTTGATGCTTTGGTGGGGGCTGTGAATAGATATCGTAGGCACCGAATACCTCACCCCAAGCCAATTTCTTTCCTTTTATACCGTCAAGAACCTACCAAGGAAGCCAGGTCAGTTCTACCTCACAGCTCGACAGGGTGTCGAGAAGTTGTCTGTGGATCGTCTTCCATCAAGCACTGGAAGGAGGACTGGTTCTACTTGTATGGTGGCTGGTTGGCTAAGGGTCATTTTGTCAAGCTGGCGGTTCTTTGCTAA

mRNA sequence

ATGAAGTTCGATGGGGCCAATTTCCGATATTGGAAAATGCAGATTAAGGATTACTTGACCTGCAAGATGTTGAATATAGCTTTAAAGGAACGACCAGCAGAGGTGAAAGATGAAGATTGGGTAGAGACAAATCAACAGGCAGTTGCCTTTATCAGATTATCTTTGTTGATGAATGTGGCAAGTCTCGTAGCAAACGAAATAAATGCAATAGATCTGATGAAAGCACTGGAGAATAGGTACGAGAAATCCTCAGCTAATAATAAGGTATATCTCGTAAGGAGATTCTTTAACATTCAGATGGAAGAGAACACTTCTATCAATTCCTACATAAATGAAGTCACAAAATTAATCAATCAATTGGCATCAGTCAAGATTACTTTTAGTGACAAAGTGAATGTTATTTTGTTGCTAACATCTTTATCTGAAAGCTTGGAAATGATGAAGACAACAATGTCCAATTCGTTAGGAGGAAAATCTTTGAAATTTTCAAAAAATTTTGATGCCGCAACTGCTGAAGAAATTTGCAAAAAGAGGAGTGAAAAAGAATCTACTTCCGAAAAAACTCAGCATTGGTTGTCGAAAAAGGTAAAGAGAAGGTTGCATCTGATGAAAGATAGCAGAAACATAGTAGGAGGGATTGGAAGAGAGATGCTGATTGTTTTCACTGCCATAAAAAAGAGCCACATCAAGAAAAATTGTATAATTTTAAAAGAGGATCTGAAAAGGTATATGACAGAGTCAAATGCAGTTGTAGACGATGCCCTCGTGTGTGTTGAAAGCAACACAAAAACAGGAAACCAGTATGTACCCAGTATCAGAATGAATCTTATCTTGATAGGAAAGTTGGACGATGATGATTATCAGAGTGAGTTTGGTGGGAACCAGTGGAAGGTCACCAAAGGATCCAAGTTGGTGGCAATTGGCCATAGAAGATCTACAGTTTACACGTCACAATTGAGTGTTGCCAGAAGATCTTTGAAACAACGAATACGGATAGGTGAATTGATGAAGTCGCGTCAGCGAACAGTTGCATCAGAGAAGATGAACCCAATAGGTGCTGAGGTAAAGAATAGAGTTTCAATTCCGGCAACAGGCTTGAATAGAGTTGTCAAGTCATCATGGGAGGCAAGATCGATGTTTTGTCTCCAAGTGGGAGATTGTTGGGATAATGGAGCCAAAACAGATGACCCGTGTCCTATGAGGAAAGGATTTTCAACACTACTACCAATTCCTACAAGGAGCATGGAGAATGACTTGGGAGGTCGAGGTGCAATACTTGAAGTATTCTTCGGGGGTACCTACAAGGAGTTAGCTATCCCATCGGGGTATTTCCTTGCTCGCACGGGGTCCCTTGAGCGACCTTTGGTCCTTGCTTTGGACAGGTCCTTTCCCTCCTTTGGCAGGTGGGACTTGGCGACCTGCTTCCCCTTCAGAGTAGTCGAGGTCGACTTGTTAGGTCGGTCTTTTCAATTTTGCTCGAACAAGTCGAACGTCTACAGTTTGACCCCCAATATTGGCCCCCTGTTGGGTCGGGCGACTGGTCGCTCAGTTGAGAGACGTACGGGGGGCAGTGGTCGACCTGTTCTTGACCTTGCCTCCGAGTTGGCCATGGACAACGTGGACCAATCAGATACCCTCACTCGCATGTCTCCACACGAACGACCTGGACGAGTCGTCTGTAACCTTTACAGAGTGGACCGTATCCATAGTGTTGCCAGTATAAGTGCCTCCATGCCTAGCTATTCAACCTTACGTCCCTCCCCCCAGGGTCTCAAAAAAACCGAAGGGAGCCCGTCCAACCTGGGCTACGGCTGCCTAGCCTATTTTAGAGATCGGTACGACATTTCTGCGTTTATTGGGATGACGTTTCCCAGGAAGGATGAGGCTATAAACAATCCTCTTGAAGGGCACATCGCCTTTTACGAGAAAATGTTTGAGTCCGGGGTCAGATATCGTAGGCACCGAATACCTCACCCCAAGCCAATTTCTTTCCTTTTATACCGTCAAGAACCTACCAAGGAAGCCAGGTCAGTTCTACCTCACAGCTCGACAGGGTGTCGAGAAGTTGTCTGTGGATCGTCTTCCATCAAGCACTGGAAGGAGGACTGGTTCTACTTGTATGGTGGCTGGTTGGCTAAGGGTCATTTTGTCAAGCTGGCGGTTCTTTGCTAA

Coding sequence (CDS)

ATGAAGTTCGATGGGGCCAATTTCCGATATTGGAAAATGCAGATTAAGGATTACTTGACCTGCAAGATGTTGAATATAGCTTTAAAGGAACGACCAGCAGAGGTGAAAGATGAAGATTGGGTAGAGACAAATCAACAGGCAGTTGCCTTTATCAGATTATCTTTGTTGATGAATGTGGCAAGTCTCGTAGCAAACGAAATAAATGCAATAGATCTGATGAAAGCACTGGAGAATAGGTACGAGAAATCCTCAGCTAATAATAAGGTATATCTCGTAAGGAGATTCTTTAACATTCAGATGGAAGAGAACACTTCTATCAATTCCTACATAAATGAAGTCACAAAATTAATCAATCAATTGGCATCAGTCAAGATTACTTTTAGTGACAAAGTGAATGTTATTTTGTTGCTAACATCTTTATCTGAAAGCTTGGAAATGATGAAGACAACAATGTCCAATTCGTTAGGAGGAAAATCTTTGAAATTTTCAAAAAATTTTGATGCCGCAACTGCTGAAGAAATTTGCAAAAAGAGGAGTGAAAAAGAATCTACTTCCGAAAAAACTCAGCATTGGTTGTCGAAAAAGGTAAAGAGAAGGTTGCATCTGATGAAAGATAGCAGAAACATAGTAGGAGGGATTGGAAGAGAGATGCTGATTGTTTTCACTGCCATAAAAAAGAGCCACATCAAGAAAAATTGTATAATTTTAAAAGAGGATCTGAAAAGGTATATGACAGAGTCAAATGCAGTTGTAGACGATGCCCTCGTGTGTGTTGAAAGCAACACAAAAACAGGAAACCAGTATGTACCCAGTATCAGAATGAATCTTATCTTGATAGGAAAGTTGGACGATGATGATTATCAGAGTGAGTTTGGTGGGAACCAGTGGAAGGTCACCAAAGGATCCAAGTTGGTGGCAATTGGCCATAGAAGATCTACAGTTTACACGTCACAATTGAGTGTTGCCAGAAGATCTTTGAAACAACGAATACGGATAGGTGAATTGATGAAGTCGCGTCAGCGAACAGTTGCATCAGAGAAGATGAACCCAATAGGTGCTGAGGTAAAGAATAGAGTTTCAATTCCGGCAACAGGCTTGAATAGAGTTGTCAAGTCATCATGGGAGGCAAGATCGATGTTTTGTCTCCAAGTGGGAGATTGTTGGGATAATGGAGCCAAAACAGATGACCCGTGTCCTATGAGGAAAGGATTTTCAACACTACTACCAATTCCTACAAGGAGCATGGAGAATGACTTGGGAGGTCGAGGTGCAATACTTGAAGTATTCTTCGGGGGTACCTACAAGGAGTTAGCTATCCCATCGGGGTATTTCCTTGCTCGCACGGGGTCCCTTGAGCGACCTTTGGTCCTTGCTTTGGACAGGTCCTTTCCCTCCTTTGGCAGGTGGGACTTGGCGACCTGCTTCCCCTTCAGAGTAGTCGAGGTCGACTTGTTAGGTCGGTCTTTTCAATTTTGCTCGAACAAGTCGAACGTCTACAGTTTGACCCCCAATATTGGCCCCCTGTTGGGTCGGGCGACTGGTCGCTCAGTTGAGAGACGTACGGGGGGCAGTGGTCGACCTGTTCTTGACCTTGCCTCCGAGTTGGCCATGGACAACGTGGACCAATCAGATACCCTCACTCGCATGTCTCCACACGAACGACCTGGACGAGTCGTCTGTAACCTTTACAGAGTGGACCGTATCCATAGTGTTGCCAGTATAAGTGCCTCCATGCCTAGCTATTCAACCTTACGTCCCTCCCCCCAGGGTCTCAAAAAAACCGAAGGGAGCCCGTCCAACCTGGGCTACGGCTGCCTAGCCTATTTTAGAGATCGGTACGACATTTCTGCGTTTATTGGGATGACGTTTCCCAGGAAGGATGAGGCTATAAACAATCCTCTTGAAGGGCACATCGCCTTTTACGAGAAAATGTTTGAGTCCGGGGTCAGATATCGTAGGCACCGAATACCTCACCCCAAGCCAATTTCTTTCCTTTTATACCGTCAAGAACCTACCAAGGAAGCCAGGTCAGTTCTACCTCACAGCTCGACAGGGTGTCGAGAAGTTGTCTGTGGATCGTCTTCCATCAAGCACTGGAAGGAGGACTGGTTCTACTTGTATGGTGGCTGGTTGGCTAAGGGTCATTTTGTCAAGCTGGCGGTTCTTTGCTAA

Protein sequence

MKFDGANFRYWKMQIKDYLTCKMLNIALKERPAEVKDEDWVETNQQAVAFIRLSLLMNVASLVANEINAIDLMKALENRYEKSSANNKVYLVRRFFNIQMEENTSINSYINEVTKLINQLASVKITFSDKVNVILLLTSLSESLEMMKTTMSNSLGGKSLKFSKNFDAATAEEICKKRSEKESTSEKTQHWLSKKVKRRLHLMKDSRNIVGGIGREMLIVFTAIKKSHIKKNCIILKEDLKRYMTESNAVVDDALVCVESNTKTGNQYVPSIRMNLILIGKLDDDDYQSEFGGNQWKVTKGSKLVAIGHRRSTVYTSQLSVARRSLKQRIRIGELMKSRQRTVASEKMNPIGAEVKNRVSIPATGLNRVVKSSWEARSMFCLQVGDCWDNGAKTDDPCPMRKGFSTLLPIPTRSMENDLGGRGAILEVFFGGTYKELAIPSGYFLARTGSLERPLVLALDRSFPSFGRWDLATCFPFRVVEVDLLGRSFQFCSNKSNVYSLTPNIGPLLGRATGRSVERRTGGSGRPVLDLASELAMDNVDQSDTLTRMSPHERPGRVVCNLYRVDRIHSVASISASMPSYSTLRPSPQGLKKTEGSPSNLGYGCLAYFRDRYDISAFIGMTFPRKDEAINNPLEGHIAFYEKMFESGVRYRRHRIPHPKPISFLLYRQEPTKEARSVLPHSSTGCREVVCGSSSIKHWKEDWFYLYGGWLAKGHFVKLAVLC
Homology
BLAST of Moc03g19760 vs. NCBI nr
Match: RVW14934.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera])

HSP 1 Score: 174.5 bits (441), Expect = 3.4e-39
Identity = 118/348 (33.91%), Postives = 193/348 (55.46%), Query Frame = 0

Query: 2   KFDGANFRYWKMQIKDYLTCKMLNI-ALKERPAEVKDEDWVETNQQAVAFIRLSLLMNVA 61
           KFDG +F YW+MQI+DYL  + L++  L  +P  +K E+W   ++Q +  IRL+L  +VA
Sbjct: 13  KFDGTDFAYWRMQIEDYLYGRKLHLPLLGTKPESMKAEEWALLDRQVLGVIRLTLSRSVA 72

Query: 62  SLVANEINAIDLMKALENRYEKSSANNKVYLVRRFFNIQMEENTSINSYINEVTKLINQL 121
             + NE    DLMKAL   YEKSSANNKV+L+++ FN++M EN S+  ++NE   + NQL
Sbjct: 73  HNIVNEKTTADLMKALSGMYEKSSANNKVHLMKKLFNLKMVENASVAQHLNEFNTITNQL 132

Query: 122 ASVKITFSDKVNVILLLTSLSESLEMMKTTMSNSLGGKSLKFSKNFDAATAEEICKKRSE 181
           +SV+I F D++  +++L SL  S E M+ T+SNS G + LK++   D   AEEI ++ + 
Sbjct: 133 SSVEIDFDDEIRALIVLASLPNSWEAMRMTVSNSTGKEKLKYNDIRDLILAEEIRRRDAG 192

Query: 182 KESTSEKTQHWLSKKVKRRLHLMK-----------DSRNIVGGIGREMLI---------- 241
           + S S      L+ +++ + H  +           D  N+V    ++ L+          
Sbjct: 193 ETSGSGSA---LNLEIRGKGHFRRQCKSHKKNNEDDFANVVTEEVQDALLLAVDSPLDDW 252

Query: 242 VFTAIKKSHIKKNCIILK-----EDLKRYMTESNAVVDDALVCVESNTKTGN-------Q 301
           V  +    H   +  I++     E  K Y+ + +A+    L  V  +   G+       +
Sbjct: 253 VLDSRASFHTTPHREIIQNYAASEFGKVYLADGSALDVVGLGDVRISLPNGSVWLLEKVR 312

Query: 302 YVPSIRMNLILIGKLDDDDYQSEFGGNQWKVTKGSKLVAIGHRRSTVY 316
           ++P +R NLI +G+LDD+ +   F G  WKVTKG++++A G +  T+Y
Sbjct: 313 HIPDLRRNLISVGQLDDERHAILFVGGTWKVTKGARVLARGKKTGTLY 357

BLAST of Moc03g19760 vs. NCBI nr
Match: RVW62803.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera])

HSP 1 Score: 174.5 bits (441), Expect = 3.4e-39
Identity = 124/367 (33.79%), Postives = 197/367 (53.68%), Query Frame = 0

Query: 2   KFDGANFRYWKMQIKDYLTCKMLNIA-LKERPAEVKDEDWVETNQQAVAFIRLSLLMNVA 61
           KFDG +F YW+MQI+DYL  + L++  L  +P  +K E+W   ++Q +  IRL+L  +VA
Sbjct: 13  KFDGTDFAYWRMQIEDYLYGRKLHLPFLGTKPESMKAEEWALLDRQVLGVIRLTLSRSVA 72

Query: 62  SLVANEINAIDLMKALENRYEKSSANNKVYLVRRFFNIQMEENTSINSYINEVTKLINQL 121
             V  E    DLMKAL + YEK SANNKV+L+++ FN++M EN S+  ++NE   + NQL
Sbjct: 73  HNVVKEKTTADLMKALSDMYEKPSANNKVHLMKKLFNLKMAENASVAQHLNEFNTITNQL 132

Query: 122 ASVKITFSDKVNVILLLTSLSESLEMMKTTMSNSLGGKSLKFSKNFDAATAEEICKKRSE 181
           +SV+I F D++  +++L SL  S E M+  +SNS G + LK++   D   AEEI +KR  
Sbjct: 133 SSVEIDFDDEIRALIVLASLPNSWEAMRMAVSNSTGKEKLKYNDIRDLILAEEIRRKRCR 192

Query: 182 KESTS-EKTQHWLSKKV---KRRLHLMK-----DSRNIVGGIGREMLIV----------- 241
            +S S ++ Q W   K    KR+    K     DS N V    ++ L++           
Sbjct: 193 SKSRSGQQVQCWNCGKTGHFKRQCKSPKKKNEDDSANAVTEEVQDALLLAVDNPLDDWVL 252

Query: 242 -----FTAIKKSHIKKNCIILKEDLKRYMTESNAVVDDALVCVESNTKTGN-------QY 301
                F       I +N  +  +  K Y+ + +A+    L  V  +   G+       ++
Sbjct: 253 DSGASFHTTPHREIIQN-YVASDFGKVYLADGSALDVVGLGDVRISLPNGSVWLLEKVRH 312

Query: 302 VPSIRMNLILIGKLDDDDYQSEFGGNQWKVTKGSKLVAIGHRRSTVYTSQLSVARRSLKQ 336
           +P +R NLI +G+LDD+ +   F G  WKVTKG++++A G +  T+         + L  
Sbjct: 313 IPDLRRNLISVGQLDDEGHAILFVGGTWKVTKGARVLARGKKTGTLLGHMSEKGMKMLLS 372

BLAST of Moc03g19760 vs. NCBI nr
Match: CAN60331.1 (hypothetical protein VITISV_024553 [Vitis vinifera])

HSP 1 Score: 174.1 bits (440), Expect = 4.4e-39
Identity = 114/331 (34.44%), Postives = 186/331 (56.19%), Query Frame = 0

Query: 2   KFDGANFRYWKMQIKDYLTCKMLNI-ALKERPAEVKDEDWVETNQQAVAFIRLSLLMNVA 61
           KFDG +F YW+MQI+DYL  + L++  L  +P  +K E+W   ++Q +  IRL+L  +VA
Sbjct: 13  KFDGTDFAYWRMQIEDYLYGRKLHLPILGTKPESMKAEEWALLDRQVLGVIRLTLSRSVA 72

Query: 62  SLVANEINAIDLMKALENRYEKSSANNKVYLVRRFFNIQMEENTSINSYINEVTKLINQL 121
             V  E    DLMKAL   YEK SANNKV+L+++ FN++M EN S+  ++NE   + NQL
Sbjct: 73  HNVVKEKTTADLMKALSGMYEKPSANNKVHLMKKLFNLKMAENASVAQHLNEFNTITNQL 132

Query: 122 ASVKITFSDKVNVILLLTSLSESLEMMKTTMSNSLGGKSLKFSKNFDAATAEEICKK--- 181
           +SV+I F D++  +++L SL  S   M+  +SNS G +  K++   D   AEEIC++   
Sbjct: 133 SSVEIDFDDEIRALIVLASLPNSWGAMRMAVSNSTGKEKFKYNDIRDLILAEEICRRDAS 192

Query: 182 ------RSEKESTSEKTQHWLSKKVKRRLHLMKDSRNIVGGIGREMLIVFTAIKKSHIKK 241
                 +S K+   + + + ++++V+  L L  DS   +     +    F       I +
Sbjct: 193 HFKRQCKSPKKKNEDDSANAVTEEVQDALLLAVDSP--LDDWVLDSGASFHTTPHREIIQ 252

Query: 242 NCIILKEDLKRYMTESNAVVDDALVCVESNTKTGN-------QYVPSIRMNLILIGKLDD 301
           N  +  +  K Y+ + +A+    L  V  +   G+       +++P +R NLI IG+LDD
Sbjct: 253 N-YVAGDFGKVYLADGSALDVVGLGDVRISLPNGSVWLLEKVRHIPDLRRNLIXIGQLDD 312

Query: 302 DDYQSEFGGNQWKVTKGSKLVAIGHRRSTVY 316
           + +   F G  WKVTKG++++A G +  T+Y
Sbjct: 313 EGHAILFVGGTWKVTKGARVLARGKKTGTLY 340

BLAST of Moc03g19760 vs. NCBI nr
Match: RVW77419.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera])

HSP 1 Score: 173.7 bits (439), Expect = 5.7e-39
Identity = 114/326 (34.97%), Postives = 184/326 (56.44%), Query Frame = 0

Query: 2   KFDGANFRYWKMQIKDYLTCKMLNI-ALKERPAEVKDEDWVETNQQAVAFIRLSLLMNVA 61
           KFDG +F YW+MQI+DYL  + L++  L  +P  +K E+W   ++Q +  IRL+L  +VA
Sbjct: 13  KFDGTDFAYWRMQIEDYLYGRKLHLPLLGTKPESMKAEEWALLDRQVLGVIRLTLSRSVA 72

Query: 62  SLVANEINAIDLMKALENRYEKSSANNKVYLVRRFFNIQMEENTSINSYINEVTKLINQL 121
             V  E    DLMKAL   YEK SANNKV+L+++ FN++M EN S+  ++NE   + NQL
Sbjct: 73  HNVVKEKTTADLMKALSGMYEKPSANNKVHLMKKLFNLKMAENASVAQHLNEFNTITNQL 132

Query: 122 ASVKITFSDKVNVILLLTSLSESLEMMKTTMSNSLGGKSLKFSKNFDAATAEEICKKRSE 181
           +SV+I F D++  +++L SL  S E M+  +SNS G + LK++   D   +EEI ++R E
Sbjct: 133 SSVEIDFDDEIRALIVLASLPNSWEAMRMAVSNSTGKEKLKYNDIRDLILSEEI-RRRDE 192

Query: 182 KESTSEKTQHWLSK-KVKRRLHLMKDSRNIVGGIGREML---IVFTAIKKSHIKKNCIIL 241
             S S  +    SK +  +++      + +   +   +L     F       I +N  + 
Sbjct: 193 GRSNSRNSNRNRSKSRSSQQVQCWNCGKTVDSPLDDWVLDSGASFHTTPHREIIQN-YVA 252

Query: 242 KEDLKRYMTESNAVVDDALVCVESNTKTGN-------QYVPSIRMNLILIGKLDDDDYQS 301
            +  K Y+ + +A+    L  V  +   G+       +++P +R NLI IG+LDD+ +  
Sbjct: 253 GDFGKVYLADGSALDVVGLGDVRISLPNGSVWLLEKVRHIPDLRRNLISIGQLDDEGHAI 312

Query: 302 EFGGNQWKVTKGSKLVAIGHRRSTVY 316
            F G  WKVTKG++++A G +  T+Y
Sbjct: 313 LFVGGTWKVTKGARVLACGKKTGTLY 336

BLAST of Moc03g19760 vs. NCBI nr
Match: CAN69998.1 (hypothetical protein VITISV_006839 [Vitis vinifera])

HSP 1 Score: 172.9 bits (437), Expect = 9.8e-39
Identity = 113/329 (34.35%), Postives = 182/329 (55.32%), Query Frame = 0

Query: 2   KFDGANFRYWKMQIKDYLTCKMLNI-ALKERPAEVKDEDWVETNQQAVAFIRLSLLMNVA 61
           KFDG NF YW+MQI+DYL  + L++  L  +P  +K E+W   ++Q +  IRL+L  +VA
Sbjct: 13  KFDGTNFAYWRMQIEDYLYGRKLHLPLLGTKPESMKAEEWALLDRQVLGVIRLTLSRSVA 72

Query: 62  SLVANEINAIDLMKALENRYEKSSANNKVYLVRRFFNIQMEENTSINSYINEVTKLINQL 121
             V  E    DLMKAL   YEK SANNKV+L+++ FN++M EN S+  ++NE   + NQL
Sbjct: 73  HNVVKEKTTADLMKALSGMYEKPSANNKVHLMKKLFNLKMAENASVAQHLNEFNTITNQL 132

Query: 122 ASVKITFSDKVNVILLLTSLSESLEMMKTTMSNSLGGKSLKFSKNFDAATAEEICKKRSE 181
           +S++I F  ++  +++L SL  S E M+  +SNS G + LK++   D   AEEI ++ + 
Sbjct: 133 SSIEIDFDYEIRALIVLASLPNSWEAMRMAVSNSTGKEKLKYNDIRDLILAEEIRRRDAG 192

Query: 182 KESTSEKTQHWLSKKVKRR----------LHLMKDSRNIVGGIGREMLIVFTAIKKSHIK 241
           + S S      L+ + + R           +    +RN       + +  +   K  H K
Sbjct: 193 ETSGSGSA---LNLETRGRGNNRNSNXGXSNSRXSNRNRSKSRSGQQVQCWNCGKTGHFK 252

Query: 242 KNCIILK----EDLKRYMTESNAVVDDALVCVESNTKTGNQYVPSIRMNLILIGKLDDDD 301
           + C   K    +D    +TE  ++ + ++  +E       +++P +R NLI IG+LDD+ 
Sbjct: 253 RQCKSPKKKNEDDSANAVTEEISLPNGSVWLLEK-----VRHIPELRRNLISIGQLDDEG 312

Query: 302 YQSEFGGNQWKVTKGSKLVAIGHRRSTVY 316
           +   F G  WKVTKG +++A G +  T+Y
Sbjct: 313 HAILFVGGTWKVTKGXRVLAHGKKTGTLY 333

BLAST of Moc03g19760 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 87.4 bits (215), Expect = 7.1e-16
Identity = 103/407 (25.31%), Postives = 179/407 (43.98%), Query Frame = 0

Query: 2   KFDGAN-FRYWKMQIKDYLTCKMLNIAL---KERPAEVKDEDWVETNQQAVAFIRLSLLM 61
           KF+G N F  W+ +++D L  + L+  L    ++P  +K EDW + +++A + IRL L  
Sbjct: 10  KFNGDNGFSTWQRRMRDLLIQQGLHKVLDVDSKKPDTMKAEDWADLDERAASAIRLHLSD 69

Query: 62  NVASLVANEINAIDLMKALENRYEKSSANNKVYLVRRFFNIQMEENTSINSYINEVTKLI 121
           +V + + +E  A  +   LE+ Y   +  NK+YL ++ + + M E T+  S++N    LI
Sbjct: 70  DVVNNIIDEDTARGIWTRLESLYMSKTLTNKLYLKKQLYALHMSEGTNFLSHLNVFNGLI 129

Query: 122 NQLASVKITFSDKVNVILLLTSLSESLEMMKTTMSNSLGGKSLKFSKNFDAATA--EEIC 181
            QLA++ +   ++   ILLL SL  S + + TT+   L GK+    K+  +A    E++ 
Sbjct: 130 TQLANLGVKIEEEDKAILLLNSLPSSYDNLATTI---LHGKTTIELKDVTSALLLNEKMR 189

Query: 182 KK--------------RSEKESTSEKTQHWLSKKVKRRL--------------HLMKDSR 241
           KK              RS + S++   +     K K R               H  +D  
Sbjct: 190 KKPENQGQALITEGRGRSYQRSSNNYGRSGARGKSKNRSKSRVRNCYNCNQPGHFKRDCP 249

Query: 242 NIVGG----IGREMLIVFTAIKKSHIKKNCIILKEDLKRYMT--ESNAVVDDAL------ 301
           N   G     G++      A+ +++      I +E+   +++  ES  VVD A       
Sbjct: 250 NPRKGKGETSGQKNDDNTAAMVQNNDNVVLFINEEEECMHLSGPESEWVVDTAASHHATP 309

Query: 302 -------------------------------VCVESN-----TKTGNQYVPSIRMNLILI 327
                                          +C+++N          ++VP +RMNLI  
Sbjct: 310 VRDLFCRYVAGDFGTVKMGNTSYSKIAGIGDICIKTNVGCTLVLKDVRHVPDLRMNLISG 369

BLAST of Moc03g19760 vs. ExPASy TrEMBL
Match: A0A438BVC7 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX=29760 GN=POLX_1673 PE=4 SV=1)

HSP 1 Score: 174.5 bits (441), Expect = 1.6e-39
Identity = 118/348 (33.91%), Postives = 193/348 (55.46%), Query Frame = 0

Query: 2   KFDGANFRYWKMQIKDYLTCKMLNI-ALKERPAEVKDEDWVETNQQAVAFIRLSLLMNVA 61
           KFDG +F YW+MQI+DYL  + L++  L  +P  +K E+W   ++Q +  IRL+L  +VA
Sbjct: 13  KFDGTDFAYWRMQIEDYLYGRKLHLPLLGTKPESMKAEEWALLDRQVLGVIRLTLSRSVA 72

Query: 62  SLVANEINAIDLMKALENRYEKSSANNKVYLVRRFFNIQMEENTSINSYINEVTKLINQL 121
             + NE    DLMKAL   YEKSSANNKV+L+++ FN++M EN S+  ++NE   + NQL
Sbjct: 73  HNIVNEKTTADLMKALSGMYEKSSANNKVHLMKKLFNLKMVENASVAQHLNEFNTITNQL 132

Query: 122 ASVKITFSDKVNVILLLTSLSESLEMMKTTMSNSLGGKSLKFSKNFDAATAEEICKKRSE 181
           +SV+I F D++  +++L SL  S E M+ T+SNS G + LK++   D   AEEI ++ + 
Sbjct: 133 SSVEIDFDDEIRALIVLASLPNSWEAMRMTVSNSTGKEKLKYNDIRDLILAEEIRRRDAG 192

Query: 182 KESTSEKTQHWLSKKVKRRLHLMK-----------DSRNIVGGIGREMLI---------- 241
           + S S      L+ +++ + H  +           D  N+V    ++ L+          
Sbjct: 193 ETSGSGSA---LNLEIRGKGHFRRQCKSHKKNNEDDFANVVTEEVQDALLLAVDSPLDDW 252

Query: 242 VFTAIKKSHIKKNCIILK-----EDLKRYMTESNAVVDDALVCVESNTKTGN-------Q 301
           V  +    H   +  I++     E  K Y+ + +A+    L  V  +   G+       +
Sbjct: 253 VLDSRASFHTTPHREIIQNYAASEFGKVYLADGSALDVVGLGDVRISLPNGSVWLLEKVR 312

Query: 302 YVPSIRMNLILIGKLDDDDYQSEFGGNQWKVTKGSKLVAIGHRRSTVY 316
           ++P +R NLI +G+LDD+ +   F G  WKVTKG++++A G +  T+Y
Sbjct: 313 HIPDLRRNLISVGQLDDERHAILFVGGTWKVTKGARVLARGKKTGTLY 357

BLAST of Moc03g19760 vs. ExPASy TrEMBL
Match: A0A438FS76 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX=29760 GN=POLX_1049 PE=4 SV=1)

HSP 1 Score: 174.5 bits (441), Expect = 1.6e-39
Identity = 124/367 (33.79%), Postives = 197/367 (53.68%), Query Frame = 0

Query: 2   KFDGANFRYWKMQIKDYLTCKMLNIA-LKERPAEVKDEDWVETNQQAVAFIRLSLLMNVA 61
           KFDG +F YW+MQI+DYL  + L++  L  +P  +K E+W   ++Q +  IRL+L  +VA
Sbjct: 13  KFDGTDFAYWRMQIEDYLYGRKLHLPFLGTKPESMKAEEWALLDRQVLGVIRLTLSRSVA 72

Query: 62  SLVANEINAIDLMKALENRYEKSSANNKVYLVRRFFNIQMEENTSINSYINEVTKLINQL 121
             V  E    DLMKAL + YEK SANNKV+L+++ FN++M EN S+  ++NE   + NQL
Sbjct: 73  HNVVKEKTTADLMKALSDMYEKPSANNKVHLMKKLFNLKMAENASVAQHLNEFNTITNQL 132

Query: 122 ASVKITFSDKVNVILLLTSLSESLEMMKTTMSNSLGGKSLKFSKNFDAATAEEICKKRSE 181
           +SV+I F D++  +++L SL  S E M+  +SNS G + LK++   D   AEEI +KR  
Sbjct: 133 SSVEIDFDDEIRALIVLASLPNSWEAMRMAVSNSTGKEKLKYNDIRDLILAEEIRRKRCR 192

Query: 182 KESTS-EKTQHWLSKKV---KRRLHLMK-----DSRNIVGGIGREMLIV----------- 241
            +S S ++ Q W   K    KR+    K     DS N V    ++ L++           
Sbjct: 193 SKSRSGQQVQCWNCGKTGHFKRQCKSPKKKNEDDSANAVTEEVQDALLLAVDNPLDDWVL 252

Query: 242 -----FTAIKKSHIKKNCIILKEDLKRYMTESNAVVDDALVCVESNTKTGN-------QY 301
                F       I +N  +  +  K Y+ + +A+    L  V  +   G+       ++
Sbjct: 253 DSGASFHTTPHREIIQN-YVASDFGKVYLADGSALDVVGLGDVRISLPNGSVWLLEKVRH 312

Query: 302 VPSIRMNLILIGKLDDDDYQSEFGGNQWKVTKGSKLVAIGHRRSTVYTSQLSVARRSLKQ 336
           +P +R NLI +G+LDD+ +   F G  WKVTKG++++A G +  T+         + L  
Sbjct: 313 IPDLRRNLISVGQLDDEGHAILFVGGTWKVTKGARVLARGKKTGTLLGHMSEKGMKMLLS 372

BLAST of Moc03g19760 vs. ExPASy TrEMBL
Match: A5AK46 (Integrase catalytic domain-containing protein OS=Vitis vinifera OX=29760 GN=VITISV_024553 PE=4 SV=1)

HSP 1 Score: 174.1 bits (440), Expect = 2.1e-39
Identity = 114/331 (34.44%), Postives = 186/331 (56.19%), Query Frame = 0

Query: 2   KFDGANFRYWKMQIKDYLTCKMLNI-ALKERPAEVKDEDWVETNQQAVAFIRLSLLMNVA 61
           KFDG +F YW+MQI+DYL  + L++  L  +P  +K E+W   ++Q +  IRL+L  +VA
Sbjct: 13  KFDGTDFAYWRMQIEDYLYGRKLHLPILGTKPESMKAEEWALLDRQVLGVIRLTLSRSVA 72

Query: 62  SLVANEINAIDLMKALENRYEKSSANNKVYLVRRFFNIQMEENTSINSYINEVTKLINQL 121
             V  E    DLMKAL   YEK SANNKV+L+++ FN++M EN S+  ++NE   + NQL
Sbjct: 73  HNVVKEKTTADLMKALSGMYEKPSANNKVHLMKKLFNLKMAENASVAQHLNEFNTITNQL 132

Query: 122 ASVKITFSDKVNVILLLTSLSESLEMMKTTMSNSLGGKSLKFSKNFDAATAEEICKK--- 181
           +SV+I F D++  +++L SL  S   M+  +SNS G +  K++   D   AEEIC++   
Sbjct: 133 SSVEIDFDDEIRALIVLASLPNSWGAMRMAVSNSTGKEKFKYNDIRDLILAEEICRRDAS 192

Query: 182 ------RSEKESTSEKTQHWLSKKVKRRLHLMKDSRNIVGGIGREMLIVFTAIKKSHIKK 241
                 +S K+   + + + ++++V+  L L  DS   +     +    F       I +
Sbjct: 193 HFKRQCKSPKKKNEDDSANAVTEEVQDALLLAVDSP--LDDWVLDSGASFHTTPHREIIQ 252

Query: 242 NCIILKEDLKRYMTESNAVVDDALVCVESNTKTGN-------QYVPSIRMNLILIGKLDD 301
           N  +  +  K Y+ + +A+    L  V  +   G+       +++P +R NLI IG+LDD
Sbjct: 253 N-YVAGDFGKVYLADGSALDVVGLGDVRISLPNGSVWLLEKVRHIPDLRRNLIXIGQLDD 312

Query: 302 DDYQSEFGGNQWKVTKGSKLVAIGHRRSTVY 316
           + +   F G  WKVTKG++++A G +  T+Y
Sbjct: 313 EGHAILFVGGTWKVTKGARVLARGKKTGTLY 340

BLAST of Moc03g19760 vs. ExPASy TrEMBL
Match: A0A438GYZ2 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX=29760 GN=POLX_346 PE=4 SV=1)

HSP 1 Score: 173.7 bits (439), Expect = 2.8e-39
Identity = 114/326 (34.97%), Postives = 184/326 (56.44%), Query Frame = 0

Query: 2   KFDGANFRYWKMQIKDYLTCKMLNI-ALKERPAEVKDEDWVETNQQAVAFIRLSLLMNVA 61
           KFDG +F YW+MQI+DYL  + L++  L  +P  +K E+W   ++Q +  IRL+L  +VA
Sbjct: 13  KFDGTDFAYWRMQIEDYLYGRKLHLPLLGTKPESMKAEEWALLDRQVLGVIRLTLSRSVA 72

Query: 62  SLVANEINAIDLMKALENRYEKSSANNKVYLVRRFFNIQMEENTSINSYINEVTKLINQL 121
             V  E    DLMKAL   YEK SANNKV+L+++ FN++M EN S+  ++NE   + NQL
Sbjct: 73  HNVVKEKTTADLMKALSGMYEKPSANNKVHLMKKLFNLKMAENASVAQHLNEFNTITNQL 132

Query: 122 ASVKITFSDKVNVILLLTSLSESLEMMKTTMSNSLGGKSLKFSKNFDAATAEEICKKRSE 181
           +SV+I F D++  +++L SL  S E M+  +SNS G + LK++   D   +EEI ++R E
Sbjct: 133 SSVEIDFDDEIRALIVLASLPNSWEAMRMAVSNSTGKEKLKYNDIRDLILSEEI-RRRDE 192

Query: 182 KESTSEKTQHWLSK-KVKRRLHLMKDSRNIVGGIGREML---IVFTAIKKSHIKKNCIIL 241
             S S  +    SK +  +++      + +   +   +L     F       I +N  + 
Sbjct: 193 GRSNSRNSNRNRSKSRSSQQVQCWNCGKTVDSPLDDWVLDSGASFHTTPHREIIQN-YVA 252

Query: 242 KEDLKRYMTESNAVVDDALVCVESNTKTGN-------QYVPSIRMNLILIGKLDDDDYQS 301
            +  K Y+ + +A+    L  V  +   G+       +++P +R NLI IG+LDD+ +  
Sbjct: 253 GDFGKVYLADGSALDVVGLGDVRISLPNGSVWLLEKVRHIPDLRRNLISIGQLDDEGHAI 312

Query: 302 EFGGNQWKVTKGSKLVAIGHRRSTVY 316
            F G  WKVTKG++++A G +  T+Y
Sbjct: 313 LFVGGTWKVTKGARVLACGKKTGTLY 336

BLAST of Moc03g19760 vs. ExPASy TrEMBL
Match: A5ASH8 (gag_pre-integrs domain-containing protein OS=Vitis vinifera OX=29760 GN=VITISV_040580 PE=4 SV=1)

HSP 1 Score: 172.9 bits (437), Expect = 4.7e-39
Identity = 112/322 (34.78%), Postives = 185/322 (57.45%), Query Frame = 0

Query: 2   KFDGANFRYWKMQIKDYLTCKMLNI-ALKERPAEVKDEDWVETNQQAVAFIRLSLLMNVA 61
           KFDG +F YW+MQI++YL  + L++  L  +P  +K E+W   ++Q +  IRL+L  +VA
Sbjct: 13  KFDGTDFAYWRMQIENYLYGRKLHLPLLGTKPESMKAEEWALLDRQVLGVIRLTLSRSVA 72

Query: 62  SLVANEINAIDLMKALENRYEKSSANNKVYLVRRFFNIQMEENTSINSYINEVTKLINQL 121
             +  E   IDLMKAL   YEK SANNKV+L+++ FN++M EN S+  ++NE   + NQL
Sbjct: 73  HNIVKEKTTIDLMKALSGMYEKPSANNKVHLMKKLFNLKMAENASVAQHLNEFNTITNQL 132

Query: 122 ASVKITFSDKVNVILLLTSLSESLEMMKTTMSNSLGGKSLKFSKNFDAATAEEICKKRSE 181
           +SVKI F D+++ +++L SL  S ++M+  +SNS G + LK++   D   AEEI ++R  
Sbjct: 133 SSVKIDFDDEIHALIVLASLPNSWKVMRMAVSNSTGKEKLKYNDIRDLILAEEI-RRRDV 192

Query: 182 KESTSEKTQHWLSKKVKRRLHLMKDSRNIVGGIGREMLIVFTAIKKSHIKKNCIILKEDL 241
            E++   +   L+ + + R+    D   +  G        F       I +N  +  +  
Sbjct: 193 GETSGSGSD--LNLETRGRVDSPLDDWVLDSGAS------FHTTPHREIIQN-YVADDFG 252

Query: 242 KRYMTESNAVVDDALVCVESNTKTGN-------QYVPSIRMNLILIGKLDDDDYQSEFGG 301
           K Y+ + +A+    L  V  +   G+       +++P +R NLI +G+LDD+ +   F G
Sbjct: 253 KVYLADGSALDVVGLGDVRISLPNGSIWLLEKVRHIPDLRRNLISVGQLDDEXHAILFVG 312

Query: 302 NQWKVTKGSKLVAIGHRRSTVY 316
             WKVTKG++++A G +  T+Y
Sbjct: 313 GTWKVTKGARVLARGKKTGTLY 324

BLAST of Moc03g19760 vs. TAIR 10
Match: AT3G29785.1 (unknown protein; Has 90 Blast hits to 90 proteins in 7 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 90; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 57.8 bits (138), Expect = 4.3e-08
Identity = 32/88 (36.36%), Postives = 51/88 (57.95%), Query Frame = 0

Query: 2  KFDGANFRYWKMQIKDYLTCKMLNIALKERPAEVKDEDWVETNQQAVAFIRLSLLMNVAS 61
          K DG ++ + +M+I+DYL  K L+  L ++   +  +DW    +Q +  IRL++  N+A 
Sbjct: 8  KVDGTSYSFCRMKIEDYLYGKKLHQPLGKKVETMSQDDWNILYRQVLDVIRLTISKNIAH 67

Query: 62 LVANEINAIDLMKALENRYEKSSANNKV 90
           VA E +   LMK L + Y+K S NN V
Sbjct: 68 NVAKEKSPDGLMKVLSDIYKKPSTNNTV 95

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
RVW14934.13.4e-3933.91Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera][more]
RVW62803.13.4e-3933.79Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera][more]
CAN60331.14.4e-3934.44hypothetical protein VITISV_024553 [Vitis vinifera][more]
RVW77419.15.7e-3934.97Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera][more]
CAN69998.19.8e-3934.35hypothetical protein VITISV_006839 [Vitis vinifera][more]
Match NameE-valueIdentityDescription
P109787.1e-1625.31Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
Match NameE-valueIdentityDescription
A0A438BVC71.6e-3933.91Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX... [more]
A0A438FS761.6e-3933.79Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX... [more]
A5AK462.1e-3934.44Integrase catalytic domain-containing protein OS=Vitis vinifera OX=29760 GN=VITI... [more]
A0A438GYZ22.8e-3934.97Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX... [more]
A5ASH84.7e-3934.78gag_pre-integrs domain-containing protein OS=Vitis vinifera OX=29760 GN=VITISV_0... [more]
Match NameE-valueIdentityDescription
AT3G29785.14.3e-0836.36unknown protein; Has 90 Blast hits to 90 proteins in 7 species: Archae - 0; Bact... [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (OHB3-1) v2
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 40..161
e-value: 2.7E-22
score: 79.0
NoneNo IPR availablePANTHERPTHR34676:SF1ZINC FINGER, CCHC-TYPE, TUBBY C-TERMINAL-LIKE DOMAIN PROTEIN-RELATEDcoord: 2..191
NoneNo IPR availablePANTHERPTHR34676FAMILY NOT NAMEDcoord: 2..191

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Moc03g19760.1Moc03g19760.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:1901363 heterocyclic compound binding
molecular_function GO:0097159 organic cyclic compound binding