Moc03g21560 (gene) Bitter gourd (OHB3-1) v2

Overview
NameMoc03g21560
Typegene
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Locationchr3: 14717292 .. 14723471 (-)
RNA-Seq ExpressionMoc03g21560
SyntenyMoc03g21560
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACACTATCAATAGGGGTATACAGTCTGGTGCCGAAAGAGACTACAGCGAAAGAATTGTTGCAGGCCTTGCAAGATAGGTATGAAAAACCTTCTGCCAATACAAAAATACTTCTGTGGACGAAGTATTTTAATATCCACATGGAGGAGGGAACCTCGGTGAATTCACACATTAATGAGCTCACCGATATCTTGAACAAATTAGAAGGGATGAGTGTCAAGATTGAGGAAGAGATGAAGGCTATGAGGCTGTTGACATCTTTGCCTGACAGTTAGGAGACGATGATGATCGCGGTGTCGAATTCGCTAGGAGAAAATAGCTTGAAATTTTCAACTATTTGTGATGCCGCCTTATCTGAAGAAGCCCGGAGAAAATTAGGGAAAATGTCTGCATCTACTTTAAGAGCAGAGCACAGAGTTGAATAAACTTTGGTAGCTTAGAACAAAGGGAAGGCAAAGATGAATTACAATGGGAAGCAGCAGCAAAGATATAGCAGGGTAGTGAGAGTTCCAGTGGAGAAGTTGAAAGTTGTTACTGCCATAAGAAGGGTCACTTCAAGAAACACGGCAGGAAGCTTAAGGAGGATCAGGAAAATGAGGACACTCTAAATTACATGTCAGCGAAGGTGTTAGCTTGTATTGAAGGTAACACAACACCTGTAGACCAGTCATCAGAGTGGGTAGTGGACAGTGCAGCTTCGGTGCATGTATCTTCAGACAGGAGTTGGTTCACGTCCTTTACTACAGGAAATCATGGTGCGGTGAGGATGGGAAATGAGAAACTCTCCAAGATCAGAGGACTTGGGGATGTTCATTTGAAGACTGACAGTGGGACCGAGTGAATATTGTATGATGTGAGGTATGTACCGAGTCTCAGGATGAATTTGATATCAGCGGGGAAGCTGGATGACGAATGCTACAGAAGTGAGTTTGGAGAGAAAAGATGGAAACTCATAAGGGGATCTGAGGTAGTGGTTGTTGGCCACAGAAAAAGCTTCAGTGTATGTGTTGAGGTTTGGTGTTGCCAGAGGATTAGAGAGACGGATTATGCACAAGGCTGCAGATAGTTTAGGGGGAGACTTTAAAAAACTAGCAGCATTGACAGCCGAGACAGATCAAGAGAATATGCCATCAATTCAAGTACAACAGCTAGGAAGTAGAGGAAAGGGAAAGGGGAACAGCTCAGTGAGGTGTTCAACAAACTGTCAGTTTTGAGCCCCAGTTGTCGGACGGATTAGCGAGCTGATGAAGTCGCATAGGCGAAAGAGTGCATCGAGAAATACTACAATTGGCGTTGTGGTCGAGGGTGAATTCTCTAAGGTGGCAACGGACTTTGGTGGGAGTGTCAAGTCATCAGTGAAAGGGCTTTCCTTCAAAAGTCGTTGGGTGCAAGTGAAGAAGGAAATGTCAGGAACCATTTAGTTCGAGTGGGAGTATGTGCCTGTGTCTCTTACGTCTAGTAGACAAGACCACGTAGGATTGCTCAGTCTCAGGGCAATCACAGATGAAGTCTTTGTTGGTGCCAAGAAGATGTTAGAAGCTGTTGGTGTAGTGGGAGTCGAGTCTTGGTTGGTGCCAAAAGGACGTTGATCAGTTGGAAGCTGTTGGTGCAATTAGAACTGAGTCATGACAGACACTATAGTGGGAGACGAGATTGTTGTTTTGTTTCCAAGTGGGCGATTGTTGGGAAAACAAAATTGAAAAATAAGACAAGAATGAAAAAATAAAAGGTTTCAATTTTTGGGCCAAATAAAAGAAAAGGCCCAAACCTAATTTTTACCTTCTTCACCAAATTTGTTTTTTAGAAGGATATATAGTGGTGAGTGAAAATTAAAAGAGAGAGAGAGAGAATTTAGTTGCGTTGTTGCACAACATTAAAGGGGGCGGCAGTGGTGCTTTTGGCAGAGAAGAAAAATGAGACTATGGATTTTGTAGTGTGTCATCGGCAGTCCGTTTTCTTGAGGTTAGTGAGGTTTTTCATTCTGTATTCTTAAGTAGTTATTGTCGTAGTGCTCAGAAACTAGTGACTATTGTTGTGGCATCTAAAGAGAACTTTGTAAAGCGTTAGTGAGACTCATTGTATTTATCCCAATATCATAGTAGAATTTGTTGTCACGGAAGACTGGAGTCCATAGTTGGGTGAACCAATATAAATTTGCTTGTTTCTGTGCTTTCCCTATTTTTATTATTATTGTGTGTGACTTAGTTTTTATTGACGAACTGTGGAAGGAGAAAATATTATTGGTAAGTAAATTTTCCAAACACAGACATAAACTAAAATGTTAATAAAATTTATTTTGTATAGATATATTATAATTTAGAGGCATTTATATAAATAACCTAGTATAAATATATTTCTCAAGAGAGAAAGCCAACCGCGCGCGACCTCTGGCACATTCAACACAGTGTTCTAACTCATGAGTTTATAGAAAAATGTTTTTTTAGTCTAAAAAGTGGAGTTGGGAGATTCAAACTCCCGACTTCTTAGTCAAAAATATGTGTCAATTACTGTTGAGTTAAGTTCATATTCGTAACAAAATGTTTGAAATTTTACAAATATATATATATATAAAAAGTCATTTTACACTGTAGTAAAATTTCTCAACAATTTGCATCTTAAGATGCATGATTTCAATTAGGATAAGAGACGGACAATTTTATTTGAGCTAATCATGATGTCGTGGCTAACATTCGCCTAATTGTTAGATATATGTCACTATTTTTATTATTTTCAATTGAAGATCATATAATAATTGTGGTTAAATCAAAGTTTAGTCAACCATTAACGTGGCCTAATAATTAATCAAAGAAATGGTCACAAATTTAATCAAGACTAGAGTTCTAATCATAACTAATAAATCAAAGTACTTTTGGACCATTTAAATTCAAAAGTGCTACATACATTTTATGAAGGAAAATGTTGGGGAAGTTGAAAGCTCTGTTAGAGTAGTCGTTGAACCAAATTTTCTACTCCTTTATTAAATGATGTGCCATAAACGCAGTTTAAGATTTCGTTCACAATTTTAAAGTCTTACGTTTATGGTGTTAGAGTTTAGAAGTCTGAATTTGTATAATTTATATTGTTTATGCAAGGATATGCACAACAATGTATTTCAGATTGCAGCTCGAACTCGGCCTCCGGACCGATCTGAATACTTGGGCGGACCTGCACAGAAAGGTGAGCACTCCGACGATCAAGTCAGTATAGGTCGGATTCCCAGTTTAGTTCGAGGTCAGAAATCATCGTACTTGATCGTGGAGTGAGGTCCTCTACTTATGGGTGGTAGTCCCGCTAATCTCGCAACGGTTATACCCGGTAATCTCGGGACCATCGGTTATATCCGGTAATCTCGGGACCCACGGTTACACCCGATAATCACGCCGCTGACAGTAGCTCACATCGGCCCTTACCGAGCTTCCCGGTAGGTCGGACCTCGGCCAGGTTCACCTCGGCCCTCATACTTAGCATCTGTCAACACTAGTGGTGGTGATCCCGGCGGCCCGAGCTGGACATGACTCATGAGTCATCTTGGAGCACCAATAGGGGTCCTCCACGTGTCCAGGGTATTCTCTTCCCCAAACATTGGCCCCCTCTCTGTCTGGTCCGATCTCGACCTGGCTGAGAAGTTCATTCGACTTGCTTTGGACGCGTGGCGACTTCCTATTCGTGGGAAAATATAACCGTTGTGGTAGATTTATCGTCGGAATATTCAAATATTCCGACGCTTCGGATCTTAGGGAGGATCCTAACCGCTCGTTGATTACACGTGTACGGTGGGTAAATCTTTCCGACGAGCTATAAATACCTCCAATCCTTCAGGTCATACCTTACGTTCCCTGAATTCTTGGAGTTCGATCTGAAGGTAGCTCGAACCCTTGGTAGGTCGGTCTCTTCCCTCACTTTCTCTTTCGAACGTAGTTGCCATGTCGTCCTCTTTTAGCAGCAACTTAGGATCCGATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGGGAGGATAGTGATGCCTCCACCTCGGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGACAACATCCTCCTTAGGATTCCGGAGGAGGGGGAGAGAGCTGGCAATCCTCCAGAGGAATGGGTCACCCTCTACTTCAAAATGTTTGAGTACAGCCTCAGACTTCCCCTTCACCCTTTCGTCCAAGAGTTTCTTTTCCGAACTGGGCTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTTTTTTGGCTACGAGCTCGGGATAGTGAAGAGGCCGAGCTGTTAGACGTAGACCAGCTTCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATATGCGCAAGGAAAGGCGCAGGCGGTATAGTTAAGGGCCCGACCTCTATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCTGGGGAATGGCTTGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTTCCCACTAGGTTTGGAAACCTAGGTACGGAGTAGTTCTTACTTCTTGTGATAAGTTTCTTTCTTGTTATTCAGGTCTGACTCAAATTTTTGTCTTTGTGCAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACGCTGAAATATTACAAGGAGTGTTTTCCGAGGGGTAGGAAGGTCGGAACCTTGGTGACTGACCAGCTGCTGCTCGAGTCCGGGCTGCTAGATTACAACCCCGCAGTTCGTCCCATTGAATCCTCAAGACCGAACTCCGAACTTGGTATGCCGAGCTTGTTCCCCCCTTTTTTCCTCTAACTTTGTGTTGACTCCTGTTTTTGTTTTGCAGCCATGGTTTGTGGATTTGCGAGCAACGTGAAGCGCAAGTATCGACTACGCTGCTGTGGTAGGGCCAGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGGGGTCCTTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGGTGGACGTCTCGCCCTTGGGCGAGGGGGTGAGGGAGGAAGTCCCTCTGAAGTGAAGGAGGAAGAAGAAGAAGACCACCTCCCCCTTGGAGGTCGGAGCTCGTGGGGTCTTGCCTGCGAGCTTCACAGATCGGGTGGACGATCCTAAGGCCAGGATGAGCGGGACGTCCGATGTGACGGCACGGTTCAGAGTTGAGCCGTCAAGCTCTGGGGTGAGGGACCAGGTGTCCCGCATCTCGGCCGCAAGTTTGGACCGCTGCCTAAGGAGGGCGTCCAAATTTGTGAGCGACCCAGGGTCCGTTCTGCAGAGGACCATCGACTACGCTGCCGAGGTAAGACTAGTGTCTCCATTTTTGCTTAATTTACCTAACAGGTAGCTCGGTCTAACTTCTTATTTGTATCTTTTCTCAGGCGTTCGTTGCTTCCATTCAATCGGCTCTGGCCGTGAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCTTCCTCCGCCATGAAGGATGAGCTACTGAAGGCTCACTCTGAGGTGGAAACTTTGAAGGCCGAGGTGGAGACCAAGGCCGAGCTGCTGAAGAAGGAAGAAGACAGACGCAAGGCCCAGCTCCGAGCTGCCCATGCTATTACCAAGGGCTTGGAGAAGGAGAAGTTCCAACTCCTCAAGGAGAAGGACGACATGCTCCAGGCGCTTGAAGCGAAGGAGGAGGAGCTGAAGCATGCGACTGCCGAGCTGGAGACGGTGAAGGAGCGTCTCAGCAATGGAGCCCTATTGGAGGAATCATTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTCTCTGACGCGGGCTTCAAGTTTCTCATGAAGGGCATTGCTTCCGACATGCCTGACGTTCAGATCGATCTCGGTGGTCTGAAGAAGAGGTATGCTGAGCAGTGGGCGTCTGGGCCTAGCGGCACCCGTGGCTCCCAAGCGTTGGTGGATAAGTACGTCGGAGATCTGGACTCTGACTACTCCGACCTCGAAGAGGATCAGGTCGACACCACTCAAGAGGGCGTTTCTCAAGCAGGTTCTTAG

mRNA sequence

ATGACACTATCAATAGGGGTATACAGTCTGGTGCCGAAAGAGACTACAGCGAAAGAATTGTTGCAGGCCTTGCAAGATAGGTATGAAAAACCTTCTGCCAATACAAAAATACTTCTGTGGACGAAGTATTTTAATATCCACATGGAGGAGGGAACCTCGGTGAATTCACACATTAATGAGCTCACCGATATCTTGAACAAATTAGAAGGGATGAGTGTCAAGATTGAGGAAGAGATGAAGGCTATGAGGCTGTTGACATCTTTGCCTGACAAAAAAGCTTCAGTGTATGTGTTGAGGTTTGGTGTTGCCAGAGGATTAGAGAGACGGATTATGCACAAGGCTGCAGATAGTTTAGGGGGAGACTTTAAAAAACTAGCAGCATTGACAGCCGAGACAGATCAAGAGAATATGCCATCAATTCAAGTACAACAGCTAGGAAGTAGAGGAAAGGGAAAGGGGAACAGCTCAGTGAGACAAGACCACGTAGGATTGCTCAGTCTCAGGGCAATCACAGATGAAGTCTTTGTTGGTGCCAAGAAGATGTTAGAAGCTGTTGGTGTAGTGGGAGTCGAGTCTTGTGCTCAGAAACTAGTGACTATTGTTGTGGCATCTAAAGAGAACTTTATTGCAGCTCGAACTCGGCCTCCGGACCGATCTGAATACTTGGGCGGACCTGCACAGAAAGGTGAGCACTCCGACGATCAAGTCAGTATAGGTCGGATTCCCAGTTTAGTTCGAGGGTATTCTCTTCCCCAAACATTGGCCCCCTCTCTGTCTGGTCCGATCTCGACCTGGCTGAGAAGTTCATTCGACTTGCTTTGGACGCGTGGCGACTTCCTATTCGTGGGAAAATATAACCGTTGTGGTAGATTTATCGTCGGAATATTCAAATATTCCGACGCTTCGGATCTTAGGGAGGATCCTAACCGCTCGTTGATTACACGTCTCGAACCCTTGGTAGGTCGGTCTCTTCCCTCACTTTCTCTTTCGAACGTAGTTGCCATGTCGTCCTCTTTTAGCAGCAACTTAGGATCCGATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGGGAGGATAGTGATGCCTCCACCTCGGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGACAACATCCTCCTTAGGATTCCGGAGGAGGGGGAGAGAGCTGGCAATCCTCCAGAGGAATGGGTCACCCTCTACTTCAAAATGTTTGAGTACAGCCTCAGACTTCCCCTTCACCCTTTCGTCCAAGAGTTTCTTTTCCGAACTGGGCTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTTTTTTGGCTACGAGCTCGGGATAGTGAAGAGGCCGAGCTGTTAGACGTAGACCAGCTTCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATATGCGCAAGGAAAGGCGCAGGCGGTATAGTTAAGGGCCCGACCTCTATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCTGGGGAATGGCTTGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTTCCCACTAGGTTTGGAAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACGCTGAAATATTACAAGGAGTGTTTTCCGAGGGGTAGGAAGGTCGGAACCTTGGTGACTGACCAGCTGCTGCTCGAGTCCGGGCTGCTAGATTACAACCCCGCAGTTCGTCCCATTGAATCCTCAAGACCGAACTCCGAACTTGCCATGGTTTGTGGATTTGCGAGCAACGTGAAGCGCAAGTATCGACTACGCTGCTGTGGTAGGGCCAGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGGGGTCCTTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGGTGGACGTCTCGCCCTTGGGCGAGGGGATCGGGTGGACGATCCTAAGGCCAGGATGAGCGGGACGTCCGATGTGACGGCACGGTTCAGAGTTGAGCCGTCAAGCTCTGGGGTGAGGGACCAGGTGTCCCGCATCTCGGCCGCAAGTTTGGACCGCTGCCTAAGGAGGGCGTCCAAATTTGTGAGCGACCCAGGGTCCGTTCTGCAGAGGACCATCGACTACGCTGCCGAGGCGTTCGTTGCTTCCATTCAATCGGCTCTGGCCGTGAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCTTCCTCCGCCATGAAGGATGAGCTACTGAAGGCTCACTCTGAGGTGGAAACTTTGAAGGCCGAGGTGGAGACCAAGGCCGAGCTGCTGAAGAAGGAAGAAGACAGACGCAAGGCCCAGCTCCGAGCTGCCCATGCTATTACCAAGGGCTTGGAGAAGGAGAAGTTCCAACTCCTCAAGGAGAAGGACGACATGCTCCAGGCGCTTGAAGCGAAGGAGGAGGAGCTGAAGCATGCGACTGCCGAGCTGGAGACGGTGAAGGAGCGTCTCAGCAATGGAGCCCTATTGGAGGAATCATTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTCTCTGACGCGGGCTTCAAGTTTCTCATGAAGGGCATTGCTTCCGACATGCCTGACGTTCAGATCGATCTCGGTGGTCTGAAGAAGAGGTATGCTGAGCAGTGGGCGTCTGGGCCTAGCGGCACCCGTGGCTCCCAAGCGTTGGTGGATAAGTACGTCGGAGATCTGGACTCTGACTACTCCGACCTCGAAGAGGATCAGGTCGACACCACTCAAGAGGGCGTTTCTCAAGCAGGTTCTTAG

Coding sequence (CDS)

ATGACACTATCAATAGGGGTATACAGTCTGGTGCCGAAAGAGACTACAGCGAAAGAATTGTTGCAGGCCTTGCAAGATAGGTATGAAAAACCTTCTGCCAATACAAAAATACTTCTGTGGACGAAGTATTTTAATATCCACATGGAGGAGGGAACCTCGGTGAATTCACACATTAATGAGCTCACCGATATCTTGAACAAATTAGAAGGGATGAGTGTCAAGATTGAGGAAGAGATGAAGGCTATGAGGCTGTTGACATCTTTGCCTGACAAAAAAGCTTCAGTGTATGTGTTGAGGTTTGGTGTTGCCAGAGGATTAGAGAGACGGATTATGCACAAGGCTGCAGATAGTTTAGGGGGAGACTTTAAAAAACTAGCAGCATTGACAGCCGAGACAGATCAAGAGAATATGCCATCAATTCAAGTACAACAGCTAGGAAGTAGAGGAAAGGGAAAGGGGAACAGCTCAGTGAGACAAGACCACGTAGGATTGCTCAGTCTCAGGGCAATCACAGATGAAGTCTTTGTTGGTGCCAAGAAGATGTTAGAAGCTGTTGGTGTAGTGGGAGTCGAGTCTTGTGCTCAGAAACTAGTGACTATTGTTGTGGCATCTAAAGAGAACTTTATTGCAGCTCGAACTCGGCCTCCGGACCGATCTGAATACTTGGGCGGACCTGCACAGAAAGGTGAGCACTCCGACGATCAAGTCAGTATAGGTCGGATTCCCAGTTTAGTTCGAGGGTATTCTCTTCCCCAAACATTGGCCCCCTCTCTGTCTGGTCCGATCTCGACCTGGCTGAGAAGTTCATTCGACTTGCTTTGGACGCGTGGCGACTTCCTATTCGTGGGAAAATATAACCGTTGTGGTAGATTTATCGTCGGAATATTCAAATATTCCGACGCTTCGGATCTTAGGGAGGATCCTAACCGCTCGTTGATTACACGTCTCGAACCCTTGGTAGGTCGGTCTCTTCCCTCACTTTCTCTTTCGAACGTAGTTGCCATGTCGTCCTCTTTTAGCAGCAACTTAGGATCCGATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGGGAGGATAGTGATGCCTCCACCTCGGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGACAACATCCTCCTTAGGATTCCGGAGGAGGGGGAGAGAGCTGGCAATCCTCCAGAGGAATGGGTCACCCTCTACTTCAAAATGTTTGAGTACAGCCTCAGACTTCCCCTTCACCCTTTCGTCCAAGAGTTTCTTTTCCGAACTGGGCTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTTTTTTGGCTACGAGCTCGGGATAGTGAAGAGGCCGAGCTGTTAGACGTAGACCAGCTTCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATATGCGCAAGGAAAGGCGCAGGCGGTATAGTTAAGGGCCCGACCTCTATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCTGGGGAATGGCTTGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTTCCCACTAGGTTTGGAAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACGCTGAAATATTACAAGGAGTGTTTTCCGAGGGGTAGGAAGGTCGGAACCTTGGTGACTGACCAGCTGCTGCTCGAGTCCGGGCTGCTAGATTACAACCCCGCAGTTCGTCCCATTGAATCCTCAAGACCGAACTCCGAACTTGCCATGGTTTGTGGATTTGCGAGCAACGTGAAGCGCAAGTATCGACTACGCTGCTGTGGTAGGGCCAGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGGGGTCCTTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGGTGGACGTCTCGCCCTTGGGCGAGGGGATCGGGTGGACGATCCTAAGGCCAGGATGAGCGGGACGTCCGATGTGACGGCACGGTTCAGAGTTGAGCCGTCAAGCTCTGGGGTGAGGGACCAGGTGTCCCGCATCTCGGCCGCAAGTTTGGACCGCTGCCTAAGGAGGGCGTCCAAATTTGTGAGCGACCCAGGGTCCGTTCTGCAGAGGACCATCGACTACGCTGCCGAGGCGTTCGTTGCTTCCATTCAATCGGCTCTGGCCGTGAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCTTCCTCCGCCATGAAGGATGAGCTACTGAAGGCTCACTCTGAGGTGGAAACTTTGAAGGCCGAGGTGGAGACCAAGGCCGAGCTGCTGAAGAAGGAAGAAGACAGACGCAAGGCCCAGCTCCGAGCTGCCCATGCTATTACCAAGGGCTTGGAGAAGGAGAAGTTCCAACTCCTCAAGGAGAAGGACGACATGCTCCAGGCGCTTGAAGCGAAGGAGGAGGAGCTGAAGCATGCGACTGCCGAGCTGGAGACGGTGAAGGAGCGTCTCAGCAATGGAGCCCTATTGGAGGAATCATTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTCTCTGACGCGGGCTTCAAGTTTCTCATGAAGGGCATTGCTTCCGACATGCCTGACGTTCAGATCGATCTCGGTGGTCTGAAGAAGAGGTATGCTGAGCAGTGGGCGTCTGGGCCTAGCGGCACCCGTGGCTCCCAAGCGTTGGTGGATAAGTACGTCGGAGATCTGGACTCTGACTACTCCGACCTCGAAGAGGATCAGGTCGACACCACTCAAGAGGGCGTTTCTCAAGCAGGTTCTTAG

Protein sequence

MTLSIGVYSLVPKETTAKELLQALQDRYEKPSANTKILLWTKYFNIHMEEGTSVNSHINELTDILNKLEGMSVKIEEEMKAMRLLTSLPDKKASVYVLRFGVARGLERRIMHKAADSLGGDFKKLAALTAETDQENMPSIQVQQLGSRGKGKGNSSVRQDHVGLLSLRAITDEVFVGAKKMLEAVGVVGVESCAQKLVTIVVASKENFIAARTRPPDRSEYLGGPAQKGEHSDDQVSIGRIPSLVRGYSLPQTLAPSLSGPISTWLRSSFDLLWTRGDFLFVGKYNRCGRFIVGIFKYSDASDLREDPNRSLITRLEPLVGRSLPSLSLSNVVAMSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPDNILLRIPEEGERAGNPPEEWVTLYFKMFEYSLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYICARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKECFPRGRKVGTLVTDQLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRKYRLRCCGRASLGRSSPSDRAGVFWGSFEGEAPQGSDRGGGRLALGRGDRVDDPKARMSGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSAMKDELLKAHSEVETLKAEVETKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELETVKERLSNGALLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDVQIDLGGLKKRYAEQWASGPSGTRGSQALVDKYVGDLDSDYSDLEEDQVDTTQEGVSQAGS
Homology
BLAST of Moc03g21560 vs. NCBI nr
Match: XP_022159252.1 (uncharacterized protein LOC111025665 [Momordica charantia])

HSP 1 Score: 575.9 bits (1483), Expect = 6.8e-160
Identity = 333/533 (62.48%), Postives = 385/533 (72.23%), Query Frame = 0

Query: 508 ICARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPEL 567
           +CARKG GGIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL
Sbjct: 1   MCARKGTGGIVKGPTSIKGWVGKWFFASGEWLAKDESGRAFFDVPTRFGNLVSIKLIPEL 60

Query: 568 TQASFDTLKYYKECFPRGRKVGTLVTDQLLLESGLLDYNPAVRPIESSRPNSELAMVCGF 627
            QA+FDTLK+YK+ FPR RK+ TLVTD+LLLESGLLDYNP VR IE+SRPNSELAMVCGF
Sbjct: 61  AQATFDTLKHYKDHFPRDRKIVTLVTDKLLLESGLLDYNPLVRLIEASRPNSELAMVCGF 120

Query: 628 ASNVKRKYRLRCCGRASL----------------GRSSPS----------DRAGVFWG-- 687
             +VKRK + R     ++                G S PS          D +G   G  
Sbjct: 121 TGSVKRKSKGRAHALKTVVGTEPVTPTVPRTXAQGNSGPSSAVPTPVIELDLSGGRSGEK 180

Query: 688 ---------------SFEGEAP-----------QGSDRGG-GRLALGRGDRVDDPKARMS 747
                             GE+P             S+ G  G L     D VDDP+ARM 
Sbjct: 181 RSREESEALDVSPLNEVRGESPLRRRRKKKKTSSSSEAGARGTLPTSHADLVDDPEARMR 240

Query: 748 GTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFV 807
           GTS+V  RF +EPSSSGV+DQVSRISA  LDR LRRASKFVSDPGSVLQRTID  AEAF+
Sbjct: 241 GTSNVRMRFGMEPSSSGVKDQVSRISATCLDRYLRRASKFVSDPGSVLQRTIDNVAEAFI 300

Query: 808 ASIQSALAVKAELDGREVLAAREKEEFSAALEAASSAMKDELLKAHSEVETLKAEVETKA 867
           ASI  A+ VKAELDGRE LAA+E+E   AALEAA++ +K ELLKA  EV+ L+AEV+ K 
Sbjct: 301 ASIHLAVMVKAELDGREALAAKERENSFAALEAATT-LKGELLKAQGEVDILRAEVDAKV 360

Query: 868 ELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELETVK 927
           +LLKKE ++ KA LRAAHAITKGLEKEKFQLLKEKDD+ Q LE K+  +   T EL+ +K
Sbjct: 361 DLLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLEEKDASIGRLTTELKDLK 420

Query: 928 ERLSNGALLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDVQIDLGGLKKRYAEQW 977
           ERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP +QIDL GLKK+Y+E+W
Sbjct: 421 ERLTNGTLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIAADMPHLQIDLNGLKKKYSEKW 480

BLAST of Moc03g21560 vs. NCBI nr
Match: XP_022159063.1 (uncharacterized protein LOC111025502, partial [Momordica charantia])

HSP 1 Score: 574.3 bits (1479), Expect = 2.0e-159
Identity = 287/322 (89.13%), Postives = 296/322 (91.93%), Query Frame = 0

Query: 335 MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGS 394
           MSSS SSNL  + DLARRLES+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYLGS
Sbjct: 1   MSSSISSNL--ESDLARRLESKLEEIENXRISDDGEDSDASTSGQGLEYPSRIPEHYLGS 60

Query: 395 LRRGFAIPDNILLRIPEEGERAGNPPEEWVTLYFKMFEYSLRLPLHPFVQEFLFRTGLAP 454
           LRRGFAIP+NILLR+PEEGERA NPPE WVTLYFKMFEY LRLPLHPFVQEFLFRTGLAP
Sbjct: 61  LRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYGLRLPLHPFVQEFLFRTGLAP 120

Query: 455 AQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYICARKGA 514
           AQVAPNGWGVIFALAILFWLRARDSEEAEL DVDQLLACFEAKRIAKKPGRFY+CARKGA
Sbjct: 121 AQVAPNGWGVIFALAILFWLRARDSEEAELXDVDQLLACFEAKRIAKKPGRFYMCARKGA 180

Query: 515 GGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDT 574
           GGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDT
Sbjct: 181 GGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDT 240

Query: 575 LKYYKECFPRGRKVGTLVTDQLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRK 634
           LKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS VKRK
Sbjct: 241 LKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASGVKRK 300

Query: 635 YRLRCCGRASLGRSSPSDRAGV 657
            + R     +   S P+  A V
Sbjct: 301 SKGRAHALEAAQSSKPATPAVV 320

BLAST of Moc03g21560 vs. NCBI nr
Match: XP_022150343.1 (uncharacterized protein LOC111018538 [Momordica charantia])

HSP 1 Score: 456.4 bits (1173), Expect = 6.0e-124
Identity = 248/285 (87.02%), Postives = 265/285 (92.98%), Query Frame = 0

Query: 693 GTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFV 752
           G   + A+ R+EPSSSGVRDQVSRISAASLDRCLRRASKFVS PGSVLQRTIDYAAEAFV
Sbjct: 16  GEQRILAKDRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFV 75

Query: 753 ASIQSALAVKAELDGREVLAAREKEEFSAALEAASSAMKDELLKAHSEVETLKAEVETKA 812
           ASIQSALAVKAELDGREVLAAREKEEFSAALE ASS MKDELLKAHSEVETLKAEVE++A
Sbjct: 76  ASIQSALAVKAELDGREVLAAREKEEFSAALETASSTMKDELLKAHSEVETLKAEVESQA 135

Query: 813 ELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELETVK 872
           ELLKKEEDRR+AQLRAAHAIT+GLE+EKFQLLKEKDDMLQALEAK++EL+HATAELET K
Sbjct: 136 ELLKKEEDRRQAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAK 195

Query: 873 ERLSNGALLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDVQIDLGGLKKRYAEQW 932
           ERLSNG LLEE+FRQHPDFDGFAKDFSDAGFKFLMKGIASDMPD+QIDL GLK+RYAE+W
Sbjct: 196 ERLSNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKW 255

Query: 933 ASGPSGTRGSQALVDKYVGDLDSDYSDLEEDQVDTTQEGVSQAGS 978
           ASGP GT G QALVD+YV DLDSDYSD EEDQV +TQEG S  GS
Sbjct: 256 ASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSTQEGASPTGS 300

BLAST of Moc03g21560 vs. NCBI nr
Match: XP_022144034.1 (uncharacterized protein LOC111013826 [Momordica charantia])

HSP 1 Score: 409.5 bits (1051), Expect = 8.4e-110
Identity = 202/227 (88.99%), Postives = 206/227 (90.75%), Query Frame = 0

Query: 430 MFEYSLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQ 489
           MFEY LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQ
Sbjct: 1   MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQ 60

Query: 490 LLACFEAKRIAKKPGRFYICARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFF 549
           LLACFEAKRIAKKPGRFY+CARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFF
Sbjct: 61  LLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFF 120

Query: 550 DVPTRFGNLVSIRPVPELTQASFDTLKYYKECFPRGRKVGTLVTDQLLLESGLLDYNPAV 609
           DVPTRFGNLVSIRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAV
Sbjct: 121 DVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAV 180

Query: 610 RPIESSRPNSELAMVCGFASNVKRKYRLRCCGRASLGRSSPSDRAGV 657
           RPIE SRPNS LAMVC FAS VKRK + R     +   S P   A V
Sbjct: 181 RPIEXSRPNSXLAMVCRFASGVKRKSKGRAHALEAAQSSKPPTPAVV 227

BLAST of Moc03g21560 vs. NCBI nr
Match: XP_022158122.1 (uncharacterized protein LOC111024680 [Momordica charantia])

HSP 1 Score: 392.9 bits (1008), Expect = 8.1e-105
Identity = 188/192 (97.92%), Postives = 190/192 (98.96%), Query Frame = 0

Query: 430 MFEYSLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQ 489
           MFEY LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQ
Sbjct: 1   MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQ 60

Query: 490 LLACFEAKRIAKKPGRFYICARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFF 549
           LLACFEAKRIAKKPGRFY+CARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFF
Sbjct: 61  LLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFF 120

Query: 550 DVPTRFGNLVSIRPVPELTQASFDTLKYYKECFPRGRKVGTLVTDQLLLESGLLDYNPAV 609
           DVPTRFGNLVSIRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAV
Sbjct: 121 DVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAV 180

Query: 610 RPIESSRPNSEL 622
           RPIESSRPNSEL
Sbjct: 181 RPIESSRPNSEL 192

BLAST of Moc03g21560 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 56.6 bits (135), Expect = 1.8e-06
Identity = 32/87 (36.78%), Postives = 49/87 (56.32%), Query Frame = 0

Query: 3   LSIGVYSLVPKETTAKELLQALQDRYEKPSANTKILLWTKYFNIHMEEGTSVNSHINELT 62
           LS  V + +  E TA+ +   L+  Y   +   K+ L  + + +HM EGT+  SH+N   
Sbjct: 67  LSDDVVNNIIDEDTARGIWTRLESLYMSKTLTNKLYLKKQLYALHMSEGTNFLSHLNVFN 126

Query: 63  DILNKLEGMSVKIEEEMKAMRLLTSLP 90
            ++ +L  + VKIEEE KA+ LL SLP
Sbjct: 127 GLITQLANLGVKIEEEDKAILLLNSLP 153

BLAST of Moc03g21560 vs. ExPASy TrEMBL
Match: A0A6J1DZB3 (uncharacterized protein LOC111025665 OS=Momordica charantia OX=3673 GN=LOC111025665 PE=4 SV=1)

HSP 1 Score: 575.9 bits (1483), Expect = 3.3e-160
Identity = 333/533 (62.48%), Postives = 385/533 (72.23%), Query Frame = 0

Query: 508 ICARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPEL 567
           +CARKG GGIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL
Sbjct: 1   MCARKGTGGIVKGPTSIKGWVGKWFFASGEWLAKDESGRAFFDVPTRFGNLVSIKLIPEL 60

Query: 568 TQASFDTLKYYKECFPRGRKVGTLVTDQLLLESGLLDYNPAVRPIESSRPNSELAMVCGF 627
            QA+FDTLK+YK+ FPR RK+ TLVTD+LLLESGLLDYNP VR IE+SRPNSELAMVCGF
Sbjct: 61  AQATFDTLKHYKDHFPRDRKIVTLVTDKLLLESGLLDYNPLVRLIEASRPNSELAMVCGF 120

Query: 628 ASNVKRKYRLRCCGRASL----------------GRSSPS----------DRAGVFWG-- 687
             +VKRK + R     ++                G S PS          D +G   G  
Sbjct: 121 TGSVKRKSKGRAHALKTVVGTEPVTPTVPRTXAQGNSGPSSAVPTPVIELDLSGGRSGEK 180

Query: 688 ---------------SFEGEAP-----------QGSDRGG-GRLALGRGDRVDDPKARMS 747
                             GE+P             S+ G  G L     D VDDP+ARM 
Sbjct: 181 RSREESEALDVSPLNEVRGESPLRRRRKKKKTSSSSEAGARGTLPTSHADLVDDPEARMR 240

Query: 748 GTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFV 807
           GTS+V  RF +EPSSSGV+DQVSRISA  LDR LRRASKFVSDPGSVLQRTID  AEAF+
Sbjct: 241 GTSNVRMRFGMEPSSSGVKDQVSRISATCLDRYLRRASKFVSDPGSVLQRTIDNVAEAFI 300

Query: 808 ASIQSALAVKAELDGREVLAAREKEEFSAALEAASSAMKDELLKAHSEVETLKAEVETKA 867
           ASI  A+ VKAELDGRE LAA+E+E   AALEAA++ +K ELLKA  EV+ L+AEV+ K 
Sbjct: 301 ASIHLAVMVKAELDGREALAAKERENSFAALEAATT-LKGELLKAQGEVDILRAEVDAKV 360

Query: 868 ELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELETVK 927
           +LLKKE ++ KA LRAAHAITKGLEKEKFQLLKEKDD+ Q LE K+  +   T EL+ +K
Sbjct: 361 DLLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLEEKDASIGRLTTELKDLK 420

Query: 928 ERLSNGALLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDVQIDLGGLKKRYAEQW 977
           ERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP +QIDL GLKK+Y+E+W
Sbjct: 421 ERLTNGTLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIAADMPHLQIDLNGLKKKYSEKW 480

BLAST of Moc03g21560 vs. ExPASy TrEMBL
Match: A0A6J1DXS5 (uncharacterized protein LOC111025502 OS=Momordica charantia OX=3673 GN=LOC111025502 PE=4 SV=1)

HSP 1 Score: 574.3 bits (1479), Expect = 9.5e-160
Identity = 287/322 (89.13%), Postives = 296/322 (91.93%), Query Frame = 0

Query: 335 MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGS 394
           MSSS SSNL  + DLARRLES+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYLGS
Sbjct: 1   MSSSISSNL--ESDLARRLESKLEEIENXRISDDGEDSDASTSGQGLEYPSRIPEHYLGS 60

Query: 395 LRRGFAIPDNILLRIPEEGERAGNPPEEWVTLYFKMFEYSLRLPLHPFVQEFLFRTGLAP 454
           LRRGFAIP+NILLR+PEEGERA NPPE WVTLYFKMFEY LRLPLHPFVQEFLFRTGLAP
Sbjct: 61  LRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYGLRLPLHPFVQEFLFRTGLAP 120

Query: 455 AQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYICARKGA 514
           AQVAPNGWGVIFALAILFWLRARDSEEAEL DVDQLLACFEAKRIAKKPGRFY+CARKGA
Sbjct: 121 AQVAPNGWGVIFALAILFWLRARDSEEAELXDVDQLLACFEAKRIAKKPGRFYMCARKGA 180

Query: 515 GGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDT 574
           GGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDT
Sbjct: 181 GGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDT 240

Query: 575 LKYYKECFPRGRKVGTLVTDQLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRK 634
           LKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS VKRK
Sbjct: 241 LKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASGVKRK 300

Query: 635 YRLRCCGRASLGRSSPSDRAGV 657
            + R     +   S P+  A V
Sbjct: 301 SKGRAHALEAAQSSKPATPAVV 320

BLAST of Moc03g21560 vs. ExPASy TrEMBL
Match: A0A6J1D971 (uncharacterized protein LOC111018538 OS=Momordica charantia OX=3673 GN=LOC111018538 PE=4 SV=1)

HSP 1 Score: 456.4 bits (1173), Expect = 2.9e-124
Identity = 248/285 (87.02%), Postives = 265/285 (92.98%), Query Frame = 0

Query: 693 GTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFV 752
           G   + A+ R+EPSSSGVRDQVSRISAASLDRCLRRASKFVS PGSVLQRTIDYAAEAFV
Sbjct: 16  GEQRILAKDRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFV 75

Query: 753 ASIQSALAVKAELDGREVLAAREKEEFSAALEAASSAMKDELLKAHSEVETLKAEVETKA 812
           ASIQSALAVKAELDGREVLAAREKEEFSAALE ASS MKDELLKAHSEVETLKAEVE++A
Sbjct: 76  ASIQSALAVKAELDGREVLAAREKEEFSAALETASSTMKDELLKAHSEVETLKAEVESQA 135

Query: 813 ELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELETVK 872
           ELLKKEEDRR+AQLRAAHAIT+GLE+EKFQLLKEKDDMLQALEAK++EL+HATAELET K
Sbjct: 136 ELLKKEEDRRQAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAK 195

Query: 873 ERLSNGALLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDVQIDLGGLKKRYAEQW 932
           ERLSNG LLEE+FRQHPDFDGFAKDFSDAGFKFLMKGIASDMPD+QIDL GLK+RYAE+W
Sbjct: 196 ERLSNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKW 255

Query: 933 ASGPSGTRGSQALVDKYVGDLDSDYSDLEEDQVDTTQEGVSQAGS 978
           ASGP GT G QALVD+YV DLDSDYSD EEDQV +TQEG S  GS
Sbjct: 256 ASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSTQEGASPTGS 300

BLAST of Moc03g21560 vs. ExPASy TrEMBL
Match: A0A6J1CR42 (uncharacterized protein LOC111013826 OS=Momordica charantia OX=3673 GN=LOC111013826 PE=4 SV=1)

HSP 1 Score: 409.5 bits (1051), Expect = 4.1e-110
Identity = 202/227 (88.99%), Postives = 206/227 (90.75%), Query Frame = 0

Query: 430 MFEYSLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQ 489
           MFEY LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQ
Sbjct: 1   MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQ 60

Query: 490 LLACFEAKRIAKKPGRFYICARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFF 549
           LLACFEAKRIAKKPGRFY+CARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFF
Sbjct: 61  LLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFF 120

Query: 550 DVPTRFGNLVSIRPVPELTQASFDTLKYYKECFPRGRKVGTLVTDQLLLESGLLDYNPAV 609
           DVPTRFGNLVSIRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAV
Sbjct: 121 DVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAV 180

Query: 610 RPIESSRPNSELAMVCGFASNVKRKYRLRCCGRASLGRSSPSDRAGV 657
           RPIE SRPNS LAMVC FAS VKRK + R     +   S P   A V
Sbjct: 181 RPIEXSRPNSXLAMVCRFASGVKRKSKGRAHALEAAQSSKPPTPAVV 227

BLAST of Moc03g21560 vs. ExPASy TrEMBL
Match: A0A6J1DWD2 (uncharacterized protein LOC111024680 OS=Momordica charantia OX=3673 GN=LOC111024680 PE=4 SV=1)

HSP 1 Score: 392.9 bits (1008), Expect = 3.9e-105
Identity = 188/192 (97.92%), Postives = 190/192 (98.96%), Query Frame = 0

Query: 430 MFEYSLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQ 489
           MFEY LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQ
Sbjct: 1   MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQ 60

Query: 490 LLACFEAKRIAKKPGRFYICARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFF 549
           LLACFEAKRIAKKPGRFY+CARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFF
Sbjct: 61  LLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFF 120

Query: 550 DVPTRFGNLVSIRPVPELTQASFDTLKYYKECFPRGRKVGTLVTDQLLLESGLLDYNPAV 609
           DVPTRFGNLVSIRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAV
Sbjct: 121 DVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAV 180

Query: 610 RPIESSRPNSEL 622
           RPIESSRPNSEL
Sbjct: 181 RPIESSRPNSEL 192

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022159252.16.8e-16062.48uncharacterized protein LOC111025665 [Momordica charantia][more]
XP_022159063.12.0e-15989.13uncharacterized protein LOC111025502, partial [Momordica charantia][more]
XP_022150343.16.0e-12487.02uncharacterized protein LOC111018538 [Momordica charantia][more]
XP_022144034.18.4e-11088.99uncharacterized protein LOC111013826 [Momordica charantia][more]
XP_022158122.18.1e-10597.92uncharacterized protein LOC111024680 [Momordica charantia][more]
Match NameE-valueIdentityDescription
P109781.8e-0636.78Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
Match NameE-valueIdentityDescription
A0A6J1DZB33.3e-16062.48uncharacterized protein LOC111025665 OS=Momordica charantia OX=3673 GN=LOC111025... [more]
A0A6J1DXS59.5e-16089.13uncharacterized protein LOC111025502 OS=Momordica charantia OX=3673 GN=LOC111025... [more]
A0A6J1D9712.9e-12487.02uncharacterized protein LOC111018538 OS=Momordica charantia OX=3673 GN=LOC111018... [more]
A0A6J1CR424.1e-11088.99uncharacterized protein LOC111013826 OS=Momordica charantia OX=3673 GN=LOC111013... [more]
A0A6J1DWD23.9e-10597.92uncharacterized protein LOC111024680 OS=Momordica charantia OX=3673 GN=LOC111024... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (OHB3-1) v2
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 58..78
NoneNo IPR availableCOILSCoilCoilcoord: 798..821
NoneNo IPR availableCOILSCoilCoilcoord: 840..874
NoneNo IPR availablePFAMPF04195Transposase_28coord: 426..468
e-value: 1.5E-6
score: 28.3
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 8..91
e-value: 1.1E-20
score: 73.8
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 676..690
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 957..977
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 662..694
NoneNo IPR availablePANTHERPTHR34676:SF1ZINC FINGER, CCHC-TYPE, TUBBY C-TERMINAL-LIKE DOMAIN PROTEIN-RELATEDcoord: 13..92
NoneNo IPR availablePANTHERPTHR34676FAMILY NOT NAMEDcoord: 13..92

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Moc03g21560.1Moc03g21560.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0043167 ion binding