Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTCCAACCAGCGAATTCGACCAATACGACGGATCGAAGGACTCTGGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCGGCGGTAGAGGAGCAAGGTCACGACGGCCTAGCAGCGGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGCCCTGCCGCCTGCACACCCGAGGACGTCCAAGGCCACCCGTGGTCGAGGTGGGACCTCTAAGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGCGAGGATTTTGATGCGCTCCAGAGAGAGATGGAGGCAATGCGCATGCAAATGCGCTCCATGGAGGCGATGTATAACGAAATGGTGCTAGCTGCAGGCGCGGGGTCCCGATCTGAAAATCGGGCGACGCGCATGGACGTACGCGAGCAAAGGGGCTCCCACCTCGGCCCGGTCGAGGAGGAACGTCCCGAAGACAACGGGAGTGAGGGGTACACTCGCCAGAGGGGAGGCCTCCGCGAGCATCTCAACCGAAAGAGAGGCTCGTCTCTCCGAAAAGGGCAGTCGCCATCCCGCTCCCACATGAGCTCCAACCAGCAGGCTAAATCCTCTCACAACCCCGCAGGGATAATCACAAGGGAGGAGTTCGATCAGCTGAGGGGGGAGCTCGATGCTCAAGTGGAGGCCTTAAAGGCTAAATGTGAGCAGAAAGACGATTCACTGAACGATGGCGACTTGGGAGAATCGCCTTTCACCTCGGACGTTTTGGAAGCACCGATCCCCCCGAAGTTCAAAGCTCCTACCGTGAAGTCTTATGACGGGACGAAGGACCCCAAGGACTATGTTGAAGTCTTTGAAGGCCTCATGGACTTCCAAGCGGTATAAGACGCAATCAAATGCCGCGCCTTCCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTACCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTTTCTTCTCGGCACTACGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGGGGGAGACGCTGCGGGAGTATGTCACCAGATTCCAGGAGGAGCAATTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGATGAGGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGACCACCTTCGCCGAGGTGCTCCAGAAGGCAAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCTGATTGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATGAAAGGGCAGATCCTAAGTCCAAGGACAAAGGATCTTTCTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAACGGACCTACCAGGAGCCAACCCTACGAGCGCTTCACCCCAACAACGATTCCAATCTCCGAGATCCTAACGAACATCGAGGATTCTGGAATGGAAAAACTACTCAAGCGTCCGGAGAAACTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTTCATCGGGAGCACGGCCACAACACGTCGGACTGCTGGGAATTGAAGCGCCAAATTGAAGATCTGATTCAAGACGGCTACTTCAAGAAGTTTGTGGGAAAGCCCGGGACCGGCTCAGCAGAGAAAAAGGAGGAGCGAAAGCGTTCAAGGACGCCACCCCGGCGCACCGACCGACCTGCGGTCATCAATGCCATCTTTGGAGGGCCAAGCGGGGGTCAATCCGGACATAAAAGAAAGGAGTTAGCCCGTGCAGCCAGACGCGAGGTGTGCATCATCAGGGAGCAGGGGCCGACCTGCCCAATCACCTTCGACGGTGCAGACTTGGAGGAGGTCCACCTGCCCCACAATGATGCCCTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTCAGGAGAGTGCTGGTAGACGGGGGCGCATCCGCCAACATCCTGTCCTTATCGACCTACCTCGCCTTGGGCTGGACGAGGTCGCAATTGAAGAGAAGCCCGACGCCGCTGGTTGGATTCTCTAGAGAGTCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACGCTGGGGCAGAACCAAACCCGGATCACTCAAATGGCCGAGTTCATGGTAGTTGACGGTAGGTCGACCTATAACGCCATCTTTGGGAGACCCATCATCCACTCCTTTCGGGCCATTCCTTCAACACTTCATCAAGTTTTGAAGTATCCCACCCCCAACGGCGTGGGCACGGTCCGAGGAGAACAAGCCGCTTCGAGGGAGTGTTATGCCGCCGCACTCAAAGGCCCATCGGTCTGCGCCCTCGAAACTCTCAGGGATGGGATTCTCGAGTTCGAGGCCGACCTGCCGAGGAAGGAGTTTGCCGCGCCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAGGTAAGCATAGGAACCAAGCTGGGGGCCACCGACAGAGAGGAGCTAATCCACTTCCTCAGATCCAACTCGAACGTCTTTGCGTGGTCCCATGAGGACATGCCTGGCATCGACCCGCGAATTATGATGCATCGCCTCAGCATAGATCCATCATTCCGACCTGTGAAACAAAAGAGAAGACCTATAAACAAGGAGATGAGTGATGTAATTGTTGAGGAAGTTAACAAACTTTTGAAAGCTGAATACATAAGAGAAATTTCGTATCCCGAGTGGCTCTCCAATGTTGTATTAGTTAAAAAATCTAACGGCAAGTGGAGAATGTGCGTAGACTTTACGAACTTAAATAAGGCATGCCCGAAAGATTGCTTCCCACTGCCGAGGATTGATCAGCTCGTGGACGCCACAGCTGGGCACGAACTGCTCACTTTCATAAACGCCTACTCTGGGTACAACCAAATCAAGATGCATGTCCCAGATGAAGGTCATACCGCTTTTATAACAGACCAAGGTCTGTACTACTACAAGGTCATGCCCTTCGGGTTAAAAAACGCAGGAGCGACCTACCAGAGAATGGTGAACAAAATGTTCGCCAAGCAGATCGGCCGGAATATGGAAGTGTATGTGGACGACATGCTTGTCAAGAGCAAGCAGTCTAAGTCGCATCTCTCCGACCTGGCCGAAGCCTTCGAGGTTCTGAGGGCATATCAAATGAAGCTCAACCCTGCTAAGTGTGCCTTTGGAGTCTCCTCGGGAAAATTCCTTGGCTTCATGGTAAACAACCGGGGAATCGAGACCAACCCTGAAAAGATTAAAGCCGTGATCGAGATGGAGGCACCTAAAACGCTGAAACAGCTTCAGTGCCTCAATGGCAGGATTGCGGCCCTGAACCGGTTTGCTTCAAGGTCAACAGACAAGTGCCTCCCATTCTTCAAGATCTTACGAAAGAAAGGACCGTTTGAATGGACAGCGGAGTGCGAGCAAGCGTTTCAGCAATTGAAGAACTACCTCTGTTCGGCACCCCTGCTTGCCAAGCCTATGCCGGGGGACAAGCTCCAATTGTACCTAGCAGTGTCTGACAGTGCCGTCAGCTCGGCCCTAATCAGGCAAGAGGAAGCGCGGCAAAACCCGGTCTACTACACAAGCAAGGCTATGACCGAAGCCGAGACTAGATACCCTCGGATGGAGAAGTTGGCTCTCGCTTTGGTCACCTCGGCCCGACGACTTAGACCATACTTCCAAGCCCATACGGTGGTGGTGCTCACTAACCTGCCCCTTAAGAGTATCTTCCACAAGCCGGAAGCTTCCGGACGCCTGATGAAGTGGGCCATAGAGCTAAGTGAGTACGACATCCAGTTCGAACCCAGAACTGCGTTGAAAGGGCAAGCAGCGGCAGATTTCATAGCCGAGCTCACACCACCTTCAGAGCTGAGCGGGTCCGACCTGCCTTGGACAGTCTATGTCGACAGATCCTCCAATGAGAAGGGGTGCGGGGCCGAGGTCCTCTTGCTCGGACCAGGGGTTGAGCGATTTGAGTACGCCTTGCGGTTCAGCTTCCGGACTTCTAACAACGAGGCTGAGTATGAAGCATTTATTGCCGGCCTGCGAGTCGCTCAAGCATTGGGGGCCTCTTGTGTTAAGGTCTTCAGTGACTCCCAGCTGGTTGTGAGCCAGATCAAGGACGAGTACCAAGCCAAAGACACCCGAATGGAGAAGTATTTGGGTAAGGTCAGGTCATACCTAGCCCAGTTTCGAACTTACGAAGTAAGCCGGATTCCGCAGGCAGAGAATTCTAATGCTGACGCCTTGGCCAAGTTAGTATCGGCGTACGAGACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCCTAGATAATCCCTCGATCTTAGAGCCCGAACTGATGGAGATCGGCGCTCCAGAACCCTCATGGATGGACCCGATCGCGAACTTCATTACGGGCAACTCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCAAGACGGGCAGCTCGGTTCGTGATCCGAGATGGGGCATTGTACCGACGTGGCTTTTCCCTGCCTCTGTTGAAATGCCTAACCCCTGAAGAGGGCCTGTACGTCCTCAGAGAGATCCACGAAGGAGTGTGCGGCAATCACTCAGGCGCCCGGTCGCTGTCAGCCAAAGTGATCCGACAAGGATACTATTGGCCGACCCTCAGCCAGGACGCCAAGAAGTTCGTTAGAACTTGCGACAATTGCCAACGCTACGGAACCATAATCCACCAACCTCCCGAGCTGCTCACCCCCATCTCGGCCCCGTGGCCATTCGCGCAGTGGGGGGTAGATATCATTGGTCCTTTCCCTTTGGGCAAGGGCCAGACCAAGTTCGCGGTGGTTGCTGTGGATTACTTCACCAAGTGGGCCGAGGCCGAAGCGCTCTCCCACATAACGGAATCCAGGGTCACGTCCTTCGTATGGACGAGCATCATATGTCGCTTTGGTATACCACAGGCCATAGTGACAGACAATGGGAAGCAGTTTGACAACGCCAAGTTCAAAGACTTTTGCAGCAAACTTGGCATAAGTCACCTCAGCTCGTCCCCCGCACATCCGCAAGCAAATGGGCAGGTGGAGGCGGTCAACAAGATCATCAAGCGAGGCATCAAACTTAGACTGGACTCCAAGAAAGGCAGGTGGGCCGAGGAGCTACCCGAAGTTCTATGGTCGTACCGGACCACCCAACGGGGGTCGACGGGTGAGACCCCGTTCTCCCTGGCCTTCGGCTCCGAAGCTGTAGTCCCAGTTGAGATCGGCATGCCATCTGACAGAGTAGAGCATTACGAGCCTTCGACAAATGAGGAAGAGCTGCTCCTCAACCTCGACTTGTGGGAAGAAAGAAGAGCAATGGCCCAGCTACGCCTGGCGGAATATCAGGGCAGGATGGCCAGACATTACAATGCCCGCGTTCGACCTCGGGCCTTTCAGGTCGGCCATCTGGTATTAAGGAGGGTCCAAACGCATGTGGGTGCCCTTGATCCGGCCTGGGAAGGCCCATTTGAGATCAAGGGCATTGTCCGACCTGGGACGTACGTATTGGCCGATCTGAAAGGAGATGTCCTCGCGCACCCGTGGAACGCGGAACACCTGAAGCGTTATTATCCTTGA
mRNA sequence
ATGGTCCAACCAGCGAATTCGACCAATACGACGGATCGAAGGACTCTGGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCGGCGGTAGAGGAGCAAGGTCACGACGGCCTAGCAGCGGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGCCCTGCCGCCTGCACACCCGAGGACGTCCAAGGCCACCCGTGGTCGAGGTGGGACCTCTAAGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGCGAGGATTTTGATGCGCTCCAGAGAGAGATGGAGGCAATGCGCATGCAAATGCGCTCCATGGAGGCGATGTATAACGAAATGGTGCTAGCTGCAGGCGCGGGGTCCCGATCTGAAAATCGGGCGACGCGCATGGACGTACGCGAGCAAAGGGGCTCCCACCTCGGCCCGGTCGAGGAGGAACGTCCCGAAGACAACGGGAGTGAGGGGTACACTCGCCAGAGGGGAGGCCTCCGCGAGCATCTCAACCGAAAGAGAGGCTCGTCTCTCCGAAAAGGGCAGTCGCCATCCCGCTCCCACATGAGCTCCAACCAGCAGGCTAAATCCTCTCACAACCCCGCAGGGATAATCACAAGGGAGGAGTTCGATCAGCTGAGGGGGGAGCTCGATGCTCAAGTGGAGGCCTTAAAGGCTAAATGTGAGCAGAAAGACGATTCACTGAACGATGGCGACTTGGGAGAATCGCCTTTCACCTCGGACGTTTTGGAAGCACCGATCCCCCCGAAGTTCAAAGCTCCTACCGTGAAGTCTTATGACGGGACGAAGGACCCCAAGGACTATGTTGAAGTCTTTGAAGGCCTCATGGACTTCCAAGCGCTGAGAAGGGAGTTCCTCGCCCAGTTTTCTTCTCGGCACTACGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGGGGGAGACGCTGCGGGAGTATGTCACCAGATTCCAGGAGGAGCAATTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGATGAGGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGACCACCTTCGCCGAGATCGGCCGGGGCAGAAGTGGAAAAGATGAAAGGGCAGATCCTAAGTCCAAGGACAAAGGATCTTTCTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAACGGACCTACCAGGAGCCAACCCTACGAGCGCTTCACCCCAACAACGATTCCAATCTCCGAGATCCTAACGAACATCGAGGATTCTGGAATGGAAAAACTACTCAAGCGTCCGGAGAAACTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTTCATCGGGAGCACGGCCACAACACGTCGGACTGCTGGGAATTGAAGCGCCAAATTGAAGATCTGATTCAAGACGGCTACTTCAAGAAGTTTGTGGGAAAGCCCGGGACCGGCTCAGCAGAGAAAAAGGAGGAGCGAAAGCGTTCAAGGACGCCACCCCGGCGCACCGACCGACCTGCGGTCATCAATGCCATCTTTGGAGGGCCAAGCGGGGGTCAATCCGGACATAAAAGAAAGGAGTTAGCCCGTGCAGCCAGACGCGAGGTGTGCATCATCAGGGAGCAGGGGCCGACCTGCCCAATCACCTTCGACGGTGCAGACTTGGAGGAGGTCCACCTGCCCCACAATGATGCCCTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTCAGGAGAGTGCTGGTAGACGGGGGCGCATCCGCCAACATCCTGTCCTTATCGACCTACCTCGCCTTGGGCTGGACGAGGTCGCAATTGAAGAGAAGCCCGACGCCGCTGGTTGGATTCTCTAGAGAGTCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACGCTGGGGCAGAACCAAACCCGGATCACTCAAATGGCCGAGTTCATGGTAGTTGACGGTAGGTCGACCTATAACGCCATCTTTGGGAGACCCATCATCCACTCCTTTCGGGCCATTCCTTCAACACTTCATCAAGTTTTGAAGTATCCCACCCCCAACGGCGTGGGCACGGTCCGAGGAGAACAAGCCGCTTCGAGGGAGTGTTATGCCGCCGCACTCAAAGGCCCATCGGTCTGCGCCCTCGAAACTCTCAGGGATGGGATTCTCGAGTTCGAGGCCGACCTGCCGAGGAAGGAGTTTGCCGCGCCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAGTTAGTATCGGCGTACGAGACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCCTAGATAATCCCTCGATCTTAGAGCCCGAACTGATGGAGATCGGCGCTCCAGAACCCTCATGGATGGACCCGATCGCGAACTTCATTACGGGCAACTCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCAAGACGGGCAGCTCGGTTCGTGATCCGAGATGGGGCATTGTACCGACGTGGCTTTTCCCTGCCTCTGTTGAAATGCCTAACCCCTGAAGAGGGCCTAGTAGAGCATTACGAGCCTTCGACAAATGAGGAAGAGCTGCTCCTCAACCTCGACTTGTGGGAAGAAAGAAGAGCAATGGCCCAGCTACGCCTGGCGGAATATCAGGGCAGGATGGCCAGACATTACAATGCCCGCGTTCGACCTCGGGCCTTTCAGGTCGGCCATCTGGTATTAAGGAGGGTCCAAACGCATGTGGGTGCCCTTGATCCGGCCTGGGAAGGCCCATTTGAGATCAAGGGCATTGTCCGACCTGGGACGTACGTATTGGCCGATCTGAAAGGAGATGTCCTCGCGCACCCGTGGAACGCGGAACACCTGAAGCGTTATTATCCTTGA
Coding sequence (CDS)
ATGGTCCAACCAGCGAATTCGACCAATACGACGGATCGAAGGACTCTGGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCGGCGGTAGAGGAGCAAGGTCACGACGGCCTAGCAGCGGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGCCCTGCCGCCTGCACACCCGAGGACGTCCAAGGCCACCCGTGGTCGAGGTGGGACCTCTAAGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGCGAGGATTTTGATGCGCTCCAGAGAGAGATGGAGGCAATGCGCATGCAAATGCGCTCCATGGAGGCGATGTATAACGAAATGGTGCTAGCTGCAGGCGCGGGGTCCCGATCTGAAAATCGGGCGACGCGCATGGACGTACGCGAGCAAAGGGGCTCCCACCTCGGCCCGGTCGAGGAGGAACGTCCCGAAGACAACGGGAGTGAGGGGTACACTCGCCAGAGGGGAGGCCTCCGCGAGCATCTCAACCGAAAGAGAGGCTCGTCTCTCCGAAAAGGGCAGTCGCCATCCCGCTCCCACATGAGCTCCAACCAGCAGGCTAAATCCTCTCACAACCCCGCAGGGATAATCACAAGGGAGGAGTTCGATCAGCTGAGGGGGGAGCTCGATGCTCAAGTGGAGGCCTTAAAGGCTAAATGTGAGCAGAAAGACGATTCACTGAACGATGGCGACTTGGGAGAATCGCCTTTCACCTCGGACGTTTTGGAAGCACCGATCCCCCCGAAGTTCAAAGCTCCTACCGTGAAGTCTTATGACGGGACGAAGGACCCCAAGGACTATGTTGAAGTCTTTGAAGGCCTCATGGACTTCCAAGCGCTGAGAAGGGAGTTCCTCGCCCAGTTTTCTTCTCGGCACTACGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGGGGGAGACGCTGCGGGAGTATGTCACCAGATTCCAGGAGGAGCAATTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGATGAGGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGACCACCTTCGCCGAGATCGGCCGGGGCAGAAGTGGAAAAGATGAAAGGGCAGATCCTAAGTCCAAGGACAAAGGATCTTTCTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAACGGACCTACCAGGAGCCAACCCTACGAGCGCTTCACCCCAACAACGATTCCAATCTCCGAGATCCTAACGAACATCGAGGATTCTGGAATGGAAAAACTACTCAAGCGTCCGGAGAAACTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTTCATCGGGAGCACGGCCACAACACGTCGGACTGCTGGGAATTGAAGCGCCAAATTGAAGATCTGATTCAAGACGGCTACTTCAAGAAGTTTGTGGGAAAGCCCGGGACCGGCTCAGCAGAGAAAAAGGAGGAGCGAAAGCGTTCAAGGACGCCACCCCGGCGCACCGACCGACCTGCGGTCATCAATGCCATCTTTGGAGGGCCAAGCGGGGGTCAATCCGGACATAAAAGAAAGGAGTTAGCCCGTGCAGCCAGACGCGAGGTGTGCATCATCAGGGAGCAGGGGCCGACCTGCCCAATCACCTTCGACGGTGCAGACTTGGAGGAGGTCCACCTGCCCCACAATGATGCCCTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTCAGGAGAGTGCTGGTAGACGGGGGCGCATCCGCCAACATCCTGTCCTTATCGACCTACCTCGCCTTGGGCTGGACGAGGTCGCAATTGAAGAGAAGCCCGACGCCGCTGGTTGGATTCTCTAGAGAGTCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACGCTGGGGCAGAACCAAACCCGGATCACTCAAATGGCCGAGTTCATGGTAGTTGACGGTAGGTCGACCTATAACGCCATCTTTGGGAGACCCATCATCCACTCCTTTCGGGCCATTCCTTCAACACTTCATCAAGTTTTGAAGTATCCCACCCCCAACGGCGTGGGCACGGTCCGAGGAGAACAAGCCGCTTCGAGGGAGTGTTATGCCGCCGCACTCAAAGGCCCATCGGTCTGCGCCCTCGAAACTCTCAGGGATGGGATTCTCGAGTTCGAGGCCGACCTGCCGAGGAAGGAGTTTGCCGCGCCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAGTTAGTATCGGCGTACGAGACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCCTAGATAATCCCTCGATCTTAGAGCCCGAACTGATGGAGATCGGCGCTCCAGAACCCTCATGGATGGACCCGATCGCGAACTTCATTACGGGCAACTCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCAAGACGGGCAGCTCGGTTCGTGATCCGAGATGGGGCATTGTACCGACGTGGCTTTTCCCTGCCTCTGTTGAAATGCCTAACCCCTGAAGAGGGCCTAGTAGAGCATTACGAGCCTTCGACAAATGAGGAAGAGCTGCTCCTCAACCTCGACTTGTGGGAAGAAAGAAGAGCAATGGCCCAGCTACGCCTGGCGGAATATCAGGGCAGGATGGCCAGACATTACAATGCCCGCGTTCGACCTCGGGCCTTTCAGGTCGGCCATCTGGTATTAAGGAGGGTCCAAACGCATGTGGGTGCCCTTGATCCGGCCTGGGAAGGCCCATTTGAGATCAAGGGCATTGTCCGACCTGGGACGTACGTATTGGCCGATCTGAAAGGAGATGTCCTCGCGCACCCGTGGAACGCGGAACACCTGAAGCGTTATTATCCTTGA
Protein sequence
MVQPANSTNTTDRRTLAASDAHQREVGAAAVEEQGHDGLAAEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPTSEDFDALQREMEAMRMQMRSMEAMYNEMVLAAGAGSRSENRATRMDVREQRGSHLGPVEEERPEDNGSEGYTRQRGGLREHLNRKRGSSLRKGQSPSRSHMSSNQQAKSSHNPAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKSYDGTKDPKDYVEVFEGLMDFQALRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPTTFAEIGRGRSGKDERADPKSKDKGSFSSGRAEYRRAENGPTRSQPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTGSAEKKEERKRSRTPPRRTDRPAVINAIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLSTYLALGWTRSQLKRSPTPLVGFSRESVIPEGCIDLPVTLGQNQTRITQMAEFMVVDGRSTYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQAASRECYAAALKGPSVCALETLRDGILEFEADLPRKEFAAPTEELELVPLLSPEKQLVSAYETDLARSVPVEILDNPSILEPELMEIGAPEPSWMDPIANFITGNSPQDPKERRKLARRAARFVIRDGALYRRGFSLPLLKCLTPEEGLVEHYEPSTNEEELLLNLDLWEERRAMAQLRLAEYQGRMARHYNARVRPRAFQVGHLVLRRVQTHVGALDPAWEGPFEIKGIVRPGTYVLADLKGDVLAHPWNAEHLKRYYP
Homology
BLAST of Moc08g32590 vs. NCBI nr
Match:
XP_022150760.1 (uncharacterized protein LOC111018823 [Momordica charantia])
HSP 1 Score: 874.0 bits (2257), Expect = 1.2e-249
Identity = 475/637 (74.57%), Postives = 508/637 (79.75%), Query Frame = 0
Query: 186 MSSNQQAKSSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPF 245
MSSNQQA+SSHNPA G+ITREEFDQLRG+L+AQVEALKAKCEQK+ LNDGDLGESPF
Sbjct: 1 MSSNQQAESSHNPATPDGVITREEFDQLRGKLNAQVEALKAKCEQKEGPLNDGDLGESPF 60
Query: 246 TSDVLEAPIPPKFKAPTVKSYDGTKDPKDYVEVFEGLMDFQALRREFLAQFSSRHYDKKT 305
TSDVLE APTVKSYDG+KDPKDYVEVFEGLMDFQA
Sbjct: 61 TSDVLE--------APTVKSYDGSKDPKDYVEVFEGLMDFQA------------------ 120
Query: 306 ATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPT 365
A+ R + FQE+QLKVA SDDSAMCYFLTGLADEALTVKLG+EAP
Sbjct: 121 ASDAIKCRAFQIALTGSARLWFQEDQLKVAQSSDDSAMCYFLTGLADEALTVKLGKEAPA 180
Query: 366 TFAE-------------------------IGRGRSGKDERADPKSKDKGSFSSGRAEYRR 425
TFAE I RGRSGKDE+AD KSKDKGSFSSGRAE+RR
Sbjct: 181 TFAEVLQKAKKVIDGQELLRTKTGRPERGIDRGRSGKDEKADLKSKDKGSFSSGRAEFRR 240
Query: 426 AENGPTRSQPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHR 485
A NGPTRS+PYERFTPTTIPISEILTNIE+SGMEKLLKRPEKLRGAPERR+KDKYCRFHR
Sbjct: 241 AVNGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRNKDKYCRFHR 300
Query: 486 EHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTGSAEKKEERKRSRTPPRRTDRPAVIN 545
EH HNTSD WELKRQIEDLIQD YFKKFVGKP T SAEKKEERK SRTP RR DRPAVIN
Sbjct: 301 EHDHNTSDRWELKRQIEDLIQDDYFKKFVGKPRTSSAEKKEERKLSRTPLRRIDRPAVIN 360
Query: 546 AIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAP 605
IFGGPSGGQSGHKRKELARAARREVCIIREQ PTCPITFD ADLEEVHLPHNDALVIAP
Sbjct: 361 TIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSADLEEVHLPHNDALVIAP 420
Query: 606 LIDHVVVRRVLVDGGASANILSLSTYLALGWTRSQLKRSPTPLVGFSRESVIPEGCIDLP 665
LIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLK+S TPLVGFSRESVIPEGCIDLP
Sbjct: 421 LIDHVVVRRVLVDEGVSANIVSLLTYLALGWTRSQLKKSTTPLVGFSRESVIPEGCIDLP 480
Query: 666 VTLGQNQTRITQMAEFMVVDGRSTYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVR 725
VTLG +QT++TQMAEF+V+DGRS YNAIFGRPIIHSFRAIPSTLHQVLKY TPNGVG VR
Sbjct: 481 VTLGHDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGMVR 540
Query: 726 GEQAASRECYAAALKGPSVCALETL--RDGILEFEADLPRKEFAAPTEELELVPLLSPEK 785
GEQ ASRECYA+ALKG SVCALETL RDG LEF+A+LPR+EFAAPTEELELVPLL +
Sbjct: 541 GEQIASRECYASALKGSSVCALETLVSRDGTLEFKANLPRREFAAPTEELELVPLLRYKY 600
Query: 786 QLVSAYETDLARSVPVEILDNPSILEPELMEIGAPEP 793
+E +L + +D+ +E G PEP
Sbjct: 601 NENIDHEQELDEKSSLNKIDDDIGVE------GMPEP 605
BLAST of Moc08g32590 vs. NCBI nr
Match:
XP_022137317.1 (uncharacterized protein LOC111008813 [Momordica charantia])
HSP 1 Score: 816.6 bits (2108), Expect = 2.2e-232
Identity = 431/528 (81.63%), Postives = 445/528 (84.28%), Query Frame = 0
Query: 191 QAKSSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVL 250
+A+SS N PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+ LNDGDLGESPFTSDVL
Sbjct: 3 KAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVL 62
Query: 251 EAPIPPKFKAPTVKSYDGTKDPKDYVEVFEGLMDFQA----------------------- 310
EAPIPPKFKAPTVK YDG+KDPKDYVEVFE LMDFQA
Sbjct: 63 EAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASDAIKCRAFRIALTGSARLWYR 122
Query: 311 ------------LRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA 370
LRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA
Sbjct: 123 RLPAXSISTYSQLRREFLAXFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA 182
Query: 371 HCSDDSAMCYFLTGLADEALTVKLGEEAPTTFAE-------------------------I 430
HCSDDSAMCYFLTGLADEALTVKLGEEAP TFAE I
Sbjct: 183 HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI 242
Query: 431 GRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSQPYERFTPTTIPISEILTNIE 490
GRGRSGKD E ADPKSKDKGSFSSGRAEYRRAENGPTRS+PYERFTPTTIPISEILTNIE
Sbjct: 243 GRGRSGKDIENADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE 302
Query: 491 DSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFV 550
+SGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFV
Sbjct: 303 ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIENLIQDGYFKKFV 362
Query: 551 GKPGTGSAEKKEERKRSRTPPRRTDRPAVINAIFGGPSGGQSGHKRKELARAARREVCII 610
GKP T SAEKKEERKRSRTPPRRTDRPAVIN IFGGPSGGQSG KRKELARAARREVCII
Sbjct: 363 GKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGRKRKELARAARREVCII 422
Query: 611 REQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLSTYLAL 655
REQ PTCPITFDGADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSL TYLAL
Sbjct: 423 REQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVXRVLVDGGTSANILSLPTYLAL 482
BLAST of Moc08g32590 vs. NCBI nr
Match:
XP_022152854.1 (uncharacterized protein LOC111020479 [Momordica charantia])
HSP 1 Score: 802.0 bits (2070), Expect = 5.7e-228
Identity = 458/790 (57.97%), Postives = 512/790 (64.81%), Query Frame = 0
Query: 1 MVQPANSTNTTDRRTLAASDAHQREVGAAAVEEQGHDGLAAEPLRRSARITAPALPPAHP 60
MVQPANSTNT DRR LAA+ HQREVGA VE QGH+ L EPL RSARIT P LPPAHP
Sbjct: 1 MVQPANSTNTADRRALAANHGHQREVGAEVVEGQGHEDLGTEPLCRSARITTPVLPPAHP 60
Query: 61 RTSKATRGRGGTSKKGARGPAPAPTSEDFDALQREMEAMRMQMRSMEAMYNEMVLAAGAG 120
+ SK
Sbjct: 61 KPSK-------------------------------------------------------- 120
Query: 121 SRSENRATRMDVREQRGSHLGPVEEERPEDNGSEGYTRQRGGLREHLNRKRGSSLRKGQS 180
Sbjct: 121 ------------------------------------------------------------ 180
Query: 181 PSRSHMSSNQQAKSSHNP--AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLG 240
A+SS+NP G+ITREEFDQL+ + DAQVEALKA+CE+K+ S +DGDLG
Sbjct: 181 -----------AESSYNPITPGVITREEFDQLKSKFDAQVEALKARCEKKESSFDDGDLG 240
Query: 241 ESPFTSDVLEAPIPPKFKAPTVKSYDGTKDPKDYVEVFEGLMDFQA-------------- 300
E F+SD+LEA IPPKFK PT+K YDG+KDPKDYVEVFE LMDFQA
Sbjct: 241 ELSFSSDILEALIPPKFKTPTMKPYDGSKDPKDYVEVFESLMDFQAATDAIKCCAFQIAL 300
Query: 301 ---------------------LRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTR 360
LR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTR
Sbjct: 301 TGSARLWYRRLPARLISTYSQLRKEFISQFSSRHYDRKTPTHLATIRQKEGETLREYVTR 360
Query: 361 FQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPTTFAE----------------- 420
F EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAP TFAE
Sbjct: 361 FPEEQLKVAHCSDDSAMCYFLTGLADETLTVKLREEAPATFAEVLQKTKKVIDGQELLRT 420
Query: 421 --------IGRGRSGKDE-RADPKSKDKG-SFSSGRAEYRRAENGPTRSQPYERFTPTTI 480
I +GR+GKD+ +AD KS+DKG S SS R +YRR+ + +S+PYE +TPTTI
Sbjct: 421 KTGRPEKNIDQGRAGKDKGKADSKSRDKGPSSSSSRVDYRRSNSSHNQSRPYEHYTPTTI 480
Query: 481 PISEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDL 540
PI EILTNIE++GMEKLLKRPEKLRG PE+R+ DKYCRFHR+HGHNTS+ WELKRQIEDL
Sbjct: 481 PIFEILTNIEETGMEKLLKRPEKLRGDPEKRNTDKYCRFHRDHGHNTSNYWELKRQIEDL 540
Query: 541 IQDGYFKKFVGKPGTGSAEKKEERKRSRTPPRRTDRPAVINAIFGGPSGGQSGHKRKELA 600
IQDGYFKKFVGKP + S EKKEERKR RTPPRR DRPAVIN K+KELA
Sbjct: 541 IQDGYFKKFVGKPRSNSVEKKEERKRLRTPPRRDDRPAVIN-------------KKKELA 600
Query: 601 RAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASAN 660
R ARREVCIIREQ PT I F+ ADLE VHLPHNDALVIAPLID V+VRR+LVDGGASAN
Sbjct: 601 REARREVCIIREQRPTSSIAFNHADLEGVHLPHNDALVIAPLIDLVLVRRILVDGGASAN 650
Query: 661 ILSLSTYLALGWTRSQLKRSPTPLVGFSRESVIPEGCIDLPVTLGQNQTRITQMAEFMVV 720
ILSLSTYLALGWTRSQLK+SPTPLVGFS ES+ EGCIDLPV++ Q+ T++TQMAEF+V+
Sbjct: 661 ILSLSTYLALGWTRSQLKKSPTPLVGFSGESISLEGCIDLPVSIRQDDTQVTQMAEFVVI 650
Query: 721 DGRSTYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQAASRECYAAALKGPSV 725
DGRS YNAIFGRPIIHSFRA+PSTLHQVLKY T NGVGTVRGE SRECYA+ K SV
Sbjct: 721 DGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGELKTSRECYASVPKRSSV 650
BLAST of Moc08g32590 vs. NCBI nr
Match:
XP_022152110.1 (uncharacterized protein LOC111019899 [Momordica charantia])
HSP 1 Score: 743.0 bits (1917), Expect = 3.1e-210
Identity = 382/439 (87.02%), Postives = 395/439 (89.98%), Query Frame = 0
Query: 340 MCYFLTGLADEALTVKLGEEAPTTFAE------------------IGRGRSGKD-ERADP 399
MCYFLTGLADEALTVKL EEAP TFAE IG+GRSGKD E DP
Sbjct: 1 MCYFLTGLADEALTVKLVEEAPATFAEVLQKAKKVIDGQELLRTKIGQGRSGKDMENTDP 60
Query: 400 KSKDKGSFSSGRAEYRRAENGPTRSQPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKL 459
KSKDKGSFS+GRAEYRRAENGPTRS+PYERFTPTTIPISEILTNIE+SGMEKLLKRPEKL
Sbjct: 61 KSKDKGSFSNGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKL 120
Query: 460 RGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTGSAEKKEER 519
RGAPERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFVGKP T SAEKKEER
Sbjct: 121 RGAPERRSKDKYCRFHREHGHNTSDYWELKSQIEDLIQDGYFKKFVGKPRTSSAEKKEER 180
Query: 520 KRSRTPPRRTDRPAVINAIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGA 579
KRSRTPPRRTDRPAVIN IFGGPSGGQSGHKRK+LARAARREVCIIREQ PTCPITFD A
Sbjct: 181 KRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKKLARAARREVCIIREQRPTCPITFDXA 240
Query: 580 DLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLSTYLALGWTRSQLKRSPTPL 639
DL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSL TYLALGWTRSQLK+SPTPL
Sbjct: 241 DLXEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPL 300
Query: 640 VGFSRESVIPEGCIDLPVTLGQNQTRITQMAEFMVVDGRSTYNAIFGRPIIHSFRAIPST 699
VGFS ESV+PEGCIDLPVTLGQ+QTR+TQMAEF+VVDGRS YNAIFGRPIIHSFRAIPST
Sbjct: 301 VGFSGESVVPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPST 360
Query: 700 LHQVLKYPTPNGVGTVRGEQAASRECYAAALKGPSVCALETL--RDGILEFEADLPRKEF 758
LHQVLKY TPNGVGTVRGEQ ASRECYA+ LKG SVCALETL RDG LEFEADLP +EF
Sbjct: 361 LHQVLKYSTPNGVGTVRGEQTASRECYASXLKGTSVCALETLTSRDGTLEFEADLPXREF 420
BLAST of Moc08g32590 vs. NCBI nr
Match:
XP_022157676.1 (uncharacterized protein LOC111024332 [Momordica charantia])
HSP 1 Score: 662.1 bits (1707), Expect = 7.1e-186
Identity = 352/435 (80.92%), Postives = 357/435 (82.07%), Query Frame = 0
Query: 347 LADEALTVKLGEEAPTTFAE-------------------------IGRGRSGKDERADPK 406
+ADEALTVKLGEEAP TFAE IGRGRSGKDERADPK
Sbjct: 1 MADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERADPK 60
Query: 407 SKDKGSFSSGRAEYRRAENGPTRSQPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLR 466
SKDKGSFSSGRAEYRRAENGPTRS+PYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLR
Sbjct: 61 SKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLR 120
Query: 467 GAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTGSAEKKEERK 526
GAPERRSKDK T SAEKKEERK
Sbjct: 121 GAPERRSKDK---------------------------------------TSSAEKKEERK 180
Query: 527 RSRTPPRRTDRPAVINAIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGAD 586
RSRTPPRRTDRPAVIN IFGGPSGGQSGHKRKELAR ARREVCIIREQGPTCPITFDGAD
Sbjct: 181 RSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELAREARREVCIIREQGPTCPITFDGAD 240
Query: 587 LEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLSTYLALGWTRSQLKRSPTPLV 646
LEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSL TYLALGWTRSQLKRSPTPLV
Sbjct: 241 LEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLV 300
Query: 647 GFSRESVIPEGCIDLPVTLGQNQTRITQMAEFMVVDGRSTYNAIFGRPIIHSFRAIPSTL 706
GFS ESVIPEGCIDLPVTLGQ+QTR+TQM EF+VVDGRSTYNAIFGRPIIHSFR IPSTL
Sbjct: 301 GFSGESVIPEGCIDLPVTLGQDQTRVTQMTEFVVVDGRSTYNAIFGRPIIHSFRXIPSTL 360
Query: 707 HQVLKYPTPNGVGTVRGEQAASRECYAAALKGPSVCALETLRDGILEFEADLPRKEFAAP 757
HQVLKY TPNGVGTVRGEQ SRECYAAALKG SVCALETLRDG LE EADLPRKEFAAP
Sbjct: 361 HQVLKYSTPNGVGTVRGEQTVSRECYAAALKGSSVCALETLRDGTLELEADLPRKEFAAP 396
BLAST of Moc08g32590 vs. ExPASy TrEMBL
Match:
A0A6J1D9E1 (uncharacterized protein LOC111018823 OS=Momordica charantia OX=3673 GN=LOC111018823 PE=4 SV=1)
HSP 1 Score: 874.0 bits (2257), Expect = 5.7e-250
Identity = 475/637 (74.57%), Postives = 508/637 (79.75%), Query Frame = 0
Query: 186 MSSNQQAKSSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPF 245
MSSNQQA+SSHNPA G+ITREEFDQLRG+L+AQVEALKAKCEQK+ LNDGDLGESPF
Sbjct: 1 MSSNQQAESSHNPATPDGVITREEFDQLRGKLNAQVEALKAKCEQKEGPLNDGDLGESPF 60
Query: 246 TSDVLEAPIPPKFKAPTVKSYDGTKDPKDYVEVFEGLMDFQALRREFLAQFSSRHYDKKT 305
TSDVLE APTVKSYDG+KDPKDYVEVFEGLMDFQA
Sbjct: 61 TSDVLE--------APTVKSYDGSKDPKDYVEVFEGLMDFQA------------------ 120
Query: 306 ATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPT 365
A+ R + FQE+QLKVA SDDSAMCYFLTGLADEALTVKLG+EAP
Sbjct: 121 ASDAIKCRAFQIALTGSARLWFQEDQLKVAQSSDDSAMCYFLTGLADEALTVKLGKEAPA 180
Query: 366 TFAE-------------------------IGRGRSGKDERADPKSKDKGSFSSGRAEYRR 425
TFAE I RGRSGKDE+AD KSKDKGSFSSGRAE+RR
Sbjct: 181 TFAEVLQKAKKVIDGQELLRTKTGRPERGIDRGRSGKDEKADLKSKDKGSFSSGRAEFRR 240
Query: 426 AENGPTRSQPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHR 485
A NGPTRS+PYERFTPTTIPISEILTNIE+SGMEKLLKRPEKLRGAPERR+KDKYCRFHR
Sbjct: 241 AVNGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRNKDKYCRFHR 300
Query: 486 EHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTGSAEKKEERKRSRTPPRRTDRPAVIN 545
EH HNTSD WELKRQIEDLIQD YFKKFVGKP T SAEKKEERK SRTP RR DRPAVIN
Sbjct: 301 EHDHNTSDRWELKRQIEDLIQDDYFKKFVGKPRTSSAEKKEERKLSRTPLRRIDRPAVIN 360
Query: 546 AIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAP 605
IFGGPSGGQSGHKRKELARAARREVCIIREQ PTCPITFD ADLEEVHLPHNDALVIAP
Sbjct: 361 TIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSADLEEVHLPHNDALVIAP 420
Query: 606 LIDHVVVRRVLVDGGASANILSLSTYLALGWTRSQLKRSPTPLVGFSRESVIPEGCIDLP 665
LIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLK+S TPLVGFSRESVIPEGCIDLP
Sbjct: 421 LIDHVVVRRVLVDEGVSANIVSLLTYLALGWTRSQLKKSTTPLVGFSRESVIPEGCIDLP 480
Query: 666 VTLGQNQTRITQMAEFMVVDGRSTYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVR 725
VTLG +QT++TQMAEF+V+DGRS YNAIFGRPIIHSFRAIPSTLHQVLKY TPNGVG VR
Sbjct: 481 VTLGHDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGMVR 540
Query: 726 GEQAASRECYAAALKGPSVCALETL--RDGILEFEADLPRKEFAAPTEELELVPLLSPEK 785
GEQ ASRECYA+ALKG SVCALETL RDG LEF+A+LPR+EFAAPTEELELVPLL +
Sbjct: 541 GEQIASRECYASALKGSSVCALETLVSRDGTLEFKANLPRREFAAPTEELELVPLLRYKY 600
Query: 786 QLVSAYETDLARSVPVEILDNPSILEPELMEIGAPEP 793
+E +L + +D+ +E G PEP
Sbjct: 601 NENIDHEQELDEKSSLNKIDDDIGVE------GMPEP 605
BLAST of Moc08g32590 vs. ExPASy TrEMBL
Match:
A0A6J1C7X5 (uncharacterized protein LOC111008813 OS=Momordica charantia OX=3673 GN=LOC111008813 PE=4 SV=1)
HSP 1 Score: 816.6 bits (2108), Expect = 1.1e-232
Identity = 431/528 (81.63%), Postives = 445/528 (84.28%), Query Frame = 0
Query: 191 QAKSSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVL 250
+A+SS N PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+ LNDGDLGESPFTSDVL
Sbjct: 3 KAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVL 62
Query: 251 EAPIPPKFKAPTVKSYDGTKDPKDYVEVFEGLMDFQA----------------------- 310
EAPIPPKFKAPTVK YDG+KDPKDYVEVFE LMDFQA
Sbjct: 63 EAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASDAIKCRAFRIALTGSARLWYR 122
Query: 311 ------------LRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA 370
LRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA
Sbjct: 123 RLPAXSISTYSQLRREFLAXFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA 182
Query: 371 HCSDDSAMCYFLTGLADEALTVKLGEEAPTTFAE-------------------------I 430
HCSDDSAMCYFLTGLADEALTVKLGEEAP TFAE I
Sbjct: 183 HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI 242
Query: 431 GRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSQPYERFTPTTIPISEILTNIE 490
GRGRSGKD E ADPKSKDKGSFSSGRAEYRRAENGPTRS+PYERFTPTTIPISEILTNIE
Sbjct: 243 GRGRSGKDIENADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE 302
Query: 491 DSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFV 550
+SGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFV
Sbjct: 303 ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIENLIQDGYFKKFV 362
Query: 551 GKPGTGSAEKKEERKRSRTPPRRTDRPAVINAIFGGPSGGQSGHKRKELARAARREVCII 610
GKP T SAEKKEERKRSRTPPRRTDRPAVIN IFGGPSGGQSG KRKELARAARREVCII
Sbjct: 363 GKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGRKRKELARAARREVCII 422
Query: 611 REQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLSTYLAL 655
REQ PTCPITFDGADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSL TYLAL
Sbjct: 423 REQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVXRVLVDGGTSANILSLPTYLAL 482
BLAST of Moc08g32590 vs. ExPASy TrEMBL
Match:
A0A6J1DHB3 (uncharacterized protein LOC111020479 OS=Momordica charantia OX=3673 GN=LOC111020479 PE=4 SV=1)
HSP 1 Score: 802.0 bits (2070), Expect = 2.8e-228
Identity = 458/790 (57.97%), Postives = 512/790 (64.81%), Query Frame = 0
Query: 1 MVQPANSTNTTDRRTLAASDAHQREVGAAAVEEQGHDGLAAEPLRRSARITAPALPPAHP 60
MVQPANSTNT DRR LAA+ HQREVGA VE QGH+ L EPL RSARIT P LPPAHP
Sbjct: 1 MVQPANSTNTADRRALAANHGHQREVGAEVVEGQGHEDLGTEPLCRSARITTPVLPPAHP 60
Query: 61 RTSKATRGRGGTSKKGARGPAPAPTSEDFDALQREMEAMRMQMRSMEAMYNEMVLAAGAG 120
+ SK
Sbjct: 61 KPSK-------------------------------------------------------- 120
Query: 121 SRSENRATRMDVREQRGSHLGPVEEERPEDNGSEGYTRQRGGLREHLNRKRGSSLRKGQS 180
Sbjct: 121 ------------------------------------------------------------ 180
Query: 181 PSRSHMSSNQQAKSSHNP--AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLG 240
A+SS+NP G+ITREEFDQL+ + DAQVEALKA+CE+K+ S +DGDLG
Sbjct: 181 -----------AESSYNPITPGVITREEFDQLKSKFDAQVEALKARCEKKESSFDDGDLG 240
Query: 241 ESPFTSDVLEAPIPPKFKAPTVKSYDGTKDPKDYVEVFEGLMDFQA-------------- 300
E F+SD+LEA IPPKFK PT+K YDG+KDPKDYVEVFE LMDFQA
Sbjct: 241 ELSFSSDILEALIPPKFKTPTMKPYDGSKDPKDYVEVFESLMDFQAATDAIKCCAFQIAL 300
Query: 301 ---------------------LRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTR 360
LR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTR
Sbjct: 301 TGSARLWYRRLPARLISTYSQLRKEFISQFSSRHYDRKTPTHLATIRQKEGETLREYVTR 360
Query: 361 FQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPTTFAE----------------- 420
F EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAP TFAE
Sbjct: 361 FPEEQLKVAHCSDDSAMCYFLTGLADETLTVKLREEAPATFAEVLQKTKKVIDGQELLRT 420
Query: 421 --------IGRGRSGKDE-RADPKSKDKG-SFSSGRAEYRRAENGPTRSQPYERFTPTTI 480
I +GR+GKD+ +AD KS+DKG S SS R +YRR+ + +S+PYE +TPTTI
Sbjct: 421 KTGRPEKNIDQGRAGKDKGKADSKSRDKGPSSSSSRVDYRRSNSSHNQSRPYEHYTPTTI 480
Query: 481 PISEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDL 540
PI EILTNIE++GMEKLLKRPEKLRG PE+R+ DKYCRFHR+HGHNTS+ WELKRQIEDL
Sbjct: 481 PIFEILTNIEETGMEKLLKRPEKLRGDPEKRNTDKYCRFHRDHGHNTSNYWELKRQIEDL 540
Query: 541 IQDGYFKKFVGKPGTGSAEKKEERKRSRTPPRRTDRPAVINAIFGGPSGGQSGHKRKELA 600
IQDGYFKKFVGKP + S EKKEERKR RTPPRR DRPAVIN K+KELA
Sbjct: 541 IQDGYFKKFVGKPRSNSVEKKEERKRLRTPPRRDDRPAVIN-------------KKKELA 600
Query: 601 RAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASAN 660
R ARREVCIIREQ PT I F+ ADLE VHLPHNDALVIAPLID V+VRR+LVDGGASAN
Sbjct: 601 REARREVCIIREQRPTSSIAFNHADLEGVHLPHNDALVIAPLIDLVLVRRILVDGGASAN 650
Query: 661 ILSLSTYLALGWTRSQLKRSPTPLVGFSRESVIPEGCIDLPVTLGQNQTRITQMAEFMVV 720
ILSLSTYLALGWTRSQLK+SPTPLVGFS ES+ EGCIDLPV++ Q+ T++TQMAEF+V+
Sbjct: 661 ILSLSTYLALGWTRSQLKKSPTPLVGFSGESISLEGCIDLPVSIRQDDTQVTQMAEFVVI 650
Query: 721 DGRSTYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQAASRECYAAALKGPSV 725
DGRS YNAIFGRPIIHSFRA+PSTLHQVLKY T NGVGTVRGE SRECYA+ K SV
Sbjct: 721 DGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGELKTSRECYASVPKRSSV 650
BLAST of Moc08g32590 vs. ExPASy TrEMBL
Match:
A0A6J1DD03 (uncharacterized protein LOC111019899 OS=Momordica charantia OX=3673 GN=LOC111019899 PE=4 SV=1)
HSP 1 Score: 743.0 bits (1917), Expect = 1.5e-210
Identity = 382/439 (87.02%), Postives = 395/439 (89.98%), Query Frame = 0
Query: 340 MCYFLTGLADEALTVKLGEEAPTTFAE------------------IGRGRSGKD-ERADP 399
MCYFLTGLADEALTVKL EEAP TFAE IG+GRSGKD E DP
Sbjct: 1 MCYFLTGLADEALTVKLVEEAPATFAEVLQKAKKVIDGQELLRTKIGQGRSGKDMENTDP 60
Query: 400 KSKDKGSFSSGRAEYRRAENGPTRSQPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKL 459
KSKDKGSFS+GRAEYRRAENGPTRS+PYERFTPTTIPISEILTNIE+SGMEKLLKRPEKL
Sbjct: 61 KSKDKGSFSNGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKL 120
Query: 460 RGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTGSAEKKEER 519
RGAPERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFVGKP T SAEKKEER
Sbjct: 121 RGAPERRSKDKYCRFHREHGHNTSDYWELKSQIEDLIQDGYFKKFVGKPRTSSAEKKEER 180
Query: 520 KRSRTPPRRTDRPAVINAIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGA 579
KRSRTPPRRTDRPAVIN IFGGPSGGQSGHKRK+LARAARREVCIIREQ PTCPITFD A
Sbjct: 181 KRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKKLARAARREVCIIREQRPTCPITFDXA 240
Query: 580 DLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLSTYLALGWTRSQLKRSPTPL 639
DL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSL TYLALGWTRSQLK+SPTPL
Sbjct: 241 DLXEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPL 300
Query: 640 VGFSRESVIPEGCIDLPVTLGQNQTRITQMAEFMVVDGRSTYNAIFGRPIIHSFRAIPST 699
VGFS ESV+PEGCIDLPVTLGQ+QTR+TQMAEF+VVDGRS YNAIFGRPIIHSFRAIPST
Sbjct: 301 VGFSGESVVPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPST 360
Query: 700 LHQVLKYPTPNGVGTVRGEQAASRECYAAALKGPSVCALETL--RDGILEFEADLPRKEF 758
LHQVLKY TPNGVGTVRGEQ ASRECYA+ LKG SVCALETL RDG LEFEADLP +EF
Sbjct: 361 LHQVLKYSTPNGVGTVRGEQTASRECYASXLKGTSVCALETLTSRDGTLEFEADLPXREF 420
BLAST of Moc08g32590 vs. ExPASy TrEMBL
Match:
A0A6J1DYW5 (uncharacterized protein LOC111024332 OS=Momordica charantia OX=3673 GN=LOC111024332 PE=4 SV=1)
HSP 1 Score: 662.1 bits (1707), Expect = 3.4e-186
Identity = 352/435 (80.92%), Postives = 357/435 (82.07%), Query Frame = 0
Query: 347 LADEALTVKLGEEAPTTFAE-------------------------IGRGRSGKDERADPK 406
+ADEALTVKLGEEAP TFAE IGRGRSGKDERADPK
Sbjct: 1 MADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERADPK 60
Query: 407 SKDKGSFSSGRAEYRRAENGPTRSQPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLR 466
SKDKGSFSSGRAEYRRAENGPTRS+PYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLR
Sbjct: 61 SKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLR 120
Query: 467 GAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTGSAEKKEERK 526
GAPERRSKDK T SAEKKEERK
Sbjct: 121 GAPERRSKDK---------------------------------------TSSAEKKEERK 180
Query: 527 RSRTPPRRTDRPAVINAIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGAD 586
RSRTPPRRTDRPAVIN IFGGPSGGQSGHKRKELAR ARREVCIIREQGPTCPITFDGAD
Sbjct: 181 RSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELAREARREVCIIREQGPTCPITFDGAD 240
Query: 587 LEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLSTYLALGWTRSQLKRSPTPLV 646
LEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSL TYLALGWTRSQLKRSPTPLV
Sbjct: 241 LEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLV 300
Query: 647 GFSRESVIPEGCIDLPVTLGQNQTRITQMAEFMVVDGRSTYNAIFGRPIIHSFRAIPSTL 706
GFS ESVIPEGCIDLPVTLGQ+QTR+TQM EF+VVDGRSTYNAIFGRPIIHSFR IPSTL
Sbjct: 301 GFSGESVIPEGCIDLPVTLGQDQTRVTQMTEFVVVDGRSTYNAIFGRPIIHSFRXIPSTL 360
Query: 707 HQVLKYPTPNGVGTVRGEQAASRECYAAALKGPSVCALETLRDGILEFEADLPRKEFAAP 757
HQVLKY TPNGVGTVRGEQ SRECYAAALKG SVCALETLRDG LE EADLPRKEFAAP
Sbjct: 361 HQVLKYSTPNGVGTVRGEQTVSRECYAAALKGSSVCALETLRDGTLELEADLPRKEFAAP 396
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1D9E1 | 5.7e-250 | 74.57 | uncharacterized protein LOC111018823 OS=Momordica charantia OX=3673 GN=LOC111018... | [more] |
A0A6J1C7X5 | 1.1e-232 | 81.63 | uncharacterized protein LOC111008813 OS=Momordica charantia OX=3673 GN=LOC111008... | [more] |
A0A6J1DHB3 | 2.8e-228 | 57.97 | uncharacterized protein LOC111020479 OS=Momordica charantia OX=3673 GN=LOC111020... | [more] |
A0A6J1DD03 | 1.5e-210 | 87.02 | uncharacterized protein LOC111019899 OS=Momordica charantia OX=3673 GN=LOC111019... | [more] |
A0A6J1DYW5 | 3.4e-186 | 80.92 | uncharacterized protein LOC111024332 OS=Momordica charantia OX=3673 GN=LOC111024... | [more] |
Match Name | E-value | Identity | Description | |