Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTTCAACCAGCGAATTCGACCAATACGACGGATCGAAGGACTCTGGCTGCCAGTGATGCCCACCAGAGGAAGGTCGGAGCAGCGGCGGCAGAAGGGCAAGGTCACGACGGCCTGGCAGCGGAACCCCCCCGCAGGTTGGCACGAATCACCGCGCCTGCCCTGCCGCCTGCACACCCGAGGACGTCCAAGGCCACCCGTGGTCGAGGTGGGACCTCTAAGAAGGGCGCCCGAGGTCCAGCCCCGACTCCAACAAGCGAGGATTTTGATGCGCTCCAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCCATGGAGGCAATGTATAACGACATGGTGCTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGGGCGACGCGCATAGACGCACGCGAGCAAAGGGGTTCCCACCTCGGCCCGGCCGAGGAGGAACATCCCGAAGACAACGGGAGTGAGGGGTACACTCGCCAGAGGGGAGACCTCCGCGAGCATCTCAACCGAAAGAGAGGCTCGTCTCTCCGAAAGGGGCAGTCGCCATCCCGCTCCCACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAACCCCACAGGGATAATCACAAGGGAGGAATTCGACCAGCTGAGGGGGGAGCTCGATGCTCAAGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAACGACGATTCACTGAACGATGGCGACTTGGGAGAATCACCTTTCACCTCGGACGTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGACGGGACGAAGGACCCCAATGACTATGTTGAAGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGCTTGTGGTACCGGAGATTGCCAGCCAGATCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGACACTACGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGGGGGAGACGCTGCGGGAGTATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTACTTCCTCACCGGTCTAGCCGATGAGGCCCTCACGGTGAAACTCGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTCCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCTGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATGAAAGGGCAGATCCCAAGTCCAAAGACAAAGGATCCTTCTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAACGGACCAACCAGGAGCCGACCTTACGAGCGCTTCACCCGAACCACGATTCCAATTTCCGAGATCCTAACGAACATCGAGGATTCTGGAATGGAAAAACTACTCAAGCGTCCGGAGAAACTTCGGGGAGCCCTGGAGAGGCGCAGCAAGGACAAGTACTGCCGCTTCCATCGAGAGCACGGCCACAACACGTCGGACTGCTGGGAATTGAAGCGCCAAATTGAAGATCTAATTCAAGACGGCTACTTCAAGAAGTTTGTGGGAAAGCCCGGGACTAGCTCAGCAGAGAAAAAGGAGGAGCGAAAGCGTTCTAGAACGCCACCTCGGCGCACCAACCGACCTGCGGTCATCAACACCATTTTTGGAGGACCAAGCGGGGTCAATCCGGGCATAAAAGAAAGGAGTTAGCCCGTGCAGCCAGACGCGAGGTGTGCGTCATCAGGGAGCACGGGCCGACCTGCCCAATCACCTTCGACGGTGTAGACTTGGAGGAGGTACACCTGCCCCACAATGATGCCCTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTTAGGAGAGTACTGGTAGACGGGGGAGCATCCGCTAACATCCTGTCCTTACCGACCTACCTCGCCTTGGGCTGGACGAGGTCGCAATTGAGGAGAAGCCCGACGCCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGCTGCATCGACTTACCGGTCACGCTGGGGCAGAACCAAACCCGGATCACTCAAATGGCCGAGTTCGTGGTAGTTGATGGTAGGTCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCCTTTCGGGCCATTCCTTCAACACTTCATCAAGTTTTGAAGTATCCCACCCCCAGCGGCGTGGGCACGGTCCGAGGAGAGCAAACCGCTTCGAGGGAGTGTTATGCCGCCGCACTCAAAGGCCCATCGGTTTGCGCCCTCGAAACTCGCAGGGATGGGACGCTCGAGTTCAAGGCCGACCTGCCGAGGAAGGAGTTTGCCGCACCCACCGAGGAGCTCGAGCTTGTTCCTCTGCTTAGCCCCGAGAAGCAGGTAAGCATAGGAACCAAGCTGGGGGCCACCGACAGAGAGGAGCTAATCCACTTCCTCAGATCCAACTCGGACGTCTTTGCATGGTCCCATGAGGACATGCCTGGCATCGACCCGCGAATTATGACGCATCGCCTCAGCATAGATCCATCATTCCGACCTGTGAAACAAAAGAGAAGACCTATAAACAAGGAGAGGAGTGATGTAATTGTTGAGGAAGTTAACAAACTTTTGAAAGCTGAATACATAAGAGAAATTTCGTATCCCGAGTGGCTTTCCAATGTTGTATTAGTTAAAAAATCTAACGGCAAGTGGAGAATGTGCGTAGACTTTACGAACTTAAATAAGACATGTCCGAAGGATTGCTTCCCACTGCCAAGGATTGATCAGCTCGTGGACGCCACAGCTGGGCACGAACTGCTCACTTTCATGGACGCCTACTCTGGGTACAACCAAATCAAGATGCATGTCCCAGATGAAGGTCATACCGCTTTTATAACAGACCAAGGTCTGTACTGCTACAAGGTCATGCCCTTCGGGTTAAAAAACGCAGGAGCGAACTACCAGAGAATGGTGAACAAAATGTTCGCCAAGCAGATCGGCCGGAATATGGAAGTGTATGTGGACGACATGCTTGTCAAGAGCAAGCAGTCTGAGTCGCATCTCTCCGACCTGGCCGAAGCCTTCGAAGTTCTGAGGGCATATCAAATGAAGCTCAACCCTGCCAAGTGTGCCTTTGGAGTCTCCTCGGGAAAATTCCTTGGCTTCATGGTAAACAATTGGGGAATCGAGGCCAACCCCGAAAAGATTAAAGCCGTGATCGAGATGGAGGCACCTAAAACGCTGAAACAGCTTCAGTGCCTCAATGGCAGGATTGCAGCCCTGAACTGGTTTGTTTCAAGGTCAACGGACAAGTGCCTCCTTTTCTTTAAGGTTTTACGAAAGAAAGGGCCGTTTCAATGGACGGCGGAGTGCGAGCAAGCGCTTCAGCAATTGAAGAACTACCTCTGTTCGGCACCCTTGCTTGCCAAGCCTATGCCGGGAGACAAGCTCCAATTATACCTAGCAGTGTCTGACAGTGCCGTCAGCTCGGCCCTAATCCGGCAAGAGGAAGCGCGGCAAAACCCGGTTTACTACACAAGCAAGGCTATGACCGAAGCCGAGGCTAGATACCCTCAGATGGAGAAGTTGGCTCTCGCTTTGGTCACCTCGGCCCGACGACTTAGACCATACTTCCAAGCCCATACGGTGGTGGTACTCACTAACTCGCCCCTTAAAAGTATCTTCCACAAGCCGGAAGCTTCCAGGCGCCTAATGAAGTGGGCGATAGAGCTAAGTGAGTACGACATCCAGTTCGAACCCAGAACTGCGTTAAAAGGGCAAGCAGCGGAATATTTCATAGCCGAGCTCACACCACCTTCCGAGCTGAGCGGGACCGACCTGCCTTGGACAGTCTACGTCGACGGATCCTCCAATGAGAGGGGGTGCGGAGCCGAGGTCCTCTTGCTCGGACCAGGGGGTGAACGATTTGAGTATGCCTTGCGGTTCAGCTTCCGGACTTCTAACAACGAGGCTGAGTATGAAGCATTTACTGCTGGCCTGCGAATCGCTCGAGCATTGGGGGCCTCTTGTGTTAAGGTCTTCAGTGACTCCTAGCTGGTTGTGAACCAGATCAAGGACGAATACCAAGCCAAAGACACCCGAATGGAGAAGTATTTGGGCAAGGTCAGATCATACCTCAACCAGTTTCGAACTTACGAAGTAAGCCGGATTCCGCGGGCGGAGAATTCTAATGCTGACGCCTTGGCCAAGTTAGCATCGGCGTACGAGACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCCTAGATAATCCCTCGATCTTAGAACCAGATCTGATGGAGATCGGCGCTCCAGAACCCTCATGGATGGACCCGATCGCAGACTTCATTAGGGGCAACTCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCAAGACAGGCAGCTCGGTTCGTGATCCGAGATGGGGCATTGTACCGACGTGGCTTTTCCCTGCCTCTGTTGAAATGCCTAACCCCTGAAGAGGGCCTGTACGTCCTCAGAGAGATCCACGAAGGAGTGTGCGGCAATCACTCAGGCGCCCGGTCGCTGTCAGCCACAGTGATCCGACAAGGATACTATTGGCCGACCCTCAGCCAGGACGCCAAGAAGTTCGTTAGAACTTGCGACAATTGCCAACGCTACGGAACCATAATCCACCAACCTCCCGAGCTGCTCACCCCCATCTCGGCCCCGTGGCCATTCGCGCAGTGGGGGGTGGATATCATTGGCCCTTTCCCTTTGGGCAAGGGCCAGACCAAGTTCGCTGTGGTTGCTGTGGATTACTTCACCAAGTGGGTCGAGGCCGAAGCGCTCTCCCACATAACGGAATCCAGGGTCACGTCCTTCGTATGGACGAACATCATATGTCGCTTTGGTATACCACAGGCCATAGTGACAGACAATGGCAAGCAGTTTGACAACGCCAAGTTCAAAGACTTTTGCAGCAAACTTGGCATAAGTCATCTCAGCTCGTCCCCCGCACATCCACAAGCAAATGGGCAGGTGGAGGCGGTCAACAAGATCATCAAGCGAGGCATCAAACTCAGACTGGACTCCAAGAAAGGCAGGTGGGCCGAGGAGCTACCCGCAGTTCTATGGTCGTACCGGACCACCCAGCGGGGGTCGACGGGTGAGACCCCGTTCTCCCTGGCCTTCGACTCCGAAGCTGTAGTCCCGGTCGAGGTCGGCATGCCATCTGACAGAGTAGAGCGTTACGAGCCTTCGACAAATGAGGAAGAGCTGCTCCTCAACCTCGACTTGTTGGAAGAAAGACGAGCAATGGCCCAGCTACGCCTGGCGGAATATCAGGGCAAGATGGCCAGACATTACAATGCCCGCGTTCGACCTCGGGCCTTTCAGGTCGGCCATCTGGTCTTAAGGAGGGTCCAAACGCATGTGGGTGCCCTTGATCCGGCCTGGGAAGGCCCATTTGAGATCAAGGGCATAGTCCGACCTAGGACGTACGTATTGGCCGATCCGAAAGGAGATGTCCTCGCGCACCCGTGGAACGCGGAACACCTGAAGCGTTATTATCCTTGA
mRNA sequence
ATGGTTCAACCAGCGAATTCGACCAATACGACGGATCGAAGGACTCTGGCTGCCAGTGATGCCCACCAGAGGAAGGTCGGAGCAGCGGCGGCAGAAGGGCAAGGTCACGACGGCCTGGCAGCGGAACCCCCCCGCAGGTTGGCACGAATCACCGCGCCTGCCCTGCCGCCTGCACACCCGAGGACGTCCAAGGCCACCCGTGGTCGAGGTGGGACCTCTAAGAAGGGCGCCCGAGGTCCAGCCCCGACTCCAACAAGCGAGGATTTTGATGCGCTCCAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCCATGGAGGCAATGTATAACGACATGGTGCTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGGGCGACGCGCATAGACGCACGCGAGCAAAGGGGTTCCCACCTCGGCCCGGCCGAGGAGGAACATCCCGAAGACAACGGGAGTGAGGGGTACACTCGCCAGAGGGGAGACCTCCGCGAGCATCTCAACCGAAAGAGAGGCTCGTCTCTCCGAAAGGGGCAGTCGCCATCCCGCTCCCACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAACCCCACAGGGATAATCACAAGGGAGGAATTCGACCAGCTGAGGGGGGAGCTCGATGCTCAAGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAACGACGATTCACTGAACGATGGCGACTTGGGAGAATCACCTTTCACCTCGGACGTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGACGGGACGAAGGACCCCAATGACTATGTTGAAGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGCTTGTGGTACCGGAGATTGCCAGCCAGATCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGACACTACGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGGGGGAGACGCTGCGGGAGTATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTACTTCCTCACCGGTCTAGCCGATGAGGCCCTCACGGTGAAACTCGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTCCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCTGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATGAAAGGGCAGATCCCAAGTCCAAAGACAAAGGATCCTTCTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAACGGACCAACCAGGAGCCGACCTTACGAGCGCTTCACCCGAACCACGATTCCAATTTCCGAGATCCTAACGAACATCGAGGATTCTGGAATGGAAAAACTACTCAAGCGTCCGGAGAAACTTCGGGGAGCCCTGGAGAGGCGCAGCAAGGACAAGTACTGCCGCTTCCATCGAGAGCACGGCCACAACACGTCGGACTGCTGGGAATTGAAGCGCCAAATTGAAGATCTAATTCAAGACGGCTACTTCAAGAAGTTTGTGGGAAAGCCCGGGACTAGCTCAGCAGAGAAAAAGGAGGAGCGAAAGCGTTCTAGAACGCCACCTCGGCGCACCAACCGACCTGCGCACGGGCCGACCTGCCCAATCACCTTCGACGGTGTAGACTTGGAGGAGGTACACCTGCCCCACAATGATGCCCTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTTAGGAGAGTACTGGTAGACGGGGGAGCATCCGCTAACATCCTGTCCTTACCGACCTACCTCGCCTTGGGCTGGACGAGGTCGCAATTGAGGAGAAGCCCGACGCCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGCTGCATCGACTTACCGGTCACGCTGGGGCAGAACCAAACCCGGATCACTCAAATGGCCGAGTTCGTGGTAGTTGATGGTAGGTCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCCTTTCGGGCCATTCCTTCAACACTTCATCAAGTTTTGAAGTATCCCACCCCCAGCGGCGTGGGCACGGTCCGAGGAGAGCAAACCGCTTCGAGGGAGTGTTATGCCGCCGCACTCAAAGGCCCATCGGTTTGCGCCCTCGAAACTCGCAGGGATGGGACGCTCGAGTTCAAGGCCGACCTGCCGAGGAAGGAGTTTGCCGCACCCACCGAGGAGCTCGAGCTTGTTCCTCTGCTTAGCCCCGAGAAGCAGTTAGCATCGGCGTACGAGACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCCTAGATAATCCCTCGATCTTAGAACCAGATCTGATGGAGATCGGCGCTCCAGAACCCTCATGGATGGACCCGATCGCAGACTTCATTAGGGGCAACTCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCAAGACAGGCAGCTCGGTTCGTGATCCGAGATGGGGCATTGTACCGACGTGGCTTTTCCCTGCCTCTGTTGAAATGCCTAACCCCTGAAGAGGGCCTACGAGCAATGGCCCAGCTACGCCTGGCGGAATATCAGGGCAAGATGGCCAGACATTACAATGCCCGCGTTCGACCTCGGGCCTTTCAGGTCGGCCATCTGGTCTTAAGGAGGGTCCAAACGCATGTGGGTGCCCTTGATCCGGCCTGGGAAGGCCCATTTGAGATCAAGGGCATAGTCCGACCTAGGACGTACGTATTGGCCGATCCGAAAGGAGATGTCCTCGCGCACCCGTGGAACGCGGAACACCTGAAGCGTTATTATCCTTGA
Coding sequence (CDS)
ATGGTTCAACCAGCGAATTCGACCAATACGACGGATCGAAGGACTCTGGCTGCCAGTGATGCCCACCAGAGGAAGGTCGGAGCAGCGGCGGCAGAAGGGCAAGGTCACGACGGCCTGGCAGCGGAACCCCCCCGCAGGTTGGCACGAATCACCGCGCCTGCCCTGCCGCCTGCACACCCGAGGACGTCCAAGGCCACCCGTGGTCGAGGTGGGACCTCTAAGAAGGGCGCCCGAGGTCCAGCCCCGACTCCAACAAGCGAGGATTTTGATGCGCTCCAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCCATGGAGGCAATGTATAACGACATGGTGCTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGGGCGACGCGCATAGACGCACGCGAGCAAAGGGGTTCCCACCTCGGCCCGGCCGAGGAGGAACATCCCGAAGACAACGGGAGTGAGGGGTACACTCGCCAGAGGGGAGACCTCCGCGAGCATCTCAACCGAAAGAGAGGCTCGTCTCTCCGAAAGGGGCAGTCGCCATCCCGCTCCCACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAACCCCACAGGGATAATCACAAGGGAGGAATTCGACCAGCTGAGGGGGGAGCTCGATGCTCAAGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAACGACGATTCACTGAACGATGGCGACTTGGGAGAATCACCTTTCACCTCGGACGTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGACGGGACGAAGGACCCCAATGACTATGTTGAAGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGCTTGTGGTACCGGAGATTGCCAGCCAGATCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGACACTACGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGGGGGAGACGCTGCGGGAGTATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTACTTCCTCACCGGTCTAGCCGATGAGGCCCTCACGGTGAAACTCGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTCCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCTGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATGAAAGGGCAGATCCCAAGTCCAAAGACAAAGGATCCTTCTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAACGGACCAACCAGGAGCCGACCTTACGAGCGCTTCACCCGAACCACGATTCCAATTTCCGAGATCCTAACGAACATCGAGGATTCTGGAATGGAAAAACTACTCAAGCGTCCGGAGAAACTTCGGGGAGCCCTGGAGAGGCGCAGCAAGGACAAGTACTGCCGCTTCCATCGAGAGCACGGCCACAACACGTCGGACTGCTGGGAATTGAAGCGCCAAATTGAAGATCTAATTCAAGACGGCTACTTCAAGAAGTTTGTGGGAAAGCCCGGGACTAGCTCAGCAGAGAAAAAGGAGGAGCGAAAGCGTTCTAGAACGCCACCTCGGCGCACCAACCGACCTGCGCACGGGCCGACCTGCCCAATCACCTTCGACGGTGTAGACTTGGAGGAGGTACACCTGCCCCACAATGATGCCCTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTTAGGAGAGTACTGGTAGACGGGGGAGCATCCGCTAACATCCTGTCCTTACCGACCTACCTCGCCTTGGGCTGGACGAGGTCGCAATTGAGGAGAAGCCCGACGCCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGCTGCATCGACTTACCGGTCACGCTGGGGCAGAACCAAACCCGGATCACTCAAATGGCCGAGTTCGTGGTAGTTGATGGTAGGTCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCCTTTCGGGCCATTCCTTCAACACTTCATCAAGTTTTGAAGTATCCCACCCCCAGCGGCGTGGGCACGGTCCGAGGAGAGCAAACCGCTTCGAGGGAGTGTTATGCCGCCGCACTCAAAGGCCCATCGGTTTGCGCCCTCGAAACTCGCAGGGATGGGACGCTCGAGTTCAAGGCCGACCTGCCGAGGAAGGAGTTTGCCGCACCCACCGAGGAGCTCGAGCTTGTTCCTCTGCTTAGCCCCGAGAAGCAGTTAGCATCGGCGTACGAGACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCCTAGATAATCCCTCGATCTTAGAACCAGATCTGATGGAGATCGGCGCTCCAGAACCCTCATGGATGGACCCGATCGCAGACTTCATTAGGGGCAACTCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCAAGACAGGCAGCTCGGTTCGTGATCCGAGATGGGGCATTGTACCGACGTGGCTTTTCCCTGCCTCTGTTGAAATGCCTAACCCCTGAAGAGGGCCTACGAGCAATGGCCCAGCTACGCCTGGCGGAATATCAGGGCAAGATGGCCAGACATTACAATGCCCGCGTTCGACCTCGGGCCTTTCAGGTCGGCCATCTGGTCTTAAGGAGGGTCCAAACGCATGTGGGTGCCCTTGATCCGGCCTGGGAAGGCCCATTTGAGATCAAGGGCATAGTCCGACCTAGGACGTACGTATTGGCCGATCCGAAAGGAGATGTCCTCGCGCACCCGTGGAACGCGGAACACCTGAAGCGTTATTATCCTTGA
Protein sequence
MVQPANSTNTTDRRTLAASDAHQRKVGAAAAEGQGHDGLAAEPPRRLARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPTPTSEDFDALQREMEAMRTQMRSMEAMYNDMVLAAGAGSRSENRATRIDAREQRGSHLGPAEEEHPEDNGSEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPTGIITREEFDQLRGELDAQVEALKAKCEQNDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPNDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTRTTIPISEILTNIEDSGMEKLLKRPEKLRGALERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTNRPAHGPTCPITFDGVDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLRRSPTPLVGFSGESVIPEGCIDLPVTLGQNQTRITQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPSGVGTVRGEQTASRECYAAALKGPSVCALETRRDGTLEFKADLPRKEFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSILEPDLMEIGAPEPSWMDPIADFIRGNSPQDPKERRKLARQAARFVIRDGALYRRGFSLPLLKCLTPEEGLRAMAQLRLAEYQGKMARHYNARVRPRAFQVGHLVLRRVQTHVGALDPAWEGPFEIKGIVRPRTYVLADPKGDVLAHPWNAEHLKRYYP
Homology
BLAST of Moc07g20260 vs. NCBI nr
Match:
XP_022152854.1 (uncharacterized protein LOC111020479 [Momordica charantia])
HSP 1 Score: 875.5 bits (2261), Expect = 4.1e-250
Identity = 484/771 (62.78%), Postives = 540/771 (70.04%), Query Frame = 0
Query: 1 MVQPANSTNTTDRRTLAASDAHQRKVGAAAAEGQGHDGLAAEPPRRLARITAPALPPAHP 60
MVQPANSTNT DRR LAA+ HQR+VGA EGQGH+ L EP R ARIT P LPPAHP
Sbjct: 1 MVQPANSTNTADRRALAANHGHQREVGAEVVEGQGHEDLGTEPLCRSARITTPVLPPAHP 60
Query: 61 RTSKATRGRGGTSKKGARGPAPTPTSEDFDALQREMEAMRTQMRSMEAMYNDMVLAAGAG 120
+ SK
Sbjct: 61 KPSK-------------------------------------------------------- 120
Query: 121 SRSENRATRIDAREQRGSHLGPAEEEHPEDNGSEGYTRQRGDLREHLNRKRGSSLRKGQS 180
Sbjct: 121 ------------------------------------------------------------ 180
Query: 181 PSRSHRSSNQQAESSHNP--TGIITREEFDQLRGELDAQVEALKAKCEQNDDSLNDGDLG 240
AESS+NP G+ITREEFDQL+ + DAQVEALKA+CE+ + S +DGDLG
Sbjct: 181 -----------AESSYNPITPGVITREEFDQLKSKFDAQVEALKARCEKKESSFDDGDLG 240
Query: 241 ESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPNDYVEVFEGLMDFQAASDAIKCRAFQIAL 300
E F+SD+LEA IPPKFK PT+KPYDG+KDP DYVEVFE LMDFQAA+DAIKC AFQIAL
Sbjct: 241 ELSFSSDILEALIPPKFKTPTMKPYDGSKDPKDYVEVFESLMDFQAATDAIKCCAFQIAL 300
Query: 301 TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTR 360
TGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTR
Sbjct: 301 TGSARLWYRRLPARLISTYSQLRKEFISQFSSRHYDRKTPTHLATIRQKEGETLREYVTR 360
Query: 361 FQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRT 420
F EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAPATFAEVLQK KKVIDGQELLRT
Sbjct: 361 FPEEQLKVAHCSDDSAMCYFLTGLADETLTVKLREEAPATFAEVLQKTKKVIDGQELLRT 420
Query: 421 KTGRPERKIGRGRSGKDE-RADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTRTTI 480
KTGRPE+ I +GR+GKD+ +AD KS+DKG S SS R +YRR+ + +SRPYE +T TTI
Sbjct: 421 KTGRPEKNIDQGRAGKDKGKADSKSRDKGPSSSSSRVDYRRSNSSHNQSRPYEHYTPTTI 480
Query: 481 PISEILTNIEDSGMEKLLKRPEKLRGALERRSKDKYCRFHREHGHNTSDCWELKRQIEDL 540
PI EILTNIE++GMEKLLKRPEKLRG E+R+ DKYCRFHR+HGHNTS+ WELKRQIEDL
Sbjct: 481 PIFEILTNIEETGMEKLLKRPEKLRGDPEKRNTDKYCRFHRDHGHNTSNYWELKRQIEDL 540
Query: 541 IQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTNRPA---------------------H 600
IQDGYFKKFVGKP ++S EKKEERKR RTPPRR +RPA
Sbjct: 541 IQDGYFKKFVGKPRSNSVEKKEERKRLRTPPRRDDRPAVINKKKELAREARREVCIIREQ 600
Query: 601 GPTCPITFDGVDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWT 660
PT I F+ DLE VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWT
Sbjct: 601 RPTSSIAFNHADLEGVHLPHNDALVIAPLIDLVLVRRILVDGGASANILSLSTYLALGWT 644
Query: 661 RSQLRRSPTPLVGFSGESVIPEGCIDLPVTLGQNQTRITQMAEFVVVDGRSAYNAIFGRP 720
RSQL++SPTPLVGFSGES+ EGCIDLPV++ Q+ T++TQMAEFVV+DGRSAYNAIFGRP
Sbjct: 661 RSQLKKSPTPLVGFSGESISLEGCIDLPVSIRQDDTQVTQMAEFVVIDGRSAYNAIFGRP 644
Query: 721 IIHSFRAIPSTLHQVLKYPTPSGVGTVRGEQTASRECYAAALKGPSVCALE 747
IIHSFRA+PSTLHQVLKY T +GVGTVRGE SRECYA+ K SVCALE
Sbjct: 721 IIHSFRAVPSTLHQVLKYSTLNGVGTVRGELKTSRECYASVPKRSSVCALE 644
BLAST of Moc07g20260 vs. NCBI nr
Match:
XP_022137317.1 (uncharacterized protein LOC111008813 [Momordica charantia])
HSP 1 Score: 870.2 bits (2247), Expect = 1.7e-248
Identity = 456/528 (86.36%), Postives = 470/528 (89.02%), Query Frame = 0
Query: 191 QAESSHN---PTGIITREEFDQLRGELDAQVEALKAKCEQNDDSLNDGDLGESPFTSDVL 250
+AESS N P G+ITREEFDQLRG+LDAQVEALKAKCEQ + LNDGDLGESPFTSDVL
Sbjct: 3 KAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVL 62
Query: 251 EAPIPPKFKAPTVKPYDGTKDPNDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYR 310
EAPIPPKFKAPTVKPYDG+KDP DYVEVFE LMDFQAASDAIKCRAF+IALTGSARLWYR
Sbjct: 63 EAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASDAIKCRAFRIALTGSARLWYR 122
Query: 311 RLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA 370
RLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA
Sbjct: 123 RLPAXSISTYSQLRREFLAXFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA 182
Query: 371 HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI 430
HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI
Sbjct: 183 HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI 242
Query: 431 GRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTRTTIPISEILTNIE 490
GRGRSGKD E ADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFT TTIPISEILTNIE
Sbjct: 243 GRGRSGKDIENADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE 302
Query: 491 DSGMEKLLKRPEKLRGALERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFV 550
+SGMEKLLKRPEKLRGA ERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFV
Sbjct: 303 ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIENLIQDGYFKKFV 362
Query: 551 GKPGTSSAEKKEERKRSRTPPRRTNRPA-------------------------------- 610
GKP TSSAEKKEERKRSRTPPRRT+RPA
Sbjct: 363 GKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGRKRKELARAARREVCII 422
Query: 611 --HGPTCPITFDGVDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLAL 670
PTCPITFDG DLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLAL
Sbjct: 423 REQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVXRVLVDGGTSANILSLPTYLAL 482
Query: 671 GWTRSQLRRSPTPLVGFSGESVIPEGCIDLPVTLGQNQTRITQMAEFV 681
GWTRSQL++SPTPLVGFSGESVIPEG IDLPVTLGQ+QT++TQMAEFV
Sbjct: 483 GWTRSQLKKSPTPLVGFSGESVIPEGFIDLPVTLGQDQTQVTQMAEFV 530
BLAST of Moc07g20260 vs. NCBI nr
Match:
XP_022150760.1 (uncharacterized protein LOC111018823 [Momordica charantia])
HSP 1 Score: 861.3 bits (2224), Expect = 8.0e-246
Identity = 479/671 (71.39%), Postives = 509/671 (75.86%), Query Frame = 0
Query: 187 SSNQQAESSHNPT---GIITREEFDQLRGELDAQVEALKAKCEQNDDSLNDGDLGESPFT 246
SSNQQAESSHNP G+ITREEFDQLRG+L+AQVEALKAKCEQ + LNDGDLGESPFT
Sbjct: 2 SSNQQAESSHNPATPDGVITREEFDQLRGKLNAQVEALKAKCEQKEGPLNDGDLGESPFT 61
Query: 247 SDVLEAPIPPKFKAPTVKPYDGTKDPNDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR 306
SDVLE APTVK YDG+KDP DYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR
Sbjct: 62 SDVLE--------APTVKSYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR 121
Query: 307 LWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQ 366
LW FQE+Q
Sbjct: 122 LW-----------------------------------------------------FQEDQ 181
Query: 367 LKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRP 426
LKVA SDDSAMCYFLTGLADEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRP
Sbjct: 182 LKVAQSSDDSAMCYFLTGLADEALTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRP 241
Query: 427 ERKIGRGRSGKDERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTRTTIPISEILT 486
ER I RGRSGKDE+AD KSKDKGSFSSGRAE+RRA NGPTRSRPYERFT TTIPISEILT
Sbjct: 242 ERGIDRGRSGKDEKADLKSKDKGSFSSGRAEFRRAVNGPTRSRPYERFTPTTIPISEILT 301
Query: 487 NIEDSGMEKLLKRPEKLRGALERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFK 546
NIE+SGMEKLLKRPEKLRGA ERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YFK
Sbjct: 302 NIEESGMEKLLKRPEKLRGAPERRNKDKYCRFHREHDHNTSDRWELKRQIEDLIQDDYFK 361
Query: 547 KFVGKPGTSSAEKKEERKRSRTPPRRTNRPA----------------------------- 606
KFVGKP TSSAEKKEERK SRTP RR +RPA
Sbjct: 362 KFVGKPRTSSAEKKEERKLSRTPLRRIDRPAVINTIFGGPSGGQSGHKRKELARAARREV 421
Query: 607 -----HGPTCPITFDGVDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTY 666
PTCPITFD DLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TY
Sbjct: 422 CIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDEGVSANIVSLLTY 481
Query: 667 LALGWTRSQLRRSPTPLVGFSGESVIPEGCIDLPVTLGQNQTRITQMAEFVVVDGRSAYN 726
LALGWTRSQL++S TPLVGFS ESVIPEGCIDLPVTLG +QT++TQMAEFVV+DGRSAYN
Sbjct: 482 LALGWTRSQLKKSTTPLVGFSRESVIPEGCIDLPVTLGHDQTQVTQMAEFVVIDGRSAYN 541
Query: 727 AIFGRPIIHSFRAIPSTLHQVLKYPTPSGVGTVRGEQTASRECYAAALKGPSVCALET-- 786
AIFGRPIIHSFRAIPSTLHQVLKY TP+GVG VRGEQ ASRECYA+ALKG SVCALET
Sbjct: 542 AIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGMVRGEQIASRECYASALKGSSVCALETLV 601
Query: 787 RRDGTLEFKADLPRKEFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSILE 819
RDGTLEFKA+LPR+EFAAPTEELELVPLL + +E +L + +D+
Sbjct: 602 SRDGTLEFKANLPRREFAAPTEELELVPLLRYKYNENIDHEQELDEKSSLNKIDD----- 605
BLAST of Moc07g20260 vs. NCBI nr
Match:
XP_022156542.1 (uncharacterized protein LOC111023421 [Momordica charantia])
HSP 1 Score: 706.8 bits (1823), Expect = 2.5e-199
Identity = 365/376 (97.07%), Postives = 367/376 (97.61%), Query Frame = 0
Query: 200 GIITREEFDQLRGELDAQVEALKAKCEQNDDSLNDGDLGESPFTSDVLEAPIPPKFKAPT 259
GIITREEFDQLRGELDAQVEALKAKCEQ DDSLNDGDLGESPFTSDVLEAPIPPKFKAPT
Sbjct: 26 GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPT 85
Query: 260 VKPYDGTKDPNDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQ 319
VKPYDGTKDP DYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLP RSISTYSQ
Sbjct: 86 VKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPXRSISTYSQ 145
Query: 320 LRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL 379
LRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL
Sbjct: 146 LRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL 205
Query: 380 TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERA 439
TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD ERA
Sbjct: 206 TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERA 265
Query: 440 DPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTRTTIPISEILTNIEDSGMEKLLKRPE 499
DPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFT TTIPI EILTNIE+SGMEKLLKRPE
Sbjct: 266 DPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIEESGMEKLLKRPE 325
Query: 500 KLRGALERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKE 559
KLRGA ERRSKDKYCRFHREHGHNTSD WELKRQIEDLIQDGYFKKFVGKP TSSAEKKE
Sbjct: 326 KLRGAPERRSKDKYCRFHREHGHNTSDFWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKE 385
Query: 560 ERKRSRTPPRRTNRPA 575
ERKRSRTPPRRT+RPA
Sbjct: 386 ERKRSRTPPRRTDRPA 401
BLAST of Moc07g20260 vs. NCBI nr
Match:
XP_022158414.1 (uncharacterized protein LOC111024904 [Momordica charantia])
HSP 1 Score: 696.4 bits (1796), Expect = 3.4e-196
Identity = 377/544 (69.30%), Postives = 419/544 (77.02%), Query Frame = 0
Query: 280 MDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTAT 339
MDFQAA+DAIKCRAFQIALTGSARLWYRRLPARSISTYSQLR+EF++QFSS HYD+KTAT
Sbjct: 1 MDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSWHYDRKTAT 60
Query: 340 HLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATF 399
HLATIRQKE ETLREYVTRFQEEQLKVAHCSDDSAMCYFLT LADE LTVKLGEEAP TF
Sbjct: 61 HLATIRQKERETLREYVTRFQEEQLKVAHCSDDSAMCYFLTSLADETLTVKLGEEAPTTF 120
Query: 400 AEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDER-ADPKSKDKGSFSS-GRAEYRR 459
EVLQKAKKVIDGQELLRTKTGRPE++I + + +++R AD KS+DKGS SS R EYRR
Sbjct: 121 VEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSRDKGSSSSASRTEYRR 180
Query: 460 AENGPTRSRPYERFTRTTIPISEILTNIEDSGMEKLLKRPEKLRGALERRSKDKYCRFHR 519
E+GP+RSRPYER+T +TIPISEILTNIE+SGMEKLLKRPEKLRG LE+R+K+KYCRFHR
Sbjct: 181 LESGPSRSRPYERYTSSTIPISEILTNIEESGMEKLLKRPEKLRGDLEKRNKEKYCRFHR 240
Query: 520 EHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTNRPA--- 579
+HGHNT+ CWELKRQIEDLIQDGYFKKFVGKP ++S EKKEERKRSRTPPRR +RPA
Sbjct: 241 DHGHNTTSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRREDRPAVIN 300
Query: 580 -------------------------------HGPTCPITFDGVDLEEVHLPHNDALVIAP 639
H PTC ITF DLE VHLPHNDALVIA
Sbjct: 301 TIFGGPNGGQSGNKRKELAREARREVCIIREHKPTCSITFGDADLEGVHLPHNDALVIAS 360
Query: 640 LIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLRRSPTPLVGFSGESVIPEGCIDLP 699
LIDH +VRRVL+DG GCIDLP
Sbjct: 361 LIDHDLVRRVLIDG----------------------------------------GCIDLP 420
Query: 700 VTLGQNQTRITQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPSGVGTVR 759
VT+GQ+ T++TQMAEFVV+DGRSAYNAIFGRPIIHSFRA+PSTLHQVLKY TP+ VG VR
Sbjct: 421 VTIGQDATQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNEVGMVR 480
Query: 760 GEQTASRECYAAALKGPSVCALE--TRRDGTLEFKADLP---RKEFAAPTEELELVPLLS 783
GEQ SRECYA+ALKG +VCALE T R E +ADLP +++F PTEELELVPLLS
Sbjct: 481 GEQKTSRECYASALKGSAVCALEEQTNRGKLQESEADLPKEGKRQFPPPTEELELVPLLS 504
BLAST of Moc07g20260 vs. ExPASy TrEMBL
Match:
A0A6J1DHB3 (uncharacterized protein LOC111020479 OS=Momordica charantia OX=3673 GN=LOC111020479 PE=4 SV=1)
HSP 1 Score: 875.5 bits (2261), Expect = 2.0e-250
Identity = 484/771 (62.78%), Postives = 540/771 (70.04%), Query Frame = 0
Query: 1 MVQPANSTNTTDRRTLAASDAHQRKVGAAAAEGQGHDGLAAEPPRRLARITAPALPPAHP 60
MVQPANSTNT DRR LAA+ HQR+VGA EGQGH+ L EP R ARIT P LPPAHP
Sbjct: 1 MVQPANSTNTADRRALAANHGHQREVGAEVVEGQGHEDLGTEPLCRSARITTPVLPPAHP 60
Query: 61 RTSKATRGRGGTSKKGARGPAPTPTSEDFDALQREMEAMRTQMRSMEAMYNDMVLAAGAG 120
+ SK
Sbjct: 61 KPSK-------------------------------------------------------- 120
Query: 121 SRSENRATRIDAREQRGSHLGPAEEEHPEDNGSEGYTRQRGDLREHLNRKRGSSLRKGQS 180
Sbjct: 121 ------------------------------------------------------------ 180
Query: 181 PSRSHRSSNQQAESSHNP--TGIITREEFDQLRGELDAQVEALKAKCEQNDDSLNDGDLG 240
AESS+NP G+ITREEFDQL+ + DAQVEALKA+CE+ + S +DGDLG
Sbjct: 181 -----------AESSYNPITPGVITREEFDQLKSKFDAQVEALKARCEKKESSFDDGDLG 240
Query: 241 ESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPNDYVEVFEGLMDFQAASDAIKCRAFQIAL 300
E F+SD+LEA IPPKFK PT+KPYDG+KDP DYVEVFE LMDFQAA+DAIKC AFQIAL
Sbjct: 241 ELSFSSDILEALIPPKFKTPTMKPYDGSKDPKDYVEVFESLMDFQAATDAIKCCAFQIAL 300
Query: 301 TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTR 360
TGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTR
Sbjct: 301 TGSARLWYRRLPARLISTYSQLRKEFISQFSSRHYDRKTPTHLATIRQKEGETLREYVTR 360
Query: 361 FQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRT 420
F EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAPATFAEVLQK KKVIDGQELLRT
Sbjct: 361 FPEEQLKVAHCSDDSAMCYFLTGLADETLTVKLREEAPATFAEVLQKTKKVIDGQELLRT 420
Query: 421 KTGRPERKIGRGRSGKDE-RADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTRTTI 480
KTGRPE+ I +GR+GKD+ +AD KS+DKG S SS R +YRR+ + +SRPYE +T TTI
Sbjct: 421 KTGRPEKNIDQGRAGKDKGKADSKSRDKGPSSSSSRVDYRRSNSSHNQSRPYEHYTPTTI 480
Query: 481 PISEILTNIEDSGMEKLLKRPEKLRGALERRSKDKYCRFHREHGHNTSDCWELKRQIEDL 540
PI EILTNIE++GMEKLLKRPEKLRG E+R+ DKYCRFHR+HGHNTS+ WELKRQIEDL
Sbjct: 481 PIFEILTNIEETGMEKLLKRPEKLRGDPEKRNTDKYCRFHRDHGHNTSNYWELKRQIEDL 540
Query: 541 IQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTNRPA---------------------H 600
IQDGYFKKFVGKP ++S EKKEERKR RTPPRR +RPA
Sbjct: 541 IQDGYFKKFVGKPRSNSVEKKEERKRLRTPPRRDDRPAVINKKKELAREARREVCIIREQ 600
Query: 601 GPTCPITFDGVDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWT 660
PT I F+ DLE VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWT
Sbjct: 601 RPTSSIAFNHADLEGVHLPHNDALVIAPLIDLVLVRRILVDGGASANILSLSTYLALGWT 644
Query: 661 RSQLRRSPTPLVGFSGESVIPEGCIDLPVTLGQNQTRITQMAEFVVVDGRSAYNAIFGRP 720
RSQL++SPTPLVGFSGES+ EGCIDLPV++ Q+ T++TQMAEFVV+DGRSAYNAIFGRP
Sbjct: 661 RSQLKKSPTPLVGFSGESISLEGCIDLPVSIRQDDTQVTQMAEFVVIDGRSAYNAIFGRP 644
Query: 721 IIHSFRAIPSTLHQVLKYPTPSGVGTVRGEQTASRECYAAALKGPSVCALE 747
IIHSFRA+PSTLHQVLKY T +GVGTVRGE SRECYA+ K SVCALE
Sbjct: 721 IIHSFRAVPSTLHQVLKYSTLNGVGTVRGELKTSRECYASVPKRSSVCALE 644
BLAST of Moc07g20260 vs. ExPASy TrEMBL
Match:
A0A6J1C7X5 (uncharacterized protein LOC111008813 OS=Momordica charantia OX=3673 GN=LOC111008813 PE=4 SV=1)
HSP 1 Score: 870.2 bits (2247), Expect = 8.3e-249
Identity = 456/528 (86.36%), Postives = 470/528 (89.02%), Query Frame = 0
Query: 191 QAESSHN---PTGIITREEFDQLRGELDAQVEALKAKCEQNDDSLNDGDLGESPFTSDVL 250
+AESS N P G+ITREEFDQLRG+LDAQVEALKAKCEQ + LNDGDLGESPFTSDVL
Sbjct: 3 KAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVL 62
Query: 251 EAPIPPKFKAPTVKPYDGTKDPNDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYR 310
EAPIPPKFKAPTVKPYDG+KDP DYVEVFE LMDFQAASDAIKCRAF+IALTGSARLWYR
Sbjct: 63 EAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASDAIKCRAFRIALTGSARLWYR 122
Query: 311 RLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA 370
RLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA
Sbjct: 123 RLPAXSISTYSQLRREFLAXFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA 182
Query: 371 HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI 430
HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI
Sbjct: 183 HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI 242
Query: 431 GRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTRTTIPISEILTNIE 490
GRGRSGKD E ADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFT TTIPISEILTNIE
Sbjct: 243 GRGRSGKDIENADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE 302
Query: 491 DSGMEKLLKRPEKLRGALERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFV 550
+SGMEKLLKRPEKLRGA ERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFV
Sbjct: 303 ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIENLIQDGYFKKFV 362
Query: 551 GKPGTSSAEKKEERKRSRTPPRRTNRPA-------------------------------- 610
GKP TSSAEKKEERKRSRTPPRRT+RPA
Sbjct: 363 GKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGRKRKELARAARREVCII 422
Query: 611 --HGPTCPITFDGVDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLAL 670
PTCPITFDG DLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLAL
Sbjct: 423 REQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVXRVLVDGGTSANILSLPTYLAL 482
Query: 671 GWTRSQLRRSPTPLVGFSGESVIPEGCIDLPVTLGQNQTRITQMAEFV 681
GWTRSQL++SPTPLVGFSGESVIPEG IDLPVTLGQ+QT++TQMAEFV
Sbjct: 483 GWTRSQLKKSPTPLVGFSGESVIPEGFIDLPVTLGQDQTQVTQMAEFV 530
BLAST of Moc07g20260 vs. ExPASy TrEMBL
Match:
A0A6J1D9E1 (uncharacterized protein LOC111018823 OS=Momordica charantia OX=3673 GN=LOC111018823 PE=4 SV=1)
HSP 1 Score: 861.3 bits (2224), Expect = 3.9e-246
Identity = 479/671 (71.39%), Postives = 509/671 (75.86%), Query Frame = 0
Query: 187 SSNQQAESSHNPT---GIITREEFDQLRGELDAQVEALKAKCEQNDDSLNDGDLGESPFT 246
SSNQQAESSHNP G+ITREEFDQLRG+L+AQVEALKAKCEQ + LNDGDLGESPFT
Sbjct: 2 SSNQQAESSHNPATPDGVITREEFDQLRGKLNAQVEALKAKCEQKEGPLNDGDLGESPFT 61
Query: 247 SDVLEAPIPPKFKAPTVKPYDGTKDPNDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR 306
SDVLE APTVK YDG+KDP DYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR
Sbjct: 62 SDVLE--------APTVKSYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR 121
Query: 307 LWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQ 366
LW FQE+Q
Sbjct: 122 LW-----------------------------------------------------FQEDQ 181
Query: 367 LKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRP 426
LKVA SDDSAMCYFLTGLADEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRP
Sbjct: 182 LKVAQSSDDSAMCYFLTGLADEALTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRP 241
Query: 427 ERKIGRGRSGKDERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTRTTIPISEILT 486
ER I RGRSGKDE+AD KSKDKGSFSSGRAE+RRA NGPTRSRPYERFT TTIPISEILT
Sbjct: 242 ERGIDRGRSGKDEKADLKSKDKGSFSSGRAEFRRAVNGPTRSRPYERFTPTTIPISEILT 301
Query: 487 NIEDSGMEKLLKRPEKLRGALERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFK 546
NIE+SGMEKLLKRPEKLRGA ERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YFK
Sbjct: 302 NIEESGMEKLLKRPEKLRGAPERRNKDKYCRFHREHDHNTSDRWELKRQIEDLIQDDYFK 361
Query: 547 KFVGKPGTSSAEKKEERKRSRTPPRRTNRPA----------------------------- 606
KFVGKP TSSAEKKEERK SRTP RR +RPA
Sbjct: 362 KFVGKPRTSSAEKKEERKLSRTPLRRIDRPAVINTIFGGPSGGQSGHKRKELARAARREV 421
Query: 607 -----HGPTCPITFDGVDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTY 666
PTCPITFD DLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TY
Sbjct: 422 CIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDEGVSANIVSLLTY 481
Query: 667 LALGWTRSQLRRSPTPLVGFSGESVIPEGCIDLPVTLGQNQTRITQMAEFVVVDGRSAYN 726
LALGWTRSQL++S TPLVGFS ESVIPEGCIDLPVTLG +QT++TQMAEFVV+DGRSAYN
Sbjct: 482 LALGWTRSQLKKSTTPLVGFSRESVIPEGCIDLPVTLGHDQTQVTQMAEFVVIDGRSAYN 541
Query: 727 AIFGRPIIHSFRAIPSTLHQVLKYPTPSGVGTVRGEQTASRECYAAALKGPSVCALET-- 786
AIFGRPIIHSFRAIPSTLHQVLKY TP+GVG VRGEQ ASRECYA+ALKG SVCALET
Sbjct: 542 AIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGMVRGEQIASRECYASALKGSSVCALETLV 601
Query: 787 RRDGTLEFKADLPRKEFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSILE 819
RDGTLEFKA+LPR+EFAAPTEELELVPLL + +E +L + +D+
Sbjct: 602 SRDGTLEFKANLPRREFAAPTEELELVPLLRYKYNENIDHEQELDEKSSLNKIDD----- 605
BLAST of Moc07g20260 vs. ExPASy TrEMBL
Match:
A0A6J1DS95 (uncharacterized protein LOC111023421 OS=Momordica charantia OX=3673 GN=LOC111023421 PE=4 SV=1)
HSP 1 Score: 706.8 bits (1823), Expect = 1.2e-199
Identity = 365/376 (97.07%), Postives = 367/376 (97.61%), Query Frame = 0
Query: 200 GIITREEFDQLRGELDAQVEALKAKCEQNDDSLNDGDLGESPFTSDVLEAPIPPKFKAPT 259
GIITREEFDQLRGELDAQVEALKAKCEQ DDSLNDGDLGESPFTSDVLEAPIPPKFKAPT
Sbjct: 26 GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPT 85
Query: 260 VKPYDGTKDPNDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQ 319
VKPYDGTKDP DYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLP RSISTYSQ
Sbjct: 86 VKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPXRSISTYSQ 145
Query: 320 LRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL 379
LRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL
Sbjct: 146 LRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL 205
Query: 380 TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERA 439
TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD ERA
Sbjct: 206 TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERA 265
Query: 440 DPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTRTTIPISEILTNIEDSGMEKLLKRPE 499
DPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFT TTIPI EILTNIE+SGMEKLLKRPE
Sbjct: 266 DPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIEESGMEKLLKRPE 325
Query: 500 KLRGALERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKE 559
KLRGA ERRSKDKYCRFHREHGHNTSD WELKRQIEDLIQDGYFKKFVGKP TSSAEKKE
Sbjct: 326 KLRGAPERRSKDKYCRFHREHGHNTSDFWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKE 385
Query: 560 ERKRSRTPPRRTNRPA 575
ERKRSRTPPRRT+RPA
Sbjct: 386 ERKRSRTPPRRTDRPA 401
BLAST of Moc07g20260 vs. ExPASy TrEMBL
Match:
A0A6J1DZB9 (uncharacterized protein LOC111024904 OS=Momordica charantia OX=3673 GN=LOC111024904 PE=4 SV=1)
HSP 1 Score: 696.4 bits (1796), Expect = 1.6e-196
Identity = 377/544 (69.30%), Postives = 419/544 (77.02%), Query Frame = 0
Query: 280 MDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTAT 339
MDFQAA+DAIKCRAFQIALTGSARLWYRRLPARSISTYSQLR+EF++QFSS HYD+KTAT
Sbjct: 1 MDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSWHYDRKTAT 60
Query: 340 HLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATF 399
HLATIRQKE ETLREYVTRFQEEQLKVAHCSDDSAMCYFLT LADE LTVKLGEEAP TF
Sbjct: 61 HLATIRQKERETLREYVTRFQEEQLKVAHCSDDSAMCYFLTSLADETLTVKLGEEAPTTF 120
Query: 400 AEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDER-ADPKSKDKGSFSS-GRAEYRR 459
EVLQKAKKVIDGQELLRTKTGRPE++I + + +++R AD KS+DKGS SS R EYRR
Sbjct: 121 VEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSRDKGSSSSASRTEYRR 180
Query: 460 AENGPTRSRPYERFTRTTIPISEILTNIEDSGMEKLLKRPEKLRGALERRSKDKYCRFHR 519
E+GP+RSRPYER+T +TIPISEILTNIE+SGMEKLLKRPEKLRG LE+R+K+KYCRFHR
Sbjct: 181 LESGPSRSRPYERYTSSTIPISEILTNIEESGMEKLLKRPEKLRGDLEKRNKEKYCRFHR 240
Query: 520 EHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTNRPA--- 579
+HGHNT+ CWELKRQIEDLIQDGYFKKFVGKP ++S EKKEERKRSRTPPRR +RPA
Sbjct: 241 DHGHNTTSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRREDRPAVIN 300
Query: 580 -------------------------------HGPTCPITFDGVDLEEVHLPHNDALVIAP 639
H PTC ITF DLE VHLPHNDALVIA
Sbjct: 301 TIFGGPNGGQSGNKRKELAREARREVCIIREHKPTCSITFGDADLEGVHLPHNDALVIAS 360
Query: 640 LIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLRRSPTPLVGFSGESVIPEGCIDLP 699
LIDH +VRRVL+DG GCIDLP
Sbjct: 361 LIDHDLVRRVLIDG----------------------------------------GCIDLP 420
Query: 700 VTLGQNQTRITQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPSGVGTVR 759
VT+GQ+ T++TQMAEFVV+DGRSAYNAIFGRPIIHSFRA+PSTLHQVLKY TP+ VG VR
Sbjct: 421 VTIGQDATQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNEVGMVR 480
Query: 760 GEQTASRECYAAALKGPSVCALE--TRRDGTLEFKADLP---RKEFAAPTEELELVPLLS 783
GEQ SRECYA+ALKG +VCALE T R E +ADLP +++F PTEELELVPLLS
Sbjct: 481 GEQKTSRECYASALKGSAVCALEEQTNRGKLQESEADLPKEGKRQFPPPTEELELVPLLS 504
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1DHB3 | 2.0e-250 | 62.78 | uncharacterized protein LOC111020479 OS=Momordica charantia OX=3673 GN=LOC111020... | [more] |
A0A6J1C7X5 | 8.3e-249 | 86.36 | uncharacterized protein LOC111008813 OS=Momordica charantia OX=3673 GN=LOC111008... | [more] |
A0A6J1D9E1 | 3.9e-246 | 71.39 | uncharacterized protein LOC111018823 OS=Momordica charantia OX=3673 GN=LOC111018... | [more] |
A0A6J1DS95 | 1.2e-199 | 97.07 | uncharacterized protein LOC111023421 OS=Momordica charantia OX=3673 GN=LOC111023... | [more] |
A0A6J1DZB9 | 1.6e-196 | 69.30 | uncharacterized protein LOC111024904 OS=Momordica charantia OX=3673 GN=LOC111024... | [more] |
Match Name | E-value | Identity | Description | |